This project explores the application of deep learning models for classifying Alzheimer's disease stages using brain imaging data. We compare the performance of multiple state-of-the-art convolutional neural network architectures across two datasets: a standard dataset and a pseudo-RGB augmented dataset.
- Standard Dataset: Original brain imaging data without augmentation
- Pseudo-RGB Dataset: Augmented version of the brain imaging data
The pseudo-RGB dataset preparation process is detailed in a linked repository: Pseudo-RGB Dataset Preparation.
We benchmarked several cutting-edge convolutional neural network architectures, including:
- VGG16
- ResNet50
- InceptionV3
- MobileNetV2
- Xception
-
Data Preparation:
- Split the datasets into train, validation, and test sets (80%, 10%, 10%)
- Applied data augmentation techniques including rotation, width/height shifts, shear, zoom, and horizontal flip
-
Model Architecture:
- Used transfer learning with pre-trained weights from ImageNet
- Customized the top layers for our specific classification task
- Implemented fine-tuning by unfreezing and training the last few layers of the base model
-
Training:
- Utilized categorical cross-entropy loss and various optimizers
- Implemented callbacks for early stopping, model checkpointing, and learning rate reduction
- Trained for a maximum of 200 epochs with a batch size of 32
-
Evaluation:
- Computed comprehensive metrics including accuracy, precision, recall, F1-score, Cohen's Kappa, log loss, and Brier score
- Generated confusion matrices and ROC curves for multi-class classification
- Analyzed per-class performance and misclassification rates
For each model, the following files are generated:
[ModelName].ipynb
: Jupyter notebook containing all the code for data processing, model training, and evaluationmodel_summary.txt
: Architecture summary of the model[Optimizer]_metrics_[ModelName].csv
: Detailed metrics for the model trained with specific optimizerbest_optimizer_metrics_[Optimizer]_[ModelName].csv
: Best metrics achieved by the model with the best optimizerhyperparameter.csv
: Hyperparameters used for model trainingsegmented_multiclass_metrics.csv
: Detailed metrics for all evaluated modelssummary_metrics_[ModelName].csv
: Summary of metrics for the specific modeltraining_history.csv
: Epoch-wise training and validation metricsmulticlass_confusion_matrix.png
: Visualization of the confusion matrixmulticlass_roc_curves.png
: ROC curves for multi-class classification
To replicate this study:
- Clone this repository
- Install the required dependencies (list them here or include a
requirements.txt
file) - Run the desired
[ModelName].ipynb
notebook - Explore the generated CSV files for detailed metrics and PNG files for visualizations
[Optimizer]_metrics_[ModelName].csv
: Contains epoch-by-epoch metrics for the model trained with a specific optimizer.best_optimizer_metrics_[Optimizer]_[ModelName].csv
: Presents the best metrics achieved by the model using the optimal optimizer.hyperparameter.csv
: Lists all hyperparameters used across different model trainings.segmented_multiclass_metrics.csv
: Provides a comprehensive breakdown of metrics for each class and overall model performance.summary_metrics_[ModelName].csv
: Offers a condensed view of key performance indicators for the specific model.training_history.csv
: Records the training and validation metrics for each epoch during model training.
- Explore ensemble methods combining multiple model architectures
- Investigate the impact of different data augmentation techniques
- Extend the study to include additional neuroimaging modalities
We acknowledge the contributions of various datasets, libraries, and tools used in this project. Special thanks to the creators of the pseudo-RGB dataset preparation process detailed in the linked repository.