A Comparative Study of Machine Learning and Deep Learning Models for Automatic Parkinson’s Disease Detection from Electroencephalogram Signals
Abstract
:1. Introduction
2. Related Study
3. Materials and Methods
3.1. Datasets
3.1.1. Dataset-1
- Frontal and Prefrontal Regions: Fp1, Fp2, AF3, AF4, F3, F4, F7, F8, Fz;
- Frontocentral and Central Regions: FC1, FC2, FC5, FC6, C3, C4, Cz, T7, T8;
- Centroparietal and Parietal Regions: CP1, CP2, CP5, CP6, P3, P4, P7, P8, Pz;
- Parieto-Occipital and Occipital Regions: PO3, PO4, O1, O2, Oz.
3.1.2. Dataset-2
3.2. Dataset Pre-Processing
3.2.1. Band-Pass Filtering
- Alpha (frequency range 8–13 Hz): Connected with calmness and wakeful rest;
- Beta (frequency range 13–30 Hz): Related to problem-solving, active thinking, etc.;
- Gamma (frequency range 30–48 Hz): Related to perception, attention, memory, consciousness, etc.;
- Theta (frequency range 4–8 Hz): Linked to drowsiness, relaxation, etc.;
- Delta (frequency range 1–4 Hz): Commonly associated with restorative processes and deep sleep [26].
3.2.2. Artifact Removal
3.2.3. Segmentation
3.2.4. Extraction of Features
3.3. Classification
- Case 1: PD_ON vs. HC (distinguishing PD patients with medication from non-PD patients);
- Case 2: PD_OFF vs. HC (distinguishing PD patients without medication from non-PD patients);
- Case 3: PD vs. HC (distinguishing PD patients from non-PD patients).
3.3.1. Machine Learning Baseline
3.3.2. Deep Learning Classification
- Input Layer: Our model inputs a feature matrix of shape 5580, 32 × 5 (number of segments, channels × bands), where each segment represents a 1 s epoch of EEG data with PSD values from 32 channels and five bands.
- Convolutional Layers: Two convolutional layers with 3 × 3 kernels are stacked to extract spatial features across channels and bands. Then, Rectified Linear Unit (ReLU) activations are applied to these layers to introduce some non-linearity.
- Pooling Layers: We used two max pooling layers of 2 × 1 in size, resulting in an output shape of 16 × 5 in each segment.
- Fully Connected Layers: After two pairs of convolutional and max pooling layers, the flattened feature map is fed into a fully connected layer (FC) with the same activation ReLU. These layers integrate the high-level features learned from the convolutions to produce a final classification decision.
- Output Layer: A sigmoid layer is used as an output layer for binary classification, which generates the probability of each class (PD or healthy control).
4. Experimental Results
4.1. Results Using Machine Learning-Based Classification
4.2. Results Using Deep Learning-Based Classification
4.3. Ablation Study: Impact of Model Components
Experimental Setup
- The 2D-CNN Model: Trained using the combined features from all five frequency bands;
- The 1D-CNN Model: Five separate models trained individually on five separate bands.
5. Discussion
5.1. Significance of the Findings
5.2. Strengths of the Deep Learning Approach
5.3. Limitations
5.4. Future Directions
- Use of data augmentation techniques, transfer learning approaches, and inclusion of more diverse populations into the dataset to improve model generalizability.
- Incorporating advanced feature extraction techniques, such as wavelet transforms and entropy measures, along with other statistical measures like standard deviation, Kurtosis, etc., to capture more nuanced signal characteristics.
- Exploring alternative models, such as LSTM, to capture more complex sequences of EEG signals and attention mechanisms to better account for temporal dependencies in EEG signals.
- Employing transfer learning from pre-trained models, or hybrid architectures combining CNNs with traditional feature-based methods with subject independency to improve performance on small EEG datasets.
6. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Abbreviations
PD | Parkinson’s disease |
SVM | Support vector machine |
CNN | Convolutional neural network |
PSD | Power spectral density |
EEG | Electroencephalogram |
PLV | phase-locked value |
References
- Goetz, C.G. The History of Parkinson’s Disease: Early clinical descriptions and neurological therapies. Cold Spring Harb. Perspect. Med. 2011, 1, a008862. [Google Scholar] [CrossRef]
- Statistics. Parkinson’s Foundation. Available online: https://www.parkinson.org/understanding-parkinsons/statistics (accessed on 15 January 2025).
- Ugrumov, M. Development of early diagnosis of Parkinson’s disease: Illusion or reality? CNS Neurosci. Ther. 2020, 26, 997. [Google Scholar] [CrossRef] [PubMed]
- Mari, Z.; Mestre, T.A. The Disease Modification Conundrum in Parkinson’s Disease: Failures and Hopes. Front. Aging Neurosci. 2022, 14, 810860. [Google Scholar] [CrossRef] [PubMed]
- Biasiucci, A.; Franceschiello, B.; Murray, M.M. Electroencephalography. Curr. Biol. 2019, 29, R80–R85. [Google Scholar] [CrossRef]
- Aljalal, M.; Aldosari, S.A.; AlSharabi, K.; Abdurraqeeb, A.M.; Alturki, F.A. Parkinson’s Disease Detection from Resting-State EEG Signals Using Common Spatial Pattern, Entropy, and Machine Learning Techniques. Diagnostics 2022, 12, 1033. [Google Scholar] [CrossRef]
- Mei, J.; Desrosiers, C.; Frasnelli, J. Machine learning for the diagnosis of Parkinson’s disease: A review of literature. Front. Aging Neurosci. 2021, 13, 633752. [Google Scholar] [CrossRef]
- Hari, R.; Puce, A. MEG-EEG Primer; Oxford University Press eBooks: Oxford, UK, 2017. [Google Scholar] [CrossRef]
- Pirrone, D.; Weitschek, E.; Di Paolo, P.; De Salvo, S.; De Cola, M.C. EEG signal processing and supervised machine learning to early diagnose alzheimer’s disease. Appl. Sci. 2022, 12, 5413. [Google Scholar] [CrossRef]
- Palop, J.J. Epilepsy and cognitive impairments in Alzheimer disease. Arch. Neurol. 2009, 66, 435. [Google Scholar] [CrossRef]
- Suhaimi, N.S.; Mountstephens, J.; Teo, J. EEG-Based Emotion Recognition: A State-of-the-Art review of current trends and opportunities. Comput. Intell. Neurosci. 2020, 2020, 1–19. [Google Scholar] [CrossRef]
- Dhara, T.; Singh, P.K.; Mahmud, M. A fuzzy ensemble-based deep learning model for EEG-based emotion recognition. Cogn. Comput. 2024, 16, 1364–1378. [Google Scholar] [CrossRef]
- Dave, M.; Mukherjee, S.K.; Singh, P.K.; Mahmud, M. Deep Ensemble Learning Approach for Multimodal Emotion Recognition. In Proceedings of the 3rd International Conference on Trends in Electronics and Health Informatics (TEHI-2023), Dhaka, Bangladesh, 20–21 December 2023; Mahmud, M., Kaiser, M.S., Bandyopadhyay, A., Ray, K., Al Mamun, S., Eds.; Lecture Notes in Networks and Systems 1034. Springer Publishers: Berlin/Heidelberg, Germany; Jahagirnagar University: Savar, Bangladesh, 2024; pp. 211–223. [Google Scholar]
- Ghosh, S.; Kim, S.; Ijaz, M.F.; Singh, P.K.; Mahmud, M. Classification of mental stress from wearable physiological sensors using image-encoding-based deep neural network. Biosensors 2022, 12, 1153. [Google Scholar] [CrossRef] [PubMed]
- Mukherjee, S.K.; Dave, M.; Singh, P.K. A Transfer Learning Approach for the Classification of Human Emotions Using Electroencephalogram Signals. In International Conference on Emerging Applications of Information Technology; Springer Nature: Singapore, 2024; pp. 219–228. [Google Scholar]
- Anjum, M.F.; Dasgupta, S.; Mudumbai, R.; Singh, A.; Cavanagh, J.F.; Narayanan, N.S. Linear predictive coding distinguishes spectral EEG features of Parkinson’s disease. Park. Relat. Disord. 2020, 79, 79–85. [Google Scholar] [CrossRef] [PubMed]
- Lee, S.-B.; Kim, Y.-J.; Hwang, S.; Son, H.; Lee, S.K.; Park, K.-I.; Kim, Y.-G. Predicting Parkinson’s disease using gradient boosting decision tree models with electroencephalography signals. Park. Relat. Disord. 2022, 95, 77–85. [Google Scholar] [CrossRef]
- Lal, U.; Chikkankod, A.V.; Longo, L. Fractal dimensions and machine learning for detection of Parkinson’s disease in resting-state electroencephalography. Neural Comput. Applic. 2024, 36, 8257–8280. [Google Scholar] [CrossRef]
- Latifoğlu, F.; Penekli, S.; Orhanbulucu, F.; Chowdhury, M.E.H. A novel approach for Parkinson’s disease detection using Vold-Kalman order filtering and machine learning algorithms. Neural Comput. Appl. 2024, 36, 9297–9311. [Google Scholar] [CrossRef]
- Loh, H.W.; Ooi, C.P.; Palmer, E.; Barua, P.D.; Dogan, S.; Tuncer, T.; Baygin, M.; Acharya, U.R. GaborPDNet: Gabor transformation and deep neural network for parkinson’s disease detection using EEG signals. Electronics 2021, 10, 1740. [Google Scholar] [CrossRef]
- Qiu, L.; Li, J.; Pan, J. Parkinson’s disease detection based on multi-pattern analysis and multi-scale convolutional neural networks. Front. Neurosci. 2022, 16, 957181. [Google Scholar] [CrossRef]
- Qiu, L.; Li, J.; Zhong, L.; Feng, W.; Zhou, C.; Pan, J. A novel EEG-Based Parkinson’s disease detection model using multiscale convolutional prototype networks. IEEE Trans. Instrum. Meas. 2024, 73, 1–14. [Google Scholar] [CrossRef]
- Chang, H.; Liu, B.; Zong, Y.; Lu, C.; Wang, X. EEG-Based Parkinson’s disease recognition via Attention-Based Sparse Graph Convolutional Neural Network. IEEE J. Biomed. Health Inform. 2023, 27, 5216–5224. [Google Scholar] [CrossRef]
- OpenNeuro. Available online: https://openneuro.org/datasets/ds002778/versions/1.0.5 (accessed on 15 January 2025).
- Gramfort, A. MEG and EEG data analysis with MNE-Python. Front. Neurosci. 2013, 7, 267. [Google Scholar] [CrossRef]
- Ksibi, A.; Zakariah, M.; Menzli, L.J.; Saidani, O.; Almuqren, L.; Hanafieh, R.A.M. Electroencephalography-Based depression detection using multiple machine learning techniques. Diagnostics 2023, 13, 1779. [Google Scholar] [CrossRef] [PubMed]
- Mukherjee, D.; Mondal, R.; Singh, P.K.; Sarkar, R.; Bhattacharjee, D. EnsemConvNet: A deep learning approach for human activity recognition using smartphone sensors for healthcare applications. Multimed. Tools Appl. 2020, 79, 31663–31690. [Google Scholar] [CrossRef]
- Bhattacharya, D.; Sharma, D.; Kim, W.; Ijaz, M.F.; Singh, P.K. Ensem-HAR: An Ensemble Deep Learning Model for Smartphone Sensor-Based Human Activity Recognition for Measurement of Elderly Health Monitoring. Biosensors 2022, 12, 393. [Google Scholar] [CrossRef] [PubMed]
- Banerjee, A.; Sarkar, A.; Roy, S.; Singh, P.K.; Sarkar, R. COVID-19 chest X-ray detection through blending ensemble of CNN snapshots. Biomed. Signal Process. Control 2022, 78, 104000. [Google Scholar] [CrossRef]
- Tanveer, M.; Rashid, A.H.; Kumar, R.; Balasubramanian, R. Parkinson’s disease diagnosis using neural networks: Survey and comprehensive evaluation. Inf. Process. Manag. 2022, 59, 102909. [Google Scholar] [CrossRef]
- Goutte, C.; Gaussier, E. A Probabilistic Interpretation of Precision, Recall and F-Score, with Implication for Evaluation. In European Conference on Information Retrieval; Lecture Notes in Computer Science; Springer: Berlin/Heidelberg, Germany, 2005; pp. 345–359. [Google Scholar] [CrossRef]
- Lee, S.; Liu, A.; Wang, Z.J.; McKeown, M.J. Abnormal phase coupling in parkinson’s disease and normalization effects of subthreshold vestibular stimulation. Front. Hum. Neurosci. 2019, 13, 118. [Google Scholar] [CrossRef]
PD | HC | |
---|---|---|
Number of subjects | 15 | 16 |
Gender | 7 males–8 females | 7 males–9 females |
Age range | 47 to 74 | 57 to 82 |
Mini-mental state score (avg.) | 28.9 | 29.2 |
PD | HC | |
---|---|---|
Number of subjects | 14 | 14 |
Gender | 6 males–8 females | 6 males–8 females |
Age range | 54 to 86 | 54 to 86 |
Montreal cognitive assessment (avg.) | 25.9 | 27.2 |
Case 1 (Dataset-1) | Case 2 (Dataset-1) | Case 3 (Dataset-2) | |||||||
---|---|---|---|---|---|---|---|---|---|
Accuracy | Specificity | Sensitivity | Accuracy | Specificity | Sensitivity | Accuracy | Specificity | Sensitivity | |
theta | 0.7744 | 0.8438 | 0.7004 | 0.7819 | 0.8691 | 0.6889 | 0.8176 | 0.8827 | 0.7524 |
alpha | 0.7900 | 0.8781 | 0.6959 | 0.8002 | 0.8854 | 0.7093 | 0.8429 | 0.8810 | 0.8048 |
beta | 0.8206 | 0.8503 | 0.7889 | 0.8116 | 0.8330 | 0.7889 | 0.9042 | 0.9238 | 0.8845 |
gamma | 0.8228 | 0.7872 | 0.8607 | 0.8066 | 0.8792 | 0.7293 | 0.9414 | 0.9589 | 0.9238 |
delta | 0.6559 | 0.6795 | 0.6307 | 0.6975 | 0.8420 | 0.5433 | 0.6815 | 0.8458 | 0.5173 |
Case 1 (Dataset-1) | Case 2 (Dataset-2) | Case 3 (Dataset-3) | |
---|---|---|---|
Accuracy | |||
theta | 68.09% | 48.48% | 52.96% |
alpha | 61.42% | 64.54% | 56.66% |
beta | 45.71% | 45.15% | 49.63% |
gamma | 48.18% | 51.51% | 46.29% |
delta | 55.23% | 48.48% | 49.62% |
Dataset-1 | Dataset-2 | ||
---|---|---|---|
Case 1 | Case 2 | Case 3 | |
Accuracy | 0.9670 | 0.9581 | 0.9929 |
Precision | 0.9590 | 0.9511 | 0.9924 |
Sensitivity | 0.9738 | 0.9633 | 0.9936 |
F1-Score | 0.9662 | 0.9570 | 0.9929 |
AUC | 0.9929 | 0.9919 | 0.9991 |
Model | EEG Band | Case | Accuracy |
---|---|---|---|
1D-CNN | theta | Case 1 Case 2 Case 3 | 71.78% 69.41% 90.74 |
1D-CNN | alpha | Case 1 Case 2 Case 3 | 64.95% 64.28% 90.05% |
1D-CNN | beta | Case 1 Case 2 Case 3 | 79.37% 75.77% 92.39 |
1D-CNN | gamma | Case 1 Case 2 Case 3 | 98.31% 85.34% 97.22% |
1D-CNN | delta | Case 1 Case 2 Case 3 | 70.51% 58.50% 85.05 |
2D-CNN | All five bands combined | Case 1 Case 2 Case 3 | 96.70% 95.81% 99.29% |
Work | Dataset | Model | Classification | Accuracy |
---|---|---|---|---|
Lal et al., 2024 [18] | Dataset-1 | KNN classifier with the Higuchi fractal dimension | Case 1 Case 2 | 96.46% 94.45% |
Latifoğlu et al., 2024 [19] | Dataset-1 | SVM with LOOCV | Case 1 Case 2 | 100% 99.68% |
KNN with LOOCV | Case 1 Case 2 | 99.68% 100% | ||
Qiu et al., 2022 [21] | Dataset-1 | SVM based on power spectral density features | Case 1 Case 2 Case 3 | 82.33% 78.69% 78.08% |
Dataset-2 | ||||
S.-B. Lee et al., 2022 [17] | Dataset-2 | Decision Tree with gradient boost using the Hjorth parameter | Case 3 | 89.30% |
Our proposed method | Dataset-1 | SVM based on power spectral density features | Case 1 Case 2 Case 3 | 82.28% 81.16% 94.14% |
Dataset-2 |
Work | Dataset | Model | Classification | Accuracy |
---|---|---|---|---|
Qiu et al., 2024 [22] | Dataset-1 | Multi-scale CNN with prototype calibration | Case 1 Case 2 Case 3 | 88.7% 84.5% 83.2% |
Dataset-2 | ||||
Chang et al., 2023 [23] | PD Oddball Data | Attention-Based Graph CNN with the LOOCV method | Case 1 Case 2 | 79.96% 87.67% |
Qiu et al., 2022 [21] | Dataset-1 | CNN (multi-layer perceptron with weight sharing) | Case 1 Case 2 Case 3 | 98.97% 97.15% 99.82% |
Dataset-2 | ||||
Loh et al., 2021 [20] | Dataset-1 | CNN with Gabor transformation of EEG | Case 1 Case 2 | 100% 99.44% |
Our proposed method | Dataset-1 | 2D-CNN | Case 1 Case 2 Case 3 | 96.70% 95.81% 99.29% |
Dataset-2 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Bera, S.; Geem, Z.W.; Cho, Y.-I.; Singh, P.K. A Comparative Study of Machine Learning and Deep Learning Models for Automatic Parkinson’s Disease Detection from Electroencephalogram Signals. Diagnostics 2025, 15, 773. https://doi.org/10.3390/diagnostics15060773
Bera S, Geem ZW, Cho Y-I, Singh PK. A Comparative Study of Machine Learning and Deep Learning Models for Automatic Parkinson’s Disease Detection from Electroencephalogram Signals. Diagnostics. 2025; 15(6):773. https://doi.org/10.3390/diagnostics15060773
Chicago/Turabian StyleBera, Sankhadip, Zong Woo Geem, Young-Im Cho, and Pawan Kumar Singh. 2025. "A Comparative Study of Machine Learning and Deep Learning Models for Automatic Parkinson’s Disease Detection from Electroencephalogram Signals" Diagnostics 15, no. 6: 773. https://doi.org/10.3390/diagnostics15060773
APA StyleBera, S., Geem, Z. W., Cho, Y.-I., & Singh, P. K. (2025). A Comparative Study of Machine Learning and Deep Learning Models for Automatic Parkinson’s Disease Detection from Electroencephalogram Signals. Diagnostics, 15(6), 773. https://doi.org/10.3390/diagnostics15060773