Exploring Voice Acoustic Features Associated with Cognitive Status in Korean Speakers: A Preliminary Machine Learning Study
Abstract
1. Introduction
- To develop and evaluate a machine learning-based classification system for detecting cognitive impairment using speech data from multiple vocal tasks.
- To investigate specific acoustic features that may be associated with different levels of cognitive impairment in Korean speakers.
- To explore the feasibility of developing practical, non-invasive screening tools that could potentially be implemented in various healthcare settings.
2. Materials and Methods
2.1. General Overview
- Collection of speech data from 223 patients with suspected cognitive impairment and extracting voice acoustic features.
- Grouping of patients based on K-MMSE scores to assess cognitive status.
- Development of machine learning models to classify patients into cognitive status groups.
- Examination of voice characteristics related to cognitive status through explainable AI analysis.
2.2. Data Preprocessing
2.2.1. Patients
- Korean patients visiting the neurosurgery clinic with suspected cognitive impairment.
- Ability to understand and follow task instructions.
- No history of other neurological conditions affecting speech.
- Provided informed consent for participation in the study.
- Understand and follow task instructions.
- Produce intelligible speech.
- Complete all required speech tasks.
2.2.2. Speech Data Collection
- Vowel Tasks (4 tasks)- Sustained vowel production (/α/, /i/, /u/): Participants sustained each vowel for 2–3 s to assess phonatory stability and fundamental vocal control.
- Vowel prolongation (/α-α-α/): Participants sustained /α/ for as long as possible to evaluate maximum phonation time and respiratory control.
 
- DDK Tasks (4 tasks)- Alternate motion rate (AMR) tasks (/puh-puh-puh/, /tuh-tuh-tuh/, /kuh-kuh-kuh/): Participants rapidly repeated individual syllables to assess speech-motor function and coordination [26].
- Sequential motion rate (SMR) task (/puh-tuh-kuh/): Participants produced a sequence of different syllables to evaluate motor planning and sequencing abilities [27].
 
2.2.3. Extracting Voice Acoustic Features
2.3. Model Development Process
2.3.1. Model Selection
2.3.2. Model Construction and Evaluation
2.3.3. Model Explanation
- Quantifying the contribution of each feature to every possible model prediction.
- Considering complex interactions between features.
- Maintaining local accuracy and consistency.
- Providing both global and local feature importance.
3. Results
3.1. Demographic Analysis
3.2. Model Construction
4. Discussion
4.1. Interpretation of Machine Learning Results in the Context of Demographic Factors
4.2. Methodological Considerations and Task-Specific Analysis
4.2.1. Sustained Vowel Tasks and Their Implications
4.2.2. DDK Tasks and Motor Control Assessment
4.2.3. Model Performance Across Tasks
- Severe vs. Normal: PR-AUC = 0.737.
- Mild vs. Normal: PR-AUC = 0.726.
- Severe + Mild vs. Normal: PR-AUC = 0.715.
- Focused Acoustic Analysis: While previous studies often combined multiple feature types or showed modest improvements when adding acoustic features to demographic data, our study demonstrates the potential of using purely acoustic features for cognitive screening. This focused approach could be particularly valuable in situations where collecting linguistic or demographic data is challenging or impractical.
- Task-Feature Relationships: By analyzing specific acoustic features across different speech tasks, we identified which combinations of tasks and features are most indicative of cognitive decline. For example, our findings suggest that DDA shimmer from the /i/ task and stdevF0 from the /puh-tuh-kuh/ task are particularly informative, providing insights that could help optimize future screening protocols.
- Korean Language Context: Most previous studies have focused on English speakers, while our research provides specific insights into how cognitive decline manifests in Korean speech patterns. This contribution is particularly valuable given the potential differences in how cognitive impairment affects speakers of different languages.
- Severity-Specific Analysis: Rather than treating cognitive impairment as a binary condition, our approach differentiated between severe and mild impairment, providing more nuanced insights into how acoustic features might reflect different stages of cognitive decline.
- The performance differences between classification tasks were relatively small.
- The demographic differences between groups may have influenced these results.
- The model’s performance should be compared with existing screening methods to establish practical utility.
4.2.4. Feature Interactions and Task Dependencies
- Different aspects of speech production may be affected at different stages of cognitive decline.
- Certain task-feature combinations may be more sensitive to specific levels of cognitive impairment.
- A multi-task approach might provide more robust assessment capabilities than single-task protocols.
4.2.5. Clinical Feasibility Considerations
- The time required to complete all eight tasks may be burdensome for patients.
- The technical requirements for accurate recording and analysis need to be standardized.
- The relative contribution of each task to overall classification accuracy needs to be evaluated to potentially streamline the protocol.
- The reliability and reproducibility of measurements across different clinical settings need to be established [63].
4.3. Limitations and Future Research Directions
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Lena, A.; Ashok, K.; Padma, M.; Kamath, V.; Kamath, A. Health and social problems of the elderly: A cross-sectional study in Udupi Taluk, Karnataka. Indian J. Community Med. Off. Publ. Indian Assoc. Prev. Soc. Med. 2009, 34, 131–134. [Google Scholar] [CrossRef] [PubMed]
- Qiu, C.; Fratiglioni, L. Aging without dementia is achievable: Current evidence from epidemiological research. J. Alzheimer’s Dis. 2018, 62, 933–942. [Google Scholar] [CrossRef] [PubMed]
- Corrada, M.M.; Brookmeyer, R.; Paganini-Hill, A.; Berlau, D.; Kawas, C.H. Dementia incidence continues to increase with age in the oldest old: The 90+ study. Ann. Neurol. 2010, 67, 114–121. [Google Scholar] [CrossRef] [PubMed]
- Mecocci, P.; Boccardi, V. The impact of aging in dementia: It is time to refocus attention on the main risk factor of dementia. Ageing Res. Rev. 2021, 65, 101210. [Google Scholar] [CrossRef]
- Zhao, X.; Ang, C.K.E.; Acharya, U.R.; Cheong, K.H. Application of Artificial Intelligence techniques for the detection of Alzheimer’s disease using structural MRI images. Biocybern. Biomed. Eng. 2021, 41, 456–473. [Google Scholar] [CrossRef]
- Odusami, M.; Maskeliūnas, R.; Damaševičius, R.; Krilavičius, T. Analysis of features of Alzheimer’s disease: Detection of early stage from functional brain changes in magnetic resonance images using a finetuned ResNet18 network. Diagnostics 2021, 11, 1071. [Google Scholar] [CrossRef]
- Zhang, Y.; Wang, S.; Phillips, P.; Dong, Z.; Ji, G.; Yang, J. Detection of Alzheimer’s disease and mild cognitive impairment based on structural volumetric MR images using 3D-DWT and WTA-KSVM trained by PSOTVAC. Biomed. Signal Process. Control 2015, 21, 58–73. [Google Scholar] [CrossRef]
- Billones, C.D.; Demetria, O.J.L.D.; Hostallero, D.E.D.; Naval, P.C. DemNet: A convolutional neural network for the detection of Alzheimer’s disease and mild cognitive impairment. In Proceedings of the 2016 IEEE Region 10 Conference (TENCON), Singapore, 22–25 November 2016. [Google Scholar]
- Kumar, L.S.; Hariharasitaraman, S.; Narayanasamy, K.; Thinakaran, K.; Mahalakshmi, J.; Pandimurugan, V. AlexNet approach for early stage Alzheimer’s disease detection from MRI brain images. Mater. Today Proc. 2022, 51, 58–65. [Google Scholar] [CrossRef]
- Forsberg, A.; Engler, H.; Almkvist, O.; Blomquist, G.; Hagman, G.; Wall, A.; Ringheim, A.; Långström, B.; Nordberg, A. PET imaging of amyloid deposition in patients with mild cognitive impairment. Neurobiol. Aging 2008, 29, 1456–1465. [Google Scholar] [CrossRef]
- Shimamura, A.P.; Salmon, D.P.; Squire, L.R.; Butters, N. Memory dysfunction and word priming in dementia and amnesia. Behav. Neurosci. 1987, 101, 347. [Google Scholar] [CrossRef]
- Morris, R.G.; Kopelman, M.D. The memory deficits in Alzheimer-type dementia: A review. Q. J. Exp. Psychol. 1986, 38, 575–602. [Google Scholar] [CrossRef] [PubMed]
- Quatieri, T.F.; Williamson, J.R.; Lambert, A.C. Noninvasive biomarkers of neurobehavioral performance. Linc. Lab. J. 2020, 24, 28–59. [Google Scholar]
- Lin, H.; Karjadi, C.; Ang, T.F.; Prajakta, J.; McManus, C.; Alhanai, T.W.; Au, R. Identification of digital voice biomarkers for cognitive health. Explor. Med. 2020, 1, 406–417. [Google Scholar] [CrossRef] [PubMed]
- Robin, J.; Harrison, J.E.; Kaufman, L.D.; Rudzicz, F.; Simpson, W.; Yancheva, M. Evaluation of speech-based digital biomarkers: Review and recommendations. Digit. Biomark. 2020, 4, 99–108. [Google Scholar] [CrossRef]
- Thomas, J.A.; Burkhardt, H.A.; Chaudhry, S.; Ngo, A.D.; Sharma, S.; Zhang, L.; Au, R.; Hosseini Ghomi, R. Assessing the utility of language and voice biomarkers to predict cognitive impairment in the Framingham Heart Study Cognitive Aging Cohort Data. J. Alzheimer’s Dis. 2020, 76, 905–922. [Google Scholar] [CrossRef]
- Zhao, Q.; Wang, W.Q.; Fan, H.Z.; Li, D.; Li, Y.J.; Zhao, Y.L.; Tan, S.P. Vocal acoustic features may be objective biomarkers of negative symptoms in schizophrenia: A cross-sectional study. Schizophr. Res. 2022, 250, 180–185. [Google Scholar] [CrossRef]
- Simões-Zenari, M.; Batista, G.K.S.; de Oliveira Pagan-Neves, L.; Nemr, K.; Wertzner, H.F. Acoustic voice and spectrographic measures in children with the phonological process of devoicing. Int. J. Pediatr. Otorhinolaryngol. 2022, 157, 111137. [Google Scholar] [CrossRef]
- Pierce, D.L. Mismatch Negativity Event Related Potential Elicited by Speech Stimuli in Geriatric Patients; Brigham Young University: Provo, UT, USA, 2019. [Google Scholar]
- Pommée, T.; Balaguer, M.; Pinquier, J.; Mauclair, J.; Woisard, V.; Speyer, R. Relationship between phoneme-level spectral acoustics and speech intelligibility in healthy speech: A systematic review. Speech Lang. Hear. 2021, 24, 105–132. [Google Scholar] [CrossRef]
- Han, C.; Jo, S.A.; Jo, I.; Kim, E.; Park, M.H.; Kang, Y. An adaptation of the Korean mini-mental state examination (K-MMSE) in elderly Koreans: Demographic influence and population-based norms (the AGE study). Arch. Gerontol. Geriatr. 2008, 47, 302–310. [Google Scholar] [CrossRef]
- Moon, Y.; Lim, J.-S.; Lee, C.-N.; Choi, H. Vulnerable strata to non-adherence and overuse in treatment for patients with cognitive impairment. Dement. Neurocogn. Disord. 2020, 19, 152. [Google Scholar] [CrossRef]
- Deary, I.J.; Corley, J.; Gow, A.J.; Harris, S.E.; Houlihan, L.M.; Marioni, R.E.; Penke, L.; Rafnsson, S.B.; Starr, J.M. Age-associated cognitive decline. Br. Med. Bull. 2009, 92, 135–152. [Google Scholar] [CrossRef] [PubMed]
- Ritchie, S.J.; Bates, T.C.; Deary, I.J. Is education associated with improvements in general cognitive ability, or in specific skills? Dev. Psychol. 2015, 51, 573–582. [Google Scholar] [CrossRef] [PubMed]
- Ziegler, W. Task-related factors in oral motor control: Speech and oral diadochokinesis in dysarthria and apraxia of speech. Brain Lang. 2002, 80, 556–575. [Google Scholar] [CrossRef] [PubMed]
- Shen, C. Individual Differences in Speech Production and Maximum Speech Performance. Ph.D. Thesis, Radboud University, Nijmegen, The Netherlands, 2022. [Google Scholar]
- Tremblay, P.; Poulin, J.; Martel-Sauvageau, V.; Denis, C. Age-related deficits in speech production: From phonological planning to motor implementation. Exp. Gerontol. 2019, 126, 110695. [Google Scholar] [CrossRef]
- Devadiga, D.N.; Bhat, J.S. Oral diadokokinetic rate-an insight into speech motor control. Int. J. Adv. Res. 2012, 1, 10–14. [Google Scholar]
- Kent, R.D.; Kim, Y.; Chen, L.-M. Oral and laryngeal diadochokinesis across the life span: A scoping review of methods, reference data, and clinical applications. J. Speech Lang. Hear. Res. 2022, 65, 574–623. [Google Scholar] [CrossRef]
- Cutchin, G.M.; Plexico, L.W.; Weaver, A.J.; Sandage, M.J. Data collection methods for the voice range profile: A systematic review. Am. J. Speech-Lang. Pathol. 2020, 29, 1716–1734. [Google Scholar] [CrossRef]
- Steurer, H.; Gustafsson, J.K.; Franzén, E.; Schalling, E. Using Portable Voice Accumulators to Study Transfer of Speech Outcomes Following Intervention—A Feasibility Study. J. Voice 2021, 38, 965.e1–965.e13. [Google Scholar] [CrossRef]
- Feinberg, D.R.; Jones, B.C.; Little, A.C.; Burt, D.M.; Perrett, D.I. Manipulations of fundamental and formant frequencies influence the attractiveness of human male voices. Anim. Behav. 2005, 69, 561–568. [Google Scholar] [CrossRef]
- Vieira, M.l.N.; McInnes, F.R.; Jack, M.A. On the influence of laryngeal pathologies on acoustic and electroglottographic jitter measures. J. Acoust. Soc. Am. 2002, 111, 1045–1055. [Google Scholar] [CrossRef]
- Teixeira, J.P.; Gonçalves, A. Accuracy of jitter and shimmer measurements. Procedia Technol. 2014, 16, 1190–1199. [Google Scholar] [CrossRef]
- Upadhya, S.S.; Cheeran, A.; Nirmal, J. Statistical comparison of Jitter and Shimmer voice features for healthy and Parkinson affected persons. In Proceedings of the 2017 Second International Conference on Electrical, Computer and Communication Technologies (ICECCT), Tamil Nadu, India, 22–24 February 2017. [Google Scholar]
- Teixeira, J.P.; Gonçalves, A. Algorithm for jitter and shimmer measurement in pathologic voices. Procedia Comput. Sci. 2016, 100, 271–279. [Google Scholar] [CrossRef]
- Klára, V.; Viktor, I.; Krisztina, M. Voice disorder detection on the basis of continuous speech. In Proceedings of the 5th European Conference of the International Federation for Medical and Biological Engineering, Budapest, Hungary, 14–18 September 2011; Springer: Berlin/Heidelberg, Germany, 2012. [Google Scholar]
- Robbins, J.; Fisher, H.B.; Blom, E.C.; Singer, M.I. A comparative acoustic study of normal, esophageal, and tracheoesophageal speech production. J. Speech Hear. Disord. 1984, 49, 202–210. [Google Scholar] [CrossRef] [PubMed]
- Dehqan, A.; Scherer, R.C.; Dashti, G.; Ansari-Moghaddam, A.; Fanaie, S. The effects of aging on acoustic parameters of voice. Folia Phoniatr. Logop. 2013, 64, 265–270. [Google Scholar] [CrossRef]
- Eskidere, Ö. A Comparison of feature selection methods for diagnosis of Parkinson’s disease from vocal measurements. Sigma 2012, 30, 402–414. [Google Scholar]
- Teixeira, J.P.; Oliveira, C.; Lopes, C. Vocal acoustic analysis–jitter, shimmer and hnr parameters. Procedia Technol. 2013, 9, 1112–1122. [Google Scholar] [CrossRef]
- Oguz, H.; Tarhan, E.; Korkmaz, M.; Yilmaz, U.; Safak, M.A.; Demirci, M.; Ozluoglu, L.N. Acoustic analysis findings in objective laryngopharyngeal reflux patients. J. Voice 2007, 21, 203–210. [Google Scholar] [CrossRef]
- Farrús, M.; Hernando, J.; Ejarque, P. Jitter and shimmer measurements for speaker recognition. In Proceedings of the 8th Annual Conference of the International Speech Communication Association, Antwerp, Belgium, 27–31 August 2007; pp. 778–781. [Google Scholar]
- Ding, H.; Zhang, Y. Speech prosody in mental disorders. Annu. Rev. Linguist. 2023, 9, 335–355. [Google Scholar] [CrossRef]
- Gandour, J.; Petty, S.H.; Dardarananda, R. Dysprosody in Broca’s aphasia: A case study. Brain Lang. 1989, 37, 232–257. [Google Scholar] [CrossRef]
- Al-Qatab, B.A.; Mustafa, M.B. Classification of dysarthric speech according to the severity of impairment: An analysis of acoustic features. IEEE Access 2021, 9, 18183–18194. [Google Scholar] [CrossRef]
- De Menezes, F.S.; Liska, G.R.; Cirillo, M.A.; Vivanco, M.J. Data classification with binary response through the Boosting algorithm and logistic regression. Expert Syst. Appl. 2017, 69, 62–73. [Google Scholar] [CrossRef]
- Fukunishi, H.; Nishiyama, M.; Luo, Y.; Kubo, M.; Kobayashi, Y. Alzheimer-type dementia prediction by sparse logistic regression using claim data. Comput. Methods Programs Biomed. 2020, 196, 105582. [Google Scholar] [CrossRef] [PubMed]
- Yang, H.; Bath, P.A. The use of data mining methods for the prediction of dementia: Evidence from the English longitudinal study of aging. IEEE J. Biomed. Health Inform. 2019, 24, 345–353. [Google Scholar] [CrossRef] [PubMed]
- Zhu, F.; Li, X.; Mcgonigle, D.; Tang, H.; He, Z.; Zhang, C.; Hung, G.U.; Chiu, P.Y.; Zhou, W. Analyze informant-based questionnaire for the early diagnosis of senile dementia using deep learning. IEEE J. Transl. Eng. Health Med. 2019, 8, 2200106. [Google Scholar] [CrossRef] [PubMed]
- Nagarajah, T.; Poravi, G. A review on automated machine learning (AutoML) systems. In Proceedings of the 2019 IEEE 5th International Conference for Convergence in Technology (I2CT), Bombay, India, 29–31 March 2019. [Google Scholar]
- Kotthoff, L.; Thornton, C.; Hoos, H.H.; Hutter, F.; Leyton-Brown, K. Auto-WEKA: Automatic model selection and hyperparameter optimization in WEKA. In Automated Machine Learning: Methods, Systems, Challenges; Springer: Cham, Switzerland, 2019; pp. 81–95. [Google Scholar]
- Feurer, M.; Hutter, F. Hyperparameter optimization. In Automated Machine Learning: Methods, Systems, Challenges; Springer: Cham, Switzerland, 2019; pp. 3–33. [Google Scholar]
- Radzi, S.F.M.; Karim, M.K.A.; Saripan, M.I.; Rahman, M.A.A.; Isa, I.N.C.; Ibahim, M.J. Hyperparameter tuning and pipeline optimization via grid search method and tree-based autoML in breast cancer prediction. J. Pers. Med. 2021, 11, 978. [Google Scholar] [CrossRef]
- Cao, B.; Liu, Y.; Hou, C.; Fan, J.; Zheng, B.; Yin, J. Expediting the accuracy-improving process of svms for class imbalance learning. IEEE Trans. Knowl. Data Eng. 2020, 33, 3550–3567. [Google Scholar] [CrossRef]
- Lundberg, S.M.; Lee, S.-I. A unified approach to interpreting model predictions. Adv. Neural Inf. Process. Syst. 2017, 30, 1–10. [Google Scholar]
- Hart, S. Shapley value. In Game Theory; Springer: Berlin/Heidelberg, Germany, 1989; pp. 210–216. [Google Scholar]
- Torre, P., III; Barlow, J.A. Age-related changes in acoustic characteristics of adult speech. J. Commun. Disord. 2009, 42, 324–333. [Google Scholar] [CrossRef]
- Nishio, M.; Tanaka, Y.; Niimi, S. Analysis of age-related changes in the acoustic characteristics of voices. J. Commun. Res. 2011, 2, 65. [Google Scholar]
- Perkell, J.S.; Nelson, W.L. Variability in production of the vowels/i/and/a. J. Acoust. Soc. Am. 1985, 77, 1889–1895. [Google Scholar] [CrossRef]
- Stevens, K.N.; House, A.S. Development of a quantitative description of vowel articulation. J. Acoust. Soc. Am. 1955, 27, 484–493. [Google Scholar] [CrossRef]
- Barbosa, A.F.; Voos, M.C.; Chen, J.; Francato, D.C.V.; Souza, C.D.O.; Barbosa, E.R.; Chien, H.F.; Mansur, L.L. Cognitive or Cognitive-Motor Executive Function Tasks? Evaluating Verbal Fluency Measures in People with Parkinson’s Disease. BioMed Res. Int. 2017, 2017, 7893975. [Google Scholar] [CrossRef] [PubMed]
- Baghai-Ravary, L.; Beet, S.W. Automatic Speech Signal Analysis for Clinical Diagnosis and Assessment of Speech Disorders; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2012. [Google Scholar]
- Decoster, W.; Debruyne, F. Longitudinal voice changes: Facts and interpretation. J. Voice 2000, 14, 184–193. [Google Scholar] [CrossRef] [PubMed]
- Ramig, L.O.; Scherer, R.C.; Klasner, E.R.; Titze, I.R.; Horii, Y. Acoustic analysis of voice in amyotrophic lateral sclerosis: A longitudinal case study. J. Speech Hear. Disord. 1990, 55, 2–14. [Google Scholar] [CrossRef]






| Group | Age | Years of Education | Sex | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Mean | SD | Range | Mean | SD | Range | Male | Female | Total | |
| 0 ≤ MMSE ≤ 19 | 72.44 | 8.57 | 46–87 | 7.10 | 4.49 | 0–16 | 32 | 40 | 72 | 
| 19 < MMSE ≤ 23 | 71.11 | 8.36 | 48–84 | 8.24 | 4.05 | 0–16 | 31 | 23 | 54 | 
| 23 < MMSE ≤ 30 | 64.80 | 11.30 | 30–85 | 10.93 | 4.13 | 0–16 | 43 | 54 | 97 | 
| Category | Features | Description | |
|---|---|---|---|
| Voice Quality | Jitter (PPQ5) | Period Perturbation Quotient: Average rate of pitch change over 4 contiguous analysis intervals | |
| Jitter (local) | Cycle-to-cycle variation in fundamental frequency; measures frequency instability in voice | ||
| Jitter (DDP) | Difference of Differences of Periods: Mean change in pitch period across consecutive intervals | ||
| Jitter (RAP) | Relative Average Perturbation: Average change in pitch period over two neighboring intervals | ||
| Jitter (absolute) | Absolute change in pitch between adjacent periods | ||
| Shimmer (DDA) | Average absolute difference between consecutive amplitude differences | ||
| Shimmer (local) | Cycle-to-cycle variation in amplitude; measures amplitude instability in voice | ||
| Shimmer (APQ5) | Amplitude Perturbation Quotient for 5 cycles: Average amplitude variability over 5 cycles | ||
| Shimmer (localdb) | Mean absolute difference between consecutive dB amplitude values | ||
| Shimmer (APQ3) | Amplitude Perturbation Quotient for 3 cycles: Mean amplitude variability over 3 cycles | ||
| Shimmer (APQ11) | Amplitude Perturbation Quotient for 11 cycles: Mean amplitude variability over 11 cycles | ||
| NHR | Harmonic-to-Noise Ratio: Measure of voice clarity; ratio of periodic to random energy | ||
| Prosody | Speech Rate | Duration | Length of speech sounds in controlled task settings | 
| Pitch | Avg. Formant | Average of the formant frequencies | |
| Mean F0 | Average fundamental frequency of voice | ||
| StDev F0 | Variation in fundamental frequency | ||
| Mean F1–F4 | Average of first through fourth formant frequencies | ||
| Median F1–F4 | Median values of first through fourth formant frequencies | ||
| Characteristic | Severe (n = 72) | Mild (n = 54) | Normal (n = 97) | Test Statistic | p-Value | 
|---|---|---|---|---|---|
| Age, years | 72.44 ± 8.57 | 71.11 ± 8.36 | 64.80 ± 11.30 | H = 24.07 † | <0.001 | 
| Education, years | 7.10 ± 4.49 | 8.24 ± 4.05 | 10.93 ± 4.13 | H = 32.83 † | <0.001 | 
| Sex, n (%) | Χ2 = 2.79 ‡ | 0.248 | |||
| 
 | 32 (44.4%) | 31 (57.4%) | 43 (44.3%) | ||
| 
 | 40 (55.6%) | 23 (42.6%) | 54 (55.7%) | 
| Comparison | p-Value * | 
|---|---|
| Age | |
| 
 | 0.956 | 
| 
 | <0.001 | 
| 
 | 0.003 | 
| Education | |
| 
 | 0.532 | 
| 
 | <0.001 | 
| 
 | <0.001 | 
| Model | PR-AUC | AUC | Accuracy | F1 | Precision | Recall | |
|---|---|---|---|---|---|---|---|
| Severe vs. Normal group | DNN | 0.737 | 0.716 | 0.618 | 0.666 | 0.542 | 0.867 | 
| GBM | 0.716 | 0.730 | 0.706 | 0.688 | 0.647 | 0.733 | |
| LM | 0.632 | 0.698 | 0.647 | 0.700 | 0.560 | 0.933 | |
| RF | 0.516 | 0.637 | 0.529 | 0.652 | 0.484 | 1.000 | |
| Normal vs. Mild + Severe group | RF | 0.715 | 0.659 | 0.533 | 0.685 | 1.000 | 0.521 | 
| GBM | 0.682 | 0.651 | 0.573 | 0.686 | 0.921 | 0.547 | |
| LM | 0.680 | 0.637 | 0.560 | 0.692 | 0.974 | 0.536 | |
| DNN | 0.659 | 0.633 | 0.560 | 0.692 | 0.974 | 0.536 | |
| Mild vs. Normal group | DNN | 0.726 | 0.794 | 0.667 | 0.688 | 0.524 | 1.000 | 
| RF | 0.630 | 0.785 | 0.800 | 0.727 | 0.727 | 0.727 | |
| LM | 0.597 | 0.636 | 0.500 | 0.571 | 0.417 | 0.909 | |
| GBM | 0.583 | 0.785 | 0.800 | 0.750 | 0.692 | 0.818 | 
| Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. | 
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Lee, J.; Kim, N.; Ha, J.-W.; Kang, K.; Park, E.; Yoon, J.; Park, K.-S. Exploring Voice Acoustic Features Associated with Cognitive Status in Korean Speakers: A Preliminary Machine Learning Study. Diagnostics 2024, 14, 2837. https://doi.org/10.3390/diagnostics14242837
Lee J, Kim N, Ha J-W, Kang K, Park E, Yoon J, Park K-S. Exploring Voice Acoustic Features Associated with Cognitive Status in Korean Speakers: A Preliminary Machine Learning Study. Diagnostics. 2024; 14(24):2837. https://doi.org/10.3390/diagnostics14242837
Chicago/Turabian StyleLee, Jiho, Nayeon Kim, Ji-Wan Ha, Kyunghun Kang, Eunhee Park, Janghyeok Yoon, and Ki-Su Park. 2024. "Exploring Voice Acoustic Features Associated with Cognitive Status in Korean Speakers: A Preliminary Machine Learning Study" Diagnostics 14, no. 24: 2837. https://doi.org/10.3390/diagnostics14242837
APA StyleLee, J., Kim, N., Ha, J.-W., Kang, K., Park, E., Yoon, J., & Park, K.-S. (2024). Exploring Voice Acoustic Features Associated with Cognitive Status in Korean Speakers: A Preliminary Machine Learning Study. Diagnostics, 14(24), 2837. https://doi.org/10.3390/diagnostics14242837
 
        


 
       