Next Article in Journal
Improving Patient Outcomes While Reducing Empirical Treatment with Multiplex-Polymerase-Chain-Reaction/Pooled-Antibiotic-Susceptibility-Testing Assay for Complicated and Recurrent Urinary Tract Infections
Previous Article in Journal
Percutaneous Ablation of T1 Renal Masses: Comparative Local Control and Complications after Radiofrequency and Cryoablation
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Keystroke Biometrics as a Tool for the Early Diagnosis and Clinical Assessment of Parkinson’s Disease

1
Department of Computer Science and Information Engineering, Advanced Institute of Manufacturing with High-Tech Innovations, National Chung Cheng University, Chiayi 621301, Taiwan
2
Department of Physical Medicine and Rehabilitation, Hualien Tzu Chi Hospital, Buddhist Tzu Chi Medical Foundation, Hualien 970473, Taiwan
3
Department of Biomedical Engineering, College of Engineering, National Cheng Kung University, Tainan 701401, Taiwan
4
Department of Medicine, School of Medicine, Tzu Chi University, Hualien 970374, Taiwan
5
Department of Neurology, Hualien Tzu Chi Hospital, Buddhist Tzu Chi Medical Foundation, Tzu Chi University, Hualien 970473, Taiwan
*
Author to whom correspondence should be addressed.
Diagnostics 2023, 13(19), 3061; https://doi.org/10.3390/diagnostics13193061
Submission received: 2 August 2023 / Revised: 22 September 2023 / Accepted: 23 September 2023 / Published: 26 September 2023
(This article belongs to the Section Pathology and Molecular Diagnostics)

Abstract

:
(1) Background: Parkinson’s disease (PD) is the second most common neurodegenerative disease. Early diagnosis and reliable clinical assessments are essential for appropriate therapy and improving patients’ quality of life. Keystroke biometrics, which capture unique typing behavior, have shown potential for early PD diagnosis. This study aimed to evaluate keystroke biometric parameters from two datasets to identify indicators that can effectively distinguish de novo PD patients from healthy controls. (2) Methods: Data from natural typing tasks in Physionet were analyzed to estimate keystroke biometric parameters. The parameters investigated included alternating-finger tapping (afTap) and standard deviations of interkey latencies (ILSD) and release latencies (RLSD). Sensitivity rates were calculated to assess the discriminatory ability of these parameters. (3) Results: Significant differences were observed in three parameters, namely afTap, ILSD, and RLSD, between de novo PD patients and healthy controls. The sensitivity rates were high, with values of 83%, 88%, and 96% for afTap, ILSD, and RLSD, respectively. Correlation analysis revealed a significantly negative correlation between typing speed and number of words typed with the standard motor assessment for PD, UPDRS-III, in patients with early PD. (4) Conclusions: Simple algorithms utilizing keystroke biometric parameters can serve as effective screening tests in distinguishing de novo PD patients from healthy controls. Moreover, typing speed and number of words typed were identified as reliable tools for assessing clinical statuses in PD patients. These findings underscore the potential of keystroke biometrics for early PD diagnosis and clinical severity assessment.

1. Introduction

Parkinson’s disease (PD) is the second most common neurodegenerative disease in developed countries [1]. The disease is characterized by a range of manifestations, including bradykinesia, tremor, rigidity, and loss of postural balance, typically resulting from the progressive degeneration of dopaminergic neurons in the basal ganglia [2]. Current pharmacological therapies mainly aim to restore dopaminergic imbalance by using dopamine precursors, monoamine oxidase inhibitors, dopaminergic agonists, anticholinergic agents, and other drugs [3]. Levodopa effectively improves motor problems in patients with PD. However, the long-term administration of levodopa can lead to multiple unpredictable problems including dyskinesia, fluctuations in motor responses to levodopa for example, on–off or freezing, and impaired cognition. Furthermore, the use of current antiparkinsonian agents may aggravate the progression of PD [4,5]. According to these clinical observations, various preclinical and clinical trials are underway for the development of neuroprotective therapy, cell therapy, and anti-inflammatory treatments [6].
Studies have revealed that non-specific non-motor symptoms, such as constipation, sleep disorders, and depression, may manifest several years or decades before the diagnosis of PD [7]. Early and adequate therapeutic intervention can improve patients’ symptoms and their quality of life [2,8]. Additionally, suboptimal medication adherence, including underuse, overuse, and irregular use, is another common issue in clinical practice for PD. Due to motor complications, the excessive use of levodopa is another prevalent problem among patients with PD. Some patients with PD tend to overuse levodopa as it provides relief from their motor disability [9] Therefore, an objective and quantitative motor assessment for these patients under dopamine replacement therapy is important [10].
Motor dysfunctions in patients with PD are diverse and can be categorized into two major subtypes: tremor-dominant and non-tremor dominant. Patients with tremor-dominant PD typically exhibit more apparent clinical features and tend to have better outcomes compared to those with the non-tremor subtype. The diagnosis of non-tremor dominant PD can be challenging as the symptoms may not be as obvious as those in the tremor-dominant subtype. Additionally, motor dysfunctions such as rigidity and bradykinesia may not become apparent until the disease has already progressed. These symptoms can also be similar to those seen in other neurological disorders, leading to potential misdiagnosis or delayed diagnosis and treatment. The misdiagnosis of early PD has been reported to range from 40% to 70% [11,12]. Early diagnosis and the implementation of appropriate therapeutic strategies can significantly improve patients’ quality of life [13]. Researchers are investigating various biomarkers of PD, such as brain imaging, cerebrospinal fluid analyses, mutant genes, and genetic polymorphisms, in order to develop tools for early diagnosis, to monitor disease progression, and to evaluate the effectiveness of treatments [14].
As PD presents with a range of clinical features, including both motor and non-motor symptoms, a multidimensional assessment tool, the Unified Parkinson’s Disease Rating Scale (UPDRS), has been widely used in clinical evaluations and research [15]. The third part of the UPDRS (UPDRS-III), designed to assess motor functions in PD patients, has been used to determine the severity of PD and track the progression of the disease over time [16,17]. Given that tremor and locomotor dysfunctions, such as bradykinesia, festinating gait, and trunk imbalance, are the predominant motor manifestations of PD, kinesiological biomarkers such as gait and postural analyses, as well as tremography, are more relevant in a clinical setting than biological markers [18]. With the advancement of wearable and detection devices, both motor and non-motor signs of PD can be accurately assessed using several available devices [19,20]. Nevertheless, several non-invasive techniques have been studied. However, weaknesses and drawbacks exist, challenging the future applications of these methods [21]. Keystroke biometrics refers to the distinctive typing pattern and rhythm of an individual’s typing behavior. This type of biometric analysis examines keystroke dynamics, such as the timing between keystrokes, the duration of each keystroke, and the pressure applied to each key, to create a unique typing profile. Keystroke biometrics has been utilized for authentication purposes [22,23]. While keystroke biometrics have primarily found application in the domains of security and user authentication, their utilization in clinical settings is an emerging and relatively novel area of research. As a result, only a limited number of clinical studies have explored this biometric modality for various applications [24]. Some researchers have employed neural networks and sophisticated methods to analyze keystroke biometrics for the early diagnosis of Parkinson’s disease [25,26,27,28]. In 2016, Giancardo and colleagues proposed that keystroke biometrics could provide an early diagnostic indicator for PD [29]. The raw data from their study, including on UPDRS-III scores, typing speeds, the numbers of typed words, and letters typed in a natural typing task are freely available from PhysioNet [30]. In this study, we extracted and assessed these keystroke biometric datasets with the aim of identifying easy-to-use and sensitive indicators for the early diagnosis of PD and relevant parameters for assessing the severity and progression of PD. Our study innovatively applies keystroke biometrics to address critical clinical questions related to the diagnosis, severity assessment, and disease progression tracking of PD. As a result, we aim to introduce a novel, non-invasive, and user-friendly tool for clinicians and researchers in the field of neurodegenerative diseases. Furthermore, following the protocol outlined in the original article, this method can also be effectively applied in telemedicine.

2. Materials and Methods

2.1. Data Source and Subjects

We retrieved two datasets on a natural typing task from PhysioNet [30]. The demographic data, clinical status, experimental protocols, and keystroke biometric parameters assessed, including the neuroQWERTY index (nQi), single-key tapping (sTap), and alternating-finger tapping (afTap), were described in the previous publication [29]. In the “single key tapping” assessment, participants were instructed to rapidly press a single button for a duration of 60 s, initially using their dominant hand and subsequently with their non-dominant hand. The resulting score was defined as the average number of button presses achieved across both hands. In the “alternating finger tapping” test, participants were asked to alternately press two buttons, positioned approximately 25 cm apart, using their index finger. This test was conducted for both hands, and the ultimate score was defined as the average number of button presses performed across both hands. Four controls and five patients with early PD could not complete the afTap test. In the natural typing task, the participants were asked to type a folk tale on a standard word processor. They were required to type as they usually do, and they were allowed to freely correct any mistake. Each subject received a randomly selected folk tale to prevent the repetition of content and mitigate potential learning effects.
Two datasets are available in PhysioNet. The early PD dataset included 18 patients with early PD (Hoehn–Yahr stages I and II) without motor fluctuations and 13 healthy controls. The second dataset recruited 24 patients with de novo PD (i.e., newly diagnosed patients who were not yet taking antiparkinsonian agents) and 30 healthy subjects. Each of the enrolled subjects reported using a laptop or desktop computer for at least 30 min every day prior to the test. The participants performed a typing task twice. To avoid issues related to imprinting and adaptation, we used the dataset from the first typing test for analysis. In the current study, we assessed a total of 43 healthy controls from the two datasets.

2.2. Data Processing

The keystroke data were recorded as the time points when the keys were pressed and released during a natural typing task. Therefore, four time intervals were identified: holding latency (HL), the interval between release and press time points of a key; interkey latency (IL), the interval between the release time point of a keystroke and press time point of the consecutive keystroke; press latency (PL), the interval between two consecutive press time points; and release latency (RL), the interval between two consecutive release time points (Figure 1) [31].

2.3. Management of Outliers

Several data points in the datasets had unusually large values that were likely the result of unknown errors. These outliers were removed from the analysis, as were any time points that resulted in negative values. We also utilized Boxplots to identify outliers, defining extreme outliers as those falling outside the range that lay between Q1 − 3 × IQR and Q3 + 3 × IQR, where IQR = Q3 − Q1 [32]. However, due to the unusual distribution of outliers in the datasets, following this outlier definition would have led to the exclusion of too many potentially meaningful samples. To strike a balance, we conducted a thorough visual examination of the raw data, subject-by-subject, and optimized the 12 × IQR threshold to maximize the retention of meaningful samples.

2.4. Assessments of Fluctuation of the Time Intervals

To estimate fluctuations in time intervals, we employed a modified version of Lan’s method [33]. We calculated the standard deviations (SDs) of the natural log of the quotient of two consecutive time intervals. Our approach differed from the published method, which only included alphanumeric, symbol, and space bar keys. We included time points for all keystrokes. Standard deviations of holding latency, interkey latency, press latency, and release latency were conducted and remarked as HLSD, ILSD, PLSD, and RLSD.

2.5. Statistical Analyses

Data are expressed as means ± standard deviation (SD). The significance of the differences in demographic data, UPDRS-III, and keystroke biometric parameters between the healthy controls, patients with de novo PD, and patients with early PD were examined using a nonparametric ANOVA with Tukey post hoc test. The correlations between UPDRS-III and the keystroke biometric parameters were evaluated using Spearman’s rank correlation coefficients. In receiver operating characteristic curve (ROC curve), the 95% confidence interval (CI) for the area under the curve is an exact binomial CI. Paired comparisons of ROC curves were performed by using DeLong’s test as described previously [34]. All statistical analyses were performed using STATA software (Version 16.0 for Windows; STATA Corp. LLC, College Station, TX, USA). Statistical significance was defined as p < 0.05 for all tests.

3. Results

3.1. Demographic Data of the Participants

The datasets from PhysioNet did not provide demographic raw data on participants. However, the original article reported no significant differences in age, sex, or years of education between healthy controls and patients with de novo and early PD [29]. We made modifications to the grouping method used in the original publication and recruited healthy controls from the two retrieved datasets as a single control group. As shown in Table 1, there were no significant differences in age and average years of education among the three groups: healthy controls (mean age: 60.10 ± 10.20 years, mean education: 15.30 ± 5.20 years), patients with de novo PD (mean age: 61.40 ± 10.50 years, mean education: 15.50 ± 3.80 years), and patients with early PD (mean age: 55.90 ± 8.00 years, mean education: 14.83 ± 4.60 years). We removed some outlier keystroke time points from the datasets of each participant, and the percentages of removed time points were 0.56% ± 0.64% in controls, 0.40% ± 0.53% in patients with de novo PD, and 0.36% ± 0.42% in patients with early PD. The average duration after diagnosis was 1.60 ± 1.22 years for patients with de novo PD and 3.89 ± 1.23 years for patients with early PD. There was only one statistical significance in ‘Average duration after diagnosis’ of PD between the de novo PD and early PD participants, found using a nonparametric ANOVA with a Tukey post hoc test.

3.2. Comparisons of Clinical Severity and Keystroke Parameters in Controls and Patients with De Novo and Early PD

Table 2 shows the comparisons of the UPDRS-III, nQi, and keystroke biometric parameters among the three groups. The healthy control group had significantly lower UPDRS-III, nQi, afTap, SD, ILSD, and RLSD values compared to both the patients with de novo PD and the patients with early PD. There were no significant differences in any of these parameters between the patients with PD and the patients with de novo PD.

3.3. The Value of Keystroke Biometric Parameters for Early Diagnosis of PD

We used Receiver Operating Characteristic (ROC) curves to assess the ability of these typestroke parameters to differentiate de novo PD participants from the healthy controls. Notably, we focused our analysis on the discrimination between patients with de novo PD and healthy controls rather than the discrimination between the healthy controls and patients with either de novo PD or early PD in previous publications [29,33]. Figure 2a shows that nQi, afTap, SD, RLSD, and ILSD are significantly effective in the early detection of PD. The area under the curve (AUC) values of afTap, ILSD, and RLSD (0.751, 0.761, and 0.814, respectively) are higher than the previously reported methods of nQi and SD (0.729 and 0.716, respectively). Conversely, Figure 2b demonstrates that the AUC values of RLSD and ILSD decreased (0.560 and 0.625, respectively) while the AUC values of nQi and SD increased to 0.892 and 0.842. The AUC value of afTap was 0.785. However, there is no statistically significant difference between these groups.
We determined the optimal cut-off point for each parameter by using the maximal Youden indices [35] and evaluated the sensitivity and specificity of each parameter in distinguishing patients with de novo PD from the healthy controls. Table 3 shows the sensitivity and specificity for nQi, SD, afTap, ILSD, and RLSD to differentiate de novo PD and early PD patients from the healthy controls.

3.4. The Correlations between Clinical Severity and Keystroke Biometric Parameters in Patients with PD

Due to the small sample size, we used Spearman’s rank correlation coefficient to assess the correlations between the UPDRS-III and keystroke parameters. As depicted in Table 4, we observed strong negative correlations between typing speed and number of words typed with UPDRS-III in patients with early PD, with correlation coefficients of −0.757 and −0.733, respectively. In patients with de novo PD, the number of words typed and afTP exhibited moderate negative correlations with UPDRS-III, with correlation coefficients of −0.452 and −0.484, respectively. However, none of the other keystroke biometric parameters analyzed in this study displayed a significant correlation with UPDRS-III, either in patients with de novo PD or in patients with early PD.

4. Discussion

In this study, we analyzed the keystroke biometric parameters retrieved from the raw data of two datasets on PhysioNet, comparing healthy controls with patients with de novo or early PD. To ensure the accuracy of our results, we checked the datasets for errors and found a small number of outliers. Outliers can occur in any dataset, and they can significantly impact data analyses [36]. These outliers were not mentioned in previous studies [29,33]. Despite the presence of outliers, we found no significant differences in their occurrence among the three groups. Based on these findings, we concluded that removing these outliers would not affect the results of our data analysis. Our results were comparable to nQi and SD, as reported in the literatures [29,33].
To make an easier and reasonable assessment, we adapted Lan’s method by incorporating all time points for the typed keys. Our findings in Table 2 indicate significant differences between patients with PD and controls in UPDRS-III, nQi, afTap, SD, ILSD, and RLSD. In this study, our objective was to develop an assisting diagnostic tool for the early detection of de novo PD, distinct from previous studies that encompassed patients with either de novo or early PD [29,33]. Our results, shown in Figure 2a, reveal that the nQi, afTap, SD, ILSD, and RLSD parameters had similar AUC values in differentiating de novo PD patients from the healthy controls. We used maximal Youden’s indices to define cut-off points for each parameter [35] and compared their sensitivity and specificity. nQi and SD had lower sensitivity rates (58% and 42%, respectively) than previously reported, possibly due to our strict grouping method. However, the afTap, ILSD, and RLSD parameters showed higher sensitivity rates (83%, 88%, and 96%, respectively), making them more suitable for screening tests (Table 3). These parameters had lower specificity, but high sensitivity is crucial for a screening test as it helps to identify individuals who may require further diagnostic procedures [37]. Conversely, the AUC values of RLSD and ILSD decreased while the AUC values of nQi and afTap increased when differentiating early PD patients from the healthy controls (Figure 2b). Additionally, the sensitivity of nQi, afTap, and SD increased, but RLSD and ILSD decreased significantly when distinguishing early PD patients from the healthy controls. Conversely, the specificity of these two parameters increased. This change could be attributed to the fact that subjects in the early PD group had been undergoing treatment for several years. Based on these findings, we propose that nQi and afTap are valuable indicators for detecting diagnosed PD. The decreased sensitivity of RLSD and ILSD suggests that these two parameters could serve as reliable markers for assessing the efficacy of PD treatment while their high specificity indicates their usefulness in confirming a PD diagnosis. These results highlight the novelty of our grouping methods compared to previous publications [29,33].
Lan and Giancardo suggested that HL could be a viable indicator for the early detection of PD. However, our current study found that the analyzed ILSD and RLSD were more effective than HL in detecting early PD. As per the definitions of keystroke biometric parameters, IL is the time interval between the release of one key and the subsequent press of another key, while RL is the interval between the release time points of two consecutive keystrokes. HL, on the other hand, measures the muscle activity involved in contracting and relaxing during typing. According to Lan’ s speculation, the SD represents the fluctuation of the time intervals [33]. While rigidity and bradykinesia are the primary symptoms of PD, they result in consistently increased muscle tone, such as in cogwheel or lead pipe rigidity, which would not impact the variation within HL. However, HLSD may increase in other neurological conditions that cause inconsistent muscle tone such as spasticity resulting from stroke, dystonia, and dyssynergia in cerebellar disorders [38]. In contrast, IL refers to the time duration between releasing one key and pressing the next key. This motion involves a more intricate motor process including initiation, programming, and executing the movement of the motor machinery [12]. Based on this hypothesis, IL can serve as a more accurate indicator for diagnosing PD compared to HL. Furthermore, in the current typing test, IL can also be influenced by a person’s reading ability and vision, suggesting that IL may be a more sensitive measure than HL in the early diagnosis of PD with cognitive impairment. Additionally, ILSD could potentially serve as a screening tool for cognitive and visual disorders as deviations from normal ILSD values could indicate underlying neurological impairment. The afTap parameter measures consecutive random typing and is simpler than the natural typing task. On the other hand, the afTap parameter still involves a complex motor control system but is less affected by comprehension and vision. Therefore, the afTap parameter may be more relevant for the early detection of PD than the natural typing task.
Several kinesiological parameters, including gait, tremor, voice, and typing behavior, have been utilized to evaluate the severity of PD [39,40,41]. From a clinical perspective, assessments of gait, posture, and keystroke biometric parameters are relevant for monitoring the clinical course of patients with PD. UPDRS-III is a tool used to assess clinical severity, particularly in the context of disease progression and therapeutic efficacy [42,43]. The other aim of this study was to identify reliable keystroke biometric parameters that correlate with UPDRS-III and can be used to monitor the progression of PD. Interestingly, as shown in Table 3, only typing speed and number of words typed exhibited strong negative correlations with UPDRS-III as per Spearman’s rank correlation coefficient analyses. However, there were no significant differences in these two parameters between healthy controls and patients with PD, and they had poor ability to differentiate between de novo patients with PD and healthy controls. These findings suggest that typing speed and number of words typed could serve as reliable indicators for monitoring disease progression but are not suitable as screening assessments for PD diagnosis. The keystroke parameters discovered in this study have the potential to significantly improve diagnostic accuracy for PD. Furthermore, they can be seamlessly integrated into telemedicine protocols as originally proposed in the datasets [29]. Additionally, keystroke biometrics can be readily incorporated into smartphones to enable remote and longitudinal PD assessment, similar to applications in patients with multiple sclerosis [44]. Furthermore, the possibility of merging patient databases in a cloud service opens up avenues for remote diagnosis, assessment, extensive data analysis, and drug development. However, it is crucial to underscore the importance of data privacy and security when implementing keystroke biometrics in clinical practice, especially within the realms of cloud services and telemedicine.

5. Study Limitations

Our study is constrained by the relatively small dataset obtained from PhysioNet, which limits the generalizability of our findings. Additionally, the dataset lacks information on the participants’ drug histories, which could have confounded the results of our keystroke biometric analyses [28]. Meanwhile some risk factors, such as family history, physical activity, occupation hazards, and comorbidity, were not available in the datasets either. Moreover, the dataset encompasses heterogeneity in disease stages and treatment strategies, further impacting the robustness of our analysis. Therefore, this study aimed to straightforwardly assess the applicability of keystroke biometrics for detecting and assessing PD, drawing comparisons with previous studies.

6. Conclusions

This study suggests that the keystroke biometric parameters afTap, ILSD, and RLSD are easy to assess and represent reliable tools for diagnostic screening for PD. Moreover, the typing speed and number of words typed may be useful for monitoring the clinical severity and progression of PD. These simple algorithms could be easily embedded in a laptop, tablet, or smartphone to enable the continuous remote monitoring of the clinical statuses of patients with PD and may help clinicians improve the accuracy of early diagnosis and the monitoring of disease progression.

Author Contributions

W.-M.L. conducted the data analysis, manuscript preparation, and discussion; C.-L.Y. performed data analysis; P.-W.C. and C.-W.L. ummarized the analytic results and discussion. A.-B.L. contributed to the study design and data analyses and reviewed the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This study was partly sponsored by research grants from the Hualien Tzu Chi General Hospital and Buddhist Tzuchi Medical Foundation Academic Advancement, Hualien, Taiwan, including grant numbers TCMF-A-108-10 (H.-R.L.), TCMF-P-111-11, TCRD 112-076, and TCRD 112-003 (P.-W.C.). This work was also partly sponsored by a research grant from the Ministry of Science and Technology, Taiwan, R.O.C. (grant number MOST 110-2221-E-194-011).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data used in this investigation are available under the ODC Public Domain Dedication and from the PhysioNet repository at https://archive.physionet.org/physiobank/database/nqmitcsxpd/?C=S;O=A (accessed on 22 September 2023).

Acknowledgments

We would like to thank Jen-Hung Wang from the Epidemiology and Biostatistics Consulting Center, Hualien Tzu Chi Hospital for help with statistical analyses and interpretation and Hsin-Ru Liu from the Department of Medical Research, Hualien Tzu Chi Hospital for checking the analyzed results and helping prepare this manuscript and the graphs.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Pringsheim, T.; Jette, N.; Frolkis, A.; Steeves, T.D. The prevalence of Parkinson’s disease: A systematic review and meta-analysis. Mov. Disord. 2014, 29, 1583–1590. [Google Scholar] [CrossRef] [PubMed]
  2. Kalia, L.V.; Lang, A.E. Parkinson’s disease. Lancet 2015, 386, 896–912. [Google Scholar] [CrossRef] [PubMed]
  3. Connolly, B.S.; Lang, A.E. Pharmacological treatment of Parkinson disease: A review. JAMA 2014, 311, 1670–1683. [Google Scholar] [CrossRef] [PubMed]
  4. Thanvi, B.; Lo, N.; Robinson, T. Levodopa-induced dyskinesia in Parkinson’s disease: Clinical features, pathogenesis, prevention and treatment. Postgrad. Med. J. 2007, 83, 384–388. [Google Scholar] [CrossRef] [PubMed]
  5. Kim, H.J.; Mason, S.; Foltynie, T.; Winder-Rhodes, S.; Barker, R.A.; Williams-Gray, C.H. Motor complications in Parkinson’s disease: 13-year follow-up of the CamPaIGN cohort. Mov. Disord. 2020, 35, 185–190. [Google Scholar] [CrossRef]
  6. Stoker, T.B.; Barker, R.A. Recent developments in the treatment of Parkinson’s Disease. F1000Research 2020, 9, 862. [Google Scholar] [CrossRef]
  7. Postuma, R.B.; Aarsland, D.; Barone, P.; Burn, D.J.; Hawkes, C.H.; Oertel, W.; Ziemssen, T. Identifying prodromal Parkinson’s disease: Pre-motor disorders in Parkinson’s disease. Mov. Disord. 2012, 27, 617–626. [Google Scholar] [CrossRef]
  8. Akhtar, R.S.; Stern, M.B. New concepts in the early and preclinical detection of Parkinson’s disease: Therapeutic implications. Expert. Rev. Neurother. 2012, 12, 1429–1438. [Google Scholar] [CrossRef]
  9. Lawrence, A.D.; Evans, A.H.; Lees, A.J. Compulsive use of dopamine replacement therapy in Parkinson’s disease: Reward systems gone awry? Lancet Neurol. 2003, 2, 595–604. [Google Scholar] [CrossRef]
  10. Farzanehfar, P.; Woodrow, H.; Braybrook, M.; Osborn, S.; Evans, A.; Nicklason, F.; Horne, M. Objective measurement in routine care of people with Parkinson’s disease improves outcomes. NPJ Park. Dis. 2018, 4, 10. [Google Scholar] [CrossRef]
  11. Adler, C.H.; Beach, T.G.; Hentz, J.G.; Shill, H.A.; Caviness, J.N.; Driver-Dunckley, E.; Sabbagh, M.N.; Sue, L.I.; Jacobson, S.A.; Belden, C.M.; et al. Low clinical diagnostic accuracy of early vs. advanced Parkinson disease: Clinicopathologic study. Neurology 2014, 83, 406–412. [Google Scholar] [CrossRef] [PubMed]
  12. Adams, W.R. High-accuracy detection of early Parkinson’s Disease using multiple characteristics of finger movement while typing. PLoS ONE 2017, 12, e0188226. [Google Scholar] [CrossRef] [PubMed]
  13. Miyasaki, J.M.; Martin, W.; Suchowersky, O.; Weiner, W.J.; Lang, A.E. Practice parameter: Initiation of treatment for Parkinson’s disease: An evidence-based review: Report of the Quality Standards Subcommittee of the American Academy of Neurology. Neurology 2002, 58, 11–17. [Google Scholar] [CrossRef] [PubMed]
  14. Gwinn, K.; David, K.K.; Swanson-Fischer, C.; Albin, R.; Hillaire-Clarke, C.S.; Sieber, B.A.; Lungu, C.; Bowman, F.D.; Alcalay, R.N.; Babcock, D.; et al. Parkinson’s disease biomarkers: Perspective from the NINDS Parkinson’s Disease Biomarkers Program. Biomark. Med. 2017, 11, 451–473. [Google Scholar] [CrossRef]
  15. Movement Disorder Society Task Force on Rating Scales for Parkinson’s Disease. The unified Parkinson’s disease rating scale (UPDRS): Status and recommendations. Mov. Disord. 2003, 18, 738–750. [Google Scholar]
  16. Grill, S.; Weuve, J.; Weisskopf, M.G. Predicting outcomes in Parkinson’s disease: Comparison of simple motor performance measures and The Unified Parkinson’s Disease Rating Scale-III. J. Park. Dis. 2011, 1, 287–298. [Google Scholar] [CrossRef]
  17. Lo, C.; Arora, S.; Lawton, M.; Barber, T.; Quinnell, T.; Dennis, G.J.; Ben-Shlomo, Y.; Hu, M.T.-M. A composite clinical motor score as a comprehensive and sensitive outcome measure for Parkinson’s disease. J. Neurol. Neurosurg. Psychiatry 2022, 93, 617. [Google Scholar] [CrossRef] [PubMed]
  18. Yang, K.; Xiong, W.X.; Liu, F.T.; Sun, Y.M.; Luo, S.; Ding, Z.T.; Wu, J.J.; Wang, J. Objective and quantitative assessment of motor function in Parkinson’s disease-from the perspective of practical applications. Ann. Transl. Med. 2016, 4, 90. [Google Scholar] [CrossRef]
  19. Deb, R.; An, S.; Bhat, G.; Shill, H.; Ogras, U.Y.J.S. A systematic survey of research trends in technology usage for Parkinson’s disease. Sensors 2022, 22, 5491. [Google Scholar] [CrossRef]
  20. Arora, S.; Venkataraman, V.; Zhan, A.; Donohue, S.; Biglan, K.M.; Dorsey, E.R.; Little, M.A. Detecting and monitoring the symptoms of Parkinson’s disease using smartphones: A pilot study. Park. Relat. Disord. 2015, 21, 650–653. [Google Scholar] [CrossRef]
  21. Espay, A.J.; Bonato, P.; Nahab, F.B.; Maetzler, W.; Dean, J.M.; Klucken, J.; Eskofier, B.M.; Merola, A.; Horak, F.; Lang, A.E.; et al. Technology in Parkinson’s disease: Challenges and opportunities. Mov. Disord. 2016, 31, 1272–1282. [Google Scholar] [CrossRef] [PubMed]
  22. Douhou, S.; Magnus, J.R. The reliability of user authentication through keystroke dynamics. Stat. Neerl. 2009, 63, 432–449. [Google Scholar] [CrossRef]
  23. Thomas, P.A.; Preetha Mathew, K. A broad review on non-intrusive active user authentication in biometrics. J. Ambient Intell. Humaniz. Comput. 2023, 14, 339–360. [Google Scholar] [CrossRef] [PubMed]
  24. Acien, A.; Morales, A.; Vera-Rodriguez, R.; Fierrez, J.; Mondesire-Crump, I.; Arroyo-Gallego, T. Detection of mental fatigue in the general population: Feasibility study of keystroke dynamics as a real-world biomarker. JMIR Biomed. Eng. 2022, 7, e41003. [Google Scholar] [CrossRef]
  25. Demir, B.; Ulukaya, S.; Erdem, O. Detection of Parkinson’s disease with keystroke data. Comput. Methods Biomech. Biomed. 2023, 26, 1653–1667. [Google Scholar] [CrossRef]
  26. Bernardo, L.S.; Damaševičius, R.; Ling, S.H.; de Albuquerque, V.H.C.; Tavares, J. Modified SqueezeNet architecture for Parkinson’s disease detection based on keypress data. Biomedicines 2022, 10, 2746. [Google Scholar] [CrossRef]
  27. Tripathi, S.; Arroyo-Gallego, T.; Giancardo, L. Keystroke-Dynamics for Parkinson’s disease signs detection in an at-home uncontrolled population: A new benchmark and method. IEEE. Trans. Biomed. Eng 2023, 70, 182–192. [Google Scholar] [CrossRef]
  28. Roy, S.; Roy, U.; Sinha, D.; Pal, R.K. Imbalanced ensemble learning in determining Parkinson’s disease using Keystroke dynamics. Expert Syst. Appl. 2023, 217, 119522. [Google Scholar] [CrossRef]
  29. Giancardo, L.; Sánchez-Ferro, A.; Arroyo-Gallego, T.; Butterworth, I.; Mendoza, C.S.; Montero, P.; Matarazzo, M.; Obeso, J.A.; Gray, M.L.; Estépar, R.S. Computer keyboard interaction as an indicator of early Parkinson’s disease. Sci. Rep. 2016, 6, 34468. [Google Scholar] [CrossRef]
  30. Goldberger, A.L.; Amaral, L.A.N.; Glass, L.; Hausdorff, J.M.; Ivanov PCh Mark, R.G.; Mietus, J.E.; Moody, G.B.; Peng, C.-K.; Stanley, H.E. PhysioBank, PhysioToolkit, and PhysioNet: Components of a New Research Resource for Complex Physiologic Signals. Circulation 2000, 101, e215–e220. [Google Scholar] [CrossRef]
  31. Acien, A.; Morales, A.; Monaco, J.V.; Vera-Rodriguez, R.; Fierrez, J. TypeNet: Deep learning keystroke biometrics. IEEE Trans. Biom. Behav. 2022, 4, 57–70. [Google Scholar] [CrossRef]
  32. Behrens, J.T. Principles and procedures of exploratory data analysis. Psychol. Methods 1997, 2, 131–160. [Google Scholar] [CrossRef]
  33. Lan, B.L.; Yeo, J.H.W. Comparison of computer-key-hold-time and alternating-finger-tapping tests for early-stage Parkinson’s disease. PLoS ONE 2019, 14, e0219114. [Google Scholar] [CrossRef] [PubMed]
  34. DeLong, E.R.; DeLong, D.M.; Clarke-Pearson, D.L. Comparing the areas under two or more correlated receiver operating characteristic curves: A nonparametric approach. Biometrics 1988, 44, 837–845. [Google Scholar] [CrossRef] [PubMed]
  35. Youden, W.J. Index for rating diagnostic tests. Cancer 1950, 3, 32–35. [Google Scholar] [CrossRef] [PubMed]
  36. Hawkins, D.M. Identification of Outliers; Monographs on Statistics and Applied Probability; Springer: Berlin/Heidelberg, Germany, 1980. [Google Scholar] [CrossRef]
  37. Maxim, L.D.; Niebo, R.; Utell, M.J. Screening tests: A review with examples. Inhal. Toxicol. 2014, 26, 811–828. [Google Scholar] [CrossRef]
  38. Ganguly, J.; Kulshreshtha, D.; Almotiri, M.; Jog, M. Muscle tone physiology and abnormalities. Toxins 2021, 13, 282. [Google Scholar] [CrossRef]
  39. di Biase, L.; Di Santo, A.; Caminiti, M.L.; De Liso, A.; Shah, S.A.; Ricci, L.; Di Lazzaro, V. Gait analysis in Parkinson’s disease: An overview of the most accurate markers for diagnosis and symptoms monitoring. Sensors 2020, 20, 3529. [Google Scholar] [CrossRef]
  40. Meigal, A.Y.; Rissanen, S.M.; Tarvainen, M.P.; Airaksinen, O.; Kankaanpää, M.; Karjalainen, P.A. Non-linear EMG parameters for differential and early diagnostics of Parkinson’s disease. Front. Neurol. 2013, 4, 135. [Google Scholar] [CrossRef]
  41. Amato, F.; Borzì, L.; Olmo, G.; Orozco-Arroyave, J.R. An algorithm for Parkinson’s disease speech classification based on isolated words analysis. Health Inf. Sci. 2021, 9, 32. [Google Scholar] [CrossRef]
  42. Holden, S.K.; Finseth, T.; Sillau, S.H.; Berman, B.D. Progression of MDS-UPDRS Scores over five years in de novo Parkinson Disease from the Parkinson’s Progression Markers Initiative Cohort. Mov. Disord. Clin. Pract. 2018, 5, 47–53. [Google Scholar] [CrossRef] [PubMed]
  43. Evers, L.J.W.; Krijthe, J.H.; Meinders, M.J.; Bloem, B.R.; Heskes, T.M. Measuring Parkinson’s disease over time: The real-world within-subject reliability of the MDS-UPDRS. Mov. Disord. 2019, 34, 1480–1487. [Google Scholar] [CrossRef] [PubMed]
  44. Lam, K.H.; Twose, J.; Lissenberg-Witte, B.; Licitra, G.; Meijer, K.; Uitdehaag, B.; De Groot, V.; Killestein, J. The Use of Smartphone keystroke dynamics to passively monitor upper limb and cognitive function in multiple sclerosis: Longitudinal analysis. J. Med. Internet Res. 2022, 24, e37614. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Definition of the typing intervals assessed in this study. The keystroke biometric parameters (dark: key pressed; grey: key released) assessed in the current study were based on the time points of press (P1, P2, P3, …Pn) and release (R1, R2, R3, …Rn) and defined as hold latency (HL), interkey latency (IL), press latency (PL), and release latency (RL) [31].
Figure 1. Definition of the typing intervals assessed in this study. The keystroke biometric parameters (dark: key pressed; grey: key released) assessed in the current study were based on the time points of press (P1, P2, P3, …Pn) and release (R1, R2, R3, …Rn) and defined as hold latency (HL), interkey latency (IL), press latency (PL), and release latency (RL) [31].
Diagnostics 13 03061 g001
Figure 2. ROC curves showing the performance of nQi and keystroke biometric parameters to differentiate the de novo Parkinson’s disease patients from the healthy controls (a) and the early Parkinson’s disease patients from the healthy controls (b). nQi: neuroQWERTY index [29], SD: standard deviation of holding latency [33], RLSD: standard deviation of release latency, afTap: alternating-finger tapping, ILSD: standard deviation of interkey latencies. There were no statistically significant results as per DeLong’s test [34]. #: Four healthy controls and five patients with early PD did not complete the afTap.
Figure 2. ROC curves showing the performance of nQi and keystroke biometric parameters to differentiate the de novo Parkinson’s disease patients from the healthy controls (a) and the early Parkinson’s disease patients from the healthy controls (b). nQi: neuroQWERTY index [29], SD: standard deviation of holding latency [33], RLSD: standard deviation of release latency, afTap: alternating-finger tapping, ILSD: standard deviation of interkey latencies. There were no statistically significant results as per DeLong’s test [34]. #: Four healthy controls and five patients with early PD did not complete the afTap.
Diagnostics 13 03061 g002
Table 1. Demographic data on the subjects assessed in this study.
Table 1. Demographic data on the subjects assessed in this study.
ParameterHealthy Controls
(n = 43)
De Novo PD
(n = 24)
Early PD
(n = 18)
Age60.10 ± 10.2061.4 ± 10.5055.90 ± 8.00
Sex (male)17 (40%)14 (58%)10 (56%)
Average duration after diagnosis (years)01.60 ± 1.223.89 ± 1.23 *
Average education (years) 15.30 ± 5.2015.50 ± 3.8014.83 ± 4.60
No. of outliers (%)0.56 ± 0.640.40 ± 0.530.36 ± 0.42
Data are expressed as means ± SD. PD: Parkinson’s disease. No. of outliers: the percentage of outliers removed during data processing. * Significances of difference were determined as p < 0.05 by using nonparametric ANOVA with Tukey post hoc test between the groups.
Table 2. Clinical Severity and Keystroke Biometric Parameters.
Table 2. Clinical Severity and Keystroke Biometric Parameters.
ParameterHealthy Controls
(n = 43)
De Novo PD
(n = 24)
Early PD
(n = 18)
UPDRS-III
(range)
1.92 ± 1.79
(0~6)
19.33 ± 6.70 *
(7~36)
22.32 ± 8.69
(11~40)
Typing speed (words/min)112.34 ± 58.7597.20 ± 42.5398.86 ± 45.94
No. samples1634.33 ± 793.041454.21 ± 497.721320.56 ± 581.98
nQi0.06 ± 0.060.12 ± 0.10 *0.14 ± 0.06
sTap (msec)170.85 ± 16.45165.48 ± 24.24159.42 ± 24.13
afTap # (msec)128.99 ± 27.8594.85 ± 23.54 *96.33 ± 19.75
SD0.34 ± 0.080.41 ± 0.11 *0.46 ± 0.14
HLSD0.50 ± 0.160.57 ± 0.150.53 ± 0.15
ILSD1.15 ± 0.151.28 ± 0.11 *1.27 ± 0.16
PLSD0.95 ± 0.131.01 ± 0.161.02 ± 0.22
RLSD1.07 ± 0.121.23 ± 0.14 *1.20 ± 0.19
UPDRS-III: the third part of the Unified Parkinson’s Disease Rating Scale, no. samples: number of words typed, nQi: neuroQWERTY index [29], sTap: single-finger tapping, afTap: alternating-finger tapping, SD: standard deviation of holding latency [33], HLSD: standard deviation of holding latency based on our modified method, ILSD: standard deviation of interkey latency, PLSD: standard deviation of press latency, RLSD: standard deviation of release latency. * p < 0.05 in the comparison between the healthy controls and de novo PD participants; p < 0.05 in the comparison between the healthy controls and early PD participants. Significances in difference were determined as p < 0.05 by nonparametric ANOVA with Tukey post hoc test. #: Four healthy controls and five patients with early PD did not complete the afTap.
Table 3. Sensitivity and specificity of typestroke biometric parameters in de novo PD and early PD patients.
Table 3. Sensitivity and specificity of typestroke biometric parameters in de novo PD and early PD patients.
ParameterDe Novo PDEarly PD
SensitivitySpecificitySensitivitySpecificity
nQi58%86%94%79%
afTap #83%60%92%72%
SD42%95%72%77%
ILSD88%60%44%91%
RLSD96%58%44%100%
nQi: neuroQWERTY index [29], SD: standard deviation of holding latency [33], afTap: alternating-finger tapping, ILSD: standard deviation of interkey latency, SD: standard deviation of holding latency, RLSD: standard deviation of release latency. The cut-off points were defined by the maximal Youden indices [35]. #: Four healthy controls did not complete the afTap.
Table 4. Correlation of Typestroke Parameters with UPDRS-III in Patients with Parkinson’s Disease.
Table 4. Correlation of Typestroke Parameters with UPDRS-III in Patients with Parkinson’s Disease.
ParameterDe Novo PDEarly PD
(n = 24)(n = 18)
Typing speed−0.371−0.757
No. samples−0.452 *−0.733
nQi0.2430.353
sTap−0.1120.077
afTap−0.484 *−0.095
SD0.0890.212
HLSD−0.005−0.019
ILSD−0.09−0.283
PLSD0.2740.347
RLSD−0.085−0.144
No. samples: number of words typed, nQi: neuroQWERTY index [29], sTap: single-finger tapping, afTap: alternating-finger tapping, SD: standard deviation of holding latency [33], HLSD: standard deviation of holding latency based on our modified method, ILSD: standard deviation of interkey latency, PLSD: standard deviation of press latency, RLSD: standard deviation of release latency. * p < 0.05 for the significant difference between the healthy controls and patients with de novo PD, p < 0.05 for the significant difference between the healthy controls and patients with early PD. Significances of difference were determined as p < 0.05 by using nonparametric Spearman rank correlation.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Liu, W.-M.; Yeh, C.-L.; Chen, P.-W.; Lin, C.-W.; Liu, A.-B. Keystroke Biometrics as a Tool for the Early Diagnosis and Clinical Assessment of Parkinson’s Disease. Diagnostics 2023, 13, 3061. https://doi.org/10.3390/diagnostics13193061

AMA Style

Liu W-M, Yeh C-L, Chen P-W, Lin C-W, Liu A-B. Keystroke Biometrics as a Tool for the Early Diagnosis and Clinical Assessment of Parkinson’s Disease. Diagnostics. 2023; 13(19):3061. https://doi.org/10.3390/diagnostics13193061

Chicago/Turabian Style

Liu, Wei-Min, Che-Lun Yeh, Po-Wei Chen, Che-Wei Lin, and An-Bang Liu. 2023. "Keystroke Biometrics as a Tool for the Early Diagnosis and Clinical Assessment of Parkinson’s Disease" Diagnostics 13, no. 19: 3061. https://doi.org/10.3390/diagnostics13193061

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop