Development of an AI-Based Predictive Algorithm for Early Diagnosis of High-Risk Dementia Groups among the Elderly: Utilizing Health Lifelog Data

Ji-Yong Lee; So Yoon Lee

doi:10.3390/healthcare12181872

Abstract

Background/Objectives: This study aimed to develop a predictive algorithm for the early diagnosis of dementia in the high-risk group of older adults using artificial intelligence technologies. The objective is to create an accessible diagnostic method that does not rely on traditional medical equipment, thereby improving the early detection and management of dementia. Methods: Lifelog data from wearable devices targeting this high-risk group were collected from the AI Hub platform. Various indicators from these data were analyzed to develop a dementia diagnostic model. Machine learning techniques such as Logistic Regression, Random Forest, LightGBM, and Support Vector Machine were employed. Data augmentation techniques were applied to address data imbalance, thereby enhancing the model performance. Results: Data augmentation significantly improved the model’s accuracy in classifying dementia cases. Specifically, in gait data, the SVM model performed with an accuracy of 0.879. In sleep data, a Logistic Regression was performed, yielding an accuracy of 0.818. This indicates that the lifelog data can effectively contribute to the early diagnosis of dementia, providing a practical solution that can be easily integrated into healthcare systems. Conclusions: This study demonstrates that lifelog data, which are easily collected in daily life, can significantly enhance the accessibility and efficiency of dementia diagnosis, aiding in the effective use of medical resources and potentially delaying disease progression.

Keywords:

artificial intelligence; early dementia detection; lifelog data; wearable devices; machine learning

1. Introduction

The escalating threat of dementia in aging societies poses a significant social and economic burden. As the global population ages, the number of people with dementia continues to increase. Dementia is a common geriatric condition affecting approximately 10% of individuals aged 65 years and above [1]. It denotes a state characterized not only by a decline in cognitive function but also by impairments in language, intelligence, concentration, and judgment abilities, indicating anomalies in perceptual skills. Once dementia develops, it is not reversible and tends to either remain stable or progressively worsen [2]. The disease places a substantial burden not only on the patients themselves but also on their family members, necessitating proactive national responses [3]. Despite the various types and symptoms of dementia, its underlying mechanisms remain unclear, and no definitive treatments are currently available. Consequently, early diagnosis of dementia is crucial and increasingly emphasized [4,5].

Globally, the number of individuals diagnosed with dementia continues to rise steadily. According to the World Health Organization (WHO), it is projected that by 2050, the number of dementia patients will reach approximately 152.8 million, which is nearly three times the current figure [6]. The economic burden of dementia is significant; in 2019, the global cost for 55.2 million individuals with dementia was estimated at USD 1.313, equating to an average cost of USD 23,796 [7]. This financial burden is expected to increase alongside the aging population, highlighting the critical need for the establishment of early diagnosis systems for dementia.

Early diagnosis is the most effective method for managing dementia, allowing for interventions to delay its progression to severe stages. Traditionally, the diagnosis of dementia relies on the clinical expertise of physicians and often involves expensive neuroimaging assessments [8]. This makes the early diagnosis of dementia difficult and inaccessible for many individuals. National-level initiatives, such as the Mini-Mental State Examination for Dementia Screening conducted at public health centers, aim to address this issue [9]. However, delays in acknowledging symptoms often prevent timely visits to health centers for diagnosis.

If dementia is not diagnosed early, patients may miss the critical treatment window, losing a valuable opportunity to slow the progression of the disease. As previously mentioned, dementia is a progressive disorder that worsens over time. Without appropriate treatment and management during the early stages, the disease can quickly advance to severe stages [2]. Failure to diagnose early means that patients may not receive the necessary treatment until the disease has significantly progressed, leading to accelerated functional decline and an increased risk of losing independence in daily activities. Additionally, in the early stages of dementia, many patients either do not notice the decline in cognitive function or dismiss mild symptoms, resulting in a diagnosis only after the disease has advanced considerably. This situation often increases the psychological and financial burden on both the patients and their families, significantly diminishing the quality of life for those affected by dementia. Therefore, early detection and proactive treatment are essential for maintaining patient functionality and improving quality of life.

Traditional dementia diagnosis methods face two main challenges: the need for individuals to visit diagnostic facilities such as hospitals or health centers, and the reliance on expensive equipment [10]. Which solutions address these issues? A potential solution for early diagnosis is to record data generated from daily life activities utilizing equipment that is readily available, thereby overcoming reliance on expensive diagnostic tools [11].

Lifelog data generated through Internet of Things (IoT) technologies, such as wearable devices and mobile equipment, provides a comprehensive record of an individual’s daily life activities. Although lifelog data includes information recorded on social networking sites, their most noteworthy application is in the healthcare industry [12]. In healthcare, notable lifelog data include activity levels, sleep information, dietary habits, weight fluctuations, body mass index, and muscle mass data collected from smartphones and wearable devices [13]. The healthcare industry aims to leverage these data to address weaknesses in medical management and provide continuously usable services.

To utilize health lifelog data in real time, the application of artificial intelligence (AI) technology, capable of classifying large volumes of data and deriving meaningful results in real time, is essential. AI technology can serve as a critical decision-making tool for the early diagnosis of dementia. Various studies have demonstrated the extensive use of AI in real-time data collection and analysis [14]. These studies highlight the important role of AI technology in real-time data analysis and decision making. In a study utilizing lifelog data to predict diabetes and cardiovascular diseases, machine learning models demonstrated a precision of 97.1% and a recall of 96.2%, thereby validating the effectiveness of early diagnosis through lifelog data analysis [15]. Building on these findings, digital healthcare platforms that leverage lifelog data have continued to evolve, collecting and automatically analyzing individual health data to offer personalized health management. These platforms employ AI-based deep learning modules to perform real-time analyses, making them highly effective tools for managing chronic diseases [16]. Therefore, the application of AI technology to the real-time analysis of health lifelog data is considered a valid approach.

Dementia diagnosis is a complex process that requires specialized knowledge of various conditions and scenarios. However, based on the results of prior research, implementing an AI-based early dementia diagnosis prediction system by integrating health lifelog data with AI technology appears feasible. Therefore, in this study, we aim to develop an AI-based predictive algorithm that enables early dementia diagnosis using health lifelog data, serving as preliminary research for building an AI-based diagnostic system. The results of this study are expected to facilitate early dementia diagnosis, enabling appropriate and timely treatment, thereby slowing disease progression and improving patients’ quality of life.

2. Methodology

2.1. Participants

To achieve the objectives of this study, we utilized the “Wearable Lifelog Data for Dementia High-Risk Groups” provided by AI Hub, accessed on 26 June 2024. (https://www.aihub.or.kr/). AI Hub is an integrated AI platform operated by the National Information Society Agency of Korea, offering training data in six fields, including healthcare, to support AI service development. The wearable lifelog data for dementia high-risk groups were derived from raw data collected using healthcare wearable devices. These data underwent refinement and labeling processes for lifelog big data construction for each stage of dementia progression and included datasets indicating the probability of developing dementia, as assessed by an AI-based early prediction model. The dataset was collected from men and women aged 55 years residing in Gwangju Metropolitan City, based on precise diagnoses by specialists. A total of 300 participants were categorized into Cognitive Normal (CN), Mild Cognitive Impairment (MCI), and Dementia (Dem), and were equipped with ring-shaped wearable devices for data collection. Following data collection, participants with a wearable device usage period of less than 35 days were excluded through data preprocessing. The final distributed dataset included the cognitive function data of 174 participants, with 111 categorized as CN, 51 as MCI, and 12 as Dem.

2.2. Data Preprocessing and Variable Extraction

The Dem group’s relatively small sample size of 12 raises the model bias risk during training. Therefore, instead of classifying MCI and Dem separately, we combined the MCI and Dem data to classify them as a high-risk group for dementia. The training and valid data used in this study are summarized in Table 1.

Table 1. Dataset used in this study.

In this study, lifelog data used to predict high-risk dementia groups were divided into sleep and gait data. The specific datasets, listed in Table 2, span approximately 35–120 days, and various metrics were collected daily. As the collection dates varied for each participant, we calculated the mean, standard deviation, maximum, and minimum values for each variable per participant to incorporate into the model training.

Table 2. Description of gait and sleep lifelog data.

Specifically, unique participant lists were extracted from sleep and gait data using each participant’s email address as an identifier. For each participant, the mean, standard deviation, maximum, and minimum values of the variables were calculated and saved as separate Excel files. The training and valid data were saved separately to maintain distinct datasets for analysis.

Nonquantifiable variables (dates, etc.) in the gait lifelog data such as the five-minute activity log, activity start time, activity end time, and one-minute MET log were excluded from the analysis. Variables (dates, etc.) in the sleep lifelog data such as sleep start time, sleep end time, five-minute heart rate log, sleep state log, and five-minute heart rate variability log were excluded from the analysis.

Finally, StandardScaler was applied to standardize the training data. Data were preprocessed to ensure that the same scaler could be applied to the validation data. This approach was designed to enable early diagnosis of dementia using data standardized consistently with the training data. The scaler was saved along with the model, allowing new data not used in this study to be standardized according to identical criteria.

StandardScaler is one method used in data preprocessing for machine learning. It standardizes data by adjusting each feature to have a mean of 0 and a standard deviation of 1 [17]. This process can enhance the model’s performance and reduce the training time.

2.3. Data Augmentation

In this study, data augmentation was performed solely on the training data to improve the model’s performance. Collecting large amounts of data for machine learning research can be costly, particularly when using human data [18]. Data augmentation can be used effectively to enhance the model performance and prevent overfitting during data prediction [19]. We utilized models both with and without data augmentation. For this study, data augmentation was achieved by randomly adjusting the mean values of each measurement for participants within a ±10% range of the standard deviation, thereby increasing the training data size by 20 times. The Mean Value variable represents the measurement’s average value, and Standard Deviation refers to the measurement’s standard deviation. The Random Factor variable is a random number generated between −1 and 1 that scales the adjustment within the range of ±10% of the standard deviation. This method was applied repeatedly to increase the training dataset size by a factor of 20, ensuring substantial augmentation while maintaining the inherent statistical properties of the original data. Importantly, no augmentation was applied to the validation data in order to avoid potential issues that data augmentation might introduce during validation. The final training data used in this study after augmentation are presented in Table 3.

A u g m e n t e d V a l u e = M e a n V a l u e + (R a n d o m F a c t o r \times 0.1 \times S t a n d a r d D e v i a t i o n)

(1)

Table 3. Dataset utilized in this study after data augmentation.

The final data used in this study after data augmentation are presented in Table 3.

2.4. Learning Model and Hyperparameters

In this study, prediction models were developed using Logistic Regression, Random Forest, LightGBM, and Support Vector Machine Classification. For datasets with defined features or limited data, traditional machine learning techniques are more effective than deep learning [20]. Additionally, to optimize the model’s hyperparameters, GridSearch, a technique to improve model performance in machine learning, was employed. This involves entering a list of hyperparameter values for a machine learning model, evaluating the performance for all possible combinations of these values, and identifying the best set of values [21]. The optimal hyperparameters for each model, determined using the GridSearch method, are listed in Table 4.

Table 4. Optimal hyperparameters for each machine learning model.

2.4.1. Logistic Regression

Logistic Regression is a widely used machine learning technique that has been employed in various research areas. It is particularly effective in addressing binary classification problems by outputting probabilities between 0 and 1, which can then be used to predict the likelihood that a specific condition is true. For instance, in the context of dementia classification, a Logistic Regression model can predict whether a patient has dementia by providing a yes or no answer based on the calculated probability. Logistic Regression has traditionally been extensively utilized in the medical field, including applications such as dementia diagnosis, where it plays a crucial role in predicting patient outcomes. The primary reason for the widespread use of Logistic Regression lies in its ability to perform binary classification with a high degree of interpretability and efficiency. There is prior research demonstrating the effectiveness of Logistic Regression in predicting Alzheimer’s disease, achieving a high accuracy of 0.873 [22]. Additionally, this technique has been employed in various studies to predict disease risks, such as colorectal cancer [23] and diabetes [24]. Based on the results of these studies, it is evident that Logistic Regression is a commonly used method for disease prediction. Therefore, it was also adopted as the machine learning technique in this study. The specific formula for the Logistic Regression model is shown in Equation (2).

- \frac{1}{m} \sum_{i = 1}^{m} [y^{(i)} \log (h (z^{(i)})) + (1 - y^{(i)}) l o g (1 - h (z^{(i)}))

(2)

2.4.2. Random Forest

Random Forest is an ensemble method that enhances predictive performance by combining multiple decision trees. Each tree in the model is trained independently, and the final prediction is determined through majority voting among the individual trees. Owing to this structure, Random Forest exhibits high stability and prediction accuracy, and it is particularly robust against overfitting. A decision tree operates by recursively partitioning the dataset into subsets based on various criteria. This partitioning process continues until no further predictions can be made, or until all data within a subset share the same target variable value. This recursive partitioning method is known as ‘Top-Down Induction of Decision Trees (TDIDT)’, and ultimately, the dependent variable Y is used as the target for classification. The vector v can be expressed by the following equation.

(v, Y) = (x_{1}, x_{2}, \dots, x_{d}, Y)

(3)

When training a Tree model, the process involves optimizing the parameters for each terminal and internal node, as well as the parameters of the node split function, to minimize an objective function defined based on the given data v, training set S0, and actual labels. Random Forest enhances model accuracy by utilizing an ensemble of these decision trees through the Bagging (bootstrap aggregation) method. Bagging involves repeatedly sampling the data (Bootstrap), training each model on these samples, and then aggregating the results (Aggregation) to produce the final prediction. In Random Forest, the combination of results from decision trees, each composed of different nodes, yields an optimized classification outcome.

I_{G} (f) = \sum_{i = 1}^{m} f_{i} (1 - f_{i}) = \sum_{i = 1}^{m} (f_{i} - f_{i}^{2}) = \sum_{i = 1}^{m} f_{i} - \sum_{i = 1}^{m} f_{i}^{2} = 1 - \sum_{i = 1}^{m} f_{i}^{2}

(4)

The use of the Random Forest technique in the field of dementia diagnosis is justified by the complex and multidimensional nature of dementia-related data, which can be effectively handled by the multi-tree structure of this method. Studies have leveraged these advantages, applying the Random Forest technique to classify neuroimaging data related to Alzheimer’s disease [25]. Additionally, this method has been employed in various studies, such as those predicting cardiovascular diseases [26], demonstrating its utility in disease prediction. Given the results of these studies, it is evident that Random Forest is a widely used technique for disease prediction, which is why it was chosen as the machine learning method in this study. The specific formulas for the Random Forest technique are presented in Equations (3) and (4).

2.4.3. LightGBM

LightGBM is a boosting framework that prioritizes high performance and speed, making it particularly useful for handling large-scale or complex high-dimensional data. Unlike traditional boosting algorithms that expand trees level by level, LightGBM grows trees leaf-wise. This approach allows the model to improve accuracy by preferentially splitting the leaf with the highest loss, thereby reducing overfitting. Additionally, LightGBM is well-suited for addressing data imbalance issues. This capability is especially valuable in research focused on disease prediction, such as dementia, where it effectively handles imbalanced datasets and facilitates early diagnosis and classification. For instance, studies have utilized LightGBM to develop models predicting the progression from mild cognitive impairment (MCI) to dementia, demonstrating high accuracy and efficiency [27]. Furthermore, LightGBM has been employed in various studies for disease prediction, including research on predicting thyroid disorders [28]. Given the outcomes of these prior studies, it is evident that LightGBM is a commonly used technique for disease prediction. Consequently, it was also selected as the machine learning method in this study.

2.4.4. Support Vector Machine Classification

Support Vector Machine (SVM) is a model that establishes a decision boundary to classify data into two categories. When new data are input, the model analyzes the features of the data and classifies them into the category that corresponds to the side of the decision boundary with similar attributes. The performance of SVM improves as the margin between the decision boundary and the data increases, with this margin being referred to as the “Margin”. SVM enhances classification accuracy by securing a wider margin and strengthens the model’s reliability by removing outliers within the margin. The SVM algorithm can be described using a p-dimensional hyperplane, as expressed in Equation (5), which represents a line where

f (X) = 0

:

f (X) = β_{0} + β_{1} X_{1} + \dots + β_{p} X_{p}

(5)

f (X) = 0

(6)

The function f(X) classifies data points based on their position relative to the hyperplane: if f(X_i) is greater than 0, the data point is classified as Class 1; if f(X_i) is less than 0, it is classified as Class 2. Specifically, when f(X_i) > 0, the data point belongs to Class 1, and when f(X_i) < 0, it belongs to Class 2. Consequently, each data point is assigned a value of Y_i, which is either −1 or 1 depending on its class. The condition for determining that all data points are correctly classified is when the expression in Equation (7) is positive for all data points.

Y_{i} (β_{0} + β_{1} X_{i 1} + \dots + β_{p} X_{i p}) > 0

(7)

When drawing a hyperplane, data can be divided with various slopes; however, to create the most accurate classification model, it is essential to find the margin that maximizes the distance between the two classes. This process defines the optimal hyperplane. Therefore, it is crucial to find the value that maximizes this margin, as expressed in Equation (10), to optimize the model’s performance.

{}_{β_{0}, β_{1}, \dots, β_{p}, ϵ_{1}, \dots, ϵ_{n} M}^{M a x i m i z e}M

(8)

s u b j e c t t o \sum_{j = 1}^{p} β_{j}^{2} = 1

(9)

Y_{i} (β_{0} + β_{1} X_{i 1} + \dots + β_{p} X_{i p}) \geq M (1 - ϵ_{1})

(10)

ϵ_{1} \geq 0, \sum_{i = 1}^{n} ϵ_{i} \leq C

(11)

The use of SVM in dementia classification is justified for several reasons. First, SVM is highly effective in handling high-dimensional data, making it well-suited for analyzing complex medical images such as MRI scans. Second, SVM has demonstrated high accuracy in predicting early stages of diseases like dementia [29], allowing for the construction of robust predictive models by integrating various clinical and imaging data. Additionally, SVM has been employed in a range of studies for disease prediction, including research on cardiovascular diseases [30]. Based on the outcomes of these previous studies, SVM is a widely used method for disease prediction, which is why it was also chosen as the machine learning technique in this study.

2.5. Model Evaluation Method

The performance of the prediction models was evaluated using six metrics: recall, precision, sensitivity, specificity, accuracy, and AUC. These performance indicators ranged from 0 to 1, with higher values indicating better model performance. The specific formula is as follows.

R e c a l l (S e n s i t i v i t y) = \frac{T r u e P o s i t i v e s (T P)}{T r u e P o s i t i v e s (T P) + F a l s e N e g a t i v e s (F N)}

(12)

P r e c i s i o n = \frac{T r u e P o s i t i v e s (T P)}{T r u e P o s i t i v e s (T P) + F a l s e P o s i t i v e s (F P)}

(13)

S p e c i f i c i t y = \frac{T r u e N e g a t i v e s (T N)}{T r u e N e g a t i v e s (T N) + F a l s e P o s i t i v e s (F P)}

(14)

A c c u r a c y = \frac{T r u e P o s i t i v e s (T P) + T r u e N e g a t i v e s (T N)}{T o t a l n u m b e r o f c a s e s}

(15)

2.6. Analysis Procedure

The research procedure was methodically organized into six stages to achieve the objectives of this study. Initially, wearable lifelog data from a high-risk dementia group was collected via AI Hub. This dataset included various physiological and behavioral indicators such as activity patterns, sleep cycles, and heart rate measurements. In the second stage, data preprocessing was conducted, which involved the removal of extraneous variables, computation of basic statistics, and standardization of each variable. This standardization ensured that the data were adjusted to a uniform scale, and basic statistics were employed to render the data in a format suitable for learning. The third stage involved segregating the preprocessed data into training and validation sets. Specifically, for the CN group, the data were divided in an 8:2 ratio for training and validation purposes, while for the MCI + Dem group, the data were split in a 9:1 ratio. This structured approach ensures a systematic processing of the data, setting a robust foundation for the subsequent analytical stages of the research. In the fourth stage, data augmentation was implemented to address issues of data scarcity and class imbalance. This step was crucial for enhancing the accuracy of the models and improving their performance across diverse conditions, underscoring the efforts to optimize data utilization. The fifth stage involved adjusting the hyperparameters of four predictive models. For this purpose, the GridSearch method was employed to systematically explore all combinations of hyperparameters to identify the configuration that yielded the best performance. Utilizing GridSearch significantly reduces the time researchers spend on manually testing combinations, thereby streamlining the model optimization process. In the final stage, the performance of the predictive models was evaluated using various metrics. Precision, recall, accuracy, and the area under the curve (AUC) were employed to thoroughly assess and analyze the predictive efficacy of each model. The specific research procedures are depicted in Figure 1.

Figure 1. Analysis procedure.

3. Results

3.1. Results of Prediction Model Training Using Original Data (Gait)

This study aimed to predict high-risk dementia groups using lifelog data. As the first research outcome, prediction models were trained using the original gait lifelog data. The performance metrics for each trained model are detailed in Table 5. Although Logistic Regression, LightGBM, and Support Vector Machine performed well in terms of accuracy, Random Forest outperformed them all, with an AUC of 0.734 according to the ROC curve. However, sensitivity, which indicates the model’s ability to correctly identify actual dementia patients, was relatively low at 0.429. Detailed confusion matrices for each model are provided in the Supplementary Materials.

Table 5. Results of prediction model training using original data (gait).

3.2. Results of Prediction Model Training Using Original Data (Sleep)

As the second research outcome, prediction models were trained using the original sleep lifelog data. The performance metrics for each trained model are listed in Table 6. Both the accuracy and AUC scores indicated that the Support Vector Machine model performed the best. However, sensitivity was relatively low at 0.429. Detailed confusion matrices for each model are provided in the Supplementary Materials.

Table 6. Results of prediction model training using original data (sleep).

3.3. Results of Prediction Model Training Using Augmented Data (Gait)

As the third research outcome, the prediction models were trained using augmented gait lifelog data. The performance metrics for each trained model are listed in Table 7. Although the Support Vector Machine model had the highest accuracy, the Random Forest model demonstrated superior performance when considering the AUC. Notably, performance improved compared to the pre-augmentation AUC of 0.734, with sensitivity increasing from 0.429 to 0.571. Detailed confusion matrices for each model are provided in the Supplementary Materials.

Table 7. Results of prediction model training using augmented data (gait).

3.4. Results of Prediction Model Training Using Augmented Data (Sleep)

As the fourth research outcome, prediction models were trained using the augmented sleep lifelog data. The performance metrics for each trained model are listed in Table 8. Although the Support Vector Machine model had the highest accuracy, the Logistic Regression model demonstrated better performance when considering the AUC. Sensitivity also improved from 0.429 to 0.571. Detailed confusion matrices for each model are provided in the Supplementary Materials.

Table 8. Results of prediction model training using augmented data (sleep).

4. Discussion

This study aimed to develop an algorithm for the early diagnosis of high-risk dementia groups among pre-older adult individuals using AI. By leveraging health lifelog data that are easily accessible in everyday life, this study suggests a significant potential to reduce reliance on expensive medical equipment and specialized medical professionals required by traditional diagnostic methods. This approach could be especially applicable in areas with limited access to healthcare or in economically disadvantaged environments, offering the potential to improve public health quality. The key findings of the study are summarized as follows.

First, participants’ cognitive function in this study was categorized into three groups: CN (111), MCI (51), and Dem (12). This led to an imbalance in the data between groups. Although we attempted to address this issue, it did not resolve the imbalance between MCI and Dem or the gender distribution. Consequently, patients with MCI and Dem were combined and labeled as the high-risk dementia group. Data imbalance is a common limitation in dementia prediction studies. For instance, in previous studies, the Synthetic Minority Over-sampling Technique (SMOTE) was proposed to artificially augment data for minority groups in machine learning-based research [31]. This technique helps balance datasets by generating examples of the minority class.

In healthcare-related research, data augmentation has been employed to address the issue of insufficient sample sizes, such as in studies focused on classifying human body types using deep learning techniques [32]. Additionally, efforts to build AI systems aimed at preventing doping among athletes have also utilized data augmentation to compensate for limited sample sizes [33]. Following these examples from various previous studies, this research applied data augmentation to tackle the problem of data imbalance. However, in the long term, future research will need to employ more precise classification techniques and expand data collection across diverse populations.

Second, this study aimed to develop an algorithm for the early diagnosis of high-risk dementia groups using original lifelog data. Although the algorithm achieved a maximum accuracy of 0.879, the sensitivity for correctly classifying actual dementia cases was low at 0.429. This raises questions about whether this algorithm is optimal for classifying dementia, which could be a topic for discussion among researchers. However, the accurate and quick prediction of patients with dementia is crucial for its management and treatment. Precise and rapid diagnosis plays a decisive role in establishing appropriate treatment and management plans, which are essential for maintaining a patient’s quality of life and slowing the progression of dementia.

Using lifelog data collected from daily life for proactive early diagnoses of dementia would considerably aid the initiation of appropriate treatment at an early stage. For example, in previous studies, a machine learning-based system using lifelog data has detected abnormal behaviors in dementia patients. This system demonstrated the potential to monitor patient conditions in real time and identify issues early [34]. This study underscores the importance of early diagnosis and continuous monitoring in dementia management and suggests how technological approaches can improve patient care. The use of lifelog data has expanded in various fields, not only in dementia research. These data are increasingly being applied in the context of early and proactive diagnosis, where speed is often prioritized over precision. In this regard, low-cost wearable devices play a crucial role, serving as important tools for rapid data collection and analysis. The data collection device used in this study aligns with this approach. For instance, in previous research, the authors developed a low-cost, autonomous wearable device designed to track Alzheimer’s patients. The device uses GPS and geofencing technology to monitor the patient’s location in real time and sends alerts when the patient exits a designated safe zone [35]. Such low-cost devices help alleviate the burden on patients and their families and can be effectively utilized in regions with limited access to healthcare.

Third, we addressed the data imbalance issue by performing data augmentation. Numerous studies have proposed data augmentation as a solution to data imbalance problems [32]. The results of this study support previous findings, showing improved performance in sensitivity after data augmentation, indicating that the ability to classify actual dementia cases as dementia improved. An increase in sensitivity implies a better identification of actual dementia cases, potentially leading to a more accurate diagnosis. Therefore, we hope that future studies will continue to explore various techniques to enhance sensitivity. Furthermore, future studies should investigate how these techniques can be applied in clinical settings to contribute to high-quality research capable of early dementia prediction.

5. Conclusions

This study aimed to develop a predictive algorithm using AI technology for the early diagnosis of high-risk dementia groups among pre-older adult individuals. Early diagnostic methods that utilize health lifelog data aim to overcome the limitations of traditional diagnostic methods. The results suggest that effectively utilizing lifelog data, which can be easily collected from daily life, not only enhances the accessibility of dementia diagnosis and enables the efficient use of medical resources but also plays a crucial role in delaying the progression of dementia.

This study focused on improving model accuracy and sensitivity by applying data augmentation techniques to overcome the limitations of previous studies, such as data imbalance issues. The improvement in sensitivity after data augmentation can enhance the reliability of AI-based diagnostic systems. Nevertheless, future research should address the issue of data imbalance, as well as efforts to improve sensitivity.

Finally, the approach used in this study suggests the potential for application not only in dementia diagnosis but also in the early diagnosis of various health conditions. The integration of AI and healthcare technology could lead to more precise and personalized medical services, further improving the quality of public health. Future research should aim to enhance the model’s predictive power using more diverse data and advanced analytical techniques and explore its applicability in real clinical settings.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/healthcare12181872/s1.

Author Contributions

Conceptualization, J.-Y.L. and S.Y.L.; methodology, J.-Y.L.; software, J.-Y.L.; validation, J.-Y.L. and S.Y.L.; formal analysis, J.-Y.L.; investigation, J.-Y.L.; resources, J.-Y.L.; data curation, S.Y.L.; writing—original draft preparation, J.-Y.L.; writing—review and editing, J.-Y.L.; visualization, S.Y.L.; supervision, J.-Y.L.; project administration, J.-Y.L.; funding acquisition, J.-Y.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Ethical review and approval were waived for this study as the data used were sourced from AI Hub, accessed on 26 June 2024 (https://www.aihub.or.kr/), a publicly accessible platform providing open data. As the data are publicly available and anonymized, ensuring that there is no risk of identification, this study qualifies for exemption from Institutional Review Board (IRB) review.

Informed Consent Statement

Not applicable.

Data Availability Statement

This research (paper) used datasets from ‘The Open AI Dataset Project (AI-Hub, S. Korea)’. All data information can be accessed through ‘AI Hub, accessed on 26 June 2024 (www.aihub.or.kr)’.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Cho, H.; Ko, Z. Current state of senile dementia and improvement of the long term care insurance for elderly people. J. Korea Acad.-Ind. Coop. Soc. 2012, 13, 5816–5825. [Google Scholar] [CrossRef]
Lee, K.H.; Kim, C.Y.; Kim, S.H. Diagnosis and treatment of dementia. J. Korean Phys. Ther. Sci. 2002, 9, 171–178. [Google Scholar]
Huang, S.S.; Lee, M.C.; Liao, Y.C.; Wang, W.F.; Lai, T.J. Caregiver burden associated with behavioral and psychological symptoms of dementia (BPSD) in Taiwanese elderly. Arch. Gerontol. Geriatr. 2012, 55, 55–59. [Google Scholar] [CrossRef] [PubMed]
Lee, J.H. Treatment of vascular dementia: A comprehensive review. J. Korean Neurol. Assoc. 2003, 21, 445–454. [Google Scholar]
EFNS; Waldemar, G.; Dubois, B.; Emre, M.; Georges, J.; McKeith, I.G.; Rossor, M.; Scheltens, P.; Tariska, P.; Winblad, B.; et al. Recommendations for the diagnosis and management of Alzheimer’s disease and other disorders associated with dementia: EFNS guideline. Eur. J. Neurol. 2007, 14, e1–e26. [Google Scholar] [CrossRef]
Nichols, E.; Steinmetz, J.D.; Vollset, S.E.; Fukutaki, K.; Chalek, J.; Abd-Allah, F.; Abdoli, A.; Abualhasan, A.; Abu-Gharbieh, E.; Akram, T.T.; et al. Estimation of the global prevalence of dementia in 2019 and forecasted prevalence in 2050: An analysis for the Global Burden of Disease Study 2019. Lancet Public Health 2022, 7, e105–e125. [Google Scholar] [CrossRef]
Wimo, A.; Seeher, K.; Cataldi, R.; Cyhlarova, E.; Dielemann, J.L.; Frisell, O.; Guerchet, M.; Jönsson, L.; Malaha, A.K.; Nichols, E.; et al. The worldwide costs of dementia in 2019. Alzheimers Dement. 2023, 19, 2865–2873. [Google Scholar] [CrossRef]
Pemberton, H.G.; Goodkin, O.; Prados, F.; Das, R.K.; Vos, S.B.; Moggridge, J.; Coath, W.; Gordon, E.; Barrett, R.; Schmitt, A.; et al. Automated quantitative MRI volumetry reports support diagnostic interpretation in dementia: A multi-rater, clinical accuracy study. Eur. Radiol. 2021, 31, 5312–5323. [Google Scholar] [CrossRef]
Han, J.W.; Kim, T.H.; Jhoo, J.H.; Park, J.H.; Kim, J.L.; Ryu, S.H.; Moon, S.W.; Choo, I.H.; Lee, H.; Yoon, J.C.; et al. A normative study of the Mini-Mental State Examination for Dementia Screening (MMSE-DS) and its short form (SMMSE-DS) in the Korean elderly. J. Korean Geriatr. Psychiatry 2010, 14, 27–37. [Google Scholar]
Fernandes, B.; Goodarzi, Z.; Holroyd-Leduc, J. Optimizing the diagnosis and management of dementia within primary care: A systematic review of systematic reviews. BMC Fam. Pract. 2021, 22, 1–17. [Google Scholar] [CrossRef]
Ranson, J.M.; Bucholc, M.; Lyall, D.; Newby, D.; Winchester, L.; Oxtoby, N.P.; Veldsman, M.; Rittman, T.; Marzi, S.; Skene, N.; et al. Harnessing the potential of machine learning and artificial intelligence for dementia research. Brain Inform. 2023, 10, 6. [Google Scholar] [CrossRef] [PubMed]
Kim, J.C.; Chung, K. Mining health-risk factors using PHR similarity in a hybrid P2P network. Peer Peer Netw. Appl. 2018, 11, 1278–1287. [Google Scholar] [CrossRef]
Masoumian Hosseini, M.; Masoumian Hosseini, S.T.; Qayumi, K.; Hosseinzadeh, S.; Sajadi Tabar, S.S. Smartwatches in healthcare medicine: Assistance and monitoring: A scoping review. BMC Med. Inform. Decis. Mak. 2023, 23, 248. [Google Scholar] [CrossRef] [PubMed]
Shajari, S.; Kuruvinashetti, K.; Komeili, A.; Sundararaj, U. The emergence of AI-based wearable sensors for digital health technology: A review. Sensors 2023, 23, 9498. [Google Scholar] [CrossRef] [PubMed]
Oikonomou, E.K.; Khera, R. Machine learning in precision diabetes care and cardiovascular risk prediction. Cardiovasc. Diabetol. 2023, 22, 259. [Google Scholar] [CrossRef]
Lee, K.; Lee, J.; Hwang, S.; Kim, Y.; Lee, Y.; Urtnasan, E.; Koh, S.B.; Youk, H. Diffusion of a Lifelog-Based Digital Healthcare Platform for Future Precision Medicine: Data Provision and Verification Study. J. Pers. Med. 2022, 12, 803. [Google Scholar] [CrossRef]
Bhattacharya, S.; Maddikunta, P.K.R.; Hakak, S.; Khan, W.Z.; Bashir, A.K.; Jolfaei, A.; Tariq, U. Ant lion resampling based deep neural network model for classification of imbalanced multimodal stroke dataset. Multimedia Tool. Appl. 2020, 81, 1–25. [Google Scholar]
Xu, M.; Yoon, S.; Fuentes, A.; Park, D.S. A comprehensive survey of image augmentation techniques for deep learning. Pattern Recognit. 2023, 137, 109347. [Google Scholar] [CrossRef]
Wong, S.C.; Gatt, A.; Stamatescu, V.; McDonnell, M.D. Understanding data augmentation for classification: When to warp? In Proceedings of the International Conference on Digital Image Computing: Techniques and Applications (DICTA), Gold Coast, Australia, 30 November–2 December 2016; pp. 1–6. [Google Scholar] [CrossRef]
Bishop, C.M.; Nasrabadi, N.M. Pattern Recognition and Machine Learning; Springer: New York, USA, 2006; Volume 4, No. 4; p. 738. [Google Scholar]
Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Dubourg, V.; et al. Scikit-learn: Machine learning in Python. J. Mach. Learn. Res. 2011, 12, 2825–2830. [Google Scholar]
Shigemizu, D.; Akiyama, S.; Asanomi, Y.; Boroevich, K.A.; Sharma, A.; Tsunoda, T.; Matsukuma, K.; Ichikawam, M.; Sudo, H.; Takizawa, S.; et al. Risk prediction models for dementia constructed by supervised principal component analysis using miRNA expression data. Commun. Biol. 2019, 2, 77. [Google Scholar] [CrossRef]
Mo, S.; Zhou, Z.; Li, Y.; Hu, X.; Ma, X.; Zhang, L.; Cai, S.; Peng, J. Establishment and validation of a novel nomogram incorporating clinicopathological parameters into the TNM staging system to predict prognosis for stage II colorectal cancer. Cancer Cell Int. 2020, 20, 1–13. [Google Scholar] [CrossRef] [PubMed]
Joshi, R.D.; Dhakal, C.K. Predicting type 2 diabetes using logistic regression and machine learning approaches. Int. J. Environ. Res. Public Health 2021, 18, 7346. [Google Scholar] [CrossRef] [PubMed]
Sarica, A.; Cerasa, A.; Quattrone, A. Random forest algorithm for the classification of neuroimaging data in Alzheimer’s disease: A systematic review. Front. Aging Neurosci. 2017, 9, 329. [Google Scholar] [CrossRef] [PubMed]
Wongvibulsin, S.; Wu, K.C.; Zeger, S.L. Clinical risk prediction with random forests for survival, longitudinal, and multivariate (RF-SLAM) data analysis. BMC Med. Res. Methodol. 2020, 20, 1–14. [Google Scholar] [CrossRef]
Park, C.; Jang, J.W.; Joo, G.; Kim, Y.; Kim, S.; Byeon, G.; Park, S.W.; Kasani, P.H.; Yum, S.; Pyun, J.; et al. Predicting progression to dementia with “comprehensive visual rating scale” and machine learning algorithms. Front. Neurol. 2022, 13, 906257. [Google Scholar] [CrossRef]
Sinha, B.B.; Ahsan, M.; Dhanalakshmi, R. LightGBM empowered by whale optimization for thyroid disease detection. Int. J. Inf. Technol. 2023, 15, 2053–2062. [Google Scholar] [CrossRef]
Vemuri, P.; Gunter, J.L.; Senjem, M.L.; Whitwell, J.L.; Kantarci, K.; Knopman, D.S.; Boeve, B.F.; Petersen, R.C.; Jack, C.R., Jr. Alzheimer’s disease diagnosis in individual subjects using structural MR images: Validation studies. Neuroimage 2008, 39, 1186–1197. [Google Scholar] [CrossRef]
Sarra, R.R.; Dinar, A.M.; Mohammed, M.A.; Abdulkareem, K.H. Enhanced heart disease prediction based on machine learning and χ² statistical optimal feature selection model. Designs 2022, 6, 87. [Google Scholar] [CrossRef]
Chawla, N.V.; Bowyer, K.W.; Hall, L.O.; Kegelmeyer, W.P. SMOTE: Synthetic minority over-sampling technique. J. Artif. Intell. Res. 2002, 16, 321–357. [Google Scholar] [CrossRef]
Yoon, J.; Lee, S.Y.; Lee, J.Y. AI somatotype system using 3D body images: Based on Deep-Learning and transfer learning. Appl. Sci. 2024, 14, 2608. [Google Scholar] [CrossRef]
Lee, S.Y.; Park, J.H.; Yoon, J.; Lee, J.Y. A validation study of a deep learning-based doping drug text recognition system to ensure safe drug use among athletes. Healthcare 2023, 11, 1769. [Google Scholar] [CrossRef] [PubMed]
Kim, K.; Jang, J.; Park, H.; Jeong, J.; Shin, D.; Shin, D. Detecting abnormal behaviors in dementia patients using lifelog data: A machine learning approach. Information 2023, 14, 433. [Google Scholar] [CrossRef]
Hegde, N.; Muralidhara, S.; Ashoka, D.V. A low-cost and autonomous tracking device for Alzheimer’s patients. J. Enabling Technol. 2019, 13, 201–211. [Google Scholar] [CrossRef]

Figure 1. Analysis procedure.

Table 1. Dataset used in this study.

Classification	CN	High-Risk Group for Dementia (MCI + Dem)
Train	85	56
Valid	26	7

Table 2. Description of gait and sleep lifelog data.

Classification	Gait Data		Sleep Data
Classification	Variable	Description	Variable	Description
1	activity_average_met	Average Daily Physical Activity Intensity	sleep_awake	Wake Time
2	activity_cal_active	Daily Activity Calories	sleep_breath_average	Average Respiratory Rate per Minute
3	activity_cal_total	Total Daily Calorie Expenditure	sleep_deep	Deep Sleep Time
4	activity_daily_movement	Daily Distance Moved	sleep_duration	Total Sleep Time
5	activity_high	High-Intensity Activity Duration	sleep_efficiency	Sleep Efficiency
6	activity_inactive	Inactivity Duration	sleep_hr_average	Average Heart Rate per Minute
7	activity_inactivity_alerts	Inactivity Alarm Frequency	sleep_hr_lowest	Low Heart Rate per Minute
8	activity_low	Low-Intensity Activity Duration	sleep_is_longest	Confirmed Sleep Presence
9	activity_medium	Moderate-Intensity Activity Duration	sleep_light	Light Sleep Time
10	activity_met_min_high	Daily High-Intensity Physical Activity Intensity	sleep_midpoint_at_delta	Sleep Midpoint Time (Delta)
11	activity_met_min_inactive	Daily Inactivity Physical Activity Intensity	sleep_midpoint_time	Sleep Midpoint Time
12	activity_met_min_low	Daily Low-Intensity Physical Activity Intensity	sleep_onset_latency	Sleep Latency
13	activity_met_min_medium	Daily Moderate-Intensity Physical Activity Intensity	sleep_period_id	Sleep Identification ID
14	activity_non_wear	Non-wear Duration	sleep_rem	REM Sleep Duration
15	activity_rest	Rest Duration	sleep_restless	Toss and Turn Rate
16	activity_score	Activity Score	sleep_rmssd	Average Heart Rate Variability
17	activity_score_meet_daily_targets	Activity Goal Achievement Score	sleep_score	Overall Sleep Score
18	activity_score_move_every_hour	Hourly Activity Maintenance Score	sleep_score_alignment	Sleep Timing Score
19	activity_score_recovery_time	Recovery Time Score	sleep_score_deep	Deep Sleep Score
20	activity_score_stay_active	Activity Maintenance Score	sleep_score_disturbances	Sleep Disturbance Score
21	activity_score_training_frequency	Exercise Frequency Score	sleep_score_efficiency	Sleep Efficiency Score
22	activity_score_training_volume	Exercise Score Training Volume	sleep_score_latency	Sleep Latency Score
23	activity_steps	Daily Step Count	sleep_score_rem	REM Sleep Score
24	activity_total	Total Activity Duration (minutes)	sleep_score_total	Sleep Duration Contribution Score
25	-	-	sleep_temperature_delta	Skin Temperature Deviation (Delta)
26	-	-	sleep_temperature_deviation	Skin Temperature Deviation
27	-	-	sleep_total	Sleep Time

Table 3. Dataset utilized in this study after data augmentation.

Classification	CN	High-Risk Group for Dementia (MCI + Dem)
Train	1700	1120
Valid	26	7

Table 4. Optimal hyperparameters for each machine learning model.

Classification	Model	Hyperparameters
Classification	Model	Gait Data	Sleep Data
1	Logistic Regression	c = 0.01	c = 0.01
2	Random Forest	Max_depth = 15	Max_depth = 15
		n_estimators = 200	n_estimators = 50
		random_state = 2024	random_state = 2024
3	LightGBM	learning_rate = 0.01	learning_rate = 0.01
		n_estimators = 50	num_leaves = 15
		num_leaves = 15
4	Support Vector Machine Classification	c = 1	c = 1
		gamma = 0.01	gamma = 1
		probability = True	probability = True

Table 5. Results of prediction model training using original data (gait).

Model	Recall/Sensitivity	Precision	Specificity	Accuracy	AUC
Logistic Regression	0.429	1.000	1.000	0.879	0.621
Random Forest	0.429	0.429	0.846	0.758	0.734
LightGBM	0.429	1.000	1.000	0.879	0.643
Support Vector Machine Classification	0.429	1.000	1.000	0.879	0.681

Table 6. Results of prediction model training using original data (sleep).

Model	Recall/Sensitivity	Precision	Specificity	Accuracy	AUC
Logistic Regression	0.429	1.000	1.000	0.879	0.780
Random Forest	0.429	0.429	0.846	0.758	0.745
LightGBM	0.429	0.750	0.962	0.848	0.769
Support Vector Machine Classification	0.429	1.000	1.000	0.879	0.786

Table 7. Results of prediction model training using augmented data (gait).

Model	Recall/Sensitivity	Precision	Specificity	Accuracy	AUC
Logistic Regression	0.429	0.375	0.808	0.727	0.604
Random Forest	0.571	0.500	0.846	0.788	0.808
LightGBM	0.429	0.600	0.923	0.818	0.736
Support Vector Machine Classification	0.429	1.000	1.000	0.879	0.764

Table 8. Results of prediction model training using augmented data (sleep).

Model	Recall/Sensitivity	Precision	Specificity	Accuracy	AUC
Logistic Regression	0.571	0.571	0.885	0.818	0.802
Random Forest	0.571	0.500	0.846	0.788	0.734
LightGBM	0.000	0.000	1.000	0.788	0.544
Support Vector Machine Classification	0.429	1.000	1.000	0.879	0.786

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Article Metrics

Citations

Article Access Statistics

Journal Statistics

Multiple requests from the same IP address are counted as one view.