Next Article in Journal
A Label-Free Colorimetric Aptasensor for Flavokavain B Detection
Previous Article in Journal
Research on Fire Detection of Cotton Picker Based on Improved Algorithm
Previous Article in Special Issue
Toward Concurrent Identification of Human Activities with a Single Unifying Neural Network Classification: First Step
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Communication

Depression Recognition Using Daily Wearable-Derived Physiological Data

1
Department of Psychological and Cognitive Sciences, Tsinghua University, Beijing 100084, China
2
Beijing Huilongguan Hospital, Peking University Huilongguan Clinical Medical School, Beijing 100096, China
*
Authors to whom correspondence should be addressed.
Sensors 2025, 25(2), 567; https://doi.org/10.3390/s25020567
Submission received: 11 December 2024 / Revised: 15 January 2025 / Accepted: 17 January 2025 / Published: 19 January 2025
(This article belongs to the Special Issue Wearable Technologies and Sensors for Healthcare and Wellbeing)

Abstract

:
The objective identification of depression using physiological data has emerged as a significant research focus within the field of psychiatry. The advancement of wearable physiological measurement devices has opened new avenues for the identification of individuals with depression in everyday-life contexts. Compared to other objective measurement methods, wearables offer the potential for continuous, unobtrusive monitoring, which can capture subtle physiological changes indicative of depressive states. The present study leverages multimodal wristband devices to collect data from fifty-eight participants clinically diagnosed with depression during their normal daytime activities over six hours. Data collected include pulse wave, skin conductance, and triaxial acceleration. For comparison, we also utilized data from fifty-eight matched healthy controls from a publicly available dataset, collected using the same devices over equivalent durations. Our aim was to identify depressive individuals through the analysis of multimodal physiological measurements derived from wearable devices in daily life scenarios. We extracted static features such as the mean, variance, skewness, and kurtosis of physiological indicators like heart rate, skin conductance, and acceleration, as well as autoregressive coefficients of these signals reflecting the temporal dynamics. Utilizing a Random Forest algorithm, we distinguished depressive and non-depressive individuals with varying classification accuracies on data aggregated over 6 h, 2 h, 30 min, and 5 min segments, as 90.0%, 84.7%, 80.1%, and 76.0%, respectively. Our results demonstrate the feasibility of using daily wearable-derived physiological data for depression recognition. The achieved classification accuracies suggest that this approach could be integrated into clinical settings for the early detection and monitoring of depressive symptoms. Future work will explore the potential of these methods for personalized interventions and real-time monitoring, offering a promising avenue for enhancing mental health care through the integration of wearable technology.

1. Introduction

Depression is a common and serious mental disorder leading to continuous effects on patients’ emotional states and daily activities. According to the World Health Organization, over 300 million people are suffering from major depression disorder [1], making it a leading cause of disability and largely aggravating the global disease burden. To handle this worldwide issue, global researchers have introduced technological innovations in the assessment, diagnosis, and management of depression disorders for better mental healthcare. Specifically, providing objective and quantitative information from depression patients was important for improving the accuracy of diagnosis [2,3].
The objective identification of depression using physiological data has emerged as a promising research focus within the field of psychiatry. Researchers have started using brain imaging techniques like fMRI and EEG in resting-state studies and have found the possible neural signatures of depression [4,5]. Task-based studies further extend the paradigm by regarding lab-based tasks, such as video watching, audio listening, etc., as a simplified daily life scenario [6,7,8]. These advancements highlight the significance of exploring the underlying depression-related biomarkers for a more accurate and reliable depression diagnosis. However, the relatively expensive cost and complex settings largely limited the application of current methods. Therefore, a more convenient and low-cost alternative is needed to promote personalized interventions and real-time monitoring in large populations.
Recently, the wide use of wearable devices has opened new possibilities for objective depression diagnosis. About 21% of Americans reported using a smartwatch or fitness tracker in daily life, and the number continues to grow [9]. Usually designed to be low-burden for users, wearable devices can non-invasively measure long-period, real-life data and, therefore, achieve higher ecological validity to monitor mental states compared with lab-based brain imaging techniques [10,11,12]. Using commercial smartwatches, previous research has explored the feasibility of individual trait assessment based on daily heart rate recordings, including personality, mental stress, etc. [13,14]. Furthermore, measurements from wearable devices can provide insights into individuals’ daily states, offering a more comprehensive view of the daily states of depression patients [15]. The latest studies have extended the wearable computing method to clinical use to distinguish patients with depression from those with other diseases [16]. These research studies usually focus on one major physiological indicator such as daily steps [17,18], activity at night (from acceleration) [19], pulse rate variability (from photoplethysmogram, PPG) [20], etc.
Integrating multimodal physiological signals could use the differential information of signals and improve the result of depression assessment [21]. Specifically, multimodal studies bring extra dimensions for insight-enhancing analysis and diversity in proportion to the increase in unique modalities because they are believed to provide related but differential information about mental health [22]. The latest research further highlights the combination of heart rate, skin conductance, and acceleration in depression classification analysis based on daily wearable data [22]. The cardiac responses such as heart rate, as well as skin conductance, are influenced by the autonomic nervous system (ANS) activity stimulated by depression-related emotions, acute stress, etc. [23,24]. The empirical study describes how bodily balance and posture quality degrade with depressive symptoms getting worse and shows the relation between mental health and motion during daily activities [25].
The present study aims to explore the feasibility of depression recognition by extracting multimodal physiological signals derived from wearable devices in daily life scenarios. We hypothesize that individuals’ depression disorder is correlated with their multimodal physiological features. Three major objectives of the present study are summarized as follows: first, to promote the clinical application of objective depression detection, we included clinically diagnosed depressed participants, which was one important step further as compared with previous research mainly with the healthy population [22]; second, to evaluate the feasibility of rapid detection based on low-burden wrist-worn devices, we explored classifications using data with varied durations. Considering the constraints of measurement duration and complexity in settings such as outpatient clinics, this is expected to enhance the potential applicability of the proposed method in clinical scenarios [26,27]. Last, to fully exploit the temporal dynamics of physiological features in depression detection [28], we investigated the effectiveness of dynamic features for classification and compared it to classical static features.
To this end, a daily wearable-derived experiment via a multimodal wristband was conducted to investigate the physiological representation of depression. A custom-designed wristband was used to record physiological data over six-hour daily activities from fifty-eight depression patients. Fifty-eight healthy individuals from a published dataset were selected as healthy controls, with a matched recording length, modality, and gender ratio. To compare the physiological changes between the two groups, we extracted static features such as the mean, variance, skewness, and kurtosis of physiological indicators including heart rate, skin conductance, and acceleration, as well as dynamic features including the autoregressive coefficients reflecting the temporal dynamic of these signals [28]. We adopted representative pattern recognition algorithms frequently used in the previous literature to construct discriminative models for identifying depressed individuals. Furthermore, to demonstrate the application potential of our method in clinical assessment scenarios, we explored shorter data segments by distinguishing depressive individuals on data aggregated over 6 h, 2 h, 30 min, and 5 min segments.

2. Materials and Methods

2.1. Data Preparation

Our data were composed of 58 depression patients and 58 healthy controls. The patients in our experiment were from the Beijing Huilongguan Hospital’s outpatient department between 2021 and 2022. These patients, aged from 18 to 50 years, were diagnosed by professional psychiatrists at Beijing Huilongguan Hospital based on comprehensive clinical assessments. The physiological data of healthy controls were from a published DAPPER dataset [29], with the same wearable device, group size, and gender ratio matched to the patient group. Specifically, the DAPPER dataset included a five-day recording, and we adopted data from the first day to avoid confounding factors such as device familiarity differing from the patient group.
Each participant was instructed to conduct multimodal physiological measurements during their normal daytime activities from 9:00 to 15:00 for one day by wearing a custom-designed wristband (Psychorus, HuiXin, Beijing, China), as shown in Figure 1a. The wristband, which has been used in several previous studies with daily contexts [29,30,31,32,33], was able to record the signals of acceleration (ACC), skin conductance (SC), and PPG at sampling rates of 20 Hz, 40 Hz, and 20 Hz, respectively, as determined by the device’s built-in hardware. The recorded PPGs were then computed to obtain heart rate (HR) at a 1 Hz sampling rate by an implemented HuiXin software package (version 201708) based on a joint sparse spectrum reconstruction algorithm, which has robust performance against daily activity artifacts [32,33]. Missing data (e.g., unreliable contact) were recognized by the wristband built-in function and marked as a low HR value (HR = 40) [29].

2.2. Feature Extraction

Four static features—mean, variance, skewness, and kurtosis—were extracted from each participant to characterize the statistical properties of the physiological indicators, including HR, SC, and ACC across a given data segment. We did not further decompose the SC signal into its tonic and phasic components, mainly for the consideration of the reliability of the decomposition of noisy wearable SC signals in daily contexts [34]. The mean and variance provided insights into the average level and variability of physiological data segments. Skewness quantified the asymmetry of the probability distribution of these features, while kurtosis described the “tailedness” of the distribution or the degree of outliers relative to the central tendency [35]. The inclusion of these higher-order statistical features aligns with previous studies on human emotion recognition and disease diagnosis [36,37]. Prior to the static feature extraction, to ensure relative uniformity among different signal modalities, the SC and ACC signals were down-sampled to match the 1 Hz sampling rate of the HR signal from the wristband [38]. Feature extraction was applied to each data segment, such as the whole 6 h experiment or shorter durations (see Section 2.3). The data segments with more than 50% missing data were excluded in feature extraction and further analysis.
Dynamic features were further extracted to characterize the temporal dynamics of the data segments. To this end, the autoregressive (AR) coefficients of the signals were employed [34]. Specifically, the coefficients were derived from AR models, as depicted in (1), where Xt denotes the signal at time t, ai denotes the coefficient at time lag i, and εt denotes the random error.
X t = i = 1 k a i X t i + ε t
The order k of the AR models represented the number of immediately preceding signals used to predict the present physiological signals. We determined an optimal order by calculating the Akaike Information Criterion (AIC) via “aic” function in MATLAB R2022a, which could minimize predicting errors. Specifically, a lower AIC value represented a better fit for the AR model [39]. The AR models were separately computed for ACC, SC, and HR data, as well as for segments of different durations. The AIC values with model orders from 2 to 8 (corresponding to 2 to 8 s at 1 Hz sampling rate) were calculated separately for each participant and averaged across participants to explore the optimal model while controlling the number of features. The model coefficients ai from the optimal k-order per AR model were extracted as the dynamic features.

2.3. Statistical Analysis and Classification

The statistical differences of physiological features between the two groups were first computed. Specifically, independent sample t-tests were conducted to assess the gender differences for each physiological dimension. The p-values in these multiple comparisons were then re-calculated by the false discovery rate (FDR) method [40], and the corrected p-values (pFDR) were used as the indicator of statistical significance in t-tests. Then, we analyzed the statistical differences in physiological features across demographic variables. Independent sample t-tests were also conducted between two gender groups (FDR corrected). Additionally, Spearman correlation analyses were performed to assess the relationship between the age of patients and their physiological features (FDR corrected).
Representative classification models were employed to discriminate the participants’ depression results based on these features. Specifically, the following classification models were used according to their promising performance in previous studies focusing on depression recognition, as reported in the existing literature [41], including Random Forest (RF), Support Vector Machine (SVM), K-Nearest Neighbors (KNN), and Linear Discriminant Analysis (LDA). The Random Forest method was employed with the tree number set to 100; the Support Vector Machine method was used with a Gaussian kernel; the K-Nearest Neighbors method was used with the number of nearest neighbors set to 1; the standard Linear Discriminant Analysis was employed with no required hyperparameters. A five-fold cross-participant validation procedure was conducted by circularly splitting the training and testing set, with an equal number of patients and normal people in each set. Each model was trained based on a random 4 out of 5 selection of all participants (92 participants) and tested on the remaining 20% (24 participants) for five-time iterations. The training and test processes were conducted 1000 times. To obtain the final classification accuracies, the predicted accuracies of all iterations were averaged. All the classification models were implemented by using the Statistic and Machine Learning Toolbox in MATLAB R2022a.
To explore the feature contributions in classification models, we adopted different combinations of physiological features in Section 2.2, including (1) all features, including static and dynamic features of three modalities (denoted as ALL); (2) the non-acceleration signal, including static and dynamic features of HR and SC signals (Non-ACC); (3) static features of three modalities (Static); (4) dynamic features of three modalities (Dynamic); (5) dynamic features of non-acceleration signal (Non-ACC Dynamic). To quantificationally explore the statistical significance of the classification results, we compared their classification accuracies with those from random models. The random model was trained with the same feature sets but tested with randomly shuffled depression labels for 1000 times.
To further explore the model stability with shorter data lengths for better clinical applications, we explored the classification models on data aggregated over 6 h, 2 h, 30 min, and 5 min segments, respectively. For the latter three conditions, segments of 2 h, 30 min, and 5 min were randomly selected from the 6 h recording data of each participant in order to avoid potential biases that might be introduced by selecting specific moments. The randomized selections were conducted 1000 times to obtain a distribution of accuracies. We performed feature extraction on all the time segments, including the calculation of both the static and dynamic features. The participants without valid data segments for a specific data duration were excluded from the corresponding classification analysis. For instance, if one participant did not have a valid 2 h data segment after the missing data operation, this participant was excluded from the 2 h classification; however, the same participant may be included for the 5 min classification if he or she had valid 5 min segments.

3. Results

In this study, we conducted 6 h recordings from 116 participants, adding up to 696 h recordings (116 participants × 6 h per participant). There were 488.4 h of valid data retained after excluding missing data, with each participant’s valid data proportion ranging from 2.5 (41.7%) to 6.0 (100%) hours. After excluding participants without enough valid data, there were 116, 116, 116, and 112 participants retained in 5 min, 30 min, 2 h, and 6 h conditions in the subsequent individual-level classification analysis.
The across-participant-averaged AIC values for fitting physiological signals with AR models are presented in Figure 2. Based on a comprehensive evaluation of the results from all AR models, order 3 has been empirically selected for feature extraction as it represents a balanced trade-off between the number of features and model performance. Therefore, the following analysis adopts a three-order autoregressive model in the AR feature extractions, and the three-dimensional AR features are denoted as AR1, AR2, and AR3, respectively.
By conducting a Spearman correlation analysis between the age of patients and all physiological features, only one feature (the standard deviation of heart rate) shows a significant correlation (r = 0.41, p = 0.016) after FDR correction. The independent sample t-test of each physiological feature between gender groups is not significant (lowest p = 0.39, FDR corrected).
As shown in Table 1, the recording samples from depression patients have significantly lower AR1 (t(115) = −2.89, p = 0.022, FDR corrected) and higher AR2 (t(115) = 2.78, p = 0.030) out of HR features than healthy controls. This was identified by conducting independent sample t-test, as well as lower kurtosis (t(115) = −3.17, p = 0.001) and lower AR1 (t(115) = −3.69, p < 0.001), of SC features. The depression samples also showed significantly lower mean value (t(115) = −3.10, p = 0.007), higher skewness value (t(115) = 2.75, p = 0.021), and lower AR coefficients (AR1: t(115) = −5.82, p = 0.001; AR2: t(115) = −9.15, p < 0.001; AR3: t(115) = −5.99, p = 0.001) of the acceleration signal.
Table 2 shows the binary classification accuracies from common pattern recognition models. The RF model finally reports the highest accuracy (90.0 ± 1.7%) with the input of 6 h physiological data, significantly higher than the Random condition (49.7 ± 5.1%). As the input data length decreases from 6 h to 5 min, the classification accuracy declines from 90.0 ± 1.7% to 76.0 ± 3.5%. The other three models also obtain significantly higher results (SVM: 74.1 ± 2.2%; LDA: 81.5 ± 2.3%; KNN: 76.1 ± 2.0%) than corresponding Random conditions.
Table 3 depicts the classification accuracies of the RF model when adopting different input features. The result of the Dynamic feature condition with the 6 h data input is close to the All-feature (ALL) condition, which combines Dynamic and Static features (90.0 ± 1.7% vs. 89.3 ± 1.2%, p > 0.05), and is significantly higher than the Static condition (p < 0.05). The prediction results of Non-ACC models are lower than ALL condition but significantly higher than Random condition (Non-ACC: 80.5 ± 2.4%; Non-ACC Dynamic: 78.1 ± 2.3%). Figure 3 depicts the distribution of classification accuracies in the ALL-feature condition. While the models with 6 h data achieved the highest accuracy (p < 0.05), the models based on 5 min data could also significantly distinguish depressive individuals from healthy controls.

4. Discussion

In the present study, we conducted a single-day wearable-derived recording via a multimodal wristband to investigate the physiological representation of depression for objective depression recognition. Our statistical analysis obtained significant correlations between depression and multiple physiological features. By extracting static and dynamic features from three modalities (ACC, SC, and HR), our classification results reported significant accuracy (90.0 ± 1.7%) based on machine learning models in the prediction of whether individuals were depression patients or healthy individuals, demonstrating the feasibility of introducing daily wearable recording to assist clinical depression diagnosis.
The findings that dynamic features obtained superior performance over static features extended the application values of conducting temporal feature extraction in daily recordings. While existing studies have explored the association between mental health and the temporal dynamic of human emotion states [42], our results further demonstrated that the dynamic feature of physiological signals can also make sense in mental health studies, which was not only supported by more significant between-group statistical differences but also by the higher accuracy of dynamic feature-based models in the classification analysis. Additionally, the effect of demographic factors such as age and gender was preliminarily investigated because older adults may exhibit distinct clinical characteristics and biological manifestations of depression from younger individuals [43,44]. However, our statistical analysis did not observe significant variations in physiological features across age and gender variables; therefore, we adopted the whole group into classification models. Nevertheless, considering the limited sample size, future research should conduct quantitative analyses with larger-scale samples to address these potential confounding factors.
The classification results based on distinct combinations of wearable physiological features further indicate that using multimodal measurement data could better reflect the daily representation of depression. Firstly, the subset of physiological modalities was also correlated with depression. While previous research studies have reported that individuals’ daily activity patterns represented by the fluctuation of acceleration were associated with depression symptoms [45,46], the statistical test reports significant differences not only in the acceleration modal but also in the Non-ACC signals (including PPG and SC), corresponding with the fact that Non-ACC models could also obtain significant results in the binary classification. This is possibly due to the less effective control of cognitive function in the depression condition, which reflects the top-down effect from the pre-frontal cortex to the ANS system [47], while it is also possible that depressed states are associated with distinct emotional experience (such as negative emotionality and lower emotional arousal) and corresponding physiological changes in the cardiovascular system [48]. Secondly, as proven by previous research [22], using multimodal data could significantly improve the performance of depression recognition, supported by the highest accuracy obtained in the ALL condition. This result may inspire further exploration of more advanced devices with more physiological modalities in depression recognition studies [49].
Our results across machine learning models and data durations further demonstrated that our approach is promising to be extended to clinical application scenarios. Firstly, while four machine learning models have obtained significant predictive results, the RF model achieves the highest accuracy in our results, which is higher than other EEG-based recognition results [50,51] and peripheral physiological studies in laboratory settings [52]. Considering the widespread use and predictive performance of RF in existing research [53], future studies could explore more advanced forms of RF models to achieve better recognition accuracy, such as applying Weighed Random Forest, Random Forest Artificial Neural Network (RF-ANN), etc. [41,54]. Moreover, while existing studies embracing machine learning algorithm ensembles have reported superior results than single algorithms [55,56], it is preferable to investigate the combination of multiple models in classification analysis. Secondly, the classification accuracy of 76.0% based on a 5 min short-duration measurement showed great clinical potential for fast and reliable depression diagnosis, significantly reducing the burden on both patients and healthcare providers. This finding highlights the feasibility of using short recordings for objective assessments. One possible explanation is the long-term influence of a depressed mood, which may create lasting physiological signatures detectable within a brief measurement. Another explanation is that individuals with depression exhibit distinct activity patterns compared to healthy controls. Future research should delve into these underlying factors to further advance wearable technologies for depression recognition. This result could be due to the long-term influence of the depressed mood; another explanation is that individuals diagnosed with depression share distinct activity patterns from normal healthy people. Further studies should explore the underlying factors of the significant result from short-duration recordings to better promote wearable depression recognition.
There are some limitations that should be noted for further studies besides the sample size and machine learning models. Firstly, the generalizability of this method may be limited by one-day measurement. Since individuals’ depression states could fluctuate in daily life [57], it is more appropriate to collect longitudinal wearable data to investigate the temporal association between mental health and physiological indicators that contributes to daily mental health monitoring. Secondly, individuals’ physiological signals have been known to vary substantially across time [58], which might influence the accuracy of classification models. While our findings have demonstrated the robustness of the proposed method for the classification of randomly selected time segments, it is necessary to conduct a multi-day recording to further evaluate the long-term stability of the proposed method across days, weeks, or longer. Thirdly, the feature extraction strategy in this study was relatively conservative, focusing on classical features from these physiological signals. A further exploration of feature extraction could be beneficial for depression recognition, including the analysis of the raw PPG signals [59], the decomposition of skin conductance components [60], and more advanced neural-network-based analyses [61,62]. Another limitation is that the result was restricted to the binary classification of depression or health. Considering the current experience of digital phenotype in depression studies [63], this method could be similarly extended further and output more detailed information, such as the degree of depression.
In sum, our results demonstrate the feasibility of using wearable devices for depression recognition through a lightweight wristband that is representative of the capability of state-of-the-art lightweight devices. The continuous recording of physiological data throughout the daily events allows for a more comprehensive understanding of individual variations in depression-specific representations [57], which could further contribute to more suitable mental health care through the integration of wearable technology, such as personalized intervention, real-time monitoring, etc.

Author Contributions

Methodology, H.X.; formal analysis, X.S.; writing—original draft prepara-tion, X.S.; writing—review and editing, D.Z. and S.T. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China (T2341003, 61977041, and 62061136001), the National Key Research and Development Plan (2024QY2401), the Medical and Health Technology Innovation Project of Chinese Academy of Medical Sciences (2020-I2M-2-009 and 2021-I2M-1-023), the Capital’s Funds for Health Improvement and Research (grant number 2022-1G-2132), the Beijing Municipal Science and Technology Commission Grant (Z191100006619104), and Beijing Natural Science Foundation (7202072).

Institutional Review Board Statement

The study was conducted in accordance with the Decla-ration of Helsinki, and approved by the Beijing Huilongguan Hospital Ethics Committee (protocol code 2020-35-Ke, date 23 October 2020).

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors on request.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Chodavadia, P.; Teo, I.; Poremski, D.; Fung, D.S.S.; Finkelstein, E.A. Prevalence and economic burden of depression and anxiety symptoms among Singaporean adults: Results from a 2022 web panel. BMC Psychiatry 2023, 23, 104. [Google Scholar] [CrossRef] [PubMed]
  2. Solomon, C.; Valstar, M.F.; Morriss, R.K.; Crowe, J. Objective methods for reliable detection of concealed depression. Front. ICT 2015, 2, 5. [Google Scholar] [CrossRef]
  3. Barua, P.D.; Vicnesh, J.; Lih, O.S.; Palmer, E.E.; Yamakawa, T.; Kobayashi, M.; Acharya, U.R. Artificial intelligence assisted tools for the detection of anxiety and depression leading to suicidal ideation in adolescents: A review. Cogn. Neurodyn. 2024, 18, 1–22. [Google Scholar] [CrossRef]
  4. Abdullah, S.; Abosuliman, S.S. Analyzing of optimal classifier selection for EEG signals of depression patients based on intelligent fuzzy decision support systems. Sci. Rep. 2023, 13, 11425. [Google Scholar] [CrossRef]
  5. Acharya, U.R.; Sudarshan, V.K.; Adeli, H.; Santhosh, J.; Koh, J.E.; Puthankatti, S.D.; Adeli, A. A Novel Depression Diagnosis Index Using Nonlinear Features in EEG Signals. Eur. Neurol. 2015, 74, 79–83. [Google Scholar] [CrossRef] [PubMed]
  6. Li, W.; Wu, C.; Hu, X.; Chen, J.; Fu, S.; Wang, F.; Zhang, D. Quantitative personality predictions from a brief EEG recording. IEEE Trans. Affect. Comput. 2020, 13, 1514–1527. [Google Scholar] [CrossRef]
  7. Li, W.; Hu, X.; Long, X.; Tang, L.; Chen, J.; Wang, F.; Zhang, D. EEG responses to emotional videos can quantitatively predict big-five personality traits. Neurocomputing 2020, 415, 368–381. [Google Scholar] [CrossRef]
  8. Min, K.; Yoon, J.; Kang, M.; Lee, D.; Park, E.; Han, J. Detecting depression on video logs using audiovisual features. Humanit. Soc. Sci. Commun. 2023, 10, 788. [Google Scholar] [CrossRef]
  9. Holko, M.; Litwin, T.R.; Munoz, F.; Theisz, K.I.; Salgin, L.; Jenks, N.P.; Holmes, B.W.; Watson-McGee, P.; Winford, E.; Sharma, Y. Wearable fitness tracker use in federally qualified health center patients: Strategies to improve the health of all of us using digital health devices. NPJ Digit. Med. 2022, 5, 53. [Google Scholar] [CrossRef] [PubMed]
  10. Nelson, B.W.; Low, C.A.; Jacobson, N.; Areán, P.; Torous, J.; Allen, N.B. Guidelines for wrist-worn consumer wearable assessment of heart rate in biobehavioral research. NPJ Digit. Med. 2020, 3, 90. [Google Scholar] [CrossRef] [PubMed]
  11. Morris, M.E.; Aguilera, A. Mobile, social, and wearable computing and the evolution of psychological practice. Prof. Psychol. Res. Pract. 2012, 43, 622–626. [Google Scholar] [CrossRef]
  12. Morales, A.; Barbosa, M.; Morás, L.; Cazella, S.C.; Sgobbi, L.F.; Sene, I.; Marques, G. Occupational stress monitoring using biomarkers and smartwatches: A systematic review. Sensors 2022, 22, 6633. [Google Scholar] [CrossRef] [PubMed]
  13. Shui, X.; Chen, Y.; Hu, X.; Wang, F.; Zhang, D. Personality in daily life: Multi-situational physiological signals reflect big-five personality traits. IEEE J. Biomed. Health Inform. 2023, 27, 2853–2863. [Google Scholar] [CrossRef] [PubMed]
  14. Lui, G.Y.; Loughnane, D.; Polley, C.; Jayarathna, T.; Breen, P.P. The apple watch for monitoring mental health–related physiological symptoms: Literature review. JMIR Ment. Health 2022, 9, e37354. [Google Scholar] [CrossRef]
  15. Price, G.D.; Heinz, M.V.; Song, S.H.; Nemesure, M.D.; Jacobson, N.C. Using digital phenotyping to capture depression symptom variability: Detecting naturalistic variability in depression symptoms across one year using passively collected wearable movement and sleep data. Transl. Psychiatry 2023, 13, 381. [Google Scholar] [CrossRef] [PubMed]
  16. Abd-Alrazaq, A.; AlSaad, R.; Shuweihdi, F.; Ahmed, A.; Aziz, S.; Sheikh, J. Systematic review and meta-analysis of performance of wearable artificial intelligence in detecting and predicting depression. NPJ Digit. Med. 2023, 6, 84. [Google Scholar] [CrossRef]
  17. Jung, D.; Kim, J.; Mun, K.-R. Identifying depression in the elderly using gait accelerometry. In Proceedings of the 2022 44th Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), Glasgow, UK, 11–15 July 2022. [Google Scholar]
  18. Choi, J.-G.; Ko, I.; Han, S. Depression level classification using machine learning classifiers based on actigraphy data. IEEE Access 2021, 9, 116622–116646. [Google Scholar] [CrossRef]
  19. Rodríguez-Ruiz, J.G.; Galván-Tejada, C.E.; Zanella-Calzada, L.A.; Celaya-Padilla, J.M.; Galván-Tejada, J.I.; Gamboa-Rosales, H.; Luna-García, H.; Magallanes-Quintanar, R.; Soto-Murillo, M.A. Comparison of night, day and 24 h motor activity data for the classification of depressive episodes. Diagnostics 2020, 10, 162. [Google Scholar] [CrossRef] [PubMed]
  20. Valenza, G.; Citi, L.; Gentili, C.; Lanata, A.; Scilingo, E.P.; Barbieri, R. Characterization of depressive states in bipolar patients using wearable textile technology and instantaneous heart rate variability assessment. IEEE J. Biomed. Health Inform. 2014, 19, 263–274. [Google Scholar] [CrossRef] [PubMed]
  21. Khaleghi, A.; Shahi, K.; Saidi, M.; Babaee, N.; Kaveh, R.; Mohammadian, A. Linear and nonlinear analysis of multimodal physiological data for affective arousal recognition. Cogn. Neurodyn. 2024, 18, 2277–2288. [Google Scholar] [CrossRef]
  22. Ahmed, A.; Ramesh, J.; Ganguly, S.; Aburukba, R.; Sagahyroon, A.; Aloul, F. Evaluating multimodal wearable sensors for quantifying affective states and depression with neural networks. IEEE Sens. J. 2023, 23, 22788–22802. [Google Scholar] [CrossRef]
  23. Lin, H.P.; Lin, H.Y.; Lin, W.L.; Huang, A.C.W. Effects of stress, depression, and their interaction on heart rate, skin conductance, finger temperature, and respiratory rate: Sympathetic-parasympathetic hypothesis of stress and depression. J. Clin. Psychol. 2011, 67, 1080–1091. [Google Scholar] [CrossRef]
  24. Shui, X.; Lin, R.; Luo, Z.; Lin, B.; Mao, X.; Li, H.; Liu, R.; Zhang, D. Bodily electrodermal representations for affective computing. IEEE Trans. Affect. Comput. 2024, 15, 1018–1025. [Google Scholar] [CrossRef]
  25. Mitsue, S.; Yamamoto, T. Relationship between depression and movement quality in normal young adults. J. Phys. Ther. Sci. 2019, 31, 819–822. [Google Scholar] [CrossRef] [PubMed]
  26. Muzammel, M.; Salam, H.; Othmani, A. End-to-end multimodal clinical depression recognition using deep neural networks: A comparative analysis. Comput. Methods Programs Biomed. 2021, 211, 106433. [Google Scholar] [CrossRef] [PubMed]
  27. Ahmed, M.S.; Ahmed, N. A fast and minimal system to identify depression using smartphones: Explainable machine learning–based approach. JMIR Form. Res. 2023, 7, e28848. [Google Scholar] [CrossRef]
  28. Haderlein, J.F.; Peterson, A.D.; Burkitt, A.N.; Mareels, I.M.; Grayden, D.B. Autoregressive models for biomedical signal processing. In Proceedings of the 2023 45th Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), Sydney, Australia, 24–27 July 2023. [Google Scholar]
  29. Shui, X.; Zhang, M.; Li, Z.; Hu, X.; Wang, F.; Zhang, D. A dataset of daily ambulatory psychological and physiological recording for emotion research. Sci. Data 2021, 8, 161. [Google Scholar] [CrossRef]
  30. He, C.; Xu, P.; Pei, X.; Wang, Q.; Yue, Y.; Han, C. Fatigue at the wheel: A non-visual approach to truck driver fatigue detection by multi-feature fusion. Accid. Anal. Prev. 2024, 199, 107511. [Google Scholar] [CrossRef] [PubMed]
  31. Yu, H.; Xu, M.; Xiao, X.; Xu, F.; Ming, D. Detection of dynamic changes of electrodermal activity to predict the classroom performance of college students. Cogn. Neurodyn. 2024, 18, 173–184. [Google Scholar] [CrossRef]
  32. Qu, Z.; Chen, J.; Li, B.; Tan, J.; Zhang, D.; Zhang, Y. Measurement of high-school students’ trait math anxiety using neurophysiological recordings during math exam. IEEE Access 2020, 8, 57460–57471. [Google Scholar] [CrossRef]
  33. Zhang, Y.; Qin, F.; Liu, B.; Qi, X.; Zhao, Y.; Zhang, D. Wearable neurophysiological recordings in middle-school classroom correlate with students’ academic performance. Front. Hum. Neurosci. 2018, 12, 457. [Google Scholar] [CrossRef]
  34. Doberenz, S.; Roth, W.T.; Wollburg, E.; Maslowski, N.I.; Kim, S. Methodological considerations in ambulatory skin conductance monitoring. Int. J. Psychophysiol. 2011, 80, 87–95. [Google Scholar] [CrossRef] [PubMed]
  35. Chua, K.C.; Chandran, V.; Acharya, U.R.; Lim, C.M. Application of higher order statistics/spectra in biomedical signals—A review. Med. Eng. Phys. 2010, 32, 679–689. [Google Scholar] [CrossRef] [PubMed]
  36. Shapiro, A.; Bradshaw, B.; Landes, S.; Kammann, P.; Bois De Fer, B.; Lee, W.-N.; Lange, R. A novel digital approach to describe real world outcomes among patients with constipation. NPJ Digit. Med. 2021, 4, 27. [Google Scholar] [CrossRef]
  37. Zhao, S.; Gholaminejad, A.; Ding, G.; Gao, Y.; Han, J.; Keutzer, K. Personalized emotion recognition by personality-aware high-order learning of physiological signals. ACM Trans. Multimed. Comput. Commun. Appl. 2019, 15, 14. [Google Scholar] [CrossRef]
  38. Pasquini, L.; Noohi, F.; Veziris, C.R.; Kosik, E.L.; Holley, S.R.; Lee, A.; Brown, J.A.; Roy, A.R.; Chow, T.E.; Allen, I. Dynamic autonomic nervous system states arise during emotions and manifest in basal physiology. Psychophysiology 2023, 60, e14218. [Google Scholar] [CrossRef] [PubMed]
  39. Portet, S. A primer on model selection using the Akaike Information Criterion. Infect. Dis. Model. 2020, 5, 111–128. [Google Scholar] [CrossRef] [PubMed]
  40. Benjamini, Y.; Hochberg, Y. Controlling the false discovery rate: A practical and powerful approach to multiple testing. J. R. Stat. Soc. Ser. B (Methodol.) 1995, 57, 289–300. [Google Scholar] [CrossRef]
  41. Bader, M.; Abdelwanis, M.; Maalouf, M.; Jelinek, H.F. Detecting depression severity using weighted random forest and oxidative stress biomarkers. Sci. Rep. 2024, 14, 16328. [Google Scholar] [CrossRef] [PubMed]
  42. van de Leemput, I.A.; Wichers, M.; Cramer, A.O.; Borsboom, D.; Tuerlinckx, F.; Kuppens, P.; Van Nes, E.H.; Viechtbauer, W.; Giltay, E.J.; Aggen, S.H. Critical slowing down as early warning for the onset and termination of depression. Proc. Natl. Acad. Sci. USA 2014, 111, 87–92. [Google Scholar] [CrossRef] [PubMed]
  43. Szymkowicz, S.M.; Gerlach, A.R.; Homiack, D.; Taylor, W.D. Biological factors influencing depression in later life: Role of aging processes and treatment implications. Transl. Psychiatry 2023, 13, 160. [Google Scholar] [CrossRef] [PubMed]
  44. Wagner, S.; Wollschlaeger, D.; Dreimueller, N.; Engelmann, J.; Herzog, D.P.; Roll, S.C.; Tadić, A.; Lieb, K. Effects of age on depressive symptomatology and response to antidepressant treatment in patients with major depressive disorder aged 18 to 65 years. Compr. Psychiatry 2020, 99, 152170. [Google Scholar] [CrossRef] [PubMed]
  45. Smagula, S.F.; Zhang, G.; Gujral, S.; Covassin, N.; Li, J.; Taylor, W.D.; Reynolds, C.F.; Krafty, R.T. Association of 24-hour activity pattern phenotypes with depression symptoms and cognitive performance in aging. JAMA Psychiatry 2022, 79, 1023–1031. [Google Scholar] [CrossRef] [PubMed]
  46. Tian, Y.E.; Di Biase, M.A.; Mosley, P.E.; Lupton, M.K.; Xia, Y.; Fripp, J.; Breakspear, M.; Cropley, V.; Zalesky, A. Evaluation of brain-body health in individuals with common neuropsychiatric disorders. JAMA Psychiatry 2023, 80, 567–576. [Google Scholar] [CrossRef]
  47. Pizzagalli, D.A.; Roberts, A.C. Prefrontal cortex and depression. Neuropsychopharmacology 2022, 47, 225–246. [Google Scholar] [CrossRef] [PubMed]
  48. Singla, S.; Jhamb, S.; Singh, K.D.; Kumar, A. Depression affects autonomic system of the body? Yes, it does! J. Educ. Health Promot. 2020, 9, 217. [Google Scholar] [PubMed]
  49. Perez, A.J.; Zeadally, S. Recent advances in wearable sensing technologies. Sensors 2021, 21, 6828. [Google Scholar] [CrossRef] [PubMed]
  50. Shen, J.; Zhang, Y.; Liang, H.; Zhao, Z.; Zhu, K.; Qian, K.; Dong, Q.; Zhang, X.; Hu, B. Depression recognition from EEG signals using an adaptive channel fusion method via improved focal loss. IEEE J. Biomed. Health Inform. 2023, 27, 3234–3245. [Google Scholar] [CrossRef]
  51. Li, L.; Wang, P.; Li, S.; Zhao, Q.; Yin, Z.; Guan, W.; Chen, S.; Wang, X.; Liao, J. Construction of a resting EEG-based depression recognition model for college students and possible mechanisms of action of different types of exercise. BMC Psychiatry 2023, 23, 849. [Google Scholar] [CrossRef] [PubMed]
  52. Zitouni, M.S.; Lih Oh, S.; Vicnesh, J.; Khandoker, A.; Acharya, U.R. Automated recognition of major depressive disorder from cardiovascular and respiratory physiological signals. Front. Psychiatry 2022, 13, 970993. [Google Scholar] [CrossRef] [PubMed]
  53. Gohari, M.R.; Doggett, A.; Patte, K.A.; Ferro, M.A.; Dubin, J.A.; Hilario, C.; Leatherdale, S.T. Using random forest to identify correlates of depression symptoms among adolescents. Soc. Psychiatry Psychiatr. Epidemiol. 2024, 59, 2063–2071. [Google Scholar] [CrossRef]
  54. Adeniji, O.D.; Adeyemi, S.O.; Ajagbe, S.A. An improved bagging ensemble in predicting mental disorder using hybridized random forest-artificial neural network model. Informatica 2022, 46, 543–550. [Google Scholar] [CrossRef]
  55. Saha, D.K.; Hossain, T.; Safran, M.; Alfarhood, S.; Mridha, M.; Che, D. Ensemble of hybrid model based technique for early detecting of depression based on SVM and neural networks. Sci. Rep. 2024, 14, 25470. [Google Scholar] [CrossRef] [PubMed]
  56. Hu, Y.; Chen, J.; Chen, J.; Wang, W.; Zhao, S.; Hu, X. An ensemble classification model for depression based on wearable device sleep data. IEEE J. Biomed. Health Inform. 2023, 28, 2602–2612. [Google Scholar] [CrossRef] [PubMed]
  57. Fedor, S.; Lewis, R.; Pedrelli, P.; Mischoulon, D.; Curtiss, J.; Picard, R.W. Wearable technology in clinical practice for depressive disorder. N. Engl. J. Med. 2023, 389, 2457–2466. [Google Scholar] [CrossRef]
  58. Christensen, J.C.; Estepp, J.R.; Wilson, G.F.; Russell, C.A. The effects of day-to-day variability of physiological data on operator functional state classification. NeuroImage 2012, 59, 57–63. [Google Scholar] [CrossRef]
  59. Wu, J.; Luo, X.; Chen, S.; Lyu, Y.; Ji, X.; Meng, Y.; Wang, D. Continuous Authentication via Wrist Photoplethysmogram: An Extensive Study. IEEE Trans. Mob. Comput. 2024, 24, 525–538. [Google Scholar] [CrossRef]
  60. Kim, A.Y.; Jang, E.H.; Choi, K.W.; Jeon, H.J.; Byun, S.; Sim, J.Y.; Choi, J.H.; Yu, H.Y. Skin conductance responses in Major Depressive Disorder (MDD) under mental arithmetic stress. PLoS ONE 2019, 14, e0213140. [Google Scholar] [CrossRef] [PubMed]
  61. Li, X.; Dong, Y.; Yi, Y.; Liang, Z.; Yan, S. Hypergraph Neural Network for Multimodal Depression Recognition. Electronics 2024, 13, 4544. [Google Scholar] [CrossRef]
  62. Zang, X.; Li, B.; Zhao, L.; Yan, D.; Yang, L. End-to-end depression recognition based on a one-dimensional convolution neural network model using two-lead ECG signal. J. Med. Biol. Eng. 2022, 42, 225–233. [Google Scholar] [CrossRef] [PubMed]
  63. Leaning, I.E.; Ikani, N.; Savage, H.S.; Leow, A.; Beckmann, C.; Ruhé, H.G.; Marquand, A.F. From smartphone data to clinically relevant predictions: A systematic review of digital phenotyping methods in depression. Neurosci. Biobehav. Rev. 2024, 158, 105541. [Google Scholar] [CrossRef]
Figure 1. (a) The wristband device; (b) the flowchart of data analysis. The classification models include Random Forest (RF), Support Vector Machine (SVM), K-Nearest Neighbors (KNN), and Linear Discriminant Analysis (LDA).
Figure 1. (a) The wristband device; (b) the flowchart of data analysis. The classification models include Random Forest (RF), Support Vector Machine (SVM), K-Nearest Neighbors (KNN), and Linear Discriminant Analysis (LDA).
Sensors 25 00567 g001
Figure 2. The overall AIC criterion of all participants across AR models with different indexes. Three subplots, respectively, represented the AIC in AR models based on HR, SC, and ACC signals.
Figure 2. The overall AIC criterion of all participants across AR models with different indexes. Three subplots, respectively, represented the AIC in AR models based on HR, SC, and ACC signals.
Sensors 25 00567 g002
Figure 3. The violin plot shows the distribution of classification accuracies based on bootstrapped RF models in the ALL-feature condition. In each bootstrap process, data segments were randomly selected and split (of training and test sets) in the 5 min, 30 min, 2 h, and 6 h conditions. Each dot showed the averaged 5-fold accuracy from one out of 1000 bootstrap steps. The dotted line represented the 95% interval of the Random condition results.
Figure 3. The violin plot shows the distribution of classification accuracies based on bootstrapped RF models in the ALL-feature condition. In each bootstrap process, data segments were randomly selected and split (of training and test sets) in the 5 min, 30 min, 2 h, and 6 h conditions. Each dot showed the averaged 5-fold accuracy from one out of 1000 bootstrap steps. The dotted line represented the 95% interval of the Random condition results.
Sensors 25 00567 g003
Table 1. Independent t-test results between depression patients and healthy controls.
Table 1. Independent t-test results between depression patients and healthy controls.
FeaturesHeart RateSkin ConductanceAcceleration
t(115)pFDRt(115)pFDRt(115)pFDR
StaticMean−0.050.963−0.590.673−3.100.007
SD−1.430.298−1.110.4080.140.932
Skew−0.680.6550.560.6732.750.021
Kurt−0.680.655−3.170.001−1.100.408
DynamicAR1−2.890.022−3.69<0.001−5.820.001
AR22.780.030−1.240.383−9.15<0.001
AR3−2.090.082−0.160.932−5.990.001
The p-value has been corrected by the FDR method.
Table 2. Classification accuracy based on different models across distinct data segments.
Table 2. Classification accuracy based on different models across distinct data segments.
Models5 min30 min2 h6 hRandom
RF76.0 (3.5)80.1 (3.2)84.7 (2.5)90.0 (1.7)49.7 (5.1)
SVM67.4 (4.4)67.0 (4.0)73.2 (3.5)74.1 (2.2)49.2 (4.5)
LDA64.6 (5.1)69.2 (4.7)77.0 (3.4)81.5 (2.3)52.1 (3.7)
KNN60.8 (4.9)64.6 (5.5)71.6 (3.7)76.1 (2.0)53.4 (2.6)
Each column showed the classification accuracy denoted by mean (SD) % of models based on specific data lengths, such as 5 min, 30 min, 2 h, and 6 h physiological data segments. The “Random” column showed a random accuracy level obtained by randomly shuffling participants’ depression labels based on corresponding 6 h models. The classification models include Random Forest (RF), Support Vector Machine (SVM), K-Nearest Neighbors (KNN), and Linear Discriminant Analysis (LDA).
Table 3. Classification accuracies based on Random Forest models across distinct feature sets.
Table 3. Classification accuracies based on Random Forest models across distinct feature sets.
Selected Features5-min30-min2-h6-hRandom
ALL76.0 (3.5)80.1 (3.2)84.7 (2.5)90.0 (1.7)49.7 (5.1)
Non-ACC70.4 (4.8)74.6 (4.4)77.0 (3.8)80.5 (2.4)50.4 (2.0)
Static75.8 (3.2)78.1 (3.1)80.6 (3.3)85.7 (1.5)49.9 (2.4)
Dynamic74.5 (3.7)77.3 (3.5)82.5 (2.9)89.3 (1.2)51.2 (2.3)
Non-ACC Dynamic68.4 (4.6)71.5 (4.3)74.5 (3.8)78.1 (2.3)52.3 (4.0)
Each column showed the classification accuracy denoted by mean (SD) % of models based on specific data lengths, such as 5 min, 30 min, 2 h, and 6 h physiological data segments. “Random” column showed a random level of accuracy by shuffling participants’ depression labels. The selected features from row 1 to row 5 represent (1) all features (ALL); (2) the non-acceleration signal, including HR and SC signals (Non-ACC); (3) static features (Static); (4) dynamic features (Dynamic); (5) dynamic features of non-acceleration signals (Non-ACC Dynamic).
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Shui, X.; Xu, H.; Tan, S.; Zhang, D. Depression Recognition Using Daily Wearable-Derived Physiological Data. Sensors 2025, 25, 567. https://doi.org/10.3390/s25020567

AMA Style

Shui X, Xu H, Tan S, Zhang D. Depression Recognition Using Daily Wearable-Derived Physiological Data. Sensors. 2025; 25(2):567. https://doi.org/10.3390/s25020567

Chicago/Turabian Style

Shui, Xinyu, Hao Xu, Shuping Tan, and Dan Zhang. 2025. "Depression Recognition Using Daily Wearable-Derived Physiological Data" Sensors 25, no. 2: 567. https://doi.org/10.3390/s25020567

APA Style

Shui, X., Xu, H., Tan, S., & Zhang, D. (2025). Depression Recognition Using Daily Wearable-Derived Physiological Data. Sensors, 25(2), 567. https://doi.org/10.3390/s25020567

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop