Next Article in Journal
A Wearable Breath Sensor Based on Fiber-Tip Microcantilever
Next Article in Special Issue
Proof of Concept in Artificial-Intelligence-Based Wearable Gait Monitoring for Parkinson’s Disease Management Optimization
Previous Article in Journal
A Single Wavelength Mid-Infrared Photoacoustic Spectroscopy for Noninvasive Glucose Detection Using Machine Learning
Previous Article in Special Issue
A Study on Dictionary Selection in Compressive Sensing for ECG Signals Compression and Classification
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Explainable Artificial Intelligence and Wearable Sensor-Based Gait Analysis to Identify Patients with Osteopenia and Sarcopenia in Daily Life

1
Department of Computer Software, University of Science and Technology, Daejeon 34113, Korea
2
Intelligent Convergence Research Laboratory, Electronics and Telecommunications Research Institute, Daejeon 34129, Korea
*
Author to whom correspondence should be addressed.
Biosensors 2022, 12(3), 167; https://doi.org/10.3390/bios12030167
Submission received: 28 December 2021 / Revised: 23 February 2022 / Accepted: 28 February 2022 / Published: 7 March 2022
(This article belongs to the Special Issue Intelligent Biosignal Processing in Wearable and Implantable Sensors)

Abstract

:
Osteopenia and sarcopenia can cause various senile diseases and are key factors related to the quality of life in old age. There is need for portable tools and methods that can analyze osteopenia and sarcopenia risks during daily life, rather than requiring a specialized hospital setting. Gait is a suitable indicator of musculoskeletal diseases; therefore, we analyzed the gait signal obtained from an inertial-sensor-based wearable gait device as a tool to manage bone loss and muscle loss in daily life. To analyze the inertial-sensor-based gait, the inertial signal was classified into seven gait phases, and descriptive statistical parameters were obtained for each gait phase. Subsequently, explainable artificial intelligence was utilized to analyze the contribution and importance of descriptive statistical parameters on osteopenia and sarcopenia. It was found that XGBoost yielded a high accuracy of 88.69% for osteopenia, whereas the random forest approach showed a high accuracy of 93.75% for sarcopenia. Transfer learning with a ResNet backbone exhibited appropriate performance but showed lower accuracy than the descriptive statistical parameter-based identification result. The proposed gait analysis method confirmed high classification accuracy and the statistical significance of gait factors that can be used for osteopenia and sarcopenia management.

1. Introduction

Osteopenia and sarcopenia can cause various senile disorders and are key factors related to the quality of life in old age [1,2,3]. Gait is a suitable indicator of musculo-skeletal diseases [4]. With the miniaturization of sensors and the development of intelligent monitoring technology, interest in wearable-sensor-based daily health management solutions is increasing [4,5]. Therefore, portable tools and methods that can analyze osteopenia and sarcopenia risks in our daily lives, rather than requiring a specialized hospital setting, can be considered.
Musculoskeletal disorders are increasingly being recognized as conditions that are associated with significant morbidity, mortality, and healthcare costs [1,2]. Osteopenia is a cause of fracture and increases the risk of complications, in addition to pain caused by fractures. Osteoporotic fractures generate costs that reach USD 25 billion, and sarcopenia generates costs of approximately USD 18 billion [2,6]. Patients with sarcopenia have a slow gait, reduced muscular endurance, face difficulty in daily living, and frequently need help from others. Osteoporosis, falls, and fractures can occur easily, whereas the blood and hormonal buffering action of the muscle are moderated, reducing the basal metabolic rate, making chronic diseases unmanageable, and increasing the likelihood of aggravating diabetes and cardiovascular disease [3].
Osteoporosis is defined by the World Health Organization (WHO) as a medical condition in which the bone mineral density (BMD) is less than −2.5 standard deviation (SD) below the mean level for young adults, and for osteopenia it is between −2.5 and −1.0 [7]. Sarcopenia is defined by the European Working Group on Sarcopenia in Older People (EWGSOP) as the presence of low muscle mass, reduced muscle strength, and physical performance [8]. BMD and muscle mass are diagnosed via dual-energy X-ray absorptiometry (DEXA) [1], although cannot be measured without expert assistance. Therefore, a system that can easily manage musculoskeletal diseases in daily life is required.
Human gait involves interactions between the musculoskeletal system and the nervous system. Thus, gait analysis is effective in identifying neuromusculoskeletal disorders such as Parkinson’s diseases (PD) [9,10], fall risk [11,12], total hip arthroplasty (THA) [5,13], and sarcopenia [4]. Traditionally, cameras and force plates have been used as clinical gait assessment tools; however, these tools are used only in large institutions such as university hospitals and are difficult to apply in daily life or complex environments because of their high cost and large space requirements [14]. Given the recent miniaturization and increased accuracy of sensor technology, inertial measurement units (IMU) are increasingly being used for gait analysis [4].
Gait analysis methods include statistical comparisons of gait parameters obtained from control and target groups and a method of analysis of the classification results of the groups using machine learning. In the analysis of gait for osteopenia, osteoporosis, sarcopenia, and osteosarcopenia conducted by Intriago [2], the slowest walking speed was observed in osteosarcopenia: 0.9 m/s in osteopenia–osteoporosis, 0.893 in sarcopenia, and 0.7 in osteosarcopenia. Choi [15] investigated the correlation between kinetic gait parameters and femoral BMD of the femoral neck, trochanter, shaft, and total proximal femur. The highest correlation (r = 0.153, p = 0.014) was observed between the walking speed and femoral neck BMD among the older female participants. ElDeeb [16] aimed to investigate the gait characteristics of postmenopausal women with low BMD (n = 17) and to determine the predictive parameters of BMD. When the normal BMD group and women with low BMD were compared, the ankle joint showed less push-off (p = 0.000), which seemed to be used to obtain gait stability. Sung [17] divided 77 older participants (n = 48 female + 29 male) into normal BMD and low BMD groups using DEXA. The spatial–temporal gait parameters (speed, stride length, and support times) of both groups were subsequently investigated. The support times included those of the initial double support, single support, and terminal double support in the stance phase. The support time was confirmed to have a high ratio of the main foot (the foot mainly used), the stride length was found to be longer on the main foot side than on the other side, and the stride length was positively associated with the single support time on the dominant limb.
Although there are many studies on gait speed for osteopenia and sarcopenia, only a few studies have analyzed gait parameters such as PD, fall risk, and THA. Recently, explainable artificial intelligence (XAI) has received considerable attention as a method to analyze the importance and contribution of parameters. XAI presents predictive results for machine learning in a human-understandable form [18]. It is primarily used to enhance the reliability of machine learning results. Low machine learning accuracy results in the misinterpretation of XAI. The XAI technique detects feature importance and explains the influence of features on model decisions [19]. Therefore, for the management of osteopenia and sarcopenia in daily life, this study proposes an algorithm for detecting gait parameters and identifying patients based on inertia signals and interpreting the results using XAI.

2. Related Studies

This section describes research related to gait parameter detection techniques, wearable-sensor-based patient identification, and XAI. This information should help with the understanding of the proposed wearable-based gait analysis method for the management of osteopenia and sarcopenia in daily life. This study proposes an algorithm for detecting gait parameters based on inertial signals, identifying patients, and interpreting the results using XAI for gait analysis.

2.1. Gait Parameter

IMU-based gait analysis is used to identify PD, fall risk, THA, and sarcopenia. Gait parameters for analysis based on IMU include spatial–temporal parameters (e.g., step length, stance phase, swing phase, single support, double support, step time, cadence, and speed), kinematic parameters (the rotational angles of the sagittal, coronal, and transverse plane of the pelvis, hip, knee, and ankle), and descriptive statistical parameters (such as the maximum, mean, and standard deviation) of inertial signals for each gait phase. The results of gait analysis using spatial–temporal parameters can be compared with the results of other gait analysis tools such as cameras and force plates. However, the disadvantage is that the motion information acquired by the inertial sensor is reduced, resulting in a low classification result [4].
Gait events and phases are detected to extract the gait parameters. Taborri [20] classified gait into two to eight phases. Whittle [21] classified gait into seven phases, and this is the most widely used classification method. One stride is from a heel strike to the next heel strike. Gait is broadly classified into a stance phase from heel strike (HS) to toe off (TO) and a swing phase from TO to the next HS. The stance phase is classified into the loading response, mid stance, terminal stance, and pre swing phases, and the swing phase is classified into the initial swing, mid swing, and terminal swing phases. The spatial–temporal parameters can be obtained by extracting the HS, TO, opposite HS, opposite TO, and walking distance. HS and TO are detected by the time and frequency signal processing of the inertial signal. Kim [22] obtained HS and TO with high accuracy within 0.03 s through time–frequency analysis.
The inertial-sensor-based distance measurement algorithm is widely used as the basis for the distance measurement algorithm in indoor navigation research, and it is difficult to accurately measure the distance using only the inertial sensor [23,24]. Kinematic parameters are obtained by attaching inertial sensors to locations such as the pelvis, hip, knee, and ankle, but they are difficult to use in daily life due to the number of sensors required. Descriptive statistics and frequency analysis have been employed to analyze the signal obtained from the inertial sensor.

2.2. Identifying Patients Based on Inertial Signals

IMU-based gait analysis is used to identify PD, fall risk, THA, and sarcopenia. With gait-parameter-based disease identification, Caramia [9] classified PD using linear discriminant analysis (LDA), naïve Bayes (NB), k-nearest neighbor (k-NN), support vector machine (SVM), SVM radial basis function (RBF), decision tree (DT), and the majority of votes. The performance of the machine learning technique—SVM with a nonlinear kernel—was the best. Eskofier [10] analyzed the gait of PD using descriptive statistical parameters such as the energy maximum, minimum, mean, variance, skewness, and kurtosis of the signal measured by the inertial sensor and the fast Fourier transform, a frequency analysis method. Howcroft [11] predicted the risk of falls using accelerometer data and used temporal (cadence and stride time) and descriptive statistics (maximum, mean, and SD of acceleration). NB, SVM, and neural network (NN) were used as classification methods, and the best single-sensor model was NN. An advantage of deep learning is that it can detect features within the algorithm from a raw signal, although Tunca [12] achieved higher accuracy in long short-term memory (LSTM) when certain parameters (e.g., speed, stride length, cycle time, stance time, swing time, clearance, stance ratio, and cadence) were used as the inputs, compared with raw signals.
Teufl [5] classified THA patients using stride length, stride time, cadence, speed, hip, and pelvis range of motion (ROM) as features of the SVM, and obtained an accuracy of 97%. Dindorf [13] used local interpretable model-agnostic explanations (LIME) to understand the features for identifying THA, and found that the sagittal movement of the hip, knee, and pelvis, as well as transversal movement of the ankle, were particularly important for this specific classification task. Kim [4] obtained feature importance using the Shapley additive explanations (SHAP) approach for spatial–temporal parameters and descriptive statistical parameters detected from signals measured by inertial sensors on both feet of ten sarcopenia and control participants. Twenty descriptive statistical parameters of high importance were used as inputs to classification models such as SVM, RF, and multi –layer perceptron (MLP); the highest accuracy (95%) was achieved using the SVM model, as shown in Table 1.

2.3. Explainable Artificial Intelligence

XAI is a method that allows humans to understand the basis of decisions made by artificial intelligence models [25]. It is primarily used to enhance the reliability of machine learning results. Low machine learning accuracy results in a misinterpretation of XAI. The XAI technique detects feature importance and explains the influence of features on model decisions [19]. LIME and SHAP are often used to explain existing handcraft feature-based classification algorithms, and layer-wise relevance propagation (LRP) and class activation mapping (CAM) are used as algorithms to interpret deep learning.
LIME is effective for tabular data, text, and images. However, it is difficult to set the kernel width in tabular data, and different results are obtained during repeated execution because the sampling process is performed randomly. SHAP was proposed to consider the dependency between features. When the dependence between features is high, the SHAP feature importance is judged to be better than the permutation importance [19]. SHAP is based on Shaply values from game theory. The main advantages of the SHAP method are local explanations and consistency in the global model structure. SHAP is used in many machine learning models as a model-agnostic method.
LRP outputs a heatmap to the input image by tracing back the results of the deep learning model. Unlike LIME and SHAP, which interpret the model using the sensitivity analysis technique, LRP [25] is a mixture of relevance propagation and decomposition. Relevance propagation is a method for calculating the relevance of the contribution of the hidden layer to the output after the decomposition process. CAM visualizes model decisions by computing a weighted linear summation of the last convolutional feature map. CAM is limited to model architectures where the model must consist of one fully connected (FC) layer with global average pooling (GAP) [26]. Grad-CAM describes models without constraints on the model architecture. Gradient-based CAM methods share the problem of shattered gradients, causing noise saliency maps in the intermediate layer. An LRP-based Relevance-CAM has been proposed to solve the gradient problem [27].

3. Methods

To analyze the gait of the osteopenia and sarcopenia groups, the patients were identified using machine learning, and the machine learning model was interpreted using XAI. The inertial sensor signals and spatial–temporal and descriptive statistical parameters detected in the proposed algorithm were used as machine learning inputs. By analyzing the model that obtained high-accuracy identification results, the inertial signal and gait parameters of the osteopenia and sarcopenia groups were analyzed. The flowchart of patient identification and gait analysis for osteopenia and sarcopenia is shown in Figure 1.

3.1. Patient Data Collection

Gait signals of 42 women over 65 years of age were obtained to analyze the gait characteristics for osteopenia and sarcopenia. Among the 42 subjects, there were 21 patients with osteopenia and 21 patients without osteopenia. The BMD obtained by measuring DEXA was compared with that of a healthy young person: when the T-score was −1 SD or higher, the data were assigned to the control group; when it was lower than −1, the data were assigned to the osteopenia group. Additionally, 10 sarcopenia and 10 non-sarcopenia patients were selected among the 42 subjects. Sarcopenia was diagnosed using the skeletal muscle mass index (SMI, appendicular skeletal muscle mass in kg/height in m2) that was less than 5.4 kg/m2 (as obtained through DEXA), whereas the grasp strength was less than 18 kg. The group without sarcopenia included participants with SMI of 5.5 or more and a grasp strength of 19 kg or more. Relevant statistics, including age, height, weight, foot size, Mini-Mental State Examination (MMSE) [28], the Mores Fall Scale (MFS) [29], SARC-F questionnaire [30], Berg Balance Scale (BBS) [31] and Timed Up and Go (TUG) scores [32], grasp power, T-score for DEXA, and SMI, are shown in Table 2.
The limitations of this study were that the physiological and psychological variables of the participants could not be controlled, the age range of the participants could not be expanded, the study was conducted on women only, and the treadmill gait experiment with fall risk factors for older adults was excluded, and only the preferred speed through walking on flat ground was measured.
All participants wore the same sneaker model and walked the 27 m corridor four times in a straight line. The gait data were acquired from the right and left insoles using IMU, as shown in Figure 2. The IMU settings included an acceleration sensitivity of 8G, a gyro sensitivity of 1000°/s, and a sampling frequency of 100 Hz [4].
Additionally, 20 participants measured 9 m gait simultaneously with the clinical standard system and the proposed inertial system to verify the proposed device and algorithm. The clinical system consisted of ten cameras (Vicon, Oxford Metrics, Oxford, UK) and four force plates (Advanced Mechanical Technology, MA, USA). Data analysis was performed using the Vicon Polygon 3.5.2. Ethics approval was obtained from the Chungnam National University Hospital Institutional Review Board before conducting this study (File No: CNUH 2019-06-042).

3.2. Gait Signals and Parameters

Gait is a motion in which both feet alternately repeat the stance and swing phases, and the event points of gait that separate the stance and swing phases are called HS and TO. HS is at the start of the stance phase, and TO is at the start of the swing phase. The gait data obtained from the IMU sensor were 6-axis signals that included the xyz-axis acceleration and angular velocity signal. When the measured sensor data were separated based on HS and normalized to 100 samples, they exhibited periodic characteristics, as shown in Figure 3. The characteristics of the gait signal differ from person to person, and for IMU gait analysis, the spatial–temporal parameter was detected from the gait signal, and the gait signals were expressed as descriptive statistical parameters and analyzed. Additionally, patients were classified using raw data as inputs for deep learning without detecting the parameters; then, the gait signals were analyzed by interpreting the deep learning results.
The spatial–temporal parameters were extracted from the inertial signals using the proposed algorithm [4]. Twenty-four spatial–temporal parameters were detected: stance phase time right, stance phase time left, swing phase time right, swing phase time left, stance phase percent right, stance phase percent left, double support first phase time right, double support first phase time left, double support second phase time right, double support second phase time left, single support phase time right, single support phase time left, double support first phase percent right, double support first phase percent left, double support second phase percent right, double support second phase percent left, single support phase percent right, single support phase percent left, stride length right, stride length left, stance phase time SI, swing phase time SI, stance phase percent SI, and cadence. The definitions are summarized in Table 3.
After detecting HS and TO, the opposite HS, opposite TO, cadence, stance phase (time), swing phase (time), single support phase (time), and double support phase (time) could be obtained by arithmetic calculations. Secondary parameters, such as balance of difference between the right and left foot, were also collected through comparative analysis of both feet. Stride was detected through a distance estimation algorithm based on zero-velocity detection (zero-velocity update) using an extended Kalman filter [23,24].
To obtain descriptive statistical parameters, the six-axis gait signal was classified into seven phases, as proposed by Whittle. The detection of HS, TO, heel rise (HR), feet adjacent (FA), and tibia vertical (TV) is required to classify seven phases; it was detected using the method proposed in a previous study [4]. Ten descriptive statistical parameters were obtained from signals classified into seven phases, and the descriptive statistical parameters were max., min., SD, AbSum, root-mean-square (RMS), kurtosis, skewness, MMgr, DMM, and Mdif. A total of 840 descriptive statistical parameters (both feet (2) × sensor signal (6) × gait phase (7) × (10 parameters)) were detected.

3.3. Patient Identification

To identify patients using the inertial gait signal and the proposed gait parameters, osteopenia and sarcopenia groups were classified through various models such as RF, XGBoost (Extreme Gradient Boosting), SVM, and deep learning models.
RF is a decision tree ensemble classifier that combines multiple single classifiers to obtain the result of each classification model either through majority vote or weighted average [33]. RF lowers the risk of overfitting by using some data and features from the training data. XGBoost is a decision tree ensemble model and improves the performance of the gradient boosting machine in terms of speed. Boosting models increase accuracy by iteratively updating the parameters of the previous classifier to reduce the slope of the loss function, thereby generating a robust classifier [33]. SVM is a binary classifier that aims to determine the optimal separation hyperplane that maximizes the margin between two classes. Kernel functions are used to map data to a higher-dimensional space; thus, an SVM can compute nonlinear decision boundaries [4].
The representative deep-learning-based models were convolutional neural network (CNN) and LSTM. A CNN is composed of one or more convolutional, pooling, FC, and dense layers. CNNs exhibit high performance in detecting and classifying features in images. Unlike LSTM, which only has forward hidden layers, BiLSTM has both forward and backward hidden layers. Therefore, it learns both before and after information and demonstrates high performance in time-series data. As a CNN backbone, ResNet exhibits excellent classification accuracy [34]. ResNet uses skip connections (or short connections) to pass the input from the previous layer to the next layer. This skip connection solves the gradient loss/burst problem, enabling deep neural networks. ResNet uses 18, 34, 50, 101, and 152 layers depending on the depth, and there are structural differences in approximately 50 layers. In particular, ResNet is a popular architecture despite the existence of other models that have improved performance in various fields. Moreover, it is a representative CNN architecture for which many supporting materials are available [34,35].
Transfer learning is applied as a solution to address the difficulty of training a model based on small datasets. In transfer learning, data similar to the target data are learned in advance and a specific layer is frozen, such that only the layer which is not frozen when learning the target data is learned [36]. Specific data characteristics can be overfitted, because patient identification is a binary classification. Therefore, person identification is pre-trained because high-resolution features can be detected by comparing and analyzing the gaits of multiple people.

3.4. Gait Analysis

The gait signals and parameters were analyzed using statistical methods and XAI techniques that interpret machine learning results. The independent t-test was used as a statistical method to compare the spatial–temporal parameters and descriptive statistical parameters. To improve the reliability of the machine-learning-based analysis method, a higher accuracy should first be obtained. Therefore, the osteopenia and sarcopenia groups were classified through various models, such as RF, XGBoost, SVM, and deep learning models. The CNN and LSTM models were used as the deep learning models.
Spatial–temporal parameters and 100 descriptive statistical parameters with low p-values of the t-test were used as inputs for RF, XGBoost, and SVM. The following RF parameters were used: number of trees = 50, max_depth = 30, and number of features = square root of the gait parameters. The XGBoost parameters were booster = gbtree, objective = binary:logistic, eta = 0.018, max_depth = 15, gamma = 0.009, subsample = 0.98, and colsample_bytree = 0.86. SVM explored the linear and RBF kernels, and the parameters were gamma = 1.0 and C = 5.0.
The 12 axes of acceleration and angular velocity signals obtained from both feet were applied to the deep learning models. We proposed a low-layer-based CNN and BiLSTM model and applied ResNet50. As the input of the deep learning model, a stride based on HS was detected and normalized to 100 samples using spline interpolation because the signal was collected at 100 Hz [4]. ResNet50 is reduced in size by pooling as the layers progress. The ResNet50 backbone cannot be used with an input of shape 12, and removing pooling lowers the accuracy. Therefore, the input shape (36,100) was generated by amplifying the signal of 12 axes threefold because the input size of ResNet50 must be 12 or more, and the kernel size was 3. The layers of each model are shown in Table 4. The parameters of the deep learning model were as follows: learning rate = 0.0005, training epoch = 100, batch size = 16, loss = CrossEntropyLoss, optimizer = Adam, and activation function = Rectified Linear Unit.
XGBoost can calculate the built-in importance (Gini importance) and permutation importance using the learned model. Permutation importance measures the increase or decrease in prediction error compared with the original data when the feature data are transformed [19]. Permutation importance does not consider the correlation between features; therefore, SHAP was proposed as a method to consider the dependency between features. In particular, the SHAP feature importance is considered to be better than the permutation importance because gait parameters have a high dependence on the features. The Gini, permutation, and SHAP importance of the spatial–temporal and descriptive statistical parameters were calculated to obtain important parameters of osteopenia and sarcopenia, and the results of deep learning were analyzed using LRP, Grad-CAM, and Relevance-CAM.

4. Results

4.1. Patient Identification

The identification results of 21 osteopenia and 21 non-osteopenia subjects showed the highest accuracy in SVM when 24 spatial–temporal parameters were used as inputs, but showed an accuracy of less than 65%. The descriptive statistics parameter obtained the highest accuracy of 68.45% in XGBoost by using 100 parameters with a low p-value as an input, as a result of an independent t-test. For training and testing, 21 cross-validations were performed on 21 subjects, and the average was obtained. Using an inertial sensor as an input for deep learning, ResNet showed the highest accuracy among CNN, BiLSTM, and ResNet. The results of applying transfer learning to the ResNet model showed lower accuracy than when no transfer learning was applied. However, when performing transfer learning, it was shown that the accuracy increased when features were extracted, including the test subject. This implies that ResNet was pre-trained for human identification using the data of 42 patients, and the patient identification was cross-validated for 21 patients. The osteopenia group obtained the highest recognition result in the transfer learning ResNet.
The identification results of 10 sarcopenia and 10 non-sarcopenia cases were over 70% accurate in terms of the spatial–temporal parameters, and the accuracy in case of sarcopenia was better than that in osteopenia. When the descriptive statistics parameter was used as the RF input, the highest accuracy was obtained, and the deep learning method of the inertial sensor input did not yield satisfactory identification results. Therefore, analysis of the results of XAI based on parameters is more reliable than the analysis of results based on deep learning. The patient identification results of machine learning are presented in Table 5 and Table 6.

4.2. Importance of Descriptive Statistical Parameter

The order of Gini, permutation, and SHAP importance was obtained for the descriptive statistical parameters of osteopenia and sarcopenia. When using highly important parameters such as RF, XGBoost, and SVM inputs, SHAP obtained the highest identification rate; however, when using the inner 20 important parameters as inputs, more identification results than the 100 descriptive statistical parameters were obtained. Table 7 and Table 8 showed the classification results as the number of parameters increased, and the average accuracy was obtained by performing 21 cross-validations for osteopenia and 10 cross-validations for sarcopenia. According to the result of each cross-validation, SHAP-based feature importance has different values. For example, osteopenia was trained with 40 datasets (20 osteopenia datasets and 20 non-osteopenia datasets) and tested with two datasets (1 osteopenia dataset and 1 non-osteopenia dataset) during 21 cross-validations. As a result of the training, the Shapley values were obtained based on the training data of 40 people, and the Shapley values were obtained in different orders. Table 9 shows the average results for 20 high-order Shapley values generated during 21 cross-validations. In osteopenia, the Shapley value is relatively high in the upper parameter and less than 0.1 from the 10th parameter. In sarcopenia, the difference in the Shapley value between the parameters is small. The parameter numbers of descriptive statistical parameters are shown in Table 10. The results of learning RF, XGBoost, and SVM with 20 parameters with high importance in Table 9 are shown in Table 11. Osteopenia obtained an accuracy of 88.69% in XGBoost using the top 4 parameters as inputs, and sarcopenia obtained an accuracy of 93.75% in RF using the top 18 parameters as inputs.

4.3. Gait Analysis

In the spatial–temporal parameters of the osteopenia group, the stance phase percentage decreased, double support percentage (time) decreased, and single support percentage increased. The sarcopenia group showed an increase in the value of the SI parameter compared with the non-sarcopenia group, implying that the difference between both feet was large. Except for the SI parameter, the p-value did not have a statistical significance of less than 0.001. Table 12 shows the mean and Shapley values of the spatial–temporal parameters; * indicates that the p-value is less than 0.025, and ** indicates that the p-value is less than 0.001.
From the result of the SHAP plot of osteopenia, as the value of the single support phase percent left (parameter 18) increased, the risk of osteopenia increased, as indicated by the positive SHAP value. As the value decreased, the risk also decreased, with the SHAP value being negative. When the single support phase percent left value increased, the risk increased linearly, and the osteopenia risk was low, at 39 or lower, and the risk increased at 42 or higher. A low double support first phase percent (parameter 13 and 14) increased the risk of osteopenia, with a decreased risk above 9 and an increased risk below 9. A low value of double support first phase time left (parameter 8) increased the risk, a high value decreased the risk of osteopenia, and a double support first phase time left lower than 0.075 led to an increase in risk. A low value of the stance phase percent right (parameter 5) increased the risk.
As a result of the SHAP plot of sarcopenia, the risk of sarcopenia increased when the double support first phase time left (parameter 8) had a very low value (less than 0.07). Stance phase percent left (parameter 6) increased the risk above 60 and decreased below 60, but did not show linearity. The risk increased when the value of the stance phase time SI (parameter 21) increased, and the risk was high at 0.35 or higher, although it was low at less than 0.35. SHAP plots of the spatial–temporal parameters of osteopenia and sarcopenia are shown in Figure 4.
The parameter with the highest SHAP value within the descriptive statistical parameters of osteopenia is the skewness of the x-axis of the accelerometer in the initial swing phase (parameter number 247). Initial swing refers to the FA after TO. When the skewness is negative, the probability density function has a long tail on the left side, and the data, including the median, are more distributed on the right side. When the skewness is positive, there is a long tail on the right side of the probability density function, indicating that the data are more distributed on the left side. Skewness has a positive value when the mean is smaller than the median, negative when the mean is larger, and has a larger value as the difference between the median and the mean becomes larger. When there is a negative value, the right part of Figure 5a has a long tail, but the average decreases and the skewness value becomes small or negative.
The inertial signals and SHAP dependence plots of the descriptive statistical parameters 247 and 114 of osteopenia cases are shown in Figure 5. The blue signal represents osteopenia, and the red signal represents non-osteopenia. A skewness of 0.5 or higher shows a low risk of osteopenia, whereas a negative value shows an increased risk. The absolute sum of gyro z values in the mid stance (parameter number 114) decreased in osteopenia. This implies that there is no rotation of the z-axis in the mid stance. When the absolute sum of values was 2.6 or higher, the risk decreased, and when the absolute sum of values was 1.89 or lower, the risk increased, as shown in Table 13.
The inertial signals and SHAP dependence plots of the descriptive statistical parameters 430 and 524 of sarcopenia are shown in Figure 6. Here, the blue signal represents sarcopenia, whereas the red signal represents non-sarcopenia. The maximum difference between two successive values of accelerometer x in the loading response (parameter number 430) was lower in the sarcopenia group than in the non-sarcopenia group. When the maximum value was less than 2.74, the risk of sarcopenia increased; however, when the maximum value was 3.79 or more, the risk of sarcopenia decreased. In the sarcopenia group, the change in the acceleration was smooth. The absolute sum of gyro y values in the mid stance (parameter number 524) increased in the sarcopenia group. As the absolute sum value increased, the risk of sarcopenia increased.
The output for layer2, when the deep-learning-based XAI technique, LRP, Grad-CAM, and Relevance-CAM were applied to ResNet50, is shown in Figure 7. In ResNet50, the CAM technique shows low resolution in small-sized images because the feature map is reduced in layer2. It is difficult to interpret the CAM results for ResNet50 with input sizes of 100 horizontal and 36 vertical. Therefore, it is desirable to interpret ResNet results as LRP. Figure 8 shows the analysis results of LRP for the ResNet of osteopenia and sarcopenia. The LRP attention map of the osteopenia group had high values in 64~67 samples of the acceleration x-axis of the right foot. Its position is the section where the acceleration value rises after TO, and it is the same as the position in Figure 5a, the parameter-based SHAP result. Osteopenia pays attention to changes in acceleration after TO in SHAP and LRP. The LRP attention map of sarcopenia group has a high value at 99~100 positions of the right acceleration x. The position is the section where HS occurs, and it is the same section as Figure 6a. The result of paying attention to the various sections of the acceleration left is similar to having a high identification result when used as various parameter inputs in SHAP. The sarcopenia group pays attention to the HS section and the various sections of the signal.

5. Discussion

The objective of this study was to propose and evaluate a method that can utilize the gait parameters obtained from a wearable device with an inertial sensor in the health management of patients suffering from sarcopenia and osteopenia in daily life.
In the proposed method, the patient was identified using gait phase description-based descriptive statistical parameters as the handcrafted feature-based machine learning input and the original signal of the inertial sensor as the input for the deep learning algorithm. For gait analysis, the identification results were analyzed using XAI tools, such as SHAP and LRP. To verify the proposed gait analysis method, the results of functional tests and questionnaires obtained at the hospital for participants, the results using the existing gait parameters, and the results of the proposed method are discussed.
To identify osteopenia and sarcopenia, a decrease in walking speed and poor body balance has been reported in previous studies. It has been reported that patients with sarcopenia have a slower walking speed than those with osteopenia. The result of the 3 m TUG was 11.71 s in the sarcopenia group and 10.96 s in the osteopenia group, indicating that the walking speed was slower in the patients with sarcopenia. Except for TUG, statistical significance was not obtained for the MMSE, MFS, SARC-F questionnaire, or BBS.
In gait analysis using inertial sensors, spatial–temporal parameters have traditionally been used as tools to conveniently identify diseases such as Faller, PD, and THA in everyday life. In this study, to identify patients with osteopenia and sarcopenia, 24 spatial–temporal parameters used for conventional disease identification were detected, and descriptive statistical parameters were detected to analyze the inertial sensor signals according to the gait phase. Statistical significance was obtained for the stance phase, double support phase, and single support phase percent in osteopenia, and SI in the stance and swing phase in sarcopenia.
With gait analysis using XAI, SHAP demonstrates the importance of parameters and the positive/negative contribution of parameters to the classification results of machine learning. To apply SHAP to machine learning classifiers, it is necessary to obtain a high machine learning accuracy. Good classification results were obtained for osteopenia in XGBoost and sarcopenia in RF. It has been reported that XGBoost has the advantage of being the most accurate among tree-based classifiers, and RF has a strong advantage in terms of overfitting. Comparing various machine learning results, osteopenia showed an accuracy of lower than 70% and sarcopenia showed overfitting. Overfitting in sarcopenia was inferred from the results of the deep learning model.
From the SHAP results of spatial–temporal parameters, single support phase percent left, double support first phase percent left, double support first phase percent right, double support first phase time left, and stance phase percent right were highly important for the osteopenia group. In the sarcopenia group, double support first phase time left, stance phase percent left, stance phase time SI, double support first phase percent left, and stance phase percent right were found to be of high importance. The important parameters obtained similar results to the statistical analysis; in osteopenia, the phase had a high contribution, whereas in sarcopenia, SI had a high contribution. Double support first phase time left and stance phase percent right showed a high contribution in both groups. The double support first phase time decreased in the osteopenia group compared with that in the sarcopenia group, and the stance phase percentage decreased. An increase in the double support first phase time and an increase in the stance phase percentage indicate a decrease in the walking speed. The time-related parameters were lower in the osteopenia group than in the non-osteopenia group; therefore, it is difficult to identify the osteopenia group based on walking speed. Sarcopenia significantly contributed to the reductions in walking speed and balance parameters, as in the previous study results [1,2,3].
The accuracy of identification of osteopenia patients with spatial–temporal parameters, which is the existing gait analysis parameter, was lower than 70%; thus, it was difficult to analyze the results in SHAP. The inertial sensor had a high temporal resolution; therefore, it was possible to obtain differences between groups by segmenting and analyzing the gait. As a result of SHAP for 840 descriptive statistical parameters, a high contribution from the skewness of the x-axis of the accelerometer in the initial swing phase (247) and the absolute sum of values of gyro z in the mid stance (114) were observed for osteopenia. For sarcopenia, the maximum difference between two successive values of accelerometer x in loading response (430) and the absolute sum of values of gyro y in the mid stance (524) showed a high contribution.
The descriptive statistical parameter 247 for osteopenia was smaller than that for sarcopenia, and the low skewness of osteopenia is due to the rapid occurrence of the maximum value of the acceleration x-axis after TO and a large negative value. This result is related to the increase in the swing phase time. The descriptive statistical parameter 430 represents the change in acceleration, and its value of osteopenia is larger than that of the sarcopenia group, implying that the gait speed of the sarcopenia group is slow because the acceleration x is the walking direction. Parameters 114 and 524 are the absolute sum of the gyro values in the mid stance. The absolute sum decreases in osteopenia indicating less foot movement in the mid stance, as shown in Table 14.
The interpretation of results for LRP-based deep learning was similar to the results of descriptive statistical parameter analysis based on SHAP. However, the reliability was low due to the change in the attention map of the inertial sensor according to the learning results of deep learning as a result of repeated experiments and low identification accuracy. If high-accuracy identification results are obtained, it is expected that the inertial signal characteristics of osteopenia and sarcopenia can be obtained using deep learning.
Functional tests and questionnaires conducted in the hospital were not statistically significant, except for TUG in the sarcopenia group. Spatial–temporal parameters, which have previously been used as gait parameters, showed statistical significance in the sarcopenia group and the osteopenia group, but showed a low identification accuracy of 63% in the osteopenia group. The proposed descriptive statistical parameters obtained an accuracy of 76% or more, and the descriptive statistical parameters attributed similar meanings to the results of the spatial–temporal parameters, had high statistical significance, and can be used as a new clinical tool because the difference in parameter values between the osteopenia and sarcopenia groups is remarkable. Descriptive statistical parameters can be used as useful tools for patient identification and risk detection.

6. Conclusions

The inertial-sensor-based gait signal was acquired and analyzed for patients with osteopenia and sarcopenia. Spatial–temporal parameters used in conventional clinical evaluation and diagnosis are effective tools for understanding gait. However, they have poor temporal resolution and do not include the function of kinematic signals during the gait cycle. Therefore, the inertial sensor data can obtain descriptive statistical parameters for each gait phase.
For analyzing the patients and control groups, parameters can be statistically analyzed or analyzed through machine-learning-based XAI. To apply XAI, high-accuracy machine learning is required; thus, useful parameters obtained from parameter analysis are used to increase the accuracy of machine learning. Therefore, parameter interpretation is important for patient identification and risk estimation. As a machine learning algorithm, XGBoost for osteopenia and RF for sarcopenia showed high performance, whereas for deep learning, ResNet50, which transfer-learned a human identification model, achieved high accuracy. For the analysis of gait parameters, SHAP was applied to the machine learning model to detect the importance and contribution of the parameters. Unlike Gini and permutation importance, SHAP has advantages of lowering the importance of a parameter when there are similar characteristics between the high-importance parameters. When deep learning identifies patient, the attention map of the inertial sensor signal was analyzed using LRP.
Analyzing the signal of the inertial sensor through XAI, we can diagnose and manage osteopenia and sarcopenia in daily life using a smart insole rather than an expensive clinical tool because the inertial sensor signal contains abundant information on gait. Although the number of participants in this study was extremely small to enable fully understanding the gait characteristics of osteopenia and sarcopenia, the proposed method is effective in analyzing osteopenia and sarcopenia. Therefore, in future studies, additional clinical evaluations will be performed to obtain and analyze many patients and segment data according to sex, age, and dominant leg.

Author Contributions

Conceptualization, J.-K.K., M.-N.B., K.L., J.-C.K. and S.G.H.; Data curation, J.-K.K. and M.-N.B.; Formal analysis, J.-K.K.; Funding acquisition, K.L. and J.-C.K.; Investigation, J.-K.K.; Methodology, J.-K.K. and S.G.H.; Project administration, M.-N.B., K.L. and J.-C.K.; Software, J.-K.K.; Supervision, K.L. and J.-C.K.; Validation, J.-K.K. and S.G.H.; Visualization, J.-K.K.; Writing—original draft, J.-K.K.; Writing—review and editing, J.-K.K., M.-N.B. and S.G.H. All authors have read and agreed to the published version of the manuscript.

Funding

This study was supported by a National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) (No. 2021M3I2A1077405) and by a grant (22DRMS-B146826-05) from the Development of Customized Contents Provision Technology for Realistic Disaster Management Based on Spatial Information Program funded by the Ministry of the Interior and Safety of the Korean government.

Institutional Review Board Statement

The study was conducted according to the guidelines of the Declaration of Helsinki, and approved by the Institutional Review Board of Chungnam National University Hospital Institutional (File No: CNUH 2019-06-042).

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

The data are not publicly available due to company security policy and personal protection of subjects. Data are available from the authors upon reasonable request.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Table A1. Abbreviations.
Table A1. Abbreviations.
AbbreviationsRawAbbreviationsRaw
XAIeXplainable Artificial IntelligenceLDALinear Discriminant Analysis
BMDBone Mineral DensityNBNaïve Bayes
SDStandard Deviationk-NNk-Nearest Neighbor
DEXADual-Energy X-ray AbsorptiometrySVMSupport Vector Machines
PDParkinson’s DiseasesRBFRadial Basis Function
THATotal Hip ArthroplastyDTDecision Tree
IMUInertial Measurement UnitXGBoostExtreme Gradient Boosting
HSHeel StrikeHMMHidden Markov Model
TOToe OffRFRandom Forest
LIMELocal Interpretable Model-agnostic ExplanationsANNArtificial Neural Network
SHAPSHapley Additive exPlanationsCNNConvolutional Neural Network
SMISkeletal Muscle mass IndexLSTMLong Short-Term Memory
MMSEMini-Mental State ExaminationResNetResidual neural Network
MFSMores Fall ScaleGAPGlobal Average Pooling
TUGTimed Up and GoFCFully Connected
BBSBerg Balance ScaleLRPLayer-wise Relevance Propagation
ROMRange of MotionCAMClass Activation Mapping

References

  1. Coll, P.P.; Phu, S.; Hajjar, S.H.; Kirk, B.; Duque, G.; Taxel, P. The prevention of osteoporosis and sarcopenia in older adults. J. Am. Geriatr. Soc. 2021, 69, 1388–1398. [Google Scholar] [CrossRef] [PubMed]
  2. Intriago, M.; Maldonado, G.; Guerrero, R.; Messina, O.D.; Rios, C. Bone Mass Loss and Sarcopenia in Ecuadorian Patients. J. Aging Res. 2020, 2020, 1072675. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  3. Stephen, W.C.; Janssen, I. Sarcopenic-obesity and cardiovascular disease risk in the elderly. J. Nutr. Health Aging 2009, 13, 460–466. [Google Scholar] [CrossRef] [PubMed]
  4. Kim, J.-K.; Bae, M.-N.; Lee, K.; Hong, S. Identification of Patients with Sarcopenia Using Gait Parameters Based on Inertial Sensors. Sensors 2021, 21, 1786. [Google Scholar] [CrossRef]
  5. Teufl, W.; Taetz, B.; Miezal, M.; Lorenz, M.; Pietschmann, J.; Jöllenbeck, T.; Fröhlich, M.; Bleser, G. Towards an Inertial Sensor-Based Wearable Feedback System for Patients after Total Hip Arthroplasty: Validity and Applicability for Gait Classification with Gait Kinematics-Based Features. Sensors 2019, 19, 5006. [Google Scholar] [CrossRef] [Green Version]
  6. Tatangelo, G.; Watts, J.; Lim, K.; Connaughton, C.; Abimanyi-Ochom, J.; Borgström, F.; Nicholson, G.C.; Shore-Lorenti, C.; Stuart, A.L.; Iuliano-Burns, S.; et al. The cost of osteoporosis, osteopenia, and associated fractures in Australia in 2017. J. Bone Miner. Res. 2019, 34, 616–625. [Google Scholar] [CrossRef]
  7. Kanis, J.A. Assessment of fracture risk and its application to screening for postmenopausal osteoporosis: Synopsis of a WHO report. Osteoporos. Int. 1994, 4, 368–381. [Google Scholar] [CrossRef]
  8. Cruz-Jentoft, A.J.; Baeyens, J.P.; Bauer, J.M.; Boirie, Y.; Cederholm, T.; Landi, F.; Martin, F.C.; Michel, J.-P.; Rolland, Y.; Schneider, S.M. Sarcopenia: European consensus on definition and diagnosisReport of the European Working Group on Sarcopenia in Older People. Age Ageing 2010, 39, 412–423. [Google Scholar] [CrossRef] [Green Version]
  9. Caramia, C.; Torricelli, D.; Schmid, M.; Muñoz-Gonzalez, A.; Gonzalez-Vargas, J.; Grandas, F.; Pons, J.L. IMU-based classification of Parkinson’s disease from gait: A sensitivity analysis on sensor location and feature selection. IEEE J. Biomed. Health Inform. 2018, 22, 1765–1774. [Google Scholar] [CrossRef]
  10. Eskofier, B.M.; Lee, S.I.; Daneault, J.F.; Golabchi, F.N.; Ferreira-Carvalho, G.; Vergara-Diaz, G.; Sapienza, S.; Costante, G.; Klucken, J.; Kautz, T.; et al. Recent machine learning advancements in sensor-based mobility analysis: Deep learning for Parkinson’s disease assessment. In Proceedings of the 2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), Orlando, FL, USA, 16–20 August 2016; pp. 655–658. [Google Scholar]
  11. Howcroft, J.; Kofman, J.; Lemaire, E. Prospective Fall-Risk Prediction Models for Older Adults Based on Wearable Sensors. IEEE Trans. Neural Syst. Rehabil. Eng. 2017, 25, 1812–1820. [Google Scholar] [CrossRef]
  12. Tunca, C.; Salur, G.; Ersoy, C. Deep Learning for Fall Risk Assessment with Inertial Sensors: Utilizing Domain Knowledge in Spatio-Temporal Gait Parameters. IEEE J. Biomed. Health Inform. 2020, 24, 1994–2005. [Google Scholar] [CrossRef] [PubMed]
  13. Dindorf, C.; Teufl, W.; Taetz, B.; Bleser, G.; Fröhlich, M. Interpretability of Input Representations for Gait Classification in Patients after Total Hip Arthroplasty. Sensors 2020, 20, 4385. [Google Scholar] [CrossRef] [PubMed]
  14. Ren, L.; Peng, Y. Research of Fall Detection and Fall Prevention Technologies: A Systematic Review. IEEE Access 2019, 7, 77702–77722. [Google Scholar] [CrossRef]
  15. Choi, W.; Choi, J.H.; Chung, C.Y.; Sung, K.H.; Lee, K.M. Can gait kinetic data predict femoral bone mineral density in elderly men and women aged 50 years and older? J. Biomech. 2021, 123, 110520. [Google Scholar] [CrossRef] [PubMed]
  16. ElDeeb, A.M.; Khodair, A.S. Three-dimensional analysis of gait in postmenopausal women with low bone mineral density. J. Neuroeng. Rehabil. 2014, 11, 55. [Google Scholar] [CrossRef] [Green Version]
  17. Sung, P.S. Increased double limb support times during walking in right limb dominant healthy older adults with low bone density. Gait Posture 2018, 63, 145–149. [Google Scholar] [CrossRef]
  18. Gunning, D.; Aha, D. DARPA’s Explainable Artificial Intelligence (XAI) Program. AI Mag. 2019, 40, 44–58. [Google Scholar] [CrossRef]
  19. Parsa, A.B.; Movahedi, A.; Taghipour, H.; Derrible, S.; Mohammadian, A. (Kouros) toward safer highways, application of XGBoost and SHAP for real-time accident detection and feature analysis. Accid. Anal. Prev. 2020, 136, 105405. [Google Scholar] [CrossRef]
  20. Taborri, J.; Palermo, E.; Rossi, S.; Cappa, P. Gait Partitioning Methods: A Systematic Review. Sensors 2016, 16, 66. [Google Scholar] [CrossRef] [Green Version]
  21. Whittle, M.W. Gait Analysis: An Introduction; Butterworth-Heinemann: Oxford, UK, 2014. [Google Scholar]
  22. Kim, J.; Bae, M.-N.; Lee, K.B.; Hong, S.G. Gait event detection algorithm based on smart insoles. ETRI J. 2019, 42, 46–53. [Google Scholar] [CrossRef]
  23. Nilsson, J.-O.; Gupta, A.K.; Handel, P. Foot-mounted inertial navigation made easy. In Proceedings of the 2014 International Conference on Indoor Positioning and Indoor Navigation (IPIN), Busan, Korea, 27–30 October 2014; pp. 24–29. [Google Scholar]
  24. Skog, I.; Nilsson, J.-O.; Handel, P. Pedestrian tracking using an IMU array. In Proceedings of the 2014 IEEE International Conference on Electronics, Computing and Communication Technologies (CONECCT), Bangalore, India, 6–7 January 2014; pp. 1–4. [Google Scholar]
  25. Bach, S.; Binder, A.; Montavon, G.; Klauschen, F.; Müller, K.-R.; Samek, W. On Pixel-Wise Explanations for Non-Linear Classifier Decisions by Layer-Wise Relevance Propagation. PLoS ONE 2015, 10, e0130140. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  26. Zhou, B.; Khosla, A.; Lapedriza, A.; Oliva, A.; Torralba, A. Learning deep features for discriminative localization. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 2921–2929. [Google Scholar]
  27. Lee, J.R.; Kim, S.; Park, I.; Eo, T.; Hwang, D. Relevance-CAM: Your Model Already Knows Where to Look. In Proceedings of the 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN, USA, 20–25 June 2021; pp. 14939–14948. [Google Scholar]
  28. Kang, Y.; Na, D.L.; Hahn, S. A validity study on the Korean Mini-Mental State Examination (K-MMSE) in dementia patients. J. Korean Neurol. Assoc. 1997, 15, 300–308. [Google Scholar]
  29. Baek, S.; Piao, J.; Jin, Y.J.; Lee, S.-M. Validity of the Morse Fall Scale implemented in an electronic medical record system. J. Clin. Nurs. 2014, 23, 2434–2441. [Google Scholar] [CrossRef] [PubMed]
  30. Kim, S.; Kim, M.; Won, C.W. Validation of the Korean version of the SARC-F questionnaire to assess sarcopenia: Korean frailty and aging cohort study. J. Am. Med. Dir. Assoc. 2018, 19, 40–45. [Google Scholar] [CrossRef]
  31. Berg, K.; Wood-Dauphine, S.; Williams, J.I.; Gayton, D. Measuring balance in the elderly: Preliminary development of an instrument. Physiother. Can. 1989, 41, 304–311. [Google Scholar] [CrossRef]
  32. Podsiadlo, D.; Richardson, S. The timed “Up & Go”: A test of basic functional mobility for frail elderly persons. J. Am. Geriatr. Soc. 1991, 39, 142–148. [Google Scholar]
  33. Zhang, D.; Qian, L.; Mao, B.; Huang, C.; Huang, B.; Si, Y. A Data-Driven Design for Fault Detection of Wind Turbines Using Random Forests and XGboost. IEEE Access 2018, 6, 21020–21031. [Google Scholar] [CrossRef]
  34. Duan, H.; Zhao, Y.; Chen, K.; Shao, D.; Lin, D.; Dai, B. Revisiting Skeleton-based Action Recognition. arXiv 2021, arXiv:2104.13586. [Google Scholar]
  35. Wu, Z.; Shen, C.; Van Den Hengel, A. Wider or deeper: Revisiting the resnet model for visual recognition. Pattern Recog. 2019, 90, 119–133. [Google Scholar] [CrossRef] [Green Version]
  36. Wan, Z.; Yang, R.; Huang, M.; Zeng, N.; Liu, X. A review on transfer learning in EEG signal analysis. Neurocomputing 2021, 421, 1–14. [Google Scholar] [CrossRef]
Figure 1. Gait analysis flowchart.
Figure 1. Gait analysis flowchart.
Biosensors 12 00167 g001
Figure 2. Sensor attachments to the insoles.
Figure 2. Sensor attachments to the insoles.
Biosensors 12 00167 g002
Figure 3. Acceleration and angular velocity signals.
Figure 3. Acceleration and angular velocity signals.
Biosensors 12 00167 g003
Figure 4. SHAP plots of the spatial–temporal parameters of osteopenia (a) and sarcopenia (b).
Figure 4. SHAP plots of the spatial–temporal parameters of osteopenia (a) and sarcopenia (b).
Biosensors 12 00167 g004aBiosensors 12 00167 g004b
Figure 5. Inertial signals and SHAP dependence plots of descriptive statistical parameters 247 and 114 of osteopenia. (a) Inertial signal 247. (b) Inertial signal 114. (c) SHAP dependence plot 247. (d) SHAP dependence plot 114.
Figure 5. Inertial signals and SHAP dependence plots of descriptive statistical parameters 247 and 114 of osteopenia. (a) Inertial signal 247. (b) Inertial signal 114. (c) SHAP dependence plot 247. (d) SHAP dependence plot 114.
Biosensors 12 00167 g005
Figure 6. Inertial signals and SHAP dependence plots of descriptive statistical parameters 430 and 524 of sarcopenia. (a) Inertial signal 430. (b) Inertial signal 524. (c) SHAP dependence plot 430. (d) SHAP dependence plot 524.
Figure 6. Inertial signals and SHAP dependence plots of descriptive statistical parameters 430 and 524 of sarcopenia. (a) Inertial signal 430. (b) Inertial signal 524. (c) SHAP dependence plot 430. (d) SHAP dependence plot 524.
Biosensors 12 00167 g006
Figure 7. Layer2 result of applying LRP, Grad-CAM, and Relevance-CAM to ResNet50.
Figure 7. Layer2 result of applying LRP, Grad-CAM, and Relevance-CAM to ResNet50.
Biosensors 12 00167 g007
Figure 8. Osteopenia and sarcopenia result of applying LRP to ResNet50. (a) LRP result of osteopenia. (b) LRP result of sarcopenia.
Figure 8. Osteopenia and sarcopenia result of applying LRP to ResNet50. (a) LRP result of osteopenia. (b) LRP result of sarcopenia.
Biosensors 12 00167 g008
Table 1. Existing studies on disease identification using gait parameters. Abbreviations are as shown in Table A1.
Table 1. Existing studies on disease identification using gait parameters. Abbreviations are as shown in Table A1.
ReferenceParameterDiseasePositionClassificationAccuracy
Caramia 2018 [9]Step length, step time, stride time, speed, hip, knee, and ankle ROMPD R and L ankle, knee, hip, chestLDA, NB, k-NN, SVM, SVM RBF, DT, majority of votes96.88%
Eskofier 2016 [10]Energy maximum, minimum, mean, variance, skewness, kurtosis, fast Fourier transformPDUpper limbsAdaBoost, PART, k-NN, SVM, CNN90.9%
Howcroft 2017 [11]Cadence, stride time maximum, mean, and SD of accelerationFallerHead, pelvis, R and L shankNB, SVM, NN57%
Tunca 2019 [12]Stride length, cycle time, stance time, swing time, clearance, stance ratio, cadence, speedFallerBoth feetSVM, RF, MLP, HMM, LSTM94.30%
Teufl 2019 [5]Stride length, stride time, cadence, speed, hip and pelvis ROMTHAHip, thigh,
shank, foot
SVM97%
Dindorf 2020 [13]Various parametersTHAHip, knee,
pelvis, ankle
RF, SVM, SVM RBF, MLP100%
Kim 2021 [4]Various parametersSarcopeniaBoth feetRF, SVM, MLP, CNN, BiLSTM95%
OursVarious parametersOsteopenia
Sarcopenia
Both feetRF, SVM, XGBoost, CNN, BiLSTM, ResNet88.69%
93.75%
Table 2. Group population statistics for osteopenia and sarcopenia groups.
Table 2. Group population statistics for osteopenia and sarcopenia groups.
ParameterOsteopeniaNon-OsteopeniaOsteopenia
p-Value
SarcopeniaNon-SarcopeniaSarcopenia
p-Value
Age (years)70.48 ± 2.3670.33 ± 2.560.85271.10 ± 2.1369.50 ± 3.140.199
Height (cm)153.65 ± 4.83152.80 ± 5.930.614150.87 ± 4.66153.10 ± 4.360.283
Weight (kg)57.75 ± 6.1259.57 ± 7.120.37953.55 ± 5.6261.20 ± 5.070.005
Feet_size (mm)236.91 ± 7.66238.57 ± 6.550.453232.00 ± 5.87239.50 ± 6.430.014
MMSE27.62 ± 1.7728.19 ± 1.780.30327.80 ± 1.4027.30 ± 2.160.547
SARC-F3.19 ± 2.403.86 ± 2.150.3492.90 ± 1.522.90 ± 2.851.000
MFS23.10±17.9226.43 ± 16.590.53513.50 ± 12.9223.50 ± 12.700.098
BBS42.38 ± 8.4842.19 ± 6.850.93743.10 ± 6.2641.90 ± 9.470.742
3m TUG10.96 ± 1.6411.50 ± 2.870.46411.71 ± 1.629.85 ± 1.920.031
Grasp_right (kg)17.29 ± 5.4218.77 ± 4.710.35114.42 ± 3.6522.57 ± 2.730.000
Grasp_left (kg)17.61 ± 4.6718.04 ± 4.400.76114.15 ± 3.9722.17 ± 3.020.000
T_score (DEXA)−1.85 ± 0.740.69 ± 1.490.000−0.49 ± 2.08−0.64 ± 2.030.872
SMI(ASM/height)5.37 ± 0.555.38 ± 0.650.9614.58 ± 0.325.93 ± 0.350.000
Table 3. Definition of gait parameters.
Table 3. Definition of gait parameters.
Gait ParametersDefinition
Spatial–temporal parameters
Cadence Number of steps acquired per minute
Stance phase (time) Percent (time) starting with HS and ending with TO of the same foot
Swing phase (time) Percent (time) starting with TO and ending with HS of the same foot
Single support phase (time)Percent (time) when only one foot is on the ground
Double support phase (time)Percent (time) when both feet are on the ground
Stride length Distance starting with HS and ending with next HS of the same foot
Symmetry indices (SI)Absolute values of (right—left)/(0.5 × ( right + left )
Descriptive statistical parameters
Max Greatest values
Min Least or smallest values
SDStandard deviation of values
AbSumAbsolute sum of values
Root-mean-square (RMS)Arithmetic mean of the squares of a set of values
KurtosisAssesses whether the tails of a given distribution contain extreme values
SkewnessA measure of the asymmetry of the probability distribution of a real-valued random variable about its mean
MMgrGradient from maximum value to minimum value
DMMDifference between maximum value and minimum value
MdifMaximum for the difference between two successive values
Table 4. Instantiation of deep learning model.
Table 4. Instantiation of deep learning model.
CNNBiLSTMResNet50
InputNone, 100, 36, 1InputNone, 100, 36, 1InputNone, 100, 36, 1
Conv1 3 × 3 , 5
2 × 1 max pooling,
BiLSTM15Conv1 7 × 7 , 64
stride 2
3 × 3 max pooling, stride 2
Conv2 3 × 3 , 5
2 × 1 max pooling,
BiLSTM210Conv2 [ 1 × 1 ,   64 3 × 3 ,   64 1 × 1 ,   256 ] × 3
Conv3 3 × 3, 20Dropout0.5Conv3 [ 1 × 1 ,   128 3 × 3 ,   128 1 × 1 ,   512 ] × 4
Dropout0.5 FC, DenseConv4 [ 1 × 1 ,   256 3 × 3 ,   256 1 × 1 ,   1024 ] × 6
FC, Dense Conv5 [ 1 × 1 ,   512 3 × 3 ,   512 1 × 1 ,   2048 ] × 3
GAP, FC
Table 5. Identification result of RF, XGBoost, and SVM (accuracy, precision, recall and F1-score).
Table 5. Identification result of RF, XGBoost, and SVM (accuracy, precision, recall and F1-score).
GroupsParametersModelsAccuracyPrecisionRecallF1-Score
OsteopeniaSpatial–temporal (24)RF 0.4940.4760.3700.393
XGBoost0.4760.4760.3760.406
SVM 0.6370.6190.5110.544
Descriptive statistical (100)RF 0.6490.6550.6120.607
XGBoost0.6840.6900.6800.650
SVM 0.6070.6780.5900.604
SarcopeniaSpatial–temporal (24)RF 0.8020.8250.7750.775
XGBoost0.7520.7250.6670.677
SVM 0.7750.6030.7750.658
Descriptive statistical (100)RF 0.6750.6750.6320.631
XGBoost0.6030.6750.5570.591
SVM 0.6370.7040.6570.644
Table 6. Identification result of CNN, BiLSTM, and ResNet (accuracy, precision, recall and F1-score).
Table 6. Identification result of CNN, BiLSTM, and ResNet (accuracy, precision, recall and F1-score).
GroupsModelsAccuracyPrecisionRecallF1-Score
OsteopeniaCNN0.6960.6900.7350.670
BiLSTM0.6190.5700.6100.571
ResNet0.7670.6720.7260.676
ResNet(transfer)0.7860.8690.7470.787
SarcopeniaCNN0.6000.4370.5250.447
BiLSTM0.4250.3000.3500.299
ResNet0.6120.3370.5000.394
ResNet(transfer)0.7000.6120.6360.606
Table 7. Osteopenia identification results according to the number of important parameters (accuracy, %).
Table 7. Osteopenia identification results according to the number of important parameters (accuracy, %).
ClassMLNumber of Parameters
234567891020100
GiniRF70.8370.2364.8872.0268.4563.6961.3060.1160.7161.3064.88
XGBoost66.6667.8564.8871.4268.4564.2865.4761.3065.4767.2668.45
SVM64.2864.8864.8864.2861.3061.3059.5255.3557.7358.3360.71
PermutationRF73.2170.8369.6467.2664.2868.4570.2369.0467.2667.2664.88
XGBoost69.6470.8370.2368.4264.8865.4767.2666.7067.2670.2368.45
SVM65.4768.4566.0764.2866.6666.6664.2864.8864.8860.7160.71
SHAPRF73.8076.1970.2363.6963.0963.6963.0963.6957.7360.1164.88
XGBoost70.237574.4073.2166.6667.8563.6959.5256.5468.4568.45
SVM71.4271.4267.2661.3058.3358.3357.1455.9557.1462.560.71
Table 8. Sarcopenia identification results according to the number of important parameters (accuracy, %).
Table 8. Sarcopenia identification results according to the number of important parameters (accuracy, %).
ClassMLNumber of Parameters
2345101516171820100
GiniRF5058.7562.56568.7567.568.7571.2571.2562.567.5
XGBoost52.557.56566.2562.558.7558.7558.7558.7558.7560
SVM52.558.7566.256572.557.556.2556.2558.756063.75
PermutationRF62.56056.2553.7557.567.555607062.567.5
XGBoost60605558.756563.7568.756566.2567.560
SVM61.256060556566.2566.2568.7563.756063.75
SHAPRF56.256057.56567.562.572.573.7568.7567.567.5
XGBoost46.2563.7562.5656563.7563.756566.2563.7560
SVM58.7567.56061.2567566.2568.7562.56058.7563.75
Table 9. Feature importance and Shapley values of descriptive statistical parameters.
Table 9. Feature importance and Shapley values of descriptive statistical parameters.
ClassImportant Parameter12345678910
OsteopeniaParameters2471148721881620629121169667
Shapley value0.970.280.270.20.180.170.160.130.10.09
SarcopeniaParameters430524519270457231387397
Shapley value0.660.280.250.220.170.160.150.130.130.13
ClassImportant Parameter11121314151617181920
OsteopeniaParameters7741174580231223542242554422
Shapley value0.090.080.080.070.070.070.070.060.060.06
SarcopeniaParameters5675216906077043804698257
Shapley value0.130.120.110.090.090.080.080.080.080.07
Table 10. Seven-phase descriptive statistical parameters.
Table 10. Seven-phase descriptive statistical parameters.
RightLeft
ParameterMaxMinSDAbSumRMSKuSkeMMgrDMMMdifMaxMinSDAbSumRMSKuSkeMMgrDMMMdif
Loading responseAccX12345678910421422423424425426427428429430
AccY11121314151617181920431432433434435436437438439440
AccZ21222324252627282930441442443444445446447448449450
GyroX31323334353637383940451452453454455456457458459460
GyroY41424344454647484950461462463464465466467468469470
GyroZ51525354555657585960471472473474475476477478479480
Mid stanceAccX61626364656667686970481482483484485486487488489490
AccY71727374757677787980491492493494495496497498499500
AccZ81828384858687888990501502503504505506507508509510
GyroX919293949596979899100511512513514515516517518519520
GyroY101102103104105106107108109110521522523524525526527528529530
GyroZ111112113114115116117118119120531532533534535536537538539540
Terminal stanceAccX121122123124125126127128129130541542543544545546547548549550
AccY131132133134135136137138139140551552553554555556557558559560
AccZ141142143144145146147148149150561562563564565566567568569570
GyroX151152153154155156157158159160571572573574575576577578579580
GyroY161162163164165166167168169170581582583584585586587588589590
GyroZ171172173174175176177178179180591592593594595596597598599600
Pre swingAccX181182183184185186187188189190601602603604605606607608609610
AccY191192193194195196197198199200611612613614615616617618619620
AccZ201202203204205206207208209210621622623624625626627628629630
GyroX211212213214215216217218219220631632633634635636637638639640
GyroY221222223224225226227228229230641642643644645646647648649650
GyroZ231232233234235236237238239240651652653654655656657658659660
Initial swingAccX241242243244245246247248249250661662663664665666667668669670
AccY251252253254255256257258259260671672673674675676677678679680
AccZ261262263264265266267268269270681682683684685686687688689690
GyroX271272273274275276277278279280691692693694695696697698699700
GyroY281282283284285286287288289290701702703704705706707708709710
GyroZ291292293294295296297298299300711712713714715716717718719720
Mid swingAccX30130303304305306307308309310721722723724725726727728729730
AccY311312313314315316317318319320731732733734735736737738739740
AccZ321322323324325326327328329330741742743744745746747748749750
GyroX331332333334335336337338339340751752753754755756757758759760
GyroY341342343344345346347348349350761762763764765766767768769770
GyroZ351352353354355356357358359360771772773774775776777778779780
Terminal swingAccX361362363364365366367368369370781782783784785786787788789790
AccY371372373374375376377378379380791792793794795796797798799800
AccZ381382383384385386387388389390801802803804805806807808809810
GyroX391392393394395396397398399400811812813814815816817818819820
GyroY401402403404405406407408409410821822823824825826827828829830
GyroZ411412413414415416417418419420831832833834835836837838839840
Table 11. Osteopenia and sarcopenia identification results with the 20 parameters from Table 9 (accuracy, %).
Table 11. Osteopenia and sarcopenia identification results with the 20 parameters from Table 9 (accuracy, %).
ClassImportant Parameter12345678910
OsteopeniaRFx7585.1185.7178.5782.1480.9581.5477.9776.78
XGBoostx72.0280.9588.6987.6987.585.1182.7381.5483.33
SVMx74.407575.5983.9282.7380.9581.5480.3578.57
SarcopeniaRFx8582.583.75858586.2582.5882.5
XGBoostx8072.578.7576.2573.757571.2573.7571.25
SVMx81.258082.581.2582.586.2586.2587.581.25
ClassImportant Parameter11121314151617181920
OsteopeniaRF77.3872.6178.5774.4079.1682.1479.7673.8082.1482.73
XGBoost76.7876.7876.1977.9780.9581.5477.3874.4073.8074.40
SVM76.1977.9774.4072.6175.5976.7876.1977.9779.1674.40
SarcopeniaRF81.2583.7586.2588.7586.2587.591.2593.7586.2592.5
XGBoost71.527571.257071.2575.72.572.571.2572.5
SVM8083.7586.2583.7586.2581.2583.7578.7578.7578.75
Table 12. Spatial–temporal parameters of osteopenia and sarcopenia.
Table 12. Spatial–temporal parameters of osteopenia and sarcopenia.
ParameterOsteopeniaNon-OsteopeniaShapley ValueSarcopeniaNon-SarcopeniaShapley Value
1Stance phase time right (s)0.610.6450.034 **0.6140.6080.014
2Stance phase time left (s)0.6120.6410.084 *0.6170.6040.18
3Swing phase time right (s)0.4270.4190.1560.4160.4140.143
4Swing phase time left (s)0.4240.4220.040.4120.4170.039
5Stance phase percent right (%)58.7760.4420.196 **59.46859.4450.235
6Stance phase percent left (%)59.0560.1240.035 **59.85359.1140.345
7Double support first phase time right (s)0.10.1150.074 **0.1120.0990.005
8Double support first phase time left (s)0.0850.1060.197 **0.090.0900.551
9Double support second phase time right (s)0.0850.1060.031 **0.090.0900.097
10Double support second phase time left (s)0.10.1150.072 **0.1110.0990.007
11Single support phase time right (s)0.4240.4220.0780.4120.4180.007
12Single support phase time left (s)0.4270.4190.0170.4160.4140.018
13Double support first phase percent right (%)9.6610.7110.22410.8029.6920
14Double support first phase percent left (%)8.189.8570.311 **8.5638.8580.248
15Double support second phase percent right (%)8.179.8460.046 **8.5568.8550.072
16Double support second phase percent left (%)9.60610.6860.017 **10.7279.6770.001
17Single support phase percent right (%)40.93939.8840.077 **40.1140.8970.035
18Single support phase percent left (%)41.26239.580.416 **40.56240.5780.02
19Stride length right (m)0.950.930.0650.940.9790.022
20Stride length left (m)0.9180.8920.0150.8960.9420.011
21Stance phase time SI0.0310.0320.0180.0360.0250.250 **
22Swing phase time SI0.0410.0460.0730.0530.0340.049 **
23Stance phase percent SI0.0260.0280.0130.03250.0210.007 **
24Cadence (steps/min)115.781113.8590.047116.21117.4690
Table 13. Descriptive statistical parameters of osteopenia and sarcopenia. * indicates that the p-value is less than 0.025, and ** indicates that the p-value is less than 0.001.
Table 13. Descriptive statistical parameters of osteopenia and sarcopenia. * indicates that the p-value is less than 0.025, and ** indicates that the p-value is less than 0.001.
OsteopeniaSarcopenia
ParameterOsteopeniaNon-OsteopeniaShapley ValueParameterSarcopeniaNon-SarcopeniaShapley Value
12470.1260.5481.033 **4302.7483.7970.921 **
21141.8922.6130.312 **5244.9252.4030.113 **
3870.3571.2010.247 **510.8130.4630.189 **
42185.6717.0650.200 **98.12111.8130.142 **
58163.0912.5020.055 **27016.41713.0790.304 **
62061.9262.0890.119 *457−0.3520.0470.003 **
72913.7743.1290.020 **2311.5320.8910.002 **
82135.17529.3130.023 **387−0.170.0420.002 **
91693.5632.8230.032 **32.2673.440.129 **
106670.1350.4810.153 **97−0.4250.2740.021 **
Table 14. Top 2 descriptive statistical parameters of osteopenia and sarcopenia.
Table 14. Top 2 descriptive statistical parameters of osteopenia and sarcopenia.
ParameterOsteopeniaNon-OsteopeniaSarcopeniaNon-Sarcopenia
2470.126 + 0.4250.548 + 0.3820.364 + 0.4830.327 + 0.534
1141.892 + 0.862.613 + 0.9382.078 + 1.0882.217 + 0.591
4303.292 + 1.053.285 + 0.8182.748 + 0.8333.797 + 0.813
5243.317 + 2.0984.297 + 4.8734.925 + 3.4792.403 + 0.473
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Kim, J.-K.; Bae, M.-N.; Lee, K.; Kim, J.-C.; Hong, S.G. Explainable Artificial Intelligence and Wearable Sensor-Based Gait Analysis to Identify Patients with Osteopenia and Sarcopenia in Daily Life. Biosensors 2022, 12, 167. https://doi.org/10.3390/bios12030167

AMA Style

Kim J-K, Bae M-N, Lee K, Kim J-C, Hong SG. Explainable Artificial Intelligence and Wearable Sensor-Based Gait Analysis to Identify Patients with Osteopenia and Sarcopenia in Daily Life. Biosensors. 2022; 12(3):167. https://doi.org/10.3390/bios12030167

Chicago/Turabian Style

Kim, Jeong-Kyun, Myung-Nam Bae, Kangbok Lee, Jae-Chul Kim, and Sang Gi Hong. 2022. "Explainable Artificial Intelligence and Wearable Sensor-Based Gait Analysis to Identify Patients with Osteopenia and Sarcopenia in Daily Life" Biosensors 12, no. 3: 167. https://doi.org/10.3390/bios12030167

APA Style

Kim, J. -K., Bae, M. -N., Lee, K., Kim, J. -C., & Hong, S. G. (2022). Explainable Artificial Intelligence and Wearable Sensor-Based Gait Analysis to Identify Patients with Osteopenia and Sarcopenia in Daily Life. Biosensors, 12(3), 167. https://doi.org/10.3390/bios12030167

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop