Next Article in Journal
Intra-Pulse Modulation Recognition of Radar Signals Based on Efficient Cross-Scale Aware Network
Previous Article in Journal
An Anti-Forensics Video Forgery Detection Method Based on Noise Transfer Matrix Analysis
Previous Article in Special Issue
L Test Subtask Segmentation for Lower-Limb Amputees Using a Random Forest Algorithm
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Phase-Based Gait Prediction after Botulinum Toxin Treatment Using Deep Learning

1
Informatique, Bio-Informatique et Systèmes Complexes (IBISC) EA 4526, Univ Evry, Université Paris-Saclay, 91020 Evry, France
2
Department of Computer Science, Sukkur IBA University, Sukkur 65200, Sindh, Pakistan
3
UGECAM Ile-de-France, Movement Analysis Laboratory, 77170 Coubert, France
4
SAMOVAR, Télécom SudParis, Institut Polytechnique de Paris, 91120 Palaiseau, France
*
Authors to whom correspondence should be addressed.
Sensors 2024, 24(16), 5343; https://doi.org/10.3390/s24165343
Submission received: 28 June 2024 / Revised: 14 August 2024 / Accepted: 17 August 2024 / Published: 18 August 2024

Abstract

:
Gait disorders in neurological diseases are frequently associated with spasticity. Intramuscular injection of Botulinum Toxin Type A (BTX-A) can be used to treat spasticity. Providing optimal treatment with the highest possible benefit–risk ratio is a crucial consideration. This paper presents a novel approach for predicting knee and ankle kinematics after BTX-A treatment based on pre-treatment kinematics and treatment information. The proposed method is based on a Bidirectional Long Short-Term Memory (Bi-LSTM) deep learning architecture. Our study’s objective is to investigate this approach’s effectiveness in accurately predicting the kinematics of each phase of the gait cycle separately after BTX-A treatment. Two deep learning models are designed to incorporate categorical medical treatment data corresponding to the injected muscles: (1) within the hidden layers of the Bi-LSTM network, (2) through a gating mechanism. Since several muscles can be injected during the same session, the proposed architectures aim to model the interactions between the different treatment combinations. In this study, we conduct a comparative analysis of our prediction results with the current state of the art. The best results are obtained with the incorporation of the gating mechanism. The average prediction root mean squared error is 2.99° ( R 2 = 0.85) and 2.21° ( R 2 = 0.84) for the knee and the ankle kinematics, respectively. Our findings indicate that our approach outperforms the existing methods, yielding a significantly improved prediction accuracy.

1. Introduction

Musculoskeletal and neurological diseases lead to diminished quality of life [1]. These diseases may be associated with spasticity, a motor disorder that causes excessive tendon jerks due to hyper-excitability of the stretch reflexes [2]. Medical doctors recommend rehabilitation for such deficiencies, in addition to pharmacologic treatment. In particular, spasticity is usually treated with intramuscular Botulinum Toxin Type A (BTX-A) injections that enhance lower and upper limb function [3,4]. It is essential to ensure that the total dose of BTX-A administered to the patient and the doses on each treated muscle do not exceed the recommended maximal doses [5]. BTX-A injections should be separated at least three months apart due to their dangers and reversible effects on muscular function. A good BTX-A outcome can enhance and accelerate the rehabilitation process, but a bad outcome could slow down and/or limit the patient’s recovery. Therefore, optimizing BTX-A treatment by choosing the correct muscles to be treated and the dose distribution is a complex and crucial task that requires careful patient assessment. Anticipating the most likely outcome of a specific BTX-A treatment could help clinicians find the most adapted treatment combination (muscle and doses) for each patient. This could also potentially enhance the patient’s participation in the treatment and rehabilitation process.
Clinical Gait Analysis (CGA) is considered for treatment decisions, along with a medical history and physical examination. Based on a biomechanical interpretation of instrumental measures, CGA examines walking issues and suggests causes [6]. CGA data are clinically reliable if quality standards are met [7]. Scientific research has shown that CGA helps assess and treat neurological diseases such as Cerebral Palsy (CP) [8], post-stroke hemiparesis [4] and Multiple Sclerosis (MS) [9].
Deep Neural Networks (DNN) have excelled in clinical decision-making [10,11]. Deep Learning (DL) has been used in several CGA works to predict gait trajectories, primarily for healthy gaits [12,13,14,15]. Most works on pathological gaits tackle classification [16,17,18,19]. Among them, a few studies predict the gait trajectory some timestamps ahead [20,21], but they do not attempt to predict the post-treatment gait trajectory. Kolaghassi et al. [22] studied the abnormal walking patterns of children with neurological disorders. They used Long Short-Term Memory (LSTM) and a Convolutional Neural Network (CNN) to predict future hip, knee, and ankle trajectories up to 200 ms. Su et al. [20] used LSTM to predict gait trajectories and the five gait phases (loading response, mid-stance, terminal stance, pre-swing, and swing) to help design exoskeletons. Karakish et al. [12] implemented more reliable and simple Artificial Neural Networks (ANNs) to predict the gait trajectories using shank and foot IMU data. Four subjects were used for training, and a fifth was used for testing (200 ms future trajectories). Ding et al. [13] proposed a model for motion prediction of a subject using the motion of complementary limbs. They implemented LSTM with an attention mechanism.
The goal of this research is to enhance the prediction of the post-treatment (with BTX-A) gait signals using DL models to help clinicians with therapeutic decision-making. Our contribution aims to predict the gait trajectory after BTX-A treatment and determine how different treatments might be combined. It is based on an architecture that can handle sparse inputs (treatment data) and construct a more robust model by sharing information between sub-models representing different treatments [21]. In our past study [23], we compared serial and parallel architectures. The best one was made up of parallel Bi-LSTM-shaped sub-models. Each sub-model was paired with a treatment (injected muscle). Those models learn to map gait sequences well before and after treatment [23].
In this work, we split the learning process into stance and swing phases for the parallel architectures in [23]. Indeed, as we work on pathological gaits, the signals show more variance than normal gaits [24]. We note that the duration of the stance phase is often longer for patients than for healthy subjects; for example, in post-stroke hemiparesis [25]. The stance and swing phases obey different biomechanical constraints; therefore, we train separate models on each phase to enhance post-treatment CGA prediction. This strategy increases the global model’s ability to accurately predict the complete post-treatment gait cycle. We show significant improvements in the prediction quality of pathological gait cycles [23].

2. Materials and Methods

2.1. Dataset Description

This study gathered data at the Movement Analysis Laboratory at the Rehabilitation Center of UGECAM Coubert (France) using a four-camera CX1 optoelectronic Codamotion system operating at a frequency of 100 Hz. The participants were adults with varying gait abnormalities: MS, stroke, CP, Spinal Cord Injury (SCI), and Traumatic Brain Injury (TBI). The N p a t = 43 patients (26 males and 17 females) received BTX-A injections for spasticity treatment. In this retrospective analysis, the patients participated in clinical activities. The institution’s research ethics committee approved the use of these data. The patients were informed about the research and did not object to the use of their data. The patients had undergone treatment via injections into their lower limbs, with N u n i = 19 patients (44.18%) affected unilaterally (10 right limbs and 9 left limbs) and N b i l = 24 patients (55.82%) affected bilaterally. Their ages ranged from 21 to 75 years old. The time lag between pre- and post-treatment CGA was between 3 and 6 weeks. A total of nineteen muscles were injected, but we selected four frequently injected muscles: soleus, gastrocnemius (medial and lateral), semitendinosus, and rectus femoris. A fifth category, “other muscles”, grouped all the other treated muscles (see Table 1). There were 28 combinations of BTX-A injections into these four muscles. A treatment binary code vector was attributed to each lower limb:
s j = ( s 1 j , , s c j ) T , s i j { 0 , 1 } , i = 1 c ( c = 5 as shown in Table 1 )
where s i j = 1 if muscle i was injected in limb j, 0 otherwise, and d j = ( d 1 j , , d 5 j ) T , d i j { 0 , 1 } is a binary vector for the disease of the patient’s limb j. This study included five diseases: CP, MS, TBI, SCI, and stroke. T is the transpose operator.

2.2. Dataset Preparation

The participants were recorded walking in a straight line, with or without technical aids (i.e., cane, rollator, tripod, etc.), through a 10 m-long laboratory room. The patients wore anatomical markers, whose coordinates were tracked in 3D using four sensors. The patients walked back and forth throughout the gait hallway (trials). Depending on the patient’s capability, multiple trials of each patient were recorded at 100 Hz. The 3D gait kinematics were computed following the recommendations of the International Society of Biomechanics [26] based on the marker data. Each trial was divided into cycles and then segmented into the stance phase, from initial contact to toe-off, and the swing phase, from toe-off to subsequent initial contact (see Figure 1). Gait events (initial contacts and toe-offs) were detected from force platform data and automatically extracted by the HPA algorithm [27]. A human expert validated and modified all the gait events (when needed). The process of extracting cycles from trials to normalized phases is shown in Figure 2.
We considered a person’s right and left cycles as different samples. Each pre-treatment cycle phase was associated with a target post-treatment cycle’s corresponding phase of the same patient, leading to n = 2518 samples in total. Note that the number of cycles per patient varied from one patient to another.
Patient data were recorded for the following five joints in 3D: pelvis, hip, knee, ankle, and foot. This study only considered the knee and ankle kinematics in the sagittal plane.
The kinematic data of each phase were resampled and normalized to 51 points following standard procedures in CGA [28]. Therefore, DL models could be trained on sequences of the same length. For any patient’s limb j, the input vector is an angular time series x j = ( x 1 j , , x m j ) T [ 180 , + 180 ] m , and the target vector is y j = ( y 1 j , , y m j ) T , with m = 51 × 2 = 102 . Let D = { x j , y j , d j , s j } j = 1 n be the input–target training set.
The data were centered and reduced by the standard deviation for normalization purposes. The goal was to use g ( x ) to make a model that maps Y ^ = g ( x ) , where Y ^ is an estimation of y .

2.3. Description of Models

In our previous study [23], seven models were developed for prediction. Four were Bi-LSTM-based parallel models, and the others were serial models. The parallel models achieved better prediction results in 38 patients. LSTM [29] is good at predicting time series and can retain information for a long period of time. It has a hidden state h t and a cell state c t of the same size as the input series x t . The cell state c t is the model’s memory. The hidden state h t is the model’s prediction of x t .
We use the best models from our previous study [23], which are based on a Bi-LSTM. Bi-LSTM combines two LSTM models trained simultaneously: one on the forward input series and the other on the reverse input series, starting with the last input and moving on to the next-to-last, and so forth.
In both models (see Figure 3), we use five parallel layers of Bi-LSTM. Each layer is responsible for a treatment, as reported in Table 1. The five treatments correspond to the five categories of injected muscles. Each Bi-LSTM layer had 51 units. Note that each unit receives a pair of inputs for the knee and ankle, respectively.
We train separate models for the stance and swing phases. For comparison with our previous work [23], we combine both predictions using the proportion of stance and swing phases of the pre-treatment gait cycle. We retrieve the prediction for a complete cycle of the knee and ankle joints.
MTD-driven model (MTD-DM): The input vectors x and s are sent to the five Bi-LSTM sub-models in this architecture. The pre-treatment knee and ankle kinematic signals are represented by vector x . The Medical Treatment Data (MTD) are represented by vector s . We initialize the cell states of the LSTM as 0. The MTD (vector s ) is handled by the treatment supervisor and was used to set the values of the hidden states h. For example, if a patient had injections in muscles 1 and 3 (Table 1), then the states h 1 , t , h 2 , t in Bi-LSTM, layers 1 and 3 are set to 1, and the other layers’ (2, 4, and 5) hidden states are set to 0. The results of the five Bi-LSTM sub-models are concatenated to form a single tensor ‘output’, a one-dimensional vector, and reshaped. The reshaped output is then processed through two subsequent fully connected layers (FC1 and FC2) to predict the post-treatment kinematics (see Figure 3a).
MTD-gated model (MTD-GM): This architecture uses a gating mechanism to handle MTD. Instead of passing the MTD as a hidden state of each sub-model, the treatment supervisor is exploited downstream, multiplying each sub-model’s output by the associated binary value of the MTD. Figure 3b shows that if there is a treatment, it will be used in the model, but if there is none, it will be neglected (multiplied by 0). The results of the remaining Bi-LSTM sub-models are concatenated and reshaped. The reshaped output is then processed through two subsequent fully connected layers (FC1 and FC2) to predict the post-treatment kinematics (see Figure 3b). Leave-one-out cross-validation was used to assess model performance. For each iteration, N t r a i n = N p a t 1 patients were used to train the model, and one was used to test it. During the training process, mini-batches with 16 samples were used. DL models were trained based on the mean square error (MSE) loss function, controlled by the Adam optimizer [30]. RMSE [31], Standard Error (SE) [32], and the coefficient of determination ( R 2 ) [33] were used to evaluate the performance of the proposed models.

3. Results

A total of 43 participants were included in this study, whereas 38 were included in our previous work [23], and five new ones were added. We evaluated the two MTL models on the new dataset using the above-mentioned evaluation metrics.

3.1. Analysis of Stance Phase and Swing Phase

Table 2, Table 3 and Table 4 report the performance of both models in predicting the post-treatment gait kinematics of the stance phase, swing phase, and complete gait cycle concerning the disease. The bold entries in the following tables represent the best predictions (lowest RMSE and highest R 2 scores). We notice that the performances of both models are equivalent in the stance phase for the knee and ankle joints (average RMSE between 1.79° and 3.17°). On the contrary, for the swing phase, MTD-GM is much better than MTD-DM, decreasing the RMSE from 3.08° and 7.01° (knee) for MTD-DM to 2.43° and 3.89° (ankle) for MTD-GM. With the MTD-GM, in the swing phase, the R 2 score is better for the knee than for the ankle. It is the opposite of the ankle for the stance phase.

3.2. Analysis of Complete Cycle

As reported in Table 4, we combine the stance and swing phase predictions to obtain the prediction for each complete cycle. We note that the MTD-GM outperforms MTD-DM, showing much better R 2 scores for the knee (higher than 0.8) and ankle (higher than 0.7). Table 5 reports the overall prediction results on both joints for MTD-DM and MTD-GM. Our results improve the R 2 scores, which become higher than 0.9 in all cases. We also note that the R 2 score is increased for all groups of diseases, because combining both series changes the variance of the whole series.
Finally, the detailed results of MTD-GM for each patient per phase and joint and on both joints together are available in the Appendix A (show Figure A1). An average RMSE predicted by MTD-GM (all patients) for the stance phase of the knee, stance phase of the ankle, swing phase of the knee, swing phase of the ankle, complete knee, and complete ankle are 2.61°, 1.98°, 3.84°, 2.43°, 2.99°, and 2.21°, respectively.
Table 2. Performance of both models in predicting the post-treatment gait trajectories of the stance phase with respect to different diseases. Bold entries denote the best predictions: lowest average RMSE and maximum R 2 over all limbs for a given disease.
Table 2. Performance of both models in predicting the post-treatment gait trajectories of the stance phase with respect to different diseases. Bold entries denote the best predictions: lowest average RMSE and maximum R 2 over all limbs for a given disease.
ModelModel TypeBody JointSpinal Cord Injury (SCI)Multiple Sclerosis (MS)StrokeCerebral Palsy (CP)Traumatic Brain Injury (TBI)
No. of Patients
13121143
No. of Cycles
40035126013874
RMSE Mean (°) ± Standard Error
R 2 Score
MTD-DMMTL, 5 Bi-LSTMsKnee2.6 ± 0.762.48 ± 0.732.65 ± 0.782.62 ± 1.083.17 ± 1.23
0.540.790.640.620.18
Ankle2.09 ± 0.791.83 ± 0.642.02 ± 0.51.99 ± 0.82.44 ± 0.78
0.870.930.670.850.84
MTD-GMMTL, 5 gated Bi-LSTMsKnee2.59 ± 0.842.48 ± 0.72.7 ± 0.782.65 ± 1.082.89 ± 1.03
0.520.800.620.460.32
Ankle2.09 ± 0.751.79 ± 0.612.01 ± 0.491.96 ± 0.822.25 ± 0.75
0.850.930.680.850.86
Table 3. Performance of both models in predicting post-treatment gait trajectories of the swing phase with respect to different diseases. Bold entries denote the best predictions: lowest average RMSE and maximum R 2 over all limbs for a given disease.
Table 3. Performance of both models in predicting post-treatment gait trajectories of the swing phase with respect to different diseases. Bold entries denote the best predictions: lowest average RMSE and maximum R 2 over all limbs for a given disease.
ModelModel TypeBody JointSpinal Cord Injury (SCI)Multiple Sclerosis (MS)StrokeCerebral Palsy (CP)Traumatic Brain Injury (TBI)
No. of Patients
13121143
No. of Cycles
40035126013874
RMSE Mean (°) ± Standard Error
R 2 Score
MTD-DMMTL, 5 Bi-LSTMsKnee6.23 ± 2.057.01 ± 2.27.0 ± 1.736.72 ± 2.296.8 ± 1.17
0.310.59−1.56−3.860.26
Ankle5.15 ± 2.055.15 ± 2.24.36 ± 1.735.3 ± 2.293.08 ± 1.17
−3.35−1.18−5.16−4.36−1.34
MTD-GMMTL, 5 gated Bi-LSTMsKnee3.52 ± 1.23.89 ± 1.242.85 ± 1.042.88 ± 1.494.86 ± 1.28
0.720.840.520.630.57
Ankle2.62 ± 1.22.29 ± 1.242.63 ± 1.041.71 ± 1.492.43 ± 1.28
0.060.53−1.450.15−0.25

3.3. Visualizing Predictions of MTD-GM

Figure 4 represents the average predictions by the MTD-GM model across all cycles of five patients, each suffering from a different disease. The ankle dorsiflexion or knee flexion is shown on the Y-axis, and the patient’s gait cycle is shown on the X-axis. Figure 4a,b represent the average predictions of the CP patient, Figure 4c,d represent the average predictions of the MS patient, Figure 4e,f represent the average predictions of the stoke patient, Figure 4g,h represent the average predictions of the SCI patient, and Figure 4i,j represent the average predictions of the TBI patient. In all these graphs, we observe that each patient’s average predicted gait trajectory is very close to the average actual trajectory. Hence, it shows that our model performs well in predicting the post-treatment gait trajectory of patients with disabilities, as reported in Table 2, Table 3 and Table 4.
Table 4. Performance of both models in predicting the post-treatment gait trajectories of a complete cycle with respect to different diseases. Bold entries denote the best predictions: lowest average RMSE and maximum R 2 over all limbs for a given disease.
Table 4. Performance of both models in predicting the post-treatment gait trajectories of a complete cycle with respect to different diseases. Bold entries denote the best predictions: lowest average RMSE and maximum R 2 over all limbs for a given disease.
ModelModel TypeBody JointSpinal Cord Injury (SCI)Multiple Sclerosis (MS)StrokeCerebral Palsy (CP)Traumatic Brain Injury (TBI)
No. of Patients
13121143
No. of Cycles
40035126013874
RMSE Mean (°) ± Standard Error
R 2 Score
MTD-DMMTL, 5 Bi-LSTMsKnee4.18 ± 0.974.58 ± 1.114.58 ± 1.094.82 ± 1.234.26 ± 0.77
0.700.850.69−0.170.72
Ankle3.61 ± 0.993.35 ± 0.82.92 ± 0.853.6 ± 1.022.86 ± 0.73
0.550.810.470.440.75
MTD-GMMTL, 5 gated Bi-LSTMsKnee3.05 ± 0.73.08 ± 0.692.9 ± 0.792.88 ± 0.993.28 ± 0.78
0.800.930.880.800.80
Ankle2.37 ± 0.82.02 ± 0.542.3 ± 0.441.92 ± 0.722.44 ± 0.49
0.830.940.700.860.82
Table 5. Performance of both models for both joints in predicting the post-treatment gait trajectories with respect to different diseases. Bold entries denote the best predictions: lowest average RMSE and maximum R 2 over all limbs for a given disease.
Table 5. Performance of both models for both joints in predicting the post-treatment gait trajectories with respect to different diseases. Bold entries denote the best predictions: lowest average RMSE and maximum R 2 over all limbs for a given disease.
ModelModel TypeBody JointSpinal Cord Injury (SCI)Multiple Sclerosis (MS)StrokeCerebral Palsy (CP)Traumatic Brain Injury (TBI)
No. of Patients
13121143
No. of Cycles
40035126013874
RMSE Mean (°) ± Standard Error
R 2 Score
MTD-DMMTL, 5 Bi-LSTMsKnee and Ankle4.02 ± 0.764.12 ± 0.793.91 ± 0.794.33 ± 1.023.76 ± 0.61
0.870.910.890.770.86
MTD-GMMTL, 5 gated Bi-LSTMs2.78 ± 0.612.65 ± 0.532.65 ± 0.572.48 ± 0.752.97 ± 0.57
0.940.960.950.940.90
However, the CGA kinematics were difficult to predict for a few patients’ post-treatment. In Figure 5a–d, we report that the post-treatment cycles for these patients show more variability between cycles than before treatment. This is not the case for most of the other patients. We investigate this phenomenon below.
Figure 5a shows the trajectories of the right limb of a patient with SCI. The knee kinematics of this patient show more inter-cycle variability than most patients. The MTD-GM model predicted this patient’s post-treatment kinematics with an average RMSE of 6.06° and an R 2 value of −0.16 for the knee joint. This is worse than the average RMSE of 3.05°. We also notice that the stance phase’s prediction error is higher than the swing phase. Figure 5b shows the trajectories of a TBI patient with an issue in his left limb. The post-treatment gait kinematics of the knee joint of this patient were more intra-variant. The prediction for this patient was also not good, with an RMSE of 6.03° and an R 2 = 0.41 for the knee joint. Figure 5c shows the trajectories of a patient with CP. Both the pre- and post-treatment gait kinematics of this patient showed high variance between cycles. The prediction of this patient shows an average RMSE of 3.02° and an R 2 score of 0.27 for the knee joint. It is difficult for our DL models to predict the post-treatment gait trajectories, which show significant variance. Figure 5d shows the trajectories of a stroke patient with an issue in his lower limbs. His kinematics were also more inter- and intra-variant, with a prediction of an average RMSE of 2.98° and R 2 = −0.04 for the knee joint.

4. Comparison between Cycle-Based Prediction, Phased-Based Prediction, and Related Works

It is difficult to compare our results to other research, because this is the first time that the entire knee and ankle kinematic signals in the sagittal plane have been predicted for BTX-A treatment, except for in ref. [23]. In this study, we worked on the kinematic prediction per phase (stance and swing) of each gait cycle, while in [23], the complete cycles were considered. The results of the total cycles of this study and the previous study are compared in Table 6. It is important to note that five additional patients are considered in this work; thus, the dataset is similar but nonidentical. For MS and TBI diseases, we recruited the same patients in both studies, but the difference in the prediction performance is evident. We can see that the R 2 scores for stroke, TBI, and CP were negative in [23]. For our new approach, there was an improvement, achieving more than 0.7 in the R 2 score. Furthermore, we compare our results with those described by [4] for forecasts of peak knee and ankle flexion in the sagittal plane for rectus femoris BTX-A injection in stroke patients with see [Table 6]. In this case, our proposed method (MTD-GM) for stroke had a higher R 2 score for knee flexion and ankle dorsiflexion. However, since the models were not trained and tested using the same databases, this comparison should be interpreted with care. In addition, we also compare how well our model predicted the post-treatment kinematics for patients with CP. Even though the proposed methods were not tested on the same databases, their results were better than the postoperative predictions for CP made by Galarraga et al. [34], Niiler et al. [35], and Niiler [36], as shown in Table 6. MTD-GM predicted knee flexion and ankle dorsiflexion with an average RMSE of 2.9° and 1.92°, respectively.

5. Discussion and Conclusions

In this study, we used two DL models using Bi-LSTM modules to predict the effects of BTX-A injections on gait kinematics. As far as we know, except for our previous work [23], no other study in the literature has used DL to look at this specific prediction task. The prediction task we assess in this work is more challenging than predicting normal gait due to the high variability between and within subjects in pathological gait.
DL architectures allow information about medical treatments to be added to the model. Instead of performing a simple pre-to-post regression task, previous results show that adding information about the treatment (i.e., which muscles were treated using BTX-A) improves performance [23].
In this work, we predict knee and ankle kinematics in the sagittal plane, but we perform this prediction per gait phase separately and only by exploiting the best DL architectures proposed in [23]. These best architectures are based on Bi-LSTM sub-models. The best DL model incorporates the presence or absence of treatments based on a gating mechanism (MTD-GM model). This model predicts post-treatment gait kinematics with a lower RMSE and higher R 2 score than the MTD-DM model and for all the considered pathologies. MTD-DM introduces treatments by initializing the hidden units of each Bi-LSTM sub-model. Instead, we randomly initialize hidden units in MTD-GM and apply the gating mechanism on the outputs of the Bi-LSTM sub-models, each devoted to a unique muscle treatment.
In our previous work [23], this model (gated model) was not the best in all situations. This improvement is due to the approach proposed in this work: splitting the learning process of each DL architecture between the stance and swing phases. This proposal allows the model to cope with the high variance present in pre-treatment and post-treatment gait between individuals and within each individual. We notice the effectiveness of our approach by exploiting the advantages of DL architectures on each gait phase separately. We note that the gated model has a relative improvement of around 55% on both joints. We have thus proven the increased capability of the global model (after combining the predictions of both phases) in accurately predicting the complete gait cycles after treatment.
If we examine the results per phase, for the stance phase, both models’ performances (MTD-DM and MTD-GM) are equivalent. For the swing phase, the difference between both models is essential. The gated model (MTD-GM) is much better: it often divides the RMSE by a factor of 2. The R 2 score is generally good for the knee (higher than 0.7) but close to 0 for the ankle. Moreover, no proposed model adequately explained the variance of the swing phase of the ankle joint (note that MTD-DM performed badly, as shown by negative R 2 scores).
We conclude that using treatment data to initialize hidden variables in each Bi-LSTM sub-model is not enough to cope with the high variance encountered in the data, especially in the swing phase. Indeed, for this phase, we remark that a more drastic treatment supervisor for the outputs of the Bi-LSTM sub-models is required. In this work, we do not address the segmentation between phases of gait cycles; instead, we focus on the prediction quality of our models by exploiting the segmentation of the pre-treatment gait cycles on the post-treatment gait cycles. In future work, we will study the impact of considering treatment dosage information. We expect to refine our predictions of post-treatment kinematics by including this information in our DL architecture.

Author Contributions

Conceptualization, A.K., O.G., S.G.-S. and V.V.; methodology, A.K.; software, A.K.; validation, A.K.; formal analysis, A.K., O.G., S.G.-S. and V.V.; investigation, A.K., O.G., S.G.-S. and V.V.; resources, O.G., S.G.-S. and V.V.; data curation, A.K. and O.G; writing—original draft preparation, A.K.; writing—review and editing, A.K., O.G., S.G.-S. and V.V.; visualization, A.K.; supervision, S.G.-S. and V.V.; project administration, S.G.-S. and V.V.; funding acquisition, O.G. and V.V. All authors have read and agreed to the published version of the manuscript.

Funding

This work was part of a a Ph.D. project funded by HEC Pakistan.

Institutional Review Board Statement

This retrospective study was approved by the local research ethics committee.

Informed Consent Statement

Since this is a retrospective study, no informed consent was needed. The patients were informed about the research and did not oppose to the utilization of their data.

Data Availability Statement

Due to the nature of this research, the participants of this study did not agree for their data to be shared publicly, so the supporting data are not available.

Acknowledgments

The authors would like to thank HEC Pakistan for funding part of this research. We thank the staff of the Movement Analysis Laboratory of UGECAM Coubert, who acquired all the data used in this work. We would also like to acknowledge the anonymous reviewers for their valuable observations and suggestions.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
AIArtificial intelligence
MLMachine learning
DLDeep learning
MTLMulti-task learning
CPCerebral palsy
CGAClinical gait analysis
LSTMLong short-term memory
CNNConvolutional neural network
MAEMean absolute error
RMSERoot mean squared error
SEStandard error
BTX-ABotulinum toxin type A
TBITraumatic brain injury
SCISpinal cord injury
MSMultiple sclerosis
CMAClinical movement analysis
DNNDeep neural network
RNNRecurrent neural network
IMUInertial measurement unit
HPAHigh-pass algorithm
Bi-LSTMBi-directional LSTM
MTDMedical treatment data

Appendix A

The detailed results of all the patients using the MTD-GM model are listed in the following figure.
Figure A1. Detailed results of all patients using the MTD-GM model.
Figure A1. Detailed results of all patients using the MTD-GM model.
Sensors 24 05343 g0a1

References

  1. McLoughlin, J.; Barr, C.; Crotty, M.; Lord, S.; Sturnieks, D. Association of postural sway with disability status and cerebellar dysfunction in people with multiple sclerosis: A preliminary study. Int. J. MS Care 2015, 17, 146–151. [Google Scholar] [CrossRef]
  2. Blumhardt, L. Multiple Sclerosis Dictionary; Taylor & Francis: Abingdon, UK, 2004. [Google Scholar]
  3. Sun, L.C.; Chen, R.; Fu, C.; Chen, Y.; Wu, Q.; Chen, R.; Lin, X.; Luo, S. Efficacy and Safety of Botulinum Toxin Type A for Limb Spasticity after Stroke: A Meta-Analysis of Randomized Controlled Trials. BioMed Res. Int. 2019, 2019, 8329306. [Google Scholar] [CrossRef] [PubMed]
  4. Roche, N.; Boudarham, J.; Hardy, A.; Bonnyaud, C.; Bensmail, D. Use of gait parameters to predict the effectiveness of botulinum toxin injection in the spastic rectus femoris muscle of stroke patients with stiff knee gait. Eur. J. Phys. Rehabil. Med. 2015, 51, 361–370. [Google Scholar]
  5. Notice Patient—DYSPORT 500 UNITES SPEYWOOD, Poudre Pour Solution Injectable—Base de DonnéEs Publique des Médicaments. Available online: https://base-donnees-publique.medicaments.gouv.fr/affichageDoc.php?specid=60242321&typedoc=N (accessed on 15 June 2022).
  6. Baker, R.W. Measuring Walking: A Handbook of Clinical Gait Analysis, 1st ed.; MacKeith Press: London, UK, 2013. [Google Scholar]
  7. McGinley, J.L.; Baker, R.; Wolfe, R.; Morris, M.E. The reliability of three-dimensional kinematic gait measurements: A systematic review. Gait Posture 2009, 29, 360–369. [Google Scholar] [CrossRef]
  8. Lerner, Z.F.; Damiano, D.L.; Bulea, T.C. A lower-extremity exoskeleton improves knee extension in children with crouch gait from cerebral palsy. Sci. Transl. Med. 2017, 9, eaam9145. [Google Scholar] [CrossRef] [PubMed]
  9. Di Russo, F.; Berchicci, M.; Perri, R.L.; Ripani, F.R.; Ripani, M. A passive exoskeleton can push your life up: Application on multiple sclerosis patients. PLoS ONE 2013, 8, e77348. [Google Scholar] [CrossRef] [PubMed]
  10. Goodfellow, I.; Bengio, Y.; Courville, A. Deep Learning; MIT Press: Cambridge, MA, USA, 2016. [Google Scholar]
  11. Topol, E.J. High-performance medicine: The convergence of human and artificial intelligence. Nat. Med. 2019, 25, 44–56. [Google Scholar] [CrossRef] [PubMed]
  12. Karakish, M.; Fouz, M.A.; ELsawaf, A. Gait trajectory prediction on an embedded microcontroller using deep learning. Sensors 2022, 22, 8441. [Google Scholar] [CrossRef] [PubMed]
  13. Ding, G.; Plummer, A.; Georgilas, I. Deep learning with an attention mechanism for continuous biomechanical motion estimation across varied activities. Front. Bioeng. Biotechnol. 2022, 10, 1021505. [Google Scholar] [CrossRef] [PubMed]
  14. Morbidoni, C.; Cucchiarelli, A.; Fioretti, S.; Di Nardo, F. A deep learning approach to EMG-based classification of gait phases during level ground walking. Electronics 2019, 8, 894. [Google Scholar] [CrossRef]
  15. Bauman, V.V.; Brandon, S.C. Gait phase detection in walking and stairs using machine learning. J. Biomech. Eng. 2022, 144, 121007. [Google Scholar] [CrossRef]
  16. Lempereur, M.; Rousseau, F.; Rémy-Néris, O.; Pons, C.; Houx, L.; Quellec, G.; Brochard, S. A new deep learning-based method for the detection of gait events in children with gait disorders: Proof-of-concept and concurrent validity. J. Biomech. 2020, 98, 109490. [Google Scholar] [CrossRef] [PubMed]
  17. Tunca, C.; Salur, G.; Ersoy, C. Deep learning for fall risk assessment with inertial sensors: Utilizing domain knowledge in spatio-temporal gait parameters. IEEE J. Biomed. Health Inform. 2019, 24, 1994–2005. [Google Scholar] [CrossRef] [PubMed]
  18. Sadeghzadehyazdi, N.; Batabyal, T.; Acton, S.T. Modeling spatiotemporal patterns of gait anomaly with a CNN-LSTM deep neural network. Expert Syst. Appl. 2021, 185, 115582. [Google Scholar] [CrossRef]
  19. Chen, I.; Yeh, P.Y.; Chang, T.C.; Hsieh, Y.C.; Chin, C.L. Sarcopenia Recognition System Combined with Electromyography and Gait Obtained by the Multiple Sensor Module and Deep Learning Algorithm. Sensors Mater. 2022, 34, 2403–2425. [Google Scholar] [CrossRef]
  20. Su, B.; Gutierrez-Farewik, E.M. Gait trajectory and gait phase prediction based on an LSTM network. Sensors 2020, 20, 7127. [Google Scholar] [CrossRef] [PubMed]
  21. Zhang, Y.; Yang, Q. A survey on multi-task learning. IEEE Trans. Knowl. Data Eng. 2021, 34, 5586–5609. [Google Scholar] [CrossRef]
  22. Kolaghassi, R.; Al-Hares, M.K.; Marcelli, G.; Sirlantzis, K. Performance of Deep Learning Models in Forecasting Gait Trajectories of Children with Neurological Disorders. Sensors 2022, 22, 2969. [Google Scholar] [CrossRef] [PubMed]
  23. Khan, A.; Hazart, A.; Galarraga, O.; Garcia-Salicetti, S.; Vigneron, V. Treatment Outcome Prediction Using Multi-Task Learning: Application to Botulinum Toxin in Gait Rehabilitation. Sensors 2022, 22, 8452. [Google Scholar] [CrossRef] [PubMed]
  24. Klejman, S.; Andrysek, J.; Dupuis, A.; Wright, V. Test-Retest Reliability of Discrete Gait Parameters in Children With Cerebral Palsy. Arch. Phys. Med. Rehabil. 2010, 91, 781–787. [Google Scholar] [CrossRef]
  25. Galarraga, O.; Le Saux, A.; Hareb, F. O 083-A new method for computing gait deviation scores in hemiparesis. Gait Posture 2018, 65, 170–171. [Google Scholar] [CrossRef]
  26. Wu, X.; Yuan, Y.; Zhang, X.; Wang, C.; Xu, T.; Tao, D. Gait phase classification for a lower limb exoskeleton system based on a graph convolutional network model. IEEE Trans. Ind. Electron. 2021, 69, 4999–5008. [Google Scholar] [CrossRef]
  27. Desailly, E.; Daniel, Y.; Sardain, P.; Lacouture, P. Foot contact event detection using kinematic data in cerebral palsy children and normal adults gait. Gait Posture 2009, 29, 76–80. [Google Scholar] [CrossRef]
  28. Schwartz, M.H.; Rozumalski, A. The Gait Deviation Index: A new comprehensive index of gait pathology. Gait Posture 2008, 28, 351–357. [Google Scholar] [CrossRef] [PubMed]
  29. Hochreiter, S.; Schmidhuber, J. Long short-term memory. Neural Comput. 1997, 9, 1735–1780. [Google Scholar] [CrossRef] [PubMed]
  30. Kinga, D.; Adam, J.B. A method for stochastic optimization. In Proceedings of the International Conference on Learning Representations (ICLR), San Diego, CA, USA, 7–9 May 2015; Volume 5, p. 6. [Google Scholar]
  31. Bishop, C.M. Pattern Recognition And Machine Learning, 2nd ed.; Springer: New York, NY, USA, 2006. [Google Scholar]
  32. Altman, D.G.; Bland, J.M. Standard deviations and standard errors. BMJ 2005, 331, 903. [Google Scholar] [CrossRef]
  33. Hastie, T.; Tibshirani, R.; Friedman, J. An Introduction to Statistical Learning. 2009. Available online: https://www.stat.berkeley.edu/users/rabbee/s154/ISLR_First_Printing.pdf (accessed on 13 August 2024).
  34. Galarraga, C.O.A.; Vigneron, V.; Dorizzi, B.; Khouri, N.; Desailly, E. Predicting postoperative gait in cerebral palsy. Gait Posture 2017, 52, 45–51. [Google Scholar] [CrossRef]
  35. Niiler, T.A.; Richards, J.G.; Miller, F.; Sun, J.Q.; Castagno, P. Reliability of predictions of post-operative gait in rectus transfer patients using FFT neural networks. Gait Posture 1999, 9. [Google Scholar]
  36. Niiler, T.A. Efficacy of Predictions of Post-Operative Gait in Rectus Transfer Patients Using Neural Networks. Ph.D. Thesis, University of Delaware, Newark, DE, USA, 2001. [Google Scholar]
Figure 1. Process of converting each patient’s trial into normalized gait phases (stance and swing). (a,b) knee and ankle trials of a random Patient i, respectively; (c,d) knee and ankle cycles of Patient i extracted from their trials; (e,f) One cycle of the knee and ankle joints, respectively; (g,i) normalized stance phase of knee and ankle joints from given cycles, respectively; (h,j) normalized swing phase of knee and ankle joints, respectively.
Figure 1. Process of converting each patient’s trial into normalized gait phases (stance and swing). (a,b) knee and ankle trials of a random Patient i, respectively; (c,d) knee and ankle cycles of Patient i extracted from their trials; (e,f) One cycle of the knee and ankle joints, respectively; (g,i) normalized stance phase of knee and ankle joints from given cycles, respectively; (h,j) normalized swing phase of knee and ankle joints, respectively.
Sensors 24 05343 g001
Figure 2. Flowchart of data preprocessing. We acquired the trajectories of patient trials from the database and extracted all cycles from the trials for the knee and ankle joints. After that, we selected each cycle and segmented them into stance and swing phases. Finally, we normalized the phases.
Figure 2. Flowchart of data preprocessing. We acquired the trajectories of patient trials from the database and extracted all cycles from the trials for the knee and ankle joints. After that, we selected each cycle and segmented them into stance and swing phases. Finally, we normalized the phases.
Sensors 24 05343 g002
Figure 3. (a) Multi-task learning architecture with Bi-LSTM sub-models: processing of MTD in each sub-model. (b) Multi-task learning architecture with Bi-LSTM sub-models: incorporating MTD through a gating mechanism.
Figure 3. (a) Multi-task learning architecture with Bi-LSTM sub-models: processing of MTD in each sub-model. (b) Multi-task learning architecture with Bi-LSTM sub-models: incorporating MTD through a gating mechanism.
Sensors 24 05343 g003
Figure 4. Comparison of MTD-GM prediction and post-treatment kinematics for a patient with a given disease.
Figure 4. Comparison of MTD-GM prediction and post-treatment kinematics for a patient with a given disease.
Sensors 24 05343 g004
Figure 5. Outliers.
Figure 5. Outliers.
Sensors 24 05343 g005
Table 1. Considered injected muscles and their frequencies in the database.
Table 1. Considered injected muscles and their frequencies in the database.
Muscle NumberMuscle/CategoryInjections in Patient
NumberProportion
1Soleus5328.3%
2Gastrocnemius (medialis and/or lateralis)5127.2%
3Rectus femoris2211.7%
4Semitendinosus147.4%
5Other muscle4725.1%
Table 6. Performance comparison to related works. Bold entries denote the best predictions: lowest average RMSE and maximum R 2 over all limbs for a given disease. * Only peak flexion and not over the whole time series.
Table 6. Performance comparison to related works. Bold entries denote the best predictions: lowest average RMSE and maximum R 2 over all limbs for a given disease. * Only peak flexion and not over the whole time series.
ModelModel TypeBody JointSpinal Cord Injury (SCI)Multiple Sclerosis (MS)StrokeCerebral Palsy (CP)Traumatic Brain Injury (TBI)
RMSE Mean (°) ± Standard Error
R 2 Score
Model 1 [23]MTL, 5 Bi-LSTMsKnee7.51 ± 1.677.23 ± 1.697.14 ± 1.086.75 ± 1.735.81 ± 1.33
0.630.630.010.70.54
Ankle5.01 ± 0.994.38 ± 1.156.85 ± 1.616.4 ± 1.194.68 ± 0.82
0.380.55−3.58−0.010.4
Model 2 [23]MTL, 5 gated Bi-LSTMsKnee7.62 ± 1.897.23 ± 2.018.02 ± 1.097.00 ± 2.425.60 ± 1.4
0.530.480.210.680.65
Ankle4.56 ± 1.055.39 ± 1.377.14 ± 1.443.77 ± 1.414.93 ± 1.0
0.270.28−3.65−0.16−0.5
MTD-DMMTL, 5 Bi-LSTMsKnee4.18 ± 0.974.58 ± 1.114.58 ± 1.094.82 ± 1.234.26 ± 0.77
0.700.850.69−0.170.72
Ankle3.61 ± 0.993.35 ± 0.82.92 ± 0.853.6 ± 1.022.86 ± 0.73
0.550.810.470.440.75
MTD-GMMTL, 5 gated Bi-LSTMsKnee3.05 ± 0.73.08 ± 0.692.9 ± 0.792.88 ± 0.993.28 ± 0.78
0.800.930.880.800.80
Ankle2.37 ± 0.82.02 ± 0.542.3 ± 0.441.92 ± 0.722.44 ± 0.49
0.830.940.700.860.82
LigReg Stroke * [4] ( R 2 Score)Linear RegressionKnee--0.24--
Ankle--0.43--
MLinReg CP [34] ( RMSE (°))Multiple Linear RegressionKnee---9.0-
Ankle---7.5-
NN99 CP [35] ( RMSE (°))NNKnee---9.7-
Ankle---6.7-
NN01 CP [36] ( RMSE (°))NNKnee---9.2-
Ankle-----
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Khan, A.; Galarraga, O.; Garcia-Salicetti, S.; Vigneron, V. Phase-Based Gait Prediction after Botulinum Toxin Treatment Using Deep Learning. Sensors 2024, 24, 5343. https://doi.org/10.3390/s24165343

AMA Style

Khan A, Galarraga O, Garcia-Salicetti S, Vigneron V. Phase-Based Gait Prediction after Botulinum Toxin Treatment Using Deep Learning. Sensors. 2024; 24(16):5343. https://doi.org/10.3390/s24165343

Chicago/Turabian Style

Khan, Adil, Omar Galarraga, Sonia Garcia-Salicetti, and Vincent Vigneron. 2024. "Phase-Based Gait Prediction after Botulinum Toxin Treatment Using Deep Learning" Sensors 24, no. 16: 5343. https://doi.org/10.3390/s24165343

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop