Phase-Based Gait Prediction after Botulinum Toxin Treatment Using Deep Learning

Khan, Adil; Galarraga, Omar; Garcia-Salicetti, Sonia; Vigneron, Vincent

doi:10.3390/s24165343

Open AccessArticle

Phase-Based Gait Prediction after Botulinum Toxin Treatment Using Deep Learning

¹

Informatique, Bio-Informatique et Systèmes Complexes (IBISC) EA 4526, Univ Evry, Université Paris-Saclay, 91020 Evry, France

²

Department of Computer Science, Sukkur IBA University, Sukkur 65200, Sindh, Pakistan

³

UGECAM Ile-de-France, Movement Analysis Laboratory, 77170 Coubert, France

⁴

SAMOVAR, Télécom SudParis, Institut Polytechnique de Paris, 91120 Palaiseau, France

^*

Authors to whom correspondence should be addressed.

Sensors 2024, 24(16), 5343; https://doi.org/10.3390/s24165343

Submission received: 28 June 2024 / Revised: 14 August 2024 / Accepted: 17 August 2024 / Published: 18 August 2024

(This article belongs to the Special Issue Recent Advance and Application of Wearable Inertial Sensors in Motion Analysis)

Download

Browse Figures

Versions Notes

Abstract

Gait disorders in neurological diseases are frequently associated with spasticity. Intramuscular injection of Botulinum Toxin Type A (BTX-A) can be used to treat spasticity. Providing optimal treatment with the highest possible benefit–risk ratio is a crucial consideration. This paper presents a novel approach for predicting knee and ankle kinematics after BTX-A treatment based on pre-treatment kinematics and treatment information. The proposed method is based on a Bidirectional Long Short-Term Memory (Bi-LSTM) deep learning architecture. Our study’s objective is to investigate this approach’s effectiveness in accurately predicting the kinematics of each phase of the gait cycle separately after BTX-A treatment. Two deep learning models are designed to incorporate categorical medical treatment data corresponding to the injected muscles: (1) within the hidden layers of the Bi-LSTM network, (2) through a gating mechanism. Since several muscles can be injected during the same session, the proposed architectures aim to model the interactions between the different treatment combinations. In this study, we conduct a comparative analysis of our prediction results with the current state of the art. The best results are obtained with the incorporation of the gating mechanism. The average prediction root mean squared error is 2.99° (

R^{2}

= 0.85) and 2.21° (

R^{2}

= 0.84) for the knee and the ankle kinematics, respectively. Our findings indicate that our approach outperforms the existing methods, yielding a significantly improved prediction accuracy.

Keywords:

botulinum toxin; clinical gait analysis; deep learning; gait rehabilitation; long short-term memory; multi-task learning

1. Introduction

Musculoskeletal and neurological diseases lead to diminished quality of life [1]. These diseases may be associated with spasticity, a motor disorder that causes excessive tendon jerks due to hyper-excitability of the stretch reflexes [2]. Medical doctors recommend rehabilitation for such deficiencies, in addition to pharmacologic treatment. In particular, spasticity is usually treated with intramuscular Botulinum Toxin Type A (BTX-A) injections that enhance lower and upper limb function [3,4]. It is essential to ensure that the total dose of BTX-A administered to the patient and the doses on each treated muscle do not exceed the recommended maximal doses [5]. BTX-A injections should be separated at least three months apart due to their dangers and reversible effects on muscular function. A good BTX-A outcome can enhance and accelerate the rehabilitation process, but a bad outcome could slow down and/or limit the patient’s recovery. Therefore, optimizing BTX-A treatment by choosing the correct muscles to be treated and the dose distribution is a complex and crucial task that requires careful patient assessment. Anticipating the most likely outcome of a specific BTX-A treatment could help clinicians find the most adapted treatment combination (muscle and doses) for each patient. This could also potentially enhance the patient’s participation in the treatment and rehabilitation process.

Clinical Gait Analysis (CGA) is considered for treatment decisions, along with a medical history and physical examination. Based on a biomechanical interpretation of instrumental measures, CGA examines walking issues and suggests causes [6]. CGA data are clinically reliable if quality standards are met [7]. Scientific research has shown that CGA helps assess and treat neurological diseases such as Cerebral Palsy (CP) [8], post-stroke hemiparesis [4] and Multiple Sclerosis (MS) [9].

Deep Neural Networks (DNN) have excelled in clinical decision-making [10,11]. Deep Learning (DL) has been used in several CGA works to predict gait trajectories, primarily for healthy gaits [12,13,14,15]. Most works on pathological gaits tackle classification [16,17,18,19]. Among them, a few studies predict the gait trajectory some timestamps ahead [20,21], but they do not attempt to predict the post-treatment gait trajectory. Kolaghassi et al. [22] studied the abnormal walking patterns of children with neurological disorders. They used Long Short-Term Memory (LSTM) and a Convolutional Neural Network (CNN) to predict future hip, knee, and ankle trajectories up to 200 ms. Su et al. [20] used LSTM to predict gait trajectories and the five gait phases (loading response, mid-stance, terminal stance, pre-swing, and swing) to help design exoskeletons. Karakish et al. [12] implemented more reliable and simple Artificial Neural Networks (ANNs) to predict the gait trajectories using shank and foot IMU data. Four subjects were used for training, and a fifth was used for testing (200 ms future trajectories). Ding et al. [13] proposed a model for motion prediction of a subject using the motion of complementary limbs. They implemented LSTM with an attention mechanism.

The goal of this research is to enhance the prediction of the post-treatment (with BTX-A) gait signals using DL models to help clinicians with therapeutic decision-making. Our contribution aims to predict the gait trajectory after BTX-A treatment and determine how different treatments might be combined. It is based on an architecture that can handle sparse inputs (treatment data) and construct a more robust model by sharing information between sub-models representing different treatments [21]. In our past study [23], we compared serial and parallel architectures. The best one was made up of parallel Bi-LSTM-shaped sub-models. Each sub-model was paired with a treatment (injected muscle). Those models learn to map gait sequences well before and after treatment [23].

In this work, we split the learning process into stance and swing phases for the parallel architectures in [23]. Indeed, as we work on pathological gaits, the signals show more variance than normal gaits [24]. We note that the duration of the stance phase is often longer for patients than for healthy subjects; for example, in post-stroke hemiparesis [25]. The stance and swing phases obey different biomechanical constraints; therefore, we train separate models on each phase to enhance post-treatment CGA prediction. This strategy increases the global model’s ability to accurately predict the complete post-treatment gait cycle. We show significant improvements in the prediction quality of pathological gait cycles [23].

2. Materials and Methods

2.1. Dataset Description

This study gathered data at the Movement Analysis Laboratory at the Rehabilitation Center of UGECAM Coubert (France) using a four-camera CX1 optoelectronic Codamotion system operating at a frequency of 100 Hz. The participants were adults with varying gait abnormalities: MS, stroke, CP, Spinal Cord Injury (SCI), and Traumatic Brain Injury (TBI). The

N_{p a t}

= 43 patients (26 males and 17 females) received BTX-A injections for spasticity treatment. In this retrospective analysis, the patients participated in clinical activities. The institution’s research ethics committee approved the use of these data. The patients were informed about the research and did not object to the use of their data. The patients had undergone treatment via injections into their lower limbs, with

N_{u n i}

= 19 patients (44.18%) affected unilaterally (10 right limbs and 9 left limbs) and

N_{b i l}

= 24 patients (55.82%) affected bilaterally. Their ages ranged from 21 to 75 years old. The time lag between pre- and post-treatment CGA was between 3 and 6 weeks. A total of nineteen muscles were injected, but we selected four frequently injected muscles: soleus, gastrocnemius (medial and lateral), semitendinosus, and rectus femoris. A fifth category, “other muscles”, grouped all the other treated muscles (see Table 1). There were 28 combinations of BTX-A injections into these four muscles. A treatment binary code vector was attributed to each lower limb:

s^{j} = {(s_{1}^{j}, \dots, s_{c}^{j})}^{T}, s_{i}^{j} \in {0, 1}, i = 1 \dots c (c = 5 as shown in Table 1)

where

s_{i}^{j} = 1

if muscle i was injected in limb j, 0 otherwise, and

d^{j} = {(d_{1}^{j}, \dots, d_{5}^{j})}^{T}, d_{i}^{j} \in {0, 1}

is a binary vector for the disease of the patient’s limb j. This study included five diseases: CP, MS, TBI, SCI, and stroke. ^T is the transpose operator.

2.2. Dataset Preparation

The participants were recorded walking in a straight line, with or without technical aids (i.e., cane, rollator, tripod, etc.), through a 10 m-long laboratory room. The patients wore anatomical markers, whose coordinates were tracked in 3D using four sensors. The patients walked back and forth throughout the gait hallway (trials). Depending on the patient’s capability, multiple trials of each patient were recorded at 100 Hz. The 3D gait kinematics were computed following the recommendations of the International Society of Biomechanics [26] based on the marker data. Each trial was divided into cycles and then segmented into the stance phase, from initial contact to toe-off, and the swing phase, from toe-off to subsequent initial contact (see Figure 1). Gait events (initial contacts and toe-offs) were detected from force platform data and automatically extracted by the HPA algorithm [27]. A human expert validated and modified all the gait events (when needed). The process of extracting cycles from trials to normalized phases is shown in Figure 2.

We considered a person’s right and left cycles as different samples. Each pre-treatment cycle phase was associated with a target post-treatment cycle’s corresponding phase of the same patient, leading to n = 2518 samples in total. Note that the number of cycles per patient varied from one patient to another.

Patient data were recorded for the following five joints in 3D: pelvis, hip, knee, ankle, and foot. This study only considered the knee and ankle kinematics in the sagittal plane.

The kinematic data of each phase were resampled and normalized to 51 points following standard procedures in CGA [28]. Therefore, DL models could be trained on sequences of the same length. For any patient’s limb j, the input vector is an angular time series

x^{j} = {(x_{1}^{j}, \dots, x_{m}^{j})}^{T} \in {[- 180, + 180]}^{m}

, and the target vector is

y^{j} = {(y_{1}^{j}, \dots, y_{m}^{j})}^{T}

, with

m = 51 \times 2 = 102

. Let

D = {x^{j}, y^{j}, d^{j}, s^{j}}_{j = 1}^{n}

be the input–target training set.

The data were centered and reduced by the standard deviation for normalization purposes. The goal was to use

g (x)

to make a model that maps

\hat{Y} = g (x)

, where

\hat{Y}

is an estimation of

y

.

2.3. Description of Models

In our previous study [23], seven models were developed for prediction. Four were Bi-LSTM-based parallel models, and the others were serial models. The parallel models achieved better prediction results in 38 patients. LSTM [29] is good at predicting time series and can retain information for a long period of time. It has a hidden state

h_{t}

and a cell state

c_{t}

of the same size as the input series

x_{t}

. The cell state

c_{t}

is the model’s memory. The hidden state

h_{t}

is the model’s prediction of

x_{t}

.

We use the best models from our previous study [23], which are based on a Bi-LSTM. Bi-LSTM combines two LSTM models trained simultaneously: one on the forward input series and the other on the reverse input series, starting with the last input and moving on to the next-to-last, and so forth.

In both models (see Figure 3), we use five parallel layers of Bi-LSTM. Each layer is responsible for a treatment, as reported in Table 1. The five treatments correspond to the five categories of injected muscles. Each Bi-LSTM layer had 51 units. Note that each unit receives a pair of inputs for the knee and ankle, respectively.

We train separate models for the stance and swing phases. For comparison with our previous work [23], we combine both predictions using the proportion of stance and swing phases of the pre-treatment gait cycle. We retrieve the prediction for a complete cycle of the knee and ankle joints.

MTD-driven model (MTD-DM): The input vectors

x

and

s

are sent to the five Bi-LSTM sub-models in this architecture. The pre-treatment knee and ankle kinematic signals are represented by vector

x

. The Medical Treatment Data (MTD) are represented by vector

s

. We initialize the cell states of the LSTM as 0. The MTD (vector

s

) is handled by the treatment supervisor and was used to set the values of the hidden states h. For example, if a patient had injections in muscles 1 and 3 (Table 1), then the states

h_{1, t}

,

h_{2, t}

in Bi-LSTM, layers 1 and 3 are set to 1, and the other layers’ (2, 4, and 5) hidden states are set to 0. The results of the five Bi-LSTM sub-models are concatenated to form a single tensor ‘output’, a one-dimensional vector, and reshaped. The reshaped output is then processed through two subsequent fully connected layers (FC1 and FC2) to predict the post-treatment kinematics (see Figure 3a).

MTD-gated model (MTD-GM): This architecture uses a gating mechanism to handle MTD. Instead of passing the MTD as a hidden state of each sub-model, the treatment supervisor is exploited downstream, multiplying each sub-model’s output by the associated binary value of the MTD. Figure 3b shows that if there is a treatment, it will be used in the model, but if there is none, it will be neglected (multiplied by 0). The results of the remaining Bi-LSTM sub-models are concatenated and reshaped. The reshaped output is then processed through two subsequent fully connected layers (FC1 and FC2) to predict the post-treatment kinematics (see Figure 3b). Leave-one-out cross-validation was used to assess model performance. For each iteration,

N_{t r a i n}

=

N_{p a t} - 1

patients were used to train the model, and one was used to test it. During the training process, mini-batches with 16 samples were used. DL models were trained based on the mean square error (MSE) loss function, controlled by the Adam optimizer [30]. RMSE [31], Standard Error (SE) [32], and the coefficient of determination (

R^{2}

) [33] were used to evaluate the performance of the proposed models.

3. Results

A total of 43 participants were included in this study, whereas 38 were included in our previous work [23], and five new ones were added. We evaluated the two MTL models on the new dataset using the above-mentioned evaluation metrics.

3.1. Analysis of Stance Phase and Swing Phase

Table 2, Table 3 and Table 4 report the performance of both models in predicting the post-treatment gait kinematics of the stance phase, swing phase, and complete gait cycle concerning the disease. The bold entries in the following tables represent the best predictions (lowest RMSE and highest

R^{2}

scores). We notice that the performances of both models are equivalent in the stance phase for the knee and ankle joints (average RMSE between 1.79° and 3.17°). On the contrary, for the swing phase, MTD-GM is much better than MTD-DM, decreasing the RMSE from 3.08° and 7.01° (knee) for MTD-DM to 2.43° and 3.89° (ankle) for MTD-GM. With the MTD-GM, in the swing phase, the

R^{2}

score is better for the knee than for the ankle. It is the opposite of the ankle for the stance phase.

3.2. Analysis of Complete Cycle

As reported in Table 4, we combine the stance and swing phase predictions to obtain the prediction for each complete cycle. We note that the MTD-GM outperforms MTD-DM, showing much better

R^{2}

scores for the knee (higher than 0.8) and ankle (higher than 0.7). Table 5 reports the overall prediction results on both joints for MTD-DM and MTD-GM. Our results improve the

R^{2}

scores, which become higher than 0.9 in all cases. We also note that the

R^{2}

score is increased for all groups of diseases, because combining both series changes the variance of the whole series.

Finally, the detailed results of MTD-GM for each patient per phase and joint and on both joints together are available in the Appendix A (show Figure A1). An average RMSE predicted by MTD-GM (all patients) for the stance phase of the knee, stance phase of the ankle, swing phase of the knee, swing phase of the ankle, complete knee, and complete ankle are 2.61°, 1.98°, 3.84°, 2.43°, 2.99°, and 2.21°, respectively.

Table 2. Performance of both models in predicting the post-treatment gait trajectories of the stance phase with respect to different diseases. Bold entries denote the best predictions: lowest average RMSE and maximum

R^{2}