1. Introduction
In the past few decades, lower limb exoskeleton robots have been continuously researched. They have shown great application potential in numerous military, medical and industrial fields [
1]. According to different expected functions, lower limb exoskeleton robots can be divided into power-assisted and rehabilitation types. Power-assisted lower limb exoskeletons are mainly used to enhance the speed, strength and endurance of normal humans, such as BLEEX [
2], HULC [
3] and HAL [
4]. Rehabilitation lower limb exoskeletons are mainly used to assist disabled people to walk and help patients perform rehabilitation training, such as Indego [
5], ReWalk [
6] and ALEX [
7]. With the continuous expansion to different application scenarios, lower limb exoskeleton robots need to adapt to more and more complex environments and the active movement of the wearer. The lower limb exoskeleton is a highly coupled human-machine system, and its walking gait is highly consistent with human gait. Human joints and exoskeleton joints have similar movements under the same gait phase. Gait phase recognition is of great significance for the human-machine coordination control of lower limb exoskeleton robots. First of all, gait phase is widely used in the lower limb exoskeleton gait trajectory generation, gait control and gait evaluation tasks, which can be used as a starting point for better human-machine coordination control. Secondly, accurate recognition of gait phase is the key to achieve efficient metabolic control of the lower limb exoskeleton. Inaccurate gait phase information often increases the user’s effort or causes the motor to impose inappropriate joint torque. Moreover, in some lower limb exoskeleton devices based on phase control, gait phase often determines the control mode required by the control system at present, which is essential information for realizing the control of lower limb exoskeleton robot. For example, Kazerooni et al. [
2] designed the lower limb exoskeleton BLEEX and divided the human walking cycle into three gait phases: single support, double support and double support with one redundancy, and built corresponding motion models and control algorithms for different gait phases, thus realizing the control of lower limb exoskeleton.
In order to recognize the characteristics of human gait phase, scholars at home and abroad have carried out a significant amount of research, mainly on machine vision recognition and sensor recognition. The requirements of recognition based on machine vision are much higher than those of sensor recognition, and the process is complex, so it is limited to experimental scenes. The sensor-based recognition not only has a low cost, small size, simple and convenient installation, but is also not easily affected by the external environment. Therefore, most human motion recognition based on lower limb exoskeleton uses sensors. At present, the mainstream sensors include the plantar pressure sensor, inertial measurement unit (IMU), surface electromyogram (SEMG) and electroencephalogram (EEG). Lim et al. [
8] developed an algorithm to detect gait phase with a small number of sensors by using center of pressure (COP). Liu et al. [
9] used inertial sensors to collect real-time motion data, calculated the group correlation coefficient between motion data and template data, and used the hidden Markov model (HMM) to identify the final motion state. Nguyen et al. [
10] adopted distributed plantar pressure sensors to obtain human motion information, and used K-nearest neighbor (KNN) classification method to realize recognition of five motion patterns, including flat walking, upstairs, downstairs, uphill and downhill. Hsu et al. [
11] used a non-parametric weighted feature extraction (NWFE) algorithm and principal component analysis (PCA) method to reduce the feature size of acceleration signals, and used a least square support vector machine (LS- BiLSTM) motion pattern based on one-to-one strategy for classification and recognition. Wu et al. [
12] designed a continuous gait phase estimator based on an adaptive oscillator network, including a gait task classifier, a gait target reset, a peak detector, and a model-based transitional gait phase estimator to improve the performance of the gait target network. Qian et al. [
13] proposed a gait phase estimation method based on an adaptive oscillator, which can accurately estimate the gait phase of users when they move on various terrain. Wei et al. [
14] used SEMG and EEG to compare the performance of linear discriminant analysis (LDA), KNN, and a kernel support vector machine (KSVM). Qin et al. [
15] proposed a human gait phase recognition algorithm based on fuzzy theory to identify the gait phase at the next moment of human lower limb movement.
Based on a large amount of data, machine learning methods are not affected by biomechanical models and cost functions [
16], and gradually become a feasible method for gait phase classification. Zeng et al. [
17] proposed a back propagation (BP) neural network algorithm model based on support vector machine (SVM), which improved the average recognition accuracy by 9.3552% compared with the SVM algorithm. Lee et al. [
18] estimated the knee angle as well as the angles of the talocrural and subtalar joints of the ankle by applying neural networks to two IMUs attached to the thigh and shank. Qiu et al. [
19] used intrinsic mode functions (IMFs) extracted from the original gait signals by ensemble empirical mode decomposition (EEMD) as the input of the classification algorithm to give 14 experimental results of normal human gait phase recognition. Lu et al. [
20] used a machine learning classifier to select feature subsets of EMG, IMU and foot switch signals from a series of time-domain features of window parameters, and realized the classification of different stages of human jumping. This team [
21] also used SEMG as an input to improve the recognition accuracy of the human lower limb jumping stage by bidirectional long short-term memory (BiLSTM) and convolution long short-term memory (ConvLSTM). Huang et al. [
22] proposed an online gait detection method based on distance and multi-sensor information fusion to solve the problem of overfitting in the case of limited data. Yunas et al. [
23] used the deep learning model to automatically extract the ground reaction force characteristics and the body movements attached to three different positions of the lower body, making up for the decline in the spatiotemporal accuracy of the individual model. Wang et al. [
24] proposed a deep learning-based approach to map multi-channel SEMG signals to human lower limb movements involving four different gait phases and three flexion and extension joint angles. Kang et al. [
25] utilized a gait phase estimator based on convolutional neural network (CNN) that can adapt to different locomotion mode settings to modulate the exoskeleton assistance. Wu et al. [
26] also proposed a graph convolution network model (GCNM) for the gait phase classification of lower extremity exoskeletons, which can solve the problem of gait phase classification in non-Euclidean domains based on exoskeleton diagram mechanism.
Many existing studies based on deep learning place too much faith in the processing power of models, and the feature extraction of limited data is not comprehensive enough. The above research [
25] uses the CNN-based model to inherently extract the feature information in the sensor data in the neural network architecture. However, lower limb walking is a continuous process, and its characteristics in the time series are also important. In the above study [
21], BiLSTM and ConvLSTM were used to identify the jump stages, and SEMG and IMU data were used as nodes. However, this method only considers the adjacency relation between nodes, ignoring the physical significance of nodes themselves. These studies can obtain a better recognition rate under normal conditions, but may reduce the accuracy of gait phase recognition under complex conditions. In order to make the exoskeleton adapt continuously to varying walking speeds, we designed the gait acquisition system and established the movement pattern data set. We recruited seven healthy subjects and collected lower limb movement data and plantar pressure data at continuously varying walking speed. In this paper, based on the analysis of the motion pattern of the human body under the continuously varying walking speed, a gait phase recognition method based on the CNN-BiLSTM network model is proposed to classify the seven gait phases of both feet. Recently, the CNN-BiLSTM model has shown excellent performance in tasks such as text classification. This model combines the CNN model and BiLSTM model, which can not only extract local features in data, but also extract forward and backward temporal features of data. We compared CNN-BiLSTM network with long short-term memory (LSTM) network and gated recurrent unit (GRU) network. The experimental results show that the average accuracy of the CNN-BiLSTM network in seven subjects is 0.417% higher than that of the LSTM network and 0.596% higher than that of the GRU network. Therefore, the proposed network can better distinguish the gait phase of human lower limbs under the continuous change of walking speed.
3. Results
Gait phases of subjects were classified by performing three experiments using machine learning methods: CNN-BiLSTM, LSTM and GRU. During the walking process, the gait acquisition system recorded the movement data of thigh and calf and the pressure data of plantar in real time. The input of the above model is shown in
Table 3. Seven subjects were asked to wear the gait acquisition system and walk continuously at 4 km/h for 4 min, 6 km/h for 3 min and 2 km/h for 3 min. After processing, the data refresh frequency of each sensor is 100 Hz. Theoretically, the total number of gait data samples in each group is 60,000, but there will be a small deviation in the actual number of samples collected.
To evaluate the accuracy of gait stage classification, we defined a criterion, namely classification success rate (CSR), as shown in Equation (12).
represents the number of correct classifications and
represents the total number of test data points. We use confusion matrices to illustrate classification performance and quantify error distributions. The definition is shown in Equation (13). Each element is defined as shown in Equation (14).
where
is the number of test data points in phase pattern
classified as pattern
;
is the total number of test data points in phase pattern
. It is clear that the diagonal elements of the confusion matrix represent the classification success rate and the off-diagonal elements represent the error rate.
In order to verify the robustness and superiority of the CNN-BiLSTM used, the subject samples were randomly shuffled and divided into training-test sets. The training-test set percentages were 80% and 20% of the gait cycle samples of the subjects, respectively.
In the experiment, the model optimization algorithm of CNN-BiLSTM network was set as Adam, the learning rate was 0.005, and each batch had 32 samples. ReLU was used as the activation function in the model construction process, and the number of training rounds is 100. The number of BiLSTM units was set to 128 and the dropout ratio was set to 0.25. Based on the data of the gait acquisition system, the 30-dimensional features are extracted from the gait data. Each part of CNN and BiLSTM consists of two layers of neural networks. Because the output of the previous layer is the input of the next layer, Connection is used as the connection layer to avoid the repeated representation of the input and output layers. Based on the CNN-BiLSTM network, the average accuracy of the gait phase classification of seven subjects was 92.989%. Among them, the gait phase classification accuracy of the subject with the highest accuracy can reach 95.09%, and the confusion matrix is shown in
Figure 8. The prediction accuracy of LH, LS, HL, HS, SL, SH and SS phases were 69%, 96%, 82%, 97%, 94%, 98% and 81%, respectively.
In order to verify the effectiveness of the proposed network, we compared its experimental results with the LSTM network and GRU network. The LSTM network is a special RNN network, mainly to solve the problem of gradient disappearance and gradient explosion in the process of long sequence training. The two-layer LSTM network with a softmax layer was selected for gait phase classification. The hidden layers include 128 memory blocks for each LSTM network to process the input gait phase feature matrix. Then, the corresponding gait phase is obtained by applying softmax layer. Based on LSTM network, the average accuracy of gait phase classification of seven subjects was 92.571%. Among them, the gait phase classification accuracy of the subject with the highest accuracy can reach 94.13%, and the confusion matrix is shown in
Figure 9. The prediction accuracies of LH, LS, HL, HS, SL, SH and SS phase were 68%, 94%, 76%, 96%, 94%, 98% and 23%, respectively.
GRU network has fewer parameters than LSTM network, so it is easy to converge. The two-layer GRU network with a softmax layer was selected for gait phase classification. To facilitate comparison, the hidden layers also include 128 memory blocks for each GRU network to process the input gait phase feature matrix. Then, the corresponding gait phase is obtained by applying the softmax layer. Based on the GRU network, the average accuracy of gait phase classification of seven subjects was 92.393%. Among them, the gait phase classification accuracy of the subject with the highest accuracy can reach 94.31%, and the confusion matrix is shown in
Figure 10. The prediction accuracies of LH, LS, HL, HS, SL, SH and SS phases were 70%, 95%, 82%, 96%, 92%, 98% and 17%, respectively.
4. Discussion
The main finding of our study is that the proposed CNN-BiLSTM network can reliably classify the seven phases of bipedal gait. In seven subjects, the average classification accuracy of each phase can reach 95.09%. We compared it with the classification results of the LSTM network and the GRU network. In the experimental results of subject No. 1, the average accuracy of the CNN-BiLSTM network was 0.16% higher than that of the LSTM network and 0.3% higher than that of the GRU network. In the experimental results of subject No. 2, the average accuracy of the CNN-BiLSTM network was 0.76% higher than that of the LSTM network and 0.58% higher than that of the GRU network. In the experimental results of subject No. 3, the average accuracy of the CNN-BiLSTM network was 0.26% higher than that of the LSTM network and 0.74% higher than that of the GRU network. In the experimental results of subject No. 4, the average accuracy of the CNN-BiLSTM network was 0.13% higher than that of the LSTM network and 0.44% higher than that of the GRU network. In the experimental results of subject No. 5, the average accuracy of the CNN-BiLSTM network was 0.96% higher than that of the LSTM network and 0.78% higher than that of the GRU network. In the experimental results of subject No. 6, the average accuracy of the CNN-BiLSTM network was 0.28% higher than that of the LSTM network and 0.33% higher than that of the GRU network. In the experimental results of subject No. 7, the average accuracy of the CNN-BiLSTM network was 0.37% higher than that of the LSTM network and 1% higher than that of the GRU network. The average classification accuracy of each phase of all subjects is shown in
Table 4. As can be seen from
Table 4, under the same experimental environment, compared with the LSTM network and GRU network, the proposed CNN-BiLSTM network model has significantly higher prediction accuracy and better robustness for gait phase classification of lower limb exoskeleton system.
Under different network models, the recognition accuracy of each gait phase of subjects with the highest accuracy is shown in
Table 5. As can be seen from
Table 5, the recognition accuracy of LH, HL and SS is low, which may be related to the short time of these three phases in the gait cycle and the small amount of data. At the same time, due to the small amount of data, these three phases have little influence on the average recognition accuracy. Taking subject No. 5 as an example, the sample sizes of each gait phase are shown in
Figure 11. To verify this idea, data of subject No. 5 were uniformly sampled to reduce the frequency from 100 Hz to 50 Hz. Its classification accuracy under the CNN-BiLSTM network is shown in
Figure 12. It can be seen from
Figure 12 that the recognition accuracy of all gait phases decreases after the data frequency decreases. LS, SL; HS and SH are two states with symmetrical left and right feet, respectively, so the recognition accuracy is similar. These phases have a large amount of data. In the results of the CNN-BiLSTM network, the recognition accuracy is no less than 94%, and the highest is 98%. In most gait phases, the recognition accuracy of the CNN-BiLSTM network is higher than that of the LSTM network and GRU network.
There are some limitations to the current study. The experiment was limited to seven healthy young subjects and was limited to walking on a treadmill. In the future, experiments should include more participants, such as the elderly and people with walking disabilities, to verify the applicability of this approach in a wider context. Walking in different conditions, such as uphill or downhill or on uneven surfaces can also test the robustness of the method.
5. Conclusions
This paper presents a gait phase classification method for lower limb exoskeleton control. We designed a gait acquisition system and collected IMU data of legs and plantar pressure data for model training. We used the passive lower limb exoskeleton robot as the experimental platform, and labeled each group of data through the images taken by the camera. Finally, this paper proposes a network model based on CNN-BiLSTM. It combines the CNN model with the BiLSTM model to classify and recognize the multi-type gait data sets collected in this study. Firstly, CNN is used to obtain the key local feature segments in the global gait data, then the BiLSTM network is used to obtain the context information of gait features from the key feature segments, and softmax layer is used as the output layer to determine the gait phase classification at the current moment. The experimental results show that:
(1) The gait acquisition system designed in this paper can effectively complete the collection of four IMU signals and plantar pressure signal of both feet. Through the manual analysis of the motion image and the position of the optical marker, the seven gait phases of both feet can be accurately distinguished.
(2) The CNN-BiLSTM classification method proposed in this paper can extract features from gait data more adequately than a single network. Compared with the LSTM network and GRU network, the average accuracy of the proposed CNN-BiLSTM network model is increased by 0.417% and 0.596%, respectively. For each subject, the recognition accuracy of the proposed model was higher than that of the other two models. Therefore, the proposed CNN-BiLSTM network model has higher prediction accuracy and better generalization performance. It can adapt to the complex walking gait of exoskeleton wearers. In our future work, we will focus on assisting the wearer through a powered exoskeleton using the proposed gait stage classification method.