Real-Time sEMG Pattern Recognition of Multiple-Mode Movements for Artificial Limbs Based on CNN-RNN Algorithm

Li, Sujiao; Zhang, Yue; Tang, Yuanmin; Li, Wei; Sun, Wanjing; Yu, Hongliu

doi:10.3390/electronics12112444

Open AccessFeature PaperArticle

Real-Time sEMG Pattern Recognition of Multiple-Mode Movements for Artificial Limbs Based on CNN-RNN Algorithm

¹

Institute of Rehabilitation Engineering and Technology, University of Shanghai for Science and Technology, Shanghai 200093, China

²

Shanghai Engineering Research Center of Assistive Devices, Shanghai 200093, China

³

Key Laboratory of Neural-Functional Information and Rehabilitation Engineering of the Ministry of Civil Affairs, Shanghai 200093, China

^*

Author to whom correspondence should be addressed.

Electronics 2023, 12(11), 2444; https://doi.org/10.3390/electronics12112444

Submission received: 24 February 2023 / Revised: 18 May 2023 / Accepted: 22 May 2023 / Published: 28 May 2023

(This article belongs to the Special Issue Advanced Wearable/Flexible Devices and Systems in Bioelectronics)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Currently, sEMG-based pattern recognition is a crucial and promising control method for prosthetic limbs. A 1D convolutional recurrent neural network classification model for recognizing online finger and wrist movements in real time was proposed to address the issue that the classification recognition rate and time delay cannot be considered simultaneously. This model could effectively combine the advantages of the convolutional neural network and recurrent neural network. Offline experiments were used to verify the recognition performance of 20 movements, and a comparative analysis was conducted with CNN and LSTM classification models. Online experiments via the self-developed sEMG signal pattern recognition system were established to examine real-time recognition performance and time delay. Experiment results demonstrated that the average recognition accuracy of the 1D-CNN-RNN classification model achieved 98.96% in offline recognition, which is significantly higher than that of the CNN and LSTM (85.43% and 96.88%, respectively, p < 0.01). In the online experiments, the average accuracy of the real-time recognition of the 1D-CNN-RNN reaches 91% ± 5%, and the average delay reaches 153 ms. The proposed 1D-CNN-RNN classification model illustrates higher performances in real-time recognition accuracy and shorter time delay with no obvious sense of delay in the human body, which is expected to be an efficient control for dexterous prostheses.

Keywords:

1D-CNN; RNN; surface EMG; pattern recognition; real-time identification

1. Introduction

A myoelectric prosthesis is a type of bionic prosthesis that uses surface electromyography (sEMG) signals from the body to direct mechanical components to make appropriate movements [1]. The control technique using sEMG as a signal source is the most similar to people’s perceptions of the prosthesis because sEMG signals can reflect neural activity and contain information about muscle activities related to limb movements. Most commercial upper-limb prostheses are often controlled by a “mode switch” when they have two or more degrees of freedom [2]. This method will significantly raise the complexity of multi-degree-of-freedom prosthesis control and the switching burden on amputee patients and lower the directness and real time of prosthesis control as the degree of freedom of the prosthesis increases.

A prosthetic control method based on sEMG signal pattern identification has been proposed to accomplish quick and intuitive prosthetic control with the further development of artificial intelligence algorithms and high-performance microprocessors. A subset of artificial intelligence known as “pattern recognition” assigns classifications to data based on their features or patterns [3]. Since the pattern recognition method extensively uses bioelectrical signals, which contain data such as the length of muscle contractions and the size of the sEMG signals of muscle contractions, the realization of multi-free prosthesis motion control becomes possible. Achieving multi-free prosthesis movement control not only improves the quality of life of persons with physical disabilities but also encourages their reintegration into society. The stability and viability of the pattern recognition algorithm, as the center of the EMG prosthesis control method, will significantly affect how well the prosthesis works.

Pattern recognition algorithms based on traditional classifiers and deep learning algorithms are two categories into which the existing sEMG-signal-based pattern recognition methods fall. “Traditional classifiers” means traditional classification methods based on machine learning. They are distinguished from the classification methods based on deep learning. The sEMG control based on pattern recognition was initially proposed by Finley’s group, in 1967 [4]. They claimed that sEMG signals could be mapped to specific motion modes using the classifier created in advance, resulting in the generation of prosthetic motion control orders. In 2002, Hudgins et al. [5] developed an intelligent control strategy for regulating multifunctional prostheses based on sEMG signals and demonstrated that sEMG signals exhibited certainty at the beginning stage of a muscle contraction and can be used to differentiate between different forms of limb movements. The proposed method has attracted the attention of a large number of researchers and is considered a promising intelligent prosthetic control strategy. By adding a sliding window to the sEMG signal in 2003, Englehart et al. [6] divided the existing continuous signal into independent time windows and applied a linear discriminant analysis (LDA) classifier for subsequent motions. Since then, the classification of motion pattern recognition based on the sEMG signal, to guarantee continual action directives from the controller, has been widely used in the rehabilitation of patients with upper-limb dysfunction due to amputation. In order to categorize 12 different forearm movements in five healthy participants, Jianwei Liu et al. [7] employed an autoregressive power spectrum (ARPS) as a feature set and LDA as a classifier, with an average error rate of 5.00%, superior to other feature sets in performance. There have been extensive studies on classifiers other than LDA as action classification techniques. Support vector machines (SVMs) were used by Futamat et al. to categorize forearm movement patterns [8], attaining an accuracy rate of 95.59%. They gathered sEMG signals under 10 different forearm movements from the patients’ four forearm muscles. On 15 different finger movements performed by 15 people, Purushothaman and Vikas [9] examined the categorization effects of SVM, LDA, and naive Bayes (NB) classifiers. Geethanjali [10] compared the classification performances of an SVM, LDA, and artificial neural network for six different hand movements of the subjects, indicating that the mean absolute value (MAV), waveform length (WL), zero crossing (ZC), slope sign change (SSC), and fourth-order auto regression (AR) coefficients were extracted as feature sets. As a classifier, the linear SVM obtained the best classification results, reaching 92.8% accuracy. Amirabdollahian et al. examined the changes in classification accuracy under various combinations of SVM nuclei and electrodes while collecting the sEMG signals of 26 people during a series of gesture movements while wearing Myo arm rings, indicating that the linear kernel has an advantage over the polynomial and radial basis functions and that the linear kernel and eight-channel electrode combination can achieve the best accuracy of 94.9% [11]. Caesarendra W et al. used ANFIS-based learning to classify the reduced features of five finger gestures [12]. Additionally, several studies applied unsupervised fuzzy clustering, K nearest neighbor (KNN), and other pattern recognition techniques to achieve multi-motion pattern identification [13,14,15].

The features manually retrieved are crucial to classifying the performances of standard-machine-learning-based sEMG pattern recognition. Traditional machine learning techniques cannot efficiently categorize and train on abstract, noisy, and high-dimensional data, and it is a great challenge to achieve high classification accuracy for unprocessed raw sEMG signals [16]. Deep learning has made significant progress with the advancement of artificial intelligence technologies in the areas of image categorization and motion pattern recognition and possesses powerful performance learning abilities. End-to-end sEMG pattern recognition can be accomplished without the laborious feature extraction stage of classical machine learning owing to its capacity to autonomously learn features at various degrees of abstraction from input samples.

The convolutional neural network (CNN) and recurrent neural network (RNN), as two currently popular deep learning algorithm models, have been employed extensively in sEMG pattern detection. A network model based on the CNN architecture was adopted by several teams [17,18,19,20,21] in an effort to boost CNN’s classification performance. Included in this are the creation of an instantaneous EMG image from the original sEMG signal [18], the use of the delayed sEMG spectrum as an input [19], the multi-stream decomposition stage and fusion stage to train the CNN model [20], the feature extraction method based on CNN (CNNFeat) [21], etc. A neural network model known as an RNN model used dynamic internal state changes to interpret time information [22]. A classification model based on the RNN has been proposed by some teams [23,24,25,26,27], and the research findings demonstrated that the RNN model performs consistently in the delay problem. Additionally, LSTM can address the issues of gradient disappearance and long-term reliance as an upgraded RNN model [26]. A CNN-RNN composite neural network structure was presented by some other teams [28,29,30,31,32] that can concurrently capture spatial and temporal information from sEMG grayscale pictures. The results show that the deep design can improve the classification’s accuracy and robustness.

To sum up, most of the current research is still focused on the accuracy of pattern recognition in complex situations. Up to now, only a few teams [33,34,35] have conducted experiments to verify the real-time performance of a gesture recognition model, and the delay is too long to meet the needs of practical use. Although accuracy is the main research direction of pattern recognition algorithms, real-time performance is also one of the important factors in whether pattern recognition algorithms can be applied to upper-limb prostheses. The CNN combines feature extraction and classification and possesses the ability to achieve the optimal features and classifier parameters based on the original data. A 1D-CNN can effectively train the limited one-dimensional data in the data set and has better real-time performance, while the RNN has reliable performance in dealing with timing problems. Both of them could improve the real-time performance of the model on the basis of maintaining considerable accuracy. Therefore, we combined the advantages of a 1D-CNN and an RNN to propose a one-dimensional convolutional recurrent network classification model (1D-CNN-RNN) suitable for motion pattern recognition based on sEMG signals. Since there is no consistent conclusion on whether the accuracy of offline action recognition can directly reflect the real-time performance of a pattern recognition system, we also designed a real-time experiment to verify the performance of the model.

2. Materials and Methods

2.1. Signal Acquisition

The multi-sEMG signals were collected using the commercial wearable gForce EMG Armband, developed by gFrocePro+ (OYMotion Technologies Co., Ltd., Shanghai, China), which consists of eight dry electrodes with a sampling frequency of 1000 Hz and connects wirelessly to a recording computer via Bluetooth. The gForcePro+ armband was worn on the forearm, 2~3 cm from the elbow crease and the distal end of the olecranon process of the ulna, and covered the extensor carpi radialis, extensor digitalis, extensor carpi ulnar, and flexor digitalis superficialis of the subject’s forearm, as shown in Figure 1a,b. During the experiments, participants moved their upper limbs in respective ways as directed by the guiding video while maintaining a modest level of muscular contraction. When it was indicated in the guiding film to return to normal condition, the subjects relaxed upper limbs as rest position. Figure 1c,d depicts 20 movement patterns designed for this study, which include typical motions of fundamental finger and wrist movements.

Figure 2 reports an example of the raw 8-channel sEMG signal. The signal was recorded while the experimenter was doing the test motion (fisting).

2.1.1. Offline Experiment

This study recruited 23 healthy subjects, with an average age of 22.78 ± 1.70 years, comprising 15 men and 8 women, 21 right-handed and 2 left-handed participants. Exclusion criteria were any neurological pathologies or musculoskeletal complaints interfering with study outcomes. Each subject received written informed consent before the experiment. Each participant conducted the offline experiments using their right hand. Each movement was performed for three seconds and repeated ten times. After 3 s rest, the exercise was repeated, with a 5 min break in between each action.

2.1.2. Online Experiment

The online sEMG signal acquisition and assessment experiments were implemented through our self-developed sEMG-specific movement recognition system, which is capable of performing sEMG signal acquisition, display and processing, and offline and online multi-movement performance recognition. Ten other healthy subjects, with an average age of 24.4 ± 1.51 years old, including 6 males and 4 females, 6 right-handed and 4 left-handed, were recruited to participate in this experiment to demonstrate the practicality of online recognition. Before the experiment, all participants were informed and signed the consent form.

The subjects conducted two experiments. The offline and online pattern recognition protocols and the schematic diagram of the overall experiments are displayed in Figure 3. A 1D convolutional recurrent neural network classification model for recognizing online finger and wrist movements in real time was proposed, as depicted in Section 2.2, which could effectively combine the advantages of a convolutional neural network and a recurrent neural network. The 1D-CNN-RNN, as the initial model, was trained in the offline model before being applied to the recognition experiments. In order to optimize the classification, the classification models used in the online experiment were all based on the previous generation models, and the data from the offline experiment were used for further iterative training.

During this experiment, subjects performed corresponding exercises according to the guidance video displayed in the system, each exercise was repeated 10 times, each exercise lasted 3 s, and the next exercise was continued after 3 s of relaxation. There was a 5 min break between exercises. At the same time, the experiment operator recorded the corresponding results displayed by the system. At the end of the experiment, the ratio of correct times to total times of results and the average delay of real-time pattern recognition were counted.

2.2. Data Processing

2.2.1. Preprocessing

Data preprocessing mainly includes filtering and noise reduction, standardization, and active segment extraction. The motion artifact and electrical interference brought by cables were removed using a third-order 10 Hz Butterworth high-pass filter in accordance with the spectrum energy distribution of the sEMG signal.

Due to factors such as different anatomical tissues and physiological conditions of subjects, multi-channel sEMG signals showed obvious differences. Therefore, standardized processing was adopted to transform sEMG signals from different subjects to reduce the impact of individual variations on pattern identification and categorization. The standardization method used in this article is Z-Score standardization. This is a frequently used standardized technique. The transformation formula is (x − μ)/σ. In this formula, μ is Mean, and σ is Standard Deviation. It converts data of different orders of magnitude into unitless values.

The active segment is often extracted by the threshold recognition approach, which recorded any instantaneous values in the smooth signal that are higher than the threshold. Nevertheless, utilizing a certain threshold as the active segment detection standard error is significant for many sEMG signals. In order to identify the active portion of the EMG signal, this study used the adaptive double-threshold approach. The method of calculation is as follows:

(1) The single-channel EMG data

s_{k} (i)

were differentially processed. The instantaneous average energy sequence E was derived from the mean square energy of eight-channel EMG data (N = 8):

E = \frac{1}{N} \sum_{i = 1}^{N} {[s_{k} (i + 1) - s_{k} (i)]}^{2}

(1)

(2) The sliding window’s mean energy S for sequence E with a window length of 64 ms was determined:

S = \frac{1}{L} \sum_{j = 1}^{j + L - 1} E (j)

(2)

(3) Double thresholds Th₁ and Th₂ were selected adaptively according to median and variance:

c = S - M e d i a n (S)

(3)

\{\begin{matrix} {T h}_{1} = S \\ {T h}_{2} = {T h}_{2} \end{matrix}, 0 < c < V a r (S)

(4)

\{\begin{matrix} {T h}_{1} = {T h}_{1} \\ {T h}_{2} = M e d i a n (S) \end{matrix}, c < 0

(5)

(4) The data segment satisfying Th₂ < S < Th₁ was recorded as active segment, and its beginning and ending locations were established.

2.2.2. Segmentation

In this study, active data were extracted using a sliding window with a 64 ms window length and a step size of 64 ms, and the segmented data were labeled. It is worth mentioning that the idle segment data were designated as “resting”.

The training set: test set ratio of 6:4 was used to randomly partition the data set. The classification model was trained using the training set as input, and its effectiveness was then confirmed using the test set.

2.3. One-Dimensional CNN-RNN

CNN is specialized to processing data with the same grid structure, such as multidimensional time series and image data. It is a groundbreaking model in the field of deep learning, and it excels in a wide range of applications. CNN models have been the de facto standard for image classification problems for the past decade, a process known as feature learning. The model accepts two-dimensional input data in its internal form. Similar steps can be taken for one-dimensional data sequences. One-dimensional CNNs use feature learning on raw data rather than data feature engineering. Figure 4 depicts the 1D-CNN fundamental design.

RNN is a type of backpropagation neural network model for processing sequence data that can change its internal state via recurrent connections. This property allows RNN to perform effectively when processing time-dependent signals such as speech and text. Figure 5 depicts the evolution of RNN structural units throughout time.

where

x

is the input vector,

s

is the value of the hidden layer,

o

is the value of the output layer,

U

is the weight matrix from the input layer to the hidden layer,

V

represents the weight matrix from the hidden layer to the output layer,

W

is the value of a hidden layer instant as the weight matrix input at that time.

As the RNN structure unit expands over time, the value of

s_{t}

is related to the preceding moment’s input and hidden layer weights.

s_{t} = f (U * x_{t} + W * s_{t - 1})

(6)

When long-term memory is required, the solution of the RNN is related to the first n times.

s_{t} = f (U * x_{t} + W * s_{t - 1} + W * s_{t - 2} + \dots + W * s_{t - n})

(7)

When

n

is increased, the model computation grows exponentially, making the model training time much longer. In addition, when dealing with a long-term problem, the data lose some information at each step of the RNN traversal, and thus the distant information caused by the disappearance of the gradient has very little impact on that instant, so the RNN state has little trace of the initial input. As a result, the standard RNN model is unsuitable for calculating long-term memory. However, as a variation of RNN, LSTM has a significant advantage in handling this problem.

Figure 6 depicts the LSTM fundamental design. The primary layer and three gate controllers are the main components of the LSTM unit (input gate, forgotten gate, and output gate).

(\begin{matrix} i \\ f \\ \begin{matrix} o \\ g \end{matrix} \end{matrix}) = (\begin{matrix} σ \\ σ \\ \begin{matrix} σ \\ t a n h \end{matrix} \end{matrix}) (W [\begin{matrix} h_{t - 1} \\ x_{t} \end{matrix}] + b)

(8)

c_{t} = f ⨀ c_{t - 1} + i ⨀ g

(9)

h_{t} = O_{t} ⨀ t a n h (c_{t})

(10)

where the vector

h_{t}

is short-term state,

c_{t}

is long-term state,

x_{t}

is the input,

i

,

g

,

f

, and

o

are the input gate, main layer, forgetting gate, and output gate, respectively,

W

is the weight matrix connected to

x_{t}

,

b

is bias.

Figure 7 depicts the one-dimensional convolutional recurrent neural network model (1D-CNN-RNN) created for this investigation. The neural network model consists of four modules. The first module consists of two layers with 64 one-dimensional convolution layer units with ReLU as the activation function, a layer of batch normalization, and a layer of maximum pooling with a window length of two. Similar to the first module, the second module has two layers of 128 one-dimensional convolution layer units using ReLU as the activation function, a layer of batch normalization, a layer of maximum pooling with a window length of 2, and a dropout layer with a dropout rate of 0.2. The third module is composed of two LSTM cells with TANH as the activation function and two dropout layers with dropout rates of 0.2 and 0.5. The final module contains the Dense layer, Flatten layer, and Softmax layer. The fundamental idea behind this architecture is to merge CNN and RNN, take full advantage of CNN’s benefits in feature extraction and multidimensional timing signal processing, and add an LSTM structure for time memory to overcome CNN’s shortcomings in time delay.

The classification results of the model are contrasted with those of the CNN and LSTM in order to examine the viability and advantages of the model in hand motion recognition based on sEMG. The same data set was used to train all three models. Each model training cycle’s epoch was 30, the batch size was 128, Adam’s initial learning rate was set to 0.001, and the cross-entropy loss function served as the model’s loss function.

2.4. Evaluation Metrics

Recall, accuracy, precision, and F1 score were the quantitative evaluation indices employed in this study to confirm the model’s performance (accuracy and recall were considered at the same time to achieve the maximum and a balance).

R e c a l l = \frac{T P}{T P + F N} \times 100 %

(11)

A c c u r a c y = \frac{T N + T P}{T N + T P + F N + F P} \times 100 %

(12)

P r e c i s i o n = \frac{T P}{T P + F P} \times 100 %

(13)

F_{1} = \frac{2 (r e c a l l \times p r e c i s i o n)}{r e c a l l + p r e c i s i o n}

(14)

3. Results and Analysis

3.1. Offline Result Analysis

The CNN, LSTM, and 1D-CNN-RNN were applied to optimize the performance of classification methods. The comparison performances of the evaluation indices of the training and test from 23 subjects based on the above three models are provided in Table 1. Although the training process of the CNN took less time than the LSTM and 1D-CNN-RNN models, the test set’s classification accuracy rate (85.43%) and loss value (0.4446) fell significantly short of the pattern recognition system’s accuracy standards. The classification accuracy of the LSTM model was significantly improved compared to the CNN model, but it consumed the longest training time, as long as 3.33 times. The test set’s accuracy of the CNN-RNN for the pattern recognition of 20 motion modes was 98.88%, outperforming the separate CNN or LSTM model. Moreover, the training time of this model was only 41% of that of the LSTM model. Consequently, the 1D-CNN-RNN performed best in recall rate, accuracy, and f1-score, with 98.88%, 98.96%, and 0.9896, respectively.

As can be seen from Figure 8, compared with the CNN and LSTM models, the 1D-CNN-RNN model presented a faster convergence rate and remained stable after a shorter training period. The loss function of the training set and test set of this model also had a significant overall fitting degree. The CNN model, however, did not reach the convergence state within the same training rounds as the 1D-CNN-RNN. For the LSTM model, the loss function curve displayed an overall declining tendency. However, the loss value initially increased because of the challenging samples used in the training process, but after numerous training sessions, it reduced to a specific range and maintained oscillations.

The CNN-RNN model performs admirably when it comes to the recognition of comparable movements, and the recognition accuracy for 20 motion patterns reaches 97% or higher; the confusion matrix is depicted in Figure 9. As we are aware from the above comparison, the CNN-RNN model achieves the best pattern recognition performance under the same pretreatment and the same neural network hyper parameter configuration, and the CNN model performs the worst. The advantage of employing a one-dimensional CNN-RNN neural network model over the LSTM approach is that the convolution layer placed before the LSTM unit reduces the input’s dimension, minimizes computation, and boosts efficiency. Convolutional layer feature extraction benefits from the batch normalization implemented in the CNN-RNN model, and the additional dropout layer prevents overfitting, making the model structure more robust.

3.2. Online Result Analysis

Before we performed the online classification outcomes of the 1D-CNN-RNN model, we trained the 10 subjects offline. According to Table 2, the offline recognition accuracy of the CNN-RNN model achieves more than 98%, and the loss value, recall rate, and accuracy rate all reach good performances. The results are in line with the 1D-CNN-RNN training results obtained from the offline experiments, further demonstrating the practicability of using data from newly added subjects to train the original neural network model, which cuts down offline training time and increases the efficiency of online recognition.

The outcomes of the real-time recognition of the 20 motion patterns of 10 participants are displayed in the histograms of Figure 10 and Figure 11. We can see that the average recognition accuracy of the ten subjects is 91% ± 5%. The real-time recognition accuracy of the following 12 movements is above 90%: rest, index and middle finger extension, five-finger grasp, four-finger pinch, four-finger stretch, fist clench, five-finger stretch, five-finger pinch, external wrist rotation, wrist flexion, and wrist extension. We speculated that the above movements had better online recognition accuracy because these motions’ properties were evident and muscular force application was not easy to misunderstand. In terms of all related movements of the thumbs, thumb lateral adduction, thumb extension, index finger pinching, and three-finger pinching have not yet been determined. This might be a result of the fact that the thumb-related movements described above have a quite high degree of similarity. When the sEMG signal’s useful information was insufficient, it was prone to make motion pattern identification mistakes.

4. Discussion

The intelligent bionic manipulator based on sEMG signal regulation currently offers a wide range of potential applications. The focus of research is on how to accurately extract motion information from sEMG and perform motion recognition. This study presented a 1D-CNN-RNN model to perform quick and precise multi-motion pattern recognition based on the “end-to-end” features of deep learning models. Pattern recognition was performed on 23 subjects’ sEMG signals in 20 forearm motion modes. The final evaluation results showed that the designed 1D-CNN-RNN model presented excellent performances, its accuracy reached 98.96%, and the recognition effect of similar actions was better compared with other common neural networks. The outcomes of the online test demonstrated that the 1D-CNN-RNN model put forth in this paper performs admirably in the recognition accuracy of the majority of activities, with an average recognition accuracy of 91%.

In terms of time delay in online recognition, this model performed quite well. Table 3 shows the model performances in different studies. Among the models with real-time performance stated, the sliding windows of [33,34] are too long, which may significantly reduce the real-time classification performance of the model. According to previous studies’ reports [36], the human body can hardly feel the time delay of 0–300 ms while the typical latency of this model is 153 ms. Hence, the 1D-CNN-RNN proposed in this study offered better real-time performance and high accuracy when compared to other studies. Additionally, the sliding window length partly reflects the real-time performances of [33,34,35]. The equipment for collecting sEMG signals is also one of the factors weighing against the delay of real-time recognition. Therefore, more research and debates are required to determine whether the delay of real-time recognition may be decreased by enhancing the hardware quality.

Additionally, different handedness tends to have dissimilar muscular force patterns of the upper limb. We wondered how handedness impacts recognition accuracy under the online movements’ classification, so we compared the recognition results under different handedness. As shown in Figure 11, the first subject (S1), as a left-hander, has the lowest average recognition accuracy at just 79%. This is probably because the offline recognition experiment was trained by the data from a right-hander, and the left-hand data from the first subject were utilized to train the online experimental model. Therefore, the recognition accuracy of the first subject was less than 85%. However, the accuracy of real-time recognition tended to rise with the number of subjects, and the accuracy of left-handed individuals’ subsequent movements is much higher than that of the initial subject, such as the second, third, and seventh subjects (S2, S3, S7). Their accuracies were gradually improved, and the accuracies of S3 and S7 had no significant difference from that of right-handed subjects. The results indicate that the accuracy of neural network pattern recognition is higher with more training data.

The multi-movement classification outcomes in this study revealed that the identification accuracies of thumb upward, index finger pinching, three-finger pinching, and lateral adduction thumb movements were generally poor. This may be because the aforementioned actions involve the thumb and corresponding hand muscle, which thereby produced similar sEMG signals and made it easier for them to flood in the remaining fingers’ signals. The next stage will involve paying more attention to the specifics of sEMG signals and separating some unique activities even more. In order to address the shortcomings of using only sEMG signals, it will also be considered to incorporate other signals, considering the advantages of joint angle, acceleration, signal acquisition location [38], and other information [39] in motion pattern recognition. What is more, this proposed 1D-CNN-RNN model is suitable for the motion control of intelligent EMG prostheses, whereas the sEMG signals used for pattern recognition in this research were derived from healthy subjects. sEMG motion control and intensity in amputees differ from those of healthy individuals, which creates a challenge in that the present models are based on data from healthy people and do not always match amputees. In order to enhance the pattern recognition algorithm framework further and perform adaptive adjustments in accordance with the actual scenario of amputees, the sEMG signal from amputees will be applied in a subsequent study.

5. Conclusions

This study comes to a significant conclusion that the proposed 1D-CNN-RNN model for motion pattern recognition accomplished good classification performance based on multi-channel sEMG for 20 independent and combined finger and wrist movements. The average recognition accuracy of the 1D-CNN-RNN model designed reaches 98.96% in offline recognition, which is significantly higher than that of the CNN and LSTM (85.43% and 96.88%, respectively, p < 0.01), and the model achieves better comprehensive performance. The key finding of this study is that the 1D-CNN-RNN model performs better in real-time recognition. This may be due to the 1D-CNN and LSTM components of the model possessing powerful capabilities of processing time-series data, as well as the addition of batch normalization and dropout layers that promote quick convergence and avoid overfitting.

Real-time pattern recognition experiments were carried out in this study for ten people to examine the real-time performance of the developed 1D-CNN. The average real-time recognition rate is 91% ± 5%, and the average delay is 153 ms, which is able to meet the needs of the real-time control of intelligent prosthetics. The 1D-CNN-RNN classification model proposed had significant advantages in real-time recognition accuracy, and the average time delay is expected to provide an efficient control method for EMG prosthetic hands because the time range had no obvious sense of delay in the human body [36]. Furthermore, the stability and accuracy of real-time recognition will be included in future studies.

Author Contributions

Conceptualization, S.L.; methodology, S.L.; software, Y.T. and W.L.; validation, Y.Z., Y.T. and W.S.; formal analysis, Y.Z. and Y.T.; investigation, Y.Z., Y.T. and W.S.; resources, S.L.; data curation, Y.Z. and Y.T.; writing—original draft preparation, Y.Z. and Y.T.; writing—review and editing, S.L., Y.Z., W.L. and H.Y.; visualization, Y.Z.; supervision, S.L.; project administration, S.L.; funding acquisition, S.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Key Research and Development Program of China, grant number No. 2020YFC2007902. This research was also funded by the National Key Research and Development Program of China, grant number No. 2018YFC2002601.

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki, and the protocol was approved by the Ethics Committee of the Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences (SIAT-IRB-221115-H0626).

Informed Consent Statement

All subjects gave their informed consent for inclusion before they participated in the study. Written informed consent has been obtained from the patient(s) to publish this paper.

Data Availability Statement

No new data were created or analyzed in this study. Data sharing is not applicable to this article.

Conflicts of Interest

The authors declare no conflict of interest.

References

Scheme, E.; Eenglehart, K. Electromyogram Pattern Recognition for Control of Powered Upper-limb Prostheses: State of the Art and Challenges for Clinical Use. J. Rehabil. Res. Dev. 2011, 48, 643–659. [Google Scholar] [CrossRef]
Farina, D.; Jiang, N.; Rehbaum, H.; Holobar, A.; Graimann, B.; Dietl, H.; Aszmann, O.C. The Extraction of Neural Information from the Surface EMG for the Control of Upper-Limb Prostheses: Emerging Avenues and Challenges. IEEE Trans. Neural Syst. Rehabil. Eng. 2014, 22, 797–809. [Google Scholar] [CrossRef]
Englehart, K.; Hudgins, B.; Parker, P.A.; Stevenson, M. Classification of the Myoelectric Signal Using Time-frequency Based Representations. Med. Eng. Phys. 1999, 21, 431. [Google Scholar] [CrossRef] [PubMed]
Finley, F.R.; Wirta, R.W. Myocoder Studies of Multiple Myopotential Response. Arch. Phys. Med. Rehabil. 1967, 48, 598. [Google Scholar] [PubMed]
Hudgins, B.; Parker, P. A New Strategy for Multifunction Myoelectric Control. IEEE Trans. Biomed. Eng. 2002, 40, 82–94. [Google Scholar] [CrossRef]
Englehart, K.; Hudgins, B. A Robust, Real-time Control Scheme for Multifunction Myoelectric Control. IEEE Trans. Biomed. Eng. 2003, 50, 848–854. [Google Scholar] [CrossRef]
Liu, J.; He, J.; Sheng, X.; Zhang, D.; Zhu, X. A New Feature Extraction Method Based on Autoregressive Power Spectrum for Improving sEMG Classification. Eng. Med. Biol. Soc. 2013, 2013, 5746–5749. [Google Scholar]
Futamata, M.; Nagata, K.; Magatani, K. The Evaluation of The Discriminant Ability of Multiclass SVM in a Study of Hand Motion Recognition by Using SEMG. Int. Conf. IEEE Eng. Med. Biol. Soc. 2012, 2012, 5246–5249. [Google Scholar]
Purushothaman, G.; Vikas, R. Identification of a Feature Selection Based Pattern Recognition Scheme for Finger Movement Recognition from Multichannel EMG Signals. Australas. Phys. Eng. Sci. Med. 2018, 41, 549–559. [Google Scholar] [CrossRef]
Geethanjali, P. A Mechatronics Platform to Study Prosthetic Hand Control Using EMG Signals. Australas. Phys. Eng. Sci. Med. 2016, 39, 765–771. [Google Scholar] [CrossRef]
Amirabdollahian, F.; Walters, M.L. Application of Support Vector Machines to Detect Hand and Wrist Gestures Using a Myoelectric Armband. In Proceedings of the IEEE International Conference on Rehabilitation Robotics, London, UK, 20 February 2017; pp. 111–115. [Google Scholar]
Caesarendra, W.; Tjahjowidodo, T.; Nico, Y.; Wahyudati, S.; Nurhasanah, L. EMG Finger Movement Classification Based on ANFIS. In Proceedings of the International Conference on Mechanical, Electronics, Computer, and Industrial Technology, Prima, Indonesia, 6–8 December 2018; Volume 1007. [Google Scholar]
Paul, Y.; Goyal, V.; Jaswal, R.A. Comparative Analysis Between SVM & KNN Classifier for EMG Signal Classification on Elementary Time Domain Features. In Proceedings of the 2017 4th International Conference on Signal Processing, Computing and Control (ISPCC), Solan, India, 21–23 September 2017. [Google Scholar]
Chan, F.H.Y.; Yang, Y.S. Fuzzy EMG Classification for Prosthesis Control. IEEE Trans. Rehabil. Eng. 2000, 8, 305–311. [Google Scholar] [CrossRef] [PubMed]
Ajiboye, A.B.; Weir, R.F. A Heuristic Fuzzy Logic Approach to EMG Pattern Recognition for Multifunctional Prosthesis Control. IEEE Trans. Neural Syst. Rehabil. Eng. 2005, 13, 280–291. [Google Scholar] [CrossRef]
Zhou, X.; Li, Y.; Liang, W. CNN-RNN Based Intelligent Recommendation for Online Medical Pre-Diagnosis Support. IEEE-ACM Trans. Comput. Biol. Bioinf. 2021, 18, 912–921. [Google Scholar] [CrossRef] [PubMed]
Manfredo, A.; Matteo, C.; Henning, M. Deep Learning with Convolutional Neural Networks Applied to Electromyography Data: A Resource for the Classification of Movements for Prosthetic Hands. Front. Neurorobot. 2016, 10, 9. [Google Scholar]
Weidong, G.; Yu, D.; Wenguang, J.; Wei, W.; Hu, Y.; Li, J. Gesture Recognition by Instantaneous Surface EMG Images. Sci. Rep. 2016, 6, 36571–36579. [Google Scholar]
Zhai, X.; Beth, J.; Chan, R.H.M.; Tin, C. Self-Recalibrating Surface EMG Pattern Recognition for Neuroprosthesis Control Based on Convolutional Neural Network. Front. Neurosci. 2017, 11, 379. [Google Scholar] [CrossRef]
Wei, W.; Wong, Y.; Du, Y.; Hu, Y.; Kankanhalli, M.; Geng, W. A Multi-stream Convolutional Neural Network for sEMG-based Gesture Recognition in Muscle-Computer Interface. Pattern Recognit. Lett. 2017, 119, 131–138. [Google Scholar] [CrossRef]
Chen, H.; Zhang, Y.; Li, G.; Fang, Y.; Liu, H. Surface Electromyography Feature Extraction via Convolutional Neural Network. Int. J. Mach. Learn. Cybern. 2020, 11, 185–196. [Google Scholar] [CrossRef]
Husken, M.; Stagge, P. Recurrent Neural Networks for Time Series Classification. Neurocomputing 2003, 50, 223–235. [Google Scholar] [CrossRef]
Barron, O.; Raison, M.; Gaudet, G.; Achiche, S. Recurrent Neural Network for electromyographic gesture recognition in transhumeral amputees. Appl. Soft Comput. 2020, 96, 106616. [Google Scholar] [CrossRef]
Teban, T.A.; Precup, R.E.; Voisan, E.L.; de Oliveira, T.E.A.; Petriu, E.M. Recurrent Dynamic Neural Network Model for Myoelectric-based Control of a Prosthetic Hand. In Proceedings of the 2016 Annual IEEE Systems Conference (SysCon), Orlando, FL, USA, 18–21 April 2016; pp. 1–6. [Google Scholar]
Koch, P.; Phan, H.; Maass, M.; Katzberg, F.; Mertins, A. Recurrent Neural Network Based Early Prediction of Future Hand Movements. In Proceedings of the 2018 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), Honolulu, HI, USA, 18–21 July 2018; pp. 4710–4713. [Google Scholar]
Hochreiter, S.; Schmidhuber, J. Long Short-Term Memory. Neural Comput. 1997, 9, 1735–1780. [Google Scholar] [CrossRef]
Quivira, F.; Koike-Akino, T.; Ye, W.; Erdogmus, D. Translating sEMG Signals to Continuous Hand Poses using Recurrent Neural Networks. In Proceedings of the IEEE Conference on Biomedical and Health Informatics, Las Vegas, NV, USA, 4–7 March 2018; pp. 166–169. [Google Scholar]
Hu, Y.; Wong, Y.; Wei, W.; Du, Y.; Kankanhalli, M.; Geng, W. A Novel Attention-based Hybrid CNN-RNN Architecture for sEMG-based Gesture Recognition. PLoS ONE 2018, 13, e0206049. [Google Scholar] [CrossRef]
Xia, P.; Hu, J.; Peng, Y. EMG-Based Estimation of Limb Movement Using Deep Learning With Recurrent Convolutional Neural Networks. Artif. Organs 2017, 42, E67–E77. [Google Scholar] [CrossRef]
Jiang, Y.; Song, L.; Zhang, J.; Song, Y.; Yan, M. Multi-Category Gesture Recognition Modeling Based on sEMG and IMU Signals. Sensors 2022, 22, 5855. [Google Scholar] [CrossRef]
Chen, Y.; Dai, C.; Chen, W. Cross-Comparison of EMG-to-Force Methods for Multi-DoF Finger Force Prediction Using One-DoF Training; IEEE: New York, NY, USA, 2020; Volume 8, pp. 13958–13968. [Google Scholar]
Azizjon, M.; Jumabek, A.; Kim, W. 1D CNN Based Network Intrusion Detection with Normalization on Imbalanced Data. In Proceedings of the 2020 International Conference on Artificial Intelligence in Information and Communication (ICAIIC), Fukuoka, Japan, 19–21 February 2020; pp. 218–224. [Google Scholar]
Nasri, N.; Orts-Escolano, S.; Gomez-Donoso, F.; Cazorla, M. Inferring Static Hand Poses from a Low-Cost Non-Intrusive sEMG Sensor. Sensors 2019, 19, 371. [Google Scholar] [CrossRef] [PubMed]
He, Y.; Fukuda, O.; Bu, N.; Okumura, H.; Yamaguchi, N. Surface EMG Pattern Recognition Using Long Short-Term Memory Combined with Multilayer Perceptron. In Proceedings of the 2018 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), Honolulu, HI, USA, 18–21 July 2018; pp. 5636–5639. [Google Scholar]
Zhang, Z.; He, C.; Yang, K. A Novel Surface Electromyographic Signal-Based Hand Gesture Prediction Using a Recurrent Neural Network. Sensors 2020, 20, 3994. [Google Scholar] [CrossRef] [PubMed]
Smith, L.H.; Hargrove, L.J.; Lock, B.A.; Kuiken, T.A. Determining the Optimal Window Length for Pattern Recognition-based Myoelectric Control: Balancing the Competing Effects of Classification Error and Controller Delay. IEEE Trans. Neural Syst. Rehabil. Eng. 2011, 19, 186–192. [Google Scholar] [CrossRef] [PubMed]
Simo, M.; Neto, P.; Gibaru, O. EMG-based Online Classification of Gestures with Recurrent Neural Networks. Pattern Recognit. Lett. 2019, 128, 45–51. [Google Scholar] [CrossRef]
Botros, F.S.; Phinyomark, A.; Scheme, E.J. Electromyography-Based Gesture Recognition: Is It Time to Change Focus from the Forearm to the Wrist? IEEE Trans. Ind. Inf. 2022, 18, 174–184. [Google Scholar] [CrossRef]
Zhou, H.; Zhang, Q.; Zhang, M.; Shahnewaz, S.; Wei, S.; Ruan, J.; Zhang, X.; Zhang, L. Toward Hand Pattern Recognition in Assistive and Rehabilitation Robotics Using EMG and Kinematics. Front. Neurorobot. 2021, 15, 659876. [Google Scholar] [CrossRef] [PubMed]

Figure 1. Schematic diagram of experimental data acquisition.

Figure 2. Example of the raw 8-channel sEMG signal.

Figure 3. Schematic diagram of the overall design of the experiment.

Figure 4. Diagram of the 1D-CNN base architecture.

Figure 5. The RNN structural units unfolding over time.

Figure 6. Diagram of the LSTM base architecture.

Figure 7. Diagram of 1D-CNN-RNN model.

Figure 8. The loss function and accuracy curve of training and validation sets of three models.

Figure 9. CNN-RNN model pattern recognition confusion matrix.

Figure 10. The recognition accuracy of each action in online recognition (the error bar represents the standard deviation).

Figure 11. The identification accuracy of each subject in online identification (S1–S10 are subject numbers; the error bar represents the standard deviation).

Table 1. Classification results under different pattern recognition models.

Model	Data Set	Loss Value	Recall	Accuracy	Precision	F1 Score	Training Time
CNN	Training	0.6431	73.14%	77.99%	83.34%	0.7791	493 min
CNN	Test	0.4446	79.93%	85.43%	90.55%	0.8491	/
LSTM	Training	0.1497	94.75%	95.39%	96.17%	0.9545	1642 min
LSTM	Test	0.0952	96.58%	96.88%	93.77%	0.9695	/
1D-CNN-RNN	Training	0.0607	98.01%	98.19%	98.42%	0.9821	672 min
1D-CNN-RNN	Test	0.0340	98.88%	98.96%	99.04%	0.9896	/

Table 2. Classification results based on offline training set evaluation.

Serial Number	Loss	Recall	Accuracy	Precision	F1 Score
1	0.0065	99.80%	99.82%	99.85%	0.9983
2	0.0274	99.28%	99.34%	99.39%	0.9934
3	0.0407	98.60%	98.73%	98.81%	0.9870
4	0.0328	98.98%	99.04%	99.20%	0.9909
5	0.0580	97.98%	98.15%	98.34%	0.9816
6	0.0303	99.06%	99.15%	99.25%	0.9915
7	0.0539	97.98%	98.18%	98.42%	0.9820
8	0.0265	99.31%	99.33%	99.43%	0.9937
9	0.0547	98.19%	98.30%	98.35%	0.9827
10	0.0259	99.20%	99.27%	99.35%	0.9928
Mean	0.0357 ± 0.0162	98.84 ± 0.62%	98.93 ± 0.57%	98.84 ± 0.62%	0.9894 ± 0.0057

Table 3. Performance comparison of different research models.

Research	Channel Number	Number of Moves	Number of Repetitions	Number of Subjects	Classifier	Accuracy	Time Delay
[37]	16	8	/	/	LSTM	95.0%	/
[33]	8	6	195	35	RNN	99.8%	940 ms
[35]	8	21	30	13	RNN	89.6%	200 ms
[34]	12	52	10	27	LSTM	75.5%	400 ms
This research	8	20	20	10	CNN-RNN	91.0%	153 ms

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Li, S.; Zhang, Y.; Tang, Y.; Li, W.; Sun, W.; Yu, H. Real-Time sEMG Pattern Recognition of Multiple-Mode Movements for Artificial Limbs Based on CNN-RNN Algorithm. Electronics 2023, 12, 2444. https://doi.org/10.3390/electronics12112444

AMA Style

Li S, Zhang Y, Tang Y, Li W, Sun W, Yu H. Real-Time sEMG Pattern Recognition of Multiple-Mode Movements for Artificial Limbs Based on CNN-RNN Algorithm. Electronics. 2023; 12(11):2444. https://doi.org/10.3390/electronics12112444

Chicago/Turabian Style

Li, Sujiao, Yue Zhang, Yuanmin Tang, Wei Li, Wanjing Sun, and Hongliu Yu. 2023. "Real-Time sEMG Pattern Recognition of Multiple-Mode Movements for Artificial Limbs Based on CNN-RNN Algorithm" Electronics 12, no. 11: 2444. https://doi.org/10.3390/electronics12112444

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Real-Time sEMG Pattern Recognition of Multiple-Mode Movements for Artificial Limbs Based on CNN-RNN Algorithm

Abstract

1. Introduction

2. Materials and Methods

2.1. Signal Acquisition

2.1.1. Offline Experiment

2.1.2. Online Experiment

2.2. Data Processing

2.2.1. Preprocessing

2.2.2. Segmentation

2.3. One-Dimensional CNN-RNN

2.4. Evaluation Metrics

3. Results and Analysis

3.1. Offline Result Analysis

3.2. Online Result Analysis

4. Discussion

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI