Next Article in Journal
A Multi-User Game-Theoretical Multipath Routing Protocol to Send Video-Warning Messages over Mobile Ad Hoc Networks
Next Article in Special Issue
Gait Measurement System for the Multi-Target Stepping Task Using a Laser Range Sensor
Previous Article in Journal
An Ultrasonic Sensor System Based on a Two-Dimensional State Method for Highway Vehicle Violation Detection Applications
Previous Article in Special Issue
Stride Segmentation during Free Walk Movements Using Multi-Dimensional Subsequence Dynamic Time Warping on Inertial Sensor Data
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Comparison of sEMG-Based Feature Extraction and Motion Classification Methods for Upper-Limb Movement

1
The Institute of Advanced Biomedical Engineering System, School of Life Science and Technology, Beijing Institute of Technology, Haidian District, Beijing 100081, China
2
Key Laboratory of Convergence Medical Engineering System and Healthcare Technology, The Ministry of Industry and Information Technology, School of Life Science and Technology, Beijing Institute of Technology, Haidian District, Beijing 100081, China
3
Faculty of Engineering, Kagawa University, Hayashi-cho, Takamatsu 761-0369, Japan
*
Author to whom correspondence should be addressed.
Sensors 2015, 15(4), 9022-9038; https://doi.org/10.3390/s150409022
Submission received: 23 December 2014 / Revised: 8 April 2015 / Accepted: 10 April 2015 / Published: 16 April 2015
(This article belongs to the Special Issue Sensor Systems for Motion Capture and Interpretation)

Abstract

:
The surface electromyography (sEMG) technique is proposed for muscle activation detection and intuitive control of prostheses or robot arms. Motion recognition is widely used to map sEMG signals to the target motions. One of the main factors preventing the implementation of this kind of method for real-time applications is the unsatisfactory motion recognition rate and time consumption. The purpose of this paper is to compare eight combinations of four feature extraction methods (Root Mean Square (RMS), Detrended Fluctuation Analysis (DFA), Weight Peaks (WP), and Muscular Model (MM)) and two classifiers (Neural Networks (NN) and Support Vector Machine (SVM)), for the task of mapping sEMG signals to eight upper-limb motions, to find out the relation between these methods and propose a proper combination to solve this issue. Seven subjects participated in the experiment and six muscles of the upper-limb were selected to record sEMG signals. The experimental results showed that NN classifier obtained the highest recognition accuracy rate (88.7%) during the training process while SVM performed better in real-time experiments (85.9%). For time consumption, SVM took less time than NN during the training process but needed more time for real-time computation. Among the four feature extraction methods, WP had the highest recognition rate for the training process (97.7%) while MM performed the best during real-time tests (94.3%). The combination of MM and NN is recommended for strict real-time applications while a combination of MM and SVM will be more suitable when time consumption is not a key requirement.

1. Introduction

With the development of electromyography (EMG) techniques, the surface EMG (sEMG) signal, which reflects the activation level of skeletal muscles, has become an advanced tool for intuitive control of prostheses or robot arms. Fukuda et al. [1] used sEMG signals to control a manipulator. They adopted a statistical neural network, named the log-linearized Gaussian mixture network, to achieve robust discrimination between differences among individuals, electrode locations, and time variations caused by fatigue or sweat. They reported that the method can provide smooth control for the manipulator and it might allow a physically handicapped person to sense a feeling of prosthetic control similar to that of the original limb. Liarokapis et al. [2] used sEMG signals from sixteen muscles of the upper limb to study the muscular co-activation patterns during a variety of reach-to-grasp motions. Chen et al. [3] developed a multi-kernel learning support vector machine method to classify multiple finger movements. In our previous study, an AR model-based multi-motion recognition method was proposed to classify different upper-limb motions to control an upper-limb exoskeleton rehabilitation device [4,5].
One of the main factors preventing the implementation on EMG signals outside a laboratory environment is the unsatisfactory motion recognition accuracy rate based on EMG signals. Many efforts have been made in order to improve the recognition accuracy rate. Ju et al. [6] designed a fuzzy Gaussian mixture model with a non-linear feature extraction method to classify different hand grasps and in-hand manipulations. They reported that by using their proposed non-linear method a recognition rate of 96.7% could be achieved. Artemiadis et al. [7] developed a switching regime model to decode the EMG activity from 11 muscles into a continuous representation of arm motion in three-dimensional space. They reported that this switching regime model could overcome some of the main difficulties of EMG-based control systems, such as the nonlinearity between the EMG recordings and the arm motion, as well as the non-stationarity of EMG signals with respect to time. Balbinot and Favieiro [8] designed a neuro-fuzzy system to classify human upper-limb movements. Alkan and Gunay [9] used SVM to classify four upper limb movements. Moreover, Young et al. [10] provided a Bayesian theory based conditional parallel linear discriminant analysis (LDA) classifiers for simultaneous movement recognition. Different from regarding the combined movements (with multi-DoF) as independent motions, the proposed classifier could recognize the simultaneous movements with a single-DoF training set.
One of the problems associated with misclassification is the cross-talk, which results from the complex musculoskeletal structure. Shibanoki et al. [11] designed a quasi-optimal channel selection method to select the effective channels for recognition of 16 hand gestures. Naik and Kumar [12,13] applied blind source separation algorithms to separate the more useful information from cross-talk signals to improve the finger-movement recognition accuracy rate. In later research, Naik and Nguyen [14] applied non-negative matrix factorization to identify finger movements and also used independent component analysis to evaluate myoelectric sensor placement [15]. Zhang et al. [16] developed a hand gesture recognition system using not only EMG signals, but also accelerometer information, to identify 72 Chinese sign language words. Tang et al. [17] applied a multi-channel energy ratio feature extraction method to overcome the influence of various forces for a given gesture. They used the proposed feature extraction method and a Cascaded-Structured classifier to recognize eleven hand gestures. Rafiee et al. [18] proposed a Mother Wavelet Matrix (MWM), which has the potential to differentiate the feature vectors to extract features from sixteen locations of EMG signals to classify ten hand motions. For these recognition solutions, many elegant feature extraction or motion classification methods have been proposed based on mathematics or signal processing points of views. As human motion is the consequence of the complex human musculoskeletal system, depending on the generalization of machine learning algorithm seems insufficient.
Besides motion recognition, EMG signals are also used for musculotendon force prediction. Two physiological models are widely used for musculotendon force predictions: Huxley- [19,20] and Hill-type models [21]. Compared with the complexity of Huxley-type models, Hill-type models are more computationally viable. Cavallaro et al. [22] developed a Hill-typemodel-based myoprocessor to predict joint torque. Seven muscles around the upper limb were recorded and a genetic algorithm was implemented to tune the parameters of the model. We also proposed a continuous elbow angle prediction method based on the Hill-type muscular model and a simplified musculoskeletal model [23]. This method can predict elbow joint angle continuously using only EMG signals recorded from the biceps brachii muscle. The motion was constrained in the sagittal plane. Nevertheless, it is relevant difficult to predict motions based on musculotendon forces because of the redundancy of the human musculoskeletal system. In order to simplify the problem, a constrained movement was required in most of the cases [24]. However, the muscular model provides a natural way with sufficient biological meaning for feature extraction for the EMG based motion recognition.
For intuitive control of robot arms, movements of upper limb in daily life are often used. The method applied based on sEMG signals should have the ability to recognize these motions. Furthermore, a real-time response for controlling is often requested. The purpose of this paper is to compare the performance of different recognition methods for eight upper limb motions (elbow flexion and extension, forearm pronation and supination, wrist flexion and extension, wrist abduction and adduction), with various combinations of four feature extraction (RMS, DFA, WP, MM) and two classification methods (NN and SVM), to determine their correlation and a proper way for motion recognition with sEMG signals. The time consumption for training is also discussed, as it reflects the performance of methods.

2. Methods

As introduced previously, four feature extraction methods and two classifiers were chosen for study in this paper. The description for these methods and classifiers are introduced in this Section. However, these methods were applied on pre-processed sEMG signals, which will be explained in the next Section.

2.1. The WPT-Based RMS Feature Extraction Method

The purpose of applying Wavelet Packet Transforms (WPT) in this study is to reduce the influence from extrinsic noises. Although a commercial filter box was used, noise during sampling is inevitable. Such noise can be introduced by relative movement between electrodes and surface skin, or sweat from the skin.
Assuming that U00 is the scaling space of an EMG signal E(t), WPT can decompose U00 into small subspaces in dichotomous way with the algorithm defined in Equation (1):
U n j 1 = U 2 n j U 2 n + 1 j , j Z ; n Z +
where j is the resolution level and ⊕ is the orthogonal decomposition. Unj1, U2nj and U2n+1j are three close spaces corresponding to un(t), u2n(t) and u2n+1(t). The relationship between un(t), u2n(t) and u2n+1(t) is defined as following:
u 2 n ( t ) = 2 k Z h ( k ) u n ( 2 t k )
u 2 n + 1 ( t ) = 2 k Z g ( k ) u n ( 2 t k )
where the function u0(t) can be identified with the scaling function φ and u1(t) can be identified with a mother wavelet Ψ. h(k) and g(k) are the coefficients of the low-pass and the high-pass filters respectively. The sub-signal at Unj1 on the jth level can be reconstructed by Equation (4):
s n j ( t ) = k D j , n k ψ j , k ( t ) , k Z
where Ψj,k(t) is the wavelet function, Dj,nk is the wavelet packet coefficients at Unj1, which can be calculated by Equation (5).
D j , n k = s ( t ) ψ j , k ( t ) d t
In this paper, we chose Daubechies 2 and decomposition raw sEMG signal to the fourth level. The reconstructed wavelet signals calculated by Equation (5) were used to extract features for motion recognition. The Root Mean Square (RMS) method was then applied to extract features from the processed sEMG signals with WPT. RMS values were calculated using an overlapping window of 100 samples length with a 25 samples increment between windows.

2.2. WPT Based Weight Peaks Feature Extraction Method

Although the RMS method can represent the energy of the original signals, in our particular case the trends that reflect changes of muscle contraction are more suitable to represent the features for motion recognition. The purpose of the WP method is to capture the trend of the original sEMG signals [25]. The reconstructed sEMG signals processed by WPT have different frequencies in different nodes. Therefore, the amount of peaks obtained in different nodes is different. Zero crossing which is defined as Equation (6) is used to find where the peak exists:
Z C = n = 1 N 1 ( sgn ( s n × s n + 1 ) | s n s n + 1 | t h r e s h o l d )
All the reconstructed sEMG signals of zero crossing are saved to obtain peaks and valleys among them.
The procedure of the WP method is described as follows:
If max(sZC(i): sZC(i + 1)) + min(sZC(i): sZC(i + 1)) 0
P(i) = max(sZC(i): sZC(i + 1))
Else if max(sZC(i): sZC(i + 1)) + min(sZC(i): sZC(i + 1)) < 0
P(i) = (1) × min(sZC(i): sZC(i + 1))
where sZC(i) is the reconstructed sEMG signal of zero crossing. P(i) is the peak or valley between the data of zero crossing and valleys are transformed into positive numbers.
During experiments, we found that the higher peaks reflect the trend of motion more than the lower peaks. Therefore the next step of weighted peaks is to increase the higher peak component and decrease the lower peak component to obtain a feature near to the trend of a subject’s motion. The algorithm is defined in Equation (7), where n is set experientially:
P ( i + 1 ) = n 1 n P ( i ) + 1 n P ( i + 1 )
The overlapping window for WP (N in Equation (6)) dependeds on the threshold set to detect the zero crossing. According to the experimental results, the calculated features took about 90% of the total samples.

2.3. Detrended Fluctuation Analysis

After the introduction of DFA by Peng et al. [26], this method attracted a lot of attention for analyzing time series. It is believed that DFA is suitable for application to signals whose underlying statistics or dynamics are non-stationary; EMG signals are typical non-stationary.
The first step in DFA is to convert a bounded time series xt into an unbounded process Xt using the following Equation (8):
X t = i = 1 t ( x i x i ¯ )
where x i ¯ is the average value of xi.
Xt is then divided into time windows Yj of L samples of length and local least squares straight-line fitting is calculated by minimizing the squared error with respect to slope and intercept parameters a and b:
E 2 = j = 1 L ( Y i j a b ) 2
Then fluctuation or RMS deviation from the trend is calculated over every window at every time scale:
F ( L ) = ( 1 L j = 1 L ( Y j j a b ) 2 ) 1 / 2
This detrending followed by fluctuation measurement process is repeated over the complete signals set. The disjoint window for DFA was 100 samples.

2.4. Muscular Model Based Feature Extraction Method

The Hill-type based muscular model has already been applied successfully in our previous study to predict elbow joint angles [23]. The advantage of this method is that it can provide quantitative relationships between sEMG signals and elbow joint angles based on a biological point of view, not just considering the issue as a single processing problem. The Equations used to describe Hill-type model are:
F P E , S E = F max ( exp ( S Δ L / Δ L max ) 1 ) / ( exp ( S ) 1 )
F C E = F max u f l f
f v = 0.1433 ( 0.1074 + exp ( 1.3 sinh ( 2.8 V C E / V C E 0 + 1.64 ) ) ) 1
f v = 0.1433 ( 0.1074 + exp ( 1.3 sinh ( 2.8 V C E / V C E 0 + 1.64 ) ) ) 1
V C E 0 = 0.5 ( u + 1 ) V C E max
F T = F C E + F P E , S E
where SE denotes the passive serial element, CE denotes the active contractile element and PE denotes the passive element arranged in parallel to the previous two. ΔL is the change in length of the element with respect to the slack length, fl is the factor of force introduced by the changes of muscle length and fv is another factor of force introduced by changes of muscle change velocity. S is a shape parameter, Fmax is the maximum force exerted by the element for the maximum change in length ΔLmax, and FPE,SE is the passive force generated by the PE or SE depending on the set of parameters used. FT is the total force exerted by the muscle. u is the activation level of a muscle. For simplicity, ΔL was considered as a constant and VCE is zero. The effect of SE and PE was also ignored. Therefore, the musculotendon force (FT) has a relative simple relationship with the muscle activation level (u) under these certain conditions.
Furthermore, raw sEMG signals should be filtered by a high-pass filter to remove any DC offsets or low-frequency noise and then rectified. Sometimes, these rectified signals are directly transformed into muscle activation levels by dividing them by the peak rectified EMG value obtained during the MVC test. Some researchers [27] suggest that a more detailed model of muscle activation dynamics is warranted in order to characterize the time-varying features of the sEMG signal. In this paper, a discretized recursive filter was used.
A discretized recursive filter with a continuous form of a second-order differential equation was implemented:
u ( t ) = M d 2 e ( t ) / d 2 t + B d e ( t ) / d t + K e ( t )
where M, B, and K are the constants that define the dynamics of Equation (17) and e(t) is the processed sEMG signal. This equation can be expressed in discrete form using backward differences:
u ( t ) = α e ( t d ) β 1 u ( t 1 ) β 2 u ( t 2 )
where d is the electromechanical delay and α, β1, and β2 are the coefficients that define the second-order dynamics. Selection of the values for α, β1, and β2 should follow the following restrictions:
β 1 = γ 1 + γ 2
β 2 = γ 1 × γ 2
| γ 1 | < 1
| γ 2 | < 1
α β 1 β 2 = 1
In order to guarantee the stability of the equation and that neural activation does not exceed 1, the calculation results should be filtered by a low-pass filter (with a cut-off frequency of 3–10 Hz) because the muscle naturally acts as a filter. The resulting force changing frequency is much lower than the amplitude changing frequency of sEMG signals. The MM feature extraction method was applied on every recorded sample in real-time. To summarize, the numbers of features generated by the four feature extraction methods in 1 s, given the condition of 1000 Hz sampling frequency, are listed in Table 1.
Table 1. Feature numbers for the four extraction methods.
Table 1. Feature numbers for the four extraction methods.
Feature Extraction Method
Feature NumbersRMSWPDFAMM
40≈900101000
The four feature extraction methods were applied on every sEMG recording channel and the calculated features from all the channels were used as inputs simultaneously for the classifiers.

2.5. Classification Method

The Neural Network (NN) classifier and Support Vector Machine (SVM) classifier were used to recognize the motions. The activation function of the NN is described as:
f ( s ) = 1 / ( 1 + e μ s )
where μ is a constant coefficient and s is the summation of the input defined as:
s = w i x i
where wi is a weighted parameter for each input to the neural node. There is one hidden layer with eight neurons in the neural network. The hyperbolic tangent sigmoid transfer function and scaled conjugate gradient backpropagation training algorithm are adopted for hidden layer combination and learning process, respectively. The dimension of the input vector equals the number of muscles used for sEMG signals recording. The output are vectors structured by one-of-k coding scheme where k is the number of patterns (in this paper it is 9, including the relax motion).
SVM is proposed as a classification technique based on maximizing the margin between a data set and using an optimal hyper plane to separate different data sets [28]. From a mathematial point of view, SVM requires solving the following optimization problem to find the minimum of Equation (26):
φ ( ω , ξ ) = 0.5 ω 2 + c i = 1 l ξ i
subject to:
y i [ ( x i ω ) + b ] 1 ξ i , i = 1 , 2 , ... , l
where x is an n-dimensional vector (in this paper it is 6-dimensional as there are six independent inputs) and b is a scalar. c is the independent variable. l is the number of data points. y is the model to be learned. Equations (26) and (27) can be rewritten in the dual Lagrangian form:
L ˜ ( a ) = n = 1 l a n 0.5 n = 1 l m = 1 l a n a m t n t m k ( x n , x m )
where k(xn,xm) denotes the kernel function which plays the soul role in SVM. Compared with neural network, the objective function of SVM is convex, giving rise in the relatively straightforward solution of the optimization problem.
To classify new data using the trained model, Equation (29) is used based on the conception of kernel function:
y ( x ) = n = 1 N a n t n k ( x , x n ) + b
where N denotes the number of support vectors. The value of target y is correspond to the target motions. In this paper it is integer ranged from 1 to 9.
One of the most popular approaches to training SVM is the Sequential Minimal Optimization (SMO) algorithm [29] which solves the quadratic programming problem of training a SVM. The details for SVM can be found in [30]. As there were four feature extraction methods and two classifiers, eight combinations (4 × 2) were tested in the following experiments, which were RMS + NN, RMS + SVM, WP + NN, WP + SVM, DFA + NN, DFA + SVM, MM + NN and MM + SVM. The classification was performed on data recorded continuously during the tasks.

3. Experimental Results

3.1. Experimental Setup

Seven healthy volunteers (aged from 22 to 28, median 25.70 ± 1.61 years, all male, two left handed and five right handed, height: 1.71 ± 0.17 m, weight: 65.66 ± 8.14) participated in the experiments. In the elbow flexion and extension experiments, subjects were asked to sit on a chair starting with their upper limbs relaxed vertically in the sagittal plane and then flex their forearms to 90°. After having maintained their forearms in a horizontal position for 2 s, the subjects were asked to extend their forearms to the initial vertical position (as shown in Figure 1a,b). In the forearm pronation and supination experiments, subjects were asked to relax their upper limbs vertically and only pronate and supinate forearms to the maximum degrees as they felt comfortable, keeping the upper arm still (as shown in Figure 1c,d). In the experiments on wrist flexion and extension, subjects relaxed their upper limbs vertically and flexed and extended their wrists to the maximum degree as they felt comfortable (as shown in Figure 2a,b). In the wrist adduction and abduction experiments, subjects again relaxed their upper limbs vertically as they did in the previous two experiments, and performed the abduction and abduction movement as shown in Figure 2c,d).
Figure 1. (a) shows the gesture of relaxing where A is the MTx sensor and B is the electrode; (b) shows the gesture of elbow extension; (c) shows the gesture of forearm pronation; (d) shows the gesture of forearm supination.
Figure 1. (a) shows the gesture of relaxing where A is the MTx sensor and B is the electrode; (b) shows the gesture of elbow extension; (c) shows the gesture of forearm pronation; (d) shows the gesture of forearm supination.
Sensors 15 09022 g001
Figure 2. (a) shows the gesture of wrist flexion; (b) shows the gesture of wrist extension; (c) shows the gesture of wrist abduction; (d) shows the gesture of wrist adduction.
Figure 2. (a) shows the gesture of wrist flexion; (b) shows the gesture of wrist extension; (c) shows the gesture of wrist abduction; (d) shows the gesture of wrist adduction.
Sensors 15 09022 g002
Each subject repeated the four experiments five times to acquire enough data to train the classifiers. After classifier training, another five trials were conducted for each of the four experiments to test the on-line performance of the recognition methods. All subjects practiced several times following a standard video before experiments.
Flexor carpi radialis (FCR), flexor carpi ulnaris (FCU), extensor carpi radialis (ECR), extensor carpi ulnaris (ECU), biceps brachii (BB) and triceps brachii (TB) were selected to record sEMG signals. All six s of these sEMG data channelwere used simultaneously for the recognition of the eight motions. Prior to data collection, the skin was shaved and wiped with alcohol. Dry rectangle electrodes (Ag/AgCl, size: 26 × 14 mm), with a skin contact surface of 20 mm2 and inter-electrode distance of 15 mm, were placed parallel to the muscle fibers, according to SENIAM references [31]. Electrode placements were confirmed by voluntary muscle contraction and followed the recommendation of [32]. For FCR, the electrode placement is one-third the distance from the medial epicondyle to the radial styloid. For FCU, it is one-third of the forearm from the medial prominence. For ECR, it is one-third the distance from the lateral epicondyle to the wrist. For ECU, it is one-third of the forearm along the ulna. For BB, it is at the middle of superficial muscle belly. For TB, it is at the long head position. The sampling rate for sEMG signals was 1000 Hz with differential amplification (gain: 1000) and common mode rejection (104 dB) by the commercial filter box (10–450 Hz band-pass, 60 Hz notch filtered, Personal EMG, Oisaka Development Ltd., Fukuyama, Japan). The filtered sEMG signals were then rectified and normalized by MVC values. Two MTx sensors (Xsens Technologies B.V., Enschede, The Netherlands) were attached to the subject’s hand and forearm to record the joint and wrist angles, which were used for labeling the training data. NN and SVM were trained by MATLAB Neural Networks Recognition Tools (MATLAB NRT, MathWorks Co., Natick, MA, USA) and LIBSVM [33], respectively. The on-line tests were conducted using self-designed software developed by Microsoft Visual Studio 2010 (Microsoft Co., Redmond, WA, USA) based on MATLAB NRT and LIBSVM.

3.2. Experimental Results

Different combinations of the four feature extraction methods and two classifiers during the training process are listed in Table 2. For the performance of recognition accuracy rate, the results show that the combination of WP and NN obtained the highest rate (with an average of 97.6%) and the combination of RMS and SVM obtained the lowest rate (with an average of 73.1%). Among the feature extraction methods, the performance of WP and MM is better than that of RMS and DFA while WP is a little better than MM (97.6% vs. 95.1%). Between the two classifiers, NN performs better than SVM during training process for all four features.
Table 2. Performance of off-line training. The value is the recognition accuracy rate defined by number of corrected recognized results dividing total results, with unit of %.
Table 2. Performance of off-line training. The value is the recognition accuracy rate defined by number of corrected recognized results dividing total results, with unit of %.
Subject (NN/SVM)
FeaturesABCDEFGAverage
RMS75.2/74.270.7/66.882.2/80.581.2/78.976.1/71.570.1/69.773.3/70.175.5/73.1
RMSF87.0/82.084.1/83.590.6/89.589.6/86.886.5/85.482.1/80.285.1/81.286.4/84.0
WP98.4/97.697.2/94.197.7/94.598.9/96.897.5/95.297.5/94.596.5/92.597.7/95.0
MM98.8/98.095.7/90.994.1/91.891.4/88.393.1/90.1495.3/93.497.3/95.495.1/92.6
Experimental results of on-line performance are listed in Table 3. It can still be observed that WP and MM obtain higher recognition accuracy rates than RMS and DFA, while MM is a little higher than WP (94.3% vs. 92.0%). However, the performance of NN is worse than that of SVM during on-line testing experiments (except for subject D). The recognition accuracy rates of both classifiers decrease compared with the performance during training period. The magnitude of this decrease for NN is larger than that of SVM.
Among the eight motions, forearm pronation and supination are different from the others as the muscles used to recognize these two motions are not directly involved. The recognition results are listed in Table 4. Although some of these recognition accuracy rates were relative low, all the recognition accuracy rates were above 90% (with the combination of WP + SVM).
Table 3. Performance of on-line testing. The value is the recognition accuracy rate defined by number of corrected recognized results dividing total results, with unit of %.
Table 3. Performance of on-line testing. The value is the recognition accuracy rate defined by number of corrected recognized results dividing total results, with unit of %.
Subject (NN/SVM)
FeaturesABCDEFGAverage
RMS70.2/74.168.7/68.880.1/81.577.1/78.072.1/73.570.0/71.169.1/71.272.5/74.0
DFA79.3/83.080.1/84.382.3/87.181.1/85.379.1/81.375.1/81.177.7/80.279.2/83.2
WP93.4/95.692.2/93.193.1/94.191.1/93.291.3/94.591.5/94.790.5/93.190.1/92.0
MM94.8/97.091.1/93.092.1/95.893.3/90.389.1/92.189.3/92.492.3/96.492.1/94.3
Table 4. Recognition accuracy rate for forearm pronation and supination.
Table 4. Recognition accuracy rate for forearm pronation and supination.
Subject (WP + SVM)
MotionABCDEFG
P/S91.3/97.898.7/94.496.6/97.194.1/97.393.2/95.596.4/92.198.3/90.1
Figure 3. On-line testing performance with MM feature extraction method. (a) shows the classification results by NN; (b) shows the classified results by SVM. The red circles mark the misclassifications using the two classifiers.
Figure 3. On-line testing performance with MM feature extraction method. (a) shows the classification results by NN; (b) shows the classified results by SVM. The red circles mark the misclassifications using the two classifiers.
Sensors 15 09022 g003
One set of on-line testing performance data is depicted in Figure 3, where the blue lines show the classification results and the dashed green lines show the labeled results. The dotted red lines mark the misclassification results. It can be noted that the misclassification places between NN and SVM are similar.
Another important issue for motion recognition is the time consumption for training and on-line computation, while the latter is more important than the former in our particular circumstance. Experimental results for the off-line training process are depicted in Figure 4 which are the average values of off-line training times of the seven subjects on the same feature extraction method and classifier. The light blue bars denote the average training time of NN while the green bars denote the ones for SVM. It can be noticed that NN takes much more time than SVM (469.6 s vs. 73.5 s) to obtain a good performance. For NN, MM and WP take more time than RMS and DFA, while the former pair obtains a higher recognition accuracy rate than the latter pair. For SVM, MM and WP take less time than RMS and DFA, while the former pair still obtains a higher recognition accuracy rate than the latter pair. Experimental results for on-line calculation are depicted in Figure 5 and the average number of support vectors for different feature extraction methods are depicted in Figure 6. The blue bars denote the time consumption for one trial with NN using different feature extraction methods. There is almost no difference because the structure of the NN is the same. However, it is distinctly visible that it takes much more time for SVM to complete one recognition loop than NN (1.0918 ms vs. 0.0034 ms). The sampling frequency in our experiments is 1000 Hz and this is also important for sEMG sampling; a time consumption longer than 1 ms will give rise to a time delay in real-time experiments. Given this condition, RMS and DFA, with time consumptions of 1.7 ms and 1.2 ms, respectively, are not suitable for real-time purposes.
Figure 4. Time consumption of off-line training process. Blue bars show the average training time using NN with different features on all the subjects. Green bars show the results with SVM.
Figure 4. Time consumption of off-line training process. Blue bars show the average training time using NN with different features on all the subjects. Green bars show the results with SVM.
Sensors 15 09022 g004
Figure 5. Time consumption of on-line testing process. Blue bars show the average computing time using NN with different features on all the subjects. Green bars show the results with SVM.
Figure 5. Time consumption of on-line testing process. Blue bars show the average computing time using NN with different features on all the subjects. Green bars show the results with SVM.
Sensors 15 09022 g005
Figure 6. Number of support vectors for different feature extraction method.
Figure 6. Number of support vectors for different feature extraction method.
Sensors 15 09022 g006

4. Discussion

Seven healthy subjects participated in the experiments. Although in some clinical studies a large number of participants are often used, the proposed method is going to be applied on healthy subjects and the purpose of this paper is to find the differences between these methods, not for a commercial implementation on human beings, such as in [6,7,10,11,16], so no more than nine subjects participated. Furthermore, some of these methods were widely studied for their ability of motion recognition. Given the previous consideration, a number of seven subjects was deemed sufficient for this study.
In our special circumstances of pattern recognition using sEMG signals, the experimental results of off-line training performance indicate that NN performed better than SVM on the training set of data at a cost of longer time consumption for training. However, the experimental results of real-time testing indicated that SVM is more generalized than NN on all the data combined (off-line training data and on-line testing data). As non-stationarity and time-variable properties of EMG signals, the generalization of the classifier is important for the issue of pattern recognition using EMG signals. From the experimental results of off-line training and on-line testing, it seems that NN tends to be more over-fitted to local training data while SVM tends to be relatively high in generalization abilities, which matches the respective properties of NN and SVM [9,17,25,28].
As time consumption of NN training depends on the initial values of parameters [25], the time varies from 3 min to 15 min using the MATLAB Neural Pattern Recognition Tool. In this study, the average training time for obtaining the highest recognition rate (with 1% offset) was used to represent the time consumption. Although the fastest time of NN training may be shorter than that of SVM, the performance was worse in most of the cases. During the NN training process, we found that a better training performance usually takes a longer time to train in our special cases. Also several training times are needed to obtain a relative high recognition rate because the initial parameters’ values also involve the training performance. Considering the multiple times needed for acquiring suitable training performance, the time consumption on training a NN classifier is longer than that of SVM. On the contrary, as a consequence of convex subject function, training performance of SVM is much more “stable” than NN. Using the LIBSVM, just one-time-training can obtain a good result, with shorter time consumption for training.
Nevertheless, time consumption on real-time computation of SVM is much longer than that of NN. The prediction using SVM is calculated through Equation (25) and the number of support vectors is the main factor involved in on-line time consumption. As shown in Figure 6, the number exceeds 10,000 for all four feature-extraction methods where RMS had the largest one (29,694) and WP had the smallest one (12,525). Even with the WP method, it takes 12,525 kernel function calculations to obtain one prediction result. As mentioned in Section 3.2, it is not suitable for real-time applications if the time consumption is larger than 1 ms for one prediction loop. Given this condition, a SVM, with support vector larger than 18,400, will exceed the limitations of our real-time requirement. Compared with the strategy of setting adaptive parameters, the SVM selects a large number of support vectors to guarantee the generalization on sEMG signals. Another issue for SVM in our special circumstance is that the support vector number increases with the increasing number of input training samples. As a consequence, the number of training samples cannot be too large if the SVM is applied in a real-time environment. According to our experimental results, a maximum of about 50,000 training samples will fit the requirement. This issue may be addressed by the nonlinear feature extraction method as motioned in the work of Ju et al. [6]. The nonlinear method has the ability to catch the dynamic information relating EMG signals and motions. Although only recognition accuracy rate was discussed in their work, it could be believed that the nonlinear method will extract the more generalized features from the data and reduce the number of support vectors.
Among the four feature extraction methods, WP obtains the highest recognition accuracy rate (97.6%) compared to 95.0% for NN and SVM, respectively, while the second one is MM with 95.1% and 92.6% for NN and SVM, respectively. RMS and DFA cannot achieve a recognition accuracy rate above 90%. One important difference between the four feature extraction methods is the adoption of a low-pass filter or low-pass-filter like algorithm. Among the feature extraction methods, MM applies a low-pass Butterworth filter directly in the processing [23]. The changing muscle force frequency is much lower than that of EMG signals due to the structure of muscle and surrounding tissue. The adoption of a low-pass filter in MM is to mimic the low-pass-filter like function of muscles. The musculotendon force gives rise to human motion while EMG signals are more like reference commands. As a consequence, it is reasonable to process features with a low-pass filter to make them more like musculotendon-force-like signals. Although no low-pass filter is used directly in WP, the algorithm of WP itself is low-pass-filter like, which tends to extract the trend of sEMG signals [25]. RMS can also be considered as a low-pass filter but the ‘cutoff’ frequency is higher than that of MM or WP and it depends on the time interval, which also involves the extraction of detail information in sEMG signals and real-time ability during application. A long time interval will eliminate useful information and cause time delays in real-time applications. As a limitation of the algorithm, DFA is hard to apply in a real-time environment.
As a consequence, for feature extraction, WP and MM performed better than the other two in our case, while MM was more effective than WP in real-time experiments. Although NN took much more training time than SVM, it took much less in real-time performance. If low time consumption is an important requirement, the combination of MM and NN is recommended. Otherwise, the combination of MM and SVM would be a good choice.
Another interesting point is that the motion of forearm pronation and supination was recognized in an indirect way, i.e., the features were not extracted from the muscles responding directly to the motion. Pronator teres, pronator quadratus, supinator and biceps brachii are the four muscles involved in the forearm pronation and supination motion. They are in the deep layer of the forearm, making them impossible to detect by surface electrodes. As a consequence, the motion was recognized via the synergetic behavior of muscles. Similar studies about recognizing motion using muscles around the forearm muscles can be found in [12,13,14,15,16]. In this paper, activations of the selected six muscles are quite different among individuals during the motion of forearm pronation and supination. Recognition of accuracy rates of this motion are thus quite different among individuals. However, all the recognition accuracy rates are above 90%, indicating that the applied method has the ability to perform motion recognition with the indirectly involved muscles.

5. Conclusions

In this paper, eight combinations of four feature extraction methods and two classifiers were tested to recognize eight human upper-limb motions, using features extracted from sEMG signals as inputs. The comparative experimental results show that features extracted by WP and MM can lead to the highest recognition accuracy rate among the four feature extraction methods, while MM is more effective in real-time applications with a trade-off of about a 3% lower recognition accuracy rate than WP. For classifiers, although NN can obtain higher recognition accuracy rate than SVM in the training process, SVM is more accurate than NN in on-line tests. However the time consumption for SVM requires the user to choose the training samples carefully and restrict the number of support vectors or it will cause time-delays in real-time applications. As a consequence, the combination of MM and NN is recommended when strict real-time performance is required. Otherwise, the combination of MM and SVM may provide higher and more stable recognition accuracy rates. The experimental results also indicate that a low-pass filter with cut-off frequency around 1–4 Hz applied in feature extraction will be helpful for increasing recognition accuracy rates.
In the future, MM will be improved to enhance the ability of resistance to the non-stationarity and time-variable properties of sEMG signals, in order to reduce the number of support vectors in SVM and improve the robustness for practical applications. These methods will be applied for robot control and rehabilitation training in the future.

Acknowledgments

This research is partly supported by National Natural Science Foundation of China (61375094), Key Research Program of the Natural Science Foundation of Tianjin (13JCZDJC26200), National High Tech. Research and Development Program of China (No.2015AA043202), and SPS KAKENHI Grant Number 15K2120.

Author Contributions

Shuxiang Guo conceived and designed the experiments; Muye Pang analyzed the data; Muye Pang wrote the paper; Shuxiang Guo and Baofeng Gao checked the results, Hideyuki Hirata and Hidenori Ishihara checked the manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Fukuda, O.; Tsuji, T.; Otsuka, A. A Human-Assisting Manipulator Teleoperated by EMG Signals and Arm Motions. IEEE Trans. Robot. Autom. 2003, 19, 210–222. [Google Scholar] [CrossRef]
  2. Liarokapis, M.V.; Artemiadis, P.K.; Katsiaris, P.T.; Kyriakopoulos, K.J.; Manolakos, E.S. Learning Human Reach-to-Grasp Strategies: Towards EMG-Based Control of Robotic Arm-Hand Systems. In Proceedings of the 2012 IEEE International Conference on Robotics and Automation, Saint Paul, MN, USA, 14–18 May 2012; pp. 2287–2292.
  3. Chen, X.; Wang, Z.J. Pattern Recognition of Number Gestures Based on a Wireless Surface EMG System. Biomed. Signal Process. Control. 2013, 8, 184–192. [Google Scholar] [CrossRef]
  4. Pang, M.; Guo, S.; Song, Z. Study on the sEMG Driven Upper Limb Exoskeleton Rehabilitation Device in Bilateral Rehabilitation. J. Robot. Mechatron. 2012, 24, 585–594. [Google Scholar]
  5. Pang, M.; Guo, S.; Song, Z.; Zhang, S. A Surface EMG Signals-Based Real-time Continuous Recognition for the Upper Limb Multi-motion. In Proceedings of the 2012 IEEE International Conference on Mechatronics and Automation, Chengdu, China, 5–8 August 2012; pp. 1984–1989.
  6. Ju, Z.; Ouyang, G.; Wilamowska-Korsak, M.; Liu, H. Surface EMG Based Hand Manipulation Identification via Nonlinear Feature Extraction and Classification. IEEE Sens. J. 2013, 13, 3302–3311. [Google Scholar] [CrossRef]
  7. Artemiadis, P.K.; Kyriakopoulos, K.J. A Switching Regime Model for the EMG-Based Control of a Robot Arm. IEEE Trans. Syst. Man Cybern. B Cybern. 2010, 41, 53–63. [Google Scholar] [CrossRef]
  8. Balbinot, A.; Favieiro, G. A Neuro-Fuzzy System for Characterization of Arm Movements. Sensors 2013, 13, 2613–2630. [Google Scholar] [CrossRef] [PubMed]
  9. Alkan, A.; Günay, M. Identification of EMG Signals Using Discriminant Analysis and SVM Classifier. Expert Syst. Appl. 2012, 39, 44–47. [Google Scholar] [CrossRef]
  10. Young, A.J.; Smith, L.H.; Rouse, E.J.; Hargrove, L.J. Classification of Simultaneous Movements Using Surface EMG Pattern Recognition. IEEE Trans. Biomed. Eng. 2013, 60, 1250–1258. [Google Scholar] [CrossRef] [PubMed]
  11. Shibanoki, T.; Shima, K.; Tsuji, T.; Otsuka, A.; Chin, T. A Quasi-Optimal Channel Selection Method for Bioelectric Signal Classification Using a Partial Kullback-Leibler Information Measure. IEEE Trans. Biomed. Eng. 2013, 60, 853–861. [Google Scholar] [CrossRef] [PubMed]
  12. Naik, G.R.; Kumar, D.K. Subtle Electromyographic Pattern Recognition for Finger Movements: A Pilot Study Using BSS Techniques. J. Mech. Med. Biol. 2012, 12, 1–19. [Google Scholar] [CrossRef]
  13. Naik, G.R.; Kumar, D.K. Identification of Hand and Finger Movements Using Multi Run ICA of Surface Electromyogram. J. Med. Syst. 2012, 36, 841–851. [Google Scholar] [CrossRef] [PubMed]
  14. Naik, G.R.; Nguyen, H.T. Non Negative Matrix Factorisation for the Identification of EMG Finger Movements: Evaluation Using Matrix Analysis. IEEE J. Biomed. Health Inf. 2015, 19, 478–485. [Google Scholar] [CrossRef]
  15. Naik, G.R.; Nguyen, H.T.; Palaniswami, M. Signal Processing Evaluation of Myoelectric Sensor Placement in Low-Level Gestures: Sensitivity Analysis Using Independent Component Analysis. Expert Syst. 2014, 31, 91–99. [Google Scholar] [CrossRef]
  16. Zhang, X.; Chen, X.; Li, Y.; Lantz, V.; Wang, K.; Yang, J. A Framework for Hand Gesture Recognition Based on Accelerometer and EMG Sensors. IEEE Trans. Syst. Man Cybern. A Syst. Hum. 2011, 41, 1064–1076. [Google Scholar] [CrossRef]
  17. Tang, X.; Liu, Y.; Lv, C.; Sun, D. Hand Motion Classification Using a Multi-Channel Surface Electromyography Sensor. Sensors 2012, 12, 1130–1147. [Google Scholar] [CrossRef]
  18. Rafiee, J.; Rafiee, M.A.; Yavari, F.; Schoen, M.P. Feature Extraction of Forearm EMG Signals for Prosthetics. Expert Syst. Appl. 2011, 38, 4058–4067. [Google Scholar] [CrossRef]
  19. Huxley, A.F. Muscle Structure and Theories of Contraction. Prog. Biophys. Biophys. Chem. 1957, 7, 255–318. [Google Scholar] [PubMed]
  20. Huxley, A.F.; Simmons, R.M. Proposed Mechanism of Force Generation in Striated Muscle. Nature 1971, 233, 533–538. [Google Scholar] [CrossRef] [PubMed]
  21. Hill, A.V. The Heat of Shortening and the Dynamic Constants of Muscle. Proc. R. Soc. Lond. Ser. B Biol. Sci. 1938, 126, 136–195. [Google Scholar] [CrossRef]
  22. Cavallaro, E.; Rosen, J.; Perry, J.C.; Burns, S.; Hannaford, B. Hill-Based Model as a Myoprocessor for a Neural Controlled Powered Exoskeleton Arm-Parameters Optimization. In Proceedings of the 2005 IEEE International Conference on Robotics and Automation, Barcelona, Spain, 18–22 April 2005; pp. 4525–4530.
  23. Pang, M.; Guo, S.; Ishihara, H.; Hirata, H. Electromyography-Based Quantitative Representation Method for Upper-Limb Elbow Joint Angle in Sagittal Plane. J. Med. Biol. Eng. 2015, in press. [Google Scholar]
  24. Manal, K.; Buchanan, T.S. A One-Parameter Neural Activation to Muscle Activation Model: Estimating Isometric Joint Moments from Electromyograms. J. Biomech. 2003, 36, 1197–1202. [Google Scholar] [CrossRef] [PubMed]
  25. Song, Z.; Guo, S.; Pang, M.; Zhang, S. Study on Recognition of Upper Limb Motion Pattern Using Surface EMG Signals for Bilateral Rehabilitation. In Proceedings of the 23rd 2012 International Symposium on Micro-Nano Mechatronics and Human Science, Nagoya, Japan, 4–7 November 2012; pp. 425–430.
  26. Peng, C.K.; Havlin, S.; Stanley, H.E.; Goldberger, A.L. Quantification of Scaling Exponents and Crossover Phenomena in Nonstationary Heartbeat Time Series. Chaos 1995, 5, 82–87. [Google Scholar] [CrossRef] [PubMed]
  27. Buchanan, T.S.; Lloyd, D.G.; Manal, K.; Besier, T.F. Neuromusculoskeletal Modeling: Estimation of Muscle Forces and Joint Moments and Movements from Measurements of Neural Command. J. Appl. Biomech. 2004, 20, 367–395. [Google Scholar] [PubMed]
  28. Cortes, C.; Vapnik, V. Support-Vector Networks. Mach. Learn. 1995, 20, 273–297. [Google Scholar]
  29. Platt, J.C. Fast Training of Support Vector Machines Using Sequential Minimal Optimization. In Advances in Kernel Methods-Support Vector Learning; Scholkopf, B., Burges, C.J.C., Smola, A.J., Eds.; The MIT Press: Cambridge, MA, USA, 1998; pp. 185–208. [Google Scholar]
  30. Bishop, C.M. Pattern Recognition and Machine Learning; Springer: New York, NY, USA, 2006; pp. 325–356. [Google Scholar]
  31. SENIAM Project. Available online: http://www.seniam.org/ (accessed on 8 December 2014).
  32. Lew, H.L.; Tsai, S.J. Johnson’s Practical Electromyography, 4th ed.; Lippincott Williams & Wilkins: Philadelphia, PA, USA, 2007; pp. 156–180. [Google Scholar]
  33. Chang, C.C.; Lin, C.J. LIBSVM: A Library for Support Vector Machines. ACM Trans. Intell. Syst. Technol. 2011, 2. [Google Scholar] [CrossRef]

Share and Cite

MDPI and ACS Style

Guo, S.; Pang, M.; Gao, B.; Hirata, H.; Ishihara, H. Comparison of sEMG-Based Feature Extraction and Motion Classification Methods for Upper-Limb Movement. Sensors 2015, 15, 9022-9038. https://doi.org/10.3390/s150409022

AMA Style

Guo S, Pang M, Gao B, Hirata H, Ishihara H. Comparison of sEMG-Based Feature Extraction and Motion Classification Methods for Upper-Limb Movement. Sensors. 2015; 15(4):9022-9038. https://doi.org/10.3390/s150409022

Chicago/Turabian Style

Guo, Shuxiang, Muye Pang, Baofeng Gao, Hideyuki Hirata, and Hidenori Ishihara. 2015. "Comparison of sEMG-Based Feature Extraction and Motion Classification Methods for Upper-Limb Movement" Sensors 15, no. 4: 9022-9038. https://doi.org/10.3390/s150409022

APA Style

Guo, S., Pang, M., Gao, B., Hirata, H., & Ishihara, H. (2015). Comparison of sEMG-Based Feature Extraction and Motion Classification Methods for Upper-Limb Movement. Sensors, 15(4), 9022-9038. https://doi.org/10.3390/s150409022

Article Metrics

Back to TopTop