1. Introduction
Facial muscle movements are a means of nonverbal communication that can convey emotions, intentions and messages through the movements of the eyes, eyebrows, lips and other facial muscles. They have essential applications in emotion recognition [
1,
2], human–computer interaction [
3,
4,
5], psychotherapy [
6], acting [
7] and socialization [
8]. For example, in acting, actors can use facial movements to express the emotions and characteristics of their roles; in psychotherapy, facial action can be used as a cue to help psychologists understand patients’ inner feelings and emotions.
There are two main traditional facial-movement-recognition methods, facial-image analysis [
9,
10] and video analysis [
11,
12], which rely on computer-vision algorithms. So far, computer-vision methods have achieved a high recognition performance for this task. However, there are high requirements for the recording environment. The recognition accuracy will be affected by the camera angle, image resolution and lighting conditions. To avoid the impact of these external factors, physiological signals, such as electromyography (EMG), electroencephalography (EEG), electrocardiography (ECG), etc., become another means of facial movement detection.
EMG refers to the electrical activity generated when the muscle motor unit in the muscle contracts and expands under the control of the nervous system [
13]. These electrical activities objectively describe the anatomical and physiological characteristics of the muscle. According to different collection methods, electrodes can be divided into invasive and non-invasive. The invasive acquisition method needs to insert needle-shaped electrodes into the muscle. Although they can avoid signal crosstalk between muscles and obtain more accurate signal changes, needle-shaped electrodes penetrate the skin and could easily cause trauma to the skin. Conversely, the non-invasive surface electromyography (sEMG) signal-acquisition method involves placing a metal electrode on the skin surface of the measured muscle source. It measures the weak voltage generated during muscle contraction. This non-invasive approach can capture temporal or intensity information related to superficial muscle activation. This collection method is safer and easier to operate, so most experiments use the non-invasive sEMG collection method. For example, when measuring the EMG signal generated via facial movements for smiling, electrodes are placed on the skin surface of the zygomaticus major muscle; for the measurement of frowning, the electrodes are placed on the skin surface of the corrugator supercilii muscle.
In recent years, many efforts have been devoted to developing EMG signal-acquisition systems, of which some commercial devices [
14,
15] are representative. Still, they are generally expensive and have a limited number of channels. In addition, there is still room for further improvement in resolution, portability and data-transmission methods. To solve these problems, scholars have been actively exploring new solutions. For example, Fuentes et al. [
16] designed a low-cost EMG sensor and compared it with a commercial Delsys system. In the study, the authors used three indicators to estimate the similarity between the output signals of two systems. The results show that the designed low-cost and commercial systems agree well. Yang et al. [
17] designed a low-cost, high-sampling-rate and multi-channel EMG acquisition system, which uses wearable EMG sensors combined with a microcontroller unit and Wi-Fi module and the collected signal data are immediately forwarded to the host for further processing. In addition, it has been verified with commercial systems and the correlation coefficients are above 0.8. Zhu et al. [
18] developed an eight-channel wireless acquisition system with a sampling rate of 2 KHz and a resolution of 24 bits and designed a hardware filter and a software filter to improve the accuracy of the measurement system. Moreover, the acquisition unit is packaged in a small 3D-printed shell, which is more convenient to wear. However, this device can only measure a few muscle movements simultaneously due to the limited number of channels. In summary, while previous studies have addressed some issues, challenges remain in efficiently assessing facial-muscle movements. Therefore, while ensuring low cost, we aim to enhance the performance of EMG acquisition devices further to realize capabilities such as multi-channel, high-resolution and wireless transmission.
EMG acquisition devices can be applied to facial-muscle-movement classification and have achieved high recognition rates [
19,
20,
21,
22,
23]. But previously they placed the electrodes directly on the source of the facial muscles during the experiment. Although this placement ensures high-quality acquisition of EMG signals, it also brings some troubles: electrodes, instruments and tapes will give the subject an unnatural feeling and the electrodes’ weight will inhibit facial movements. In addition, the electrodes attached to the face may be displaced by muscle movement. To solve the problems mentioned above, A. Gruebler et al. [
24] proposed a wearable device that utilizes EMG signals for facial-expression detection. In their study, the authors investigated the placement of electrodes on the face to determine the optimal method for signal acquisition. The experimental results demonstrated that the placement of the distal electrode achieved an average recognition rate of 98% for smiles and 96% for frowns. Although they only recognized two facial expressions, the results showed that the recognition performance could also be high with respect to placing the distal electrodes. Another study by Monica et al. [
25] employed distal EMG signals and computer-vision techniques to detect smile action units while ensuring the electrodes did not obstruct the face. The results displayed an accuracy of 81% for AU6 (cheek Lift) and 82% for AU12 (the corners of the mouth pull up). Regrettably, they use limited channels and fewer facial movements were recognized. In this paper, we increased the number of channels for acquiring EMG signals and the number of facial movements identified.
Based on the previous discussion, we designed an EMG signal-acquisition system for capturing the movement signals of facial muscles, including two main components: (1) a multi-channel EMG signal-acquisition device and (2) signal-receiver software with real-time display and data processing. To verify the reliability and practicality of the system, we conducted two experiments: a comparison with a commercial system for reliability assessment, and categorizing facial-muscle movements to demonstrate the practicality. The main contributions of this paper are as follows:
- (1)
The proposed EMG signal-acquisition system is affordable, with numerous channels, enabling measurement across various facial regions. Its wireless transmission enhances portability and leaves an expansion interface so other users can use it to realize more data-acquisition channels. In addition, the device is equipped with hardware unipolar and bipolar switching.
- (2)
We propose a multichannel EMG acquisition scheme around the face that simultaneously acquires multiple facial-muscle movements, providing the possibility of a realistic and natural facial muscle assessment.
- (3)
We collected 484 EMG samples and classified muscle actions using traditional machine learning and neural networks. The experiment showed machine learning’s superiority in classifying muscle action from EMG signals, with high recognition rates even for electrodes not directly placed on the facial-muscle source.
The rest of this paper is organized as follows.
Section 2 describes the hardware and software design of the EMG acquisition system;
Section 3 presents the details of the experiments comparing the EMG acquisition system proposed in this paper with a commercial system;
Section 4 describes the experiments on facial-movement recognition;
Section 5 discusses the features and drawbacks of the EMG signal-acquisition system;
Section 6 summarizes the paper and discusses future work.
3. Equipment-Comparison Experiment for Reliability
The primary purpose of this experiment is to verify the reliability of the EMG acquisition module proposed in this paper by comparing it with a commercial system MP160 produced by BIOPAC [
30].
Table 1 lists the characteristics of the two devices. The commercial system consists of three modules: the EMG data-acquisition unit MP160, the advanced converter module HLT100C and a laptop computer installed with the supporting software AcqKnowledge 5.0. The advanced converter module and the laptop are connected via a network cable to form a local area network (LAN) and the data acquisition module MP160 sends the acquired data to the computer via Wi-Fi for reception, real-time display and storage.
3.1. Experimental Procedure
The purpose of this experiment is to verify the reliability of the device we designed. We recruited four subjects and to ensure that the signals collected by the two devices were comparable, we selected four symmetrical muscles on the face, namely the corrugator supercilii, orbicularis oculi, zygomaticus and orbicularis oris. We placed both systems on symmetrical muscle sources on the left and right sides of the participant’s face to ensure symmetrical electrode placement. Based on these four muscles, we selected four corresponding facial movements: frowning, blinking, smiling and pouting.
To ensure consistency between the acquisition methods of the two systems, we used the bipolar method to acquire EMG signals, mainly because the MP160 module collects signals in a bipolar manner. Moreover, the signal obtained in the bipolar way represents the potential difference between the positive and negative electrodes. This approach partially mitigates signal crosstalk among muscles and improves the received signal’s quality.
The experimental procedure is as follows:
- (1)
First, the subjects needed to understand the four facial movements for comparison and be able to display them.
- (2)
Second, we placed a pair of electrodes from both devices on top of the frowning muscles on the left and right sides of the participant’s face, respectively. Next, the subjects made three frowning movements were addressed. After that, we exchanged the position of the electrodes on the left and right sides of the participant’s face and asked the participant to repeat the frowning action three times. This approach can reduce the interference caused by the asymmetry of facial movements to a certain extent.
- (3)
Subsequently, the subjects completed the acquisition of EMG data for blinking, smiling and pouting in this way.
3.2. Index Evaluation
To better evaluate the difference between the commercial EMG acquisition equipment and the EMG acquisition equipment proposed in this paper, we selected four indicators of the Spearman correlation coefficient, energy ratio, linear correlation coefficient () and cross-correlation coefficient () to evaluate the correlation between the obtained data.
Spearman’s correlation is a non-parametric statistic used to assess the degree of association between two variables and is commonly used to measure a monotonic relationship between two variables. The Spearman correlation coefficient takes values between −1 and 1, where −1 indicates a completely inverse monotonic relationship, 0 indicates no monotonic relationship and 1 indicates an entirely positive monotonic relationship.
The energy ratio is usually used to compare the strength or energy of two EMG signals. The calculation formula is shown below.
E denotes the energy and takes values ranging from 0 to 1, with 0 indicating no similarity and 1 denoting complete similarity and compares the magnitude of the energy level step by step.
denotes the amplitude of the signals at the time point t.
Linear Correlation Coefficient (LCC): The
is calculated using the following equation:
x represents the value of the signal captured by the system proposed in this paper and
y represents the value captured by the commercial system. The values are taken in the range between −1 and 1. When
, it is a positive correlation. If
, it is a negative correlation. If
, there is no correlation between the two variables and it is an uncorrelated relationship.
The cross-correlation coefficient (CCC) is a broad method of comparing EMG signals.
values range from −1 to 1, where −1 indicates a negative correlation and 1 is a positive correlation.
3.3. Result
We used the correlation coefficient to evaluate the difference between our designed system and the commercial system after the same processing of the acquired EMG signal.
Table 2 shows the evaluation results of the two systems under four facial-muscle movements. In the evaluation index of energy ratio, the similarity between the two is the highest. The similarity of the four indicators is above 70%, indicating that the signals collected by the two systems have good consistency. To show the difference between the two devices, we chose the comparison data when one of the subjects performed four facial-muscle movements. As shown in
Figure 3, when frowning and blinking, there is less signal difference between the two devices; when smiling and pouting, our proposed device is more susceptible to sharp peaks.
4. EMG-Based Facial-Muscle-Movement Recognition
To demonstrate the practical value of the EMG acquisition system we designed, we conducted a facial-muscle-movement-recognition task. The task was different from previous facial-movement-recognition tasks using facial EMG (placing electrodes on top of the facial-movement source muscle to collect data). As shown in
Figure 1, we placed electrodes around the face to collect crosstalk signals from the source muscle at the distal end using volume conduction between the muscles. The overall architecture of data analysis is shown in
Figure 4. The specific experimental procedure is described below.
4.1. Electrode Placement
In our experiments, 18 electrodes (Ag/AgCl) were used. One is the reference electrode, another is the negative electrode and the rest were used to collect EMG signals. We placed all electrodes around the face to ensure unrestricted facial-muscle movement. The reference electrode is placed at a more neutral position relative to the face, i.e., behind the ear, while the negative electrode was placed at the forehead position. The remaining electrodes were placed on the left and right sides of the face, labeled L1–L8 and R1–R8. This electrode arrangement is referred to as a distal arrangement because the electrodes are not placed directly on the muscle sources of facial movements. Therefore, the acquired signals belong to the crosstalk signals propagated from the facial-muscle sources to the periphery of the face and such crosstalk signals may propagate in more than one direction. In addition, we considered the weight of the electrodes and the distance between the electrodes and finally chose to use 16 electrodes for signal acquisition.
4.2. Experimental Setup
We recruited ten healthy subjects, including eight males and two females, with an average age of 23.5 years, who actively participated in the experiment that lasted one hour. Before the start of the experiment, to reduce the impedance between the electrodes and the skin to obtain a clean sEMG signal, we cleaned the facial skin of the subjects using wipes containing alcohol. With the cleaning of the face completed, we followed the steps in Experiment 2.1 and distributed 16 electrodes evenly on both sides of the subject’s face, with eight electrodes on each side.
The experiment is described as shown in
Figure 5. After the electrodes were attached, the subjects imitated the five facial-muscle movements (raising the eyebrows, frowning, closing the eyes, pursing the lips and lifting the corners of the mouth) shown on the computer screen. During the experiment, the computer screen displayed each facial-muscle action in two ways, normal and forceful. Then, we used the Psychopy program to instruct the subjects to display facial-muscle movements at these two intensities. The steps of the experiment are as follows:
Step 1: First, the program displays a picture of a facial-muscle movement on the computer screen for a duration of 2 s.
Step 2: Subsequently, an instructional word for facial-muscle-movement intensity (“normal”) was presented on the screen, accompanied by a sound.
Step 3: Subjects display the facial movement described in Steps 1 and 2 for 2 s. After that, the subjects had 3 s of muscle relaxation time.
Step 4: The screen then presented an instructional word (“forceful”) for the facial-muscle-movement intensity accompanied by a sound.
Step 5: Subjects followed Steps 1 and 4 to display the appropriate facial movement for 2 s. After the sound ended, they again returned to their natural state.
The above five steps are executed five times to form a block, capturing sEMG signals of different intensities for five facial-muscle actions. The whole experiment was divided into three blocks, with the same sequence of facial-muscle movements within each block and a 30 s rest period between different blocks. The experiment was divided into three blocks.
4.3. Data Pre-Processing
The pre-processing of the EMG data is realized via Matlab R2021b. The EMG data we collected first passed through a notch filter to remove power-line interference. After that, they passed through a fourth-order Butterworth bandpass filter with a frequency range of 20–350 Hz to preserve the main energy frequencies of the sEMG signals. Next, to make the EMG signals closer to the variations of the original muscle activity, we performed a direct current (DC) removal to improve the signal quality. Subsequently, we used a full-wave rectification to highlight the amplitude variation of the signal. Then, we used a low-pass filter to smooth the signal to obtain a linear signal envelope. Finally, the preprocessed data were used to obtain eight correlation components for feature extraction using the ICA algorithm.
During the experimental acquisition, we utilized a camera to record video of the subjects’ faces. This approach aimed to enhance labeling efficiency when associating EMG signals with specific facial-muscle movements.
Figure 6 illustrates the number of labeled samples for five facial-muscle actions. The results indicate that, under forceful conditions, only three actions—puckering lips, raising eyebrows and lifting mouth corners—reached the desired 60 samples. Closing eyes had the fewest samples among the other facial actions, with only 44. Under normal conditions, the annotated sample count for all five facial actions did not reach the desired 60. The number of samples for frowning was about half of the ideal number. The number for closed eyes is the lowest, only 10.
Combined with the video analysis, we found that due to individual differences, some subjects had smaller movement amplitude when accomplishing closing eye and frowning movements. The electrodes placed around the face could hardly capture the signals. Although
Figure 6 shows that we collected ten samples of closing eye (normal) data. By analyzing the video recordings of the subjects, we found that some of the subjects were actually closing their eyes (forceful) while performing the closing eye (normal). For data consistency, we still consider it a closing eye (normal). After excluding these invalid facial-muscle-movement data, we obtained 484 EMG signal samples for subsequent experimental analysis.
4.4. Source Separation
Due to the placement of electrodes around the face instead of directly on the muscle source responsible for facial movements, the measured facial muscle activity is obtained remotely. Consequently, when facial movements occur, the signal captured by an electrode can represent a combination of signals originating from multiple muscle sources to varying degrees. The Independent Component Analysis (ICA) algorithm can separate these mixed signals and identify their contributions. Applying the ICA algorithm makes it possible to disentangle the underlying muscle sources and enhance the accuracy of signal analysis in facial EMG studies.
Given a set of EMG signals (observations)
,
,
,
, where t is time and n is the number of electrodes, they are assumed to be a linear mixture of independent components.
where A is the
mixing matrix,
,
,
,
is the original signal generated by the muscle source and the purpose of using ICA is to decompose the original signal
,
from the mixed signal
,
.
where
is an unmixing matrix of size
. This algorithm revealed the independent component (IC) weight corresponding to EMG in the mixed signal acquired by each electrode.
4.5. Feature Extraction
Feature selection is an essential step in EMG-based facial-muscle-movement recognition. To fully characterize the EMG signal, five time-domain features (IEMG, VAR, MAV, SSI, RMS) and three frequency-domain features (FC, MF, FRMS) are selected, as listed in
Table 3. During the ICA separation process, the original 16-channel data are transformed, resulting in eight components. We systematically compute eight specific features for each component, resulting in 64 features for facial-muscle movement. These features are then combined and fed into a classifier for facial-muscle-movement classification.
4.6. Classification Implementation
Support Vector Machine (SVM) is a commonly used supervised learning method in machine learning, first proposed by Cortes and Vapnik in 1995 [
31]. SVM is a powerful tool for dealing with small-sample, nonlinear problems. The most significant advantage of SVM is that it only needs a small number of samples to achieve good results. The core of SVM classification is to generate a hyperplane to classify the data. It involves introducing a kernel function to calculate the interval of the data pre-classification plane, aiming to seek a hyperplane that maximally separates different data classes. The mathematical expression of the hyperplane can be expressed as follows:
where
x is the vector on the hyperplane; w is the normal vector of the hyperplane;
b is the intercept of the hyperplane. By solving the quadratic optimization problem, the values of
w and b are obtained as:
where
C is the penalty parameter;
N is the number of samples;
is the Lagrange multiplier vector;
y is the label value. After we obtain the optimal solution
, we choose a component
of
that satisfies condition
. We calculated
and
.
then the classification decision function formula is obtained as follows:
In this paper, a Gaussian kernel function is used as the kernel function of the SVM classifier, which extends the SVM from a linear problem to a nonlinear problem. In order to be able to perform multiple classifications, we adopt the “one pair redundancy” method. That is, we assume that the
mth class of samples is a positive class and the other classes are regarded as negative classes and the final decision function is:
Given a test sample x, take it into the above equation and take the maximum value as the classification result of SVM.
The random forest algorithm is an ensemble learning approach that combines bagging and random feature-selection techniques. Conceived by Breiman in 2001 [
32], this method has entrenched itself as a staple within machine learning. It exhibits versatility, serving as a regression, prediction and feature-selection tool. Central to random forest’s mechanics is the assembly of a classifier through the orchestrated collaboration of multiple decision trees. These decision trees collectively undertake the training and prediction of samples. Each decision tree draws from an independent set of values, engendering diversity through employing a random vector. This vector, derived from a pre-defined probability distribution, shapes the distinctiveness of each tree. Following establishing a forest comprising a specified number of decision trees, the final prediction ensues through a straightforward majority-voting process. This algorithm’s sophistication lies in its ability to harness diverse decision trees, amplifying its predictive capabilities.
Backpropagation neural network (BPNN) is a common type of artificial neural network. It is a multi-layer feedforward neural network composed of an input, hidden and output layer. The training process of this network involves two key steps: forward propagation and backpropagation. During the forward-propagation phase, input data pass through the network layers sequentially, generating computed outputs. Subsequently, in the backpropagation phase, the network’s weights are adjusted using errors by comparing the actual outputs with the expected outputs. Through multiple iterations of these two steps, the network gradually refines its weights to approach the desired results. The BP neural network is renowned for its powerful learning and approximation capabilities and it finds extensive applications in areas such as pattern recognition, predictive analysis, classification tasks and more.
4.7. Result of the Facial-Muscle-Movement Recognition
In this experiment, the experimental design mentioned five facial-muscle movements at two different intensities, so we performed two categorization tasks. One was to classify the five facial-muscle movements (forceful) and the other was to classify ten facial-muscle movements (normal and forceful). The data used for classification had 484 samples, of which the samples of facial-muscle movements (forceful) had 276. Then, we chose SVM, random forest and BPNN as classifiers. In addition, to verify the reliability of the classification results, we used 80% of the data for training and 20% for testing randomly. Three metrics, accuracy, recall and F1 score, were used to evaluate the performance of the classifiers.
Table 4 illustrates the classification results of all the models on the test set. Among the classification results of five facial-muscle movements, the accuracy rates of SVM, random forest and BPNN are 82.14%, 91.79% and 64.28%, respectively. Among the classification results of ten facial-muscle movements, SVM, random forest and BPNN have 71.13%, 77.31% and 52.57% correct classification rates, respectively.
Figure 7 compares the confusion matrices of the three classifiers for the two classification tasks. The confusion matrix clearly lists the correct and incorrect classifications: the diagonal elements represent the number of correctly classified samples for each category and the elements not above the diagonal represent the number of incorrectly classified samples. Comprehensively comparing the two classification tasks, random forest has the best classification results compared to BPNN and SVM. We analyze the classification results of the random forest. Under the intensity of forcefulness, the highest correct classification rate is for closing the eyes and pursing the lips. The lifting of the corners of the mouth is sometimes incorrectly classified as pursing the lips because they have a similarity in the appearance of the facial movement. Raised eyebrows and frowns had some data incorrectly categorized as pursed lips, possibly due to problems with the data themselves. Among the results of the classification of ten facial-muscle movements (normal and forceful), normal eye closure achieved the highest correct classification rate. Partially forceful eye closure was classified as normal eye closure and partially normal pursing of the lips was classified as forceful pursing because subjects did not distinguish between normal and forceful when performing the movements. Partially normal pursing of the lips was also misclassified as normal lifting of the corners of the mouth. All other facial-muscle movements were misclassified in a small fraction of the data, with frowning having the worst results.
5. Discussion
In the past few years, EMG signals have played an essential role in the biomedical field. With the development of technology, many EMG signal-acquisition solutions have been designed. However, the commonly used EMG signal-acquisition devices in the market still have room for improvement in price, number of channels and expansion performance. For this reason, we have designed a low-cost, multi-channel, scalable device. Next, we will discuss this device more deeply based on its features and experiments.
In the design of this system, we employed the daisy chain mode of the ADS1299 chip to realize multi-channel data collection. In theory, this approach can support more channels, but it requires attention to several key considerations. Firstly, as the number of connected chips increases, the transmission delay between chips becomes more pronounced, so special data-transmission processing between chips is required to reduce the delay. Secondly, multi-channel data collection increases data volume, making the size of the controller’s memory particularly crucial. A larger memory provides a more extensive data buffer space, which aids in optimizing the performance coordination between data collection, processing and transmission, especially in dealing with the instability of network transmission. Lastly, it is important to note that the maximum Wi-Fi transmission speed in the current system is limited to 2.9 Mbps/s. We will explore better wireless transmission solutions to improve data-transfer efficiency in future designs.
In addition to expanding the number of channels of the EMG acquisition module, the module also considers the switching between unipolar and bipolar acquisition methods to increase the system’s flexibility. If the experiment only needs to measure the activity of a single muscle or a few muscles, bipolar acquisition is a better choice. Compared to unipolar, the bipolar method can effectively suppress signal crosstalk between different muscles. However, if the experiment needs to measure the entire facial expression, we recommend a unipolar configuration for the periphery of the face. The main reason for this is that the bipolar approach requires the use of more electrodes covering the facial-muscle sources, which may not only feel unnatural to the subject but may also interfere with the normal movement of the facial-muscle sources due to factors such as the weight and size of the electrodes.
In the facial-muscle-movement recognition experiment, we used a unipolar approach with 16 electrodes evenly distributed around the face. This electrode placement did not interfere with the movement of the facial-muscle sources and the subjects could show facial-muscle movements more naturally. However, this method also has some limitations because the electrodes are not directly placed on the facial-muscle sources, so the acquired signals may be relatively weak and even the signals of some facial-muscle movements are almost unable to be acquired, such as blinking (normal) and frowning (normal). In contrast, the EMG signals of the frontalis, orbicularis oculi and zygomaticus muscles, both in the normal state and under exertion, can be effectively captured according to the labeling results. These labeling results indicate that this electrode configuration is feasible for evaluating facial-muscle movements, further providing a solution for facial-expression analysis limited by small sample problems, such as composite expressions, pain studies and micro-expressions. Specifically, with the perifacial EMG device, we can efficiently annotate facial-muscle movements by temporal and spatial variations of facial-muscle signals without affecting facial-image acquisition. This annotation can provide a large amount of sample data to train intelligent analysis of facial expressions under specific tasks.
Moreover, some aspects of this collection method can be explored in depth, starting with the size of the electrodes. In the experiments, we used larger-size Ag/AgCl electrodes. If some smaller-sized flexible electrodes can be used in the future, more electrodes can be placed around the face, thus potentially acquiring finer signals of facial-muscle movements. Secondly, the distance of the electrodes from the facial-muscle source needs to be considered to ensure that the electrodes do not interfere with facial-muscle movement but also to ensure that the full range of facial-muscle-movement signals is captured. If used with computer-vision methods, special consideration needs to be given to the distance between the electrodes and the facial-muscle source. All these issues need to be gradually verified and thoroughly investigated in future experiments to further enhance the performance and application prospects of the EMG signal-acquisition system.
Eventually, we chose a dedicated laboratory for data acquisition, in which only the necessary equipment required for the experiment was kept to minimize the impact of electromagnetic interference, wireless signal interference, etc., on the results. However, there are some challenges in real-world applications, such as environmental diversity, complex wireless and electromagnetic interference in cities, hardware failure or performance degradation, etc., which may negatively affect the proper operation of the equipment. All of these issues will require continuous testing in the future to improve the circuit design of the device. It is also difficult to avoid being affected by the outliers, known as noise or disturbances. To minimize these effects, we can use a variety of methods. First, data-processing techniques, such as smoothing, filtering, interpolation, etc., can be utilized. Secondly, the effect of individual outliers on the results can be reduced by multiple measurements. Finally, we can optimize the algorithm of the signal-acquisition system to make it more robust and better resistant to the effects of noise and outliers.
6. Conclusions and Future Work
This paper presents the design and implementation of a high-resolution, 16-channel wireless EMG measurement system. The multi-channel approach allows the system to measure different muscle activities of an individual simultaneously, increasing the system’s flexibility and applicability. The acquired data can be transferred to a computer via Wi-Fi for display, storage and further analysis. In addition, we compared this designed acquisition system with a commercial device and the correlation between the signals acquired by both devices was above 70%, which illustrates the reliability of our designed device. Finally, we placed electrodes around the periphery of the face to capture crosstalk signals propagating to the edge of the face during facial-muscle movements. We compared three different classifiers to categorize five facial activities, with the random forest providing the best recognition. This study demonstrates the ability of this EMG signal-acquisition device by us to recognize facial movements using distal EMG.
In our future work, we will utilize the proposed EMG signal-acquisition device and its distal electrode configuration scheme to enhance the detection and recognition of facial-muscle movements, achieving a more comprehensive assessment of facial muscle activity. Our electrode-placement method does not obstruct the face, offering a novel approach to the design of wearable facial EMG signal-acquisition devices. Furthermore, due to its non-obstructive nature, this facial EMG signal-acquisition method can be multimodal integrated with computer vision, thereby enhancing the performance of facial-muscle-movement detection and recognition. This technology holds potential application value in hardware-based facial recognition computation and criminal investigation.