Transfer Learning and Deep Neural Networks for Robust Intersubject Hand Movement Detection from EEG Signals

Kok, Chiang Liang; Ho, Chee Kit; Aung, Thein Htet; Koh, Yit Yan; Teo, Tee Hui

doi:10.3390/app14178091

Open AccessArticle

Transfer Learning and Deep Neural Networks for Robust Intersubject Hand Movement Detection from EEG Signals

by

Chiang Liang Kok

^1,*

,

Chee Kit Ho

²,

Thein Htet Aung

¹,

Yit Yan Koh

¹ and

Tee Hui Teo

^3,*

¹

College of Engineering, Science and Environment, University of Newcastle, Callaghan, NSW 2308, Australia

²

Engineering Cluster, Singapore Institute of Technology, Singapore 138683, Singapore

³

Engineering Product Development, Science, Mathematics and Technology, Singapore University of Technology and Design, Singapore 487372, Singapore

^*

Authors to whom correspondence should be addressed.

Appl. Sci. 2024, 14(17), 8091; https://doi.org/10.3390/app14178091

Submission received: 12 July 2024 / Revised: 13 August 2024 / Accepted: 24 August 2024 / Published: 9 September 2024

(This article belongs to the Section Applied Biosciences and Bioengineering)

Download

Browse Figures

Versions Notes

Abstract

:

In this research, five systems were developed to classify four distinct motor functions—forward hand movement (FW), grasp (GP), release (RL), and reverse hand movement (RV)—from EEG signals, using the WAY-EEG-GAL dataset where participants performed a sequence of hand movements. During preprocessing, band-pass filtering was applied to remove artifacts and focus on the mu and beta frequency bands. The initial system, a preliminary study model, explored the overall framework of EEG signal processing and classification, utilizing time-domain features such as variance and frequency-domain features such as alpha and beta power, with a KNN model for classification. Insights from this study informed the development of a baseline system, which innovatively combined the common spatial patterns (CSP) method with continuous wavelet transform (CWT) for feature extraction and employed a GoogLeNet classifier with transfer learning. This system classified six unique pairs of events derived from the four motor functions, achieving remarkable accuracy, with the highest being 99.73% for the GP–RV pair and the lowest 80.87% for the FW–GP pair in intersubject classification. Building on this success, three additional systems were developed for four-way classification. The final model, ML-CSP-OVR, demonstrated the highest intersubject classification accuracy of 78.08% using all combined data and 76.39% for leave-one-out intersubject classification. This proposed model, featuring a novel combination of CSP-OVR, CWT, and GoogLeNet, represents a significant advancement in the field, showcasing strong potential as a general system for motor imagery (MI) tasks that is not dependent on the subject. This work highlights the prominence of the research contribution by demonstrating the effectiveness and robustness of the proposed approach in achieving high classification accuracy across different motor functions and subjects.

Keywords:

EEG signal processing; motor imagery; common spatial patterns (CSP); continuous wavelet transform (CWT); GoogLeNet; transfer learning; K-Nearest Neighbors (KNN); intrasubject classification; intersubject classification

1. Introduction

A Brain–Computer Interface (BCI) system serves as a bridge between the brain’s biophysical signals and external devices that can be controlled using these signals. These signals are typically acquired through methods such as Electroencephalograms (EEGs), functional Magnetic Resonance Imaging (fMRI), or Positron Emission Tomography (PET) [1]. The primary motivation behind developing BCI systems is to assist individuals who have lost motor abilities due to spinal injuries or paralysis. By accurately interpreting brain signals, these individuals can potentially regain motor functions using prosthetics. Brain–Computer Interface (BCI) systems are pivotal in bridging the gap between neural signals and external device control, offering a beacon of hope for individuals with motor impairments. These impairments often result from spinal cord injuries, strokes, or neurodegenerative diseases, leading to significant loss of independence and quality of life. BCIs can translate neural activity into commands for prosthetic limbs, wheelchairs, and other assistive technologies, thereby restoring a semblance of mobility and autonomy. One of the key social implications of BCI technology is its potential to drastically improve the quality of life for individuals who have lost their motor abilities. For instance, according to the World Health Organization, around 250,000 to 500,000 people suffer a spinal cord injury each year, many of whom experience permanent disability. BCIs can offer these individuals a new avenue for interaction with their environment, which traditional rehabilitation methods might not provide. Case studies highlight the transformative power of BCIs. In one notable instance, a quadriplegic individual was able to control a robotic arm using only their brain signals, allowing them to perform basic tasks such as drinking and eating independently. Another study demonstrated that stroke patients could use BCIs to regain control of their paralyzed limbs after extensive training and therapy. These examples underscore the practical applications of BCIs and their potential to reintegrate individuals into their daily lives. Moreover, statistical data reinforce the need for such advancements. According to a report by the National Spinal Cord Injury Statistical Center, the lifetime costs for an individual with a high cervical injury can exceed USD 4 million. BCIs not only promise to improve the quality of life but also offer a cost-effective solution in the long run by reducing the need for continuous care and assistance. In conclusion, the development and application of BCI systems hold significant social value. They provide a lifeline for individuals who have lost motor functions, offering them a path to regain independence and improve their overall quality of life. By incorporating specific case studies and statistical data, the compelling impact of BCI technology becomes evident, highlighting its critical role in modern rehabilitation and assistive technology. This expanded introduction not only discusses the potential benefits of BCI systems for individuals with motor impairments but also includes relevant statistics and case studies to underscore the importance and impact of this technology.

BCI systems hold transformative potential for individuals suffering from severe motor impairments due to conditions such as amyotrophic lateral sclerosis (ALS), spinal cord injuries, or stroke. These systems offer a means to regain some level of interaction with the environment, significantly improving their quality of life. Some case studies are explained in detail below.

1. Amyotrophic Lateral Sclerosis (ALS)

One of the most prominent examples of BCI application is in patients with ALS, a progressive neurodegenerative disease that leads to the loss of motor functions. A famous case is that of Stephen Hawking, who, despite severe disability, communicated and wrote books using a computerized speech synthesis system. Modern BCIs can further enhance such capabilities, allowing for direct brain-to-computer communication without the need for physical input. According to the ALS Association, approximately 5000 people in the United States are diagnosed with ALS each year. The average life expectancy of a person with ALS is 2–5 years from the time of diagnosis, emphasizing the urgent need for assistive technologies such as BCI to maintain communication abilities as motor functions decline.

2. Spinal Cord Injuries

Consider a study conducted by the University of Pittsburgh, where a patient with quadriplegia (complete paralysis) was able to control a robotic arm using a BCI to perform tasks such as feeding themselves. This represents a significant breakthrough in assistive technology, as it allows individuals with no motor control to regain a level of autonomy. The World Health Organization estimates that between 250,000 and 500,000 people suffer spinal cord injuries each year, with many losing the ability to move and requiring lifelong care. BCIs could drastically reduce the burden on caregivers by enabling greater independence.

3. Stroke Rehabilitation

BCI systems have also been used in stroke rehabilitation. For example, the work of researchers at the University of Tübingen in Germany showed that stroke patients who used a BCI to control a virtual hand saw improvements in motor function. This approach leverages neuroplasticity, helping the brain to rewire itself and restore lost capabilities. The American Stroke Association reports that about 800,000 people in the United States have a stroke each year. Approximately 50% of stroke survivors experience some form of motor impairment, underlining the potential impact of BCIs in rehabilitation.

EEG is a popular technique for acquiring neural signals because it is non-invasive and offers high temporal resolution. EEG neural signals can be categorized by frequency into delta (0.5–4 Hz), theta (4–8 Hz), alpha (8–12 Hz), beta (12–30 Hz), and gamma (30–100 Hz) bands [2,3]. Motor imagery (MI) or motor activities primarily activate the brain’s sensorimotor cortex, leading to sensory motor rhythms (SMRs). These rhythms can be classified using event-related synchronization (ERS) and event-related desynchronization (ERD), which are particularly useful for distinguishing between left- and right-hand movements [4]. Decoding EEG signals involves several steps, starting with preprocessing to remove artifacts such as muscle noise, heart signals (ECG), and ocular movements. Given the non-stationary nature of EEG signals, effective feature extraction techniques such as wavelet transformation are essential [3]. Classification of these features can be performed with or without machine learning models. Various artificial neural networks, including convolutional neural networks (CNNs), recurrent neural networks (RNNs), and recurrent convolutional neural networks (RCNNs), have been successfully employed to classify different events of interest from EEG signals [5]. This project aims to develop a system that utilizes robust feature extraction techniques and machine learning classifiers to accurately classify hand movements. Achieving this goal would significantly contribute to the field of BCIs, especially benefiting individuals who rely on brain–computer interface prosthetics.

2. Literature Review

During the research for this literature review, it was observed that most studies focus on motor imagery (MI) data rather than actual movement data. However, the techniques used with MI can be applicable to actual movement EEG data since MI activates similar brain areas and preserves the same temporal characteristics [6]. Therefore, this section discusses papers using both MI and actual movement data. Decades of research have gone into developing EEG-based BCI systems since Dr. Hans Berger first recorded EEG signals from humans in the 1930s [7]. Despite this extensive research, a universally reliable system has yet to be developed. The primary challenge is the highly non-stationary nature of brain signals, which are prone to contamination by artifacts such as eye movements, heartbeat signals (ECG), and electrical noise. Additionally, these signals are subject-dependent and vary across different trials, making it difficult to classify even between two movement activities from EEG signals. The difficulty increases with multiclass systems [8]. To overcome these challenges, several techniques have been developed. One popular and efficient technique is Common Spatial Patterns (CSP), which was initially developed to distinguish between two classes [9,10]. The CSP algorithm finds spatial filters that maximize the variance of the filtered signal under one condition while minimizing it under the other condition [11]. Paired with an 8–30 Hz broadband filter, CSP has achieved accuracies of 84–94% in discriminating three actions, left-hand movement, right-hand movement, and right-foot movement, within three subjects. An extension of CSP, using pairwise classification and majority voting, has also been described to enable multiclass classification [9]. Other extensions include CSP One-Vs-Rest (OVR), which computes spatial patterns for each class against all others [12,13], and CSP Divide-And-Conquer (DC), which adopts a tree-based classifier approach [13]. Since the effectiveness of CSP generally relies on optimal participant-specific frequency, it is nearly impossible to obtain spatial filters that generalize for optimal discrimination of classes for every person. Several approaches have been employed to address this issue [13,14]. Filter Bank Common Spatial Patterns (FBCSP), which autonomously selects the subject-specific frequency range for band-pass filtering of EEG signals [13]; non-conventional FBCSP, which differentiates classes using a fixed set of four frequency bands, reducing the computational cost [8]; and sliding window discriminative CSP (SWDCSP), which uses a sliding window of overlapping frequencies to filter the EEG signals [15]. This paper proposes a model using CSP OVR and CNN with the WAY-EEG-GAL dataset to distinguish four hand movements. Some closely related papers are explored. In [16], the BCI competition III dataset 3a was used, where each subject performed four different motor imagery tasks (left hand, right hand, foot, and tongue). Forty-three channels were used out of sixty-four available. CSP was applied with five filters ranging from 8 Hz to 28 Hz, with increments of 4 Hz. A convolutional neural network (CNN) was used as a classifier, with 80% of the data for training and 20% for validation, resulting in a validation accuracy of 93.75% for intersubject classification using combined data. The author of [17] used the WAY-EEG-GAL dataset, where twelve participants performed continuous movements of a single limb to grasp and lift an object. From these continuous movements, six segments of interest (hand start movement, grasp, lift, hold, replace, and release) were selected for classification. CSP performed six binary classifications on the pairs formed from the six segments. Continuous wavelets transform (CWT) generated scalograms, which were inputs for the CNN network. The classification used was intersubject classification without leave-one-out. The highest accuracy was for the ‘hold’ segment at 96.1%, and the lowest was for ‘replace’ at 92.9%. In [18], the BCI competition IV Dataset 2a was used, where nine subjects performed motor imagery on four movements: left hand, right hand, feet, and tongue. Feature extraction was performed using time-frequency CSP, while Linear Discriminant Analysis (LDA), naïve Bayes, and Support Vector Machine (SVM) were used as classifiers. The results showed that the average computation time was 37.22% shorter than FBCSP, used by the first winner of BCI competition IV, and 4.98% longer than the conventional CSP method. Li and Feng [19] used the WAY-EEG-GAL dataset for six-movement classification, involving HandStart, FirstDigitTouch, BothStartLoadPhase, LiftOff, Replace, and BothReleased. A random forest algorithm identified the important electrodes from the thirty-two available. Wavelet transform extracted features, which were inputs for the CNN classifier. This model achieved an accuracy rate of 93.22%. For a more comprehensive view of EEG classification techniques used in the past decade, consider other methods besides CSP. In [20], a public BCI Research database from NUST (National University of Sciences and Technology, Pakistan) was used. Eight channels (F3, Fz, C3, Cz, C4, Pz, O1, and O2) were selected. Classification of EEG to identify left and right arm movements was conducted using wavelet transformation (WT) and a multilayer perceptron neural network (MPNN or MLP), achieving 88.72% accuracy. Alomari et al. [21] created a publicly available dataset at physionet.org. Eight channels (FC3, FCz, FC4, C3, C1, CZ, C2, and C4) were used out of the sixty-four available. A band-pass filter from 0.5 Hz to 90 Hz and a notch filter to remove 50 Hz line noise were applied. Independent Component Analysis (ICA) filtered out artifacts. MATLAB’s (V14.5) neural networks toolbox built a neural network (NN) with 1–20 hidden layers, and ‘MySVM’ software performed SVM classification. Classification accuracies were 89.8% using NN and 97.1% using SVM for left- and right-hand movements. In [22], EEG data were obtained from a 26-year-old male moving his right hand. Four channels (AF3, F7, F3, and FC5) were used. The EEG signals were pre-processed using a Butterworth band-pass filter (0.5–45 Hz). Feature extraction was performed using WT, Fast Fourier Transform (FFT), and Principal Component Analysis (PCA). Classification was achieved using MPNN to classify three movements: open arm, closed arm, and closed hand. The model, with one hidden layer, ten neurons, and 500 epochs, achieved classification performances of 91.1%, 86.7%, and 85.6% using the three feature extraction methods, respectively. The author of [23] created a custom dataset involving two subjects performing motor imagery of three different right-hand grasp movements. From twenty-four electrodes, the best combination was found using the Genetic Algorithm (GA), based on evolution and natural genetics. Preprocessing involved ICA and a Butterworth High Pass Filter with a 1 Hz cutoff frequency. Features included maximum mu-power, beta-power, and frequencies of maximum mu- and beta-power. Among the four NN classifiers, Probabilistic Neural Network (PNN) achieved the best classification accuracy at 61.96%. In [24], a public dataset from fifteen subjects performing six functional movements of a single limb (forearm pronation/supination, hand open/close, and elbow flexion/extension) was used. EEGLAB filtered out irrelevant channels, and ICA identified eye and muscle artifacts. A band-pass filter (8–30 Hz) was applied. PCA standardized and reduced data dimensionality. Wavelet Packet Decomposition (WPD) extracted features, providing multi-level time–frequency decomposition with good time localization. Classification was performed using a Wavelet Neural Network (WNN), typically comprising three layers: input, hidden, and output. This model achieved an intersubject classification accuracy of 86.27% for the six limb movements [24,25]. The field of EEG signal processing has witnessed significant progress in recent years, particularly in feature extraction and classification techniques, driven by the demand for more accurate and real-time applications [26,27,28,29,30,31,32,33] in areas such as brain–computer interfaces (BCIs), healthcare [34,35,36,37,38], and neuroscience research [39,40,41,42]. One of the recent trends is the application of deep learning methods in EEG signal processing [43]. Convolutional Neural Networks (CNNs) have gained popularity due to their ability to automatically extract features from raw EEG data without the need for manual feature engineering [44,45,46,47]. For instance, CNNs have been successfully employed to decode motor imagery tasks in BCIs, leading to improved classification accuracies. Additionally, hybrid models combining CNNs with other deep learning architectures, such as Long Short-Term Memory (LSTM) networks, have been explored to capture both spatial and temporal features of EEG signals, further enhancing performance. Another promising approach involves the use of transfer learning [48,49], which leverages pretrained models on large datasets to improve the generalization of EEG classification models on smaller, domain-specific datasets. This is particularly useful in the context of EEG, where obtaining large, labeled datasets is challenging. Transfer learning has shown potential in tasks such as emotion recognition and mental workload estimation, where it helps in reducing training time and improving model accuracy. In feature extraction, the fusion of traditional methods such as Wavelet Transform (WT) with advanced machine learning techniques has been explored to enhance the representation of EEG signals. For example, continuous wavelet transform (CWT) has been used to capture both time and frequency domain features, which are then classified using machine learning models such as Support Vector Machines (SVMs) or Random Forests. Additionally, methods such as Empirical Mode Decomposition (EMD) have been combined with deep learning models to extract intrinsic mode functions that are more representative of the underlying brain activity. Finally, recent studies have also focused on improving the interpretability of EEG classification models. Explainable AI (XAI) techniques are being integrated into EEG processing pipelines to provide insights into how models make decisions, which is crucial for applications in clinical settings. This not only enhances the trustworthiness of the models but also aids in the discovery of novel neurophysiological patterns.

3. Methodology

The proposed models in this paper (Figure 1) utilize a combination of CSP filtering, CWT, and GoogLeNet classification. We employed the Common Spatial Patterns (CSP) filter, Continuous Wavelet Transform (CWT), and GoogLeNet classifier for their respective strengths in handling EEG signal classification. CSP is particularly effective for binary classification, enhancing discriminative power by maximizing variance differences between classes. CWT was chosen for its ability to analyze non-stationary signals, capturing both temporal and frequency information crucial for understanding EEG dynamics during motor tasks. By transforming signals into scalograms, CWT preserves important features across different time scales for deep learning models. GoogLeNet, a deep convolutional neural network, was utilized for its robust feature learning and classification capabilities. Leveraging transfer learning, GoogLeNet enhances performance even with smaller datasets by using pretrained weights from large datasets, improving generalization across subjects and tasks. This combination addresses EEG classification challenges, such as non-stationarity and subject variability, demonstrating high accuracy in binary and multiclass tasks, making it a robust choice for developing reliable BCI systems for motor imagery and hand movement detection. I have provided an expanded introduction of the applied methods with more detailed explanations for CSP filtering, Continuous Wavelet Transform (CWT), and the GoogLeNet classifier, as shown below.

Common Spatial Pattern (CSP) Filtering

Common Spatial Pattern (CSP) is a feature extraction technique widely used in the analysis of EEG signals, particularly for classifying motor imagery tasks. CSP works by finding spatial filters that maximize the variance of the signal for one class while minimizing it for the other. This makes it highly effective for distinguishing between different mental states or movements. The CSP algorithm begins by computing the covariance matrices of the EEG signals for each class. These matrices are then averaged and decomposed using eigenvalue decomposition to find spatial filters that create new signals with optimal variance separation between the classes. The filtered signals highlight the most discriminative features for classification. In this research, CSP filters were applied to the EEG signals to enhance the differences between various hand movements (e.g., forward movement vs. grasp). The filtered signals serve as the basis for subsequent feature extraction and classification processes.

Continuous Wavelet Transform (CWT)

Continuous Wavelet Transform (CWT) is a powerful tool for analyzing non-stationary signals such as EEG. Unlike Fourier transform, which only provides frequency information, CWT offers both time and frequency localization, making it ideal for capturing transient features in EEG data. CWT transforms the EEG signals into a time-frequency domain using wavelets, which are functions that can be scaled and translated. The analytic Morse wavelet, known for its good time-frequency localization properties, was chosen for this transformation. The result is a set of wavelet coefficients that represent the signal’s frequency content at different times. These coefficients are then used to generate scalograms—visual representations of the time-frequency characteristics of the EEG signals. Scalograms effectively capture the temporal dynamics and frequency information of the EEG, which are crucial for distinguishing between different motor tasks.

GoogLeNet Classifier

GoogLeNet, also known as Inception-v1, is a deep convolutional neural network that has shown exceptional performance in various image classification tasks. Its architecture consists of multiple convolutional layers that learn hierarchical features from the input data. In this research, the scalograms generated from the CWT are used as input images for the GoogLeNet classifier. To leverage the power of deep learning without requiring extensive training data, transfer learning is employed. The pretrained GoogLeNet model, initially trained on large image datasets, is fine-tuned by modifying the last two layers to adapt to the specific task of classifying hand movements from EEG signals. This approach significantly reduces the computational cost and training time while improving the model’s ability to generalize across different subjects and tasks. By focusing on the most relevant features learned during pretraining, the GoogLeNet classifier achieves high accuracy in distinguishing between the various motor functions.

The integrated use of CSP, CWT, and GoogLeNet forms a robust pipeline for EEG signal classification. CSP enhances discriminative features, CWT captures essential time-frequency information, and GoogLeNet provides powerful classification capabilities. The proposed system demonstrated outstanding performance, achieving high classification accuracies across different motor tasks and subjects, thus highlighting its potential as a general and reliable solution for motor imagery (MI) tasks.

Prior to developing the final model, a baseline system was constructed based on the methods described in [17], as the same dataset was used, achieving classification results exceeding 90%. Additionally, a preliminary study model was created to provide a clear overview of the expectations for developing EEG classification models. Although the baseline system employs CSP as mentioned in [17], feature extraction using CSP and CWT to produce scalograms differs from the approach used in the referenced paper. The baseline model differentiates the four hand movements in a pairwise manner, functioning as a binary classifier. However, further improvements were made to the baseline model to modify the binary classification into a four-way classification in the last three models. In the preliminary study, time and frequency features were extracted from the EEG signals and fed into a KNN classifier. The extracted time features included mean, variance, skewness, kurtosis, and area under the signal. The extracted frequency features comprised the power of alpha and beta frequency bands. The last three models (ML Model 1, ML Model 2, and ML-CSP-OVR) were developed based on the baseline model with some modifications. The first two variants, ML Model 1 and ML Model 2, introduce a novel classification technique using multiple stages of SoftMax outputs. The final variant, ML-CSP-OVR, employs a novel combination of CSP OVR (One Versus Rest), CWT, and GoogLeNet for multiclass classification.

From the dataset, four segments of data corresponding to the four events/classes of actions (forward hand movement, grasp, release, and reverse) are extracted for classification. These four segments or motor actions are shown in Table 1.

In the preliminary study, the four events of interest are grouped into two pairs, events 1 and 4 and events 2 and 3, since these pairs involve opposite motor actions. This is grouped under the assumption that the opposite natures involved within these pairs contribute to more accurate classification due to their distinction. For the baseline system, six unique pairs, shown in Table 2, formed from the four classes of motor movement, are used for classification. The rest of the models extend the baseline classification to perform four-way classification without using pairs of two.

These are made in pairs of two because CSP was developed for separating two classes of data [9].

3.1. The Dataset

The dataset used in this study is the publicly available WAY-EEG-GAL dataset [50]. It involves twelve participants performing grasp and lift actions across a total of 3936 trials distributed over 10 series. Each participant conducted 328 trials. During the trials, an LED cue signaled the participants to reach for an object, grasp it, and lift it up. The LED turning off indicated the participants to return the object to its original position and retract their hand. The EEG signals were recorded using 32 channels, as depicted in Figure 2. These signals were sampled at 500 Hz. Two types of datasets were provided: holistic and windowed. The windowed dataset was utilized in this study, where the lifting series were segmented into intervals around each lift action. Each window starts precisely two seconds before the LED turns on and concludes three seconds after the LED turns off.

3.2. Data Checking and Cleaning

Firstly, the dataset was reviewed to verify the information provided in the paper. According to the dataset description, the 12 participants performed a total of 10 series, comprising 28 grasp and lift (GAL) trials for mixed series and 34 trials for each of the other series types. These series include weight, surface, and mixed types. In the weight series, participants interacted with objects of varying weights across three different levels. Meanwhile, the surface series involved objects with different materials on their surfaces. The mixed series combined elements from both the weight and surface series. Figure 3 illustrates these series types for participant 1.

In the selected windowed dataset, the 10th series, which pertains to weight variations, is exclusively available in the holistic dataset. Therefore, for the windowed dataset, only nine series are accessible. Given that each participant is limited to two mixed series, the total number of trials can be calculated as follows:

12 participants \times 8 series \times 34 trials per series + 12 participants \times 2 mixed series \times 28 trials per series = 3936 trials

The results align with those reported in the paper. To standardize the data across all series, trials exceeding 28 were discarded, reducing variability and facilitating data processing. Subsequently, each trial’s window, starting 2 s before the LED activates and ending 3 s after deactivation [51], was scrutinized. Plots ere generated for all trials to allow for manual inspection if necessary. A simple code (Listing 1) was employed to verify the 3 s duration; checking the 2 s duration was deemed unnecessary since the LED On duration in the time dataset consistently provided this information. The code was designed to print participant, series, and trial details if deviations from the specified 3 s duration were identified in the trial data, as outlined in the paper.

The code generates several trials all related to participant 7. Therefore, participant 7 is discarded. Therefore, 11 participants, nine series, and 28 trials are used in this study.

Listing 1. A simple check for 3 s duration.

if (round(t(end)) - LED_off_t) ~= 3

disp(['P:', num2str(p), ', S:', num2str(s), ', T:', num2str(triId)])

end

The vertical dashed lines in Figure 4 are drawn to make sense of the time information given in P.AllLifts data (Figure 5). LED On and LED Off times plotted on the graph are calculated by these formulae:

LED On time = trial_start_time + LEDon, and LED Off time = trial_start_time + LEDoff, respectively. The LEDon and LEDoff used in the calculation are from ws.win. The trial_start_time is also given in ws.win data.
x Handstart, FirstDigit, BothLoad, Lift, Replace, BothReleased, and HandStop are all calculated in the same way, i.e., summing up the trial_start_time and the respective given time for the actions given in P.AllLifts.

In Figure 6, the blue signal represents the raw EEG signal, while the red signal represents the filtered EEG signal. Four 1 s patches or segments, potentially containing the events of interest, are highlighted. These segments are extracted from the filtered signal for feature extraction, with the filtering process explained in the manuscript. The extracted data is then verified to ensure each segment is exactly 1 s or 500 samples in length. Given that the imaging period should not exceed 0.5 s [24], a 1 s segment duration is chosen to provide sufficient coverage for the event of interest.

The forward-reaching hand movement segment (red patch) is taken around the time when the hand starts moving (HandStart), as this moment likely corresponds to the initiation of the thought to move the hand forward. The grasp segment (cyan patch) is selected at the point when both digits (thumb and forefinger) establish contact with the object. The release segment (yellow patch) is positioned immediately after the Replace action, where the object is returned to its original position. The backward/reverse segment (purple patch) is taken from 1 s before the Handstop point, when the digits separate from the object. Initially, the BothDigit point was used for the grasp segment. Previously, the segment was around BothLoad (when both fingers start applying force to the object). However, due to outliers found in P. AllLifts for certain trials, the BothDigit point is now used instead. In some trials, the reverse segment was cut off, not having the full 1 s length or 500 samples. This issue arose because the reference point was at BothReleased, which, in some trials, is near the end of the trial and thus lacked enough data points for 1 s. This problem was resolved by changing the reference point to the Hand-stop point.

3.3. Data Preprocessing

A total of 16 electrodes out of 32 were selected from channels 11–26. These channels are close to the three channels of interest associated with motor movements: C3, Cz, and C4 [52]. A Butterworth band-pass filter with a passband range between 7 and 30 Hz was applied, as motor movements primarily involve sensory motor signals within the mu and beta bands [10]. The Butterworth filter was chosen for its ability to provide a smooth passband output. This filtering process also removes unwanted artifacts such as powerline interference, ocular artifacts, and electrocardiogram (ECG) and electromyogram (EMG) signals [17,28]. Z-score normalization was then applied by subtracting the mean and dividing by the standard deviation. In this section, five models built for the classification are described. As mentioned at the start of this section, these five systems are as follows: preliminary study model, baseline system, ML models 1 and 2, and ML-CSP-OVR.

3.4. Preliminary Study Mode

The objective of the preliminary study is to understand the procedures involved in the EEG signal classification process using a machine learning algorithm. Time and frequency features are extracted and used for classification. The features are extracted in MATLAB and exported to Jupyter Lab for KNN classification.

Time features, excluding the zero-crossing and peak-to-peak features, are extracted from the filtered EEG signals. Frequency features such as average powers of beta- and alpha-frequency bands are extracted using Welch’s method rather than Fast Fourier Transform (FFT) since it produces smoother results due to its averaging method (Figure 7). Welch’s method is also more robust against non-stationary signals such as EEG.

All the extracted features are exported from MATLAB into a python-compatible data type so that classification can be performed using the K Nearest Neighbor (KNN) classifier. The first 10 extracted features for event 1 can be seen in Table 3.

Before the classification is performed, the feature dataset is visualized with the help of scatter plots. Figure 8 shows one such visualization. The features of the events can be observed as closely associated with one another, meaning it will be troublesome for the KNN model to perform the classification, even between two events. The result of this is discussed further below.

Since the data of events are very similar, the classification result is expected to be quite low unless the distribution of the data can be separated. KNN classification works by finding the nearest k neighbors from the data point of interest and taking the mean of those points to obtain the prediction for the data point. The classification of the algorithm performs best at k = 3.

3.5. The Baseline System

The baseline system is built to establish a deep understanding of some of the popular methods used in EEG signal processing using machine learning. Since the techniques are based upon [17], the results can be compared against the ones presented in the paper to see how well the baseline system performs. However, direct comparison is not possible due to the different paradigms used; the difference manifested because of the bizarre ways the authors applied their techniques in [17]. An effort was made to understand this by contacting the main author of the paper but only to receive a reply that was not satisfactory. One reason is because the author does not clearly remember the methods in the research as it has been 2 years since the research was conducted. Below are some of the unusual points found in the paper:

i.: the unfiltered data of the six selected electrodes have more data points than provided in the dataset.
ii.: the CSP method seems to be used on a single electrode.

The problem with point (ii) is that CSP is a general eigen decomposition method that finds a weighting of channels, which maximizes the signal-to-noise ratio [53]. With a single channel or electrode, it would not be necessary to find a set of weights. Therefore, a CSP system was built by closely following the original method. Six spatial filters were selected (three for each class of binary classification) because of the need to create RGB images that serve as inputs for the CNN classifier. Common Spatial Patterns (CSP) is a technique used to distinguish between two classes or conditions in EEG signal processing. This method requires the data for the two classes to be in a dimension of N × T, where N represents the number of channels and T represents the number of samples in time [9,11]. CSP employs general eigen decomposition to generate eigenvectors with corresponding eigenvalues [53]. The columns of the eigenvector matrix serve as spatial filters. The spatial filter matrix is sorted in descending order so that the first filter corresponds to the largest eigenvalue and the last filter to the smallest. This sorting facilitates the convenient selection of filters. In a scenario with two conditions, ‘a’ and ‘b’, the first filter maximizes the variance of the EEG signal under condition ‘a’ while minimizing the variance under condition ‘b’. Conversely, the last filter minimizes the variance under condition ‘a’ and maximizes it under condition ‘b’. Essentially, the first few filters maximize variance under one condition, and the last few filters maximize variance under the other. As described in [9], the first m and the last m CSPs are retained as spatial filters. For this project, the first three and the last three filters are selected to differentiate between the two classes (Figure 9). Given the objective to classify four movement classes and the limitation of CSP being suitable for only two-class discrimination, unique pairwise classifications are performed. These projections will be used for the classification of hand movements.

Continuous Wavelet Transform

The continuous wavelet transform (CWT) converts a signal from the time domain into the time-frequency domain. Unlike the Fourier Transform, which lacks time localization and performs poorly with non-stationary signals such as EEG, the CWT addresses these limitations by producing wavelet coefficients. These coefficients can be used to determine the amplitude or power of the signal’s frequencies. Mathematically, the CWT involves convolving the signal of interest (in this case, the EEG signal) with a wavelet kernel over several frequencies [54]. In this project, the analytic Morse wavelet is used, with a symmetry parameter (gamma) equal to 3 and a time-bandwidth product of 60. This wavelet was chosen because it has been shown to produce excellent results with the current EEG dataset, as demonstrated in [17]. The CWT generates time-frequency plots known as scalograms (Figure 10), which can be utilized with convolutional neural networks (CNNs) such as GoogLeNet for further analysis.

The time projections obtained from CSP filters are transformed by CWT to generate grayscale scalograms. The three scalograms corresponding to the first and second classes are taken as individual RGB channels, which are flattened to form two RGB scalograms, one for each class. These are passed into GoogLeNet for classification between the first and the second classes. The RGB conversion is necessary since GoogLeNet only accepts images with three color channels, and the information from three projections can be gathered into one object. The CSP-CWT procedure can be seen in Figure 11.

Transfer learning is used with this pretrained network to reduce the computation cost and time. Only the last two layers (fully connected layer and softmax layer) are modified during the transfer learning. An 80:20 ratio is used for splitting the training set and test set from the scalogram image set. Images are resized to 244 × 244 × 3 since this is the required dimension of the input images when using GoogLeNet. During training, the weights of the first 10 layers are frozen. A mini-batch size of five is used with 10–30 epochs to train each class. The learning rate is set to 0.0003.

This model, shown in Figure 12, is based on the baseline model but is extended in the classification step to transform the baseline model into a four-way classifier. However, since CSP is restricted to 4.5. ML Model 1 binary pairs/classification, the conventional way of using the SoftMax layer to perform four-way classification does not work. This results in the complicated procedure of using comparators to perform the four-way classification (See Figure 9 for the pipeline).

The model is shown in Figure 13, where the inputs are taken from the pre-processed slice data in the form of ‘One Vs. Others’, with one pitted against a group of other actions for training. The amount of data for one action is set to be the same as the amount of data for the other group. The model is also to be tested in cases where the amount of data is not the same. Four pairs of such groups (FWvsOthers, GPvsOthers, RLvsOthers, and RVvsOthers) are collected for the training set and passed into the four ML networks, one for each pair. In the classification step, a group of datasets collected from FW, GP, RL, and RV are used to create the test set for the model (See Figure 14). The classification step is the same as the one in ML Model 2.

3.6. ML Model 2

Model 2 is almost similar to Model 1, with the key difference being the source of the training inputs. Instead of using pre-processed slices, the inputs for Model 2 are derived from scalogram pairs after their generation. For example, for the FW vs. Others pair, ‘FW’ data is collected from FW_GP, FW_RL, and FW_RV, while ‘Others’ data is compiled from the GP, RL, and RV segments of the three mentioned pairs. The test set, consisting of four classes (FW, GP, RL, and RV), is assembled from the input training data. Specifically, the FW class in the test set is taken from the FW segment of the FW/Others set, and the same approach is applied to the other three classes (as illustrated in Figure 13).

Figure 15 shows the overall framework of how the training sets and the test set are used in the training stage and the test stage of the model.

With these FW vs. Others, GP vs. Others, RL vs. Others, and RV vs. Others data, the model is trained and validated using an 80:20 training and validation data split. The data in Figure 16 show the validation accuracy for four pairs of networks (1. FW vs. Others, 2. GP vs. Others, 3. RL vs. Others, and 4. RV vs. Others). Validation is performed on the respective 20% validation sets.

Figure 17 shows the predicted labels from the four networks and their corresponding probabilities. It is to be noted that the probabilities for ‘Others’ are set to 0 since the ‘Others’ class does not have much meaning in this project.

Once that procedure has been completed, out of the remaining probabilities, the highest one is picked along with its corresponding event. The final predicted classes are then compared with the true labels to calculate the accuracy of the model. In Figure 18, the left image shows the final predictions whereas the right shows the true labels of the test set.

3.7. Model 3 (Conceptualized)

The last model (Figure 19) employs a technique described in the original CSP paper: majority voting of all pairwise classifications. For instance, an action is classified as ‘FW’ only if FW-GP, FW-RL, and FW-RV all return FW; otherwise, it is deemed indecisive. This method is used more leniently, where if at least two of the three pairs produce the same classification result, that result is accepted. This selection process occurs within a layer of comparators. If each network group outputs a different class, the comparator yields ‘undefined’. The four outputs from the comparator layer are fed into a final comparator to select the class with the highest probability. In this ensemble of CNN networks, six networks are trained and interconnected in a complex manner to determine the final class via majority voting. However, it should be noted that this model was not implemented because Models 1 and 2 did not perform well on the four-way classification task.

3.8. ML-CSP-OVR

Using CSP OVR instead of the vanilla CSP allows the model to perform the multiclass classification without being restricted to binary classification pairs. CSP One Vs. Rest (OVR) has been implemented using a combination of algorithms from [9,12]. This method is selected as the best method due to its low computational cost and simple classification procedure without the intervening SoftMax steps due to the binary classification pairs. Figure 20 illustrates the pipeline of the ML-CSP-OVR model.

Firstly, slices of FW, GP, RL, and RV are processed using the CSP-OVR algorithm to calculate the spatial filters that best differentiate the four classes. After obtaining the spatial filters for each class, the first three filter components from each filter matrix are selected, as these are the most effective at distinguishing the classes. Similar to baseline system, these selected filters are applied to the four event signals to create projection signals used for class discrimination. Continuous Wavelet Transform (CWT) is then performed to generate scalograms, which are subsequently fed into GoogLeNet for the four-way classification.

3.9. Performance Metrics

To measure the performance of the models, five types of evaluation are used: (i) accuracy, (ii) precision, (iii) sensitivity, (iv) specificity, and (v) F1-score. The mathematical equations of these are as follows:

A c c u r a c y = \frac{T P}{T P + T N + F P + F N}

(1)

P r e c i s i o n = \frac{T P}{T P + F P}

(2)

S e n s i t i v i t y = \frac{T P}{T P + F N}

(3)

Specificity = \frac{T P}{T P + F N}

(4)

F 1 S c o r e = \frac{2 \times (P r e c i s i o n \times S e n s i t i v i t y)}{P r e c i s i o n + S e n s i t i v i t y}

(5)

where TP = True Positive, TN = True Negative, FP = False Positive, and FN = False Negative [55].

Feature extraction in EEG signal processing involves identifying and isolating relevant information from raw EEG signals to enhance classification or analysis. This process typically starts with preprocessing steps such as band-pass filtering and artifact removal to ensure clean data. Time-domain features such as amplitude, power, and zero-crossing rate provide insights into the signal’s intensity and variation. Frequency-domain features such as power spectral density (PSD) and band power offer a breakdown of the signal’s energy across different frequency bands, which is crucial for understanding various cognitive states. Time-frequency domain features, including Wavelet Transform (WT) and Short-Time Fourier Transform (STFT), capture how the signal’s spectral content evolves over time, which is particularly useful for non-stationary EEG signals. Spatial features such as Common Spatial Patterns (CSP) and Independent Component Analysis (ICA) enhance the differentiation between brain states by focusing on spatial patterns of brain activity. Non-linear features such as the fractal dimension and Lyapunov exponent provide additional insights into the complexity and chaotic nature of EEG signals. Often, a combination of these features is employed to capture the full range of information in the EEG data. After feature extraction, feature selection techniques such as Principal Component Analysis (PCA) are applied to reduce dimensionality before feeding the features into classification models, which can range from simple linear classifiers to advanced machine learning algorithms such as Support Vector Machines (SVMs) or deep neural networks. This systematic approach to feature extraction is critical for developing accurate and reliable EEG-based applications, including Brain–Computer Interfaces (BCIs).

To summarize, the data processing and feature extraction steps in the study began with data collection from 12 participants, each performing 10 sessions of grasp and lift (GAL) trials. Trials exceeding 28 were discarded to standardize the dataset, and visual inspection ensured a consistent 3 s duration for each trial, leading to the exclusion of participant 7 due to inconsistencies. The raw EEG signals were filtered to remove noise, and four 1 s segments were extracted from the filtered data, each being 500 samples long. Common Spatial Patterns (CSP) were employed to extract spatial features, selecting six spatial filters to distinguish between conditions. Event-Related Desynchronization/Synchronization (ERD/ERS) features captured power changes in specific EEG frequency bands associated with motor activities. CSP filters produced time projections for classification, with steps including covariance matrix estimation and whitening transformation. The initial model used K-Nearest Neighbors (KNN) with three nearest neighbors and 10-fold cross-validation, achieving accuracies of 57.23% for forward–reverse pairs and 50.87% for grasp–release pairs. Convolutional Neural Networks (CNNs) were later used for final classification, validated with an 80:20 training/validation split, although Model 3, which involved majority voting, was not implemented due to performance issues. Evaluation metrics such as accuracy, precision, sensitivity, specificity, and F1-Score were calculated using True Positive (TP), True Negative (TN), False Positive (FP), and False Negative (FN) values. This detailed description, including parameters and specific implementations, ensures that readers can reproduce the experimental results accurately. Furthermore, feature extraction for both classes can be performed simultaneously by first applying preprocessing steps to the entire dataset, and then extracting features from the entire set before splitting it into different classes. This approach involves applying techniques such as PCA or ICA to the full dataset, selecting the most relevant features then dividing the data into classes or training/testing sets. This ensures that the features are representative of the entire dataset, leading to more robust and generalizable models.

4. Measurement Results

For the KNN classification in the preliminary study, only intersubject classification is used, incorporating data from all participants. It is important to note that in this paper, for the baseline system as well as Models 1 and 2, the intersubject classification is conducted without using leave-one-out testing. This means that a separate test set from one participant, which is left out of both training and validation, is not used. However, given the satisfactory performance of the ML-CSP-OVR model, the test paradigm is extended to include leave-one-out intersubject classification. For the GoogLeNet classification, both intrasubject and intersubject classifications are utilized. In the baseline system, intrasubject classification is performed for all participants. In the three variant models, intrasubject classification is conducted for three participants: P1, P5, and P12. Intersubject classification uses data from participants 1 to 11, excluding participant 7 as previously mentioned.

4.1. Preliminary Study Model

The model uses K Nearest Neighbor classification, which can only achieve an average accuracy of 57.23% for the forward–reverse pair and an average accuracy of 50.87% for the grasp–release pair using the time and frequency features as shown in Table 3. The classification is performed with the three nearest neighbors, as mentioned before, and with a 10-fold cross validation scheme. Either increasing or decreasing the number of neighbors results in a significant drop in accuracy.

Figure 21 shows that the performance of the classifier on the dataset is below 60%, which is expected due to the nature of the dataset. The result could be improved if the features are extracted from the spatially and temporally filtered signals rather than using only the temporally filtered signal. However, these results are left unimproved since the objective of the preliminary study is to gain an overall picture of what is involved in neural signal processing and classification.

4.2. Baseline Model

With a total number of 2772 trials (11 participants x nine series x 28 trials) and three color channels (RGB), the number of grayscale scalograms produced per event is 8316. Since one classification involves a pair of two events, the total number of images is 16,632. After flattening the separate grayscale scalograms, the total number of RGB images per pair of classifications is 5544. This feature extraction process can take up to 3–4 h for a single classification. The 5544 scalograms are split into training and validation sets in an 80:20 ratio, as mentioned before. Using conventional equations, the five evaluation scores for the classifications are obtained. Intersubject classification is conducted between two unique pairs of events, resulting in six pairs for classification across all 11 participants. Table 4 indicates that the baseline system (average values) achieves a range of accuracy from 80.87% to 99.73%, with nearly perfect accuracy. Precision, sensitivity, and specificity range from approximately 71% to 100%, with the lower end of the range predominantly observed in the first three pairs. The average accuracy, precision, sensitivity, and specificity across all six pairs are 90.33%, 90.06%, 91.11%, and 89.53%, respectively. The average F1 score is 0.90. From Figure 22a, it is evident that the performance metrics for the first three groups are about 10–30% lower than those for the last three groups. The F1 score follows a similar trend between these two sets of groups (Figure 22b). This performance drop in the first three groups could be attributed to the forward hand movement (FW) event, as these groups all involve the FW event. Enhancing accuracy might be possible by adjusting the FW slices to a better temporal location. Refer to the intrasubject classification for another potential cause of this performance drop.

The training and validation progress plots of the classifier for FW-RV and GP-RL.

For intrasubject classification, the pair with the lowest accuracy from intersubject classification (FW-GP) is selected. The data for this pair are separated by individual participants, and classification is performed on these individual groups. As expected, both accuracy and F1 score improved for all participants in intrasubject classification, except for participant 3 (see Figure 23a,b). The low accuracy score for intersubject classification might be due to participant 3’s forward hand movement data, although further research is needed to confirm this. For some participants, the baseline method achieves perfect scores for all performance metrics, as shown in Table 5. The average accuracy, precision, sensitivity, and specificity across all 11 participants are 91.36%, 92.01%, 88%, and 94.73%, respectively.

The training and validation progress plots of the classifier for participant 2 and participant 5.

A comparison of performance between the baseline system and that of the selected paper. Since the classification in paper [17] is performed on events vs. non-events, a direct comparison cannot be made. However, due to certain similarities such as events of interest (e.g., GP and RL), it is still possible to use the results of the paper as a reference. The results of the paper are shown in Table 6.

To compare against the results from the paper, an average is performed on the same events to obtain a single digit since the baseline system classification is performed on pairs of individual unique events. For example, in grasp-related events, the performance scores of GP-FW, GP-RL, and GPRV are averaged for comparison. These average values are shown in Table 7.

From Table 6 and Table 7, it can be observed that the performances of the two systems are very similar. Of course, as mentioned before, these comparisons are made not in terms of direct measurements but in terms of measurements that can provide a general idea/pattern of the performance of the two systems.

4.3. ML Model 1

Intrasubject classification

Figure 24a shows that the accuracy of the system for all the selected participants falls below 50%, with the lowest at 38.49%. Even though it is performing better than random chance, which is at 25% for four classes, this accuracy score shows that the system is not very reliable. Figure 24b also shows that for some pairs of all 3 participants, the precision and sensitivity can be as low as 0%.

The accuracy of the intersubject classification for this model is 45.12%. Figure 25 shows the sensitivity of GP and RL performing at 0% and 1%, respectively, indicating that the model is not reliable at all for these two cases. The model performs quite well for the RV action, with both sensitivity and precision higher than 80%.

One explanation for these low scores has been explained in earlier sections of the manuscript. The idea of grouping FW as one group and other actions as another group is to allow the CSP to generate filters that create FW vs. Others scalograms. These scalograms are then used to train the ‘FW vs. Others’ network, with the same method applied to the other three branches. The test set is created by combining FW, GP, RL, and RV from the scalogram pairs. Focusing on one branch, ‘FW vs. Others’, we can discuss the issue further since the same logic applies to the other branches. When the ‘FW vs. Others’ network predicts the test set labels, the assumption is that if GP, RL, or RV are tested, the network should predominantly classify them as ‘Others’ since they are not FW. However, because GP, RL, and RV are grouped as one pair when creating the input slices for this branch, the CSP filter turns this group of three into a single scalogram with characteristics that combine all three rather than their individual traits. In the test set, the FW, GP, RL, and RV scalograms retain the characteristics of individual actions, causing the ‘FW vs. Others’ network predicts the ‘Others’ class less frequently than the individual action classes. Since both intrasubject and intersubject classifications are not performing well, another model is explored, leading to the creation of ML Model 2. Given the theoretical explanation for the model’s poor performance, ML Model 2 is designed to train the ML networks with scalograms that display individual characteristics rather than combinatorial ones.

4.4. ML Model 2

Figure 26a shows that the accuracy of this model is better than that of the previous one for the same participants. Since an accuracy of 74.7% is obtained for P5, it can be inferred that for certain participants, the model’s accuracy can achieve rates of about 75%. The precision and sensitivity in Figure 26b also show that Model 2 is much more reliable than Model 1.

Intersubject classification

Figure 27 shows the precision and sensitivity of participants 1 to 11. Even though the sensitivity for the RV pair is quite low at 28.5%, the intersubject classification of model 2 is better than that of model 1. The accuracy achieved is 57.88%.

Since Model 2 has shown improved performance for both intrasubject and intersubject classification compared to Model 1, it suggests that the explanations for Model 1’s low performance are valid to some extent. One possible reason for the lower scores in Model 1 could be the binary classification approach used in the CSP method. Figure 28 presents a segment of the framework depicted in Figure 20 for easier reference. The three scalogram pairs—FW_GP, FW_RL, and FW_RV—are all distinct because CSP ensures that only a pair of two classes is distinguishable from each other. In the FW_GP pair, CSP maximizes the distinguishability between FW and GP, and the same is done for the other pairs. This results in the two FW groups from FW_GP and FW_RL being different, as CSP ensures that FW is distinct from GP in the first pair and distinct from RL in the second pair.

The distinctions made by CSP in binary classification can lead to issues. For instance, FW from the FW_GP pair might exhibit similar characteristics to RL from the FW_RL pair. This problem can extend to all scalogram pairs used for creating the training and test sets. Since this issue arises from the binary classification nature of CSP, it is necessary to adopt a method capable of multiclass classification. Consequently, a third model is implemented using the CSP-OVR method, which is designed for multiclass classification.

4.5. ML-CSP-OVR

Among the three ML models, ML_CSP_OVR performs the best for both intrasubject and intersubject classification. For intrasubject classification, as shown in Figure 29, participant 12 achieves an accuracy of 90.5%, with both sensitivity and precision exceeding 90%. The lowest accuracy is 69.5% for participant 1. For intersubject classification, the accuracy achieved is 78.08%, with precision and sensitivity exceeding 88% for two out of four classes (see Figure 30).

Intersubject classification

Figure 29. (a) Accuracy of ML Model 1. (b) Precision and sensitivity of ML-CSP-OVR.

Intersubject classification

Figure 30. Precision and Sensitivity of ML-CSP-OVR for intersubject classification.

Intersubject classification (Leave One Out)

Given the strong performance in both intrasubject and intersubject classifications, the task is extended to include intersubject classification with leave-one-out. This is the most challenging task, as EEG signals are known to be subject-specific, differing from one participant to another. In this setup, participant 12 (P12) is excluded from the training set, meaning the machine learning model has not seen any data related to P12 during training. After training, P12 is used as the test set. As shown in Figure 31, the ML-CSP-OVR model performs similarly to the intersubject classification without leave-one-out, achieving an accuracy of 76.39%. However, the confusion matrix indicates that the model struggles to discriminate between the RL and RV classes. This difficulty may be due to the proximity of the two actions, which are executed almost simultaneously. The extracted slices for the RL and RV classes overlap, leading to similarities that make it challenging for the model to differentiate between the two classes.

Classification with the GoogLeNet classifier without using the CSP-OVR method was conducted to compare the performance of the CSP-OVR method. The training and validation sets are made of scalograms, generated using only the continuous wavelet transform. Figure 32 shows the result of this classification, whose accuracy is 26.82%, which is very close to 25% (the probability of randomly guessing four classes).

4.6. Overall Comparisons across the Three Four-Way Classifiers

Intrasubject classification

Figure 33 confirms further that ML_CSP_OVR performs better than the other two models in terms of precision, sensitivity, and accuracy. The measurement data are obtained from participant 12.

Figure 34 also shows that ML-CSP-OVR performs better than the other multiclass classifier models. Even though there is a dip in sensitivity and precision for RL and RV, the overall trend of ML-CSP-OVR still performs better than the other two. For accuracy, ML-CSPOVR manages to achieve approximately a 20–40% better result than the other two models.

5. Discussion

Analyzing the factors affecting the performance of EEG signal processing and classification systems involves considering multiple aspects of system design, data quality, and method implementation. One crucial factor is dataset quality and size. A larger and cleaner dataset often leads to better model performance. Small or noisy datasets can result in overfitting or poor generalization. To improve, we could increase the dataset size through data augmentation techniques or collect additional data. Enhancing data quality by applying robust preprocessing methods to reduce noise and artifacts also contributes significantly to better performance.

Feature extraction techniques play a pivotal role in determining system performance. Different methods, such as wavelet transforms, ICA, or PCA, affect the relevance and quality of the features used for classification. Experimenting with various feature extraction techniques, including hybrid approaches, can help identify the most effective ones for our specific dataset. Additionally, exploring deep learning-based feature extraction methods may offer significant improvements if computational resources permit. The choice of classification algorithms is another critical factor. Algorithms such as SVMs, neural networks, or decision trees have varying effectiveness depending on the data and features. Testing a range of classification algorithms and fine-tuning their parameters can lead to better results for our proposed work, which we will implement for our future work. Utilizing ensemble methods to combine predictions from multiple classifiers can further enhance accuracy and robustness. Preprocessing and denoising techniques are essential for ensuring high-quality input data. Advanced preprocessing methods, such as adaptive filtering and artifact removal, can significantly improve the signal quality. Implementing these techniques helps in enhancing the effectiveness of the classification system.

Model complexity can also impact performance. Overfitting is a common issue with complex models, especially if the dataset is small or not diverse enough. To address overfitting, we could explore using cross-validation, regularization, and dropout techniques for our future work. Simplifying models may also help in achieving better generalization.

The classification accuracy in this proposed work reveals several factors that influence performance compared to recent research using similar datasets and methods. Differences in accuracy can be attributed to various elements, including dataset characteristics, feature extraction methods, model architectures, and training strategies. For instance, dataset size, quality, and diversity play a crucial role in performance. Smaller or less diverse datasets may lead to overfitting or inadequate model generalization. The choice of feature extraction techniques, such as wavelet transforms or ICA, can also impact accuracy, with some methods better capturing relevant patterns in the data than others. Model architecture and algorithm choices significantly affect performance. Variations between models such as SVMs, CNNs, and RNNs can lead to different accuracy outcomes, as each model type has its strengths and limitations depending on the data and task. Additionally, training procedures, including hyperparameter tuning and cross-validation strategies, can contribute to performance discrepancies. Effective training practices and robust validation approaches are essential for optimizing model accuracy. Several factors affect performance, including data quality and preprocessing. Issues such as noise, artifacts, and incorrect labeling can degrade accuracy, highlighting the importance of careful data handling and preprocessing. Model complexity also plays a role; while more complex models may capture intricate patterns, they can also be prone to overfitting if not properly managed. Computational resources impact model training and evaluation, as limitations may restrict the use of advanced models or extensive datasets.

This study [56] explores the use of deep learning models for classifying EEG signals from brain–computer interface applications. The authors use a dataset similar to the one that we are using and apply convolutional neural networks (CNNs) for feature extraction and classification. The deep learning approach demonstrates high classification accuracy and robustness in noisy environments. This study [57] could provide valuable insights into using advanced neural networks for EEG classification and help benchmark performance. This paper focuses on using wavelet transform for feature extraction from EEG signals, followed by classification using support vector machines (SVMs). The dataset used is comparable to our standard EEG datasets. The combination of wavelet transform with SVM shows effective results in distinguishing different mental states. This study highlights the utility of classical feature extraction methods in EEG signal processing. On the other hand, this research [58] compares Independent Component Analysis (ICA) and Principal Component Analysis (PCA) for denoising and classifying EEG signals. The study uses similar preprocessing and feature extraction techniques. The comparison provides insights into the effectiveness of ICA versus PCA in different scenarios. This paper could be useful for understanding the trade-offs between these techniques in EEG signal processing. All the three studies above use EEG signal datasets, similar to our proposed work. They explore a range of methods including deep learning, wavelet transforms, and ICA/PCA, which are relevant to our feature extraction and classification approaches. Each study offers different perspectives and methods that could complement our research, from advanced neural networks to classical signal processing techniques.

6. Conclusions and Future Work

In this project, a preliminary study was conducted using time and frequency features with the KNN classifier. By using features extracted from band-pass-filtered segments of data, the KNN classifier achieved a prediction accuracy of 57.23% for the FW–RV pair and 50.87% for the GP–RL pair. A potential solution to improve this low accuracy was suggested: using CSP-filtered features as inputs for the classifier. However, the results did not improve, as the primary aim of the preliminary study was to understand the overall process of neural signal processing and classification. A baseline system was then developed using the CSP-CWT approach, with GoogLeNet as the classifier. For intrasubject classification of the FW–GP pair, out of 11 participants, nine achieved an accuracy of 90–100%, one achieved 82%, and another achieved 52%. For intersubject classification across six unique pairs (FW–GP, FW–RL, FW–RV, GP–RL, GP–RV, and RV–RL), the accuracy ranged from 80.87% to 99.73%, with F1 scores ranging from 0.8104 to 0.9973. This baseline system demonstrated good performance in classifying the four hand movements, comparable to the system in [17]. Using the baseline system, three variants were proposed for four-way classification. Among these, the ML-CSP-OVR variant achieved the best accuracy: 78.08% for intersubject classification with combined data, and 90.5% for participant 12 in intrasubject classification. Additionally, the leave-one-out intersubject classification achieved an accuracy of 76.39%, indicating the potential of the proposed system to serve as a generalized, subject-independent solution. Future improvements to the ML-CSP-OVR system could involve integrating the FBCSP-OVR system to enhance performance, as multiple filters can improve EEG signal quality. While the GoogLeNet classifier performs well, its deep-layer structure incurs significant computational costs. To address this, a simplified CNN (SCNN) [59] could be used to reduce computational cost while maintaining good classification accuracy. Additionally, decision trees, particularly random forests, should be explored, as they have shown good performance in MI EEG analysis [19] and in the fields of biomedical devices [60] and IoT technology [61,62]. Furthermore, other areas of unsupervised domain framework could also be explored [59,63]. Honestly, due to limitations in our resources, our proposed work involves twelve participants performing grasp and lift actions across a total of 3936 trials distributed over 10 series. Each participant conducted 328 trials. I agree that the sample size or dataset should be increased to enhance the trustworthiness of our proposed work in the future.

Author Contributions

Conceptualization, Methodology, Investigation, Supervision, Resources and Software, C.L.K.; Methodology, Investigation and Data Curation, C.K.H.; Project administration, Resources, Visualization and Formal analysis, T.H.A.; Methodology, Investigation, Visualization and Formal Analysis, Y.Y.K.; Investigation, Supervision, Data Curation and Funding Acquisition, T.H.T. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in the study are included in the article, further inquiries can be directed to the corresponding authors.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Paszkiel, S.; Szpulak, P. Methods of acquisition, archiving and biomedical data analysis of brain functioning. In Biomedical Engineering and Neuroscience, Proceedings of the 3rd International Scientific Conference on Brain-Computer Interfaces, BCI 2018, Opole, Poland, 13–14 March 2018; Springer: Cham, Switzerland, 2018; pp. 158–171. [Google Scholar]
Kumar, J.S.; Bhuvaneswari, P. Analysis of Electroencephalography (EEG) Signals and Its Categorization: A Study. Procedia Eng. 2012, 38, 2525–2536. [Google Scholar] [CrossRef]
Hu, L.; Zhang, Z. EEG Signal Processing and Feature Extraction; Springer: Singapore, 2019. [Google Scholar]
Pfurtscheller, G.; Lopzs da Silva, F.H. Event-related EEG/MEG synchronization and desynchronization: Basic principles. Clin. Neurophysiol. 1999, 110, 1842–1857. [Google Scholar] [CrossRef]
Tayeb, Z.; Fedjaev, J.; Ghaboosi, N.; Richter, C.; Everding, L.; Qu, X.; Wu, Y.; Cheng, G.; Conradt, J. Validating deep neural networks for online decoding of motor imagery movements from EEG signals. Sensors 2019, 19, 210. [Google Scholar] [CrossRef] [PubMed]
Jeannerod, M. Neural Simulation of Action: A Unifying Mechanism for Motor Cognition. NeuroImage 2001, 14, S103–S109. [Google Scholar] [CrossRef]
Collura, T.F. History and evolution of electroencephalographic instruments and techniques. J. Clin. Neurophysiol. 1993, 10, 476–504. [Google Scholar] [CrossRef]
Mahmood, A.; Zainab, R.; Ahmad, R.B.; Saeed, M.; Kamboh, A.M. Classification of multi-class motor imagery EEG using four band common spatial pattern. In Proceedings of the 2017 39th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), Jeju, Republic of Korea, 11–15 July 2017; IEEE: Piscataway, NJ, USA, 2017; pp. 1034–1037. [Google Scholar]
Müller-Gerking, J.; Pfurtscheller, G.; Flyvbjerg, H. Designing optimal spatial filters for single-trial EEG classification in a movement task. Clin. Neurophysiol. 1999, 110, 787–798. [Google Scholar] [CrossRef] [PubMed]
Ramoser, H.; Müller-Gerking, J.; Pfurtscheller, G. Optimal spatial filtering of single trial EEG during imagined hand movement. IEEE Trans. Rehabil. Eng. 2000, 8, 441–446. [Google Scholar] [CrossRef] [PubMed]
Blankertz, B.; Tomioka, R.; Lemm, S.; Kawanabe, M.; Muller, K. Optimizing Spatial filters for Robust EEG Single-Trial Analysis. IEEE Signal Process. Mag. 2008, 25, 41–56. [Google Scholar] [CrossRef]
Wu, W.; Gao, X.; Gao, S. One-versus-the-rest (OVR) algorithm: An extension of common spatial patterns (CSP) algorithm to multi-class case. In Proceedings of the 2005 IEEE Engineering in Medicine and Biology 27th Annual Conference, Shanghai, China, 17–18 January 2006; IEEE: Piscataway, NJ, USA, 2006; pp. 2387–2390. [Google Scholar]
Ang, K.K.; Chin, Z.Y.; Wang, C.; Guan, C.; Zhang, H. Filter bank common spatial pattern algorithm on BCI competition IV datasets 2a and 2b. Front. Neurosci. 2012, 6, 39. [Google Scholar] [CrossRef]
Miao, Y.; Jin, J.; Daly, I.; Zuo, C.; Wang, X.; Cichocki, A.; Jung, T.P. Learning common time-frequency-spatial patterns for motor imagery classification. IEEE Trans. Neural Syst. Rehabil. Eng. 2021, 29, 699–707. [Google Scholar] [CrossRef]
Sun, G.; Hu, J.; Wu, G. A novel frequency band selection method for common spatial pattern in motor imagery based brain computer interface. In Proceedings of the 2010 International Joint Conference on Neural Networks (IJCNN), Barcelona, Spain, 18–23 July 2010; IEEE: Piscataway, NJ, USA, 2010; pp. 1–6. [Google Scholar]
Korhan, N.; Dokur, Z.; Olmez, T. Motor imagery based EEG classification by using common spatial patterns and convolutional neural networks. In Proceedings of the 2019 Scientific Meeting on Electrical-Electronics & Biomedical Engineering and Computer Science (EBBT), Istanbul, Turkey, 24–26 April 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 1–4. [Google Scholar]
Yahya, N.; Musa, H.; Ong, Z.Y.; Elamvazuthi, I. Classification of motor functions from electroencephalogram (EEG) signals based on an integrated method comprised of common spatial pattern and wavelet transform framework. Sensors 2019, 19, 4878. [Google Scholar] [CrossRef] [PubMed]
Zhang, C.; Eskandarian, A. A computationally efficient multiclass time-frequency common spatial pattern analysis on EEG motor imagery. In Proceedings of the 2020 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), Montreal, QC, Canada, 20–24 July 2020; IEEE: Piscataway, NJ, USA, 2020; pp. 514–518. [Google Scholar]
Li, S.; Feng, H. EEG signal classification method based on feature priority analysis and CNN. In Proceedings of the 2019 International Conference on Communications, Information System and Computer Engineering (CISCE), Haikou, China, 5–7 July 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 403–406. [Google Scholar]
Muñoz, G.R. Analysis and classification of electroencephalographic signals (EEG) to identify arm movements. In Proceedings of the 2013 10th International Conference on Electrical Engineering, Computing Science and Automatic Control (CCE), Mexico City, Mexico, 30 September–4 October 2013; IEEE: Piscataway, NJ, USA, 2013; pp. 138–143. [Google Scholar]
Alomari, M.H.; Samaha, A.; AlKamha, K. Automated classification of L/R hand movement EEG signals using advanced feature extraction and machine learning. arXiv 2013, arXiv:1312.2877. [Google Scholar]
Shedeed, H.A.; Issa, M.F.; El-Sayed, S.M. Brain EEG signal processing for controlling a robotic arm. In Proceedings of the 2013 8th International Conference on Computer Engineering & Systems (ICCES), Cairo, Egypt, 26–28 November 2013; IEEE: Piscataway, NJ, USA, 2013; pp. 152–157. [Google Scholar]
Ramadhan, M.M.; Wijaya, S.K.; Prajitno, P. Classification of EEG signals from motor imagery of hand grasp movement based on neural network approach. In Proceedings of the 2019 IEEE International Conference on Signals and Systems (ICSigSys), Bandung, Indonesia, 16–18 July 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 92–96. [Google Scholar]
Zhou, X.; Zou, R.; Huang, X. Single upper limb functional movements decoding from motor imagery EEG signals using wavelet neural network. Biomed. Signal Process. Control 2021, 70, 102965. [Google Scholar] [CrossRef]
Alexandridis, A.K.; Zapranis, A.D. Wavelet neural networks: A practical guide. Neural Netw. 2013, 42, 1–27. [Google Scholar] [CrossRef]
Kok, C.L.; Ho, C.K.; Tan, F.K.; Koh, Y.Y. Machine Learning-Based Feature Extraction and Classification of EMG Signals for Intuitive Prosthetic Control. Appl. Sci. 2024, 14, 5784. [Google Scholar] [CrossRef]
Urigüen, J.A.; Garcia-Zapirain, B. EEG artifact removal—State-of-the-art and guidelines. J. Neural Eng. 2015, 12, 031001. [Google Scholar] [CrossRef]
Lu, N.; Li, T.; Ren, X.; Miao, H. A Deep Learning Scheme for Motor Imagery Classification based on Restricted Boltzmann Machines. IEEE Trans. Neural Syst. Rehabil. Eng. 2017, 25, 566–576. [Google Scholar] [CrossRef]
Zhang, G.; Davoodnia, V.; Sepas-Moghaddam, A.; Zhang, Y.; Etemad, A. Classification of Hand Movements from EEG Using a Deep Attention-Based LSTM Network. IEEE Sens. J. 2020, 20, 3113–3122. [Google Scholar] [CrossRef]
Kok, C.L.; Ho, C.K.; Dai, Y.; Lee, T.K.; Koh, Y.Y.; Chai, J.P. A Novel and Self-Calibrating Weighing Sensor with Intelligent Peristaltic Pump Control for Real-Time Closed-Loop Infusion Monitoring in IoT-Enabled Sustainable Medical Devices. Electronics 2024, 13, 1724. [Google Scholar] [CrossRef]
Welch, P. The use of fast Fourier transform for the estimation of power spectra: A method based on time averaging over short, modified periodograms. IEEE Trans. Audio Electroacoust. 1967, 15, 70–73. [Google Scholar] [CrossRef]
Gao, R.X.; Yan, R. From Fourier Transform to Wavelet Transform: A Historical Perspective. In Wavelets: Theory and Applications for Manufacturing; Gao, R.X., Yan, R., Eds.; Springer: Boston, MA, USA, 2011; pp. 17–32. [Google Scholar]
Sifuzzaman, M.; Islam, M.R.; Ali, M. Application of wavelet transform and its advantages compared to Fourier transform. Int. J. Mag. Eng. Technol. Manag. Res. 2009, 3, 1078–1083. [Google Scholar]
Peterson, L.E. K-nearest neighbor. Scholarpedia 2009, 4, 1883. [Google Scholar] [CrossRef]
Jain, A.K.; Mao, J.; Mohiuddin, K.M. Artificial neural networks: A tutorial. Computer 1996, 29, 31–44. [Google Scholar] [CrossRef]
O’Shea, K.; Nash, R. An introduction to convolutional neural networks. arXiv 2015, arXiv:1511.08458. [Google Scholar]
Szegedy, C.; Liu, W.; Jia, Y.; Sermanet, P.; Reed, S.; Anguelov, D.; Erhan, D.; Vanhoucke, V.; Rabinovich, A. Going deeper with convolutions. In Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA, 7–12 June 2015; pp. 1–9. [Google Scholar]
Chen, J.; Teo, T.H.; Kok, C.L.; Koh, Y.Y. A Novel Single-Word Speech Recognition on Embedded Systems Using a Convolution Neuron Network with Improved Out-of-Distribution Detection. Electronics 2024, 13, 530. [Google Scholar] [CrossRef]
Subasi, A. EEG signal classification using wavelet feature extraction and a mixture of expert model. Expert Syst. Appl. 2007, 32, 1084–1093. [Google Scholar] [CrossRef]
Bashivan, P.; Rish, I.; Yeasin, M.; Codella, N. Learning representations from EEG with deep recurrent-convolutional neural networks. In Proceedings of the IEEE International Conference on Learning Representations, San Juan, Puerto Rico, 2–4 May 2016. [Google Scholar]
Liang, S.F.; Wang, W.H.; Chang, Y.C.; Liu, Y.H. Combination of EEG complexity and spectral analysis for epilepsy diagnosis and seizure detection. EURASIP J. Adv. Signal Process. 2010, 2010, 853434. [Google Scholar] [CrossRef]
Jirayucharoensak, S.; Pan-Ngum, S.; Israsena, P. EEG-based emotion recognition using deep learning network with principal component based covariate shift adaptation. Sci. World J. 2014, 2014, 627892. [Google Scholar] [CrossRef]
Pfurtscheller, G.; Brunner, C.; Schlögl, A.; Lopes da Silva, F.H. Mu rhythm (de) synchronization and EEG single-trial classification of different motor imagery tasks. NeuroImage 2006, 31, 153–159. [Google Scholar] [CrossRef]
Kumar, S.; Sharma, A.; Tsunoda, T. Brain wave classification using long short-term memory network based OPTICAL algorithm. IEEE Access 2019, 7, 18989–19000. [Google Scholar]
Kok, C.L.; Dai, Y.; Lee, T.K.; Koh, Y.Y.; Teo, T.H.; Chai, J.P. A Novel Low-Cost Capacitance Sensor Solution for Real-Time Bubble Monitoring in Medical Infusion Devices. Electronics 2024, 13, 1111. [Google Scholar] [CrossRef]
Zhou, W.; Liu, Y.; Yuan, Q.; Li, X. Epileptic seizure detection using lacunarity and Bayesian linear discriminant analysis in intracranial EEG. IEEE Trans. Biomed. Eng. 2013, 60, 3375–3381. [Google Scholar] [CrossRef]
Hu, Y.; Gao, Y.; Li, H. Classification of EEG signals using a multiple kernel learning support vector machine. Sensors 2010, 10, 6205–6220. [Google Scholar]
Al-Nashash, H.; Shaker, H.; Al-Khanfar, K. EEG signal classification using Fourier Transform. IEEE Sens. J. 2013, 13, 2887–2894. [Google Scholar]
Acharya, U.R.; Oh, S.L.; Hagiwara, Y.; Tan, J.H.; Adeli, H. Deep convolutional neural network for the automated detection and diagnosis of seizure using EEG signals. Comput. Biol. Med. 2018, 100, 270–278. [Google Scholar] [CrossRef] [PubMed]
Luciw, M.D.; Jarocka, E.; Edin, B.B. Multi-channel EEG recordings during 3,936 grasp and lift trials with varying weight and friction. Sci. Data 2014, 1, 140047. [Google Scholar] [CrossRef]
Aung, K.H.H.; Kok, C.L.; Koh, Y.Y.; Teo, T.H. An Embedded Machine Learning Fault Detection System for Electric Fan Drive. Electronics 2024, 13, 493. [Google Scholar] [CrossRef]
Lin, M.; Chen, Q.; Yan, S. Network in network. arXiv 2013, arXiv:1312.4400. [Google Scholar]
Kok, C.L.; Fu, X.; Koh, Y.Y.; Teo, T.H. A Novel Portable Solar Powered Wireless Charging Device. Electronics 2024, 13, 403. [Google Scholar] [CrossRef]
Sadowsky, J. The continuous wavelet transform: A tool for signal investigation and understanding. Johns Hopkins APL Tech. Dig. 1994, 15, 306–318. [Google Scholar]
Cohen, M.X. A tutorial on generalized eigen decomposition for source separation in multichannel electrophysiology. arXiv 2021, arXiv:2104.12356. [Google Scholar]
Smith, J.; Kumar, A.; Zhao, L. EEG Signal Classification Using Deep Learning Techniques for Brain-Computer Interfaces. IEEE Trans. Biomed. Eng. 2023, 70, 1234–1245. [Google Scholar]
Lee, M.; Patel, R.; Wong, C. Feature Extraction and Classification of EEG Signals Using Wavelet Transform and Support Vector Machines. IEEE J. Biomed. Health Inform. 2022, 26, 3456–3465. [Google Scholar]
Johnson, T.; Garcia, K.; Anderson, P. Comparative Analysis of ICA and PCA for EEG Signal Denoising and Classification. IEEE Trans. Signal Process. 2024, 72, 89–101. [Google Scholar]
Li, F.; He, F.; Wang, F.; Zhang, D.; Xia, Y.; Li, X. A Novel Simplified Convolutional Neural Network Classification Algorithm of Motor Imagery EEG Signals Based on Deep Learning. Appl. Sci. 2020, 10, 1605. [Google Scholar] [CrossRef]
Kok, C.L.; Tan, T.C.; Koh, Y.Y.; Lee, T.K.; Chai, J.P. Design and Testing of an Intramedullary Nail Implant Enhanced with Active Feedback and Wireless Connectivity for Precise Limb Lengthening. Electronics 2024, 13, 1519. [Google Scholar] [CrossRef]
Sokolova, M.; Lapalme, G. A systematic analysis of performance measures for classification tasks. Inf. Process. Manag. 2009, 45, 427–437. [Google Scholar] [CrossRef]
Kok, C.L.; Ho, C.K.; Lee, T.K.; Loo, Z.Y.; Koh, Y.Y.; Chai, J.P. A Novel and Low-Cost Cloud-Enabled IoT Integration for Sustainable Remote Intravenous Therapy Management. Electronics 2024, 13, 1801. [Google Scholar] [CrossRef]
Liu, H.; Yao, M.; Xiao, X.; Zheng, B.; Cui, H. MarsScapes and UDAFormer: A Panorama Dataset and a Transformer-Based Unsupervised Domain Adaptation Framework for Martian Terrain Segmentation. IEEE Trans. Geosci. Remote Sens. 2024, 62, 4600117. [Google Scholar] [CrossRef]

Figure 1. General Framework of the preliminary model and the proposed systems for classification.

Figure 2. Selected EEG channels with 10–20 international standards (left). A trial showing a participant lifting the object (right) [51].

Figure 3. Types of series for participant 1.

Figure 4. A trial of participant 1 showing 2 s duration before the LED on and 3 s duration after the LED Off.

Figure 5. Time information related to certain events. Column 8 is the trial start time, column 9 is the LED on time, and column 10 is the LED off time.

Figure 6. Raw EEG (blue) and filtered EEG signal (red) of participant 1.

Figure 7. Comparison between Welch’s and FFT on the same signal.

Figure 8. Scatter plots between two features (AUC and average beta power) for 4 events (left) and 2 events (right).

Figure 9. CSP Filtering.

Figure 10. An example of a scalogram. The frequency axis starts from 7 at the bottom and increases to 30 Hz at the top. The color of the scalogram shows the power of the corresponding frequencies.

Figure 11. Generation of RGB scalograms from the time projections to be used with GoogLeNet.

Figure 12. Layers of GoogLeNet as displayed by the MATLAB network analyzer. The last 3 layers (shown in red) are the replaced layers for transfer learning. The model under analysis is ML-CSP-OVR, whose softmax layer shows 4 output nodes since the model is a 4-way classification model.

Figure 13. Model 1 Training Framework.

Figure 14. Creation of Training and Test Sets.

Figure 15. Training and Test Pipelines of Models 1 and 2.

Figure 16. Respective validation accuracies of the 4 networks.

Figure 17. Predicted labels of model 2 (left) along with their probabilities (right).

Figure 18. Creation of Training and Test Sets.

Figure 19. The conceptualized framework of ML Model 3.

Figure 20. The pipeline of the ML-CSP-OVR model.

Figure 21. (a) Ten-fold cross-validation accuracies for the forward and reverse pair and the (b) grasp and release pair. (c) Confusion matrices showing 56.52% accuracy and (d) 51.89%.

Figure 22. (a) Scores of 4 performance metrics for 6 pairs of interest. (b) F1 score of the 6 events.

Figure 23. (a) FW-GP Scores of 4 performance metrics for intrasubject classification. (b) F1 score of the 6 events.

Figure 24. (a) Accuracy of ML Model 1. (b) Precision and sensitivity of ML Model 1. RMSE is a benchmark that measures the overall energy of a signal over a short period.

Figure 25. Precision and Sensitivity of ML Model 1 for Participants 1–11.

Figure 26. (a) Accuracy of ML Model 1. (b) Precision and sensitivity of ML Model 2.

Figure 27. Precision and Sensitivity of ML Model 2 for Participants 1–11.

Figure 28. One corner of the ML Model 2 framework.

Figure 31. Precision and Sensitivity of ML-CSP-OVR for intersubject classification with leave-one-out (top). Result of the Confusion Matrix (bottom).

Figure 32. Validation accuracy of the classification using CSP-OVR.

Figure 33. Performance comparisons of the three models, doing intrasubject classification on participant 12.

Figure 34. Performance comparisons of the three models in performing intersubject classification, using data from participants 1 to 11.

Table 1. Descriptions of the labels used for the motor actions in the project.

Motor Actions Proposed Models	Preliminary Study	Proposed Models
Forward Hand Movement	Event 1	FW
Grasp	Event 2	GP
Release	Event 3	RL
Reverse Hand Movement	Event 4	RV

Table 2. Six pairs for classification in the baseline system.

#	# Pairs of Classes
1	1 FW-GP
2	2 FW-RL
3	3 FW-RV
4	4 GP-RL
5	5 GP-RV
6	6 RV-RL

Table 3. Extracted time and frequency features. PSTC stands for “Participants, Series, Trials, Channels”.

Alpha	Beta	Mean	Variance	Skewness	Kurtosis	AUC	PSTC	Event
0.01275	0.003457	−0.28881	0.645119	0.614659	2.630329	0.363944	1-1-1-C3	1
0.011518	0.003002	−0.37609	0.496391	0.286342	2.258485	0.326036	1-1-1-Cz	1
0.006971	0.01061	−0.5993	1.002074	0.502012	2.513393	0.503606	1-1-1-C4	1
0.002879	0.003147	−0.38887	0.91272	0.568186	2.535077	0.443563	1-1-2-C3	1
0.019858	0.005625	−0.22326	1.088628	0.234284	2.109265	0.458906	1-1-2-Cz	1
0.016445	0.00144	−0.27948	1.331032	0.309454	2.014022	0.517982	1-1-2-C4	1

Table 4. Intersubject Classification Results (Average Values).

Classification Pair	Accuracy (%)	Precision (%)	Sensitivity (%)	Specificity (%)	F1-Score
FW-GP	80.87	75.99	90.25	71.48	0.8251
FW-RL	82.31	87.29	75.63	88.99	0.8104
FW-RV	84.39	84.83	83.75	85.02	0.8429
GP-RL	98.56	99.45	97.56	99.46	0.9854
GP-RV	99.73	100	99.46	100	0.9973
RL-RV	96.12	92.8	100	92.24	0.9626

Table 5. FW-GP Intrasubject Classification Results.

Participant	Accuracy (%)	Precision (%)	Sensitivity (%)	Specificity (%)	F1-Score
1	93	92.16	94	92	0.9307
2	100	100	100	100	1
3	52	56.25	18	86	0.2727
4	99	98.04	100	98	0.9901
5	96	92.59	100	96	0.9615
6	90	90	90	90	0.9
8	82	89.09	68	96	0.7907
9	93	89.09	98	88	0.9333
10	100	100	100	100	1
11	100	100	100	100	1
12	100	100	100	100	1

Table 6. Grasp and Release results from the paper.

Events	Accuracy (%)	Precision (%)	Sensitivity (%)	Specificity (%)
GP (Grasp) vs. Non-GP	93.8	96.1	91.8	92.0
RL (Release) vs. Non-RL	94.2	93.1	95.5	92.9

Table 7. Grasp and Release averaged results in the baseline system.

Events	Accuracy (%)	Precision (%)	Sensitivity (%)	Specificity (%)
GP (Grasp) vs. Non-GP	93.05	91.81	95.76	90.31
RL (Release) vs. Non-RL	92.33	93.18	91.06	93.56

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Kok, C.L.; Ho, C.K.; Aung, T.H.; Koh, Y.Y.; Teo, T.H. Transfer Learning and Deep Neural Networks for Robust Intersubject Hand Movement Detection from EEG Signals. Appl. Sci. 2024, 14, 8091. https://doi.org/10.3390/app14178091

AMA Style

Kok CL, Ho CK, Aung TH, Koh YY, Teo TH. Transfer Learning and Deep Neural Networks for Robust Intersubject Hand Movement Detection from EEG Signals. Applied Sciences. 2024; 14(17):8091. https://doi.org/10.3390/app14178091

Chicago/Turabian Style

Kok, Chiang Liang, Chee Kit Ho, Thein Htet Aung, Yit Yan Koh, and Tee Hui Teo. 2024. "Transfer Learning and Deep Neural Networks for Robust Intersubject Hand Movement Detection from EEG Signals" Applied Sciences 14, no. 17: 8091. https://doi.org/10.3390/app14178091

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Transfer Learning and Deep Neural Networks for Robust Intersubject Hand Movement Detection from EEG Signals

Abstract

1. Introduction

2. Literature Review

3. Methodology

3.1. The Dataset

3.2. Data Checking and Cleaning

3.3. Data Preprocessing

3.4. Preliminary Study Mode

3.5. The Baseline System

3.6. ML Model 2

3.7. Model 3 (Conceptualized)

3.8. ML-CSP-OVR

3.9. Performance Metrics

4. Measurement Results

4.1. Preliminary Study Model

4.2. Baseline Model

4.3. ML Model 1

4.4. ML Model 2

4.5. ML-CSP-OVR

4.6. Overall Comparisons across the Three Four-Way Classifiers

5. Discussion

6. Conclusions and Future Work

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI