Next Article in Journal
Neuroprotective Potentials of Berberine in Rotenone-Induced Parkinson’s Disease-like Motor Symptoms in Rats
Previous Article in Journal
Ictal Fear or Panic Attack, This Is the Question—A Video–EEG Study
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Hybrid EEG-Based Stress State Classification Model Using Multi-Domain Transfer Entropy and PCANet

1
School of Biomedical Engineering (Suzhou), Division of Life Sciences and Medicine, University of Sciences and Technology of China, No.96, Jinzhai Road, Hefei 230026, China
2
Suzhou Institute of Biomedical Engineering and Technology, Chinese Academy of Sciences, No.88, Keling Road, Suzhou 215163, China
3
School of Psychology, Beijing Sport University, Beijing 100084, China
4
Luo Yang Institute of Science and Technology, No. 90, Wangcheng Avenue, Luoyang 471023, China
*
Author to whom correspondence should be addressed.
Brain Sci. 2024, 14(6), 595; https://doi.org/10.3390/brainsci14060595
Submission received: 8 May 2024 / Revised: 6 June 2024 / Accepted: 10 June 2024 / Published: 12 June 2024
(This article belongs to the Section Cognitive Social and Affective Neuroscience)

Abstract

:
This paper proposes a new hybrid model for classifying stress states using EEG signals, combining multi-domain transfer entropy (TrEn) with a two-dimensional PCANet (2D-PCANet) approach. The aim is to create an automated system for identifying stress levels, which is crucial for early intervention and mental health management. A major challenge in this field lies in extracting meaningful emotional information from the complex patterns observed in EEG. Our model addresses this by initially applying independent component analysis (ICA) to purify the EEG signals, enhancing the clarity for further analysis. We then leverage the adaptability of the fractional Fourier transform (FrFT) to represent the EEG data in time, frequency, and time–frequency domains. This multi-domain representation allows for a more nuanced understanding of the brain’s activity in response to stress. The subsequent stage involves the deployment of a two-layer 2D-PCANet network designed to autonomously distill EEG features associated with stress. These features are then classified by a support vector machine (SVM) to determine the stress state. Moreover, stress induction and data acquisition experiments are designed. We employed two distinct tasks known to trigger stress responses. Other stress-inducing elements that enhance the stress response were included in the experimental design, such as time limits and performance feedback. The EEG data collected from 15 participants were retained. The proposed algorithm achieves an average accuracy of over 92% on this self-collected dataset, enabling stress state detection under different task-induced conditions.

1. Introduction

The stress state is an external emotional state that individuals undergo in response to both internal and external tense environmental stimuli. Stressful events are inevitable in life, and they trigger a series of reactions in the human body in unexpected tense situations, leading to stress pressure [1]. For instance, in a state of stress, the body exhibits physical reactions such as muscle tension, elevated blood pressure, and shortness of breath; emotional reactions such as anxiety, tension, and panic; and behavioral responses including avoidance, hesitation, and aggression [2]. While moderate stress pressure can enhance human activity efficiency and adaptive capabilities, excessive or prolonged stress states can have adverse effects on both physical and mental well-being and—in severe cases—pose a threat to human life. Therefore, effective identification of the human stress state is of significant importance for timely stress intervention and regulation to mitigate the associated risks to physical and mental health [3].
Traditional methods for stress assessment mainly rely on non-invasive approaches such as speech and facial expression analysis. Gupta et al. [4] proposed a driver stress detection method by analyzing facial expressions to monitor stress levels based on a hybrid deep convolutional neural network model. Meanwhile, Chu et al. [5] designed an automatic speech analysis program to evaluate stress burden and psychological health. However, these methods often focus on individual subjective tendencies or only consider the external manifestations of stress, resulting in limited evaluation mechanisms and significant individual variations in recognition accuracy. Recognizing and evaluating stress via physiological signals is of great significance since the changes in physiological features during stress are inherently difficult to conceal. Electroencephalogram (EEG) signals, which capture the overall electrical activity of brain neurons on the scalp or cortical surface, contain abundant physiological and psychological information. Consequently, they serve as an important objective basis for human stress state monitoring [6].
The analysis of EEG signals poses significant challenges due to their nonlinearity, non-stationarity, and susceptibility to interference. Currently, commonly used methods for EEG signal analysis include time domain, frequency domain, time–frequency domain analysis, and nonlinear dynamical analysis [7]. In recent years, EEG signals have been widely applied in stress recognition [8]. The literature review shows a major challenge in establishing a successful stress detection model: extracting emotion-related information from complex EEG signals. However, single-angle analyses in existing approaches reveal limited key features, making them ineffective in differentiating between different stress states.
To address this challenge, we propose a stress state recognition algorithm based on multi-domain transfer entropy (TrEn) and principal component analysis network (PCANet). Based on the flexibility of the fractional Fourier transform (FrFT) [9], we establish representations of TrEn in the time domain, frequency domain, and time–frequency domain, cleverly integrating temporal–spectral information and nonlinear information. Furthermore, to overcome the limitations of manual feature extraction, PCANet plays a crucial role in this approach through the automatic extraction of high-level emotional information from EEG signals. This achieves considerable performance gains with lower computational complexity, enhancing the reliability of the model in EEG classification. The present study focuses on exploring an effective stress state detection method through EEG. This study aims to contribute significantly to the field of affective computing and stress-related research, offering a robust and reliable method for stress state detection and analysis.
In summary, the main contributions of this work are listed as follows:
(1)
By ingeniously leveraging the flexibility of fractional Fourier transform in varying the transform order, we achieved the representation of EEG signals in the time domain, frequency domain, and time–frequency domain. This established a multi-domain transfer entropy-based representation scheme for EEG signals, allowing the fusion of temporal–spectral information and better revealing the hidden details in EEG signals.
(2)
We introduced deep PCANet, which automatically learns features from low-level and high-level EEG patterns within a supervised learning framework, instead of relying on manually selected features. This effectively avoids the subjectivity introduced by manual feature selection and enhances the model’s generalization capability and robustness.
(3)
We designed a well-defined stress induction paradigm and collected EEG data from multiple participants. The proposed algorithm was validated on actual stress-inducing EEG data, and the experimental results demonstrated its effectiveness in the automatic detection of stress states based on EEG signals.
The structure of this paper is organized as follows. Section 2 presents the proposed algorithm framework. Section 3 explains the paradigm for EEG data collection. Section 4 analyzes the experimental results. Finally, Section 5 provides the discussion, and Section 6 presents the conclusion.

2. Data Collection Paradigms and Schemes

2.1. Stress Induction Paradigm

We collected and analyzed EEG data from 30 healthy male university students aged 18–23. Each participant performed two sets of game task experiments, employing two stress induction paradigms: the 2-back task [10] and the Balloon Analog Risk Task (BART) [11,12]. Our experimental design includes additional stress-inducing elements that complement the two tasks. For example, we have incorporated time constraints and performance feedback to further enhance the stress response. Previous studies [13,14] have demonstrated the effectiveness of the tests in inducing stress.
For the 2-back paradigm, the experimental stimuli consisted of English letters such as W, R, E, Q, etc. The experiment began with the presentation of a fixation point “+” for 1000 ms, followed by the experimental stimuli for 2000 ms. Participants were instructed to compare the current stimulus with the one presented two trials back. They were required to press the F key if the stimuli were the same and the J key if they differed, with a time limit of 2000 ms to respond. The experiment consisted of a total of 50 trials, and the participants’ response results were recorded by the computer. After the experiment, the average accuracy rate of the participants was calculated. The specific task paradigm is shown in Figure 1.
In the BART experimental paradigm, a deflated balloon appears at the center of the computer screen. With each press of the “inflate” key, the balloon expands by 0.3 cm, and the participant earns 1 point. However, each balloon randomly bursts between 1 and 30 inflations. During the inflation process, participants have the option to halt inflation at any point and transfer the current balloon earnings to their total earnings. In the event of a balloon bursting, the points accumulated for the “current balloon earnings” reset to zero. Participants can either continue inflating until the balloon bursts or press the “stop inflating” key to save the current earnings, thereby ending the trial. Subsequently, another balloon is presented to initiate a new trial, and this process continues until the end of the experimental session. The specific task paradigm is shown in Figure 2.

2.2. Stress Induction Protocol

The basic procedure of the laboratory simulated stress induction experiment is as follows:
(1)
Experimental Setup: The experiment takes place in a quiet environment. Prior to the formal experiment, participants receive instructions regarding the experimental procedures and game controls. They are advised to maintain a calm state of mind. Before stress induction, their pre-stress EEG signals are recorded. After EEG data collection, participants complete the Subjective Anxiety Inventory (SAI) to assess their pre-stress emotional state.
(2)
Stress Task Commencement: Participants engage in a game task under time constraints and negative feedback. During the game, instances of errors prompt real-time negative feedback such as “game failed” or “No points,” accompanied by a time penalty of a 10% reduction in the allocated task time. In order to increase the sense of stress, we will suddenly inform the participant before the task that the results of this task will be ranked and announced, and the final results will affect the final bonus amount.
(3)
Stress Task Completion: After the stress task, participants again fill out the SAI to assess their post-stress emotional state.
Each individual experiment lasts 16 min. The experimental procedure is depicted in Figure 3, with 5 min allocated for resting-state EEG data collection and another 5 min for stress-state EEG data collection. Each participant undergoes EEG data collection for the 2-back task and BART stress tasks on two separate days, with consistent data collection procedures.

2.3. Data Collection and Preprocessing

In this experiment, the EEG amplifier produced by Neuroscan was used for data collection. This equipment consists of 64 unipolar, 4 bipolar, and 2 high-level inputs, totaling 70 channels; while the sampling rate can reach 20,000 Hz. All channels use high-quality 24-bit A/D analog-to-digital chips to process data, which are transmitted to the computer via a USB 2.0 interface. We only applied a total of 32 scalp electrodes, including reference electrodes, positioned in accordance with the international 10–20 electrode placement system. The specific layout is shown in Figure 4. The sample frequency used for this experiment was set to 512 Hz. To ensure the validity and standardization of the experimental data, subjective rating scores before and after stressful scenario simulation assisted in the selection of EEG data. Samples with data loss due to uncontrollable reasons, those with no significant change in rating scores before and after the task, and samples where the game task was not successfully completed were excluded. Consequently, EEG data from 15 participants were retained for each of the two different stress tasks.
Before formal analysis of the EEG data related to stress states, it is necessary to perform simple preprocessing steps to standardize the data. Firstly, all EEG data undergo baseline correction by subtracting this average from each point in the EEG waveform. Subsequently, the Common Average Reference (CAR) method is applied to reduce the coupling effect between acquisition channels. Finally, the EEG signals are subjected to a band-pass filter and a 50 Hz notch filter to remove frequency components outside the range of 0.5–60 Hz and power line interference, respectively [15]. After simple data collation, the quality of EEG signals is preliminarily improved. In this paper, the EEG data are segmented using a sliding window of 2 s in length, without overlap between the windows.

3. Method

The stress state recognition algorithm proposed in this paper, based on multi-domain transfer entropy and PCANet, consists of five stages: independent component analysis (ICA) denoising, multi-domain FrFT decomposition, transfer entropy matrix generation, PCANet feature learning, and feature classification.
The architecture of the proposed algorithm is illustrated in Figure 5. In the first stage, ICA is applied to eliminate ocular and muscular artifacts from the EEG signals. Subsequently, in the second stage, the EEG signals undergo projection into different domains by adjusting the rotation angle of the FrFT. In the third stage, the transfer entropy matrix of the EEG signals is computed under different decomposition perspectives across different domains. In the fourth stage, the 2D-PCANet, based on multi-domain transfer entropy and PCANet, is employed to automatically learn and discover features deeply rooted in the generated multi-domain transfer entropy matrix. This effectively captures the differences among various types of EEG signals. Finally, in the classification process, an SVM is used to assign a label to each extracted PCANet feature.

3.1. Independent Component Analysis (ICA)

ICA is used for array processing and data analysis. The aim is to recover unobserved signals from observed mixed signals under the assumption of signal independence [16,17]. The process begins with principal component analysis (PCA)—equivalent to singular value decomposition—followed by the recombination of these principal components to achieve temporal independence among the signals. Ideally, the number of collected mixed signals should equal or exceed the number of source signals [18]. Unlike other statistical methods, ICA measures independence using metrics such as mutual information, negative entropy, or correlation-based measures [19].
In the statistical model of ICA, considering  m  source signals and  n  observation points, the  n  observed values  x  are modeled as a linear combination of  m  sources:
x i = j = 1 m a i j s j i = 1 , n
where  a i  are real numbers, and  s j  are statistically independent.
The variable  s j  represents latent variables that cannot be directly measured and are referred to as independent components. On the other hand, the observed results,  x i , are the only measurable observations, and they can be represented as a vector matrix:
X = A S
where  X  is an n-dimensional observed random vector,  A  is the mixing matrix, and  S  is the matrix of independent sources, the goal is to estimate the original independent sources  S  from the observed values  X  [20].
Statistical independence between two source signals is essential in ICA. To ensure this, any two functions  h 1  and  h 2  must satisfy:
E h 1 s 1 , h 2 s 2 = E h 1 s 1 E h 2 s 2
Thus, independence implies uncorrelatedness but not vice versa. A new observation matrix  Z  is obtained by whitening the original observed matrix  X . Whitening involves subtracting the mean from  X  and applying principal component analysis to extract the orthogonal components that best represent the data, expressed as:
Z = Q X = Q A S = B S
where  Q  is the whitening matrix, and  B  is an orthogonal matrix, which can be represented as:
Z = B S
Then, it follows that that:
E Z Z T = E B S S T B T = B I B T = I
where  I  is the identity matrix [21,22].
Based on Equation (2), the estimation of independent components is expressed as:
Y = W X
where  Y  represents the estimated  S , and  W  is the estimated demixing matrix, ideally given by  W = A 1 . After whitening the original data, Equation (7) can be rewritten as:
Y = W X = B T Z = B T Q X = B T Q A S
where  Y  is the estimation of the source signals, and  W  is the corresponding demixing matrix, which can be represented as [23]:
W = B T Q

3.2. Fractional Fourier Transform (FrFT)

FrFT is one of the extensions of the Fourier transform that preserves its characteristics while allowing signal transformation into a domain between time and frequency [9]. As a result, FrFT finds wide applications in fields such as communication, radar, and encryption [24].
The pth order FrFT of a signal  x ( t )  is represented as:
X p u = F p x t = + x t K p u , t d t
Here,  F [ · ]  represents the FrFT operator,  p  is the order of the FrFT transformation, and  K p ( u , t )  is the kernel function of the FrFT, given by:
K p u , t = 1 jcot   α 2 π e x p j u 2 + t 2 2 cot   α j u t sin   α α n π δ ( u t ) α = 2 n π δ ( u + t ) α = ( 2 n ± 1 ) π
where  α = p π / 2  is the rotation angle. The flexibility of FrFT arises from the flexible rotation angle  α δ ( t )  represents the Dirac delta function [25].
From the above formulas, it can be observed that when  p = 1 , i.e.,  α = π / 2 , the FrFT reduces to the ordinary Fourier transform:
X 1 u = F 1 x t = + x t K 1 u , t d t = 1 2 π + exp   j u t x t d t
Moreover, for  p = 0 , i.e.,  α = 0 X 0 u = x t , indicating no transformation applied to the signal. It is worth noting that  α  only appears in the trigonometric functions, making it periodic with a period of  2 π . Therefore,  p  has a period of 4. It is sufficient to observe the signal’s behavior for  α ( π , π ]  (or  p ( 2,2 ] ) [26].
The following properties of the kernel function can be derived:
K α u , t = K α u , t
+ K α u , t K α u , t d t = δ u u
Hence, the inverse transformation of the FrFT can be obtained as:
x t = F p X p u = + X p u K p u , t d u
Figure 6 illustrates the time–frequency domain representation of the FrFT.

3.3. Transfer Entropy (TrEn)

Transfer entropy (TrEn), introduced by Schreiber, serves as an enhancement of information entropy. It analyzes information flow between two systems by examining the past values of one system alongside the current observations of another system. It quantifies the dependency between the two systems, with a higher value of transfer entropy indicating increased information transfer and a stronger correlation between the systems [27].
The specific definition of transfer entropy between two random processes,  I  and  J , is as follows:
T J I = p ( i t + k , i t ( k ) , j t ( k ) ) l o g p ( i t + k | i t ( k ) , j t ( k ) ) p ( i t + k | i t ( k ) )
where  i t  and  j t  represent the observed values at time  t . This expression can be simplified to:
T J I = H ( i t + k | i t ) H ( i t + k | i t , j t )
where,  H ( i t + k | i t )  represents the conditional entropy, which measures the uncertainty of the variable at the current time given the knowledge of the past  k  time steps [28].

3.4. Principal Component Analysis Network (PCANet)

Principal Component Analysis Network (PCANet) is a convolutional neural network that builds upon PCA. It aims to address issues such as the need for a large number of training samples and the increasing number of parameters as research complexities increase. PCANet, proposed by Chan et al., 2014 [29], can be divided into three stages, and its network structure is as follows:
Assuming that the dataset comprises  N  training samples of size  m × n , represented as:
I = { I i } i = 1 N
The filter size is set constant,  k 1 × k 2 .
First Stage: Block sampling is performed on each training sample, followed by mean subtraction (centering) on the sampled results. The sampled results of the i-th training sample can be represented as a matrix:
X ¯ i = [ x ¯ i , 1 , x ¯ i , 2 , , x ¯ i , m n ] , x i , 1 , x i , 2 , , x i , m n R k 1 k 2
The same mean subtraction process is applied to each sample in the dataset to yield a new matrix:
X = X ¯ 1 , X ¯ 2 , , X ¯ N R k 1 k 2 × N m n
Assuming the i-th layer has  L i  filters, PCA is used to determine the principal component analysis matrix (standard orthogonal matrix) that minimizes the reconstruction error [30]:
m i n V R k 1 k 2 × L 1 X V V T X F 2 , s . t . V T V = I L 1
where  I L 1  is an  L 1 × L 1  identity matrix. According to the PCA principle,  L 1  is the eigenvector matrix of  X X T . Thus, the PCA filters are obtained by solving:
W l 1 m a t k 1 , k 2 q l X X T R k 1 × k 2 , l = 1 , 2 , , L 1
where  m a t  is a function that maps a vector to a new space, and  q l  is a function that solves the parameter eigenvector matrix. It represents the eigenvectors of the matrix  X X T , which form the mapping matrix containing the main feature information of the samples [31]. The first-layer convolution is performed on the original dataset  I . To ensure that the resulting  I i l  in the first stage matches the original image  I i , zero-padding is applied to the edges of the original image since the mapping in the first stage may reduce the image size. The following formula represents the convolution process of the i-th image with  L 1  feature maps:
I i l I i × W l 1 , i = 1 , 2 , , L 1
The results of the first-layer convolution for the i-th image are concatenated to form a feature map:
F i 1 I i × W l 1 l = 1 L 1 = I i 1 , , I i L 1
Second Stage: Similar to the first stage, the second stage also requires block sampling and mean subtraction on each input feature map. Next, the i-th feature map  I i l  is subjected to mean subtraction [32]:
Y ¯ i l = y ¯ i , l , 1 , y ¯ i , l , 2 , , y ¯ i , l , m n R k 1 k 2 × m n ^
The feature maps produced by convolving the l-th convolutional kernel in the first layer undergo mean subtraction and can be uniformly represented as:
Y l = Y ¯ 1 l , Y ¯ 2 1 , , Y ¯ N l R k 1 k 2 × N m n
Therefore, the entire output of the first layer convolution after mean subtraction can be represented as:
Y = Y 1 , Y 2 , , Y L 1 R k 1 k 2 × L 1 N m n
Next, we solve for the convolutional kernels in the second stage. We set the kernel size to be  k 1 × k 2  and the number of kernels to be  L 2 .
W l 2 m a t k 1 , k 2 q l Y Y T R k 1 × k 2 , l = 1 , 2 , , L 2
In the second stage, when convolving the  I i l  feature maps generated in the first stage,  L 2  corresponding feature maps are produced:
O i l I i l × W l 2 l = 1 L 2
After two rounds of PCA-based feature extraction on the i-th image,  L 1  ×  L 2  corresponding feature maps are obtained in the second stage:
O i l I i l × W l 2 l = 1 L 2 , l = 1 , 2 , , L 1
After the i-th image undergoes two layers of convolution, the final output results are concatenated, represented as:
F i 2 = I i W l 1 l = 1 L 1 × W l 2 l = 1 L 2 = O i 1 , , O i L 1 × L 2
From the previous convolution iterations, it can be observed that PCANet exhibits strong scalability, making it easy to construct deep neural networks for feature extraction [33]. Here, only a brief introduction to PCANet with two iterations is provided.
Third Stage: Due to the exponential growth of feature dimensions with  L 1  ×  L 2  feature maps generated in the second stage, the third stage employs hashing and histogram mapping methods to handle the feature maps and achieve discrete parameters and dimension reduction [34]. The specific steps are as follows:
First, binarization is performed on each feature matrix obtained from the second stage:
H I i l × W l 2 l = 1 L 2
where  H  is a step function: if the original element of a pixel is greater than 0, the corresponding value in the new matrix is set to 1; otherwise, it is set to 0.
Next, hash encoding is performed on the processed feature matrices:
T i l l = 1 L 2 2 l 1 H I i l × W l 2
In other words, the binary representation of cap L2 is treated as a decimal number for each pixel, and the binarized feature maps into integer images.
Then, the encoded matrices are divided into blocks, with histogram statistics performed on each block
f i B h i s t T i 1 , , B h i s t T i L 1 T R 2 L 2 L 1 B
Therefore, in this third stage, the feature maps generated in the previous stage are processed using hashing and histogram mapping techniques to achieve discrete parameters and reduce the feature dimensions [35]. Figure 7 shows the PCANet structure.

3.5. Support Vector Machine (SVM)

SVM, a machine learning algorithm proposed by Chapelle et al., operates on the principle of structural risk minimization. Essentially, it maps data into a higher-dimensional feature space and locates a hyperplane in this projected space that maximizes the margin between classes [36]. Figure 8 illustrates the core principle of SVM. Its widespread use can be attributed to its outstanding problem-solving capacity related to small samples, nonlinearity, and high-dimensional recognition [37]. Due to the high dimensionality of the feature vectors, this study employs a linear kernel SVM. After hyperparameter fine-tuning, the penalty parameter C was set to 2. The value of C is changed from 1 to 10; the one that provided the best results is selected in this paper.

4. Experimental Results and Analysis

4.1. ICA Denoising Analysis

In this study, the amplitude of the EEG signals was susceptible to artifacts such as eye movements and muscle activity. To address this issue and enhance signal quality, the filtered EEG signals were subjected to ICA. Figure 9 shows the original EEG signals alongside those after ICA denoising. From the figure, it can be observed that the EEG waveforms after ICA denoising appear smoother, with significantly reduced disturbances and a substantial reduction in sharp peaks. These observations indicate ICA effectively removes artifacts from the EEG signals.

4.2. Multi-Domain Representation Based on FrFT and TrEn

FrFT, a time–frequency analysis method, integrates both time and frequency domains, providing a unified representation in the time–frequency domain. Leveraging this characteristic, this study employs FrFT with different orders (p) to transform the EEG signals across the time, frequency, and time–frequency domains. Specifically, we set p to 0 (time domain), 0.5 (time-frequency domain), and 1 (frequency domain). To avoid the influence of complex values, we consider the absolute amplitude of the FrFT-transformed signals. Figure 10 illustrates the absolute coefficients of the sample EEG signals after multi-order FrFT transformation across different domains. As p changes, the EEG signals undergo transformations from the time domain to the time–frequency domain and then to the frequency domain, with the coefficients gradually becoming more compact and concentrated.
After the multi-domain representation, TrEn is calculated to quantify the information transfer between different channels of the EEG, enabling the characterization of brain spatial information transfer across different domains. Figure 11 shows the TrEn matrices of the EEG signals in the time domain, frequency domain, and time–frequency domain under different states. These matrices depict variations in information transfer between electrodes across different decomposition domains, indicating the disclosure of distinct brain state information. Analysis of the spatial connectivity matrices of TrEn in the frequency domain and time–frequency domain reveals a significant decrease in the interaction strength of causal information among brain regions during stress tasks. Very strong connections between these brain regions are associated with emotion. This observation indicates that stress can impact the efficiency of brain functioning. Under stress, the brain becomes more active.

4.3. Classification Based on PCANet

The obtained multi-domain TrEn matrices are spatially integrated through cascading and fed into PCANet for automatic feature learning. After multiple training iterations, it was observed that the learning performance of PCANet does not further improve with an increase in layers. Therefore, a two-layer PCANet is sufficient to capture an adequate amount of stress state information. The optimal parameters for the PCANet structure are determined as shown in Table 1.
The obtained multi-domain transfer entropy-based PCANet features are then fed into an SVM classifier for stress state recognition. To be convinced, the 10-fold cross validation method is employed to evaluate the performance of the proposed method in detecting stress states in two tasks, namely BART and 2-back, separately. The results of the binary stress state EEG classification for 15 datasets in the two different tasks are shown in Table 2. From the results in Table 2, it is observed that the proposed algorithm achieves an average recognition accuracy of 92.14%, an average sensitivity of 94.25%, and an average specificity of 87.11% in the BART task. In the 2-back task, the proposed algorithm achieves an average recognition accuracy of 93.31%, an average sensitivity of 93.40%, and an average specificity of 93.22%. However, the stress recognition performance varies significantly among different subjects. For example, in the 2-back task, the highest recognition rate can reach 98.64%, while the lowest can be as low as 88.64%, with a difference of 10%. This variation is mainly due to the individual differences in how stress is manifested, as different subjects may exhibit varying levels of emotional response to the same stress task.
To validate the effectiveness of the proposed multi-domain TrEn representation scheme, the study compared the recognition performance achieved by different single-domain TrEn representations. Figure 10 illustrates the overall recognition performance using different feature representation schemes. Specifically, T-TrEn, TF-TrEn, and F-TrEn represent TrEn representations in the single time domain, time-frequency domain, and frequency domain, respectively. On the other hand, Multi-TrEn represents the cascaded representation of the multi-domain fused TrEn. The proposed Multi-TrEn has achieved the best accuracies for both tasks. The results of other three methods are inferior to 90%. The results in Figure 12 demonstrate that the proposed multi-domain fusion strategy achieves information integration and complementarity, enhancing the learning efficiency of PCANet.
Furthermore, to further validate the performance of the proposed algorithm, this study conducted stress classification experiments in three states: Baseline, 2-Back, and BART. The results are shown in Table 3. The proposed algorithm achieved an average recognition accuracy of 91.81% for stress states in multi-stress tasks, along with an average Sen of 95.21% and an average Spe of 86.93%. The proposed multi-domain TrEn and PCANet models demonstrated significant advantages in both binary and ternary stress detection tasks, indicating the effectiveness and scalability of the algorithm.

5. Discussion

The present study introduces an automatic stress state recognition algorithm based on multi-domain transfer entropy and PCANet, aiming to identify an individual’s stress state through EEG signal analysis. The findings demonstrate that our proposed algorithm achieved an average accuracy rate exceeding 92% on a self-collected dataset, which is significant for the field of stress state detection.
Many researchers are working in this area. Sharma and his colleagues [38] utilized stationary wavelet transform to decompose EEG signals, extracted entropy-based features, and employed various evolutionary heuristic methods to optimize SVM. In their experiments, the Whale Optimization Algorithm optimized SVM and achieved better results. Tsai and his colleagues [39] collected EEG data from table tennis players, extracted features, and compared the performance of logistic regression, support vector machine, decision tree C4.5, classification and regression tree, random forest, and extreme gradient boosting (XGBoost) algorithms. Ultimately, XGBoost achieved an accuracy of 86.49% in classifying three stress states in a general model. Subhani and his colleagues [40] proposed a machine learning framework involving the analysis of EEG signals from stress participants. The results indicated an accuracy of 83.4% for multilevel stress recognition using this framework. The majority of studies have been conducted within the traditional single-domain framework, utilizing self-collected data. In this paper, we have proposed a novel method based on multi-domain TrEn and PCANet, which provides a new perspective for stress detection.
After ICA denoising processing, the quality of EEG signals was significantly enhanced, laying a solid foundation for subsequent feature extraction and state recognition. The EEG waveforms post-ICA denoising were smoother, indicating that this method effectively removed artifactual components from the signals, improving the signal-to-noise ratio. Leveraging the flexibility of the FrFT, we achieved multi-domain representations of EEG signals in the time domain, frequency domain, and time–frequency domain. The calculation of TrEn further revealed the state of information transfer between different brain regions, which is crucial for understanding the dynamic changes in the brain under stress (as shown in Figure 11). Experimental results indicate that as the stress task proceeds, the causal information interaction strength between subjects’ brain regions decreases, suggesting that stress states may affect the brain’s work efficiency. In terms of feature learning, the application of PCANet has shown its advantages in automatically extracting features from EEG signals. Within a supervised learning framework, PCANet can automatically learn both low-level and high-level features of EEG signals, avoiding the subjectivity brought about by manual feature selection, and enhancing the model’s generalization and robustness. SVM as a classifier also demonstrated good performance in this study. In summary, the automatic stress state recognition algorithm proposed in this study shows high accuracy and application potential in automatically detecting an individual’s stress state.
Although this study has achieved encouraging results, there is still room for improvement. One limitation in this paper is that the analysis of brain function was inadequate. Future research will pay more attention to brain function and brain networks under stress. We will expand the scope of experiments, including subjects of different ages, genders, and backgrounds, to enhance the model’s universality. Additionally, further diversifying stress-inducing paradigms and optimizing the algorithm structure may lead to further performance enhancement.

6. Conclusions

This paper proposes an automatic stress state recognition algorithm based on multi-domain TrEn and PCANet, providing a new approach for EEG-based emotion analysis. Based on the flexible transformation characteristics of the FrFT in the time–frequency domain, a multi-domain TrEn representation method is established, which integrates information from both time–frequency and spatial domains. This enables a multi-angle revelation of stress-related information. Furthermore, this study explores the application of PCANet—a deep feature learning model—in stress detection. This facilitates the automatic extraction of high-level and low-level features, eliminating the subjective influence of manual feature selection.
The proposed algorithm undergoes validation using authentic EEG data collected during stress-inducing tasks, showing recognition accuracies of over 92% in different stress-inducing tasks. This demonstrates its effectiveness and potential applications. Despite these promising outcomes, there is room for future research to further expand the experimental scope, diversify stress-inducing paradigms, brain function analysis, and optimize the algorithm structure.

Author Contributions

Y.D.: Conceptualization, methodology, formal analysis, investigation; data curation, visualization, writing—original draft preparation; L.X.: Methodology, formal analysis, investigation, data curation, J.Z.: Methodology, formal analysis, investigation, data curation; D.W.: Methodology, formal analysis, investigation, data curation; H.L.: Investigation, data curation; Y.S.: writing—review and editing, supervision; G.S.: writing—review and editing, supervision; W.F.: writing—review and editing, supervision. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Key R&D Program of China (Grant No. 2023YFF0713600).

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki, and approved by the Ethics Committee of Beijing Sport University (Ethical Approval Number: 2023313H, 30 November 2023).

Informed Consent Statement

Informed consent was obtained from all patients for being included in the study.

Data Availability Statement

The datasets generated for this study are available on request to corresponding authors. The data are not publicly available due to privacy and ethical reasons.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Jackson, E.D.; Payne, J.D.; Nadel, L.; Jacobs, W.J. Stress differentially modulates fear conditioning in healthy men and women. Biol. Psychiatry 2006, 59, 516–522. [Google Scholar] [CrossRef] [PubMed]
  2. Qi, M.; Gao, H.; Guan, L.; Liu, G.; Yang, J. Subjective stress, salivary cortisol, and electrophysiological responses to psychological stress. Front. Psychol. 2016, 7, 174421. [Google Scholar] [CrossRef] [PubMed]
  3. Kruse, O.; León, I.T.; Stalder, T.; Stark, R.; Klucken, T. Altered reward learning and hippocampal connectivity following psychosocial stress. NeuroImage 2018, 171, 15–25. [Google Scholar] [CrossRef] [PubMed]
  4. Gupta, C.; Kumar, M.; Yadav, A.; Yadav, D. FERNET: An Integrated Hybrid DCNN Model for Driver Stress Monitoring via Facial Expressions. Int. J. Pattern Recognit. Artif. Intell. 2023, 37, 2357002. [Google Scholar] [CrossRef]
  5. Chu, A.M.Y.; Lam, B.S.Y.; Tsang, J.T.Y.; Tiwari, A.; Yuk, H.; Chan, J.N.L.; So, M.K.P. An automatic speech analytics program for digital assessment of stress burden and psychosocial health. Ment. Health Res. 2023, 2, 15. [Google Scholar] [CrossRef] [PubMed]
  6. Pichandi, S.; Balasubramanian, G.; Chakrapani, V. Stress Level Based Emotion Classification Using Hybrid Deep Learning Algorithm. KSII Trans. Internet Inf. Syst. 2023, 17, 3099–3120. [Google Scholar]
  7. Djemili, R.; Djemili, I. Nonlinear and chaos features over EMD/VMD decomposition methods for ictal EEG signals detection. Comput. Methods Biomech. Biomed. Eng. 2023, 1–20. [Google Scholar] [CrossRef] [PubMed]
  8. Wang, Z.; Chen, M.; Feng, G. Study on driver cross-subject emotion recognition based on raw multi-channels EEG data. Electronics 2023, 12, 2359. [Google Scholar] [CrossRef]
  9. Zayed, A. Two-dimensional fractional Fourier transform and some of its properties. Integral Transform. Spec. Funct. 2018, 29, 553–570. [Google Scholar] [CrossRef]
  10. Yuan, Y.; Leung, A.W.; Duan, H.; Zhang, L.; Zhang, K.; Wu, J.; Qin, S. The effects of long-term stress on neural dynamics of working memory processing: An investigation using ERP. Sci. Rep. 2016, 6, 23217. [Google Scholar] [CrossRef]
  11. Zhu, Y.; Wang, Y.; Chen, P.; Lei, Y.; Yan, F.; Yang, Z.; Yang, L.; Wang, L. Effects of acute stress on risky decision-making are related to neuroticism: An fMRI study of the Balloon Analogue Risk Task. J. Affect. Disord. 2023, 340, 120–128. [Google Scholar] [CrossRef] [PubMed]
  12. Lighthall, N.R.; Mather, M.; Gorlick, M.A. Acute stress increases sex differences in risk seeking in the balloon analogue risk task. PLoS ONE 2009, 4, e6002. [Google Scholar] [CrossRef] [PubMed]
  13. Gündogdu, S.; Dogan, E.A.; Gülbetekin, E.; Çolak, Ö.H.; Polat, Ö. Evaluation of the EEG Signals and Eye Tracker Data for Working Different N-back Modes. Trait. Du Signal 2019, 36. [Google Scholar]
  14. Deng, Y.; Wang, M.; Rao, H. Risk-taking research based on the Balloon Analog Risk Task. Adv. Psychol. Sci. 2022, 30, 1377–1392. [Google Scholar] [CrossRef]
  15. Fatma, N.; Singh, P.; Siddiqui, M.K. Epileptic seizure detection in EEG signal using optimized convolutional neural network with selected feature set. Int. J. Artif. Intell. Tools 2023, 32, 2350045. [Google Scholar] [CrossRef]
  16. Grifanti, L.; Salimi-Khorshidi, G.; Beckmann, C.F.; Auerbach, E.J.; Douaud, G.; Sexton, C.E.; Zsoldos, E.; Ebmeier, K.P.; Filippini, N.; Mackay, C.E.; et al. ICA-based artefact removal and accelerated fMRI acquisition for improved resting state network imaging. NeuroImage 2014, 95, 232–247. [Google Scholar] [CrossRef] [PubMed]
  17. Rai, H.M.; Chatterjee, K. Hybrid adaptive algorithm based on wavelet transform and independent component analysis for denoising of MRI images. Measurement 2019, 144, 72–82. [Google Scholar] [CrossRef]
  18. Nolan, H.; Whelan, R.; Reilly, R.B. FASTER: Fully automated statistical thresholding for EEG artifact rejection. J. Neurosci. Methods 2010, 192, 152–162. [Google Scholar] [CrossRef]
  19. Delorme, A.; Palmer, J.; Onton, J.; Oostenveld, R.; Makeig, S. Independent EEG sources are dipolar. PLoS ONE 2012, 7, e30135. [Google Scholar] [CrossRef] [PubMed]
  20. Urigüen, J.A.; Garcia-Zapirain, B. EEG artifact removal-state-of-the-art and guidelines. J. Neural Eng. 2015, 12, 031001. [Google Scholar] [CrossRef]
  21. Chaumon, M.; Bishop, D.V.M.; Busch, N.A. A practical guide to the selection of independent components of the electroencephalogram for artifact correction. J. Neurosci. Methods 2015, 250, 47–63. [Google Scholar] [CrossRef]
  22. Winkler, I.; Debener, S.; Müller, K.R.; Tangermann, M. On the influence of high-pass filtering on ICA-based artifact reduction in EEG-ERP. In Proceedings of the 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), Milan, Italy, 25–29 August 2015; IEEE: Piscataway, NJ, USA, 2015; pp. 4101–4105. [Google Scholar]
  23. Pontifex, M.B.; Miskovic, V.; Laszlo, S. Evaluating the efficacy of fully automated approaches for the selection of eyeblink ICA components. Psychophysiology 2017, 54, 780–791. [Google Scholar] [CrossRef] [PubMed]
  24. Fares, K.; Amine, K.; Salah, E. A robust blind color image watermarking based on Fourier transform domain. Optik 2020, 208, 164562. [Google Scholar] [CrossRef]
  25. You, Y.; Zhong, X.; Liu, G.; Yang, Z. Automatic sleep stage classification: A light and efficient deep neural network model based on time, frequency and fractional Fourier transform domain features. Artif. Intell. Med. 2022, 127, 102279. [Google Scholar] [CrossRef]
  26. Li, M.; Chen, W.; Zhang, T. FuzzyEn-based features in FrFT-WPT domain for epileptic seizure detection. Neural Comput. Appl. 2019, 31, 9335–9348. [Google Scholar] [CrossRef]
  27. Stefan, S.; Schorr, B.; Lopez-Rolon, A.; Kolassa, I.T.; Shock, J.P.; Rosenfelder, M.; Heck, S.; Bender, A. Consciousness indexing and outcome prediction with resting-state EEG in severe disorders of consciousness. Brain Topogr. 2018, 31, 848–862. [Google Scholar] [CrossRef] [PubMed]
  28. Ursino, M.; Ricci, G.; Magosso, E. Transfer entropy as a measure of brain connectivity: A critical analysis with the help of neural mass models. Front. Comput. Neurosci. 2020, 14, 45. [Google Scholar] [CrossRef]
  29. Chan, T.H.; Jia, K.; Gao, S.; Lu, J.; Zeng, Z.; Ma, Y. PCANet: A simple deep learning baseline for image classification. IEEE Trans. Image Process. 2015, 24, 5017–5032. [Google Scholar] [CrossRef] [PubMed]
  30. Yu, J.; Liu, J. Two-dimensional principal component analysis-based convolutional autoencoder for wafer map defect detection. IEEE Trans. Ind. Electron. 2020, 68, 8789–8797. [Google Scholar] [CrossRef]
  31. Ma, Y.; Chen, B.; Li, R.; Wang, C.; Wang, J.; She, Q.; Luo, Z.; Zhang, Y. Driving fatigue detection from EEG using a modified PCANet method. Comput. Intell. Neurosci. 2019, 2019, 4721863. [Google Scholar] [CrossRef]
  32. Li, M.; Chen, W. FFT-based deep feature learning method for EEG classification. Biomed. Signal Process. Control 2021, 66, 102492. [Google Scholar] [CrossRef]
  33. Aly, W.; Aly, S.; Almotairi, S. User-independent American sign language alphabet recognition based on depth image and PCANet features. IEEE Access 2019, 7, 123138–123150. [Google Scholar] [CrossRef]
  34. Abdelbaky, A.; Aly, S. Human action recognition using short-time motion energy template images and PCANet features. Neural Comput. Appl. 2020, 32, 12561–12574. [Google Scholar] [CrossRef]
  35. Soon, F.C.; Khaw, H.Y.; Chuah, J.H.; Kanesa, J. PCANet-based convolutional neural network architecture for a vehicle model recognition system. IEEE Trans. Intell. Transp. Syst. 2018, 20, 749–759. [Google Scholar] [CrossRef]
  36. Chapelle, O.; Haffner, P.; Vapnik, V.N. Support vector machines for histogram-based image classification. IEEE Trans. Neural Netw. 1999, 10, 1055–1064. [Google Scholar] [CrossRef]
  37. Richhariya, B.; Tanveer, M. EEG signal classification using universum support vector machine. Expert Syst. Appl. 2018, 106, 169–182. [Google Scholar] [CrossRef]
  38. Sharma, L.D.; Bohat, V.K.; Habib, M.; Al-Zoubi, A.M.; Faris, H.; Alijarah, I. Evolutionary inspired approach for mental stress detection using EEG signal. Expert Syst. Appl. 2022, 197, 116634. [Google Scholar] [CrossRef]
  39. Tsai, Y.-H.; Wu, S.-K.; Yu, S.-S.; Tsai, M.-H. Analyzing brain waves of table tennis players with machine learning for stress classification. Appl. Sci. 2022, 12, 8052. [Google Scholar] [CrossRef]
  40. Subhani, A.R.; Mumtaz, W.; Saad, M.N.B.M.; Kamel, N.; Malik, A.S. Machine learning framework for the detection of mental stress at multiple levels. IEEE Access 2017, 5, 13545–13555. [Google Scholar] [CrossRef]
Figure 1. 2-back task experimental paradigm.
Figure 1. 2-back task experimental paradigm.
Brainsci 14 00595 g001
Figure 2. BART task experimental paradigm.
Figure 2. BART task experimental paradigm.
Brainsci 14 00595 g002
Figure 3. Experimental Procedure for EEG Data Collection. The “W” is one of the stimulus letters.
Figure 3. Experimental Procedure for EEG Data Collection. The “W” is one of the stimulus letters.
Brainsci 14 00595 g003
Figure 4. Electrodes used and their placement. The 32 electrodes used in this paper is marked in black, while the white ones are spare.
Figure 4. Electrodes used and their placement. The 32 electrodes used in this paper is marked in black, while the white ones are spare.
Brainsci 14 00595 g004
Figure 5. The architecture of the proposed algorithm. The main steps including denoising, muti-domain feature representation, feature learning, classification, and results.
Figure 5. The architecture of the proposed algorithm. The main steps including denoising, muti-domain feature representation, feature learning, classification, and results.
Brainsci 14 00595 g005
Figure 6. Time–frequency decomposition of FrFT. The  α  in red is the rotation angle of the time–frequency plane. The axis after rotation is marked in blue.
Figure 6. Time–frequency decomposition of FrFT. The  α  in red is the rotation angle of the time–frequency plane. The axis after rotation is marked in blue.
Brainsci 14 00595 g006
Figure 7. PCANet network architecture.
Figure 7. PCANet network architecture.
Brainsci 14 00595 g007
Figure 8. Illustration of SVM principle. The red and blue dots represent for two different kinds of samples. The green line represents for hyperplane.
Figure 8. Illustration of SVM principle. The red and blue dots represent for two different kinds of samples. The green line represents for hyperplane.
Brainsci 14 00595 g008
Figure 9. Comparison of original EEG signal and ICA denoised EEG; (a) Baseline, (b) 2-back, (c) BART.
Figure 9. Comparison of original EEG signal and ICA denoised EEG; (a) Baseline, (b) 2-back, (c) BART.
Brainsci 14 00595 g009
Figure 10. Absolute coefficients of the sample EEG after FrFT transformation; (a) p = 0 (time domain), (b) p = 0.5 (time–frequency domain), (c) p = 1 (frequency domain).
Figure 10. Absolute coefficients of the sample EEG after FrFT transformation; (a) p = 0 (time domain), (b) p = 0.5 (time–frequency domain), (c) p = 1 (frequency domain).
Brainsci 14 00595 g010
Figure 11. Connectivity matrix of TrEn for spatial connections in different domains; (a) Baseline, (b) 2-back, (c) BART.
Figure 11. Connectivity matrix of TrEn for spatial connections in different domains; (a) Baseline, (b) 2-back, (c) BART.
Brainsci 14 00595 g011
Figure 12. Performance comparison of TrEn in different single domains and multi-domain fusion. Specifically, T-TrEn, TF-TrEn, and F-TrEn represent TrEn representations in the single time domain, time–frequency domain, and frequency domain, respectively. Multi-TrEn represents the cascaded representation of the multi-domain fused TrEn.
Figure 12. Performance comparison of TrEn in different single domains and multi-domain fusion. Specifically, T-TrEn, TF-TrEn, and F-TrEn represent TrEn representations in the single time domain, time–frequency domain, and frequency domain, respectively. Multi-TrEn represents the cascaded representation of the multi-domain fused TrEn.
Brainsci 14 00595 g012
Table 1. PCANet structural parameters.
Table 1. PCANet structural parameters.
ParameterValueParameterValue
Layer numbers2Filter numbers (L1 × L2)9 × 9
Input matrix size (m × n)32 × 96Block size32 × 32
Patch size (k1 × k2)3 × 3Block overlap ratio0.5
Table 2. Stress detection performance of the proposed method in different stress-inducing tasks. The final average results are in bold.
Table 2. Stress detection performance of the proposed method in different stress-inducing tasks. The final average results are in bold.
GroupBART2-Back
Acc (%)Sen (%)Spe (%)Acc (%)Sen (%)Spe (%)
194.7695.3593.6690.3393.1087.66
294.6596.6091.0093.229096.33
388.3789.8285.6690.5088.2792.66
493.4896.6087.6698.6497.9399.33
593.4895.7189.3392.8896.2089.66
694.5397.58994.0697.9390.33
788.7293.7589.3392.7195.5190
892.558688.3395.5996.2095
990.6991.9688.3394.4093.7995
1093.1395.5388.6690.8490.6891
119093.3983.6692.5492.4192.66
1291.9796.0784.3398.3097.9398.66
1389.5393.578292.7189.3196
1493.7295.3590.6794.2392.4196
1592.5596.608588.6489.3188
Ave92.1494.2587.1193.3193.4093.22
Acc: Accuracy, Sen: Sensitivity, Spe: Specificity.
Table 3. Stress state classification for baseline, 2-Back, and BART modes. The final average results are in bold.
Table 3. Stress state classification for baseline, 2-Back, and BART modes. The final average results are in bold.
GroupsAcc (%)Sen (%)Spe (%)
194.8098.9588.33
298.2799.4797.22
388.7097.7691.58
493.2199.3296.32
591.7195.4186.19
691.8998.3582.86
790.8596.5981.43
890.3594.4787.00
998.8791.5391.67
1087.1395.4182.67
1194.3595.1883.33
1286.9685.1885.00
1386.0092.5981.00
1491.0495.5382.67
1593.0492.4786.67
Ave91.8195.2186.93
Acc: Accuracy, Sen: Sensitivity, Spe: Specificity.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Dong, Y.; Xu, L.; Zheng, J.; Wu, D.; Li, H.; Shao, Y.; Shi, G.; Fu, W. A Hybrid EEG-Based Stress State Classification Model Using Multi-Domain Transfer Entropy and PCANet. Brain Sci. 2024, 14, 595. https://doi.org/10.3390/brainsci14060595

AMA Style

Dong Y, Xu L, Zheng J, Wu D, Li H, Shao Y, Shi G, Fu W. A Hybrid EEG-Based Stress State Classification Model Using Multi-Domain Transfer Entropy and PCANet. Brain Sciences. 2024; 14(6):595. https://doi.org/10.3390/brainsci14060595

Chicago/Turabian Style

Dong, Yuefang, Lin Xu, Jian Zheng, Dandan Wu, Huanli Li, Yongcong Shao, Guohua Shi, and Weiwei Fu. 2024. "A Hybrid EEG-Based Stress State Classification Model Using Multi-Domain Transfer Entropy and PCANet" Brain Sciences 14, no. 6: 595. https://doi.org/10.3390/brainsci14060595

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop