Comparison of LSTM- and GRU-Type RNN Networks for Attention and Meditation Prediction on Raw EEG Data from Low-Cost Headsets

Rivas, Fernando; Sierra-Garcia, Jesús Enrique; Camara, Jose María

doi:10.3390/electronics14040707

Open AccessArticle

Comparison of LSTM- and GRU-Type RNN Networks for Attention and Meditation Prediction on Raw EEG Data from Low-Cost Headsets

by

Fernando Rivas

^1,*

,

Jesús Enrique Sierra-Garcia

^2,*

and

Jose María Camara

²

¹

Department of Electromechanical Engineering, University of Burgos, 09006 Burgos, Spain

²

Department of Digitalization, University of Burgos, 09006 Burgos, Spain

^*

Authors to whom correspondence should be addressed.

Electronics 2025, 14(4), 707; https://doi.org/10.3390/electronics14040707

Submission received: 27 December 2024 / Revised: 5 February 2025 / Accepted: 7 February 2025 / Published: 12 February 2025

(This article belongs to the Special Issue Application of Time Series Analysis and Forecasting in Computer Science)

Download

Browse Figures

Versions Notes

Abstract

:

This study bridges neuroscience and artificial intelligence by developing advanced models to predict cognitive states—specifically attention and meditation—using raw EEG data collected from low-cost commercial devices such as NeuroSky and Brainlink. Leveraging the temporal capabilities of recurrent neural networks (RNNs), particularly long short-term memory (LSTM) and gated recurrent units (GRUs), the study evaluates their effectiveness in predicting future cognitive states. These predictions have applications in real-time brain–computer interface (BCI) systems, enhancing responsiveness and adaptability in dynamic environments like robotic control. The proposed LSTM model demonstrated superior predictive accuracy for meditation states, achieving a Root Mean Squared Error (RMSE) of 10.90, while the GRU model excelled in predicting attention states, with an RMSE of 11.79. Both models outperformed the results provided by the proprietary eSense algorithm, reinforcing the potential of raw EEG data in cognitive-state analysis. Notably, inference times were optimized to under 50 milliseconds, making the models suitable for real-time applications. These findings underline the feasibility of using raw EEG signals from affordable devices for robust real-time prediction, offering a significant step forward in applied neuroscience. This research lays the groundwork for further exploration of RNN architectures in BCI applications, enabling safer, more intuitive, and personalized interactions in assistive technologies and beyond.

Keywords:

BCI/EEG; LSTM; GRU; deep learning; machine learning; RNN; NeuroSky; Brainlink

1. Introduction

Brain–computer interface technology has emerged as a transformative bridge between human cognition and external devices, offering promising solutions for applications ranging from assistive technologies to rehabilitation systems. A critical challenge in BCI development is the reliable detection and interpretation of cognitive states that can serve as robust control signals [1]. Among these states, attention and meditation have garnered particular interest due to their distinctive neurophysiological signatures and practical implications for BCI applications.

The importance of attention and meditation in BCI systems stems from several key factors. First, attention represents a fundamental cognitive mechanism that directly influences task performance, learning efficiency, and error prevention in human–machine interaction [2]. In BCI applications, attentional states can serve as natural control signals, as they can be voluntarily modulated by users and maintain stability over extended periods. Second, meditation states offer complementary advantages through their association with enhanced signal-to-noise ratios in EEG readings and reduced cognitive interference, potentially improving BCI reliability [3].

Current commercial BCI systems, such as those utilizing NeuroSky technology, employ proprietary algorithms to detect these cognitive states. However, these closed systems present several limitations: lack of transparency in signal processing, inability to customize detection parameters for specific applications, and restricted adaptation to individual user characteristics [4]. These constraints have spurred research interest in developing open, adaptable alternatives that can advance both scientific understanding and practical applications.

EEG has proven particularly valuable for studying attention and meditation due to its high temporal resolution and ability to capture rapid cognitive state transitions [5]. Recent advances in EEG signal processing have demonstrated distinct neural signatures associated with diverse levels of attention and meditative states, particularly in the prefrontal cortex regions [6,7]. These findings suggest the potential for developing more sophisticated detection algorithms that can leverage these neural patterns for enhanced BCI control.

The integration of attention and meditation detection in BCIs has significant practical implications. In rehabilitation settings, accurate detection of attention levels can help optimize therapy sessions and provide objective measures of patient engagement [8]. For assistive technologies, meditation states can serve as stable control signals, particularly beneficial for users with limited motor control. These applications demonstrate the practical value of improving cognitive state detection in BCI systems.

The emergence of advanced machine learning techniques, particularly RNNs, offers new opportunities to address current limitations in cognitive state detection. LSTM and GRU networks have demonstrated particular promise in capturing temporal dependencies in EEG signals, yet their application to attention and meditation detection remains relatively unexplored [8].

1.1. Hypothesis and Contributions

The central hypothesis of this study is that it is feasible to predict attention and meditation values derived from EEG signals using neural networks. This hypothesis is founded on the premise that the temporal and non-linear characteristics of EEG signals can be effectively captured and modeled by advanced neural architectures. Specifically, this study explores the applicability of these predictive models in accurately estimating the cognitive states of attention and meditation, which are essential for various human–computer interaction applications.

By addressing this hypothesis, the research aims to contribute to the growing body of knowledge on EEG signal processing and its integration with machine learning techniques. The outcomes of this investigation have significant implications for developing real-time applications in neurofeedback, cognitive training, and brain–computer interface systems, offering a pathway for improved user experiences and technological advancements in the field.

This research makes specific contributions to the field:

Development of LSTM and GRU architectures specifically optimized for real-time detection of attention and meditation states from raw EEG signals.
Empirical validation of these models’ performance compared to existing proprietary solutions, with detailed analysis of accuracy, latency, and robustness.
Introduction of a new methodology for processing raw EEG data that enables greater customization and adaptation of BCI systems.
Demonstration of practical applications through case studies in assistive technology and rehabilitation contexts

Our approach addresses several critical limitations in current BCI systems. By working directly with raw EEG signals rather than preprocessed data, we enable greater transparency and customization possibilities. The use of advanced RNN architectures allows for better capture of temporal dynamics in cognitive-state transitions, potentially improving detection accuracy. Furthermore, our models’ ability to operate in real-time makes them suitable for practical BCI applications.

This research not only advances our understanding of cognitive-state detection in BCI systems but also provides practical tools for improving human–machine interaction in critical applications. The combination of advanced machine learning techniques with raw EEG signal processing represents a significant step toward more adaptable and effective BCI systems.

1.2. Paper Structure

The paper is structured as follows: Section 2 reviews related works in EEG signal processing and cognitive-state prediction. Section 3 details the materials and methods, including experimental setup, data acquisition protocols, and the architecture of our LSTM and GRU models. Section 4 presents results and validation metrics, while Section 5 discusses findings in relation to the existing literature. Finally, Section 6 concludes with key contributions and future research directions.

2. Related Works

Neuroscience has experienced a boom in recent decades, especially in exploring the relationship between brain activity and cognitive states such as attention and meditation. EEG has established itself as an essential tool for capturing and analyzing the brain’s electrical activity in real time. As technology advances, researchers have begun to decipher the brainwave patterns associated with sustained attention and meditative states, opening new possibilities for understanding the human mind. These advances not only offer insights into the fundamental nature of consciousness but also have the potential to influence practical applications, from improving cognitive performance to treating neurological disorders.

This article provides a more detailed and elaborate review of EEG-based attention and meditation prediction, incorporating the most recent publications.

In the past decade, the field of EEG has experienced significant advancements, revolutionizing our understanding of brain activity and its applications in various areas of neuroscience. Chaddad et al. (2023) presented a comprehensive review of EEG signal processing methods and techniques, encompassing everything from acquisition to classification and application [2,9]. This review highlights the inherent complexity of EEG signals and underscores the critical need to develop advanced preprocessing and feature extraction methods for their effective analysis. The complexity of these non-invasive signals has spurred researchers to propose innovative approaches to unravel the wealth of information contained in patterns of electrical brain activity. Concurrently, Posner (2023) examined the evolution of attention networks, proposing an integrative approach that combines human and animal studies to address unresolved problems in this field [3]. His work emphasizes the fundamental importance of attention networks in integrating cognitive and neural studies, laying the groundwork for significant advances in cognitive neuroscience. This integrative perspective promises to unveil the mechanisms underlying complex attentional processes and their relationship to other higher cognitive functions.

The integration of emerging technologies with traditional EEG techniques has opened new avenues of research, expanding our understanding of brain processes in more natural and ecologically valid contexts. An innovative 2019 study explored the connections between creative behavior, flow state, and brain activity through the integration of EEG and virtual reality [4]. This research revealed significant correlations between individual creativity levels, flow state, and the quality of creative output, providing valuable insights into the neural substrates of creativity and focused attention. In the realm of meditation, several studies have utilized EEG to investigate the effects of different techniques on brain activity and cognitive performance. A 2022 retrospective analysis compared “internal” versus “external” meditation techniques, shedding light on the relative efficacy of different meditative approaches [5]. Complementarily, a 2020 longitudinal study provided direct evidence of the effectiveness of Focused Attention Meditation (FAM) training in modulating brain activity and improving cognitive performance [6], underscoring the potential of meditative practices in optimizing brain functions.

The convergence of EEG with other emerging technologies has significantly broadened the horizon of neuroscientific research. An innovative 2021 project combined EEG with a brainwave lamp to study real-time attention, meditation, and fatigue values [10], opening new possibilities for monitoring and modulating mental states in various contexts. This multidisciplinary approach not only allows for a more holistic assessment of cognitive and emotional states but also offers promising perspectives for applications in areas such as mental health and cognitive performance. Furthermore, a pioneering 2021 study revealed a significant reorganization of brain network connectivity following intensive meditation training [11]. This research identified changes in key areas such as the right insula, superior temporal gyrus, inferior parietal lobe, and bilateral superior frontal gyrus, providing neurobiological evidence of the long-term effects of meditative practice on the brain’s functional architecture. These collective advances not only demonstrate the immense potential of EEG in understanding brain processes but also lay the foundation for revolutionary applications in various fields of neuroscience, biomedical engineering, and personalized medicine, promising to transform our understanding of the human brain and its functioning in states of health and disease.

These publications provide an in-depth and up-to-date overview of research and advances in the field of EEG-based attention and meditation prediction. The combination of advanced signal-processing techniques, together with innovative approaches to measuring and analyzing attention and meditation, is leading to significant discoveries that may have practical applications in areas such as mental health, education, and general well-being.

One of the primary limitations identified in the current literature is the widespread dependence on NeuroSky’s proprietary algorithm for interpreting EEG signals. This algorithm, designed to determine values such as attention and meditation, has been widely used in numerous studies. For instance, the research conducted by Rușanu et al. (2023) [8] that developed a LabVIEW instrument for brain–computer interface research using the NeuroSky MindWave Mobile headset does not specify whether it relied on NeuroSky’s algorithm for determining certain values. This dependence on a proprietary algorithm raises questions about the reproducibility and comparability of results across different studies, as well as the flexibility in interpreting EEG data for specific applications.

Another significant limitation of the NeuroSky/Brainlink headband lies in its precision and resolution compared to medical-grade or laboratory EEG systems. As a low-cost device designed for the consumer market, the NeuroSky headband may not offer the same level of fidelity in signal acquisition as more expensive professional equipment. This discrepancy in data quality can have important implications for research, especially in studies that require high precision in measuring brain activity. The limitation in spatial resolution, due to the reduced number of electrodes, also restricts the ability to accurately localize sources of neural activity, which can be crucial in certain cognitive and clinical neuroscience applications.

A significant gap in the current literature is the scarcity of research specifically focusing on the use of raw signals from the NeuroSky headband to determine mental states such as attention and meditation [12]. Many studies rely on NeuroSky’s algorithm-processed data, limiting the exploration of raw EEG signals’ full potential. This research addresses this gap by using RNNs, specifically LSTM and GRU models, to analyze raw EEG data. These architectures are ideal for time series like EEG signals, capturing complex patterns and long-term dependencies [13].

By bypassing the proprietary algorithm, this approach enhances flexibility in data interpretation, uncovering patterns and mental states that NeuroSky’s algorithm might overlook. Analyzing raw data also enables the development of personalized models for attention and meditation, tailored to specific applications.

LSTM and GRU networks are particularly effective in handling EEG’s sequential nature. LSTMs retain relevant information over time, while GRUs efficiently update internal states, making them well-suited to detect subtle brain activity patterns linked to cognitive states.

Additionally, deep learning techniques like RNNs can identify new features and relationships in EEG data, offering insights into brain signals and cognitive states [14]. This could reveal biomarkers for neurological or psychological conditions while improving result interpretability compared to NeuroSky’s opaque “black box” algorithm.

Despite the hardware limitations of devices like the NeuroSky headband, advanced signal processing and RNN-based models improve the functional resolution of data, enabling more precise brain-activity inferences. This enhances the headband’s utility and broadens its application to areas like cognitive neuroscience, clinical psychology, and advanced brain–computer interfaces.

This innovative approach not only addresses the current limitations of the NeuroSky headband but also paves the way for more sophisticated and nuanced analyses of EEG data in general. By leveraging the power of deep learning and working directly with raw signals, researchers can potentially uncover subtle patterns and relationships in brain activity that were previously inaccessible. This could lead to breakthroughs in our understanding of cognitive processes, emotions, and various neurological conditions.

Furthermore, the development of custom RNN-based models for EEG analysis could have far-reaching implications beyond the specific context of the NeuroSky headband. The methodologies and insights gained from this research could be applied to other EEG devices and even to more complex multi-channel EEG systems, potentially revolutionizing the field of brain signal analysis.

In conclusion, while the NeuroSky headband has already made significant contributions to democratizing EEG research, the proposed approach of using RNNs to analyze raw signals represents a crucial next step in unlocking its full potential. Although our system is trained with the results of the headset’s own algorithm, the key contribution is in the ability to predict future states of attention and meditation. This extends the functionality of BCI systems, allowing them to anticipate user needs and improve interaction with external devices [14]. This predictive modeling based on recurrent neural networks opens new pathways for real-time applications, such as BCI-controlled robotic arms or wheelchair systems, where immediate response to cognitive states is crucial to ensure above all user safety.

The summary of related works is shown in Table 1.

3. Technologies

3.1. NeuroSky

The NeuroSky headband has emerged as a revolutionary tool in the field of EEG signal acquisition, offering an accessible and versatile alternative to traditional medical-grade EEG systems. Despite its relative simplicity, this device has proven invaluable in a wide range of research and development applications.

Figure 1 shows the location of the EEG potential signal capture points in a healthy brain.

The brain signals and frequency ranges captured by NeuroSky are shown in Table 2.

In the following figure, Figure 2, we can see the typical typology of these signals based on their frequency and waveform.

Its ability to provide raw EEG data has opened new avenues of research and democratized access to applied neuroscience. A pioneering study conducted in 2022 by Vasilescu et al. illustrates the potential for integrating the NeuroSky headband with advanced data acquisition and processing systems [8]. The researchers developed a series of LabVIEW applications that enable real-time acquisition, processing, feature extraction, and classification of EEG signals detected by the integrated sensor of the NeuroSky MindWave Mobile headset. This innovative approach not only enhances the accessibility of EEG data but also facilitates its real-time analysis, opening new possibilities for research in BCI.

The versatility of the NeuroSky/Brainlink headbands is further evidenced by its application in diverse research fields. A study conducted by Mohd Amin et al. in 2020 explored the use of the NeuroSky Smarter Kit in a brain training program for the elderly [21]. That research focused on analyzing changes in attention and meditation levels, providing valuable insights into how EEG technology can contribute to improving cognitive health in ageing populations. Concurrently, an innovative 2023 study by Shrestha et al. leveraged the capabilities of the NeuroSky headband to classify EEG signals based on color stimuli [16]. Using a deep neural network based on attention, the researchers successfully classified raw EEG signals from the NeuroSky MindWave headset based on two and four different colors.

The NeuroSky’s proprietary algorithm, known as the eSense algorithm, is designed to compute values for attention and meditation by analyzing specific EEG signal components. For attention, the algorithm primarily focuses on the power ratio of high and low Beta waves, which are associated with active cognitive engagement. For meditation, it combines Alpha waves, linked to relaxation, with Theta waves, often indicative of a meditative or drowsy state. Figure 3 shows typical attention and mediation signals. Figure 4 shows the brain points for the calculation of attention and meditation based on signals from the prefrontal area of the brain, identified as point FP1 and FP2.

While the eSense algorithm is effective in providing an estimation of these cognitive states, its proprietary nature poses certain limitations. These include restricted transparency in signal processing, limited adaptability to specific applications, and an inability to customize parameters for individual users. Our study addresses these limitations by using raw EEG data and advanced neural network models, offering a more transparent and flexible approach to cognitive-state prediction.

The practical application of NeuroSky technology in assisting people with physical disabilities is evident in the project developed by Sathyanarayanan et al. in 2021 [24]. This brain-controlled EEG system for home automation, specifically designed to aid individuals with physical disabilities and paralysis, achieved an impressive 90% accuracy in detecting attention levels [14]. These results underscore the transformative potential of accessible EEG technology in improving the quality of life for vulnerable populations. Complementing these advancements, an earlier study by Mathur et al. in 2018 explored the classification of EEG-based directional signals using RNN variants [17]. The researchers implemented a sophisticated model using long short-term memory with an attention layer to classify both raw EEG signals and power signals generated by the NeuroSky MindWave device. This approach not only demonstrates the versatility of the data provided by the NeuroSky headband but also illustrates how advanced deep learning techniques can extract meaningful information from these signals, thus expanding the horizon of potential applications in fields such as cognitive neuroscience, human–machine interaction, and neurological rehabilitation.

These studies demonstrate the versatility and applicability of the NeuroSky headset in a variety of research areas, from brain–computer interface to color-based signal classification and home automation. Despite the inherent limitations of a low-cost, single-channel device, researchers have found innovative ways to use it to achieve significant results in their respective fields.

3.2. Predictive Deep Learning Models

In this work, we explore the ability of LSTM and GRU networks to predict attention and meditation signals. LSTM and GRU networks have shown promising results in capturing long-term dependencies and modeling temporal dynamics in sequential data, including EEG signals. These models have been successfully applied in various cognitive state-prediction tasks [25].

LSTM networks are advanced recurrent neural networks designed to handle sequential data by addressing the vanishing-gradient problem. As illustrated in Figure 5, each LSTM cell includes three key gates: the input gate, which controls new information entering the cell; the forget gate, which discards unnecessary information; and the output gate, which forwards relevant information to the next step. This gating system allows LSTMs to manage long-term dependencies effectively, making them ideal for processing EEG signals in our BCI project.

GRU networks represent a simplified and highly efficient variant of recurrent neural networks, designed to capture long-term dependencies in sequential data. As illustrated in Figure 6, a GRU unit comprises two primary gates: the update gate, which determines what portion of the previous information is retained, and the reset gate, which controls how much past information is forgotten. This simpler structure, compared to LSTMs, allows GRUs to process data sequences efficiently whilst maintaining the ability to capture complex temporal relationships, making them particularly suitable for EEG signal analysis in our BCI project.

In the context of our research, GRUs are employed to analyze and classify patterns in EEG signals, a crucial step in translating neural activity into commands for assistive technologies. The GRU’s ability to learn and recognize temporal patterns in EEG data, corresponding to specific neural activities or intentions, is fundamental to the system’s accuracy and responsiveness. This makes GRU networks an essential component of our BCI framework, facilitating a seamless transition between raw EEG signals and actionable outputs in assistive devices. Their integration into our system not only exemplifies cutting-edge neural-processing technologies but also establishes a solid foundation for future advancements in intuitive human–machine interaction within the field of BCIs.

4. Use-Case Architecture

The use-case architecture is shown in Figure 7. The EEG signals are recorded using commercial headsets (NeuroSky/Brainlink) equipped with dry electrodes, capturing brain activity from the prefrontal cortex. These raw signals, containing information from frequency bands such as delta, theta, alpha, and beta, form the input for the neural networks. The target data used to train the neural networks are the attention and meditation signals computed by the patented algorithm of NeuroSky. This algorithm is patented and protected by NeuroSky and is not freely available.

The aim of the neural network is to replicate the outputs of the patented algorithm. This way the attention and meditation signals will be able to be used in low cost BCIs without these capabilities. The right side in Figure 7 shows the deployment of the neural network. It is shown that whatever low cost BCI can be used. NeuroSky is not needed.

This process ensures that the predictive framework remains both flexible and adaptive. By relying on raw EEG signals rather than the fixed outputs of proprietary algorithms, the architecture achieves greater transparency and adaptability. Furthermore, the use of RNNs allows the system to effectively capture the temporal dependencies within the data, resulting in more accurate and dynamic predictions of cognitive states. This design not only addresses existing limitations in EEG-based systems but also ensures its applicability across diverse real-world scenarios.

5. Methodology

The process to record the dataset, analyze, and organize the information, and train the neural models is divided into the following steps:

Experimental setup: Dataset are recorded following a standardized procedure.
Feature sets: considering the information captured in the dataset different feature sets are identified.
Data preprocessing: data are converted into a structure suitable for a time series, as needed to train networks.
Training: Data are separated into training and validation sets. RandomSearch is used to find the best hyperparameters.
Cross-validation: Cross-validation is used to validate that results are consistent regardless of the subsets in the dataset are considered.

5.1. Experimental Setup

The study employed a structured data collection approach spanning 6 months (June 2023–December 2023). Data were collected from 5 participants (3 male, 2 female, age range of 21–60 years) using both NeuroSky and Brainlink headsets.

Participants were selected based on the following:

-: No history of neurological disorders,
-: Normal or corrected-to-normal vision,
-: No prior experience with BCI devices.

Recording sessions:

-: Two 30-min recording sessions separated by one week,
-: Controlled environment settings (22 °C (±1), 45 dB ambient noise),
-: Talks included free-cognitive-state periods (≥10 min).

Each subject performed the experiments in two sessions separated by several days to ensure reproducibility of the results [26]. Subsequently, the signals from the different subjects were also incorporated into a continuous dataset in order to achieve a sufficient volume of information to guarantee the training process of the LSTM and GRU network.

The age range varies between 21 and 60 years, trying to maintain gender parity. The participants’ data are anonymized, being collected with a consecutive trial number that does not allow for the identification or association of the data with the participant in the trials. It should be noted that, in male adults close to 60 years of age, the process of reading the data in some cases has become unfeasible, as no data can be obtained from the prefrontal region of the subjects, indicating that the use of non-invasive dry electrodes in this case may be a barrier to consistent data capture.

Experiments were conducted in a controlled environment to minimize external distractions. A NeuroSky device and a Brainlink device were used to record EEG signals, as both devices have the same TGAM-based technology.

During the experiments, participants were asked to act naturally, trying to voluntarily maintain high levels of attention and concentration, according to the real-time values that could be seen in the data-capture application.

The data acquisition process was carefully designed to ensure both the authenticity of the collected signals and the comfort of the participants. To closely replicate real-world conditions, participants were given the freedom to engage in any activity of their choice during the sessions, such as watching films, chatting, reading, or simply relaxing. This approach aimed to capture a diverse range of natural cognitive states while minimizing action bias that could otherwise influence the data and limit their generalizability.

The duration of each session was set between 15 and 30 min, providing an optimal balance between data quantity and participant comfort. Given that the EEG headset records one block of data per second, this setup resulted in a minimum of 900 and up to 1800 data points per session, with each block containing 11 signal values. This design ensured that the dataset was both extensive and reflective of natural behavioral conditions, supporting the development of models robust enough to handle dynamic real-life environments.

5.2. Features Sets

The dataset captured during the experiments contains the following columns:

Timestamp: Timestamp of the capture.
Attention: Attention value.
Meditation: Meditation value.
Delta, Theta, low Alpha, high Alpha, low Beta, high Beta, low Gamma, and high Gamma: Values of the brain signals.
Signal: This column indicates the quality of the signal. In general, a value of 0 indicates a good signal quality, while higher values indicate a poor signal quality or no signal.

NeuroSky’s patented algorithm uses Beta signals to compute the attention, and Alpha and Theta to compute meditation. Thus, we have created two different sets of features. In the complete set, all signals are simultaneously used as inputs to predict the attention and meditation levels. In the partial feature set, only the signals used by the protected algorithm are used. Table 3 shows the relation between input and output signals in each feature set.

Figure 8 shows the attention and meditation data, together with the brain signals, during one of the experiments. Some signals, such as Delta, show much higher values compared to other signals. Attention and meditation signals show fluctuations over time, indicating changes in the levels of attention and meditation.

5.3. Data Preprocessing

The EEG data underwent a series of carefully designed preprocessing steps to prepare them for training the LSTM and GRU networks. Initially, the raw signals were assessed for quality using the headset’s internal metrics, and any segments with poor signal quality were excluded to minimize the impact of noise or artefacts. The remaining data were then normalized to a standard range from 0 to 1, ensuring consistent scaling and facilitating stable model training. To capture the temporal dependencies inherent in EEG signals, the data were organized into sliding look-back windows, where a fixed number of prior time steps (tested with sizes of 3, 5, 7, and 15) were used as input for predicting subsequent values.

5.4. Data Training

The dataset was split into training (65%) and testing (35%) subsets, allowing for robust model evaluation and generalizability testing:

Training set with 1118 records (65%),
Test set with 602 records (35%).

We used RandomSearch to determine the best hyperparameters and architecture. As part of the RandomSearch process, several hyperparameters were systematically varied to identify the optimal configuration for the LSTM and GRU models. Table 4 provides a comprehensive summary of the hyperparameters explored, including their respective ranges and the best values determined through the experiments. This optimization process was crucial for enhancing model performance and ensuring robust predictions.

These hyperparameters were optimized separately for attention and meditation datasets, with consistent performance improvements observed for both.

5.5. Cross-Validation

Cross-validation is a vital step in machine learning to ensure that a model performs reliably and is not overly tailored to a specific dataset. This approach ensures that the model is evaluated on different subsets of data, improving its reliability and reducing the bias that might occur if a single train–test split was used. By dividing the data into training and testing subsets, it helps validate the model’s ability to generalize, providing confidence that it will work effectively in real-world scenarios.

To evaluate the performance and generalizability of the LSTM and GRU models, we employed a k-fold cross-validation approach with k = 5, using the values of the hyperparameters obtained in the previous RandomSearch process. This methodology ensures robust performance evaluation while minimizing the risk of overfitting. The process is described as follows:

Dataset partitioning:
The dataset was randomly shuffled and divided into 5 equally sized folds.
At each iteration, 1 fold was used as the test set, while the remaining 4 folds were combined to form the training set.
Training and validation:
The models were trained on the training set and evaluated on the test fold. This process was repeated 10 times, with each fold serving as the test set once.
For each fold, we recorded metrics such as RMSE, MSE, and MAE to measure prediction accuracy.
Performance aggregation:
After completing the 10 iterations, the evaluation metrics were averaged across all folds to obtain a reliable estimate of the model’s performance.

5.6. Performance Evaluation Metrics

To evaluate the performance of the LSTM, GRU, and CNN models in calculating attentional and meditative states, metrics such as RMSE, MSE, MAE, and SMAPE were used. These metrics are essential to determine the accuracy and reliability of the models in predicting cognitive states from EEG signals. The choice of these metrics is based on previous studies that have demonstrated their effectiveness in evaluating deep learning models in EEG-based prediction tasks [27].

In the context of interpreting the performance of a neural network, the choice of the appropriate metric depends on the specific problem and the characteristics of the data. The mentioned metrics (MAE, MSE, RMSE, and SMAPE) have different properties and are applied in different situations. A detailed and well-argued justification for each is provided below:

Mean Absolute Error (MAE)

Definition: The MAE is the mean of the absolute values of the errors between predictions and actual values [28].

MAE Formula (1):

M A E = \frac{1}{n} \sum_{i = 1}^{n} |y_{i} - {\hat{y}}_{i}|

(1)

where

n is the number of observations,

y_i is the actual value,

ŷ_i is the predicted value.

Advantages:

-: It is easy to interpret, as it represents the average error in the same units as the data.
-: It is robust to outliers, as it does not penalize large errors as much as the MSE.

Disadvantages:

-: It is not differentiable at all points, thus potentially complicating its use in some optimization algorithms.

2.: Mean Squared Error (MSE)

Definition: The MSE is the Mean Squared Error between predictions and actual values [28].

MSE Formula (2):

M S E = \frac{1}{n} \sum_{i = 1}^{n} {(y_{i} - {\hat{y}}_{i})}^{2}

(2)

where

n is the number of observations,

y_i is the actual value,

ŷ_i is the predicted value.

Advantages:

-: It penalizes large errors more heavily, which can be useful if you want to avoid large deviations.
-: It is always differentiable, which facilitates its use in neural network optimization.

Disadvantages:

-: It is more sensitive to outliers, as large errors have a quadratic impact on the metric.

3.: Root Mean Squared Error (RMSE)

Definition: The RMSE is the square root of the MSE [28].

RMSE Formula (3):

R M S E = \sqrt{M S E} \to R M S E = \sqrt{\frac{1}{n} \sum_{i = 1}^{n} {(y_{i} - {\hat{y}}_{i})}^{2}}

(3)

where

n is the number of observations,

y_i is the actual value,

ŷ_i is the predicted value.

Advantages:

-: Similar to MSE in terms of penalizing large errors but returns errors in the same units as the original data, which can be more intuitive.
-: Useful when a metric is needed that reflects the magnitude of errors more directly than MSE.

Disadvantages:

-: Shares the same sensitivity to outliers as the MSE.

4.: Symmetric Mean Absolute Percentage Error (SMAPE)

Definition: SMAPE is a percentage error metric that is symmetric: it treats overestimation and underestimation errors equally [29,30].

SMAPE Formula (4):

S M A P E = \frac{1}{n} \sum_{t = 1}^{n} \frac{|A_{t} - F_{t}|}{(|A_{t}| + |F_{t}|) / 2}

(4)

where

A_t is the actual value,

F_t is the forecast value,

n is the total number of observations.

Advantages:

-: It provides a relative measure of error, which can be useful when comparing errors on different scales.
-: It is symmetrical, which makes it suitable for cases where relative errors are to be treated equally.

Disadvantages:

-: It can be unstable when actual values or predictions are close to zero, due to splitting.

Final recommendation:

The choice of the most recommendable metric depends on the specific context:

MAE is recommended when an easy-to-interpret metric is needed and the impact of outliers is to be minimized.

MSE and RMSE are useful when you want to penalize larger errors more. RMSE is especially recommended if you need a metric in the same units as the data.

SMAPE is preferable when a relative and symmetric metric is needed, especially in problems where the data may vary in magnitude.

In general, for most neural network regression problems, RMSE is usually the most recommended metric because of its balance between penalizing large errors and easy interpretability in the units of the original data. However, the final selection should consider the specific characteristics of the problem and the objectives of the analysis.

6. Results

To assess the accuracy and efficacy of these models, performance metrics were selected, as well as cross-validation techniques to ensure the robustness of the models. This comparison methodology is essential to discern the relative strengths and weaknesses of LSTM and GRU networks in the task of prediction from EEG data, thus enabling a comprehensive assessment of their applicability in neurofeedback and BCI contexts [31]. The evaluation metrics RMSE, MSE, MAE, and SMAPE validate the results, being in line with the results provided by the previous literature on deep learning model evaluation methodologies [32].

For the computational process and calculation of values and metrics with the RandomSearch method, Google Sandbox and Google Colab were used to facilitate a significant reduction in operating times, after selecting the GPU configuration necessary to optimize the process in its execution environment. Python 3.10.11(64-bits) was used as the programming language.

6.1. LSTM Performance

In the following figures, Figure 9 and Figure 10, the complete LSTM model validation process can be observed, as well as the metric values and the optimal hyperparameters for these metric values.

The first analysis performed was the calculation of attention and meditation using the same calculation scheme followed by NeuroSky and Brainlink, segmenting the neural signals and discarding the Delta signal value. The first comparison process was performed for the attention and meditation values using an LSTM network and RandomSearch for the determination of the hyperparameters, as shown in Table 5 for the attention values and Table 6 for the meditation values.

The same process as above, but in this case, with the analysis of 100% of the values of the neural signals obtained from the headband, without replicating the procedure followed by the NeuroSky company, is shown below in Table 7 and Table 8.

6.2. GRU Performance

The same process was performed, but using a GRU network and the RandomSearch calculation structure, as shown in Table 9 for the prediction of attention and Table 10 for the value of meditation.

In the last two tables, repeated look-back values can be seen, since in the testing process the values obtained, especially in the definition of the hyperparameters, showed values far from what was expected or with a greater dispersion than allowed.

As with the previous model, we performed the prediction with the GRU architecture while maintaining the test conditions, meaning that we maintained the analysis on 100% of the neural signals, with the following results shown in Table 11 for the attention value and Table 12 for the meditation value.

6.3. Model Comparison

To compare the prediction performance between the LSTM and GRU networks, we focused on the RMSE metric as the main evaluation metric. The reason for this choice is that RMSE provides a direct measure of error in the same units as the original data, making it easier to interpret. In addition, RMSE penalizes larger errors more heavily, which is crucial in the context of time-series forecasting, where significant errors can affect the practical utility of the model. The comparison will provide information on the strengths and weaknesses of each model in predicting attention and meditation from raw EEG signals [33].

In the process of comparing the above data, the following results can be extracted for the two model architectures and prediction strategies, based on the data obtained in Table 13, Table 14, Table 15 and Table 16.

These values resulting from the RandomSearch calculation are reflected in the following graphs, as shown in Figure 11 for the attention values and in Figure 12 for the meditation values.

As a summary, the result of the best prediction based on the RMSE metric is shown in Table 17, where you can see the comparison not only of the performance of the LSTM and GRU networks but also the size of the time window (look-back) and which of the prediction strategies is more interesting to follow in our research.

From the above data, we can extract the optimal value obtained, as well as the architecture and the prediction calculation model, as can be seen in Table 18 for the attention value and Table 19 for the meditation value.

Continuing with the study and analysis of the values obtained and with the aim of guaranteeing the prediction process, a new calculation and test will be carried out on the look-back values, taking the previous and subsequent values to determine, without any doubt, the optimum value of the time window that determines the best prediction of the models. This new test has been carried out following the same procedure, comparing the two RNN architectures since, as can be seen in the values in Table 15 and Table 16 of results, in both cases, the prediction is more favorable with the model that does not use the calculation structure defined and followed by NeuroSky in the eSense algorithm.

To compare the GRU and LSTM architectures for predicting attention and meditation states using EEG signals, a temporal five-fold cross-validation was implemented to evaluate their performance and stability. Parameter 5 was selected because it provided positive results in previous works [34,35]. Key evaluation metrics included MAE, MSE, RMSE, and SMAPE, each accompanied by standard deviations to assess consistency across validation folds. The results revealed distinct patterns in the behavior of these architectures, offering valuable insights into their suitability for predicting mental states. Table 20 and Table 21 below show and compare the results obtained from the cross-validation application. For each case, the configuration of the best hyperparameters has been used.

Based on the values obtained and shown in the tables above, we can state that in the case of attention-state prediction, GRU outperformed LSTM across all metrics, with a notably lower MAE compared to LSTM’s. GRU also demonstrated superior stability, as reflected in lower standard deviations, particularly for MSE. However, the difference in SMAPE values between GRU and LSTM was marginal, indicating similar performance in terms of normalized percentage error. This suggests that while GRU is more robust and reliable for attention prediction, both models are comparable when interpretability of normalized errors is prioritized in practical applications.

For meditation-state prediction, the performance of GRU and LSTM was strikingly similar, with almost identical MAE values and parity across all metrics. Both architectures showed greater stability in meditation predictions compared to attention, as evidenced by significantly lower standard deviations. Notably, the SMAPE for meditation was considerably lower, suggesting that meditation states exhibit more consistent and predictable patterns in EEG signals. These findings highlight the distinct characteristics of mental states and their computational modeling potential, offering practical guidance for architecture selection and avenues for further research in deep learning applications for EEG-based mental-state prediction.

To ensure the consistency of the above results, the window values or (LB) before and after the calculated value will be analyzed, as the analysis intervals have been performed in two window steps, leaving values unanalyzed.

The result of the comparison with the previous and subsequent values are shown in Table 22 and Table 23 below.

With these data, we can confirm that the values of the time window are consistent and that the data were calculated previously. These data are reflected in Figure 13, corresponding to the attention and meditation value.

This verification allows us to specify the metrics and values of the time window that best results in the prediction of attention and meditation, and the final results correspond to Table 24 and Table 25, which confirm the initial values of the first RandomSearch test.

Thus, we can conclude that the prediction of the attention and meditation values using LSTM-type RNNs to determine the meditation value and GRU type for the attention value.

6.4. Real-Time Deployment and Analysis of Inference Time

With these results, the next step is to calculate the inference times of the networks in the calculation of the attention and meditation value in a real-time analysis. This calculation is motivated by the limitation of the reading process of the NeuroSky and Brainlink headset that supplies a block of raw data (Delta, Alpha, Theta, …) every second, so implicitly there is a limitation in the available time of inference in the calculation.

As can be seen in the graphs in Figure 14, the inference times in the calculation of the attention and meditation values are substantially less than one second, with average values around 50 milliseconds.

The inference times of attention were calculated with a GRU, with LB = 5. In the case of meditation, it was performed with LSTM, with LB = 7. These architectures were used because they provided the best performances, according to Section 6.3.

To complete the real-time analysis, additional EEG data were collected from a new subject, allowing us to independently validate the previous experimental setup. This approach ensures that the model is evaluated against entirely unseen data that were not included in the training, validation, or cross-validation processes.

The real-time testing was conducted using the GRU network, following insights from the cross-validation results, which demonstrated a slight advantage of this architecture over LSTM in predictive performance.

Using this newly acquired dataset, we proceeded with real-time testing of the GRU-based neural network, incorporating the optimized hyperparameters. The results obtained from this evaluation are presented in Table 26, and Figure 15 and Figure 16.

It is possible to see how these results are similar to those presented in Section 6.3. This additional testing further strengthens the validation of our model in a real-world setting.

7. Discussion

In our study, LSTM and GRU models were used to predict attention and meditation levels from raw EEG data. The results show that both models are able to make predictions with relatively low errors, as indicated by the MAE, MSE, and RMSE metrics.

Comparison with the literature:

“EEG-Based Age and Gender Prediction Using Deep BLSTM-LSTM Network Model” (2019) [36]: This study demonstrates the effectiveness of LSTM architectures in classifying EEG data, albeit in a different context (age and gender). The high accuracy obtained in this study suggests that LSTMs are suitable for capturing complex temporal features of EEG signals, a suggestion that is consistent with their findings that LSTMs can successfully predict attentional and meditative states.
“Application of Artificial Intelligence Techniques for Brain–Computer Interface in Mental Fatigue Detection: A Systematic Review (2011–2022)” (2023) [37]: Although this study focuses on mental-fatigue detection, the systematic review of AI techniques applied to BCI supports the idea that deep learning models are powerful tools for interpreting EEG signals. This reinforces the validity of the study’s approach using LSTM and GRU to predict cognitive states.
“EEG-based Biometric Authentication Using Machine Learning: A Comprehensive Survey” (2022) [38]: This study provides an overview of machine learning techniques applied to EEG-based biometric authentication. Although the goal is different, the effectiveness of machine learning techniques in classifying EEG signals bodes well for their application in attention and meditation prediction.

In summary, the results obtained are in line with the existing literature regarding the applicability and effectiveness of RNNs, specifically LSTMs and GRUs, for analyzing and predicting cognitive states from EEG signals. The comparison of different architectures and the optimization of hyperparameters in their study provide a valuable contribution to the field of BCI study, demonstrating that, with the right setup, these models can be tuned to improve accuracy in predicting complex mental states. The integration of bioelectric signal acquisition systems with artificial intelligence techniques, as demonstrated in recent work by Laganà et al. (2024), offers promising opportunities for enhancing signal interpretation and clinical diagnosis through the combination of robust hardware design and advanced computational analysis methods [39]. This synergistic approach can lead to more accurate and reliable diagnostic tools in neurological assessment.

7.1. Analysis of the Strengths and Weaknesses of LSTM and GRU Networks for the Prediction of Attention and Meditation

LSTM and GRU networks are variants of recurrent neural networks that have been widely used to process sequences of data such as EEG signals. Both architectures are designed to capture long-term temporal dependencies, making them suitable for time-series prediction tasks such as predicting attention and meditation from EEG signals. However, each has its own strengths and weaknesses in this context.

Strengths of LSTM:

Memory capacity: LSTMs are designed to avoid the problem of gradient fading, which allows them to learn long-term dependencies. This is crucial when working with EEG signals, which may contain patterns relevant to attention and meditation over long periods of time.
Accuracy: Studies have shown that LSTMs can be very accurate in classification and prediction tasks, as reflected in the study “EEG-Based Age and Gender Prediction Using Deep BLSTM-LSTM Network Model” (2019), suggesting that they can be equally effective in predicting attention and meditation [36].

Weaknesses of LSTMs:

Complexity and computational cost: LSTMs have a more complex structure than GRUs, possibly leading to higher computational cost and longer training times, especially on large datasets.
Risk of overfitting: Given their complexity, LSTMs can be prone to overfitting, especially when insufficient training data are available.

Strengths of GRU:

Efficiency: GRUs have a simpler structure than LSTMs, as they combine forgetting and updating gates. This can result in faster training and higher computational efficiency, as suggested in the systematic review “Application of Artificial Intelligence Techniques for Brain–Computer Interface in Mental Fatigue Detection” (2023) [37].
Flexibility: The simplicity of GRUs can make them more flexible to adapt to different data sizes, which can be advantageous in BCI applications where datasets may be limited or highly varied [40].

Weaknesses of GRU:

Memory capacity: Although GRUs are efficient, they may have a slightly lower memory capacity compared to LSTMs, potentially posing a drawback when modeling EEG signals that require the capture of long-term information.
Generalization: GRUs may have difficulty generalizing in some cases, especially when dealing with complex or subtle patterns in the data, which could affect the accuracy of attention prediction and meditation.

In our study, the final results show that both LSTM and GRU models perform comparably in terms of MAE, MSE, and RMSE metrics. This indicates that, despite their differences, both architectures can capture the dynamics of EEG signals to predict attention and meditation with reasonable accuracy. The choice between LSTM and GRU may depend on factors specific to the dataset and application context, such as the size of the dataset, the availability of computational resources, and the need for fast training.

We can observe how both LSTMs and GRUs have their merits in predicting cognitive states from EEG signals. The choice between them must be based on a balance between desired accuracy and available resources, as well as on the specific nature of the EEG data being worked with. On the other hand, if we also consider the results obtained from the cross-validation process, we can conclude that the results indicate that GRU offers superior performance and stability for attention-state prediction, making it the preferred choice for tasks requiring robust and consistent predictions. However, for meditation-state prediction, both GRU and LSTM demonstrate an equivalent performance, allowing the choice between them to be guided by practical considerations, such as computational efficiency. These findings provide valuable insights into the suitability of these architectures for mental-state modeling and underscore the potential for future research to further optimize their application in EEG-based cognitive-state prediction.

7.2. Implications and Possible Applications of the Research Results

The research results have several significant implications and open the door to multiple practical applications in the field of BCI, cognitive neuroscience, and mental health. The ability to accurately predict attentional and meditative states from EEG signals using LSTM and GRU networks has the potential to positively impact several areas:

7.2.1. Implications for BCI Research and Technology

Improved brain–computer interfaces: LSTM and GRU models could be integrated into BCI devices to provide real-time feedback on users’ attention and meditation states. This could improve human–machine interaction, especially in applications that require sustained concentration, such as learning or driving.
Personalization of user experience: By understanding and predicting cognitive states, applications could dynamically adapt to user needs, improving the experience in virtual reality applications, video games, and educational applications.

7.2.2. Applications in Mental Health and Well-Being

Monitoring and improving mental well-being: wearable devices equipped with EEG sensors and the predictive models developed could be used to monitor stress levels and mental well-being, providing timely interventions, such as breathing exercises or guided meditation.
Personalized therapies: In the clinical context, the models could help personalize therapies for attention or meditation disorders, such as ADHD or anxiety, by adjusting interventions based on the patient’s brain response in real time [41].

7.2.3. Implications for Education and Training

Improved educational tools: Education systems could use these models to assess and improve students’ concentration during learning activities, adapting content to maintain optimal attention.
Attention training: In high-performance pursuits, such as sport or music, the models could be used to train individuals in concentration and meditation techniques, improving overall performance.

7.2.4. Future Research in Cognitive Neuroscience

Understanding cognitive processes: The results may provide a basis for further studies on the underlying neural mechanisms of attention and meditation, contributing to scientific knowledge in cognitive neuroscience.
Biomarker development: The ability to predict cognitive states from EEG could lead to the development of biomarkers for various neurological and psychiatric conditions.

7.2.5. Challenges and Ethical Considerations

Data privacy and security: Implementation of these technologies must address the privacy and security of EEG data, which are sensitive biometric information.
Accessibility and equity: It is crucial to consider accessibility and equity in the development and implementation of BCI applications to ensure that the benefits are available to a wide range of users.

In summary, the results of this research have the potential to enrich human–computer interaction, improve mental health and well-being, and advance scientific understanding of cognitive processes. However, it is critical to address ethical and practical challenges in order to maximize the benefits and minimize the potential risks.

For a more detailed and specific discussion of BCI and EEG applications in mental-fatigue detection, the study [37] provides a relevant systematic review. In addition, the survey [38] provides an overview of EEG applications in biometric authentication and could provide insights into future applications of LSTM and GRU models in this field.

7.3. Limitations Encountered During the Study

Sample Size and Diversity:
- The sample may have been limited in size or diversity, which affects the generalizability of the results. A larger and more diverse sample could improve the robustness of the predictive models.
EEG Data Quality:
- EEG data can be subject to noise and artefacts, which can affect the accuracy of predictions. Data quality is critical to the performance of machine learning models.
Complexity of Cognitive States:
- Attention and meditation are complex cognitive states that may not be fully captured by EEG data or metrics used.
Models and Hyperparameters:
- Model selection and hyperparameter optimization may have been limited by time or available computational resources.
Model Interpretation:
- Neural networks, especially deep ones such as LSTM and GRU, are often criticized for their lack of interpretability, which can make it difficult to understand how models arrive at their predictions.

7.4. Comparison with the Results Previously Obtained in Similar Studies

Our results demonstrate that both LSTM and GRU models are capable of effectively predicting attention and meditation values, showcasing their suitability for EEG-based cognitive-state analysis. These findings align with previous research that has employed recurrent neural networks for EEG signal processing, reinforcing their capacity to capture the temporal dynamics inherent in this type of data. For example, studies have shown similar predictive performance when using RNN-based architectures for cognitive-state classification [36,37]. However, many of these studies relied on preprocessed or proprietary EEG features, whereas our approach uses raw EEG signals, which enhance transparency and adaptability.

One of the notable insights from our work is that GRU models, due to their simpler architecture, provide a computational advantage over LSTM without sacrificing accuracy. This observation is consistent with prior findings in an analysis of the computational and efficiency advantages of the GRU over LSTMs [40]. However, unlike much of the existing research that relies on multi-channel EEG systems, our study demonstrates the feasibility of using low-cost, single-channel devices, making EEG-based technologies more accessible for practical applications. These distinctions underline the relevance of our study in bridging the gap between advanced predictive models and real-world usability.

Future research could build on these findings by testing the models on larger and more diverse datasets, as well as exploring hybrid architectures or additional neural network approaches to further enhance performance and generalizability. Nonetheless, this study provides a meaningful step toward simplifying and improving EEG-based cognitive-state predictions for practical and scalable applications.

8. Conclusions

This study has explored the application of deep learning models, specifically LSTM and GRU networks, in the prediction of cognitive states of attention and meditation using raw EEG signals. Our preliminary results indicate that these advanced models can accurately capture the temporal dynamics and long-term dependencies present in EEG signals, as doing so is essential for the accurate prediction of cognitive states [25]. Performance comparison between LSTM and GRU networks has provided valuable insight into the strengths and weaknesses of each model in this specific domain. Evaluation metrics, RMSE, MSE, MAE, and SMAPE, have been essential to quantify and compare the performance of these models [42].

For attention, the LSTM model with the partial feature set outperformed in regard to MAE and MSE, showing lower average and squared errors. Although its SMAPE is slightly higher, this model remains preferable if absolute error minimization is prioritized. Similarly, for meditation, the LSTM model with the partial feature set consistently showed better performance across all metrics compared to the GRU model with the algorithm, indicating higher accuracy and reliability. These findings highlight the promise of deep learning models in predicting cognitive states from raw EEG signals, paving the way for further exploration of RNNs in applied neuroscience and real-time BCI systems.

In conclusion, this study highlights the potential of LSTM and GRU neural networks to predict attention and meditation states using raw EEG signals collected from single-channel, low-cost devices. Both models demonstrated strong performance, with GRU standing out as a computationally efficient option that does not sacrifice accuracy. These results underscore the practicality of these neural network architectures for real-time cognitive-state monitoring, particularly in accessible applications like neurofeedback and brain–computer interface systems.

Moving forward, future research could build on these findings by involving larger and more diverse participant groups to improve the generalizability of the models. Additionally, integrating EEG data with other physiological signals or exploring hybrid neural network architectures may further enhance prediction accuracy and expand the range of applications. Overall, this work marks an important step toward making EEG-based cognitive-state prediction both simpler and more adaptable for real-world use.

Author Contributions

Conceptualization, F.R.; methodology, J.E.S.-G. and J.M.C.; formal analysis, F.R.; research, F.R., J.E.S.-G., and J.M.C.; data preservation, F.R.; data, F.R.; writing the original draft, F.R.; writing—revising and editing, J.E.S.-G. and J.M.C.; visualization, J.E.S.-G.; supervision, J.E.S.-G. and J.M.C.; project administration, F.R. All authors have read and accepted the published version. All authors have read and agreed to the published version of the manuscript.

Funding

This work was financially supported by the University of Burgos.

Data Availability Statement

The data presented in this study are available upon request from the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

BCI	brain–computer interface
EEG	electroencephalography
EMG	electromyography
EOG	(electrooculography)
AI	artificial intelligence
CNN	convolutional neural networks
RNN	recurrent neural networks
LSTM	long short-term memory
MAE	Mean Absolute Error
MSE	Mean Squared Error
RMSE	Root Mean Squared Error
SMAPE	Symmetric Mean Absolute Percentage Error
DBS	Deep Brain Stimulation

References

Sergio, R.L. Reducción de Artefactos En Señales Electroencefalográficas Mediante Nuevas Técnicas de Filtrado Automático Basadas En Separación Ciega de Fuentes. Ph.D. Thesis, Universitat Politècnica de Catalunya, Barcelona, Spain, 2010. [Google Scholar]
Miranda, C.; Lescher, A.; Rojas, A.; Molino, J.; Ibarra, E.; de Tristan, S. Detección Temprana de Epilepsia Pediátrica: Progresión de los Electrodos en EEG. Eur. Sci. J. 2023, 19, 1. [Google Scholar] [CrossRef]
Posner, M.I. The Evolution and Future Development of Attention Networks. J. Intell. 2023, 11, 98. [Google Scholar] [CrossRef] [PubMed]
Yang, X.; Cheng, P.Y.; Lin, L.; Huang, Y.M.; Ren, Y. Can an Integrated System of Electroencephalography and Virtual Reality Further the Understanding of Relationships Between Attention, Meditation, Flow State, and Creativity? J. Educ. Comput. Res. 2019, 57, 846–876. [Google Scholar] [CrossRef]
Sharma, K.; Wernicke, A.G.; Rahman, H.; Potters, L.; Sharma, G.; Parashar, B. A Retrospective Analysis of Three Focused Attention Meditation Techniques: Mantra, Breath, and External-Point Meditation. Cureus 2022, 14, e23589. [Google Scholar] [CrossRef] [PubMed]
Yoshida, K.; Takeda, K.; Kasai, T.; Makinae, S.; Murakami, Y.; Hasegawa, A.; Sakai, S. Focused Attention Meditation Training Modifies Neural Activity and Attention: Longitudinal EEG Data in Non-Meditators. Soc. Cogn. Affect. Neurosci. 2020, 15, 215–224. [Google Scholar] [CrossRef]
García, P.; Ángel, M. Caracterización de la sincronía de fase de EEG para su aplicación en Interfaces Cerebro-Computadora. Ph.D. Thesis, Universidad Autónoma Metropolitana, Mexico City, Mexico, 2020. [Google Scholar] [CrossRef]
Rușanu, O.A. A LabVIEW Instrument Aimed for the Research on Brain-Computer Interface by Enabling the Acquisition, Processing, and the Neural Networks Based Classification of the Raw EEG Signal Detected by the Embedded NeuroSky Biosensor. Int. J. Online Biomed. Eng. 2023, 19, 57–81. [Google Scholar] [CrossRef]
Vélez, L.; Kemper, G. Algorithm for Detection of Raising Eyebrows and Jaw Clenching Artifacts in EEG Signals Using Neurosky Mindwave Headset. In Proceedings of the 5th Brazilian Technology Symposium; Smart Innovation, Systems and Technologies; Springer: Cham, Switzerland, 2021; Volume 202, pp. 99–110. [Google Scholar]
Chen, R.C.; Liou, M.J.; Dewi, C. Combination of EEG and Brainwave Mind Lamp to Detect the Value of Attention, Meditation and Fatigue of a Person. In Proceedings of the 2021 International Conference on Technologies and Applications of Artificial Intelligence (TAAI), Taichung, Taiwan, 18–20 November 2021; pp. 174–179. [Google Scholar]
Bréchet, L.; Ziegler, D.A.; Simon, A.J.; Brunet, D.; Gazzaley, A.; Michel, C.M. Reconfiguration of Electroencephalography Microstate Networks after Breath-Focused, Digital Meditation Training. Brain Connect. 2021, 11, 146–155. [Google Scholar] [CrossRef] [PubMed]
You, S.D. Classification of Relaxation and Concentration Mental States with EEG. Information 2021, 12, 187. [Google Scholar] [CrossRef]
Ali, A.; Afridi, R.; Soomro, T.A.; Khan, S.A.; Khan, M.Y.A.; Chowdhry, B.S. A Single-Channel Wireless EEG Headset Enabled Neural Activities Analysis for Mental Healthcare Applications. Wirel. Pers. Commun. 2022, 125, 3699–3713. [Google Scholar] [CrossRef] [PubMed]
Adhikari, B.; Shrestha, A.; Mishra, S.; Singh, S.; Timalsina, A.K. EEG Based Directional Signal Classification Using RNN Variants. In Proceedings of the 2018 IEEE 3rd International Conference on Computing, Communication and Security (ICCCS), Kathmandu, Nepal, 25–27 October 2018; pp. 218–223. [Google Scholar]
Chaddad, A.; Wu, Y.; Kateb, R.; Bouridane, A. Electroencephalography Signal Processing: A Comprehensive Review and Analysis of Methods and Techniques. Sensors 2023, 23, 6434. [Google Scholar] [CrossRef]
Shrestha, A.; Adhikari, B. Color-Based Classification of EEG Signals for People with the Severe Locomotive Disorder. arXiv 2023, arXiv:2304.11068. [Google Scholar]
Saha, S.; Mathur, A.; Bora, K.; Basak, S.; Agrawal, S. A New Activation Function for Artificial Neural Net Based Habitability Classification. In Proceedings of the 2018 International Conference on Advances in Computing, Communications and Informatics, ICACCI 2018, Bangalore, India, 19–22 September 2018; pp. 1781–1786. [Google Scholar] [CrossRef]
Permana, K.; Wijaya, S.K.; Prajitno, P. Controlled Wheelchair Based on Brain Computer Interface Using Neurosky Mindwave Mobile 2. AIP Conf. Proc. 2019, 2168, 020022. [Google Scholar]
Attentional Modulation Effects on Brain Networks: An FMRI Study on the Visual Attention Network and the Default-Mode Network. Available online: https://www.researchgate.net/publication/281239686_Attentional_Modulation_Effects_on_Brain_Networks_an_fMRI_Study_on_the_Visual_Attention_Network_and_the_Default-Mode_Network?channel=doi&linkId=55dc6dcc08aec156b9b1771d&showFulltext=true (accessed on 23 January 2025).
Brainwaves, The Key To Healthy Brain Function|The Neurofeedback Center of Pittsburgh. Available online: https://www.neurofeedbackpittsburgh.com/brainwaves-the-key-to-healthy-brain-function/ (accessed on 21 September 2024).
Chaipakornwong, T.; Sittiprapaporn, P. Brain Exercise in Elderly: NeuroSky Smarter Kit Investigation. Asian J. Med. Sci. 2020, 11, 69–74. [Google Scholar] [CrossRef]
Alpha-Theta Training—The Neuro Brain—Neurofeedback Melbourne—FAQs. Available online: https://theneurobrain.com/blog/tag/alpha-theta+training (accessed on 21 September 2024).
Cerebral Areas for EEG Band Power Spectrum Calculations|Download Scientific Diagram. Available online: https://www.researchgate.net/figure/Cerebral-areas-for-EEG-band-power-spectrum-calculations_fig2_267811728 (accessed on 21 September 2024).
Bala, P.; Amob, R.; Islam, M.; Adib, S.; Hasan, F.; Uddin, M.N. EEG—Based Load Control System for Physically Challenged People. In Proceedings of the 2021 2nd International Conference on Robotics, Electrical and Signal Processing Techniques (ICREST), Khaka, Bangladesh, 5–7 January 2021; pp. 603–606. [Google Scholar]
Tigga, N.P.; Garg, S. Efficacy of Novel Attention-Based Gated Recurrent Units Transformer for Depression Detection Using Electroencephalogram Signals. Health Inf. Sci. Syst. 2023, 11, 1. [Google Scholar] [CrossRef] [PubMed]
Zheng, W.; Lu, B. Investigating Critical Frequency Bands and Channels for EEG-Based Emotion Recognition with Deep Neural Networks. IEEE Trans. Auton. Ment. Dev. 2019, 7, 162–175. [Google Scholar] [CrossRef]
Alfredo, A.S.; Adytia, D.A. Time Series Forecasting of Significant Wave Height Using GRU, CNN-GRU, and LSTM. J. RESTI (Rekayasa Sist. Dan Teknol. Inf.) 2022, 6, 776–781. [Google Scholar] [CrossRef]
Understanding MAE, MSE, and RMSE: Key Metrics in Machine Learning—DEV Community. Available online: https://dev.to/mondal_sabbha/understanding-mae-mse-and-rmse-key-metrics-in-machine-learning-4la2 (accessed on 24 January 2025).
Goodwin, P.; Lawton, R. On the Asymmetry of the Symmetric MAPE. Int. J. Forecast. 1999, 15, 405–408. [Google Scholar] [CrossRef]
How to Calculate SMAPE in Excel?—GeeksforGeeks. Available online: https://www.geeksforgeeks.org/how-to-calculate-smape-in-excel/ (accessed on 24 January 2025).
Yang, D.; Hollenstein, N. PLM-AS: Pre-Trained Language Models Augmented with Scanpaths for Sentiment Classification. In Proceedings of the Northern Lights Deep Learning Workshop, Tromsø, Norway, 10–12 January 2023; Volume 4. [Google Scholar] [CrossRef]
Sahu, A.K.; Sharma, S.; Raja, R. Deep Learning-Based Continuous Authentication for an IoT-Enabled Healthcare Service. Comput. Electr. Eng. 2022, 99, 107817. [Google Scholar] [CrossRef]
Sravanth, K.R.; Peddi, A.; Sagar, G.S.; Gupta, B.; Chakraborty, C. Comparison of Attention and Meditation Based Mobile Applications by Using EEG Signals. In Proceedings of the 6th Global Wireless Summit, GWS 2018, Chiang Rai, Thailand, 25–28 November 2018; pp. 260–265. [Google Scholar] [CrossRef]
Noh, J.H.; Yang, H.-D. Alzheimer Progression Classification Using FMRI Data. Smart Media J. 2024, 13, 86–93. [Google Scholar] [CrossRef]
Ma, B.; Dong, S. A Hybrid Prediction Model for Pumping Well System Efficiency Based on Stacking Integration Strategy. Int. J. Energy Res. 2024, 2024, 8868949. [Google Scholar] [CrossRef]
Kaushik, P.; Gupta, A.; Roy, P.P.; Dogra, D.P. EEG-Based Age and Gender Prediction Using Deep BLSTM-LSTM Network Model. IEEE Sens. J. 2019, 19, 2634–2641. [Google Scholar] [CrossRef]
Yaacob, H.; Hossain, F.; Shari, S.; Khare, S.K.; Ooi, C.P.; Acharya, U.R. Application of Artificial Intelligence Techniques for Brain-Computer Interface in Mental Fatigue Detection: A Systematic Review (2011–2022). IEEE Access 2023, 11, 74736–74758. [Google Scholar] [CrossRef]
Shams, T.B.; Hossain, M.S.; Mahmud, M.F.; Tehjib, M.S.; Hossain, Z.; Pramanik, M.I. EEG-Based Biometric Authentication Using Machine Learning: A Comprehensive Survey. ECTI Trans. Electr. Eng. Electron. Commun. 2022, 20, 225–241. [Google Scholar] [CrossRef]
Laganà, F.; Pratticò, D.; Angiulli, G.; Oliva, G.; Pullano, S.A.; Versaci, M.; Foresta, F. La Development of an Integrated System of SEMG Signal Acquisition, Processing, and Analysis with AI Techniques. Signals 2024, 5, 476–493. [Google Scholar] [CrossRef]
Wang, X.; Han, Q.; Li, J.; Jin, Y. Research on Prediction Model of Epileptic EEG Signal Based on GRU. In Proceedings of the 2021 International Conference on Electronic Information Engineering and Computer Science (EIECS), Changchun, China, 23–26 September 2021; pp. 9–12. [Google Scholar]
Hidalgo-Munoz, A.R.; Acle-Vicente, D.; Garcia-Perez, A.; Tabernero-Urbieta, C. Application of Neurotechnology in Students with ADHD: An Umbrella Review. Comun. Media Educ. Res. J. 2023, 31, 59–69. [Google Scholar] [CrossRef]
Ahmadzadeh, E.; Kim, H.; Jeong, O.; Kim, N.; Moon, I. A Deep Bidirectional LSTM-GRU Network Model for Automated Ciphertext Classification. IEEE Access 2022, 10, 3228–3237. [Google Scholar] [CrossRef]

Figure 1. Functional areas of the cerebral cortex where we can capture the raw signal [19].

Figure 2. Brainwave typology I (source NeuroFeedBack) [20].

Figure 3. Brainwave typology II (source NeuroFeedBack) [22].

Figure 4. FP1 and FP2 electrode selected position for capturing raw data (source ResearchGate) [23].

Figure 5. LSTM diagram.

Figure 6. GRU diagram.

Figure 7. Use-case architecture.

Figure 8. Brain signals obtained with Brainlink headset.

Figure 9. Hyperparameter calculation with LSTM model for attention values.

Figure 10. Hyperparameter calculation with LSTM model for meditation values.

Figure 11. Comparison for attention prediction.

Figure 12. Comparison for prediction of meditation.

Figure 13. Final prediction results for attention and meditation.

Figure 14. Histogram of inference times.

Figure 15. Real-time prediction of ATTENTION signal with GRU and data other than those used during training.

Figure 16. Real-time prediction of MEDITATION signal with GRU and data other than those used during training.

Table 1. Summary of related work in EEG-based cognitive-state prediction.

Reference	Focus Area	Methods/Techniques	Key Results	Contributions
Chaddad et al. (2023) [15]	EEG signal-processing methods	Comprehensive review of preprocessing and feature extraction	Identified critical preprocessing techniques for robust EEG analysis	Provided a foundation for understanding preprocessing challenges in EEG studies
Posner (2023) [3]	Evolution of attention networks	Human and animal studies’ integration	Highlighted the role of attention networks in cognitive integration	Introduced frameworks to understand attention mechanisms
Yang et al. (2019) [4]	EEG and virtual reality for creativity	EEG combined with virtual reality for flow state analysis	Correlated creativity and brainwave patterns during flow states	Pioneered studies linking EEG signals to creativity and attention dynamics
Yoshida et al. (2020) [6]	Meditation and cognitive performance	FAM with EEG longitudinal study	Demonstrated FAM’s effect on improving neural activity and attention	Validated meditation’s role in enhancing cognitive performance
Shrestha et al. (2023) [16]	EEG signal classification for disabilities	Deep learning applied to classify EEG signals from low-cost devices	Achieved high classification accuracy for EEG signals based on stimuli	Opened avenues for developing assistive communication systems
Rușanu et al. (2023) [8]	Real-time EEG data processing	LabVIEW for EEG signal acquisition and neural network classification	Enabled real-time EEG signal classification using low-cost devices	Improved accessibility to real-time EEG-based BCI applications
Mathur et al. (2018) [17]	Directional EEG signal classification	LSTM and attention mechanisms for EEG signal classification	Demonstrated efficacy of deep learning models for classifying brain signals	Advanced methods for interpreting raw EEG data using neural networks
Bréchet et al. (2021) [11].	Meditation and EEG microstates	Digital meditation training with EEG	Showed reconfiguration of EEG networks post meditation training	Explored EEG network adaptations to meditation
Permana et al. (2019) [18].	EEG-based BCI for wheelchair control	NeuroSky MindWave Mobile applied to BCI	Demonstrated feasibility of EEG-based control for assistive devices	Validated low-cost EEG devices for practical BCI implementations

Table 2. Signals and frequency ranges captured by NeuroSky.

Wave Type	Characteristics	Values (Frequency and Voltage)
Delta wave:	Typical of infancy, children under 3 months of age. Phase III of physiological sleep, its contribution in adults must be considered abnormal.	a 4 Hz Greater than 50 μV
Theta wave:	Located in the fronto-central area. If the signal is less than 15 μV, it can be considered abnormal, unless it is accompanied by a good Alpha-wave rhythm. Phase I and II of physiological sleep and during hyperventilation and/or fatigue.	4 a 7 Hz Greater than 40 μV
Low Alpha wave:	Located in occipital area. If it maintains an asymmetry of more than 50%, it can be considered abnormal.	8 to 12 Hz 15 μV
High Alpha wave:		8 to 12 Hz 15 μV
Low Beta wave:	Predominates in periods of wakefulness. Appears in states where attention is directed to external cognitive tasks. Fast frequency; it is present when we are attentive and focused on solving everyday tasks or making decisions.	Beta 1 (12 to 15 Hz) Beta 2 (15 to 22 Hz) Beta 3 (22 to 30 Hz)
High Beta wave:
Low Gamma wave:	These are the fastest frequency waves and occur in short bursts. They are related to central nervous-system tasks. It is observed in brain processes of high resolution and intensity, high brain activity. In normal state, it is considered a value of 40 Hz.	25 a 100 Hz
High Gamma wave:		25 a 100 Hz

Table 3. Feature sets according to input signals.

Feature Set	Input Signals	Output Signals
Complete	Delta, Theta, low Alpha, high Alpha, low Beta, high Beta, low Gamma, high Gamma	Attention, meditation
Partial	Low Beta, high Beta, low Gamma, high Gamma,	Attention
Partial	Theta, low Alpha, high Alpha	Meditation

Table 4. Hyperparameter ranges and optimal values for LSTM and GRU model.

Hyperparameter	Range Explored	Description
LSTM units	[4, 8, 16, 32, 64]	Number of units in the LSTM layer. Optimized to balance capacity and computational efficiency.
GRU units	[4, 8, 16, 32, 64]	Number of units in the GRU layer. Selected to balance accuracy and efficiency.
Batch size	[1, 5, 10, 20}	Number of samples processed before updating the model. Smaller values reduce overfitting risk.
Optimizer	“adam”, “rmsprop”, “sgd”	Algorithm used to optimize the model’s parameters. Adam performed best for both attention and meditation.
Dropout rate	[0.01, 0.1, 0.2, 0.3, 0.4]	Regularization applied to prevent overfitting during training.
Look-back window	[3, 5, 7, 9, 15, 30, 60, 100]	Number of previous time steps used as input to predict the next value.

Table 5. Attention prediction with LSTM model, using the partial feature set.

LSTM
Attention
Look-Back	LSTM Units	Batch_Size	Optimizer	Dropout	MAE	MSE	RMSE	SMAPE
1	8	1	Adam	0.01	18.440895	520.61426	22.816973	46.424299
3	8	1	Adam	0.01	16.443464	390.06958	19.75018	44.18967
5	64	10	Adam	0.1	17.237684	444.3844	21.080427	45.00878
7	8	1	Adam	0.01	17.011745	418.7853	20.464245	44.6055114
9	8	1	Adam	0.01	17.269892	450.35944	21.221674	45.212623
12	32	5	Adam	0.2	18.437788	522.3067	22.85403	46.92971
15	16	1	rmsprop	0.2	18.791443	526.82117	22.952585	48.3881175
60	8	1	Adam	0.01	19.565895	558.27045	23.627747	51.152211
100	64	1	Adam	0.3	21.289196	666.65015	25.819569	53.5422564

Table 6. Meditation prediction with LSTM model, using the partial feature set.

LSTM
Meditation
Look-Back	LSTM Units	Batch_Size	Optimizer	Dropout	MAE	MSE	RMSE	SMAPE
1	8	1	Adam	0.01	12.93792	248.0195	16.34073	26.07285
3	8	1	Adam	0.01	9.74729	152.81094	12.361672	20.292328
5	8	1	Adam	0.01	9.730363	153.521	12.390359	20.3964844
7	16	5	Adam	0.01	9.932504	160.70638	12.677002	20.925404
9	16	5	Adam	0.01	11.080525	194.34912	13.940915	22.482206
12	16	5	Adam	0.01	9.750857	153.20485	12.377595	20.5961972
15	16	5	Adam	0.01	10.855734	167.29128	13.68544	22.5330263
60	8	1	Adam	0.01	13.085041	263.2616	16.225338	26.387578
100	8	1	Adam	0.01	9.309048	137.05049	11.706857	20.1372519

Table 7. Attention prediction with LSTM, using the complete feature set.

LSTM
Attention
Look-Back	LSTM Units	Batch_Size	Optimizer	Dropout	MAE	MSE	RMSE	SMAPE
1	8	1	Adam	0.01	12.770439	252.98478	15.905495	36.03656
3	8	1	Adam	0.01	10.93788	180.85922	13.448391	38.268446
5	8	1	Adam	0.01	11.117896	185.6015	13.623564	40.6224002
7	8	1	Adam	0.01	9.586014	146.60876	12.1082411	34.657588
9	32	20	Adam	0.1	10.250811	166.4137	12.900144	32.837361
12	16	1	rmsprop	0.2	10.234032	162.91768	12.76921	39.718466
15	4	10	Adam	0.01	12.086856	219.6982	14.8222	77.626531
60	4	10	Adam	0.01	10.679445	170.31284	13.050396	37.424832
100	32	20	Adam	0.1	13.549226	270.00806	16.431921	59.413784

Table 8. Meditation prediction with LSTM, using the complete feature set.

LSTM
Meditation
Look-Back	LSTM Units	Batch_Size	Optimizer	Dropout	MAE	MSE	RMSE	SMAPE
1	64	1	Adam	0.01	13.012856	266.65714	16.32964	26.2508482
3	8	1	Adam	0.01	9.405639	139.42348	11.807772	19.6614996
5	8	1	Adam	0.01	9.011736	122.980484	11.089657	19.215181
7	8	1	Adam	0.01	8.75148	118.901115	10.904179	18.5926675
9	8	1	Adam	0.01	9.703162	144.41208	12.017158	20.2926966
12	16	5	Adam	0.01	9.390194	136.74336	11.693731	19.839185
15	16	5	Adam	0.01	11.352932	205.39166	14.331492	23.227106
60	16	5	Adam	0.01	13.8042	286.8171	16.935675	28.2309085
60B	4	10	Adam	0.01	10.759749	189.46066	13.794471	23.1912166
100	8	1	Adam	0.01	11.555381	208.5672	14.44185	24.088779

Table 9. Attention prediction with GRU model, using partial feature set.

GRU
Attention
Look-Back	GRU Units	Batch_Size	Optimizer	Dropout	MAE	MSE	RMSE	SMAPE
1	8	1	Adam	0.01	18.205666	498.30548	22.342758	46.277722
3	8	1	Adam	0.01	16.537975	391.97757	19.798424	44.8215187
3B	8	1	Adam	0.01	16.757694	409.6622	20.240114	45.133781
5	8	1	Adam	0.01	16.450132	394.23697	19.855402	43.7501669
7	8	1	Adam	0.01	16.665699	400.61368	20.015337	45.749473
9	4	10	sgd	0.2	14.613	368.0468	19.1845	29.7287
9B	8	1	Adam	0.01	16.847183	423.59082	20.581322	46.538028
12	8	5	Adam	0.2	17.409185	442.0205	21.024284	46.35398
15	16	5	Adam	0.01	17.239313	477.5919	21.853876	45.865431
60	8	1	Adam	0.01	17.830471	503.54346	22.439774	50.804883
60B	64	10	Adam	0.1	20.124237	652.8454	25.55084	57.5832188
60C	4	10	Adam	0.1	13.5327	297.3851	17.2449	26.5076
100	4	10	Adam	0.01	19.392126	590.4532	24.299242	54.391807
100B	8	1	Adam	0.01	19.640924	585.0889	24.188612	52.710318

Table 10. Meditation prediction with GRU model, using partial feature set.

GRU
Meditation
Look-Back	GRU Units	Batch_Size	Optimizer	Dropout	MAE	MSE	RMSE	SMAPE
1	8	1	Adam	0.01	12.695457	257.61722	16.050459	25.7325947
3	64	20	Adam	0.01	9.669626	153.72371	12.3985	20.1065033
3B	8	1	Adam	0.01	9.377201	142.39085	11.932764	19.491779
5	8	1	Adam	0.01	10.554417	178.80147	13.371667	21.483133
7	64	20	Adam	0.01	9.94037	161.12167	12.693371	20.752151
9	64	1	Adam	0.3	11.3582	215.1325	14.6674	21.554
9B	64	20	Adam	0.01	10.049313	163.75613	12.7967	20.726409
12	16	5	Adam	0.01	9.7034855	152.66554	12.35579	20.34242
12B	16	5	Adam	0.01	11.157973	197.5904	14.056685	22.694964
15	16	5	Adam	0.01	10.180891	168.55742	12.982966	21.997992
60	4	10	Adam	0.01	10.4398775	183.21724	13.535776	22.5904629
60B	8	1	Adam	0.01	11.246518	206.35455	14.3650465	23.528207
100	8	1	Adam	0.01	12.257871	231.84604	15.226491	25.1133353

Table 11. Prediction attention with GRU, using the complete feature set.

GRU
Attention
Look-Back	GRU Units	Batch_Size	Optimizer	Dropout	MAE	MSE	RMSE	SMAPE
1	8	1	Adam	0.01	12.905936	259.13977	16.097818	36.4739567
3	64	10	Adam	0.1	14.400285	173.77466	13.182362	37.5045925
5	64	10	Adam	0.1	10.174233	169.12747	13.004902	35.3435516
5B	8	1	Adam	0.01	9.3669615	139.02986	11.79193	31.9900125
7	32	10	rmsprop	0.4	18.602463	561.6555	23.699272	44.233652
7B	64	10	Adam	0.1	10.734492	191.56284	13.840623	34.664142
7C	8	1	Adam	0.01	9.570991	147.82707	12.158416	34.180242
9	64	10	Adam	0.1	9.922806	159.27669	12.620487	33.0549777
9B	16	1	rmsprop	0.2	11.752387	211.51498	14.543554	14.8547634
11	8	5	Adam	0.2	9.488643	148.82626	12.199437	30.0053089
12	8	1	Adam	0.01	11.753124	203.12943	14.252348	45.8035856
15	8	5	Adam	0.2	13.397072	266.48898	16.32449	51.8113613
60	32	20	Adam	0.1	10.852699	181.9647	13.489429	38.016602
100	32	20	Adam	0.1	20.168981	605.4482	24.605846	88.43652

Table 12. Meditation prediction with GRU, using the complete feature set.

GRU
Meditation
Look-Back	GRU Units	Batch_Size	Optimizer	Dropout	MAE	MSE	RMSE	SMAPE
1	8	1	Adam	0.01	12.594968	248.45146	15.762343	25.644166
3	8	1	Adam	0.01	8.601949	112.522995	10.607686	18.3427959
5	64	20	Adam	0.01	9.537732	151.21387	12.296905	20.873832
5B	64	20	Adam	0.01	10.158519	166.11583	12.888593	20.95598
7	8	1	Adam	0.01	13.383541	311.49344	17.649178	36.1100912
7B	8	1	Adam	0.01	9.552063	143.57953	11.982468	19.974741
7C	8	1	Adam	0.01	9.075036	131.09457	11.449651	19.262354
9	16	5	Adam	0.01	9.315576	143.1378	11.964021	19.808089
11	64	20	Adam	0.01	9.868773	167.212	12.93148	21.18044
13	16	5	Adam	0.01	10.399938	173.73161	13.180719	21.70301
13B	16	5	Adam	0.01	10.867846	187.10138	13.6785	22.142434
15	8	1	Adam	0.01	9.803316	150.17015	12.254394	20.382806
60	8	1	Adam	0.01	10.531624	178.25508	13.35122	22.1152886
100	8	1	Adam	0.01	12.487414	236.91072	15.391905	25.4003704

Table 13. Best prediction results for attention/meditation with LSTM, using partial feature set.

Output Signal	Look-Back	LSTM Units	Batch_Size	Optimizer	Dropout	MAE	MSE	RMSE	SMAPE
Attention	3	8	1	Adam	0.01	16.443464	390.06958	19.75018	44.18967
Meditation	5	8	1	Adam	0.01	9.730363	153.521	12.390359	20.3964844

Table 14. Best prediction results for attention/meditation with GRU, using partial feature set.

Output Signal	Look-Back	GRU Units	Batch_Size	Optimizer	Dropout	MAE	MSE	RMSE	SMAPE
Attention	9	4	10	sgd	0.2	14.613	368.0468	19.1845	29.7287
Meditation	3B	8	1	Adam	0.01	9.377201	142.39085	11.932764	19.491779

Table 15. Best prediction results for attention/meditation with LSTM, using the complete feature set.

Output Signal	Look-Back	LSTM Units	Batch_Size	Optimizer	Dropout	MAE	MSE	RMSE	SMAPE
Attention	7	8	1	Adam	0.01	9.586014	146.60876	12.1082411	34.657588
Meditation	7	8	1	Adam	0.01	8.75148	118.901115	10.904179	18.5926675

Table 16. Best prediction results for attention/meditation with GRU, using complete feature set.

Output Signal	Look-Back	GRU Units	Batch_Size	Optimizer	Dropout	MAE	MSE	RMSE	SMAPE
Attention	5B	8	1	Adam	0.01	9.3669615	139.02986	11.79193	31.9900125
Meditation	7C	8	1	Adam	0.01	9.075036	131.09457	11.449651	19.262354

Table 17. Best prediction results for attention/meditation regarding LSTM vs. GRU.

Attention	RMSE	LB	Meditation	RMSE	LB
GRU without	11.79193	5	GRU without	11.449651	7
LSTM without	12.10824	7	LSTM without	10.904179	7
GRU with	12.390359	5	GRU with	11.932764	5
LSTM with	19.75018	7	LSTM with	12.390359	7

Table 18. Best prediction result for attention regarding LSTM vs. GRU.

Attention	RMSE	LB
GRU without	11.79193	5

Table 19. Best prediction result for meditation regarding LSTM vs. GRU.

Meditation	RMSE	LB
LSTM without	10.904179	7

Table 20. Comparison of cross-validation results for attention.

Attention	MAE	MSE	RMSE	SMAPE
GRU:	9.1176 (±0.9822)	134.0736 (±29.3221)	11.5149 (±1.2169)	27.9219 (±10.4861)
LSTM:	9.4034 (±1.6250)	144.8023 (±54.1297)	11.8589 (±2.0415)	27.7985 (±10.8337)

Table 21. Comparison of cross-validation results for meditation.

Meditation	MAE	MSE	RMSE	SMAPE
GRU:	9.8498 (±0.4453)	163.8544 (±15.3970)	12.7868 (±0.5929)	19.6260 (±1.5804)
LSTM:	9.8380 (±0.3493)	164.3154 (±18.7108)	12.7973 (±0.7371)	19.7476 (±1.6546)

Table 22. Best prediction results of prior and posterior LB for attention/meditation with LSTM architecture and partial feature set.

Output Signal	Look-Back	LSTM Units	Batch_Size	Optimizer	Dropout	RMSE
Attention	6	8	1	Adam	0.01	13.416016
	7	8	1	Adam	0.01	12.1082411
	7B	32	20	rmsprop	0.3	13.100437
	8	4	10	Adam	0.01	14.6655445
Meditation	6	16	5	Adam	0.01	11.016348
	7	8	1	Adam	0.01	10.904179
	7B	64	20	Adam	0.01	12.357736
	8	8	1	Adam	0.01	12.174273

Table 23. Best prediction results of prior and posterior LB for attention/meditation with GRU architecture and partial feature set.

Output Signal	Look-Back	GRU Units	Batch_Size	Optimizer	Dropout	RMSE
Attention	4	8	1	Adam	0.01	12.974952
	5	8	1	Adam	0.01	11.79193
	5B	64	10	Adam	0.1	12.302976
	6	32	20	Adam	0.1	13.864193
Meditation	6	8	1	Adam	0.01	12.525784
	7	8	1	Adam	0.01	11.449651
	7B	16	5	Adam	0.01	12.585928
	8	16	5	Adam	0.01	12.0246105

Table 24. Best prediction result for attention.

	Architecture/Feature Set		RMSE	LB
	GRU/partial feature set		11.79193	5
Look-Back	GRU Units	Batch_Size	Optimizer	Dropout
5	8	1	Adam	0.01

Table 25. Best prediction result for meditation.

	Architecture/Feature Set		RMSE	LB
	LSTM/partial feature set		10.904179	7
Look-Back	LSTM Units	Batch_Size	Optimizer	Dropout
7	8	1	Adam	0.01

Table 26. Results of real-time prediction with GRU.

Signal	MAE	MSE	RMSE	SMAPE	Correlation	F1-Score
Attention	13.99607	337.22452	18.363674	24.714878	0.732409	0.873117
Mediation	12.7323065	261.5731	16.173222	28.55646	0.680617	0.645211

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Rivas, F.; Sierra-Garcia, J.E.; Camara, J.M. Comparison of LSTM- and GRU-Type RNN Networks for Attention and Meditation Prediction on Raw EEG Data from Low-Cost Headsets. Electronics 2025, 14, 707. https://doi.org/10.3390/electronics14040707

AMA Style

Rivas F, Sierra-Garcia JE, Camara JM. Comparison of LSTM- and GRU-Type RNN Networks for Attention and Meditation Prediction on Raw EEG Data from Low-Cost Headsets. Electronics. 2025; 14(4):707. https://doi.org/10.3390/electronics14040707

Chicago/Turabian Style

Rivas, Fernando, Jesús Enrique Sierra-Garcia, and Jose María Camara. 2025. "Comparison of LSTM- and GRU-Type RNN Networks for Attention and Meditation Prediction on Raw EEG Data from Low-Cost Headsets" Electronics 14, no. 4: 707. https://doi.org/10.3390/electronics14040707

APA Style

Rivas, F., Sierra-Garcia, J. E., & Camara, J. M. (2025). Comparison of LSTM- and GRU-Type RNN Networks for Attention and Meditation Prediction on Raw EEG Data from Low-Cost Headsets. Electronics, 14(4), 707. https://doi.org/10.3390/electronics14040707

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Comparison of LSTM- and GRU-Type RNN Networks for Attention and Meditation Prediction on Raw EEG Data from Low-Cost Headsets

Abstract

1. Introduction

1.1. Hypothesis and Contributions

1.2. Paper Structure

2. Related Works

3. Technologies

3.1. NeuroSky

3.2. Predictive Deep Learning Models

4. Use-Case Architecture

5. Methodology

5.1. Experimental Setup

5.2. Features Sets

5.3. Data Preprocessing

5.4. Data Training

5.5. Cross-Validation

5.6. Performance Evaluation Metrics

6. Results

6.1. LSTM Performance

6.2. GRU Performance

6.3. Model Comparison

6.4. Real-Time Deployment and Analysis of Inference Time

7. Discussion

7.1. Analysis of the Strengths and Weaknesses of LSTM and GRU Networks for the Prediction of Attention and Meditation

7.2. Implications and Possible Applications of the Research Results

7.2.1. Implications for BCI Research and Technology

7.2.2. Applications in Mental Health and Well-Being

7.2.3. Implications for Education and Training

7.2.4. Future Research in Cognitive Neuroscience

7.2.5. Challenges and Ethical Considerations

7.3. Limitations Encountered During the Study

7.4. Comparison with the Results Previously Obtained in Similar Studies

8. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI