1. Introduction
Sleep, a fundamental biological state of the mind and body, is crucial for maintaining cognitive function, emotional regulation, and overall physical and mental health [
1,
2]. Extensive evidence indicates that sleep quality is closely linked to anxiety, stress, and depression in both clinical and non-clinical populations [
2]. Older adults, in particular, frequently experience disrupted sleep patterns and report poorer sleep quality compared to younger individuals [
3]. However, it is essential to note that sleep problems in older adults are rarely a direct consequence of aging [
4]. Consequently, the study of sleep quality has garnered increasing attention across different age groups.
Sleep quality encompasses several factors, including sleep onset latency, total sleep time, total wake time, sleep fragmentation, sleep efficiency, and disruptions such as apnea [
5]. The Pittsburgh Sleep Quality Index (PSQI) is a widely utilized tool for assessing the overall sleep quality based on individuals’ retrospective evaluations of various sleep factors over the past month [
6,
7]. While the PSQI offers valuable insights and is a reliable indicator of perceived sleep quality, it is limited by the inherent biases associated with self-reporting and subjective measures. Subjective and objective assessments of sleep quality may not always align, with discrepancies typically more pronounced in individuals with poor sleep [
8].
Electroencephalography (EEG) is a widely used, cost-effective, and noninvasive neuroimaging technique that records post-synaptic potentials from many neurons, predominantly cortical pyramidal cells [
9,
10,
11]. Its good temporal resolution makes it particularly useful for objectively assessing sleep quality [
12,
13,
14,
15,
16] and studying aging [
14,
17,
18,
19,
20]. When applied during sleep, EEG frequency sub-band patterns, combined with the information from polysomnography, provide valuable insights into the sleep state and abnormalities [
21]. However, sleep studies conducted in laboratory settings can be challenging. Alternatively, acquiring resting-state wake EEG presents a practical solution, allowing for data collection in a more natural setting while maintaining data quality. Growing evidence suggests that EEG complexity during wakefulness may reflect the underlying processes of sleep regulation [
22,
23], offering indirect insights into sleep quality without requiring overnight recordings. Although this method does not directly capture sleep-specific phenomena—such as rapid eye movement (REM) sleep or slow-wave activity (SWA)—studies have shown associations between wake EEG complexity and quantified SWA during sleep onset [
22]. Consequently, combining wake EEG with self-reported sleep quality data presents a simpler yet innovative research strategy. It circumvents the challenges of traditional sleep recordings and can help assess the reliability of wake EEG in evaluating sleep quality, while preserving the benefits of lower cost, greater accessibility, and reduced participant inconvenience. Despite its potential, only a few studies [
12,
14,
24] have explored sleep quality using awake EEG signals in healthy populations. For instance, Buckelew et al. [
12] compared healthy individuals with good and poor sleep quality using resting-state and sensory/cognitive tasks. They found that theta activity decreased from the eyes-open resting state to sensory attentiveness tasks in individuals with good sleep quality, whereas the opposite trend was observed in those with poor sleep quality. Bartolacci et al. [
24] assessed driving-related cognitive abilities in younger and older adults, revealing that elderly subjects exhibited lower habitual sleep efficiency and poorer performance on psychomotor vigilance, attention, and perception tasks, while younger adults reported higher self-rated sleepiness.
While power spectrum (PS) analysis, commonly used in studies examining aging and sleep, effectively identifies dominant frequency bands and linear correlations in the brain activity, it fails to capture the nonlinear and complex temporal dynamics of EEG data [
25]. Nonlinear EEG processing methods have been applied separately to study either aging [
17] or sleep quality [
16,
26]. Some of these methods include measures of statistical self-similarity, such as the Hurst Exponent (H) [
26], and entropy, which quantifies the level of chaos in a dynamic system. Increased entropy and reduced interhemispheric connectivity have been observed in healthy aging, potentially indicating a decreased ability of the brain to adapt to novel or complex situations [
27,
28]. Jeong et al. [
16] investigated eyes-closed EEG recordings from healthy male participants in both normal and sleep-deprived states. Their analysis revealed that sleep deprivation significantly reduced correlation dimension (D2), a nonlinear measure of system complexity, in specific EEG sensors. Only one study by Penalba-Sánchez et al. [
14] has used nonlinear EEG measures to classify sleep quality in aging. They classified young and older individuals based on their sleep quality using nonlinear features, achieving an accuracy of 85% across all channels and up to 92% in the delta frequency band for specific channels. Their findings suggest that an algorithm combining nine complexity features effectively discriminated between age groups (e.g., young adults with good sleep quality vs. older adults with good sleep quality), particularly highlighting differences in the lower frequency bands (theta and delta). The classification accuracy improved further when comparing younger adults with good sleep to older adults with poor sleep, suggesting a relationship between aging and sleep quality at the neural level.
To further investigate these nonlinear brain dynamics, advanced analytical tools such as sample entropy (SampEn) [
29] and dispersion entropy (DispEn) [
30] have been developed, offering significant advantages over traditional PS analysis for EEG signal evaluation. SampEn and DispEn overcome the limitations of PS analysis by quantifying the regularity and unpredictability of EEG signals, thereby detecting nonlinear patterns that PS might overlook. Building upon these methods, multiscale entropy (MSE) [
31] and multiscale dispersion entropy (MDE) [
32] enhance the analysis by evaluating entropy across multiple temporal scales through a coarse-graining procedure. This multiscale approach recognizes that brain dynamics occur across various time scales—from rapid neural processing to slower network interactions—offering a more detailed and comprehensive evaluation of signal complexity [
25].
MSE or MDE reveals different aspects of brain signal variability at each scale. Fine scales (short time intervals) capture local neural functions and high-frequency activities, reflecting immediate and intricate brain functions [
25]. Coarse scales (longer time intervals) assess large-scale network dynamics and lower-frequency interactions, indicative of broader cognitive and integrative processes. MSE and MDE can distinguish different physiological states, such as differentiating younger from older adults, by highlighting how the complexity is distributed across these scales [
32,
33]. For instance, healthy EEG signals exhibit high complexity across multiple scales, suggesting robust and adaptable brain dynamics. In contrast, pathological or aging-related EEG patterns show reduced complexity, supporting the “complexity-loss” theory [
31,
33]. Thus, MSE and MDE provide nuanced insights into both the linear and nonlinear aspects of brain activity, making it a superior tool for understanding the multifaceted nature of EEG signals compared to conventional PS analysis [
17,
25].
Although MSE and MDE as single-channel entropy measures effectively evaluate the complexity and irregularity of individual EEG channels, they are limited in detecting intricate patterns and interdependencies across multiple channels [
34,
35]. In contrast, multivariate MSE (mvMSE) [
34], and multivariate MDE (mvMDE) [
34,
35] leverage relationships between multiple EEG channels, enabling the identification of coordinated activity patterns that single-channel measures overlook. These multivariate approaches capture the collective dynamics of brain signals, providing a more comprehensive assessment of the data and facilitating the selection of channel subsets that better represent the underlying neural processes. While MSE and mvMSE can assess signal complexity, they may produce undefined or unreliable entropy values for short signals. In contrast, MDE and mvMDE effectively capture nonlinear dynamics and interdependencies across multiple EEG channels, resulting in higher classification accuracy or effect sizes for different health condition states [
17,
32,
35].
A key limitation of applying mvMDE to all EEG channels is its simultaneous inclusion of both intra- and inter-regional dispersion patterns. This dual focus can obscure the analysis when the primary goal is to understand interactions between distinct brain regions, as the local complexity within regions may overshadow the interpretation of inter-regional dynamics. To address this gap, we developed and evaluated two novel strategies for optimizing mvMDE application in EEG analysis. The first strategy, namely Regional Group Analysis, involves dividing EEG sensors into five brain regions and applying mvMDE within each region to isolate intra-regional patterns. These results are then compared with the average MDE of all sensor signals within each brain region to evaluate local complexity. The second strategy, namely Representative Channel Selection, focuses on capturing inter-regional dynamics by selecting the most informative channel from each region using four feature selection methods: maximum correlation (MaxCorr), maximum entropy (MaxEn), mutual information (MI), and principal component analysis (PCA). mvMDE is then applied to these reduced channel sets, and the results are compared with traditional full-channel mvMDE and average MDE. This targeted approach helps identify key channels that capture inter-regional dynamics while reducing memory and computational demands by limiting the number of processed channels. Selecting the representative channels from each major brain region enables mvMDE to capture meaningful interactions between distinct regions, such as the frontal and temporal areas, while minimizing redundant intra-regional patterns. Together, these strategies fill an important methodological gap by improving the interpretability and efficiency of multivariate entropy analysis in EEG research.
To the best of our knowledge, no prior studies have applied multivariate entropy measures to assess sleep quality and aging, particularly using resting–awake EEG recordings. Moreover, the existing research has not explored the integration of channel selection strategies prior to conducting multivariate entropy analysis. This represents a notable gap, as channel selection can significantly reduce computational demands and potentially improve classification accuracy—especially important when working with high-density EEG data. To address these limitations, we investigated the application of mvMDE for distinguishing between younger and older adults, further subdividing each age group into good and poor sleep quality categories.
The contributions of our study are multifaceted. Firstly, we applied a multivariate multiscale entropy type and introduced a novel approach for both region-specific and representative sensor selection. Secondly, we comprehensively compare different sensor selection methods to assess the brain dynamics concerning sleep quality and aging. Both approaches exhibited trends similar to traditional methods, with slightly better results at higher scale factors (SFs), which may be related to aging. Additionally, these approaches significantly reduce computational time and memory usage, which is crucial for analyzing large datasets and high-density EEG channel recordings. Finally, this study is the first to explore both intra- and inter-regional dispersions using mvMDE, offering new insights into the brain complexity and variability across different age groups and sleep quality subgroups.
2. Materials and Methods
The flowchart of our study is shown in
Figure 1. Evidently, after screening and EEG data recording from four groups of healthy individuals, the EEG data are preprocessed. The preprocessed data are then fed into our two proposed scenarios related to intra- and inter-regional analysis. Our proposed approaches include two scenarios that use entropy-based techniques, together with sensor selection. Finally, suitable statistical analysis and machine learning algorithms are applied to the extracted features to provide a comprehensive evaluation of our study. The following sections explain each of these steps in detail.
2.1. Participants
The current study used the eyes-closed resting-state EEG dataset reported by Penalba-Sánchez et al. [
14], which received ethical approval from the Health Research Authority, UK (REC reference: 17/EM/0101). The study was conducted with 58 healthy right-handed participants, divided into the following four groups based on age and sleep quality assessed by the PSQI:
11 young individuals with good sleep quality (Y-GSQ) (5 females, mean age = 23.36 ± 2.70)
13 young individuals with poor sleep quality (Y-PSQ) (5 females, mean age = 25.53 ± 3.54)
9 older individuals with good sleep quality (O-GSQ) (4 females, mean age = 73.77 ± 5.45)
25 older individuals with poor sleep quality (O-PSQ) (17 females, mean age = 72.56 ± 3.40)
It is worth noting that sleep quality was evaluated using the self-reported PSQI questionnaire, which includes 19 items across seven subcategories: subjective sleep quality, sleep duration, sleep onset, sleep efficiency, daytime dysfunction, use of sleeping medication, and sleep disturbances [
6]. This tool is considered highly effective for assessing sleep quality. The subcategories are combined into a global PSQI score, categorizing participants as good or poor sleepers in this study. Scores range from 0 to 21, with scores above 5 indicating poor sleep or with significant sleep disturbance [
6]. Thus, all good sleepers have a PSQI score < 5, and all poor sleepers have a PSQI score > 5. All participants included in the study met the following inclusion criteria: they were fluent in English, had normal or corrected-to-normal vision, and fell within one of the two designated age groups—20–34 years (younger adults) or ≥65 years (older adults). Participants reported no history of cognitive, psychiatric, or neurological disorders and were not taking any medications that could affect EEG recordings. To ensure data quality and minimize potential confounding variables, participants were asked to abstain from alcohol for 24 h and from caffeine and nicotine for at least 3 h prior to the EEG recording session. Exclusion criteria included a history of paranoid or paraphrenic illness, memory impairments, dyslexia, drug abuse, or any psychiatric or neurological disorder (including current or past diagnoses) or medications that could affect EEG recordings. For the older adult group, additional exclusion criteria included a definite or probable diagnosis of Alzheimer’s disease (AD), dementia, or any dementia-related condition; a history of epilepsy or stroke; any evidence of clouded consciousness; and a history of substance abuse or dependence (including alcohol). To further screen for mild cognitive impairment (MCI), older adult participants completed the Hopkins Verbal Learning Test-Revised (HVLT-R) [
36] prior to participation. This test was administered to ensure the healthy older adults were not likely to be experiencing MCI. This test was chosen due to its superior sensitivity and specificity compared to other cognitive assessments, such as the Mini-Mental State Examination (MMSE) [
37]. All participants gave their informed consent before commencing the data recording.
2.2. EEG Data Recording and Preprocessing
The original EEG data were recorded using a BioSemi Active-Two amplifier with 128 Ag/AgCl scalp electrodes (BioSemi Inc., Amsterdam, Netherlands) at a sampling rate of 2048 Hz and digitized at 24-bits. Referencing was performed online using CMS/DRL feedback loop with a low pass filter (5th order since response with a −3 dB at 1/5th sampling rate). During the electrode application, offsets were examined to ensure they were <20
. In addition, seven external electrodes were placed around the eyes and face to assist with the detection and removal of ocular and facial artifacts. EEG recording took place in a specialized laboratory within a purpose-built Faraday cage to minimize electrical interference. The ambient temperature was controlled and maintained between 21–23 ∘C. Data were collected using ActiView V6.05 (National Instruments, TX, USA) on a Windows PC. Participants were seated comfortably with EEG equipment attached and were offered breaks as needed throughout the session. During data collection, they were instructed to remain still, stay awake, and keep their eyes closed throughout the resting-state EEG recording. It is worth noting that data collection was standardized between 10:00 and 14:00 to mitigate possible diurnal differences. For this study, 32 channels based on the 10–20 system (Fp1, Fp2, AF3, AF4, F3, F4, F7, F8, Fz, FC5, FC6, FC1, FC2, C3, C4, Cz, CP1, CP2, CP5, CP6, T7, T8, P3, P4, P7, P8, Pz, PO3, PO4, O1, O2, Oz) were selected, as commonly reported in studies of aging disorders [
9]. This selection reduces computational load while maintaining sufficient spatial coverage for clinical and cognitive analyses. All preprocessing steps were performed using the EEGLAB toolbox [
38]. After re-referencing all channels to a linked mastoid, a bandpass filter with 0.1 and 45 Hz cutoff frequencies was applied to all channels. Furthermore, the DC offset was removed, and the data were downsampled to 256 Hz. The data were then visually inspected, and all noisy segments were removed before applying independent component analysis (ICA) using the Infomax ICA algorithm. Subsequently, the components related to eye blinking or muscle artifacts were excluded, and removed channels were interpolated. Finally, the first minute of data was divided into 4-second epochs for further analysis.
2.3. Proposed Approaches
In this study, we employed two scenarios reflecting intra- and inter-regional analyses to evaluate the proposed approaches comprehensively. In the first scenario, focusing on intra-regional analysis, all EEG sensors were divided into five conventional brain regions based on previous studies, as illustrated in
Figure 2 (left topoplot). The proposed mvMDE was then applied separately to each regional group, yielding a complexity vector of size
for each region. To validate this scenario, MDE was first computed for each channel individually, and then these channel-wise metrics were compared to the mvMDE values averaged across the sensors within each region.
Figure 2 provides an overview of the intra-regional analysis approach.
In the second scenario, centered on inter-regional analysis, we introduced an innovative representative sensor selection approach. First, one sensor was chosen from each of the five main brain regions (frontal, central, temporal, parietal, and occipital) using four different methods—MaxCorr, MaxEn, MI, and PCA. We then applied mvMDE to these selected five sensors to capture the complexity and irregularity of inter-regional EEG dynamics. The resulting complexity vector (
) was subsequently compared with the output from both the conventional mvMDE calculation (using all 32 EEG sensors) and the averaged MDE (calculated from channel-wise MDE values).
Figure 3 provides a flowchart illustrating this representative sensor selection scenario.
As can be observed, the implementation of these two scenarios required several approaches, including MDE and mvMDE calculations, as well as sensor selection methods. The following subsections present the theoretical background of these key methods, detailing their implementation and relevance to the study objectives.
2.4. Multiscale Dispersion Entropy
The computation of MDE involves the following steps: Consider a signal of length
L, represented as
. First, the signal
is divided into non-overlapping segments of size
, known as the temporal SF. The average value of each segment is then computed to produce coarse-grained signals as described in [
33]:
where
N denotes the length of the coarse-grained series. Finally, the DispEn is calculated for each coarse-grained signal.
DispEn is a method based on Shannon entropy and dispersion patterns, offering a computationally efficient and robust approach to entropy estimation. It is particularly effective for analyzing short time series. First introduced in [
30], DispEn has been extensively applied in biomedical signal analysis, including EEG [
39,
40], magnetoencephalography (MEG) [
41], cardiac signals [
42], and heart sounds [
43].
The DispEn for a signal is computed through the following steps:
(1) Each element
is mapped into
classes with integer indices ranging from 1 to
. This mapping is performed using the normal cumulative distribution function (NCDF) to address the problem of skewed class assignments caused by extreme values. The NCDF transforms
into
within the interval [0, 1]:
Here,
and
denote the mean and standard deviation of
. A subsequent linear transformation assigns each
to an integer between 1 and
c using
, where
is the
element of the classified signal [
30].
(2) Vectors
are constructed with an embedding dimension
m and delay
d:
Each vector
is mapped to a dispersion pattern
[
30,
44], where
,
,…,
. The total number of possible patterns is
.
(3) The relative frequency of each pattern
is determined:
where # denotes the count of patterns.
(4) Finally, DispEn is computed as:
2.5. Multivariate Multiscale Dispersion Entropy
The mvMDE algorithm is designed to extend DispEn to multivariate time series, capturing both temporal and spatial complexity. It comprises two primary steps: (1) coarse-graining the multivariate signal, and (2) calculating the multivariate dispersion entropy (mvDE). The detailed procedure is as follows:
Assume we have a multivariate time series with p channels, each of length L.
For each channel
k, the signal is divided into non-overlapping segments of size
, termed the SF. The average of each segment is calculated to generate coarse-grained signals:
Here,
N is the length of the coarse-grained signal. The coarse-grained signals
represent the smoothed versions of the original multivariate time series, preserving the key dynamics at the chosen scale
.
(a) Mapping to Classes: Each element
is mapped to one of
classes with integer indices (1 to
). This mapping is achieved using the NCDF to normalize the data between 0 and 1:
where
and
are the mean and standard deviation of channel
k. A subsequent linear mapping assigns each
to an integer between 1 and
:
(b) Constructing Multivariate Embedded Vectors: Based on Takens’ embedding theorem [
45], multivariate embedded vectors
are constructed to capture both spatial and temporal dependencies:
for
, where
is the embedding dimension and
is the delay for channel
k. For simplicity, we assume
and
for all
k.
(c) Generating Combinations of Patterns: Each multivariate vector comprises elements. For these elements, all combinations of m elements at a time are generated, termed , where . The number of possible combinations for all vectors is .
(d) Calculating Pattern Frequencies: For each combination
and each of the
possible dispersion patterns
, where
,
,…,
, the relative frequency is computed:
where # denotes the count of patterns in the embedded vectors.
(e) Computing mvDE: Finally, mvDE is calculated using Shannon entropy:
2.6. Channel Selection Methods
Channel selection in EEG signal processing is crucial for enhancing computational efficiency, minimizing overfitting, and reducing setup time in applications such as seizure detection, motor imagery classification, and emotion recognition. The primary approaches can be categorized into five strategies: filtering, wrapping, embedding, hybrid, and human-based techniques [
46]. Filtering techniques independently evaluate channels using statistical measures like variance and entropy, offering speed and scalability. Wrapping techniques assess channel subsets using specific classification algorithms, achieving higher accuracy but increasing the computational complexity and overfitting risks. Embedding techniques optimize channel selection within the model training process, balancing efficiency and accuracy. Hybrid methods combine filtering and wrapping strategies, leveraging their strengths for robust performance with reduced computational demands. Human-based techniques rely on expert knowledge to identify relevant channels, which is particularly beneficial in domains requiring specialized expertise. Together, these methods provide a comprehensive toolkit for optimizing the EEG channel selection tailored to specific application needs.
In this context, it is essential to note that our goal is not to evaluate the classification results but to focus on channel selection methods’ impact on the computational efficiency and signal representation. The choice of channel selection strategy should align with the specific requirements and constraints of the application at hand without being solely dependent on the classification-based performance metrics. In this study, we propose four primary methods for channel selection: MaxCorr-based selection, MaxEn-based selection, MI-based selection, and PCA-based selection. Each technique offers unique advantages in identifying the most representative EEG channels that capture the essential characteristics of brain activity within specific regions.
2.6.1. MaxCorr-Based Selection
Correlation-based selection is a method that selects EEG channels based on their average correlation with other channels within the same brain region [
47]. This technique emphasizes channels that are centrally connected to the network of electrodes, thereby capturing the common activity patterns while reducing redundancy. A correlation matrix is computed for each brain region to assess the pairwise correlations between all EEG channels. The average of the absolute correlation values for each channel is then calculated. The channel with the highest average correlation is selected as the representative channel for that region. This process ensures that the chosen channel reflects the central signal of the region, maintaining essential information while minimizing overlapping activity from redundant channels. Using MaxCorr-based selection effectively identifies channels that are most representative of the collective activity within a brain region.
2.6.2. MaxEn-Based Selection
Based on the principle of maximum entropy, the probability distribution that best represents the current state of knowledge about a system is the one with the largest entropy [
48]. In this scenario, we select the channel with the maximum entropy value for each brain region. This approach ensures that the selected channel contains the most and least biased information, providing a comprehensive representation of the EEG signal’s complexity within that region.
2.6.3. MI-Based Selection
MI measures the amount of shared information between channels, capturing both linear and nonlinear dependencies [
49]. Unlike correlation, MI can identify complex relationships between EEG channels, providing a more comprehensive understanding of information sharing. The MI between every pair of EEG channels is computed for each brain region. Subsequently, the average MI with all other channels in the region is calculated for each channel. The channel with the highest average MI is selected as the representative channel for that region. This selection process ensures that the most informative and statistically relevant signals are preserved, capturing intricate dependencies that may not be evident through correlation alone. By leveraging MI for channel selection, this method captures nonlinear dependencies between EEG channels, offering a deeper insight into the underlying brain dynamics.
2.6.4. PCA-Based Selection
PCA is a widely used dimensionality reduction technique that transforms the original correlated EEG channel data into a set of uncorrelated components known as principal components [
50]. These components are ordered by the amount of variance they explain in the data. The primary objective of PCA in our research is to identify the EEG channel that best represents the dominant activity pattern within each brain region. For each predefined brain region, PCA is performed on the EEG data to extract principal components. The first principal component (PC1), which accounts for the maximum variance, is identified. The EEG channel with the highest loading on PC1—indicating the strongest association with the principal component—is selected as the representative channel for that region. This approach effectively reduces the complexity of the EEG data while retaining the most informative features, thereby enhancing the robustness of subsequent entropy measures. By selecting channels based on their contribution to the principal components, PCA ensures that the most significant patterns of brain activity are captured. This minimizes data dimensionality and preserves the essential information necessary for the accurate analysis of brain dynamics.
2.7. Statistical Analysis and Machine Learning
To rigorously evaluate our proposed algorithms, we conducted pairwise comparisons between the two groups (Y-GSQ vs. Y-PSQ and O-GSQ vs. O-PSQ) using both statistical and machine learning methods. First, the normality of the features was assessed using a one-sample Kolmogorov–Smirnov test [
51]. As the features did not follow a normal distribution, the Wilcoxon rank-sum test was applied for statistical comparisons. Given the relatively small sample sizes, Hedge’s g effect size (ES) was also computed to provide a more robust evaluation and comparison of our proposed algorithms with conventional methods. In the machine learning step, we evaluated both conventional and proposed algorithms separately. Significant and informative features were first selected using the Wilcoxon rank-sum test on the training set. The test set was then assessed using two classifiers: support vector machines (SVM) and k-nearest neighbors (KNN). The selection of classifiers’ hyperparameters, as well as informative features, was optimized through a trial-and-error approach aimed at minimizing the classification error. To ensure robust and subject-independent evaluation, we performed leave-one-subject-out cross-validation (LOSOCV), where the data from one subject is used for testing while those of the remaining subjects form the training set. This process was repeated for all subjects to obtain the overall performance of each algorithm. Finally, the model performance was evaluated using specificity, sensitivity, accuracy, and F1-score metrics.
4. Discussion
For the first time, this study utilizes mvMDE within two distinct algorithms—region-specific and representative sensor selection—to assess sleep quality in both young and older adults. These algorithms were statistically compared to the conventional applications of MDE and mvMDE, using ES values to compensate for the limited dataset size in each group. Additionally, each method was evaluated independently using machine learning approaches to examine classification performance.
The proposed approaches introduce significant advancements in EEG signal analysis by incorporating innovative strategies for both intra- and inter-regional complexity assessments. In the first scenario (
Figure 2), mvMDE is applied to predefined EEG regions, enabling a spatially structured analysis that captures intra-regional dynamics and provides deeper insights into brain signals’ localized complexity and irregularity compared to single-channel methods. In the second scenario (
Figure 3), a novel computationally efficient framework identifies representative sensors for inter-regional analysis using MaxCorr, MaxEn, MI, and PCA techniques. This ensures that selected channels retain the most critical information while substantially reducing the computational burden of processing high-density EEG datasets.
The novelty of our algorithm lies in its dual focus: enhancing interpretability through intra- and inter-regional analysis while improving scalability via efficient channel selection. This dual approach not only advances the methodological rigor of EEG analysis but also maintains or improves the accuracy of measurements, paving the way for their applications in clinical and cognitive neuroscience, especially in studies requiring the assessment of sleep quality and aging-related changes.
Our results across both scenarios demonstrate that the proposed algorithms outperform conventional approaches in distinguishing sleepers with good and poor sleep in both young and older adult groups. As shown in
Figure 4, the region-specific analysis reveals that entropy measures exhibit trends similar to MDE across all brain regions. Furthermore, based on
Table 1, our algorithm shows slightly better differentiation between groups, particularly at higher SF values. The highest ES for comparing older adults corresponds to SF = 27, theoretically linked to lower frequency bands, especially delta. As mentioned earlier, each SF (
) is associated with specific frequency bands, calculated as
/(2 ×
), where
is the sampling frequency. Notably, the most significant results (
p = 0.01) in older adults occur in the temporal brain region, a critical area for studies on healthy aging and related disorders [
19,
52]. The hippocampus, situated in the medial temporal lobe, plays a key role in memory formation and retrieval [
53], including declarative memory consolidation during slow-wave sleep [
54]. The anterior hippocampus is part of an anterior-temporal network involved in semantic memory, while the posterior hippocampus is linked to a posterior-medial network associated with episodic memory [
55,
56]. Therefore, examining temporal lobe activity provides valuable insights into age-related memory changes and sleep disturbances. Future aging and sleep studies could combine functional magnetic resonance imaging (fMRI) and EEG to enhance spatiotemporal resolution and provide deeper insights not only into specific regions of the temporal lobe affected by aging and sleep but also into other deep brain structures, such as the locus coeruleus (LC). The LC, a brainstem region and primary source of norepinephrine, modulates wakefulness and has been identified as one of the first regions affected by both sleep processes and aging. Due to its widespread innervation of several cortical regions (e.g., strong afferent connections to the amygdala, hippocampus, and frontal areas) [
57], the LC plays a critical role in regulating fear and pain responses [
58], preserving memory [
59], and supporting attention and decision-making [
60] in both the aging process and in AD.
As illustrated in
Figure 5 and
Figure 6, and detailed in
Table 2, the second scenario—sleep quality comparisons—further highlights the importance of higher SF values, particularly SF = 25, which is associated with the delta band and sleep disturbances in older adults. The significance of slow frequency bands (e.g., delta) has been well-documented in studies on aging [
19,
52,
61,
62,
63] and sleep disorders [
13,
64,
65]. Yeo et al. [
64] recently reported significant correlations between delta and theta band powers and sleep reactivity across brain regions in older adults with insomnia, attributing these changes to aging rather than cognitive decline. Alterations in slow-frequency EEG power, such as decreased theta and delta relative band power, are associated with normal aging [
19,
52,
61,
62,
63], potentially reflecting mild brain atrophy and neurophysiological changes [
64,
66]. Our prior findings demonstrated that decreases in slow relative power during aging align with entropy measures showing similar trends [
17]. Conversely, increases in slow-frequency power and entropy have been linked to dementia and cognitive decline in older adults [
17]. This study supports the hypothesis that O-PSQ is associated with reduced entropy values in the lower frequency bands (higher SFs), as evidenced by the patterns observed in
Figure 4 (bottom panel) and
Figure 6, which reflect a compensatory brain process aimed at maintaining cognitive function and delaying dementia progression. [
67].
In contrast, as observed in
Figure 4 (top panel) and
Figure 5, Y-PSQ exhibit elevated entropy values in the lower frequency bands (higher SFs), suggesting a different response to sleep disturbances compared to older adults. This may stem from differences in tolerance to sleep abnormalities, such as poor sleep, forced sleep, or sleep deprivation, between young and aging brains [
24,
68].
Among all the methods used in the second algorithm for representative sensor selection, PCA achieves the highest ES and produces the most significant results in aging groups (
Table 2). The other methods yield comparable outcomes, with slightly better performance observed when employing MI and MaxCorr for sensor selection. The classification results (
Table 3) are consistent with the statistical findings, demonstrating that the second proposed algorithm, mainly when using PCA, and the first algorithm deliver the best classification performance, with SVM and KNN performing optimally, respectively. In this study we demonstrate an improvement over previous work using the same dataset in comparing O-GSQ vs. O-PSQ, achieving 91.18% accuracy with mvMDE-PCA. Notably, the earlier study [
14] reported accuracies of 81% (for the best channel) and 72% (averaged across all channels). For Y-GSQ vs. Y-PSQ, their algorithm achieved 88% accuracy for the best-performing channel, but only 50% when averaged across all channels. In contrast, our approach reached a more balanced accuracy of 70.83% using mvMDE-PCA, which, although slightly lower than their best-channel performance, was derived from an integrative multichannel analysis. Despite using the same dataset, the key differences between these studies may account for the variation in the results. For instance, their model evaluated each EEG channel independently, optimizing the classification at a local (i.e., channel-specific) level. By contrast, our approach focuses on both intra- and inter-regional dynamics using mvMDE, which inherently accounts for cross-channel interactions. This methodology enables a more holistic assessment of brain complexity, better capturing the distributed nature of neural compensatory mechanisms related to aging and sleep disturbances. Overall, both studies demonstrated that changes in slow-frequency bands are characteristic of typical brain aging and poor sleep quality across the lifespan. This finding aligns with previous research using traditional methodologies, which also suggest that sleep deprivation and poor sleep quality affect theta and delta frequencies [
69]. Specifically, alterations in delta and theta activity in the temporal region appear to be more strongly associated with older adults experiencing poor sleep quality, rather than with normal aging alone. We hypothesize that chronic disruptions in slow-wave activity within temporal regions in older adults may contribute to reduced clearance of brain metabolites, potentially leading to pathological aging [
70]. Furthermore, studies indicate that damage in the temporal and parietal cortices driven by tau and beta-amyloid accumulation is a hallmark of prodromal AD. In particular, the right temporoparietal cortex exhibits reduced activation and increased atrophy in AD, which has been linked to impairments in spatial navigation.
In this pilot study, we evaluated our proposed algorithms, focusing on computational time and efficiency. Notably, the mvMDE function exhibited the highest computational time among all methods. However, by employing the second algorithm with only five selected sensors, we reduced the computational burden by approximately 15-fold compared to using all 32 channels. This reduction is particularly advantageous for high-density EEG recordings, making the use of multivariate entropy measures more practical for big data analysis in such settings. Furthermore, the measured MI demonstrated the shortest running time among all sensor selection methods, followed closely by PCA. Overall, combining mvMDE with the PCA-based sensor selection approach not only delivers the best classification and statistical outcomes for the aging dataset but also significantly enhances computational efficiency, making it a highly effective solution for large-scale EEG studies.
Our findings highlight the practicality of implementing the proposed approach in real-world clinical scenarios. By optimizing the computational method and utilizing conventional machine learning techniques, our method ensures easy integration into the existing clinical workflows without requiring significant computational resources. Selection of the most informative channels within each brain region further enhances efficiency, reducing computational demands while preserving analytical accuracy. Additionally, our study highlights the importance of both intra- and inter-regional approaches with multiscale analysis to identify key brain regions and frequency ranges relevant to sleep and aging-related disorders. Despite the clinical significance and applicability of this study, further validation is essential for successful clinical translation. Independent validation using diverse datasets will help assess the model’s generalizability and mitigate potential biases. Furthermore, cross-site testing across different clinical centers with varied protocols, equipment, and patient demographics will be necessary to confirm the method’s robustness and adaptability. Importantly, by employing an inter-regional approach with representative sensor selection, our method enhances the compatibility across different EEG systems, making it more accessible for use in a variety of clinical environments. These steps will be crucial for ensuring the reliability, safety, and widespread adoption of our approach in future clinical applications.
Despite the novelty of the proposed approaches and the intriguing results, this study has some limitations that should be acknowledged. First, the sample size in this study is relatively small, which may limit the robustness and generalizability of our findings. Additionally, our dataset lacks sufficient diversity in terms of demographic and ethnic representation. Future research should incorporate larger and more diverse datasets, including individuals from multiple nationalities, socioeconomic backgrounds, and age groups within each category. Expanding the sample sizes in this way would enhance the external validity of the study and reduce potential biases. Furthermore, future studies should include additional groups, particularly those related to neurodegenerative disorders such as AD and MCI, to facilitate a more comprehensive evaluation of aging and its associated conditions. Including participants with neurodegenerative disorders would provide valuable insights into the relationship between sleep quality, aging, and cognitive decline. Moreover, examining these populations could enhance our understanding of whether sleep-related biomarkers serve as early indicators of neurodegenerative conditions leading to preventative strategies and therapeutic interventions.