1. Introduction
Alcoholism is a well-known mental illness, recognized for its widespread negative impact and high mortality rates [
1]. According to the World Health Organization (WHO), alcohol use contributes to a significant number of deaths, accounting for approximately 5.3 percent (3 million) of total fatalities in 2022 [
2]. It ranks as the fifth leading cause of mortality [
3] and is identified as the primary risk factor for premature death and disability [
4]. Alcoholism profoundly affects overall health, leading to various ailments such as lung and kidney diseases, psychological disorders, and certain types of cancer.
Excessive alcohol consumption contributes to myriad societal issues, including violent crimes, car accidents, social problems, and family breakdowns [
5,
6]. Individuals with alcoholism often experience mental health challenges, including cognitive impairments, motor difficulties, and prolonged behavioral amendments characterized by restiveness and hopelessness [
7,
8]. The electrophysiological processes in the brain, including neural signaling and connectivity, undergo significant alterations due to alcohol consumption. These changes manifest as variations in the frequency, amplitude, and connectivity of brainwave patterns. For instance, studies have shown that alcohol intake can enhance certain brainwave frequencies while dampening others [
9]. Understanding these complex modifications in brain activity is essential for accurately diagnosing alcoholism and comprehending its neurological implications.
EEG has proven to be a valuable diagnostic and research tool in the field of neuroscience, offering insights into brain dynamics and functioning [
10]. The EEG readings reflect the electrical activity generated by neurons in various brain regions. However, interpreting these intricate EEG signals and extracting relevant information can pose challenges. Expert clinicians generally evaluate the signals visually to distinguish between healthy and alcoholic persons. However, even experienced specialists can miss signal fluctuations due to interference [
11,
12]. Considering the increasing importance of proper neurological abnormality assessment and treatment, this research aims to create an organized investigation structure for efficiently diagnosing alcoholism. A design like this can help with the early detection of probable ailments.
The research literature provides several methods, covering temporal, spectral, non-sequential, auto-regressive (AR), and temporal–spectral strategies, for automatically detecting alcoholic and healthy EEG signals [
13]. Both temporal and spectrum analysis methodologies are unsuitable for a thorough evaluation since EEG signals are unpredictable and display fluctuating properties. It is, therefore, essential to use time–frequency analysis techniques. Prior research has evaluated the power distribution of EEG data for classification purposes using AR models and fast Fourier transform (FFT) techniques [
14]. To improve the identification process, a study [
14] also suggests an automated method incorporating an AR model, a fuzzy-based customizable technique, and a dimension reduction technique.
Numerous studies [
15,
16] have used a variety of nonlinear attributes to distinguish between normal and alcoholic EEG patterns, including “approximate entropy (ApEn)”, “the largest Lyapunov exponent (LLE)”, “sample entropy (SampEn)”, “correlation dimension (CD)”, “the Hurst exponent (H)”, and “higher-order spectral features”. Recently, “cross-frequency coupling” [
17] and “amplitude modulation multiscale entropy” [
18] new nonlinear methods have been proposed, offering enhanced capabilities for characterizing the complexity and dynamics of EEG signals. These newer approaches allow for a more comprehensive and detailed analysis of brain states and have the potential to improve the accuracy of alcoholism detection. In addition, time–frequency techniques have been used in studies [
19] to separate healthy and alcoholic EEG data.
In recent years, the field of alcoholism detection has witnessed a burgeoning interest in graphical approaches, showcasing the innovative potential of data visualization and computational modeling techniques. Prominent among these methods is the second-order difference plot (SODP), which captures subtle patterns in electroencephalogram (EEG) data by highlighting variations in signal intensity over time. However, the applicability of SODPs is limited by their susceptibility to noise, making them less reliable in real-world, noisy EEG environments [
20]. Another approach gaining traction is the analysis of phase-space dynamics, which leverages the concept of attractors to explore the intricate nonlinear dynamics of brain signals. While promising, this method often necessitates the tuning of parameters, making it less user-friendly for non-experts [
21]. Graph neural networks (GNNs) have also revolutionized EEG-based alcoholism prediction by modeling neuronal activity as a graph, allowing for the introduction of complicated connections. Nevertheless, for the best results, GNNs may demand significant resources for processing and large datasets, thus limiting their utility in environments with limited resources [
22].
Ensemble approaches are emerging as a significant tool for improving the efficacy and robustness of prediction models in the field of EEG-based alcoholism diagnosis. To give an accurate depiction of EEG data, these approaches combine the benefits of numerous feature extraction or classification algorithms. These approaches are widely used in the identification of different neural diseases using EEG but rarely used for alcoholism detection. The model used in [
23] integrates input data in tabular, temporal, and picture forms with an ensemble of linear neural networks, long short-term memory (LSTM), and efficient-net classification approaches. This technique, though, has significant disadvantages, including a limiting emphasis on the EEG domain and the use of fundamental engineering traits, high processing costs, and poor classification accuracy. To resolve the aforementioned issues, the key contributions of this study are as follows:
Improved automated distinction between normal and alcoholic EEG signals through a novel design.
Developed a noise removal approach tailored for multivariate time-series data.
Introduced an innovative graphical technique utilizing fast fractional Fourier transform (fast FrFT)) to visualize EEG signals.
Conducted an evaluation of relevant graphical features for alcoholism detection.
Implemented an ensemble feature selection strategy to select the most compelling feature set.
Designed a robust classification model for EEG signal analysis.
The key novelties of this study are the development of a Fast-FrFT-based graphical approach to convert time-series EEGs into topological patterns, graphical feature extraction to understand complex EEG signal behavior, and the proposal of an ensemble feature selection approach for building a realistic alcoholism computerized framework.
2. Materials and Methods
EEG recordings from subjects who were both alcohol- and non-alcohol-consuming were used to develop the dataset for this investigation. Anyone can access this dataset for free and academic purposes at
https://archive.ics.uci.edu/dataset/121/eeg+database (accessed on 1 January 2023). The recordings come from 64 electrodes positioned on each subject’s head in rest state under the recommended sensor positions established by the American Electroencephalographic Association in 1990. The sampling frequency used for the EEG signals was 256 Hz. The dataset is divided into two parts: standard and alcoholic. The EEG data are divided into 32-second (about 16,400 samples) intervals. The three categories of open-access EEG record files include small sets, large sets, and full records, comprising data from two, ten, and 122 subjects, respectively. This work uses small database collection for research purposes. The dataset details are given in
Table 1 [
24].
Figure 1 presents a graphical depiction of both the alcoholic and control EEG signals.
This work introduces a graphical technique based on the fast FrFT for identifying normal and alcoholic EEG signals. The comprehensive process of the proposed new framework is segmented into several modules, including preprocessing, a novel graphical technique employing the fractional Fourier transform, feature extraction, ensemble feature selection, and classification. These modules are illustrated in
Figure 2. The components mentioned above are discussed further below.
2.1. Module 1: Preprocessing
The dataset contains EEG signals captured at a 256 Hz sampling rate and 12-bit resolution for 32 s (about 16,400 samples). The 32 s long EEG recordings are divided into four equal parts, each containing 2048 samples within an 8 s frame. This study uses smaller datasets for research, and the baseline filter efficiently eliminates interference like blinking and physical mobility (>73.3 μV) [
25]. The massive electroencephalogram (EEG) data are split into an eight-second frame with four equal parts of 2048 samples for subsequent analysis.
The EEG signals that are recorded from a subject’s scalp are delicate, brittle, and vulnerable to different types of interference, including electrical noise, structured noise, eye movement noise, and others. These interferences have the following mathematical expression:
In the above equation,
stands for the information and
for the artifact in the signal. The objective is to create a method that efficiently removes noise from the raw signal while maintaining the data in
. A superior technique for locating correlated data and establishing the direction of the linear relationship between two categories is principal component analysis (PCA). It is crucial to use time–frequency wavelet processing to deal with the EEG signal’s properties because it is nonlinear and not stationary. The use of wavelet decomposition, for this reason, has been tried here. A denoising technique was developed by fusing PCA and the wavelet transform. The algorithm described in [
26] is summarized below:
Apply the wavelet transform to all channel signals to break them down into their n levels.
Apply principal component analysis (PCA) on the estimated and detailed matrices of the decomposed signals.
Apply the inverse wavelet transform to the resulting principal components (PCs).
Consider applying PCA to the equivalent matrix acquired in the previous step to obtain a filtered EEG data sequence. Only a few PCs were preserved in the developed system based on the Kaiser rule, which specifies that PCs with eigenvalues greater than the corresponding individual eigenvalues should be retained. Five layers of wavelets were chosen after multiple testing. To generate the detail and approximation signals, the Sym4 wavelet function was empirically chosen.
Figure 3 presents a graphical representation of the preprocessing effects. It is evident from
Figure 3c,d that MSPCA retains the information while effectively removing the noise.
2.2. Module 2: Fast Fractional Fourier Transform as Graphical Approach
Fast fractional Fourier transform (fast FrFT) enables the investigation of signals with variable frequencies over time, making it a useful tool for studying non−stationary data like EEG signals [
27]. The fast FrFT as a graphical approach may be applied to the analysis of EEG signals to convert time-domain signals into the FrFT domain, producing a 2D representation of the signal in the time–frequency plane. This temporal visual representation of the signal’s frequency content can help in frequency component identification, alcoholism detection, and better interpretation of complex signals. With the fast FrFT’s graphical method, it is possible to identify particular frequency components and their time-varying behavior, leading to more precise neurological condition diagnosis and therapies. The method also offers a more natural way to visualize the signal and spot transitory events that could be challenging to spot when using conventional analysis methods. All things considered, the fast FrFT is a potent tool that can improve the interpretation and comprehension of EEG signals [
28]. The fast FrFT, as a graphical approach, enables the identification of specific frequency components and their time-varying behavior, allowing for more accurate diagnoses and treatments of neurological disorders. Additionally, the method provides a more intuitive way of visualizing the signal and detecting transient events that may be difficult to identify using traditional analysis techniques. The fast FrFT is a powerful tool that can enhance the analysis and understanding of EEG signals [
28]. As a graphical tool, fast FrFT is described in depth in Algorithm 1.
Algorithm 1 Fast FrFT Algorithm with Plotting for EEG Signals. |
Define Parameters: |
|
Algorithm Steps: |
- 1.
Input: EEG signal of length N and desired fractional Fourier transform angle for alcoholism detection. - 2.
Output: EEG signal in the fractional Fourier domain and a scatter plot of the real and imaginary parts of the fast FrFT coefficients. - 3.
Set the initial value of to . - 4.
Calculate the scaling factor . - 5.
Initialize k to 1. - 6.
While do: - (a)
Set FFT length to . - (b)
Calculate the filter coefficients :
- (c)
Filter the EEG signal using the filter coefficients and perform an FFT to obtain the signal in the fractional Fourier domain. - (d)
Multiply the signal in the fractional Fourier domain by the scaling factor . - (e)
Inverse FFT the signal to obtain the EEG signal in the time domain. - (f)
Set to the filtered EEG signal obtained in the previous step. - (g)
Increment k by 1.
- 7.
Output EEG signal in the fractional Fourier domain. - 8.
Calculate the fast FrFT coefficients :
- 9.
Generate a scatter plot of the real and imaginary parts of the fast FrFT coefficients for alcoholism detection.
|
A 2D representation of fast FrFT is shown in
Figure 4.
2.3. Module 3: Feature Extraction
Time-domain features, such as mean amplitude, variance, skewness, and kurtosis, define the amplitude and time duration of EEG waves. These traits reveal details regarding the overall structure of the EEG waveform, but they may miss vital frequency-specific facts that can indicate neurological processes. The frequency-domain characteristics describe the power or energy distribution of EEG signals over distinct spectrum bands, such as alpha, beta, theta, and delta. These characteristics can show changes in neurological activity linked with different cognitive or behavioral states and can help recognize frequency-specific behavioral trends in activity.
Graphical features visually depict the EEG signal by combining data from the temporal and frequency domains. These characteristics enable frequency distribution changes over time to be examined and specific trends or factors that may be significant for alcoholic EEG signal categorization to be identified. Graphical aspects have the beneficial feature of providing a more understandable and clear depiction of the EEG signal than numerical values alone. Investigators can find complicated connections and patterns that would be challenging to determine from numerical data alone by visualizing the signal in this manner. It is possible for individuals who consume alcohol to have temporal and spectral changes, and connectivity of brain waves in their EEG signals. For example, alcohol can decrease specific brainwave patterns while enhancing others [
6].
This study proposes a set of thirty-four novel graphical features, as illustrated in
Figure 5, for quantifying and analyzing electroencephalogram (EEG) signals in the context of alcoholism identification. These features capture various aspects of the EEG signal’s characteristics and are designed to provide insights into the variation, complexity, self-similarity, scatter rate, symmetry, and distribution of data points on a 2D space. The details of the thirty-four novel graphical features utilized for the experiments are as follows [
21].
F1 (summation of consecutive circles area (SCCA)) measures the variation in the graphical fast FrFT of EEG signals by summing the areas of subsequent circles. F2 (summation of consecutive triangles area (SCTA)) quantifies the 2D dynamics with more flexibility by adding the areas of consecutive triangles. F3 (summation of Heron’s circulars area (SHCA)) captures the self-similarity of the phase-space dynamics through the summation of the areas of Heron’s circulars. F4 (summation of distances between Heron’s circulars (SDHC)) calculates self-sameness and intricacy by adding the distances among succeeding Heron’s circulars. F5 (summation of the angles between Heron’s circulars (SAHC)) quantifies the similarity between 2D shapes using the angles between successive Heron’s circular centers. F6 (summation of successive vector lengths (SSVLs)) captures the amplitude variation in the time domain by summing the lengths of successive vectors. F7 (shortest distance from each point relative to the 45-degree line (SH45)) and F8 (summation of shortest distance from each point relative to the 135-degree line (SH135)) measure the scatter rate of data in different quarters of the 2D shape relative to specific lines. F9 (area of octagon (AOCT)) quantifies the extent of data expansion through the calculation of the area of an octagon. F10 (summation of distances to a coordinate center (SDTC)) measures the variation in the 2D structure from the central point by summing the distances to the center. F11 (summation of angles between three consecutive points (SATP)) captures the smoothness of the 2D shape by summing the angles between three consecutive points.
F12 (summation of triangles areas made by successive points and coordinate center (TACR)) combines the features of SSVLs and SDTC to measure variation and self-similarity simultaneously. It calculates the sum of the areas of triangles formed by connecting successive points to the coordinate center. F13 (summation of consecutive rectangular area (SCRA)) incorporates the features of SH45, SH135, and SDTC to quantify data scattering from multiple reference lines simultaneously. It sums the areas of rectangles formed by the points on specific lines. F14 (two-dimensional standard descriptors (TDSDs)) uses two lines to depict the dispersion of data values on the grid. F15 (elliptical area (ELPA)) captures the elliptical pattern of the EEG signal’s phase-space dynamics by calculating the area of the fitted ellipse. Finally, F16–F34 (central tendency measures (CTMs)) are central tendency measures that provide insights into the distribution of points on the coordinate plane. These measures are calculated for different percentiles and visually represented as circles of varying sizes, reflecting the variability of data points.
Collectively, these thirty-four graphical features offer a comprehensive framework for analyzing EEG signals in the context of alcoholism identification. They provide valuable information about various aspects of the signals’ characteristics and can contribute to more accurate and insightful assessments in this domain.
2.4. Module 4: Feature Selection
Feature selection plays a crucial role in the development of a reliable and accurate algorithm for detecting alcoholism based on EEG signals. The complexity and information contained in EEG signals are immense. Therefore, it is essential to choose pertinent features that accurately depict the symptoms of alcoholism in EEG signals. Feature selection aims to decrease the feature space’s dimensionality while maintaining critical information. As a result, the algorithm becomes more effective and reliable and can precisely identify alcoholism from EEG patterns. We used a novel ensemble feature selection approach in this study. The use of an ensemble feature selection method for EEG feature selection is motivated by the need to circumvent the constraints imposed by individual feature selection methods. Because EEG signals are complex and multidimensional, no feature selection approach can capture all of the vital information from these signals. Ensemble feature selection methods incorporate different feature selection strategies to combine the capabilities of several feature selection methods, thus can overcome the limits of individual methods and give more robust and accurate feature subsets. The proposed ensemble feature selection method is shown in
Figure 6 and explained in the subsequent Algorithm 2.
Algorithm 2 Ensemble Feature Selection. |
- 1.
Input: A dataset D with m samples and n features, and a positive integer K indicating the number of features to select. - 2.
Output: A subset of K features that are highly correlated with the target variable but uncorrelated with each other. - 3.
Extract graphical features using fast fractional Fourier transform. - 4.
For to n: - (a)
Calculate information gain (IG) for feature i using the dataset D and feature A:
- (b)
Calculate ReliefF score for feature i based on differences between samples:
- (c)
Calculate variance score for feature i:
- (d)
Calculate NCA score for feature i based on the conditional probability :
- (e)
Calculate CFS score for feature i by considering correlations between features and the target variable:
- 5.
Combine scores for each feature by taking their average or using a weighted average. - 6.
Select the top-K features based on their scores.
|
2.5. Module 5: Classification
The features are then utilized as the input for the classification stage after choosing the optimum feature subset. For EEG alcoholism categorization, several neural network topologies are used in this study. The single-layer neural network (NN), multilayer neural network (MLNN), feed-forward neural network (FFNN), cascade forward neural network (CFNN), recurrent neural network (RNN), and generalized regression neural network (GRNN) are examples of these architectures. Each of these architectures has specific abilities and characteristics that can help to accurately classify EEG data in alcoholism identification.
8. Discussion
In this study, our objective was to develop a robust graphical technique for distinguishing between normal and alcoholic EEG signals, aiming to achieve improved outcomes. We initiated this process by applying the MSPCA approach to extract clean signals from multivariate EEG data. Subsequently, we delved into the complex and unpredictable behavior of EEG signals using a 2D fractional Fourier transform-aided graphical approach to differentiate between normal and alcoholic classes. Our analysis revealed distinctive characteristics of the 2D shape of EEG signals in the alcoholic group, including a more prominent and wider area, with broader dispersion trends originating from the coordinate center and bisector of trigonometric regions. These findings suggest the potential suitability of this graphical representation as a visual indicator for alcoholism examination in medical practice, facilitating neurologists in comprehending the impact of alcohol on the brain.
A closer look at the mean values among the normal and alcoholic categories, as presented in
Table 3, helps in interpreting the observed discrepancies. For instance, feature 1 exhibited a lower mean value in alcoholic subjects, implying a deficiency of this feature in alcoholics compared to non-alcoholics. In contrast, feature 11 displayed a higher mean value in alcoholic subjects, indicating a higher occurrence of this feature in alcoholics. These distinctions provide valuable insights into the dataset’s properties and trends, contributing to a clearer understanding of the association between graphical features and alcoholism. An effective feature selection approach is essential to effectively comprehend the structural characteristics of normal and alcoholic EEG signals. In response to the challenges posed by various feature selection methods, we proposed a novel correlation-based ensemble feature selection method for EEGs.
Table 5 provides a comprehensive analysis of our proposed computerized approach for alcoholism detection using EEG signals in comparison with existing studies, including [
24,
29,
30,
31,
32,
33,
34,
35], which predominantly employ wavelet analysis. While wavelet techniques have been widely utilized in EEG signal processing, it is essential to acknowledge their inherent limitations, including the trade-off between time and frequency resolution, subjectivity in parameter selection, challenges in interpretability, and the assumption of signal stationarity. In light of these considerations, the proposed method offers several advantages, as described below.
Parameter independence: The primary benefit of our suggested method is that it does not rely on any specific parameter adjustments. The fast FrFT technique is based on a desirable fractional Fourier transform angle , which can be selected based on the unique analytic objectives. This parameter independence removes the requirement for parameter fine-tuning or optimization, making the procedure simpler for users and less susceptible to subjective bias.
Time–frequency localization: The fast FrFT approach uses the time–frequency plane representation produced from EEG signal processing. This approach allows a more localized view of frequency components over time, allowing more accurate analysis of non-stationary signals. By visualizing the signal in the time–frequency plane, our suggested method can aid in understanding and detecting transitory occurrences or patterns by facilitating the recognition of specific frequency components and their time-varying behavior.
Graphical representation: A scatter plot of the real and imaginary sections of the fast FrFT coefficients is generated using the fast FrFT method. This graphical representation enables easy visualization and investigation of the signal’s properties. Visually identifying and examining patterns, anomalies, or specific features provides vital insights into the fundamental structure of the EEG data. This graphical approach can be beneficial for spotting complex patterns or minor alterations that would be difficult to discover using other techniques.
Table 5 includes details about the method and features used, feature selection approaches, cross-validation methods, classifiers, and accuracy, sensitivity, and specificity scores. Our proposed work uses the FHWT with matrix determinant as features for detecting alcoholism, without employing feature selection. The classifier used is RNN, and a 10-fold cross-validation approach is applied. Our proposed work achieves an accuracy of 93.3%, with equal sensitivity and specificity. Comparing our work with existing studies, it is evident that various methods, features, feature selection strategies, cross-validation methods, and classifiers have been employed in the literature. While our proposed work achieves an accuracy of 93.3%, some previous research reports higher accuracy values, such as 97.91%, 98.91%, and 99.16%. It is important to note that our work does not employ feature selection, which might explain the slightly lower accuracy compared to certain previous studies.
Furthermore, our research includes two variations: one that utilizes all features obtained from FHWT and employs the CFNN classifier, resulting in an accuracy of 96.3% and sensitivity and specificity of 95.8%. The alternative variation uses the CFS technique to select specific features and employs the RNN classifier, achieving an accuracy of 97.5%, a sensitivity of 96.7%, and a specificity of 98.3%. In summary, our proposed computerized study demonstrates competitive accuracy in alcoholism detection using EEG signals.
It is also important to consider the computational expense of our suggested method, as indicated in
Figure 9. The complexity varies for each step, with the fast FrFT step having a computational complexity of
, where
N represents the length of the EEG signal. The Pearson correlation step involves a complexity of
when evaluating correlations among features and the target variable and among pairs of features, with a combined step complexity of
. The top-
K selection step has a complexity of
. Importantly, our method offers parameter independence, removing the need for parameter fine-tuning or optimization, making it user-friendly and less susceptible to bias. It also provides improved time–frequency localization and graphical representation, which facilitates the recognition of specific frequency components and time-varying behaviors in EEG data.
Figure 10 presents a combined plot overlaying the original sequence with gradient-weighted class activation mapping (Grad-CAM). Grad-CAM helps to visualize the time steps in the input sequence that were crucial for the alcoholism identification, as shown by dashed horizontal lines in
Figure 10. This approach not only aids in identifying influential time steps relied upon by the model but also facilitates the detection of potential data irregularities, ensuring the reliability and robustness of the analysis.
While our study presents promising results, it is essential to acknowledge some limitations and consider future research directions:
- 1.
Data size: The study’s efficacy can be further evaluated with larger datasets to enhance the model’s robustness and generalizability.
- 2.
Clinical validation: Collaborating with medical professionals for clinical validation and testing on real-world EEG datasets is essential.
- 3.
Model optimization: Further optimization of model parameters and architecture might enhance the accuracy and efficiency.
- 4.
Incorporation of explainable machine learning: Understanding the importance of explainable artificial intelligence, particularly Shapley additive explanations; we plan to include these methods in our future work. This addition will contribute to the interpretability of our models, providing valuable insights into the features and patterns influencing the predictions.
By addressing these limitations and pursuing future research in these directions, we aim to provide a more comprehensive and effective solution for EEG-based alcoholism detection.