Information on Drivers’ Sex Improves EEG-Based Drowsiness Detection Model

Stancin, Igor; Zeba, Mirta Zelenika; Friganovic, Kresimir; Cifrek, Mario; Jovic, Alan

doi:10.3390/app12168146

Open AccessArticle

Information on Drivers’ Sex Improves EEG-Based Drowsiness Detection Model

by

Igor Stancin

¹,

Mirta Zelenika Zeba

²,

Kresimir Friganovic

^2,3

,

Mario Cifrek

²

and

Alan Jovic

^1,*

¹

Department of Electronics, Microelectronics, Computer and Intelligent Systems, Faculty of Electrical Engineering and Computing, University of Zagreb, Unska 3, 10000 Zagreb, Croatia

²

Department of Electronic Systems and Information Processing, Faculty of Electrical Engineering and Computing, University of Zagreb, Unska 3, 10000 Zagreb, Croatia

³

Genome Institute of Singapore, Agency for Science Technology and Research, 1 Fusionopolis Way, #20-10 Connexis, Singapore 138632, Singapore

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2022, 12(16), 8146; https://doi.org/10.3390/app12168146

Submission received: 19 July 2022 / Revised: 10 August 2022 / Accepted: 12 August 2022 / Published: 15 August 2022

(This article belongs to the Section Biomedical Engineering)

Download

Browse Figures

Versions Notes

Abstract

:

Objective detection of a driver’s drowsiness is important for improving driving safety, and the most prominent indicator of drowsiness is changes in electroencephalographic (EEG) activity. Despite extensively documented behavioral differences between male and female drivers, previous studies have not differentiated drowsiness detection models based on drivers’ sex. Therefore, the overall aim of this study is to demonstrate that drowsiness detection can be improved with the use of drivers’ sex information, either as a feature or as separate sex-dependent datasets. Additionally, we aim to provide a reliable EEG-based sex classification model. The used dataset consists of 17 male and 17 female drivers which were evaluated during alert and drowsy sessions. Frequency-domain and recurrence quantification analysis EEG features were used. Four classification algorithms and three feature selection methods were applied to build the models. The accuracy of drowsiness detection based on sex-dependent datasets is 84% for male drivers and 88% for female drivers, which is 3% and 7% better, respectively, than the classification without information about driver’s sex (81%). The model for sex classification based on EEG achieved high accuracy: 97% correctly identified participants in alert sessions and 96% in drowsy sessions. All participants were correctly classified after the application of majority voting on five algorithm runs. The results suggest that sex-dependent datasets improve the accuracy of drowsiness models, which may be relevant to a variety of drowsiness detection systems currently being developed in the field.

Keywords:

drowsiness detection; EEG features; machine learning; recurrence quantification analysis; sex classification; sex differences

1. Introduction

Driving requires various cognitive skills such as visual perception, attention, memory, executive functions, and motor skills [1]. Performance in these cognitive domains is related to driving performance and driving ability. In addition, psychophysiological aspects such as drowsiness and fatigue can affect cognitive processes during driving and lead to traffic accidents [2]. Moreover, fatigue and drowsy driving are some of the major causes of traffic accidents [3,4]. Therefore, objective detection of a driver’s drowsiness is an important factor that could improve driving safety.

However, drowsiness in drivers is not easy to detect. The most used measures are self-assessments of drowsiness. The European Union regulation for the type-approval of motor vehicles with regard to their drivers’ drowsiness and attention warning systems is based on the self-assessment of drowsiness with the Karolinska Sleepiness Scale [5]. Other methods for inferring correlates of drowsiness are image-based methods, vehicle-based methods, physiological-based methods, and hybrid methods [6]. Each of these categories can be further split into subcategories, e.g., physiological-based methods can be based on electroencephalogram (EEG), electrocardiogram (ECG), respiration, electrooculogram, electromyogram, galvanic skin response, or skin temperature [7].

ECG records the heart’s electrical activity (heartbeats), while heart rate variability (HRV) refers to the variations in times between two adjacent heartbeats. HRV is considered a good drowsiness predictor and accuracies of drowsiness detection based on HRV range from 56.6% to 95% among studies [8]. Respiration can be derived from HRV but is also measured using several other techniques. Siddiqui et al. [9], in their recent research, used radar for non-invasive respiration measurement and obtained 87% drowsiness classification accuracy. Measuring skin temperature can also be used for the assessment of drowsiness. Gielen and Aerts [10] measured temperature on the nose and wrist, and obtained accuracies of 68.4% and 88.9%, respectively. Electrooculogram measures eye movement characteristics. It is often used for drowsiness detection, achieving accuracies that range from 64% to 99% among studies [11].

EEG provides a very accurate assessment of the driver’s mental state [12], achieving accuracies of drowsiness detection that range from 67% to 99% among studies [13]. Furthermore, changes in EEG activity are considered biomarkers of mental fatigue and drowsiness [14]. The most prominent indicator associated with mental fatigue and drowsiness is increased theta activity in frontal, central, and posterior cortical sites. In addition, increased alpha activity is associated with individual variability in cortical changes related to mental fatigue and drowsiness. Many studies also use ratio indices between these frequency bands as indicators of drowsiness [15], and a recent study suggests that multichannel ratio indices could bring an additional increase in drowsiness detection accuracy [16]. However, the results obtained are from studies with male participants or males and females as a group, without separating them by sex.

Differences in driving behavior between men and women have been extensively documented using experimental driving tasks, attitudes toward driving, behavioral analysis, risk perception, and the number of accidents.

Drowsiness also has differential effects on driving behavior between men and women. For example, self-rated levels of drowsiness while driving tend to be higher in women than in men [17]. Women also tend to report longer ideal sleep durations than men [18]. As a result, it appears that women’s greater need for sleep leads to higher ratings of drowsiness.

Studies also report sex differences in brain organization that influence the regulation of brain activity during awake state and sleep [19]. In addition, sleep deprivation has a differential effect on brain activity between females and males. Therefore, it is reasonable to assume that changes in EEG activity related to drowsiness differ between men and women.

There are various methods of EEG signal analysis to detect drowsiness and alertness in drivers. They are mostly based on the different types of features extracted from the signal (i.e., frequency-domain features, recurrence quantification analysis features, entropies, etc.) [20,21,22,23,24,25]. To the best of our knowledge, there is no research that includes drivers’ sex information in drowsiness detection systems.

A preliminary version of this work has been reported [26] and it shows the statistical difference between EEG features of alert male and female drivers. The main goal of this substantially extended study is to improve drowsiness detection by including the information about drivers’ sex in the classifier. This is done in two ways: (1) by considering sex as a feature, and (2) by separating the datasets into male and female. In addition, the study aims to develop a reliable EEG-based sex classification model, where correlations between features are introduced as a novel feature that differentiates between male and female drivers.

2. Methodology

2.1. Experimental Design

In our study, 34 healthy participants were recorded during two sessions in a driving simulation. All participants had a valid driver’s license and drove a car regularly but are not professional drivers. The EEG was recorded with a 32-channel actiCHamp EEG amplifier (Brain Products, Munich, Germany) with passive sintered Ag/AgCl electrodes. The electrodes were located at prefrontal (Fp1, Fp2), frontal (F3, F4), central frontal (FC5, FC1, FC2, FC6), inferior frontal (F7, F8), midline frontal (Fz), central (C3, C4), midline central (Cz), midline parietal (Pz), central parietal (CP5, CP1, CP2, CP6), midline occipital (Oz), inferior temporal (T7, T8), posterior temporal (TP9, TP10), parietal (P3, P4), inferior parietal (P7, P8), posterior occipital (PO9, PO10), and occipital scalp sites (O1, O2). The electrodes were positioned according to the International 10–20 system guidelines. The electrode at the FCz position was used as a reference. A ground electrode was placed on the forehead. BrainVision Recorder software was used for impendence check and on-line monitoring. The EEG signals were recorded with a 1000 Hz sampling rate. A simulation scenario was shown on three wide LCD screens. The car was controlled with a professional steering wheel joystick and pedals. All participants were instructed to follow the traffic rules. Male participants averaged 30.24 of age with a standard deviation of 6.86 years and female participants averaged 30.12 years of age with a standard deviation of 6.98 years. The driving scenario was the same for all participants and consisted of driving on state roads and highways, and in an urban environment. The study was approved by the Ethics Committee of the University of Zagreb, Faculty of Electrical Engineering and Computing.

Each participant had two recording sessions. The first recording session (alert session) began at 2:00 p.m. and the second one (drowsy session) began at midnight. The sessions were recorded on different days. The driving scenario was the same for both sessions: after 15 min of adaptation driving (simple acceleration, braking, and turning exercises), all participants were instructed to drive on a highway for 90 min, with adjusted day/night lighting according to the time of the session. Highway driving was monotonic, with very few other cars in traffic, and did not include any unexpected events (such as road crossings of animals or pedestrians). Room lighting, temperature, and humidity were controlled and were the same for both sessions.

Each EEG recording was divided into 10 s epochs with five seconds’ overlap between epochs. The first 250 epochs (~21 min) of highway driving in the alert session were labeled as periods with the highest alertness level for all participants. The last 250 epochs of the drowsy session (approximately after 70 min of highway driving) were labeled as periods with the highest drowsiness level for all participants. This labeling, where the beginning of the session is labeled as alert, and the end is labeled as drowsy, is commonly used in practice [27,28,29,30].

These classifications of two-phase classes were additionally confirmed by an expert in psychophysiological behavior based on their self-assessment of drowsiness (Karolinska Sleepiness Scale) before and after the session, and visual inspection of participants. Participants indicated their drowsiness level on the Karolinska Sleepiness Scale for the alert session as extremely alert, very alert, alert, or fairly alert. Additionally, their total score on the Fatigue Assessment Scale indicated no fatigue before the alert session. Table 1 shows the number of male and female drivers in each class.

Although we used a 32-channel EEG, we wanted to describe dependencies and differences between specific brain regions—front left (FL), front right (FR), occipital left (OL), and occipital right (OR) regions. We calculated the mean value for each feature from five channels in each region (see Table 2). In addition to the channels from these four regions, we also included Oz, Pz, and Cz channels in our analysis.

2.2. Preprocessing and Feature Extraction

The first step of preprocessing was to filter the raw EEG signals to remove unwanted artifacts from the signal. For filtering, we used a Butterworth bandpass filter from 0.5 Hz to 40 Hz.

In this way, we filtered out the line noise (50 Hz). Independent component analysis (ICA) was also used to remove artifacts such as eye movements [31]. The Infomax ICA algorithm was used for separating the original signal into independent components [32]. If there were waveforms between these components that were characteristic of eye movements, we removed these eye movements from the signal. Blinks were removed for all participants and left–right eye movements were removed whenever present (in approximately 50% of participants).

EEG features were calculated based on 10 s epochs with a five-second overlap between epochs. We computed the basic frequency-domain features: the relative power of the delta (0.5–4 Hz), theta (4–8 Hz), alpha (8–12 Hz), beta (12–30 Hz), and gamma (30–50 Hz) frequencies. They were calculated using the Thomson multitaper method [33] to obtain the power spectral density. We also used recurrence quantification analysis (RQA) [34,35,36] features: determinism (Det), laminarity (Lam), recurrence rate (RR), trapped time (TT), determinism divided by recurrence rate (Det/RR), longest diagonal (Lmax), longest vertical line (Vmax), average diagonal line length (Adll), divergence (Div), and entropy (Ent). The RQA features were calculated from the recurrence plot (RP) of the signal. RP is a 2D representation of the phase space trajectory of the signal [37]. It is a matrix with dimensions N × N, where N is the length of the signal. The position (i, j) in the matrix is marked as one if the i-th and j-th points in the signal are close to each other.

The total number of extracted features was 15 per channel/region. We used four regions (with averaged feature value from 5 channels) and three additional channels (see above), which gave us a total of 105 features.

2.3. Drowsiness Detection

The 105 features described in Section 2.2 served as the initial feature set for the first drowsiness detection model. The aim of this analysis was to investigate whether sex as a feature could improve drowsiness detection.

The best combination of algorithms and hyperparameters was optimized using a grid search approach. The experiment was conducted using four classification algorithms (XGBoost, naïve Bayes, random forest, and support vector machines) and three feature selection algorithms (chi2, information gain, and ANOVA F-test). Table 3 shows the hyperparameters that were explored for classification algorithm selection on the training set. In addition, we also searched for the optimal number of features to include in the model (selection of 10, 20, 30, 40, 50, 60, and 100). Grid search was applied to all these hyperparameters, classification algorithms, and feature selection methods simultaneously. The decision was made based on maximizing the accuracy of a model without including sex as a feature.

Once the best algorithm and feature set were obtained, the evaluation metrics for the drowsiness detection model without sex as a feature were calculated. The classification was performed for each epoch of each participant, resulting in a total of 17,000 epochs (both classes with 34 recording sessions, each session with 250 epochs) for drowsiness detection. A random 66% of the dataset was used for training the classifier and the remaining 33% was used for testing the classifier.

Then, sex as a feature was added to the dataset, and classification was performed in the same way. In the second analysis, to further investigate the influence of driver’s sex on drowsiness detection, the dataset was divided into two subsets containing only male and female drivers, respectively. The classification was performed in the same way for these two subsets of data.

2.4. Sex Classification Model

A reliable EEG-based sex classification model was developed to make previous findings applicable in cases where the dataset does not contain information about drivers’ sex. For the sex classification model, we divided each channel/region of each participant into nine segments (the first eight segments with 27 epochs and the ninth segment with the remaining 34 epochs). The values of each feature in each of these segments were averaged, as shown in Figure 1. We chose nine segments to increase our dataset for classification to 306 records (34 participants with 9 segments each), which was sufficient to properly split the dataset into training and test subsets. Moreover, each of the nine segments had enough epochs to compute correlations between features. The final feature set consisted of 105 features, explained in Section 2.2, and the correlations between each pair of features as new features. In total, this amounted to 5565 features. The training dataset consisted of each participant’s six randomly selected segments and the test dataset consisted of each participant’s remaining three segments. We also used leave-one-subject-out evaluation (LOSOCV). Hyperparameter optimization and algorithm selection were performed on the training dataset using 5-fold cross-validation. The same grid search approach as in Section 2.3 was used for hyperparameter optimization.

3. Results

3.1. Improvement of Drowsiness Detection with Addition of Sex as a Feature

Based on the optimization of grid search hyperparameters, XGBoost with default parameters, along with 50 features selected by the chi2 method, was selected for our drowsiness detection model. The decision was made based on maximizing the accuracy of a model without including sex as a feature.

Table 4 shows the evaluation metrics’ scores of the drowsiness detection model with and without sex as a feature. Including sex as an additional feature in the feature set resulted in only an incremental improvement in the evaluation metrics. However, when the dataset was split into two datasets based on the sex information, it led to a significant improvement in the drowsiness detection evaluation metrics. Table 4 shows that the accuracy of the drowsiness detection is 84% for male drivers and 88% for female drivers, which is 3% and 7% better, respectively, than the classification without information about driver sex. Additionally, precision and recall are higher in both groups for both male and female drivers.

3.2. Model for Sex Classification

The best classification model for drivers’ sex classification was XGBoost. The following hyperparameters were selected as the best ones and were the same for both groups: eta equals 0.1, gamma equals 0, max_depth equals 10, reg_alpha equals 0, reg_lambda equals 1, and learning_rate parameter equals 0.1. The chi2 method was selected as the feature selection method, and the 40 best-ranked features were selected for our final classification model.

Interestingly, among the selected features, there were no frequency-domain features. There were 36 selected RQA features and four selected correlation features. The same features were selected for all seven channels/regions—Det, Lam, Det/RR, Lmax, and Vmax. Besides these 35 features, the model also selected the RR feature from the Cz channel, the correlation between Cz Theta and Cz Delta, the correlation between FL Theta and Cz Gamma, the correlation between FR Theta and Cz Gamma, and the correlation between OL Theta and OL Delta. All the selected correlations for the drowsy group were between frequency-domain features. Among the selected features, the distribution of channels/regions was almost uniform, 5 ± 1 of all channels/regions were selected.

In general, the final accuracy of any classification model may vary with a different selection of the test set. Since we randomly selected our test set as described in Section 2.4, we applied the methodology described there five times (as seen in Table 5 and Table 6) to verify the stability and robustness of the classification model.

Table 5 and Table 6 show the accuracy of the classification models for alert and drowsy participants, respectively. Each participant had three randomly selected segments in the test set. The classification accuracy of these segments is marked with blue. If at least two out of three of these segments have a correctly classified sex (accuracy 0.67 or 1.00), the participant is marked as correctly classified in the right, green part of the table. The average classification accuracy was calculated for each participant and each run of the methodology. The average classification accuracy of the segments (marked in blue) was 93% (alert participants) and 92% (drowsy participants), while the average classification accuracy of the participants (marked in green) was 97% (alert participants) and 96% (drowsy participants). The classification accuracy with LOSOCV was 82%.

4. Discussion

Splitting the dataset into two subsets, male and female, resulted in a significant improvement in drowsiness detection (confirmed with Mann–Whitney U test). After adding sex as a feature to the dataset, drowsiness detection was only incrementally improved, but splitting into two sex-dependent datasets yielded a 3% and 7% improvement for male and female drivers, respectively. Precision and recall were higher for alert and drowsy states for both male and female subsets. Interestingly, precision and recall were higher for female drivers than for male drivers. This increase in prediction accuracy, precision, and recall for female drivers suggests differential changes in EEG activity associated with drowsiness compared to male drivers.

Theoretically, approaches using sex as a feature and manual split of the dataset could have the same accuracy. The reason why this is not the case could be the relatively high dimensionality of the dataset and the relatively small number of examples in the dataset. For this reason, the algorithm was unable to optimize the hyperplane to artificially separate the data based on the sex feature.

A recent review of EEG signal features and their application in driver drowsiness detection systems summarized 39 papers, none of which used participant’s sex as a feature [13]. Based on our results, it is reasonable to assume that the model accuracy presented in many of these papers could be further increased by using sex as a feature or by splitting the dataset into male and female subsets.

For these results to be applicable to all datasets, a high-accuracy sex classifier is needed. The average classification accuracy of the segment’s sex (blue markers in Table 5 and Table 6) is 93% and 92% for alert and drowsy drivers, respectively. On average, the classifiers correctly classify 97% of the alert participants and 96% of the drowsy participants (green markers in Table 5 and Table 6). When majority voting is applied to each participant across all five runs, the accuracy of both classifiers is 100%.

Other studies with high accuracy of sex classification models based on EEG were those of Kaur et al. [38] and Kaushik et al. [39]. Their classification accuracy was 96.7% and 97.5%, respectively. The experimental design was the same in both papers, participants were measured in a relaxed resting position with their eyes closed. In both works, the discrete wavelet transform was used to obtain the frequency-domain features. Their final classifiers were based only on these frequency-domain features. In comparison, our participants were measured while driving, which is a complex mental activity. Kaur et al. achieved their accuracy with the usage of LOSOCV on the 60 participants. Their high accuracy suggests that a higher number of participants could increase our LOSOCV closer to the segments split based accuracy.

Our previous work showed that frequency-domain features and RQA features differ significantly between alert male and female drivers [26]. Similar statistical results were found when brain activity was analyzed between male and female drowsy drivers. Female drowsy drivers showed significantly higher relative beta power in all regions and significantly higher relative alpha power in all regions except the Cz electrode. Male drowsy drivers showed significantly higher relative delta power in all regions and relative theta power in the occipital region. On the other hand, relative gamma power showed a different pattern between male and female drowsy drivers. In female drowsy drivers, relative gamma power was significantly higher in the occipital region, whereas in male drowsy drivers, relative gamma power was higher in the frontal region. Furthermore, the results of this paper showed that the features in the frequency domain are more correlated in males than in females during both alert and drowsy sessions.

In the current work, Figure 2 shows the difference between the average feature value for all male and all female drowsy drivers for two RQA features with the smallest p-values (Vmax in the OR region and Lmax in the Pz channel) and two frequency-domain features (beta in the OR region and the OL region). Figure 3 shows a high correlation (0.79) between the Det feature from the FR region and the Lam feature from the Cz channel for male drowsy drivers. The right part of Figure 3 shows the same features for female drowsy drivers, which are only weakly correlated (0.37). These differences in the correlation of the features are the main reason for their use as features.

It should be noted that, although our dataset consisted of frequency-domain features, our feature selection method filtered out all the frequency-domain features for sex classification. The final feature sets for the sex classification model thus consisted of only RQA features and correlation features. Since RQA features discriminate male and female drivers better than frequency-domain features in our work, it is reasonable to assume that the studies (Kaur et al. [38] and Kaushik et al. [39]) reporting high accuracy of the sex classification models would report even higher results with the inclusion of RQA features, but further research should be conducted to confirm this assumption.

Our system is based on the 10 s epochs with 5 s overlaps between them so the application in the real-world scenario could make a decision every 5 s. Classifiers make their decisions in less than a second, which means that if epochs would have a step of one second instead of five, the system would still be able to make decisions on time.

There are also some drawbacks to this work. One is the not very large dataset with 34 participants in 68 recording sessions. This results in a lower LOSOCV (82%) and the increase in the number of participants would increase accuracy. Another drawback is the exclusive use of binary drowsiness classification. In real-world applications, at least three levels of drowsiness are usually targeted [39,40].

For future work, the number of participants and the number of features considered should be increased. For example, an interesting point would be to observe driving performance data (e.g., line crossings, distance from an ideal position on the road) based on participants’ sex and EEG features. Since our data suggest functional differences between males and females during drowsy driving, the next step would be to investigate whether these differences are related to driving performance. Investigating the relation between driving performance and EEG features could provide insight into how people drive and explain potential differences. The underlying mechanisms related to driving would therefore provide a more accurate model for driving safety systems. Another interesting topic for future work is to investigate the influence of driver’s sex as a feature on deep learning models that have lately been used extensively and are showing promising results [40]. In addition, combining our findings (sex as a feature and RQA features as good drowsiness predictors) with decreasing the number of electrodes used [41] could lead to a reliable system that is also easy to implement in practice. Such a system’s accuracy could also benefit from the usage of blink-related features derived from EEG [42].

5. Conclusions

This research has shown that including the information about driver’s sex increases the accuracy of drowsiness detection. Furthermore, a reliable sex classifier based on EEG signals was developed. Although it is hard to implement exactly the same system in a real-time environment, due to the high number of electrodes, these important findings may benefit all other systems that are less intrusive simply by including sex as a feature in the existing systems.

The drowsiness detection model for drivers is usually based on the EEG features and without sex as a feature. After adding sex as an additional feature in the dataset, only incremental improvements in drowsiness detection accuracy were achieved. With the further step of manually splitting the dataset into male and female subsets, the drowsiness detection model accuracy increased by 3% and 7% for male and female datasets, respectively. We consider these results relevant to a variety of drowsiness detection systems currently being developed in the field.

The sex classification model based on EEG features achieved high accuracy. All participants were correctly classified after applying majority voting to the results of all five runs. Correlations between features used as features scored high on the feature selection list, suggesting that correlations between features from different brain regions/channels should be used more frequently as features.

Author Contributions

Conceptualization, I.S., M.Z.Z., K.F., M.C. and A.J.; Methodology, I.S., M.Z.Z., K.F. and A.J.; Software, I.S.; Validation, I.S., M.Z.Z. and A.J.; Formal Analysis, M.C. and A.J.; Investigation, I.S., M.Z.Z. and K.F.; Resources, M.C. and A.J.; Data Curation, I.S., M.Z.Z. and K.F.; Writing—Original Draft Preparation, I.S. and M.Z.Z.; Writing—Review and Editing, I.S., M.Z.Z., K.F., M.C. and A.J.; Visualization, I.S.; Supervision, M.C. and A.J.; Project Administration, M.C. and A.J.; Funding Acquisition, M.C. and A.J. All authors have read and agreed to the published version of the manuscript.

Funding

This work has been carried out within the project “Research and development of the system for driver drowsiness and distraction identification—DFDM” (KK.01.2.1.01.0136), funded by the European Regional Development Fund in the Republic of Croatia under the Operational Programme Competitiveness and Cohesion 2014–2020.

Institutional Review Board Statement

The study was conducted according to the guidelines of the Declaration of Helsinki, and approved by the Ethics Committee of the University of Zagreb, Faculty of Electrical Engineering and Computing (on 26 September 2020).

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

Restrictions apply to the availability of these data. The data used in this work were obtained from Xylon d.o.o., Croatia, to which any request for data should be addressed ([email protected]).

Conflicts of Interest

The authors declare no conflict of interest.

References

Ledger, S.; Bennett, J.M.; Chekaluk, E.; Batchelor, J.; Di Meco, A. Cognitive function and driving in middle adulthood: Does age matter? Transp. Res. Part F Traffic Psychol. Behav. 2019, 66, 471–484. [Google Scholar] [CrossRef]
Chuang, C.-H.; Cao, Z.; King, J.-T.; Wu, B.-S.; Wang, Y.; Lin, C.-T. Brain Electrodynamic and Hemodynamic Signatures against Fatigue during Driving. Front. Neurosci. 2018, 12, 181. [Google Scholar] [CrossRef] [PubMed]
National Highway Traffic Safety Administration. Drowsy Driving 2015; National Highway Traffic Safety Administration: Washington, DC, USA, 2017. [Google Scholar]
Owens, J.M.; Dingus, T.A.; Guo, F.; Fang, Y.; Perez, M.; McClafferty, J.; Tefft, B. Prevalence of Drowsy Driving Crashes: Estimates from a Large-Scale Naturalistic Driving Study; Foundation for Traffic Safety: Washington, DC, USA, 2018. [Google Scholar]
European Commision. Commision Delegated Regulation (EU) 2021/1341; Official Journal of the European Union; European Commision: Brussels, Belgium, 2021. [Google Scholar]
Albadawi, Y.; Takruri, M.; Awad, M. A Review of Recent Developments in Driver Drowsiness Detection Systems. Sensors 2022, 22, 2069. [Google Scholar] [CrossRef] [PubMed]
Chowdhury, A.; Shankaran, R.; Kavakli, M.; Haque, M. Sensor Applications and Physiological Features in Drivers’ Drowsiness Detection: A Review. IEEE Sens. J. 2018, 18, 3055–3067. [Google Scholar] [CrossRef]
Burlacu, A.; Brinza, C.; Brezulianu, A.; Covic, A. Accurate and early detection of sleepiness, fatigue and stress levels in drivers through Heart Rate Variability parameters: A systematic review. Rev. Cardiovasc. Med. 2021, 22, 845–852. [Google Scholar] [CrossRef]
Siddiqui, H.; Saleem, A.; Brown, R.; Bademci, B.; Lee, E.; Rustam, F.; Dudley, S. Non-Invasive Driver Drowsiness Detection System. Sensors 2021, 21, 4833. [Google Scholar] [CrossRef]
Gielen, J.; Aerts, J.-M. Feature Extraction and Evaluation for Driver Drowsiness Detection Based on Thermoregulation. Appl. Sci. 2019, 9, 3555. [Google Scholar] [CrossRef]
Tian, Y.; Cao, J. Fatigue driving detection based on electrooculography: A review. EURASIP J. Image Video Process. 2021, 2021, 1–17. [Google Scholar] [CrossRef]
Johnson, R.R.; Popovic, D.P.; Olmstead, R.E.; Stikic, M.; Levendowski, D.J.; Berka, C. Drowsiness/alertness algorithm development and validation using synchronized EEG and cognitive performance to individualize a generalized model. Biol. Psychol. 2011, 87, 241–250. [Google Scholar] [CrossRef]
Stancin, I.; Cifrek, M.; Jovic, A. A Review of EEG Signal Features and Their Application in Driver Drowsiness Detection Systems. Sensors 2021, 21, 3786. [Google Scholar] [CrossRef]
Tran, Y.; Craig, A.; Craig, R.; Chai, R.; Nguyen, H. The influence of mental fatigue on brain activity: Evidence from a systematic review with meta-analyses. Psychophysiology 2020, 57, e13554. [Google Scholar] [CrossRef] [PubMed]
Lal, S.K.; Craig, A. A critical review of the psychophysiology of driver fatigue. Biol. Psychol. 2001, 55, 173–194. [Google Scholar] [CrossRef]
Stancin, I.; Frid, N.; Cifrek, M.; Jovic, A. EEG Signal Multichannel Frequency-Domain Ratio Indices for Drowsiness Detection Based on Multicriteria Optimization. Sensors 2021, 21, 6932. [Google Scholar] [CrossRef]
Soares, S.; Monteiro, T.; Lobo, A.; Couto, A.; Cunha, L.; Ferreira, S. Analyzing Driver Drowsiness: From Causes to Effects. Sustainability 2020, 12, 1971. [Google Scholar] [CrossRef]
Tonetti, L.; Fabbri, M.; Natale, V. Sex Difference in Sleep-Time Preference and Sleep Need: A Cross-Sectional Survey among Italian Pre-Adolescents, Adolescents, and Adults. Chronobiol. Int. 2008, 25, 745–759. [Google Scholar] [CrossRef]
Corsi-Cabrera, M.; Sánchez, A.I.; Del-Rı#xo-Portilla, Y.; Villanueva, Y.; Pérez-Garci, E. Effect of 38 h of total sleep deprivation on the waking EEG in women: Sex differences. Int. J. Psychophysiol. 2003, 50, 213–224. [Google Scholar] [CrossRef]
Borghini, G.; Astolfi, L.; Vecchiato, G.; Mattia, D.; Babiloni, F. Measuring neurophysiological signals in aircraft pilots and car drivers for the assessment of mental workload, fatigue and drowsiness. Neurosci. Biobehav. Rev. 2012, 44, 58–75. [Google Scholar] [CrossRef]
Chen, L.-L.; Zhao, Y.; Zhang, J.; Zou, J.-Z. Automatic detection of alertness/drowsiness from physiological signals using wavelet-based nonlinear features and machine learning. Expert Syst. Appl. 2015, 42, 7344–7355. [Google Scholar] [CrossRef]
Cai, Q.; Gao, Z.-K.; Yang, Y.-X.; Dang, W.-D.; Grebogi, C. Multiplex Limited Penetrable Horizontal Visibility Graph from EEG Signals for Driver Fatigue Detection. Int. J. Neural Syst. 2019, 29, 1850057. [Google Scholar] [CrossRef]
Min, J.; Wang, P.; Hu, J. Driver fatigue detection through multiple entropy fusion analysis in an EEG-based system. PLoS ONE 2017, 12, e0188756. [Google Scholar] [CrossRef]
Nguyen, T.; Ahn, S.; Jang, H.; Jun, S.C.; Kim, J.G. Utilization of a combined EEG/NIRS system to predict driver drowsiness. Sci. Rep. 2017, 7, 43933. [Google Scholar] [CrossRef]
Lin, C.-T.; King, J.-T.; Chuang, C.-H.; Ding, W.; Chuang, W.-Y.; Liao, L.-D.; Wang, Y. Exploring the Brain Responses to Driving Fatigue through Simultaneous EEG and fNIRS Measurements. Int. J. Neural Syst. 2019, 30, 1950018. [Google Scholar] [CrossRef] [PubMed]
Stancin, I.; Friganovic, K.; Zeba, M.Z.; Jovic, A.; Cifrek, M. Gender Differences in EEG Features While Driving. In Proceedings of the 2nd International Conference on Advances in Signal Processing and Artificial Intelligence, Berlin, Germany, 30 June–2 July 2020; pp. 125–130. [Google Scholar]
Chen, J.; Wang, H.; Wang, Q.; Hua, C. Exploring the fatigue affecting electroencephalography based functional brain networks during real driving in young males. Neuropsychologia 2019, 129, 200–211. [Google Scholar] [CrossRef] [PubMed]
Chen, J.; Wang, H.; Hua, C. Electroencephalography based fatigue detection using a novel feature fusion and extreme learning machine. Cogn. Syst. Res. 2018, 52, 715–728. [Google Scholar] [CrossRef]
Hu, J.; Min, J. Automated detection of driver fatigue based on EEG signals using gradient boosting decision tree model. Cogn. Neurodynamics 2018, 12, 431–440. [Google Scholar] [CrossRef] [PubMed]
Dimitrakopoulos, G.N.; Kakkos, I.; Dai, Z.; Wang, H.; Sgarbas, K.; Thakor, N.; Bezerianos, A.; Sun, Y. Functional Connectivity Analysis of Mental Fatigue Reveals Different Network Topological Alterations between Driving and Vigilance Tasks. IEEE Trans. Neural Syst. Rehabil. Eng. 2018, 26, 740–749. [Google Scholar] [CrossRef]
Makeig, S.; Bell, A.J.; Jung, T.-P.; Sejnowski, T.J. Independent Component Analysis of Electroencephalographic Data. In Proceedings of the 8th International Conference on Neural Information Processing Systems, Denver, CO, USA, 27–30 November 1995; pp. 145–151. [Google Scholar]
Bell, A.J.; Sejnowski, T.J. An Information-Maximization Approach to Blind Separation and Blind Deconvolution. Neural Comput. 1995, 7, 1129–1159. [Google Scholar] [CrossRef]
Thomson, D.J. Spectrum estimation and harmonic analysis. Proc. IEEE 1982, 70, 1055–1096. [Google Scholar] [CrossRef]
Shabani, H.; Mikaili, M.; Noori, S.M.R. Assessment of recurrence quantification analysis (RQA) of EEG for development of a novel drowsiness detection system. Biomed. Eng. Lett. 2016, 6, 196–204. [Google Scholar] [CrossRef]
Webber, C.; Zbilut, J. Recurrence Quantification Analysis of Nonlinear Dynamical Systems. In Tutorials in Contemporary Nonlinear Methods for the Behavioral Sciences; National Science Foundation: Alexandria, VA, USA, 2005. [Google Scholar]
Acharya, U.R.; Sree, S.V.; Chattopadhyay, S.; Yu, W.; Ang, P.C.A. Application of recurrence quantification analysis for the automated identification of epileptic eeg signals. Int. J. Neural Syst. 2011, 21, 199–211. [Google Scholar] [CrossRef]
Eckmann, J.-P.; Kamphorst, S.O.; Ruelle, D. Recurrence Plots of Dynamical Systems. Eur. Lett. 1987, 4, 973–977. [Google Scholar] [CrossRef]
Kaur, B.; Singh, D.; Roy, P.P. Age and gender classification using brain–computer interface. Neural Comput. Appl. 2018, 31, 5887–5900. [Google Scholar] [CrossRef]
Kaushik, P.; Gupta, A.; Roy, P.P.; Dogra, D.P. EEG-Based Age and Gender Prediction Using Deep BLSTM-LSTM Network Model. IEEE Sens. J. 2018, 19, 2634–2641. [Google Scholar] [CrossRef]
Chaabene, S.; Bouaziz, B.; Boudaya, A.; Hökelmann, A.; Ammar, A.; Chaari, L. Convolutional Neural Network for Drowsiness Detection Using EEG Signals. Sensors 2021, 21, 1734. [Google Scholar] [CrossRef] [PubMed]
Ko, L.-W.; Komarov, O.; Lai, W.-K.; Liang, W.-G.; Jung, T.-P. Eyeblink recognition improves fatigue prediction from single-channel forehead EEG in a realistic sustained attention task. J. Neural Eng. 2020, 17, 036015. [Google Scholar] [CrossRef] [PubMed]
Shahbakhti, M.; Beiramvand, M.; Rejer, I.; Augustyniak, P.; Broniec-Wojcik, A.; Wierzchon, M.; Marozas, V. Simultaneous Eye Blink Characterization and Elimination from Low-Channel Prefrontal EEG Signals Enhances Driver Drowsiness Detection. IEEE J. Biomed. Health Inform. 2021, 26, 1001–1012. [Google Scholar] [CrossRef]

Figure 1. Illustration of segment averaging.

Figure 2. The difference between the average features of all male and all female drowsy drivers: Vmax from the OR region, Lmax from the Pz channel, relative beta power from the OR region, and relative beta power from the OL region. The Y-axis shows the values of observed features. The bold line is the mean of the feature, and the filled area represents the standard deviation.

Figure 3. Left: highly correlated (0.79) determinism (Det) from the FR region and laminarity (Lam) from the Cz channel for male drivers; right: weakly correlated (0.37) determinism (Det) from the FR region and laminarity (Lam) from the Cz channel for female drivers. The Y-axis shows the values of the observed features. The bold line is the mean of the feature, and the filled area represents the standard deviation.

Table 1. The number of participants in each class.

	Male	Female
Alertness	17	17
Drowsiness	17	17

Table 2. Channels in brain regions.

Region	Channels
Front left (FL)	F7, F3, FC5, FC1, T7
Front right (FR)	F8, F4, FC6, FC2, T8
Occipital left (OL)	O1, P7, P3, CP5, CP1
Occipital right (OR)	O2, P4, P8, CP6, CP2

Table 3. Table of optimization hyperparameters with a grid search approach. Naïve Bayes had no hyperparameters to optimize.

XGBoost
Eta—{0.1, 0.3, 0.4, 0.9}	reg_alpha—{0, 0.5}
gamma—{0, 1, 5}	reg_lambda—{0.5, 1}
learning_rate—{0.05, 0.1, 0.5, 1}
Random Forest
n_estimators—{30, 100, 200, 500}
min_samles_split—{2, 4, 6}
Support Vector Machine
C—{0.5, 1, 10}	kernel—{linear, rbf}

Table 4. Results of the drowsiness classification model.

	Without Sex Information
	Precision	Recall	Accuracy
Alertness	0.81	0.82	0.81
Drowsiness	0.82	0.80	0.81
	With sex information
	Precision	Recall	Accuracy
Alertness	0.81	0.83	0.82
Drowsiness	0.82	0.81	0.82
	Only male drivers
	Precision	Recall	Accuracy
Alertness	0.82	0.86	0.84
Drowsiness	0.86	0.82	0.84
	Only female drivers
	Precision	Recall	Accuracy
Alertness	0.87	0.89	0.88
Drowsiness	0.89	0.87	0.88

Table 5. Classification accuracies on the test data for alert participants.

Participant	Accuracy						Target Class	Correctly Classified
Participant	Run1	Run2	Run3	Run4	Run5	Avg	Target Class	Run1	Run2	Run3	Run4	Run5	Avg
1	1.00	1.00	1.00	1.00	1.00	1.00	F	1	1	1	1	1	1
2	1.00	1.00	1.00	1.00	1.00	1.00	F	1	1	1	1	1	1
3	1.00	1.00	1.00	1.00	1.00	1.00	F	1	1	1	1	1	1
4	0.67	1.00	1.00	0.67	1.00	0.87	M	1	1	1	1	1	1
5	1.00	1.00	1.00	1.00	1.00	1.00	M	1	1	1	1	1	1
6	1.00	1.00	1.00	1.00	1.00	1.00	F	1	1	1	1	1	1
7	0.67	1.00	1.00	1.00	1.00	0.93	M	1	1	1	1	1	1
8	1.00	1.00	1.00	1.00	1.00	1.00	M	1	1	1	1	1	1
9	1.00	1.00	1.00	1.00	1.00	1.00	M	1	1	1	1	1	1
10	1.00	1.00	1.00	1.00	1.00	1.00	F	1	1	1	1	1	1
11	1.00	1.00	1.00	1.00	1.00	1.00	M	1	1	1	1	1	1
12	1.00	1.00	1.00	1.00	1.00	1.00	F	1	1	1	1	1	1
13	1.00	1.00	1.00	1.00	1.00	1.00	M	1	1	1	1	1	1
14	1.00	1.00	0.67	1.00	1.00	0.93	M	1	1	1	1	1	1
15	1.00	1.00	1.00	1.00	0.67	0.93	M	1	1	1	1	1	1
16	1.00	1.00	1.00	1.00	1.00	1.00	F	1	1	1	1	1	1
17	1.00	1.00	0.67	0.67	0.67	0.80	F	1	1	1	1	1	1
18	1.00	1.00	0.67	0.67	1.00	0.87	M	1	1	1	1	1	1
19	1.00	1.00	1.00	0.67	1.00	0.93	F	1	1	1	1	1	1
20	1.00	1.00	1.00	1.00	1.00	1.00	M	1	1	1	1	1	1
21	1.00	1.00	1.00	1.00	1.00	1.00	F	1	1	1	1	1	1
22	1.00	1.00	1.00	1.00	1.00	1.00	F	1	1	1	1	1	1
23	1.00	1.00	1.00	1.00	1.00	1.00	F	1	1	1	1	1	1
24	1.00	1.00	1.00	1.00	1.00	1.00	F	1	1	1	1	1	1
25	1.00	1.00	1.00	1.00	1.00	1.00	M	1	1	1	1	1	1
26	0.67	0.67	1.00	1.00	0.67	0.80	M	1	1	1	1	1	1
27	1.00	1.00	1.00	1.00	1.00	1.00	M	1	1	1	1	1	1
28	1.00	0.67	0.67	1.00	1.00	0.87	M	1	1	1	1	1	1
29	0.33	0.67	0.67	1.00	0.67	0.67	M	0	1	1	1	1	0.8
30	0.33	0.33	1.00	0.67	0.67	0.60	F	0	0	1	1	1	0.6
31	1.00	0.67	0.67	0.67	0.67	0.73	F	1	1	1	1	1	1
32	1.00	1.00	0.33	0.33	1.00	0.73	F	1	1	0	0	1	0.6
33	1.00	1.00	1.00	1.00	1.00	1.00	M	1	1	1	1	1	1
34	1.00	1.00	1.00	1.00	1.00	1.00	F	1	1	1	1	1	1
Average:	0.93	0.94	0.92	0.92	0.94	0.93	Sum:	32	33	33	33	34	33
							Accuracy:	0.94	0.97	0.97	0.97	1.00	0.97

Table 6. Classification accuracies on the test data for drowsy participants.

Participant	Accuracy						Target Class	Correctly Classified
Participant	Run1	Run2	Run3	Run4	Run5	Avg	Target Class	Run1	Run2	Run3	Run4	Run5	Avg
1	1.00	1.00	1.00	1.00	1.00	1.00	F	1	1	1	1	1	1
2	0.67	1.00	0.33	1.00	0.67	0.73	F	1	1	0	1	1	0.8
3	1.00	1.00	0.67	1.00	1.00	0.93	F	1	1	1	1	1	1
4	1.00	1.00	1.00	1.00	1.00	1.00	M	1	1	1	1	1	1
5	1.00	0.67	1.00	0.67	0.67	0.80	M	1	1	1	1	1	1
6	1.00	1.00	1.00	1.00	0.67	0.93	M	1	1	1	1	1	1
7	1.00	0.67	1.00	1.00	0.67	0.87	F	1	1	1	1	1	1
8	1.00	1.00	1.00	1.00	1.00	1.00	M	1	1	1	1	1	1
9	1.00	1.00	0.67	1.00	1.00	0.93	F	1	1	1	1	1	1
10	0.67	1.00	1.00	0.67	0.67	0.80	M	1	1	1	1	1	1
11	0.33	1.00	0.33	1.00	0.67	0.67	M	0	1	0	1	1	0.6
12	1.00	1.00	0.67	1.00	0.67	0.87	M	1	1	1	1	1	1
13	1.00	1.00	1.00	1.00	1.00	1.00	M	1	1	1	1	1	1
14	1.00	1.00	1.00	0.67	1.00	0.93	F	1	1	1	1	1	1
15	0.67	1.00	1.00	0.67	0.67	0.80	M	1	1	1	1	1	1
16	1.00	1.00	1.00	1.00	1.00	1.00	F	1	1	1	1	1	1
17	1.00	1.00	1.00	1.00	1.00	1.00	M	1	1	1	1	1	1
18	1.00	1.00	1.00	1.00	1.00	1.00	F	1	1	1	1	1	1
19	1.00	1.00	1.00	1.00	1.00	1.00	F	1	1	1	1	1	1
20	0.67	0.67	0.67	1.00	0.33	0.67	F	1	1	1	1	0	0.8
21	1.00	1.00	1.00	1.00	1.00	1.00	F	1	1	1	1	1	1
22	1.00	1.00	1.00	1.00	1.00	1.00	F	1	1	1	1	1	1
23	1.00	1.00	1.00	1.00	1.00	1.00	M	1	1	1	1	1	1
24	1.00	1.00	1.00	1.00	1.00	1.00	M	1	1	1	1	1	1
25	0.67	0.67	1.00	0.33	1.00	0.73	M	1	1	1	0	1	0.8
26	1.00	1.00	1.00	1.00	1.00	1.00	F	1	1	1	1	1	1
27	1.00	1.00	1.00	1.00	1.00	1.00	M	1	1	1	1	1	1
28	1.00	1.00	1.00	1.00	1.00	1.00	M	1	1	1	1	1	1
29	1.00	1.00	1.00	1.00	1.00	1.00	M	1	1	1	1	1	1
30	1.00	0.67	0.33	1.00	0.67	0.73	F	1	1	0	1	1	0.8
31	1.00	0.67	1.00	1.00	1.00	0.93	F	1	1	1	1	1	1
32	1.00	1.00	1.00	1.00	1.00	1.00	M	1	1	1	1	1	1
33	1.00	1.00	1.00	1.00	1.00	1.00	F	1	1	1	1	1	1
34	1.00	1.00	1.00	1.00	1.00	1.00	F	1	1	1	1	1	1
Average:	0.93	0.94	0.90	0.94	0.89	0.92	Sum:	33	34	31	33	33	32.8
							Accuracy:	0.97	1.00	0.91	0.97	0.97	0.96

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Stancin, I.; Zeba, M.Z.; Friganovic, K.; Cifrek, M.; Jovic, A. Information on Drivers’ Sex Improves EEG-Based Drowsiness Detection Model. Appl. Sci. 2022, 12, 8146. https://doi.org/10.3390/app12168146

AMA Style

Stancin I, Zeba MZ, Friganovic K, Cifrek M, Jovic A. Information on Drivers’ Sex Improves EEG-Based Drowsiness Detection Model. Applied Sciences. 2022; 12(16):8146. https://doi.org/10.3390/app12168146

Chicago/Turabian Style

Stancin, Igor, Mirta Zelenika Zeba, Kresimir Friganovic, Mario Cifrek, and Alan Jovic. 2022. "Information on Drivers’ Sex Improves EEG-Based Drowsiness Detection Model" Applied Sciences 12, no. 16: 8146. https://doi.org/10.3390/app12168146

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Information on Drivers’ Sex Improves EEG-Based Drowsiness Detection Model

Abstract

1. Introduction

2. Methodology

2.1. Experimental Design

2.2. Preprocessing and Feature Extraction

2.3. Drowsiness Detection

2.4. Sex Classification Model

3. Results

3.1. Improvement of Drowsiness Detection with Addition of Sex as a Feature

3.2. Model for Sex Classification

4. Discussion

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI