Next Article in Journal
Prolonged Zaleplon Treatment Increases the Expression of Proteins Involved in GABAergic and Glutamatergic Signaling in the Rat Hippocampus
Next Article in Special Issue
Limitations in Evaluating Machine Learning Models for Imbalanced Binary Outcome Classification in Spine Surgery: A Systematic Review
Previous Article in Journal
Exercise Modifies the Brain Metabolic Response to Chronic Cocaine Exposure Inhibiting the Stria Terminalis
Previous Article in Special Issue
Automatic Diagnosis of Major Depressive Disorder Using a High- and Low-Frequency Feature Fusion Framework
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Distinguishing Laparoscopic Surgery Experts from Novices Using EEG Topographic Features

1
School of Engineering, University of Lincoln, Lincoln LN6 7TS, UK
2
Centre for Modeling, Simulation, and Imaging in Medicine, Rensselaer Polytechnic Institute, Troy, MI 12180, USA
3
Department of Industrial and Systems Engineering, University at Buffalo, Buffalo, NY 14260, USA
4
Department of Biomedical Engineering, Rensselaer Polytechnic Institute, Troy, MI 12180, USA
5
School of Medicine and Biomedical Sciences, University at Buffalo, Buffalo, NY 14203, USA
6
College of Engineering, Florida A&M University-Florida State University, Tallahassee, FL 32310, USA
*
Author to whom correspondence should be addressed.
Brain Sci. 2023, 13(12), 1706; https://doi.org/10.3390/brainsci13121706
Submission received: 2 November 2023 / Revised: 27 November 2023 / Accepted: 1 December 2023 / Published: 11 December 2023
(This article belongs to the Special Issue Advances of AI in Neuroimaging)

Abstract

:
The study aimed to differentiate experts from novices in laparoscopic surgery tasks using electroencephalogram (EEG) topographic features. A microstate-based common spatial pattern (CSP) analysis with linear discriminant analysis (LDA) was compared to a topography-preserving convolutional neural network (CNN) approach. Expert surgeons (N = 10) and novice medical residents (N = 13) performed laparoscopic suturing tasks, and EEG data from 8 experts and 13 novices were analysed. Microstate-based CSP with LDA revealed distinct spatial patterns in the frontal and parietal cortices for experts, while novices showed frontal cortex involvement. The 3D CNN model (ESNet) demonstrated a superior classification performance (accuracy > 98%, sensitivity 99.30%, specificity 99.70%, F1 score 98.51%, MCC 97.56%) compared to the microstate based CSP analysis with LDA (accuracy ~90%). Combining spatial and temporal information in the 3D CNN model enhanced classifier accuracy and highlighted the importance of the parietal–temporal–occipital association region in differentiating experts and novices.

1. Introduction

Laparoscopic surgery training is a comprehensive module under the Fundamentals of Laparoscopic Surgery (FLS) curriculum, aimed at equipping medical professionals, scientists, and doctors with basic surgical skills required for successful laparoscopic procedures. FLS is a joint program by the American Gastrointestinal and Surgery Association and the American Academy of Surgery for general surgery [1]. FLS certification involves five psychomotor tasks of increasing complexity: pegboard transfer, pattern cutting, placement of a ligating loop, suturing with extracorporeal knot tying, and suturing with intracorporeal knot tying. This training focuses on cognitive and psychological abilities essential for minimally invasive surgery and serves as a standardised assessment of physicians’ capabilities where brain correlates are important to robustly identify expertise [2].
To evaluate brain correlates of FLS skills, it is proposed that the brain forms a cognitive-perceptual mental model [3,4] during the initial stages of the skill acquisition in a novel laparoscopy environment [5,6,7,8]. Here, Fitts and Posner proposed a three-stage model for motor skill acquisition, comprising the cognitive stage, the associative stage, and the autonomous stage [9]. During these stages for motor skill acquisition, the brain–behaviour relationship can be explored based on portable brain imaging. Brain circuit mechanisms, driven by motor skill proficiency, involve selective attention and cortical alterations in motor planning and execution [6]. For example, specific brain mechanisms [10] subserved by motor skill proficiency may represent dissociable selective attention or local excitability alterations in the cortex during motor planning and execution that are postulated to be driven by a supplementary motor area, premotor cortex, and cerebellum [11]—all communicating via the thalamus [12] and the corticothalamic loops [13,14]. In this study, we hypothesised that these semi-stable brain states involving selective attention and cortical alterations in motor planning and execution can be estimated based on the topography of electroencephalogram (EEG). The majority of contemporary brain–computer interfaces utilizing EEG rely on machine learning algorithms [15] and a wide array of classifier types is employed within this domain including facial expressions as control commands [16].
In this study, it is postulated that while experts will already have a cognitive perceptual model for FLS task performance based on their prior experience, the novices will start building the cognitive perceptual model on their first exposure to the FLS task [3,4]. Semi-stable brain states have been analysed in prior studies based on microstates [17], which can be estimated based on the scalp potential field or EEG topography [18], and are differentially modulated by the vigilance level [19]. Microstates approach to analyse brain states has an a priori assumption that only one spatial map accurately defines the brain’s global state at a given time and that the residuals are considered noise [18]. This microstate-based analysis has been applied to error-based learning using EEG in conjunction with functional near-infrared spectroscopy during a complex surgical motor task [20]. Based on related prior studies [14,21], we postulated in this study that the EEG topography during FLS suturing with intracorporal knot tying task will differ between experts and novices. First, we applied conventional common spatial pattern (CSP) approach, one of the most common methods for feature extraction in brain–computer interfaces [22], to classify two skill levels, experts and novices. We improved the traditional CSP method that is known to suffer from noise sensitivity due to the L2 norm in its optimisation problem to find a spatial filter [23]. We developed a microstate-based CSP approach that performed better due to a metacriterion that favours the highest signal-to-noise ratio [24]. We presented these preliminary results using microstate-based CSP approach at the Interservice/Industry Training, Simulation, and Education Conference (I/ITSEC) 2022 [21]. In our I/ITSEC presentation, we focused on a cohort of 10 expert surgeons and 10 novice medical residents. Through the application of a linear discriminant analysis with 10-fold cross-validation, we successfully attained a classification accuracy exceeding 90%, utilizing spatial pattern vectors extracted from the scalp. In this paper, we present a more advanced convolutional neural network (CNN) based approach with Grad-CAM analysis [25] and compare the results with our microstate-based CSP approach.
Our current study is motivated by prior studies that show non-Markovian and nonstationary microstates [26] can classify cognitive states using the attention-based time series deep learning framework [27]. However, the challenge remains in pre-selecting scalp potential topographies (microstates) that are considered stable [18], while short periods of unstable EEG topographies may occur, e.g., during errors. Instead of preselecting EEG microstates, one can use EEG topography as 3D tensor input for attention features in a CNN, e.g., see ESNet [28], where topography-preserving EEG-based temporal attentive pooling may be neurophysiologically interpretable [29] using Grad-CAM analysis [25]. Therefore, we adapted ESNet [28] to classify experts versus novices and compared the temporally important topography-preserving time segments with the microstate-based CSP approach [21] for mechanistic insights into skill learning [30]. Mechanistic insights were guided by prior studies [30,31] using functional near-infrared spectroscopy (fNIRS) that found cortical activation in the right prefrontal cortex, the right precentral gyrus, and the right postcentral gyrus at the start of the FLS task. Then, the left inferior frontal gyrus, the optical part, the left superior frontal gyrus, the medial orbital, the left postcentral gyrus, the gyrus, the left superior temporal gyrus, right superior frontal gyrus, and the medial orbital cortical areas of the cortical areas of the orbital showed significant differences between experts and novices in the error epochs [20]. In the current study, we analysed a subset of participants from prior studies [20,30] where simultaneous EEG measurements were conducted for our EEG topographic feature analysis.

2. Methods

2.1. Subjects and Experimental Setup

The study received approval from the Institutional Review Board of the University at Buffalo, USA, and all procedures adhered to local research regulations for human subjects. Thirteen neurologically normal novice medical students (seven females) and ten experienced surgeons (five females), all right-handed, participated after providing written informed consent. The EEG data utilised in this study are a subset of a previously published study [20,30] where multimodal fNIRS-EEG data were collected.
The experienced surgeons, with 1 to 25 years of expertise in FLS laparoscopic suturing tasks, were compared to novices new to FLS suturing with intracorporal knot tying. A prior study [30] demonstrated a statistically significant superior performance of experienced surgeons (overall score: 370.4, SD: 61.3) compared to novices (overall score: 84.2, SD: 65.3). Participants received verbal instructions and were equipped with laparoscopic tools for the task, which involved suturing through two marks in a Penrose drain and tying specific knots using needle drivers operated by both hands. The task began with the ‘start’ command, recorded in the multimodal data, and concluded when the subject cut both ends of the suture within a 180 s timeframe.
A customised multimodal fNIRS-EEG montage with 32 active gel electrodes (Figure 1A) was used to record brain activation signals. The EEG signals were captured by a wireless LiveAmp system (Brain Vision, Brain Vision, LLC-515 N. Greenfield Parkway, Suite 100, Garner, NC 27529, USA) at a rate of 500 Hz through 32 channels, as indicated by the grey ‘E’ discs in Figure 1A.

2.2. Data Preprocessing in EEGLab

The EEG data underwent comprehensive pre-processing and offline analysis using the EEGLab toolbox, an open-source software (https://sccn.ucsd.edu/eeglab/index.php accessed on 27 November 2023), designed for microstate analysis [18]. Initially, the data were downsampled to 250 Hz and high pass filtered at 1 Hz. To eliminate line noise, the ‘cleanline’ function was applied, followed by the ‘clean_rawdata’ function to identify and reject problematic channels. The interpolation of bad channels was accomplished using spherical splines [32] within the ‘clean_rawdata’ function, followed by re-referencing the EEG time series to the global average.
Task epochs were defined from the ‘start’ trigger by the experimenter, marking the initiation of the FLS task for each subject. The data then underwent artefact subspace reconstruction (ASR) using default settings in EEGLab, followed by re-referencing the EEG time series to the global average. ASR, an automated method, effectively removed transient EEG artefacts [33]. The default ASR parameter value of 20 was used, balancing the removal of non-brain signals with retaining brain activity, with the optimal range typically between 20 and 30 [33].
To focus on cortical sources corresponding with fNIRS HbO activity [20], a Laplacian spatial filter was applied to eliminate volume conduction from subcortical sources. Two expert subjects were excluded from the analysis due to the presence of ≥5 bad channels, as reliable microstate analysis requires the maximum number of bad channels to be less than five per subject [34]. Consequently, eight expert subjects remained in the study.

2.3. Data Processing of EEGLab and BCILab for Microstate-Based CSP Analysis

Microstate analysis was performed using the EEGlab toolbox [35] after aggregating EEG data during the FLS task from all experts (N = 8) and novices (N = 13), which is detailed in our published study [20]. In this study, we investigated the FLS complex task onset response where a previous study [36] demonstrated that the concentration of oxyhemoglobin peaked within 10 seconds during complex motor actions. Therefore, a 10 s duration was deemed sufficient for investigating the FLS complex task onset response using EEG and functional near-infrared spectroscopy [20]. FLS task related EEG dynamics will continue beyond the initial 10 seconds, which was not investigated in this study. First, we identified EEG microstate prototypes based on modified K-means clustering in EEGlab. The modified clustering of K-means was based on the goodness of fit of the microstate segmentation determined from the global explained variance (GEV) and the cross-validation criterion (CV). Here, the GEV criterion should theoretically become monotonically larger with increasing number of clusters [35]. The modified clustering of K-means in EEGlab found topographical maps of polarity-invariant microstate prototypes [35] from spontaneous EEG data during the FLS task (and the rest periods between the trials). Here, global field power (GFP) peaks are used to segment the EEG time series. The minimum peak distance was set at 10 ms (default) and 1000 randomly selected peaks (default) per subject were used for segmentation. Then, we rejected the GFP peaks that exceeded the standard deviation of all GFPs of all maps one time to segment the EEG data into a predefined number (2 to 8) of microstates. Here, the goal is to maximise the similarity between the EEG samples and the prototypes of the microstates they are assigned to using the modified K-means algorithm [35]. The modified K-means algorithm also sorts the microstate prototypes in decreasing GEV. We had set 100 random numbers of initialisations and 1000 maximum iterations for the modified K-means algorithm with 1 × 10−6 (default) as the relative threshold of convergence [35]. These microstates provided prototypes for subsequent microstate-based CSP analysis [21].
Microstate labels were applied to EEG samples from experts and novices based on topographical similarity (called backfitting) using the EEGlab toolbox [35]. Modified K-means algorithm [35] benefits of using k-means++ [37] for initialisation and the squared Euclidean metric for similarity calculation. Since short periods of unstable EEG topographies can occur, we applied temporal smoothing. Then, the temporally smoothed EEG topographies of experts (N = 8) and novices (N = 13) at the start of the FLS task in a 10 s time window were subjected to CSP analysis and classification using BCILab [38]. Here, if X1 and X2 are the EEG topographies from the experts and novices at the start of the FLS task, viz., X1 is a matrix of rows 250 Hz × 10 s (=2500 data points) and columns 32 channels, then, the desired spatial filter is obtained by, argmax w J ( w ) = w T X 1 T X 1 w w T X 2 T X 2 w = w T C 1 w w T C 2 w , where w denotes the spatial filter, and C1 and C2 represent the covariance matrices of X1 and X2, respectively. Using the Lagrange multiplier approach, the optimisation problem can be written as, L ( λ , w ) = w T C 1 w λ ( w T C 2 w ) , where λ is the Lagrange multiplier. The optimisation problem to find the spatial filter, w, requires the derivative set to zero, i.e., δ L δ w = 2 w T C 1 2 λ w T C 2 = 0 . The solution to this optimisation problem are the eigenvectors, M = C 2 1 C 1 , representing the spatial pattern vectors on the scalp. Here, the regularised CSP can improve robustness in small sample setting [39], and the largest eigenvector from, M 1 = ( C 2 + α K ) 1 C 1 and M 2 = ( C 1 + α K ) 1 C 2 ,   represent the spatial pattern vectors on the scalp with K assumed as an identity matrix [40]. Then, the classification was performed using a simple linear discriminant analysis (LDA) with a 10-fold cross-validation. The computational pipeline, starting from the raw EEG data to the classification, is shown in Figure 1B. Figure 1C shows the repeated measure design with 3 min for the FLS task and 2 min for the rest period.

2.4. Data Processing for Topography-Preserving CNN Approach

The procedure to convert the EEG data into a cuboid tensor is represented in Figure 2. Since spatiotemporal patterns were considered distinctive between experts and novices, spatiotemporal patterns were represented as 3D data that contained spatial information in two dimensions as well as temporal information in the third dimension. The EEG time series obtained from each channel was first downsampled at 120 Hz [28] and then projected to the corresponding position (from the scalp EEG montage; see Figure 1A) into a 2D image of a 16 × 16 square grid (height × width) using an azimuthal equidistant projection. We followed [28] so the EEG data during the FLS task from 2 s before the start trigger was divided into 3 s segments using a 1-second sliding window. The process was repeated in all sliding time steps, and the empty locations on the 16 × 16 grid between the projected electrode locations were interpolated using the griddata() function, Matlab (Mathworks, Inc., Natick, MA, USA) built-in function. Here, the griddata() function grids process 2D or 3D scattered data with a desired interpolation method. We used the v4 interpolation method in Matlab (Mathworks, Inc., USA) for better quality instead of cubic spline interpolation in ESNet [28]. Finally, the EEG data was shaped as a 3D tensor that included spatio-temporal information (that is, height × width × time). Here, we generated X e e g 16 × 16 × 360 3D EEG image for each 3-seconds (360 data points) EEG time window according to ESNet [28]. Therefore, 21 subjects (8 Experts and 13 Novices), with each subject performing the task three times (trials or reps), provided 21 subjects × 3 reps × 180 time windows (=11,340 cuboid tensors).

2.5. CNN for the 3D-EEG Tensor Classification of Expert versus Novice

A 3D CNN model, called ESNet [28], takes into account both spatial and temporal information by implementing a specific pooling layer called temporal attentive pooling (TAP) layer that compresses temporal information efficiently. The structure of the 3D CNN model is shown in Figure 3, which we adapted from ESNet [28]. The model consisted of three convolutional layers, and each of them is followed by a rectified linear unit (ReLU) activation function. Each layer inputs the channel information and doubles it as an output. Short-length kernels and strides were used for spatial information (i.e., the first and second axes) in each convolutional layer. Then, for temporal information (i.e., the third axis), short length kernels and stride were used in the second and third convolutional layers, while longer kernels and stride lengths were used in the first convolutional layer [28]. In summary, we determined the size of [kernel, stride] = [(2, 2, 10), (2, 2, 4)] for the first layer, [(2, 2, 2), (2, 2, 2)] for the second layer, and [(2, 2, 3), (2, 2, 2)] for the third layer. After the three convolutional layers, the TAP layer followed an efficient pooling process in the CNN model. The TAP layer first conducts Spatial Attentive Pooling (SAP), where the characteristic after the third convolutional layer is multiplied element by element by a trainable parameter, φ∈ 2 × 2 × 46 × 64 , and then computes the sum along the spatial axis for the characteristic, resulting in the SAP feature shape of 1 × 1 × 46 × 64 . Then, the SAP feature was classified by a Fully Connected (FC) layer, followed by the Softmax activation function, and multiplied element by the original feature after the third convolutional layer, and then, the sum along the temporal axis for the result was calculated. In total, the entire TAP process converted the shape of the original feature 2 × 2 × 46 × 64 to 2 × 2 × 1 × 64 , providing a larger weight to the relevant temporal information and, therefore, compressing it efficiently. The detailed structure and concept of the TAP layer are described in the original paper [28]. Then, the feature after the TAP layer was passed through the FC layer, followed by the ReLU function, Dropout, and Softmax layer. In addition to the dropout layer, we further implemented the batch normalisation between each convolutional layer and the ReLU function, after the TAP layer and FC layer and the ReLU function. The L2 regularisation was also adapted in a kernel and bias at each FC layer with a regularisation factor of 0.01 to prevent overfitting. The mechanistic insights were based on Gradient-weighted Class Activation Mapping (Grad-CAM) for “visual explanations” of decisions from our CNN-based model [25]. GradCAM uses the gradients of the EEG map flowing into the final convolutional layer to produce a coarse localisation map highlighting the salient regions for expert versus novice classification.

2.6. CNN Classification & Evaluation Criteria

We applied our customised ESNet [28], as shown in Figure 3, to classify experts versus novices with a five-fold cross-validation. In the five-fold cross-validation and testing, we divided the participants data in the 9:1 ratio, in which 10% of the total experts (8 subjects × 3 reps × 180 time windows) and novices (13 subjects × 3 reps × 180 time windows) data were used as holdout test data, and 90% of the total data were used for ten repeats of five-fold cross-validation. In each five-fold cross-validation, we trained the model using 80% of the 90% of total experts (8 subjects × 3 reps × 180 time windows) and novices (13 subjects × 3 reps × 180 time windows) data and cross-validated the model using 20% of the 90% total data, as shown in Figure 4. In each training epoch, the batch size was set at 32, and the training epoch was repeated 200 times, with five iterations within a five-fold cross-validation, with the learning rate set at 0.001. Then, for testing, this five-fold cross-validation was repeated 10 times that generated a new training and validation splits of the trials, at random, where we initialised the weights each time using the Keras initialiser function ‘glorot_uniform’, in which random values are pulled as initialised variables. The results of each iteration were evaluated with the holdout test data (10% of the total data) using indices: accuracy, F1 score, Mathews correlation coefficient (MCC), sensitivity, and specificity. The definition of F1 score, MCC, sensitivity, and specificity are as follows:
F 1 = ( p r e c i s i o n ) × ( r e c a l l ) ( p r e c i s i o n ) + ( r e c a l l ) × 2    
M C C = ( T P × T N F P × F N ) ( T P + F P ) ( T P + F N ) ( T N + F P ) ( T N + F N )
S e n s i t i v i t y = R e c a l l = T P T P + F N
S p e c i f i c i t y = T N F P + T N
Here, precision is shown as follows:
P r e c i s i o n = T P F P + F P  
Here, TP, FP, FN, and TN are the elements of the confusion matrix for binary classification.
C = [ T P ,   F P ;   F N ,   T N ]  
Moreover, TP (true positive) and FP (false positive) are the ratios of data correctly and falsely classified as positive data (i.e., Expert), respectively. Then, FN (false negative) and TN (true negative) are the numbers of data correctly and falsely classified as negative data (i.e., Novice), respectively.

3. Results

3.1. Microstate-Based CSP Classification of Expert versus Novice

We selected six microstate EEG prototypes based on the global explained variance (GEV) and the cross-validation criterion (CV) that was published earlier [20]. Here, the CV criterion, which points to the best clustering solution at its smallest value, reaches the minimum value for six microstates that are shown in Figure 5, sorted in decreasing GEV. As expected for a visuomotor task, the highest GEV is for microstate 1, corresponding to activation of the visual cortex (visual imagery [41]). The six microstate prototypes were backfitted to the EEG for 10 s at the start of the FLS task where it is postulated to be the start of building a cognitive-perceptual model [3,4] by the novices [6].
The global field power (GFP) of the first active microstates at the start of the FLS task was subjected to CSP analysis to find the spatial filters. Then, after applying spatial filters to the expert and novice EEG data and extracting features from them, the experts and novices were classified by LDA. We compared our microstate-based regularised CSP approach with conventional regularised CSP, where the microstate-based regularised CSP approach achieved a classification accuracy of 90.84% compared to 82.26% with conventional regularised CSP. The scalp topography for the first spatial filter using microstate based regularised CSP approach identified topographical maps from microstates 2 and 4 as the most significant eigenvectors. Also, our microstate based regularised CSP approach achieved classification accuracy greater than 90%. Here, microstate analysis applied a meta-criterion favouring the highest signal-to-noise ratio [24] that improved the accuracy when compared to that of conventional regularised CSP. Furthermore, microstates 2 and 4 as the most significant eigenvectors illustrated the importance of EEG electrodes in the parietal–temporal–occipital region for the classification of experts and novices during the FLS task. Microstate 2 was dominant in novices, while microstate 4 was dominant in the experts. We also computed the Kappa coefficient, which is a statistical method to measure the degree of agreement between classes. The Kappa coefficient method assigns zero to random classification and one to perfect classification [42], which is a more robust criterion than classification accuracy by considering random agreement. The microstate based regularised CSP approach outperformed conventional regularised CSP with a Kappa coefficient of more than 0.9. Importantly, the regularised CSP approach identified topographical maps from microstates 2 and 4 as the largest eigenvectors (from M 2 = ( C 1 + α K ) 1 C 2 and M 1 = ( C 2 + α K ) 1 ,   respectively ).

3.2. CNN for EEG 3D Tensor Classification of Expert versus Novice

Five-Fold Cross-Validation

Figure 6 and Figure 7 show the loss function and accuracy of the model, respectively, during 200 epochs of training and validation processes. The learning curve converges at the middle (100th epoch) of the training epochs, and the accuracy performance gap between training and validation stays within 2.5% by the end. Table 1 shows the average and maximum accuracies during the learning phase are shown for each of the 20 epochs.

3.3. Ten Evaluations with the Holdout Test Dataset

Table 2 shows the mean ± standard deviation and maximum for F1, MCC, precision, sensitivity, and specificity for the holdout test dataset over 10 repetitions of five-fold cross-validation. In Table 2, the highest values of each assessment (F1, MCC, accuracy, sensitivity, and specificity) are highlighted in red. Since our previous results of the classification accuracy using microstate-based CSP and LDA were 90.53%, so the topography-preserving CNN resulted is a significant improvement with >98% classification accuracy. The highest sensitivity, which indicates the percentage of correct predictions in the data labelled as positive (i.e., Expert), is 99.30% maximum. Then, the specificity, the rate of correct predictions in data labelled as negative (i.e., Novice), is 99.70% maximum. Furthermore, the F1 score, which evaluates the trade-off between recall and precision, reached 98.51%, indicating the equivalence of the model classification. Finally, the Matthews correlation coefficient (MCC), which evaluates the trade-off between the precision of positive and negative classifications (ranging from −100% to 100%), had a maximum of 97.56%. This implied that there was almost no classification bias among novices and experts even after five-fold cross-validation was repeated 10 times that generated a new training and validation splits of the trials, at random—see Table 2.

3.4. Gradient-Weighted Class Activation Mapping (Grad-CAM) Assessment of the CNN

The input of 16 × 16 EEG grid data to CNN (see Figure 3) is shown in the supplementary materials in Figure S1A for experts and novices. Then, the Grad-CAM heatmap for the convolutional layer 1 is shown in Figure S1B, for convolutional layer 2 is shown in Figure S1C, for the convolutional layer 3 is shown in Figure S1D, and for the TAP layer is shown in Figure 8. Note that the TAP layer first conducts Spatial Attentive Pooling which can provide insights into salient brain areas distinctive between experts and novices. The heatmap shows the salient regions in the topography-preserving convolutional layers from 1 to 3 (see Figure 3) where the time compressed central tendency of the heatmap for the TAP layer is shown in the top panel of Figure 8. The bottom left quadrant of the 16 × 16 EEG grid data is the discriminating salient region between experts and the novices. This bottom left quadrant is denoted by ‘2′ on the 1D x-axis in the bottom panel of Figure 8 showing temporal activation with flattened 2D space. The discriminating salient regions from the TAP layer corresponded (see E9, E10, E11, E12, E13, E14, and E15 in Figure 1A) with the centro-parietal (CCP5h, CCP3h), parietal (P3, P5, P7), and parieto–occipital (PO3, PO7) regions that partly overlap with the significantly different regions between experts and novices in our prior study [20].

4. Discussion

In this study, the novice trainees attempted a complex visuomotor task in a novel laparoscopic environment; therefore, they had to start building the perceptual model of the novel 3D environment based on 2D video and tactile feedback [43]. EEG topography, for example, microstate topographies, can be used as a marker of proficiency such that FLS psychomotor tasks with increasing task complexity can progress in the simulator as the novice trainee achieves proficiency towards FLS certification. For example, microstate 4 (see Figure 5) has been associated with the activation of the left inferior parietal lobe [44] related to the level of expert skill [45]. In our previous study [20], the microstate 4 was found to be more common in experts who are expected to have the action semantic knowledge [45]. Furthermore, global gestalt perception [46] is postulated to be present in experts due to their experience. Here, EEG topography can provide neurophysiological insights [20], e.g., microstate 1 (corresponding to visual cortex [41]), microstate 3 (corresponding to attention reorientation [41] and medial frontal cortex activation [47], and microstates 5 (topography comparable to microstate 3) during task performance can be considered markers of expertise. Then, the sequential flow of information between different brain states can be related to microstate sequences corresponding to the perception–action coupling [20].
We postulated that the CNN approach can learn the underlying temporal dynamics and provide latent representations that can be sensitive to other factors such as mental stress [26]. In this study, the ESNet approach [28] using EEG topography was adapted to classify experts from novices that provided a significant improvement with the highest sensitivity of 99.30% and the highest specificity at 99.70%. Since our CNN is topography-preserving, the Grad-CAM heatmap highlighting the bottom left quadrant of the TAP layer aligned with microstate 2 (see Figure 5) was found dominant in the novices from microstate-based CSP analysis. Here, microstate 2 is comparable to right-frontal left-posterior microstate A of the prototypical microstate classes [18,48], whereas microstate 4 hotspot that overlies the temporoparietal junction and the left inferior parietal lobe [44] may be related to the intact perception of global gestalt [49]. In particular, the Grad-CAM heatmap in the bottom panel of Figure 8D highlighted the parietal–occipital association area in novices (when compared to experts) at the beginning of the FLS task. This requires further investigation based on higher density EEG source localisation, since parietal hotspot was also found to be important for discrimination (relevant for spatial binding [46] based on CSP analysis which aligns well with the cognitive perception models [3,4]). Then, the supplementary motor area complex (SMA) is postulated to play a central role in the descent from the prefrontal to the motor cortex for the flow of skill-related information [7]. Here, SMA is known to be involved in planning complex motor finger tasks [50], and considered the programming area for motor effector subroutines in bimanual coordination tasks. Additionally, SMA has been suggested to form a queue of time-ordered motor commands before voluntary movement is executed via the descending pathways of the primary motor cortex (M1). In the current study, microstate 5 is postulated to capture SMA-related brain activity, which was found to be more frequent in experts than novices.
In the context of the perception–action cycle [51], investigation of motor skill acquisition with different virtual and physical simulation technologies [52] can provide insights into the neurophysiology of skill learning. Specifically, perception and action form a functional system through which behaviour is adapted in novices during exploratory actions to develop perceptual memory at the beginning in a novel environment. Then, perceptual memory allows action planning for improved skilled behaviour by updating the action parameters and refining them in executive memory, a continuous process of exploitative learning to reduce task variability. The two crucial attributes of the perception–action cycle are perceptual and executive memory [53], which are subserved by the frontoparietal network [54]. Here, the EEG-based analysis using microstate-based CSP and ESNet identified primarily the parietal–temporal–occipital EEG electrodes (microstates 2 and 4, the most significant eigenvectors) that illustrated the importance of the parietal–temporal–occipital association region for the classification of experts and novices.
In our previous study [55], we found that average fNIRS HbO-based cortical activation in novices was mainly in the left pars opercularis of the inferior frontal gyrus involved in cognitive control [56]. The inferior frontal gyrus is postulated to be crucial for error-based learning [20] since published studies have shown that the inferior frontal gyrus and the presupplementary motor area (pre-SMA) are involved in stop signal task performance [57] that is relevant in error correction. Then, the prefrontal area [20,58] was found to be more active based on the activation of fNIRS HbO, which may be related to the manipulation of structured information [59]. Therefore, the fNIRS-guided attention network (FGANet) [28] may improve neurophysiological interpretation by capturing the frontoparietal hemodynamic network [54]. Specifically, neuroimaging of the rostrocaudal characteristics of the frontal lobes that are associated with varying degrees of information processing complexity [60] can be improved with fNIRS-EEG fusion where spatially important regions can be identified from fNIRS signals while temporally detailed neuronal activation, e.g., microstates, can be extracted from the EEG signals [20].
The main limitation of the current study is the low-density EEG montage since microstate analysis is more reliable with higher electrode densities [34,61,62]. Furthermore, a higher EEG electrode density can allow robust source localisation [63] to establish the regions of the brain underlying salient microstates that support skilled behaviour. Also, the limited number of subjects (8 experts and 13 novices) did not allow classification of individual skill level. Here, we conducted group comparison of EEG topography that may be too nonspecific to support clear conclusions about the skill level of individual subjects [64]. Therefore, the current study showed the feasibility of the CNN approach that substantially improved (>98%) EEG topography-based group classification of experts versus novices for FLS suturing with intracorporeal knot tying task when compared to microstate-based CSP analysis with LDA (~90%). Here, the accuracy performance gap between training and validation stayed within 2.5% by the end due to a limited number of subjects (see Figure 6 and Figure 7), and that gap did not lead to classification bias even after five-fold cross-validation was repeated 10 times and generated new training and validation splits, at random—see Table 2. A potential pitfall in using artifact removal using ICA in EEGlab is a decrease in rank that can cause decreased accuracy in the CSP implementation in BCIlab used in the current study [65]. Therefore, we have verified the spatial filters that they are not complex numbers in the current CSP analysis with LDA. Task onset trigger was set manually by the experimenter when the start command was assigned by him/her to the subject to start the FLS task, which can affect CSP analysis with LDA; however, the temporal attentive pooling layer of ESNet [28] can find temporally important time segments despite small (<10 s) misalignments. Then, a weakness of Grad-CAM used in this study is its partial derivative approach that can miss multiple occurrences of the same class and/or can lead to inaccurate localisation of a heatmap; therefore, Grad-CAM++ may be preferred in the future [66].

5. Conclusions and Future Research

We postulated that testing ESNet [28] for our application can provide mechanistic insights from EEG topography-preserving CNN approach that can be enhanced with a temporal attentive pooling layer using simultaneous fNIRS signals (see FGANet [28]). In the future research, FGANet [28] approach of online fNIRS-EEG fusion may drive closed-loop adaptive FLS simulators in virtual reality such that task difficulty may be individually paced according to brain-based metrics to develop “coping” to handle cognitive stress response (sympathetic vasoconstriction or ‘choking’ [67,68] monitored with portable neuroimaging [26]. Furthermore, subject-specific portable neuroimaging skill learning may provide brain-based error prediction [20] that can be compared with actual task errors from 3D (behaviour) video data (from FLS box trainer) to develop predictive fNIRS-EEG-video fusion. An expected task error can be highlighted in the 2D video feedback to novices to facilitate visuospatial attention for corrective action in the early stage of skill learning. Here, a distinction is necessary between sensory prediction error [69], which is postulated to be important at the initial perceptual-cognitive stage of skill learning [70], and task error which is postulated to be important in the later stages for strategy learning [9] to achieve expert performance. Then, the CNN with Grad-CAM approach provided insights into the main brain areas that differentiated experts from novices, which may be facilitated with neuroimaging-guided non-invasive brain stimulation—[58,71]. For example, non-invasive cerebellar stimulation may facilitate sensory prediction error and/or non-invasive frontal stimulation may facilitate task error feedback to improve FLS task performance and demonstrate brain-behaviour causality [72]. Also, a simultaneous multimodal EEG-fNIRS approach to measure task and/or non-invasive brain stimulation related brain response can provide important mechanistic insights, e.g., during non-invasive brain stimulation facilitated skill learning, where neurovascular coupling may be modulated by endogenous [73] and exogenous [74] arousals, e.g., due to sympathetic vasoconstriction [20,67,75].

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/brainsci13121706/s1.

Author Contributions

Conceptualization, A.D.; methodology, T.M. and A.D.; software, T.M.; validation, T.M., F.N.U.R. and A.D.; formal analysis, A.D.; investigation, T.M. and Y.F.; resources, L.C. and S.D.S.; data curation, T.M.; writing—original draft preparation, T.M.; writing—review and editing, T.M., F.N.U.R., L.C. and A.D.; visualization, T.M.; supervision, A.D. and L.C.; project administration, A.D., L.C., S.D.S. and S.D.; funding acquisition, X.I., S.D.S., L.C., A.D. and S.D. All authors have read and agreed to the published version of the manuscript.

Funding

The authors appreciate the support of this study through the Medical Technology Enterprise Consortium (MTEC) award, #W81XWH2090019 (2020-628), and the US Army Futures Command, Combat Capabilities Development Command Soldier Centre STTC cooperative research agreement, #W912CG-21-2-0001. T.M. was funded by the pump priming grant from the school of engineering, University of Lincoln, UK for the writing—review and editing.

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki and approved by the Institutional Review Board of the University at Buffalo, USA. Approval code STUDY00004789. Approval date 26 August 2020.

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

Data available on request due to privacy/ethical restrictions.

Acknowledgments

The authors appreciate the support of pump priming from the school of engineering, University of Lincoln, UK for 2023 summer internship of Takahiro Manabe.

Conflicts of Interest

All authors declare that they have no conflict of interest.

References

  1. Birkmeyer, J.D.; Finks, J.F.; O’Reilly, A.; Oerline, M.; Carlin, A.M.; Nunn, A.R.; Dimick, J.; Banerjee, M.; Birkmeyer, N.J.O. Surgical Skill and Complication Rates after Bariatric Surgery. N. Engl. J. Med. 2013, 369, 1434–1442. [Google Scholar] [CrossRef] [PubMed]
  2. Dehabadi, M.; Fernando, B.; Berlingieri, P. The Use of Simulation in the Acquisition of Laparoscopic Suturing Skills. Int. J. Surg. 2014, 12, 258–268. [Google Scholar] [CrossRef] [PubMed]
  3. Cioffi, D. Beyond Attentional Strategies: Cognitive-Perceptual Model of Somatic Interpretation. Psychol. Bull. 1991, 109, 25–41. [Google Scholar] [CrossRef] [PubMed]
  4. Renner, R.S.; Velichkovsky, B.M.; Helmert, J.R. The Perception of Egocentric Distances in Virtual Environments—A Review. ACM Comput. Surv. 2013, 46, 23:1–23:40. [Google Scholar] [CrossRef]
  5. Marucci, M.; Di Flumeri, G.; Borghini, G.; Sciaraffa, N.; Scandola, M.; Pavone, E.F.; Babiloni, F.; Betti, V.; Aricò, P. The Impact of Multisensory Integration and Perceptual Load in Virtual Reality Settings on Performance, Workload and Presence. Sci. Rep. 2021, 11, 4831. [Google Scholar] [CrossRef] [PubMed]
  6. Kamat, A.; Makled, B.; Norfleet, J.; Schwaitzberg, S.D.; Intes, X.; De, S.; Dutta, A. Directed Information Flow during Laparoscopic Surgical Skill Acquisition Dissociated Skill Level and Medical Simulation Technology. NPJ Sci. Learn. 2022, 7, 1–13. [Google Scholar] [CrossRef]
  7. Kamat, A.; Intes, X.; De, S.; Dutta, A. Efference Information Flow during Skill Acquisition Mediates Its Interaction with Medical Simulation Technology. In Proceedings of the Biophotonics Congress: Biomedical Optics 2022 (Translational, Microscopy, OCT, OTS, BRAIN), Fort Lauderdale, FL, USA, 24–27 April 2022; paper JTu3A.33. Optica Publishing Group: Washington, DC, USA, 2022. [Google Scholar]
  8. Riener, R.; Harders, M. Virtual Reality in Medicine; Springer: London, UK, 2012; ISBN 978-1-4471-4010-8. [Google Scholar]
  9. Taylor, J.A.; Ivry, R.B. The Role of Strategies in Motor Learning. Ann. N. Y. Acad. Sci. 2012, 1251, 1–12. [Google Scholar] [CrossRef] [PubMed]
  10. Gu, Q.L.; Lam, N.H.; Wimmer, R.D.; Halassa, M.M.; Murray, J.D. Computational Circuit Mechanisms Underlying Thalamic Control of Attention; Tufts University: Medford, MA, USA, 2021. [Google Scholar]
  11. Guillot, A.; Collet, C.; Nguyen, V.A.; Malouin, F.; Richards, C.; Doyon, J. Brain Activity during Visual versus Kinesthetic Imagery: An fMRI Study. Hum. Brain Mapp. 2009, 30, 2157–2172. [Google Scholar] [CrossRef]
  12. Crick, F. Function of the Thalamic Reticular Complex: The Searchlight Hypothesis. Proc. Natl. Acad. Sci. USA 1984, 81, 4586–4590. [Google Scholar] [CrossRef]
  13. Collins, D.P.; Anastasiades, P.G. Cellular Specificity of Cortico-Thalamic Loops for Motor Planning. J. Neurosci. 2019, 39, 2577–2580. [Google Scholar] [CrossRef]
  14. Guo, K.; Yamawaki, N.; Svoboda, K.; Shepherd, G.M.G. Anterolateral Motor Cortex Connects with a Medial Subdivision of Ventromedial Thalamus through Cell Type-Specific Circuits, Forming an Excitatory Thalamo-Cortico-Thalamic Loop via Layer 1 Apical Tuft Dendrites of Layer 5B Pyramidal Tract Type Neurons. J. Neurosci. 2018, 38, 8787–8797. [Google Scholar] [CrossRef] [PubMed]
  15. Lotte, F.; Bougrain, L.; Cichocki, A.; Clerc, M.; Congedo, M.; Rakotomamonjy, A.; Yger, F. A Review of Classification Algorithms for EEG-Based Brain-Computer Interfaces: A 10 Year Update. J. Neural. Eng. 2018, 15, 031005. [Google Scholar] [CrossRef] [PubMed]
  16. Pawuś, D.; Paszkiel, S. BCI Wheelchair Control Using Expert System Classifying EEG Signals Based on Power Spectrum Estimation and Nervous Tics Detection. Appl. Sci. 2022, 12, 10385. [Google Scholar] [CrossRef]
  17. Pascual-Marqui, R.D.; Michel, C.M.; Lehmann, D. Segmentation of Brain Electrical Activity into Microstates: Model Estimation and Validation. IEEE Trans. Biomed. Eng. 1995, 42, 658–665. [Google Scholar] [CrossRef] [PubMed]
  18. Michel, C.M.; Koenig, T. EEG Microstates as a Tool for Studying the Temporal Dynamics of Whole-Brain Neuronal Networks: A Review. NeuroImage 2018, 180, 577–593. [Google Scholar] [CrossRef]
  19. Krylova, M.; Alizadeh, S.; Izyurov, I.; Teckentrup, V.; Chang, C.; van der Meer, J.; Erb, M.; Kroemer, N.; Koenig, T.; Walter, M.; et al. Evidence for Modulation of EEG Microstate Sequence by Vigilance Level. NeuroImage 2021, 224, 117393. [Google Scholar] [CrossRef]
  20. Walia, P.; Fu, Y.; Norfleet, J.; Schwaitzberg, S.D.; Intes, X.; De, S.; Cavuoto, L.; Dutta, A. Error-Related Brain State Analysis Using Electroencephalography in Conjunction with Functional near-Infrared Spectroscopy during a Complex Surgical Motor Task. Brain Inform. 2022, 9, 29. [Google Scholar] [CrossRef]
  21. Manabe, T.; Walia, P.; Fu, Y.; Intes, X.; De, S.; Schwaitzberg, S.; Cavuoto, L.; Dutta, A. EEG Topographic Features for Assessing Skill Levels during Laparoscopic Surgical Training. Res. Sq. 2022. [Google Scholar] [CrossRef]
  22. Yu, H.; Lu, H.; Wang, S.; Xia, K.; Jiang, Y.; Qian, P. A General Common Spatial Patterns for EEG Analysis with Applications to Vigilance Detection. IEEE Access 2019, 7, 111102–111114. [Google Scholar] [CrossRef]
  23. Koles, Z.J. The Quantitative Extraction and Topographic Mapping of the Abnormal Components in the Clinical EEG. Electroencephalogr. Clin. Neurophysiol. 1991, 79, 440–447. [Google Scholar] [CrossRef] [PubMed]
  24. Custo, A.; Van De Ville, D.; Wells, W.M.; Tomescu, M.I.; Brunet, D.; Michel, C.M. Electroencephalographic Resting-State Networks: Source Localization of Microstates. Brain Connect 2017, 7, 671–682. [Google Scholar] [CrossRef] [PubMed]
  25. Selvaraju, R.R.; Cogswell, M.; Das, A.; Vedantam, R.; Parikh, D.; Batra, D. Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization. Int. J. Comput. Vis. 2020, 128, 336–359. [Google Scholar] [CrossRef]
  26. Sikka, A.; Jamalabadi, H.; Krylova, M.; Alizadeh, S.; van der Meer, J.N.; Danyeli, L.; Deliano, M.; Vicheva, P.; Hahn, T.; Koenig, T.; et al. Investigating the Temporal Dynamics of Electroencephalogram (EEG) Microstates Using Recurrent Neural Networks. Hum. Brain Mapp. 2020, 41, 2334–2346. [Google Scholar] [CrossRef] [PubMed]
  27. Agrawal, S.; Chinnadurai, V.; Sharma, R. Hemodynamic Functional Connectivity Optimization of Frequency EEG Microstates Enables Attention LSTM Framework to Classify Distinct Temporal Cortical Communications of Different Cognitive Tasks. Brain Inform. 2022, 9, 25. [Google Scholar] [CrossRef]
  28. Kwak, Y.; Song, W.-J.; Kim, S.-E. FGANet: fNIRS-Guided Attention Network for Hybrid EEG-fNIRS Brain-Computer Interfaces. IEEE Trans. Neural. Syst. Rehabil. Eng. 2022, 30, 329–339. [Google Scholar] [CrossRef]
  29. Kumar, N.; Michmizos, K.P. A Neurophysiologically Interpretable Deep Neural Network Predicts Complex Movement Components from Brain Activity. Sci. Rep. 2022, 12, 1101. [Google Scholar] [CrossRef]
  30. Walia, P.; Fu, Y.; Schwaitzberg, S.D.; Intes, X.; De, S.; Dutta, A.; Cavuoto, L. Portable Neuroimaging Differentiates Novices from Those with Experience for the Fundamentals of Laparoscopic Surgery (FLS) Suturing with Intracorporeal Knot Tying Task. Surg. Endosc. 2022. [Google Scholar] [CrossRef]
  31. Fu, Y.; Walia, P.; Schwaitzberg, S.D.; Intes, X.; De, S.; Dutta, A.; Cavuoto, L. Changes in Functional Neuroimaging Measures as Novices Gain Proficiency on the Fundamentals of Laparoscopic Surgery Suturing Task. Neurophotonics 2023, 10, 023521. [Google Scholar] [CrossRef]
  32. Perrin, F.; Pernier, J.; Bertrand, O.; Echallier, J.F. Spherical Splines for Scalp Potential and Current Density Mapping. Electroencephalogr. Clin. Neurophysiol. 1989, 72, 184–187. [Google Scholar] [CrossRef]
  33. Chang, C.-Y.; Hsu, S.-H.; Pion-Tonachini, L.; Jung, T.-P. Evaluation of Artifact Subspace Reconstruction for Automatic Artifact Components Removal in Multi-Channel EEG Recordings. IEEE Trans. Biomed. Eng. 2020, 67, 1114–1121. [Google Scholar] [CrossRef] [PubMed]
  34. Zhang, K.; Shi, W.; Wang, C.; Li, Y.; Liu, Z.; Liu, T.; Li, J.; Yan, X.; Wang, Q.; Cao, Z.; et al. Reliability of EEG Microstate Analysis at Different Electrode Densities during Propofol-Induced Transitions of Brain States. NeuroImage 2021, 231, 117861. [Google Scholar] [CrossRef] [PubMed]
  35. Poulsen, A.T.; Pedroni, A.; Langer, N.; Hansen, L.K. Microstate EEGlab Toolbox: An Introductory Guide. bioRxiv 2018. [Google Scholar] [CrossRef]
  36. Li, X.; Krol, M.A.; Jahani, S.; Boas, D.A.; Tager-Flusberg, H.; Yücel, M.A. Brain Correlates of Motor Complexity during Observed and Executed Actions. Sci. Rep. 2020, 10, 10965. [Google Scholar] [CrossRef]
  37. Arthur, D.; Vassilvitskii, S. K-Means++: The Advantages of Careful Seeding. In Proceedings of the Eighteenth Annual ACM-SIAM Symposium on Discrete Algorithms, New Orleans, LA, USA, 7–9 January 2007; Society for Industrial and Applied Mathematics: Philadelphia, PA, USA, 2007; pp. 1027–1035. [Google Scholar]
  38. Kothe, C.A.; Makeig, S. BCILAB: A Platform for Brain-Computer Interface Development. J. Neural. Eng. 2013, 10, 056014. [Google Scholar] [CrossRef] [PubMed]
  39. Lu, H.; Eng, H.-L.; Guan, C.; Plataniotis, K.N.; Venetsanopoulos, A.N. Regularized Common Spatial Pattern with Aggregation for EEG Classification in Small-Sample Setting. IEEE Trans. Biomed. Eng. 2010, 57, 2936–2946. [Google Scholar] [CrossRef] [PubMed]
  40. Lotte, F.; Guan, C. Regularizing Common Spatial Patterns to Improve BCI Designs: Unified Theory and New Algorithms. IEEE Trans. Biomed. Eng. 2011, 58, 355–362. [Google Scholar] [CrossRef]
  41. Britz, J.; Van De Ville, D.; Michel, C.M. BOLD Correlates of EEG Topography Reveal Rapid Resting-State Network Dynamics. Neuroimage 2010, 52, 1162–1170. [Google Scholar] [CrossRef]
  42. Townsend, G.; Graimann, B.; Pfurtscheller, G. A Comparison of Common Spatial Patterns with Complex Band Power Features in a Four-Class BCI Experiment. IEEE Trans. Biomed. Eng. 2006, 53, 642–651. [Google Scholar] [CrossRef]
  43. Tanagho, Y.S.; Andriole, G.L.; Paradis, A.G.; Madison, K.M.; Sandhu, G.S.; Varela, J.E.; Benway, B.M. 2D versus 3D Visualization: Impact on Laparoscopic Proficiency Using the Fundamentals of Laparoscopic Surgery Skill Set. J. Laparoendosc. Adv. Surg. Tech. A 2012, 22, 865–870. [Google Scholar] [CrossRef]
  44. Numssen, O.; Bzdok, D.; Hartwigsen, G. Functional Specialization within the Inferior Parietal Lobes across Cognitive Domains. eLife 2021, 10, e63591. [Google Scholar] [CrossRef] [PubMed]
  45. van Elk, M. The Left Inferior Parietal Lobe Represents Stored Hand-Postures for Object Use and Action Prediction. Front. Psychol. 2014, 5. [Google Scholar] [CrossRef] [PubMed]
  46. Zaretskaya, N.; Anstis, S.; Bartels, A. Parietal Cortex Mediates Conscious Perception of Illusory Gestalt. J. Neurosci. 2013, 33, 523–531. [Google Scholar] [CrossRef] [PubMed]
  47. Gehring, W.J.; Fencsik, D.E. Functions of the Medial Frontal Cortex in the Processing of Conflict and Errors. J. Neurosci. 2001, 21, 9430–9437. [Google Scholar] [CrossRef] [PubMed]
  48. Khanna, A.; Pascual-Leone, A.; Michel, C.M.; Farzan, F. Microstates in Resting-State EEG: Current Status and Future Directions. Neurosci. Biobehav. Rev. 2015, 49, 105–113. [Google Scholar] [CrossRef] [PubMed]
  49. Rennig, J.; Bilalic, M.; Huberle, E.; Karnath, H.-O.; Himmelbach, M. The Temporo-Parietal Junction Contributes to Global Gestalt Perception—Evidence from Studies in Chess Experts. Front. Hum. Neurosci. 2013, 7. [Google Scholar] [CrossRef] [PubMed]
  50. Roland, P.E.; Larsen, B.; Lassen, N.A.; Skinhoj, E. Supplementary Motor Area and Other Cortical Areas in Organization of Voluntary Movements in Man. J. Neurophysiol. 1980, 43, 118–136. [Google Scholar] [CrossRef]
  51. Fuster, J.M. Chapter 8—Prefrontal Cortex in Decision-Making: The Perception–Action Cycle. In Decision Neuroscience; Dreher, J.-C., Tremblay, L., Eds.; Academic Press: San Diego, CA, USA, 2017; pp. 95–105. ISBN 978-0-12-805308-9. [Google Scholar]
  52. Dutta, A.; Kamat, A.; Makled, B.; Norfleet, J.; Intes, X.; De, S. Interhemispheric Functional Connectivity in the Primary Motor Cortex Distinguishes between Training on a Physical and a Virtual Surgical Simulator. In Proceedings of the Medical Image Computing and Computer Assisted Intervention—MICCAI 2021, Strasbourg, France, 27 September–1 October 2021; de Bruijne, M., Cattin, P.C., Cotin, S., Padoy, N., Speidel, S., Zheng, Y., Essert, C., Eds.; Springer International Publishing: Cham, Switzerland, 2021; pp. 636–644. [Google Scholar]
  53. Fuster, J.M. Upper Processing Stages of the Perception-Action Cycle. Trends Cogn. Sci. 2004, 8, 143–145. [Google Scholar] [CrossRef]
  54. Marek, S.; Dosenbach, N.U.F. The Frontoparietal Network: Function, Electrophysiology, and Importance of Individual Precision Mapping. Dialogues Clin. Neurosci. 2018, 20, 133–140. [Google Scholar] [CrossRef]
  55. Walia, P.; Fu, Y.; Schwaitzberg, S.D.; Intes, X.; De, S.; Cavuoto, L.; Dutta, A. Neuroimaging Guided tES to Facilitate Complex Laparoscopic Surgical Tasks—Insights from Functional Near-Infrared Spectroscopy. In Proceedings of the 2021 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), Guadalajara, Mexico, 1–5 November 2021. [Google Scholar]
  56. Levy, B.J.; Wagner, A.D. Cognitive Control and Right Ventrolateral Prefrontal Cortex: Reflexive Reorienting, Motor Inhibition, and Action Updating. Ann. N. Y. Acad. Sci. 2011, 1224, 40–62. [Google Scholar] [CrossRef]
  57. Seidler, R.D.; Kwak, Y.; Fling, B.W.; Bernard, J.A. Neurocognitive Mechanisms of Error-Based Motor Learning; Advances in Experimental Medicine and Biology; Springer: New York, NY, USA, 2013; Volume 782. [Google Scholar] [CrossRef]
  58. Walia, P.; Kumar, K.N.; Dutta, A. Neuroimaging Guided Transcranial Electrical Stimulation in Enhancing Surgical Skill Acquisition. Comment on Hung et al. The Efficacy of Transcranial Direct Current Stimulation in Enhancing Surgical Skill Acquisition: A Preliminary Meta-Analysis of Randomized Controlled Trials. Brain Sci. 2021, 11, 707. Brain Sci. 2021, 11, 1078. [Google Scholar] [CrossRef]
  59. Kroger, J.; Kim, C. Frontopolar Cortex Specializes for Manipulation of Structured Information. Front. Syst. Neurosci. 2022, 16. [Google Scholar] [CrossRef] [PubMed]
  60. Thiebaut de Schotten, M.; Urbanski, M.; Batrancourt, B.; Levy, R.; Dubois, B.; Cerliani, L.; Volle, E. Rostro-Caudal Architecture of the Frontal Lobes in Humans. Cereb. Cortex 2017, 27, 4033–4047. [Google Scholar] [CrossRef] [PubMed]
  61. Bréchet, L.; Michel, C.M. EEG Microstates in Altered States of Consciousness. Front. Psychol. 2022, 13, 856697. [Google Scholar] [CrossRef] [PubMed]
  62. Dinov, M.; Leech, R. Modeling Uncertainties in EEG Microstates: Analysis of Real and Imagined Motor Movements Using Probabilistic Clustering-Driven Training of Probabilistic Neural Networks. Front. Hum. Neurosci. 2017, 11. [Google Scholar] [CrossRef] [PubMed]
  63. Michel, C.M.; Brunet, D. EEG Source Imaging: A Practical Review of the Analysis Steps. Front. Neurol. 2019, 10, 325. [Google Scholar] [CrossRef] [PubMed]
  64. Cook, D.A. Much Ado about Differences: Why Expert-Novice Comparisons Add Little to the Validity Argument. Adv. Health Sci. Educ. Theory Pract. 2015, 20, 829–834. [Google Scholar] [CrossRef]
  65. Rybar, M.; Daly, I.; Poli, R. Potential Pitfalls of Widely Used Implementations of Common Spatial Patterns. Annu. Int. Conf. IEEE Eng. Med. Biol. Soc. 2020, 2020, 196–199. [Google Scholar] [CrossRef]
  66. Chattopadhay, A.; Sarkar, A.; Howlader, P.; Balasubramanian, V.N. Grad-CAM++: Generalized Gradient-Based Visual Explanations for Deep Convolutional Networks. In Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision (WACV), Lake Tahoe, NV, USA, 12–15 March 2018; pp. 839–847. [Google Scholar]
  67. Hannah, T.C.; Turner, D.; Kellner, R.; Bederson, J.; Putrino, D.; Kellner, C.P. Neuromonitoring Correlates of Expertise Level in Surgical Performers: A Systematic Review. Front. Hum. Neurosci. 2022, 16, 705238. [Google Scholar] [CrossRef]
  68. Modi, H.N.; Singh, H.; Orihuela-Espina, F.; Athanasiou, T.; Fiorentino, F.; Yang, G.-Z.; Darzi, A.; Leff, D.R. Temporal Stress in the Operating Room: Brain Engagement Promotes “Coping” and Disengagement Prompts “Choking”. Ann. Surg. 2018, 267, 683–691. [Google Scholar] [CrossRef]
  69. Schlerf, J.; Ivry, R.B.; Diedrichsen, J. Encoding of Sensory Prediction Errors in the Human Cerebellum. J. Neurosci. 2012, 32, 4913–4922. [Google Scholar] [CrossRef] [PubMed]
  70. Broadbent, D.P.; Causer, J.; Williams, A.M.; Ford, P.R. Perceptual-Cognitive Skill Training and Its Transfer to Expert Performance in the Field: Future Research Directions. Eur. J. Sport Sci. 2015, 15, 322–331. [Google Scholar] [CrossRef] [PubMed]
  71. Rahul, F.N.U.; Dutta, A.; Subedi, A.; Makled, B.; Norfleet, J.; Intes, X.; De, S. A Deep Learning Model for a Priori Estimation of Spatiotemporal Regions for Neuroimaging Guided Non-Invasive Brain Stimulation. Brain Stimul. Basic Transl. Clin. Res. Neuromodulation 2021, 14, 1689. [Google Scholar] [CrossRef]
  72. Brain-Behavior Analysis of Transcranial Direct Current Stimulation Effects on a Complex Surgical Motor Task. Available online: https://www.researchsquare.com (accessed on 4 October 2023).
  73. Han, F.; Gu, Y.; Liu, X. A Neurophysiological Event of Arousal Modulation May Underlie fMRI-EEG Correlations. Front. Neurosci. 2019, 13. [Google Scholar] [CrossRef]
  74. Arora, Y.; Dutta, A. Human-in-the-Loop Optimization of Transcranial Electrical Stimulation at the Point of Care: A Computational Perspective. Brain Sci. 2022, 12, 1294. [Google Scholar] [CrossRef]
  75. Arora, Y.; Dutta, A. Perspective: Disentangling the Effects of tES on Neurovascular Unit. Front. Neurol. 2023, 13, 1038700. [Google Scholar] [CrossRef]
Figure 1. (A) Multimodal sensor montage where the 32 active EEG gel electrodes (E1–E32) are shown with grey discs. The fNIRS source (S1–S16) and detectors (D1–D15) were not used in the current study. (B) EEG data processing pipeline in EEGLab and BCILab for the CSP-based classification of experts versus novices. (C) Repeated measures of FLS task (3 min) with rest periods (2 min).
Figure 1. (A) Multimodal sensor montage where the 32 active EEG gel electrodes (E1–E32) are shown with grey discs. The fNIRS source (S1–S16) and detectors (D1–D15) were not used in the current study. (B) EEG data processing pipeline in EEGLab and BCILab for the CSP-based classification of experts versus novices. (C) Repeated measures of FLS task (3 min) with rest periods (2 min).
Brainsci 13 01706 g001aBrainsci 13 01706 g001b
Figure 2. Transformation of 3 s of EEG time window into a 3D tensor for CNN analysis.
Figure 2. Transformation of 3 s of EEG time window into a 3D tensor for CNN analysis.
Brainsci 13 01706 g002
Figure 3. Our customised ESNet [28] architecture. The red box shows a temporal attentive pooling layer that was designed to compress temporal information extracted by consecutive convolutional layers efficiently. This pooling layer assigns higher weights to crucial time segments, enhancing the model’s focus on temporally significant features within the 3D feature representation.
Figure 3. Our customised ESNet [28] architecture. The red box shows a temporal attentive pooling layer that was designed to compress temporal information extracted by consecutive convolutional layers efficiently. This pooling layer assigns higher weights to crucial time segments, enhancing the model’s focus on temporally significant features within the 3D feature representation.
Brainsci 13 01706 g003
Figure 4. Data distribution for the five-fold cross-validation procedure that was repeated 10 times that generated a new training and validation splits of the trials, at random. Test data, shown in orange, remained the same for all the 10 repeats of the five-fold cross-validation.
Figure 4. Data distribution for the five-fold cross-validation procedure that was repeated 10 times that generated a new training and validation splits of the trials, at random. Test data, shown in orange, remained the same for all the 10 repeats of the five-fold cross-validation.
Brainsci 13 01706 g004
Figure 5. The 1–6 EEG microstate prototypes are sorted by decreasing global explained variance. These six EEG microstates are thought to represent basic building blocks of cognitive and perceptual processes during FLS skill learning [20].
Figure 5. The 1–6 EEG microstate prototypes are sorted by decreasing global explained variance. These six EEG microstates are thought to represent basic building blocks of cognitive and perceptual processes during FLS skill learning [20].
Brainsci 13 01706 g005
Figure 6. Training and validation loss function performance of the model.
Figure 6. Training and validation loss function performance of the model.
Brainsci 13 01706 g006
Figure 7. Accuracy performance of model training and validation.
Figure 7. Accuracy performance of model training and validation.
Brainsci 13 01706 g007
Figure 8. Spatial and temporal information are integrated through the incorporation of a dedicated pooling layer known as the temporal attentive pooling (TAP) layer, designed to efficiently condense temporal information. An illustrative Gradient-weighted Class Activation Mapping (Grad-CAM) assessment of TAP layer is shown where the top panel shows the compressed central tendency, while the bottom panel shows temporal activation (time in seconds) with flattened 2D grid (256 = 16 × 16) on the 1D x-axis.
Figure 8. Spatial and temporal information are integrated through the incorporation of a dedicated pooling layer known as the temporal attentive pooling (TAP) layer, designed to efficiently condense temporal information. An illustrative Gradient-weighted Class Activation Mapping (Grad-CAM) assessment of TAP layer is shown where the top panel shows the compressed central tendency, while the bottom panel shows temporal activation (time in seconds) with flattened 2D grid (256 = 16 × 16) on the 1D x-axis.
Brainsci 13 01706 g008
Table 1. Training and validation results: mean ± standard deviation and maximum accuracy during training and validation.
Table 1. Training and validation results: mean ± standard deviation and maximum accuracy during training and validation.
Training AccuracyValidation Accuracy
EpochMean ± Standard Deviation Maximum Mean ± Standard Deviation Maximum
10.7895 ± 0.01570.83690.5522 ± 0.03980.7339
200.9883 ± 0.00160.99270.9570 ± 0.01400.9889
400.9934 ± 0.00130.99740.9685 ± 0.00990.9912
600.9952 ± 0.00080.99800.9699 ± 0.01260.9918
800.9956 ± 0.00110.99940.9709 ± 0.01050.9924
1000.9962 ± 0.00100.99880.9779 ± 0.00800.9924
1200.9966 ± 0.00090.99900.9796 ± 0.00620.9936
1400.9973 ± 0.00080.99900.9794 ± 0.00700.9912
1600.9972 ± 0.00080.99940.9793 ± 0.00650.9912
1800.9971 ± 0.00080.99910.9778 ± 0.00960.9936
2000.9975 ± 0.00080.99960.9836 ± 0.00380.9936
Table 2. Test results with the holdout test data (five-fold cross-validation repeated 10 times that generated a new training and validation splits of the trials): mean ± standard deviation and maximum for F1, MCC, precision, sensitivity, and specificity (confusion matrix for each iteration is shown in Table S1 in the supplementary materials). Numbers in Bold are highest across iterations.
Table 2. Test results with the holdout test data (five-fold cross-validation repeated 10 times that generated a new training and validation splits of the trials): mean ± standard deviation and maximum for F1, MCC, precision, sensitivity, and specificity (confusion matrix for each iteration is shown in Table S1 in the supplementary materials). Numbers in Bold are highest across iterations.
IterationF1MCCAccuracySensitivitySpecificity
Mean ± Standard DeviationMaximumMean ± Standard DeviationMaximumMean ± Standard DeviationMaximumMean ± Standard DeviationMaximumMean ± Standard DeviationMaximum
10.9843 ± 0.01620.99280.9756 ± 0.02210.98870.9886 ± 0.01260.99470.9930 ± 0.01971.00000.9863 ± 0.02291.0000
20.9851 ± 0.01710.99130.9765 ± 0.02370.98640.9891 ± 0.01380.99370.9824 ± 0.03300.99420.9931 ± 0.01300.9983
30.9784 ± 0.02390.98690.9660 ± 0.03340.97960.9842 ± 0.01930.99050.9822 ± 0.04010.99710.9855 ± 0.01090.9882
40.9767 ± 0.02780.98710.9633 ± 0.03820.97960.9828 ± 0.02250.99050.9783 ± 0.05251.00000.9860 ± 0.03000.9950
50.9792 ± 0.01420.98430.9674 ± 0.01970.97540.9849 ± 0.01100.98840.9799 ± 0.02360.99400.9878 ± 0.02130.9983
60.9746 ± 0.02280.98430.9600 ± 0.03040.97520.9813 ± 0.01740.98840.9707 ± 0.03500.99080.9880 ± 0.03330.9967
70.9821 ± 0.01440.98710.9719 ± 0.01930.97960.9870 ± 0.01100.99050.9861 ± 0.01920.99400.9876 ± 0.02230.9950
80.9745 ± 0.04030.99420.9594 ± 0.05520.99090.9804 ± 0.03350.99580.9550 ± 0.06920.99420.9970 ± 0.01171.0000
90.9829 ± 0.01670.98710.9730 ± 0.02320.97980.9874 ± 0.01360.99050.9758 ± 0.03400.99120.9944 ± 0.01390.9983
100.9758 ± 0.03660.98990.9618 ± 0.04910.98410.9821 ± 0.02840.99260.9741 ± 0.04250.99120.9874 ± 0.04021.0000
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Manabe, T.; Rahul, F.N.U.; Fu, Y.; Intes, X.; Schwaitzberg, S.D.; De, S.; Cavuoto, L.; Dutta, A. Distinguishing Laparoscopic Surgery Experts from Novices Using EEG Topographic Features. Brain Sci. 2023, 13, 1706. https://doi.org/10.3390/brainsci13121706

AMA Style

Manabe T, Rahul FNU, Fu Y, Intes X, Schwaitzberg SD, De S, Cavuoto L, Dutta A. Distinguishing Laparoscopic Surgery Experts from Novices Using EEG Topographic Features. Brain Sciences. 2023; 13(12):1706. https://doi.org/10.3390/brainsci13121706

Chicago/Turabian Style

Manabe, Takahiro, F.N.U. Rahul, Yaoyu Fu, Xavier Intes, Steven D. Schwaitzberg, Suvranu De, Lora Cavuoto, and Anirban Dutta. 2023. "Distinguishing Laparoscopic Surgery Experts from Novices Using EEG Topographic Features" Brain Sciences 13, no. 12: 1706. https://doi.org/10.3390/brainsci13121706

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop