1. Introduction
Spinal cord injury (SCI) and multiple sclerosis (MS) are two highly debilitating neurological disorders affecting the central nervous system, causing significant sensory, motor, and autonomic impairments that drastically reduce quality of life [
1,
2]. SCI and MS were chosen due to their shared pathophysiology and the availability of EP-based ML studies, which are scarce for other neurological conditions. SCI, often resulting from trauma like motor vehicle accidents or falls, leads to immediate and frequently irreversible spinal cord damage [
1]. This damage disrupts neural pathways, causing paralysis, sensory loss, and impaired autonomic functions. SCI severity ranges from incomplete lesions, where some function remains, to complete lesions with total functional loss below the injury site. The pathophysiology involves an initial mechanical injury followed by secondary injury processes, including inflammation, edema, and oxidative stress, which worsen neuronal damage and hinder recovery [
1]. These complex mechanisms make predicting recovery and tailoring rehabilitation challenging, highlighting the need for accurate prognostic tools. Multiple sclerosis is a chronic autoimmune disease involving recurrent inflammatory attacks on the myelin sheath of the brain and spinal cord nerve fibers [
2]. These attacks cause demyelination, axonal damage, and neurodegeneration, leading to diverse symptoms including muscle weakness, fatigue, and cognitive impairment. MS typically begins with a relapsing-remitting course, with periods of neurological dysfunction followed by partial or complete recovery; however, many patients transition to a secondary progressive phase with accumulating disability [
2]. The unpredictable nature and heterogeneous presentation of MS complicate prognosis and treatment. Both SCI and MS share pathological features—demyelination, inflammation, and axonal damage—that disrupt neural conduction [
1,
2]. These shared mechanisms make them suitable for evaluation using electrophysiological techniques like evoked potentials (EPs) [
3,
4]. EPs, including somatosensory evoked potentials (SSEPs) and motor evoked potentials (MEPs), are crucial for assessing sensory and motor pathway integrity in SCI and MS [
3,
4]. SSEPs measure electrical activity in response to peripheral nerve stimulation, reflecting sensory pathway conduction efficiency. MEPs, elicited by transcranial magnetic stimulation, assess motor pathway function through muscle response recordings. In SCI, prolonged SSEP latencies or absent amplitudes indicate disrupted sensory conduction, often correlating with axonal damage, while MEP abnormalities reflect motor pathway impairment, aiding functional recovery predictions [
3]. In MS, EP abnormalities, such as delayed N20 or P40 latencies, signal demyelination, and reduced MEP amplitudes, suggest motor tract dysfunction, providing insights into disease progression [
4]. These electrophysiological measures offer valuable insights into the functional consequences of pathological processes, making them potentially valuable prognostic biomarkers.
The integration of machine learning (ML), which learns patterns from data to make predictive models, with EPs, offers a promising approach to improve prognostic accuracy in SCI and MS [
5,
6,
7]. ML algorithms, such as Random Forests, Support Vector Machines, and Deep Learning, effectively analyze complex multimodal data, identifying patterns in EP metrics (latency, amplitude, time series) that correlate with clinical outcomes like the American Spinal Injury Association (ASIA) scale for SCI or the Expanded Disability Status Scale (EDSS) for MS [
7,
8]. By using large datasets, ML models can uncover subtle relationships between EP features and disease outcomes, providing personalized predictions that surpass traditional statistical methods. For example, in SCI, ML might prioritize SSEP amplitude as a predictor of motor recovery, reflecting axonal integrity, while in MS, MEP time series might capture dynamic changes in motor function, informing disease trajectory. Despite the potential of EP-based ML models, their application in SCI and MS remains under-explored, with limited efforts to synthesize evidence across studies [
9,
10,
11,
12,
13,
14]. This meta-analysis addresses this gap by synthesizing evidence from five studies (n = 583) on EP-based ML models for predicting SCI recovery and MS progression [
12,
15]. Using a random-effects model to account for heterogeneity, the analysis evaluates the accuracy and area under the curve (AUC) of these models, providing a robust estimate of their prognostic utility. Complementary imaging modalities, such as T2-weighted MRI and diffusion tensor imaging (DTI), can further enhance the prognostic value of EPs by providing structural insights into lesion burden and axonal disruption [
5,
6]. This study aims to highlight the potential of EP-based ML in precision neurology, guiding personalized rehabilitation for SCI and disease-modifying therapies for MS, while identifying areas for future research to improve clinical translation.
1.1. Pathophysiological Framework: Linking EPs to Disease Mechanisms
SCI and MS impair sensory and motor conduction through shared pathological processes, positioning EPs as vital biomarkers [
1,
2]. In SCI, trauma triggers oligodendrocyte apoptosis, reducing myelin basic protein and causing demyelination, which prolongs EP latencies, such as N20 or P40, reflecting slowed neural conduction [
1,
3]. Inflammation in SCI involves acute cytokine release, including TNF-α and IL-6, leading to reduced EP amplitudes that signal neuronal dysfunction [
1]. Axonal damage in SCI results from calpain-mediated transection, with absent EP amplitudes indicating irreversible loss, often marked by reduced N-acetylaspartate levels [
1]. Neurodegeneration and repair failure in SCI, driven by glial scar formation, result in persistent EP abnormalities, highlighting failed remyelination [
1]. In MS, autoimmune attacks on myelin components, such as MBP and MOG, cause multifocal demyelination, detectable through prolonged EP latencies [
2,
4]. Chronic inflammation in MS, mediated by cytokines like TNF-α and IL-17, reduces EP amplitudes, reflecting ongoing neuronal impairment [
2]. Axonal loss in MS, secondary to chronic demyelination, is marked by reduced N-acetylaspartate and absent EP amplitudes, indicating severe damage [
2]. Impaired oligodendrocyte differentiation in MS hinders remyelination, leading to persistent EP abnormalities [
2,
6]. These mechanisms demonstrate EPs’ role in capturing disease processes, informing ML-based prognostic models [
7,
15,
16].
1.2. Imaging Correlations: Complementing EPs with Structural Insights
Imaging modalities enhance the prognostic utility of EPs by providing structural insights into SCI and MS pathology [
5,
6,
17]. T2-weighted MRI reveals edema or necrosis in SCI and demyelinating plaques in MS, with EP abnormalities correlating with lesion burden in both conditions [
5,
16,
18]. Diffusion tensor imaging (DTI) shows reduced fractional anisotropy (FA), indicating axonal disruption, which aligns with EP changes in sensory and motor pathways [
15,
16,
19]. Magnetization transfer imaging (MTI) detects decreased myelin content through reduced magnetization transfer ratios (MTR), complementing EP findings of demyelination [
8,
20,
21]. Magnetic resonance spectroscopy (MRS) identifies reduced N-acetylaspartate (NAA) and elevated choline, signaling axonal loss and inflammation, respectively, which correspond to EP amplitude reductions [
22,
23,
24]. In MS, lesion volume shows moderate positive correlations with SSEP latency (r ≈ 0.4–0.6, Yperman et al., 2020 [
25,
26]), while in SCI, lower FA is associated with reduced MEP amplitude (r ≈ 0.3–0.5, Yoo H.J. et al., 2024 [
27]; Chrysanthakopoulou et al., 2025 [
15]). These correlations vary due to differences in imaging protocols and EP metrics. The integration of EPs with imaging strengthens ML models by combining functional and structural data, improving prognostic accuracy [
7,
18].
4. Discussion
This meta-analysis, using a random-effects model, synthesizes evidence from five studies (n = 583) on the application of machine learning to evoked potentials for prognostic modeling in SCI and MS [
12,
15,
24,
26,
27]. The pooled accuracy 77.7%, AUC 0.82 underscore the robust predictive utility of EP-based ML models, with SSEP latency and MEP time series emerging as universal predictors due to their sensitivity to demyelination [
3,
4]. Compared to prior reviews (accuracy ~70–75%), our results highlight ML’s ability to integrate EP features (SSEP latency, MEP time-series) with imaging (T2 lesions, DTI FA) and clinical data (ASIA, EDSS). SSEPs, sensitive to dorsal column demyelination, and MEPs, reflecting corticospinal axonal integrity, align with pathophysiological mechanisms (MBP loss, TNF-α elevation, NAA reduction). The pooled accuracy and AUC support predictions of functional milestones, such as ASIA score improvements in SCI and improved MS prognosis (Hypothetical Clinical Scenarios, pp. 8–9), though minimal clinically important differences for EP-based outcomes remain undefined, warranting future research. SSEP latency’s prominence in MS likely reflects demyelination-induced conduction delays, while MEP amplitude drives SCI predictions due to axonal loss (Imaging Improvements); future ML explainability studies (e.g., using SHAP values) could clarify feature prioritization [
24,
27]. Multimodal approaches (e.g., SSEP-MRI-Gait in Yoo H.J., et al., 2024 [
27]) improved SCI prognosis, but limited reporting of unimodal vs. multimodal AUCs prevented quantitative comparison, underscoring the need for future studies [
27]. Sensitivity analysis excluding an animal study (n = 551) yielded slightly lower accuracy (76.6%, I² = 55%) and confirms stability without the rat model, supporting human applicability. [
25]. The higher accuracy in SCI studies compared to MS reflects the stronger association of EPs, particularly amplitude, with axonal damage in SCI, which is more directly tied to functional outcomes like motor recovery [
15,
25,
27].
Table 1 clarifies the variability across studies, highlighting diverse ML approaches and EP types. Peeters et al. (2020) demonstrated that MEP time-series features improve EDSS progression prediction [
26]. Yoo H.J., et al. (2024) showed that combining SSEPs with MRI data enhances recovery prediction [
27]. These findings align with prior evidence that EPs are sensitive biomarkers; however, ML significantly enhances their prognostic power [
7,
8]. Precision neurology aims to deliver personalized diagnostic and therapeutic strategies [
21].
This approach leverages advanced technologies, such as machine learning, to integrate diverse data sources—electrophysiological, imaging, and clinical—to generate precise predictions of disease trajectories and treatment responses. In this meta-analysis, EP-based ML models exemplify precision neurology by harnessing the functional insights provided by SSEPs and MEPs to predict outcomes in SCI and MS [
12,
15,
24,
26,
27]. For SCI, the ability of ML to prioritize SSEP amplitude reflects its sensitivity to axonal integrity, a critical determinant of motor recovery potential [
15,
27]. In MS, the emphasis on MEP time series captures the dynamic interplay of demyelination and inflammation, enabling predictions of EDSS worsening [
24,
26]. The high predictive performance (accuracy 80.0%, AUC 0.82) demonstrates the potential of these models to inform individualized care, guide rehabilitation plans for SCI patients, or identify MS patients at risk of rapid progression for early intervention with disease-modifying therapies [
21].
The integration of multimodal data is crucial in precision neurology, and this meta-analysis highlights the synergistic role of EPs and imaging in enhancing prognostic accuracy [
5,
6,
18]. T2-weighted MRI, DTI, and MRS provide structural and biochemical insights into lesion burden, axonal disruption, and inflammation, complementing the functional data from EPs [
5,
6]. For instance, in SCI, reduced fractional anisotropy on DTI correlates with SSEP amplitude reductions, reflecting axonal loss, while in MS, T2 lesion load aligns with prolonged SSEP latencies, indicating demyelination [
5,
6]. ML models that combine these modalities can create comprehensive patient profiles, enabling more accurate predictions of outcomes [
18,
27]. Despite its promise, precision neurology faces challenges in the context of EP-based ML models. These challenges include variability in EP acquisition protocols, limited sample sizes, and the need for standardized multimodal data integration, as discussed below. Variability in EP acquisition protocols (e.g., stimulation parameters or electrode placement) contributes to heterogeneity (I
2 = 60–62%), but the limited number of studies (n = 5) precluded subgroup analyses by EP methodology (SSEP vs. MEP) or ML algorithm type (e.g., Random Forest vs. deep learning). Future studies with larger samples should explore these stratifications. The moderate heterogeneity observed in this meta-analysis (I
2 = 55-57%) reflects variability in EP types, populations, ML algorithms, and outcomes, which complicates model generalizability [
12,
15,
24,
26,
27]. The inclusion of one rodent study (Cui et al., 2019) introduces potential interspecies variability in neurophysiology, mitigated by sensitivity analysis (accuracy 78.5%, AUC 0.81, n = 448) confirming robust human-only results [
24]. The analysis of multiple EP features (e.g., latency, amplitude) may increase Type I error risk in primary studies, though their validation methods and our pooled metrics mitigate false discovery concerns. The study was not pre-registered in PROSPERO, but covariate selection was justified post hoc based on clinical relevance, minimizing selective reporting risks. The focus on short-term outcomes (e.g., 6-month ASIA recovery) limits insights into longitudinal EP variability (e.g., due to remyelination), necessitating future studies on chronic trajectories like 2-year EDSS progression. Standardizing EP acquisition protocols is essential to ensure consistency across studies and facilitate the development of robust, widely applicable models. Additionally, the reliance on retrospective data and the limited number of studies (n = 6) constrain statistical power and clinical applicability. Prospective, multicenter studies with larger cohorts are needed to validate EP-based ML models and establish their utility in real-world clinical settings. Furthermore, the integration of real-time EP monitoring could enhance the dynamic assessment of disease progression, allowing for adaptive prognostic models [
21].
The findings of this meta-analysis position EP-based ML models as a cornerstone of precision neurology, with significant implications for clinical practice [
21]. In SCI, these models can guide rehabilitation by identifying patients likely to achieve motor recovery [
15,
27]. In MS, they can inform treatment decisions by predicting progression risk [
12,
24,
26]. The cost-effectiveness and non-invasive nature of EPs make them particularly suitable for widespread clinical adoption. Looking forward, the development of integrated platforms that combine EP data with imaging, genetic, and clinical variables could further advance precision neurology, creating a holistic approach to prognosis and management in SCI and MS [
21]. To illustrate the practical utility of EP-based ML models, consider two hypothetical clinical scenarios.
In an SCI case, a 35-year-old patient with a C6 incomplete injury undergoes an SSEP assessment one-month post-injury. An ML model, leveraging SSEP amplitude, predicts with 83% accuracy a high likelihood of ASIA score improvement within six months. The clinician uses this prognosis to prioritize intensive physical therapy and consider transcranial magnetic stimulation, optimizing resource allocation and patient motivation.
In an MS case, a 42-year-old patient with relapsing-remitting MS shows prolonged SSEP latency and variable MEP time series. An ML model predicts a 70% risk of EDSS worsening within two years. The neurologist initiates ocrelizumab earlier than planned, tailoring therapy to mitigate progression.
These scenarios demonstrate how the high predictive performance of EP-based ML models can guide personalized interventions, enhancing clinical decision-making in SCI and MS [
12,
15,
21,
24,
26,
27]. Spinal cord injury poses significant challenges due to its acute onset and lasting consequences [
1]. The primary injury, often caused by mechanical trauma, severs axons and disrupts neural circuits, leading to immediate loss of function. The secondary injury phase, characterized by inflammation, oxidative stress, and excitotoxicity, amplifies tissue damage, creating a hostile environment for neural repair. This cascade of events results in glial scarring and cystic cavity formation, which impede axonal regeneration and remyelination. EPs capture these disruptions by detecting prolonged latencies (indicative of demyelination) and reduced amplitudes (reflecting axonal loss), providing a functional measure of injury severity [
3]. The ability of ML to prioritize features like SSEP amplitude in SCI underscores its potential to predict recovery trajectories, such as improvements in the ASIA scale, guiding rehabilitation strategies tailored to individual patients [
15,
27]. The heterogeneity in SCI studies, driven by differences in injury levels and EP metrics, highlights the need for disease-specific interpretations [
5]. Multiple sclerosis presents a dynamic and progressive challenge due to its autoimmune etiology [
2]. The immune system’s attack on myelin leads to multifocal lesions, disrupting neural conduction and causing a spectrum of symptoms that evolve over time. Early in MS, inflammatory demyelination dominates, producing relapses that may resolve partially or fully. Over time, chronic demyelination and axonal loss drive progressive disability, often measured by the EDSS. EPs, particularly SSEP latency and MEP time series, detect these changes by quantifying conduction delays and motor dysfunction, offering insights into disease activity and progression [
4]. ML models excel at integrating these temporal and multimodal EP features, predicting EDSS progression with high accuracy [
12,
24,
26]. However, the lower performance in MS studies compared to SCI may reflect the disease’s heterogeneity, with variable lesion locations and clinical courses complicating prognostic modeling [
12,
24,
26]. The heterogeneity in MS studies reflects the variability in EEG outcomes and clinical domains [
12]. The focus on long-term outcomes could further improve model precision [
24]. The findings align with prior evidence that EPs are sensitive biomarkers of EEG variability, with ML enhancing predictive power [
7]. The integration of multimodal EEG data improves prognostic accuracy, supporting multimodal approaches [
6]. The random-effects model’s flexibility accounts for variability in study populations and outcomes, ensuring robust evidence synthesis [
12,
27].
The moderate heterogeneity (I
2 = 55–57%) in the meta-analysis, driven by differences in EP types, populations, and ML methods, underscores the complexity of applying ML across these disorders [
12,
15,
24,
26,
27]. The findings align with prior evidence that EPs are sensitive biomarkers of neural dysfunction, but the integration of ML significantly enhances their prognostic power over traditional methods [
3,
4,
7,
8]. The ability of ML to handle high-dimensional EP data, such as time series and amplitude profiles, enables the identification of subtle patterns that correlate with clinical outcomes. In SCI, the emphasis on amplitude reflects its direct link to axonal integrity, a critical determinant of motor recovery [
15,
26]. In MS, the focus on time series captures the dynamic interplay of demyelination and inflammation, which drive disease progression [
12,
25,
27]. The random-effects model’s wider confidence intervals appropriately account for the variability introduced by animal model inclusion and differing outcomes (ASIA vs. EDSS vs. injury location), ensuring a robust synthesis of evidence [
12,
15,
24,
26,
27].
As a main limitation of the present study, the limited number of studies (n = 6) restricts statistical power and the precision of between-study variance (τ
2) estimation. The inclusion of an animal model also limits clinical applicability, as rat models do not fully replicate human pathophysiology [
25]. Hypothetical pooling assumptions, such as estimated accuracy for Yperman et al. (2020) and AUC for Cui et al. (2019), introduce potential bias [
24,
25]. The retrospective design of most studies further constrains causal inference. The random-effects model mitigates some of these issues by accommodating heterogeneity, but its wider confidence intervals reflect underlying uncertainty. Future meta-analyses should prioritize larger, prospective human studies to improve robustness. The cost-effectiveness and non-invasive nature of EPs, combined with ML’s analytical power, position EP-based models as practical tools for precision neurology [
21]. In SCI, these models could inform rehabilitation by predicting motor recovery potential [
15]. In MS, they could guide disease-modifying therapies by identifying patients at risk of rapid progression [
12,
26]. The integration of EPs with imaging, such as MRI or DTI, could further enhance model performance, offering a multimodal approach to prognosis [
5,
6,
18,
27]. This meta-analysis represents the first effort to quantify the performance of EP-based machine learning models across SCI and MS [
12,
15,
26,
27]. It establishes a pathophysiological linkage, demonstrating that EPs reflect demyelination through myelin basic protein loss, inflammation via cytokine activity, and axonal damage indicated by reduced N-acetylaspartate [
1,
2]. The analysis highlights cross-disease contributions, identifying shared predictors like latency and time series, as well as disease-specific predictors, such as amplitude in SCI [
15,
26,
27]. It supports the clinical translation of EP-based ML tools for personalized prognosis, offering a framework for tailored interventions [
21]. The study’s scientific impact lies in consolidating EPs’ role in ML-driven prognosis, while its clinical impact includes informing rehabilitation strategies for SCI and disease-modifying therapies for MS [
15,
26,
27]. By setting a foundation for future multimodal ML meta-analyses, it advances the field of precision neurology [
21].