Next Article in Journal
A 2.5D Self-Training Strategy for Carotid Artery Segmentation in T1-Weighted Brain Magnetic Resonance Images
Previous Article in Journal
Imaging Based Techniques Combined with Color Measurements for the Enhancement of Medieval Wall Paintings in the Framework of EHEM Project
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Hybrid Ensemble Deep Learning Model for Advancing Ischemic Brain Stroke Detection and Classification in Clinical Application

1
Department of Computer Science, Al-Quds University, Jerusalem P.O. Box 20002, Palestine
2
Department of Computer Engineering, Istinye University, Istanbul 34010, Turkey
3
Department of Medical Imaging, Al-Quds University, Jerusalem P.O. Box 20002, Palestine
4
Al Quds Business Center for Innovation, Technology, and Entrepreneurship, Al-Quds University, Jerusalem P.O. Box 20002, Palestine
5
Faculty of Medicine, Al-Quds University, Jerusalem P.O. Box 20002, Palestine
*
Author to whom correspondence should be addressed.
J. Imaging 2024, 10(7), 160; https://doi.org/10.3390/jimaging10070160
Submission received: 30 March 2024 / Revised: 16 May 2024 / Accepted: 28 May 2024 / Published: 2 July 2024
(This article belongs to the Section AI in Imaging)

Abstract

:
Ischemic brain strokes are severe medical conditions that occur due to blockages in the brain’s blood flow, often caused by blood clots or artery blockages. Early detection is crucial for effective treatment. This study aims to improve the detection and classification of ischemic brain strokes in clinical settings by introducing a new approach that integrates the stroke precision enhancement, ensemble deep learning, and intelligent lesion detection and segmentation models. The proposed hybrid model was trained and tested using a dataset of 10,000 computed tomography scans. A 25-fold cross-validation technique was employed, while the model’s performance was evaluated using accuracy, precision, recall, and F1 score. The findings indicate significant improvements in accuracy for different stages of stroke images when enhanced using the SPEM model with contrast-limited adaptive histogram equalization set to 4. Specifically, accuracy showed significant improvement (from 0.876 to 0.933) for hyper-acute stroke images; from 0.881 to 0.948 for acute stroke images, from 0.927 to 0.974 for sub-acute stroke images, and from 0.928 to 0.982 for chronic stroke images. Thus, the study shows significant promise for the detection and classification of ischemic brain strokes. Further research is needed to validate its performance on larger datasets and enhance its integration into clinical settings.

1. Introduction

Brain strokes, or brain attacks, represent a severe global health problem, significantly contributing to the burden of illness, disability, and mortality worldwide [1,2]. The World Health Organization (WHO) reports that cerebrovascular accidents, commonly known as strokes, are the second leading cause of death and the third leading cause of disability globally [3]. The severity of stroke can vary greatly, from mild cases that resolve with little or no residual disability to severe cases that can lead to death or lifelong dependence on care [4,5].
Speedy treatment of stroke is crucial as delays can cause permanent brain damage or increase the severity of the incident on the patient’s healthcare. However, accurately diagnosing strokes quickly in an emergency is difficult due to several factors, including varying levels of expertise among doctors, availability (or lack thereof) of Magnetic resonance imaging (MRI) and Computed tomography (CT) machines for overcrowded hospitals, and a long processing time due to the heavy computational load from processing medical images. These issues can lead to missed or delayed diagnoses and increase the risk of long-term disabilities.
Several methods for ensuring early detection and timely treatment of stroke have been studied. For instance, deep learning, a relatively new yet rapidly developing field of artificial intelligence, has the potential to revolutionize the diagnosis and treatment of stroke. Deep learning models can be trained on large datasets of medical images to learn to identify stroke lesions with high accuracy, even in small or complex cases [6]. These models leverage complex algorithms to extract intelligent, high-dimensional patterns not easily detectable through standard diagnostic procedures. The efficacy of deep learning in stroke diagnosis is gaining attention, primarily due to its proficiency in separating significant features and facilitating differentiation between diverse patient scenarios [5,7,8,9,10,11].
In recent years, studies have indicated that deep learning models can outperform human radiologists in detecting and classifying stroke lesions on CT scans and MRI scans [6,7]. Exploring this cutting-edge field, recent research has identified several benchmark deep learning models—including AlexNet, Visual Geometry Group (VGGNet), Inception, and Residual Network (ResNet)—as effective tools in the early detection of brain stroke [12].
Employed as transfer learning algorithms, these models have demonstrated considerable success in infarct lesion detection and classification [5,6,7,13]. Within their functionality is an encoder–decoder structure that emphasizes semantic segmentation tasks, empowering these models to precisely identify and categorize stroke-related abnormalities within CT images of the brain [14]. Nevertheless, the effectiveness of these advanced models in automating brain infarction diagnosis is complicatedly connected to the selection of the architectural framework.
Moreover, researchers have introduced the Hybrid ensemble deep learning (HEDL) models as advanced approaches that combine several deep learning methods, aiming to obtain even better prediction performances [13]. Several studies have used the hybrid model by combining various algorithms, like convolutional neural networks (CNNs), recurrent neural networks (RNNs), and long short-term memory networks (LSTM), all with varying results [15,16,17].
Despite promising results, machine learning faces several significant challenges in detecting early-stage ischemic strokes. One major issue is the quality and variety of data. Medical imaging data can be inconsistent due to differences in equipment, scanning protocols, and patient conditions. Limited labeled data is another critical challenge. Annotating medical images requires expert knowledge, making it difficult to amass large datasets for training machine learning models.
The subtle and varied nature of stroke features further complicates detection. Early-stage ischemic strokes often present with minute changes in the brain that can be easily overlooked. These features can vary widely between patients, necessitating robust models that can generalize across diverse cases. Additionally, the inherent complexity of medical images, with their high-dimensional and noisy nature, poses a significant obstacle in accurately identifying stroke-related abnormalities.
Extracting important features from these images and making models that perform well in different settings is also challenging. Feature extraction requires sophisticated techniques to capture relevant information without losing critical details. Ensuring that these models work effectively across various clinical environments, with different imaging devices and patient demographics, adds another layer of difficulty.
Considering the above, this study seeks to introduce a novel approach for speedy detection of stroke in emergency situations by combining advanced image processing techniques with a unique deep learning network. The image processing techniques simplify and classify ischemic strokes more accurately, overcoming previous obstacles and improving diagnosis reliability. This technological advance not only boosts diagnostic precision in a shorter time but also integrates smoothly into clinical workflows, enhancing patient care and reducing the chances of long-term after-effects from strokes.

2. Materials and Methods

This study utilized both CT and MRI scans from Istishari Arab Hospital (IAH) to capture different stroke stages, including normal, acute, chronic, hyper-acute, and sub-acute. The equipment used includes the Philips Incisive scanner and the Philips Ingenia MRI scanner (Philips Healthcare, Amsterdam, Netherlands).
The brain CT scans were performed using the Philips Incisive scanner (128 slices) at IAH. The technical specifications for the CT scans were a tube current of 300 mA, a peak voltage of 120 kVp, a window width/level (WW/WL) of 80/40 HU, a matrix size of 512 × 512, a slice thickness of 0.625 mm, and a reconstruction technique utilizing iDose 3.
For the diffusion-weighted MRI (DW-MRI) sequences, the Philips Ingenia 1.5 T MRI scanner at IAH was employed. The technical specifications for the DW-MRI sequences included an echo time (TE) of 85 ms, a repetition time (TR) of 4000 ms, a field of view (FOV) of 230 × 230 mm, a matrix size of 152 × 106, and a b-Factor of 0 & 1000.
Neurologists and radiologists classified the strokes into hyper-acute, acute, sub-acute, and chronic categories, following recognized medical standards. Each classification was verified by clinical diagnosis and follow-up. The research protocol was approved by the Al-Quds University Institutional Review Board (IRB), under code 410/REC/2023.

2.1. Dataset

In 2023, our study collected a dataset of 10,000 images from public hospitals across Palestine, featuring five categories: normal, hyper-acute, acute, sub-acute, and chronic, with 2000 cases each. The images, standardized to 640 × 640 pixels and JPEG format, trained our advanced diagnostic model.
The dataset was divided into four training groups: Normal-Hyper-acute, Normal-Acute, Normal-Sub-acute, and Normal-Chronic. To enhance diversity and prevent overfitting, we applied data augmentation techniques, quadrupling the dataset size. The augmented dataset was split into training (70%), testing (20%), and validation (10%) segments, using 25-fold cross-validation and dropout layers. Additionally, a supplementary set of 500 real-world clinical images validated the model’s accuracy and clinical relevance.

2.2. Hybrid Ensemble Deep Learning for Clinical Stroke Diagnosis

This study introduces a novel HEDL model for diagnosing and classifying ischemic brain strokes in CT medical images. The model has three main phases: (1) medical image enhancement and data preparation, using the Software Process Engineering Metamodel (SPEM) model to improve image quality and clarity for clinical stroke diagnosis; (2) the HEDL model, combining Dense Convolutional Network with 121 layers (DenseNet121) and Ensemble Deep Random Vector Functional Link (edRVFL) algorithms for their effectiveness with stroke image datasets; and (3) a novel approach for stroke lesion detection and segmentation, utilizing Medical Segmentation Anything Model (MedSAM) and the “You Only Look Once version 5” (YOLOv5) object detection model [18,19,20].
We employed transfer learning to adapt the Visual Geometry Group 16-layer model (VGG16) and DenseNet121 architectures within the CNN to our stroke image dataset. A comparative analysis of both architectures helped identify the most effective model for integration with the edRVFL outputs, creating a robust hybrid deep learning framework. Figure 1 illustrates the architecture of the model, highlighting the integration process and the flow of data through the different components.

2.2.1. Stroke Precision Enhancement Model (SPEM)

The study introduced the stroke precision enhancement model (SPEM) as an approach for enhancing CT image quality to aid in stroke prediction through deep learning analysis. SPEM employs morphological erosion to reduce noise and simplify raw CT images, enhancing visibility for stroke diagnosis [21]. It then uses contrast-limited adaptive histogram equalization (CLAHE) to redistribute pixel intensity values, enhancing contrast crucial for accurate stroke prediction [22]. The effectiveness of CLAHE in SPEM is evaluated using the effective measure of enhancement (EME) and peak signal-to-noise ratio (PSNR), ensuring that SPEM enhances images while maintaining the integrity and accuracy needed for reliable stroke prediction.
The following equation is utilized for EME:
E M E = 1 K 1 K 2 L = 1 K 2 K = 1 K 1 20   l o g I m a x k , l I m i n k , l
where K 1 , K 2 are the number of horizontal and vertical blocks in the image, I m a x k , l , and I m i n k , l are the maximum and minimum pixel values in each block.
The peak signal-to-noise ratio (PSNR) was used to measure the deviation of the current image from the original image with respect to the peak value of the gray level. Given a reference image f and a test image g , both of size M × N , the PSNR between f and g is defined by:
P S N R ( f , g ) = 10 255 2 / M S E ( f , g )
where
M S E ( f , g ) = 1 M N i = 1 M j = 1 N f i j g i j 2
In SPEM, as the Mean Squared Error (MSE) approaches zero, the PSNR increases, indicating improved image quality. A higher PSNR means the enhanced image closely resembles the original CT scan, retaining essential details for accurate stroke prediction.
SPEM also uses the Laplacian of Gaussian (LoG) edge enhancement technique, which begins with Gaussian smoothing followed by the application of a Laplacian operator to highlight edges of pathological structures, crucial for stroke analysis [23]. Furthermore, SPEM employs unsharp masking to sharpen CT image details. The masking technique involves creating a blurred, or “unsharp”, version of the image and then subtracting it from the original. The resulting high-contrast mask is then added back to the original image. This step enhances the sharpness and clarity of the image without introducing unwanted noise or artifacts, maintaining the integrity of diagnostic details [24].

2.2.2. Adaptation of Ensemble Deep Random Vector Functional Link for Stroke Classification

In the context of our study, the ensemble deep random vector functional link (edRVFL) has been adapted for classifying strokes, which integrates deep and ensemble learning techniques, furthering the random vector functional link (RVFL) network’s legacy. Initially brought forward by Hu et al. (2023) and built upon the single-layer feedforward network introduced by Zhang et al. (2016), RVFL is praised for its straightforward design which is remarkably capable in several machine learning models [25,26,27].
The RVFL network is known for its effectiveness in machine learning, image classification, pattern recognition, and signal processing. It has a simple architecture with three layers:
  • Input layer: processes incoming data.
  • Hidden layer: uses randomly initialized weights for unbiased data processing.
  • Output layer: produces the final output.
During training, the focus is on adjusting the weights of the output layer, often by solving a linear least squares problem. Methods like the conjugate gradient method can improve this process’s efficiency.
In the RVFL, features from each model level are merged with the original input data after undergoing nonlinear transformations, enhancing the model’s complexity and effectiveness. The core computation of the RVFL is outlined by a specific mathematical formula that describes this process of feature integration and transformation:
y = W o   t a n h   t a n h   W i x + b i + b o  
wherein:
  • y stands for the predicted output.
  • Wo and Wi designate the weights associated with the output and hidden layers, respectively.
  • bi and bo are the bias terms corresponding to the hidden and output layers.
  • x is representative of the input data.
The edRVFL approach improves upon the basic RVFL by creating an ensemble of multiple RVFL networks, each trained on different segments of the dataset. This ensures that each network specializes in a unique subset of data. The outputs from these networks are then combined to form a more robust model. This integration results in reduced overfitting and improved generalization across diverse datasets. The mathematical framework of the edRVFL reflects this enhanced approach (see Figure 2):
e d R V F L x = F X W 1 + b 1 W 2 + b 2  
For clarity:
  • x symbolizes the input data.
  • W1 and W2 represent weight matrices corresponding to the sequential RVFL networks.
  • b1 and b2 denote bias vectors that are aligned with the respective RVFL networks.
  • F is indicative of a nonlinear activation function.
The training process for the edRVFL involves several key steps:
  • Begin by randomly assigning the weight matrices and bias vectors in the RVFL networks.
  • Conduct individual training sessions for each RVFL network, tailoring them to specific data subsets for the best results.
  • Combine the outputs from all RVFL networks to create a unified edRVFL output.
  • Repeatedly adjust and fine-tune the process by revisiting steps 2 and 3 until the model reaches optimal performance.

2.2.3. Enhancing Stroke Detection by Advancing edRVFL Classification Capabilities

Our research has refined the edRVFL algorithm parameters to better classify strokes, emphasizing early detection of acute events. We have enhanced the algorithm’s efficiency and accuracy in stroke classification, focusing on early-stage identification, reducing execution times, and minimizing reliance on conventional feature engineering.

In-Depth Hyperparameter Optimization

Our research focuses on optimizing edRVFL’s hyperparameters for better stroke classification. We have identified three key hyperparameters: enhancement node count, regularization parameter, and hidden layer quantity. Their precise calibration balances algorithm performance and resource efficiency.
We adjust enhancement nodes for optimal classification and manage the computational load. The regularization parameter is fine-tuned to prevent overfitting and ensure memory efficiency. The optimal number of hidden layers is crucial for handling complex data features while considering computational demands and execution speed.
The optimal hyperparameter values found are:
  • Enhancement nodes: 10
  • Regularization parameter: 100
  • Hidden layers: 10

Enhancing Feature Extraction and Clustering Techniques

We have enhanced the edRVFL framework by integrating DenseNet 121, a state-of-the-art deep learning architecture, improving its ability to detect complex patterns in stroke images and simplifying classification. This approach outperforms standard feature engineering in biological classification.
Our method surpasses traditional clustering techniques by leveraging the hierarchical structure of data and capturing subtle image relationships. We used a 25-fold cross-validation technique in the clustering process to ensure the robustness and reliability of our results. This is crucial for analyzing our curated dataset of 500 stroke cases, augmented for increased robustness and diversity. Our hierarchical clustering strategy achieves more precise and relevant grouping outcomes by accurately reflecting the nuanced relationships between images.

2.2.4. Ablation Study

In our study, we conducted an ablation study to dissect the contribution of individual components within our proposed ensemble model for diagnosing and classifying ischemic brain strokes in CT medical images. The baseline ensemble model consists of a combination of customized CNN, VGG16, and DenseNet121 architectures.
To assess the impact of each component within the ensemble model, we conducted a series of experiments wherein we systematically removed one model from the ensemble at a time. For instance, we eliminated the CNN and observed the resultant effects on performance metrics such as accuracy, precision, recall, F1 score, and AUC (area under the curve). The subsequent configurations after the elimination of one model include the ensembles of CNN + VGG16, CNN + DenseNet121, and VGG16 + DenseNet121, as well as individual models such as CNN, VGG16, and DenseNet121.
Objective measures, including accuracy, precision, recall, F1 score, and AUC, are employed to quantitatively evaluate the effectiveness of the model in enhancing stroke diagnosis accuracy.

2.2.5. Developing a Novel Method for Lesion Detection and Segmentation in Medical Imaging

Quantitative analysis in medical imaging relies heavily on segmentation, but traditional methods are limited by the need for extensive manual data labeling. Foundation models like Meta’s Segment Anything Model (SAM) by Alexander Kirillov et al. in 2023, and MedSAM by Bo Wang et al. in 2023, show promise in overcoming these limitations by pre-training on diverse datasets [18]. SAM provides broad segmentation capabilities across imaging types using text prompts. MedSAM, trained on over one million image–mask pairs, excels in medical segmentation, outperforming specialist models in some cases due to its superior generalization [19]. Furthermore, in our study, the YOLOv5 object detection model on the dataset was employed [20]. YOLOv5 was chosen for its exceptional performance in real-time object detection. It was trained for 250 epochs using the stochastic gradient descent (SGD) optimizer, with an initial learning rate of 0.001, and common data augmentation techniques like flipping, rotations, and cropping.
Our primary metric for segmentation accuracy was the Dice similarity coefficient (DSC), a widely used metric in medical imaging for evaluating segmentation performance. It measures the spatial overlap between two segmentation masks on a scale from 0 to 1, with a score of 1 indicating perfect overlap. The formula for calculating the DSC is:
D S C = 2   | A     B | | A | + | B |
A: Predicted Segmentation
B: True Segmentation

2.3. Model Validation

We evaluated various machine learning models for stroke prediction on a clinical dataset of 500 CT brain scans, comparing results with actual diagnoses. The evaluation used 25-fold cross-validation and metrics like accuracy, precision, recall, F1 score, and AUC to assess consistency and generalization, identifying the most effective algorithm for stroke detection.
The hybrid approach combines supervised detection and unsupervised segmentation to reduce manual annotation dependency. We employed YOLOv5 for real-time detection efficiency, achieving high accuracy in lesion detection on annotated scans from Palestinian hospitals. MedSAM was then used for precise lesion segmentation, guided by YOLOv5’s detections. This method leverages the strengths of both models, offering a robust solution for accurate medical image segmentation.

3. Results

3.1. Hybrid Model Performance before Enhancement

Table 1’s analysis reveals the performance of various machine learning classifiers on an original brain ischemic stroke dataset before integrating the SPEM model. For hyper-acute strokes, SVM led in accuracy (94.9%), closely followed by random forest (92.0%), with random forest (41.87 s) being quicker than SVM (53.05 s). In acute strokes, edRVFL was most accurate (95.4%), with logistic regression nearly matching (95.3%). Nonetheless, LR was faster in processing time than the edRVFL at 0.06 s and 1.01 s, respectively. Sub-acute stroke detection saw SVM and edRVFL both achieving 96.3% accuracy, with SVM faster at 0.34 s compared to edRVFL’s 0.45 s. For chronic strokes, SVM and edRVFL impressed with 97.5% and 97.9% accuracies, respectively, and were efficient in time, with SVM at 0.61 s and edRVFL at 0.74 s. Overall, SVM consistently offered high accuracy and time-efficiency across stroke types, with logistic regression and edRVFL also showing strong performance in both accuracy and speed.

3.2. Hybrid Model Performance with SPEM Image Enhancement Top of Form

Table 2 reveals the impact of the SPEM model alongside DenseNet121 on enhancing machine learning classifiers’ performance on an enhanced dataset. In hyper-acute stroke detection, edRVFL and logistic regression led with accuracies of 0.9958 and 0.9957, respectively, combining high precision with low processing times of 0.74 and 0.04 s. SVM also showed strong performance with a 0.9915 accuracy and 0.52-s processing time, while random forest, despite its lower accuracy of 0.9832, completed predictions in 3.30 s.
For acute strokes, SVM, logistic regression, and edRVFL classifiers all recorded accuracies above 0.99. SVM’s accuracy stood at 0.9930, requiring 24.74 s, whereas logistic regression showed a near-identical accuracy of 0.9922 but with a much quicker 0.42 s processing time. edRVFL achieved the highest accuracy at 0.9933, with a processing time of 2.46 s. Random Forest lagged slightly in accuracy (0.9716) and took longer at 42.73 s.
In sub-acute stroke analysis, edRVFL had the top accuracy of 0.9816, with a quick 0.45 s processing time. SVM and logistic regression both notched accuracies of 0.975, with logistic regression being the fastest at 0.03 s, slightly edging out SVM’s 0.31 s. Random Forest was less accurate (0.9509) but maintained efficiency with a 2.07 s processing time.
In chronic stroke analysis, classifier performance notably improved. The edRVFL classifier achieved an impressive accuracy of 0.9980 with a processing time of 1.02 s. SVM and logistic regression both recorded accuracies of 0.9959, with SVM slightly less time-efficient at 1.83 s, compared to logistic regression’s swift 0.06 s. Random Forest, although the least accurate at 0.9736, maintained a quick processing time of 8.36 s.

3.3. Insights from Ablation Study

The results of the ablation study, as summarized in Table 3, provide valuable insights into the significance of each model within the ensemble for stroke diagnosis.
The results indicate that DenseNet121 plays a crucial role in contributing to the overall performance metrics of the ensemble model. DenseNet121 significantly enhances the final performance when included in the ensemble, as evidenced by the high accuracy scores achieved in both the hyper-acute and acute stages. In particular, DenseNet121 achieved the highest accuracy scores of 0.9958 and 0.9933 for the hyper-acute and acute stages, respectively, surpassing the performance of the baseline model.
Moreover, removing DenseNet121 from the ensemble resulted in a significant decrease in performance across all stages, highlighting its substantial impact on the overall effectiveness of the model.
However, while the customized CNN architecture achieves the best result for the sub-acute stage, it does not perform as well as DenseNet121 in other stages. Interestingly, the combination of CNN and DenseNet121 in the ensemble resulted in the highest accuracy for the chronic stage, showcasing the complementary strengths of these two architectures when used together.
Based on these findings, only DenseNet121 was used in our proposed architecture. This decision is motivated by its superior performance across different stages and the desire to streamline the model architecture, thus avoiding unnecessary computational overhead and reducing processing time.
It is worth noting that all experiments were conducted on enhanced datasets utilizing contrast limited adaptive histogram equalization (CLAHE) with a clip limit of 4. Additionally, the final classification was performed using the enhanced discriminative random vector functional link (edRVFL) algorithm, ensuring robustness and accuracy in the diagnosis of ischemic brain strokes.

3.4. Hybrid Model’s Validation in Clinical Setting

Table 4 presents a comparison of hierarchical clustering’s performance on raw versus SPEM-enhanced stroke images using the edRVFL model. For hyper-acute stroke images, accuracy increased from 0.876 in the raw dataset to 0.933 with SPEM enhancement, demonstrating clear accuracy benefits for hyper-acute cases. In acute stroke images, accuracy improved from 0.881 with no enhancement to 0.948 post-SPEM application, underscoring the value of image enhancement in acute stroke classification.
Sub-acute stroke images saw a marked accuracy increase from 0.927 in the original to 0.974 with SPEM enhancement, indicating the technique’s effectiveness in sub-acute cases. Chronic stroke images experienced the most significant accuracy boost, from 0.928 in the raw dataset to 0.982 with enhancement, highlighting SPEM’s role in accurately classifying chronic strokes.

3.5. Hybrid Segmentation Model

Results in Table 5 indicated the success of a hybrid segmentation method combining MEDSAM and YOLOv5, validated with clinical data from the Palestinian healthcare system. The results indicated the hybrid model’s segmentation efficiency using Dice similarity coefficients (DSCs), comparing original and SPEM-enhanced data. Original data showed moderate to high segmentation efficiency across stroke types, with DSCs ranging from 0.7 to 0.96, the highest being for chronic strokes. SPEM enhancement significantly improved DSCs for acute, hyper-acute, and sub-acute strokes to 0.91, 0.92, and 0.98 respectively.
Results in Figure 3 reveal that the hybrid segmentation method, especially with SPEM-enhanced data, excels in accurately identifying stroke lesions in CT scans. The noticeable boost in Dice similarity coefficients (DSCs) for hyper-acute, acute, and sub-acute strokes highlights the significant role of image enhancement in increasing lesion detection accuracy for these stroke types. On the other hand, for chronic strokes, the unenhanced original data yielded marginally superior outcomes, indicating that SPEM’s effectiveness may differ based on the stroke’s nature and features.

4. Discussion

This study successfully demonstrates the potential of a HEDL model, empowered by image enhancement, for improving brain stroke detection and classification in clinical applications. This model leverages the complementary strengths of diverse architectures, notably DenseNet 121 and ensemble deep random vector functional link (edRVFL) networks, to achieve high accuracy across different stroke types. Furthermore, the integration of the SPEM image enhancement technique significantly elevates the model’s performance, particularly for hyper-acute and acute stroke types. These findings align with previous research highlighting the efficacy of hybrid deep learning approaches and image preprocessing for enhancing medical image analysis for diagnoses and classification [16,17,28].
The synergy within the hybrid model lies in the strategic combination of distinct strengths. edRVFL networks, recognized for their robustness and interpretability in medical applications [15,16], provide a reliable foundation for accurate stroke classification. DenseNet121, known for its exceptional efficiency and speed [17], complements edRVFL by ensuring rapid processing, a crucial factor for real-world clinical applications. This combination empowers the model to deliver high accuracy while maintaining practical feasibility. These findings align with other studies indicating the efficiency and robustness of DenseNet121 in medical diagnosis and object recognition [29,30].
Secondly, this study emphasizes the remarkable impact of image enhancement on model performance. The SPEM model, by enhancing image clarity and highlighting stroke-related abnormalities, significantly boosts the accuracy of all classifiers across diverse stroke types. This finding corroborates previous research advocating for the importance of image preprocessing and enhancement in boosting the accuracy and reliability of deep learning models in medical image analysis [5,6,8,31]. Notably, the accuracy improvement was most noticeable for hyper-acute, acute, and sub-acute stroke cases, suggesting that image enhancement may be particularly beneficial for these challenging categories.
The hybrid segmentation model, integrating MedSAM and YOLOv5, further strengthens the model’s capabilities by enabling accurate lesion detection and visualization. This approach, combining the object detection ability of YOLOv5 with MEDSAM’s lesion segmentation expertise, achieved high accuracy in identifying stroke lesions, particularly in acute and hyper-acute cases. This aligns with prior studies demonstrating the effectiveness of hybrid segmentation models for medical image analysis tasks [9,14,32].
Therefore, this study demonstrates the efficacy of a hybrid deep learning model with image processing for superior stroke detection and classification in clinical settings. The model’s high accuracy and operational efficiency, particularly through the hybrid prediction method, indicate its readiness for clinical use. The SPEM image enhancement technique boosts the precision of classifiers in identifying ischemic strokes, especially in hyper-acute and acute cases.
The integration of MEDSAM and YOLOv5 further improves accuracy, especially in acute and hyper-acute strokes.

5. Limitations

Despite these promising results, some limitations necessitate further investigation. First, the model requires validation and testing on larger and more diverse datasets to ensure generalizability and robustness. Second, enhancing the model’s interpretability would provide valuable insights into its decision-making process and build trust among clinicians. Finally, seamless integration with existing clinical workflows is crucial for successful real-world implementation. Addressing these limitations will promote the real-world deployment of this model, enabling more accurate and efficient brain stroke diagnosis and management, and thus better care and quality of life for stroke patients.

6. Conclusions

This study highlights the potential of a hybrid ensemble deep learning model with image enhancement for improved brain stroke detection and classification in clinical applications. The model’s high accuracy and efficiency, coupled with the effectiveness of the hybrid segmentation approach, suggests its potential for clinical adoption. The study demonstrates the effectiveness of the SPEM image enhancement technique in significantly improving the accuracy of machine learning classifiers for predicting ischemic strokes. Before enhancement, SVM, logistic regression, and edRVFL showed promise in accuracy and processing times. However, after enhancement, accuracy levels notably increased, particularly for hyper-acute and acute strokes. edRVFL and logistic regression consistently achieved high accuracy with low processing times, making them practical choices. SVM maintained high accuracy but required a slightly longer processing time, while random forest offered a balance between accuracy and speed. These results emphasize the potential of image enhancement methods for enhancing stroke prediction models and improving early diagnosis and treatment. This study encourages further research in this field to benefit healthcare professionals and patient outcomes.

Author Contributions

Conceptualization, R.Q.; methodology, R.Q. and I.Q.; software, O.D. and S.T.; validation, O.D., S.T. and S.A.; formal analysis, R.Q. and S.T.; data curation, I.Q. and O.D.; writing—original draft preparation, R.Q.; writing—review and editing, S.V.P., S.A., and D.A.A.-H.; resources, D.A.A.-H. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The original data presented in the study are available on request.

Acknowledgments

The authors would like to acknowledge the Dunya Center for Cancer in Palestine for its vast logistical support in the implementation of this research, as well as the participants, doctors, radiologists, and field workers who have supported the successful completion of the study.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Fan, J.; Li, X.; Yu, X.; Liu, Z.; Jiang, Y.; Fang, Y.; Zong, M.; Suo, C.; Man, Q.; Xiong, L. Global Burden, Risk Factor Analysis, and Prediction Study of Ischemic Stroke, 1990–2030. Neurology 2023, 101, E137–E150. [Google Scholar] [CrossRef] [PubMed]
  2. Feigin, V.L.; Krishnamurthi, R.V.; Parmar, P.; Norrving, B.; Mensah, G.A.; Bennett, D.A.; Barker-Collo, S.; Moran, A.E.; Sacco, R.L.; Truelsen, T.; et al. Update on the global burden of ischemic and hemorrhagic stroke in 1990–2013: The GBD 2013 study. Neuroepidemiology 2015, 45, 161–176. [Google Scholar] [CrossRef] [PubMed]
  3. Martins, S.; de Silva, D. Activation Toolkit: Welcome to World Stroke Day, 29 October 2022. Available online: https://www.world-stroke.org/assets/downloads/WSD2022_Campaign_Toolkit_LR.pdf (accessed on 29 March 2024).
  4. Andersen, K.K.; Olsen, T.S.; Dehlendorff, C.; Kammersgaard, L.P. Hemorrhagic and ischemic strokes compared: Stroke severity, mortality, and risk factors. Stroke 2009, 40, 2068–2072. [Google Scholar] [CrossRef]
  5. Rahman, S.; Hasan, M.; Sarkar, A.K. Prediction of Brain Stroke using Machine Learning Algorithms and Deep Neural Network Techniques. Eur. J. Electr. Eng. Comput. Sci. 2023, 7, 23–30. [Google Scholar] [CrossRef]
  6. Reddy, M.K.; Kovuri, K.; Avanija, J.; Sakthivel, M.; Kaleru, S. Brain Stroke Prediction Using Deep Learning: A CNN Approach. In Proceedings of the 4th International Conference on Inventive Research in Computing Applications (ICIRCA), Coimbatore, India, 21–23 September 2022; pp. 775–780. [Google Scholar] [CrossRef]
  7. Chen, Y.-T.; Chen, Y.-L.; Chen, Y.-Y.; Huang, Y.-T.; Wong, H.-F.; Yan, J.-L.; Wang, J.-J. Deep Learning–Based Brain Computed Tomography Image Classification with Hyperparameter Optimization through Transfer Learning for Stroke. Diagnostics 2022, 12, 807. [Google Scholar] [CrossRef]
  8. Chen, R.; Cai, Y.; Wu, J.; Liu, H.; Peng, Z.; Xie, Y.; Xu, C.; Peng, X. Artificial intelligence-based identification of brain CT medical images. SPIE 2023, 1256009, 53–58. [Google Scholar] [CrossRef]
  9. Islam, R.; Debnath, S.; Palash, T.I. Predictive Analysis for Risk of Stroke Using Machine Learning Techniques. In Proceedings of the 6th International Conference on Computer, Communication, Chemical, Materials and Electronic Engineering (IC4ME2), Rajshahi, Bangladesh, 26–27 December 2021. [Google Scholar] [CrossRef]
  10. Omarov, B.; Tursynova, A.; Uzak, M. Deep Learning Enhanced Internet of Medical Things to Analyze Brain Computed Tomography Images of Stroke Patients. Int. J. Adv. Comput. Sci. Appl. 2023, 14, 668–676. [Google Scholar] [CrossRef]
  11. Xu, Y.; Holanda, G.; Souza, L.F.d.F.; Silva, H.; Gomes, A.; Silva, I.; Ferreira, M.; Jia, C.; Han, T.; de Albuquerque, V.H.C.; et al. Deep Learning-Enhanced Internet of Medical Things to Analyze Brain CT Scans of Hemorrhagic Stroke Patients: A New Approach. IEEE Sens. J. 2021, 21, 24941–24951. [Google Scholar] [CrossRef]
  12. Tasci, B. Automated ischemic acute infarction detection using pre-trained CNN models’ deep features. Biomed. Signal Process. Control 2023, 82, 104603. [Google Scholar] [CrossRef]
  13. Vamsi, B.; Bhattacharyya, D.; Midhunchakkravarthy, D.; Kim, J.Y. Early Detection of Hemorrhagic Stroke Using a Lightweight Deep Learning Neural Network Model. Trait. Du Signal 2021, 38, 1727–1736. [Google Scholar] [CrossRef]
  14. Gaidhani, B.R.; Rajamenakshi, R.; Sonavane, S. Brain stroke detection using convolutional neural network and deep learning models. In Proceedings of the 2019 2nd International Conference on Intelligent Communication and Computational Techniques (ICCT), Jaipur, India, 28–29 September 2019; pp. 242–249. [Google Scholar] [CrossRef]
  15. Al-Mekhlafi, Z.G.; Senan, E.M.; Rassem, T.H.; Mohammed, B.A.; Makbol, N.M.; Alanazi, A.A.; Almurayziq, T.S.; Ghaleb, F.A. Deep Learning and Machine Learning for Early Detection of Stroke and Haemorrhage. Comput. Mater. Contin. 2022, 72, 775–796. [Google Scholar] [CrossRef]
  16. Al-Mekhlafi, Z.G.; Senan, E.M.; Rassem, T.H.; Mohammed, B.A.; Makbol, N.M.; Alanazi, A.A.; Almurayziq, T.S.; Ghaleb, F.A. A Hybrid Stacked CNN and Residual Feedback GMDH-LSTM Deep Learning Model for Stroke Prediction Applied on Mobile AI Smart Hospital Platform. Sensors 2023, 23, 3500. [Google Scholar] [CrossRef]
  17. Prasad, K.S.; Pasupathy, S. A Machine Learning Framework with Hybrid Feature Engineering for Leveraging Brain Stroke Detection Performance. Math. Stat. Eng. Appl. 2022, 71, 1805–1824. [Google Scholar]
  18. Kirillov, A.; Mintun, E.; Ravi, N.; Mao, H.; Rolland, C.; Gustafson, L.; Xiao, T.; Whitehead, S.; Berg, A.C.; Lo, W.-Y.; et al. Segment Anything. arXiv 2023, arXiv:2304.02643. [Google Scholar]
  19. Ma, J.; He, Y.; Li, F.; Han, L.; You, C.; Wang, B. Segment Anything in Medical Images. Nat. Commun. 2024, 15, 654. [Google Scholar] [CrossRef] [PubMed]
  20. Shou, Y.; Meng, T.; Ai, W.; Xie, C.; Liu, H.; Wang, Y. Object Detection in Medical Images Based on Hierarchical Transformer and Mask Mechanism. Comput. Intell. Neurosci. 2022, 2022, 1–12. [Google Scholar] [CrossRef] [PubMed]
  21. Mahmoud, T.A.; Marshall, S. Medical image enhancement using threshold decomposition driven adaptive morphological filter. In Proceedings of the 2008 16th European Signal Processing Conference, Lausanne, Switzerland, 25–29 August 2008. [Google Scholar]
  22. Isa, I.S.; Sulaiman, S.N.; Mustapha, M.; Karim, N.K. Automatic contrast enhancement of brain MR images using Average Intensity Replacement based on Adaptive Histogram Equalization (AIR-AHE). Biocybern. Biomed. Eng. 2017, 37, 24–34. [Google Scholar] [CrossRef]
  23. Przelaskowski, A.; Sklinda, K.; Bargieł, P.; Walecki, J.; Biesiadko-Matuszewska, M.; Kazubek, M. Improved early stroke detection: Wavelet-based perception enhancement of computerized tomography exams. Comput. Biol. Med. 2007, 37, 524–533. [Google Scholar] [CrossRef]
  24. Wubuli, A.; Zhen-Hong, J.; Xi-Zhong, Q.; Jie, Y.; Kasabov, N. Medical image enhancement based on shearlet transform and unsharp masking. J. Med. Imaging Health Inform. 2014, 4, 814–818. [Google Scholar] [CrossRef]
  25. Shi, Q.; Katuwal, R.; Suganthan, P.N.; Tanveer, M. Random vector functional link neural network based ensemble deep learning. Pattern Recognit. 2021, 117, 107978. [Google Scholar] [CrossRef]
  26. Malik, A.; Gao, R.; Ganaie, M.; Tanveer, M.; Suganthan, P.N. Random vector functional link network: Recent developments, applications, and future directions. Appl. Soft.Comput. 2023, 143, 110377. [Google Scholar] [CrossRef]
  27. Hu, M.; Chion, J.H.; Suganthan, P.N.; Katuwal, R.K. Ensemble deep random vector functional link neural network for regression. IEEE Trans. Syst. Man Cybern. Syst. 2022, 53, 2604–2615. [Google Scholar] [CrossRef]
  28. Dhiman, H.; Kumar, R.; Rani, P. A Hybrid Model for Early Prediction of Stroke Disease. In Proceedings of the 2022 10th International Conference on Reliability, Infocom Technologies and Optimization (Trends and Future Directions) (ICRITO), Noida, India, 13–14 October 2022; pp. 1–6. [Google Scholar] [CrossRef]
  29. Lu, Z.; Oda, M.; Hayashi, Y.; Hu, T.; Itoh, H.; Watadani, T.; Abe, O.; Hashimoto, M.; Jinzaki, M.; Mori, K. Extremely imbalanced subarachnoid hemorrhage detection based on DenseNet-LSTM network with class-balanced loss and transfer learning. SPIE-Intl. Soc. Opt. Eng. 2021, 11597, 68. [Google Scholar]
  30. Zhou, Q.; Zhu, W.; Li, F.; Yuan, M.; Zheng, L.; Liu, X. Transfer learning of the ResNet-18 and DenseNet-121 model used to diagnose intracranial hemorrhage in CT scanning. Curr. Pharm. Des. 2022, 28, 287–295. [Google Scholar] [CrossRef]
  31. Chen, W.; Huang, Y.; Chong, C. Editorial: Post-stroke complications: Mechanisms, diagnosis, and therapies. Front. Neurol. 2023, 14, 1292562. [Google Scholar] [CrossRef]
  32. Islam, R.; Hossain, A.; Haque, M.S. Deep Neural Network based Algorithm for Stroke Prediction. In Proceedings of the International Conference on Recent Progresses in Science, Engineering and Technology (ICRPSET), Rajshahi, Bangladesh, 26–27 December 2022; pp. 1–4. [Google Scholar] [CrossRef]
Figure 1. Architecture of the model.
Figure 1. Architecture of the model.
Jimaging 10 00160 g001
Figure 2. The ensemble deep RVFL neural network architecture.
Figure 2. The ensemble deep RVFL neural network architecture.
Jimaging 10 00160 g002
Figure 3. Hybrid segmentation models (a) show the results of hyper-acute segmentation, and (b) show the chronic segmentation results.
Figure 3. Hybrid segmentation models (a) show the results of hyper-acute segmentation, and (b) show the chronic segmentation results.
Jimaging 10 00160 g003
Table 1. Performance evaluation of the original data using different machine learning classifiers, concerning the DenseNet121 Deep Learning Model.
Table 1. Performance evaluation of the original data using different machine learning classifiers, concerning the DenseNet121 Deep Learning Model.
Brain Ischemic Stroke Prediction
CasesClassifierAccuracyPrecisionRecallF1AUCTime (s)
Hyper-acuteRandom Forest0.9200.9040.9260.9150.97541.87
SVM0.9490.9400.9520.9460.98953.05
Logistic Regression0.9080.9070.8950.9010.9670.4
edRVFL0.9100.9100.9100.9100.9722.4
AcuteRandom Forest0.9070.9210.9270.9240.9688.63
SVM0.9390.9630.9370.9500.9872.52
Logistic Regression0.9530.9630.9600.9620.9880.06
edRVFL0.9540.9480.9470.9470.9901.01
Sub-acuteRandom Forest0.9330.9320.9190.9250.9922.35
SVM0.9630.9860.9320.9580.9920.34
Logistic Regression0.9570.9470.9600.9530.9940.04
edRVFL0.9630.9630.9630.9630.9960.45
ChronicRandom Forest0.9450.9500.9430.9470.9863.44
SVM0.9750.9830.9670.9750.9980.61
Logistic Regression0.9780.9880.9590.9790.9990.05
edRVFL0.9790.9800.9790.9790.9990.74
Table 2. Performance evaluation of the enhanced data using the SPEM model, using different machine learning classifiers, concerning the DenseNet121 Deep Learning Model.
Table 2. Performance evaluation of the enhanced data using the SPEM model, using different machine learning classifiers, concerning the DenseNet121 Deep Learning Model.
Enhanced Dataset at CLAHE (Clip Limit = 4)
CasesClassifierAccuracyPrecisionRecallF1AUCTime (s)
Hyper-acuteRandom Forest0.98320.99180.9760.98380.99953.30
SVM0.99150.98300.99450.99150.99980.52
Logistic Regression0.99570.99140.99320.99570.99990.04
edRVFL0.99580.99570.99580.99580.99990.74
AcuteRandom Forest0.97160.95340.98960.97110.996142.73
SVM0.99300.99310.99300.99300.999724.74
Logistic Regression0.99220.99190.99190.99190.99940.42
edRVFL0.99330.99330.99330.99330.99942.46
Sub-acuteRandom Forest0.95090.93980.96300.95120.99182.07
SVM0.97540.96390.98760.97560.99930.31
Logistic Regression0.97550.97530.97530.97530.99860.03
edRVFL0.98160.98170.98160.98170.99880.45
ChronicRandom Forest0.97360.96490.98920.97690.99808.36
SVM0.99590.99280.99410.99640.99991.83
Logistic Regression0.99590.99280.99760.99640.99970.06
edRVFL0.99800.99800.99800.99800.99991.02
Table 3. Comparative ablation study results for stroke diagnosis model configurations.
Table 3. Comparative ablation study results for stroke diagnosis model configurations.
CasesExperimentAccuracyPrecisionRecallF1 ScoreAUC
Hyper-acuteBaseline Model0.99230.99220.99230.99230.9998
Ensemble of CNN, VGG160.99060.99050.99060.99060.9996
Ensemble of CNN, DenseNet0.99360.99350.99360.99360.9999
Ensemble of VGG16, DenseNet0.99270.99260.99280.99270.9998
CNN0.99120.99110.99110.99110.9998
VGG160.98980.98960.98980.98970.9996
DenseNet1210.99580.99570.99580.99580.9999
AcuteBaseline Model0.99250.99260.99250.99250.9994
Ensemble of CNN, VGG160.99240.99250.99250.99250.9992
Ensemble of CNN, DenseNet0.99320.99310.99310.99310.9994
Ensemble of VGG16, DenseNet0.99260.99250.99260.99260.9993
CNN0.99280.99290.99290.99290.9994
VGG160.99220.99210.99210.99210.9992
DenseNet1210.99330.99330.99330.99330.9994
Sub-acuteBaseline Model0.98550.98540.98540.98540.9988
Ensemble of CNN, VGG160.98710.98710.98720.98720.9989
Ensemble of CNN, DenseNet0.98780.98770.98760.98770.9991
Ensemble of VGG16, DenseNet0.98720.98710.98710.98710.9987
CNN0.99320.99340.99330.99330.9991
VGG160.98110.98120.98110.98110.9989
DenseNet1210.98160.98170.98160.98170.9988
ChronicBaseline Model0.99610.99620.99630.99630.9998
Ensemble of CNN, VGG160.99520.99510.99510.99510.9996
Ensemble of CNN, DenseNet0.99810.99820.99810.99810.9998
Ensemble of VGG16, DenseNet0.99750.99760.99760.99760.9997
CNN0.99780.99790.99780.99780.9998
VGG160.99160.99150.99150.99150.9993
DenseNet1210.99800.99800.99800.99800.9999
Table 4. Comparative performance evaluation of hierarchical clustering on raw stroke images and images enhanced with the use of hybrid edRVFL model.
Table 4. Comparative performance evaluation of hierarchical clustering on raw stroke images and images enhanced with the use of hybrid edRVFL model.
AccuracyTime (s)
Hyper-AcuteAcuteSub-AcuteChronic
Original
without enhancement
0.8760.8810.9270.9280.2922
SPEM
at
CLAHE = 4
0.9330.9480.9740.9820.2550
Table 5. The efficiency of the hybrid segmentation model based on Dice similarity coefficients (DSCs) estimation compared to MedSAM.
Table 5. The efficiency of the hybrid segmentation model based on Dice similarity coefficients (DSCs) estimation compared to MedSAM.
Stroke LevelDice Similarity Coefficient (MedSAM)Dice Similarity Coefficient (SPEM)
Acute0.70.91
Hyper-acute0.710.92
Sub-acute0.80.92
Chronic0.960.98
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Qasrawi, R.; Qdaih, I.; Daraghmeh, O.; Thwib, S.; Vicuna Polo, S.; Atari, S.; Abu Al-Halawa, D. Hybrid Ensemble Deep Learning Model for Advancing Ischemic Brain Stroke Detection and Classification in Clinical Application. J. Imaging 2024, 10, 160. https://doi.org/10.3390/jimaging10070160

AMA Style

Qasrawi R, Qdaih I, Daraghmeh O, Thwib S, Vicuna Polo S, Atari S, Abu Al-Halawa D. Hybrid Ensemble Deep Learning Model for Advancing Ischemic Brain Stroke Detection and Classification in Clinical Application. Journal of Imaging. 2024; 10(7):160. https://doi.org/10.3390/jimaging10070160

Chicago/Turabian Style

Qasrawi, Radwan, Ibrahem Qdaih, Omar Daraghmeh, Suliman Thwib, Stephanny Vicuna Polo, Siham Atari, and Diala Abu Al-Halawa. 2024. "Hybrid Ensemble Deep Learning Model for Advancing Ischemic Brain Stroke Detection and Classification in Clinical Application" Journal of Imaging 10, no. 7: 160. https://doi.org/10.3390/jimaging10070160

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop