Next Article in Journal
Association between ADAM33 Single-Nucleotide Polymorphisms and Treatment Response to Inhaled Corticosteroids and a Long-Acting Beta-Agonist in Asthma
Previous Article in Journal
Preoperative Hilar and Mediastinal Lymph Node Staging in Patients with Suspected or Diagnosed Lung Cancer: Accuracy of 18F-FDG-PET/CT:A Retrospective Cohort Study of 138 Patients
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Computer-Aided Diagnosis System for Blood Diseases Using EfficientNet-B3 Based on a Dynamic Learning Algorithm

by
Sameh Abd El-Ghany
1,*,
Mohammed Elmogy
2 and
A. A. Abd El-Aziz
1
1
Department of Information Systems, College of Computer and Information Sciences, Jouf University, Sakakah 42421, Saudi Arabia
2
Information Technology Department, Faculty of Computers and Information, Mansoura University, Mansoura 35516, Egypt
*
Author to whom correspondence should be addressed.
Diagnostics 2023, 13(3), 404; https://doi.org/10.3390/diagnostics13030404
Submission received: 31 December 2022 / Revised: 17 January 2023 / Accepted: 20 January 2023 / Published: 22 January 2023
(This article belongs to the Section Machine Learning and Artificial Intelligence in Diagnostics)

Abstract

:
The immune system’s overproduction of white blood cells (WBCs) results in the most common blood cancer, leukemia. It accounts for about 25% of childhood cancers and is one of the primary causes of death worldwide. The most well-known type of leukemia found in the human bone marrow is acute lymphoblastic leukemia (ALL). It is a disease that affects the bone marrow and kills white blood cells. Better treatment and a higher likelihood of survival can be helped by early and precise cancer detection. As a result, doctors can use computer-aided diagnostic (CAD) models to detect early leukemia effectively. In this research, we proposed a classification model based on the EfficientNet-B3 convolutional neural network (CNN) model to distinguish ALL as an automated model that automatically changes the learning rate (LR). We set up a custom LR that compared the loss value and training accuracy at the beginning of each epoch. We evaluated the proposed model on the C-NMC_Leukemia dataset. The dataset was pre-processed with normalization and balancing. The proposed model was evaluated and compared with recent classifiers. The proposed model’s average precision, recall, specificity, accuracy, and Disc similarity coefficient (DSC) were 98.29%, 97.83%, 97.82%, 98.31%, and 98.05%, respectively. Moreover, the proposed model was used to examine microscopic images of the blood to identify the malaria parasite. Our proposed model’s average precision, recall, specificity, accuracy, and DSC were 97.69%, 97.68%, 97.67%, 97.68%, and 97.68%, respectively. Therefore, the evaluation of the proposed model showed that it is an unrivaled perceptive outcome with tuning as opposed to other ongoing existing models.

1. Introduction

The human body is supplied with essential substances by blood. The three main components of blood cells in the human body are erythrocytes (red blood cells (RBCs)), leukocytes (white blood cells (WBCs)), and thrombocytes (platelets). RBCs transport oxygen throughout the body and platelets assist with blood clotting in the event of injury. Human infections are prevented and fought by WBCs. Although WBCs only comprise one percent of the blood’s volume, even small changes can significantly impact the human immune system. Changes in the blood’s WBC count present a challenge. An abnormally high number of WBCs can prevent infection [1].
One of the most lethal diseases known is cancer, defined as abnormal and uncontrolled WBC growth [2]. The world health organization (WHO) estimates that approximately 10 million people will die from cancer in 2020, representing an increase of nearly 60% from 2000. Around 19.3 million people will be diagnosed with the disease [3]. Between now and 2040, it is anticipated that the number of people affected will increase by approximately 50% [4]. The disease’s age determines the four main types of leukemia, which are acute myelogenous leukemia (AML), chronic lymphocytic leukemia (CLL), chronic myeloid leukemia (CML), and acute lymphoblastic leukemia (ALL) [5,6]. The bone marrow, where blood cells are made, is typically where ALL, a type of blood cancer, begins. WBCs are associated with this type of cancer. The abnormal cells in intense leukemia develop and spread rapidly, requiring brief therapy, though ongoing leukemia is hard to distinguish in its beginning phases. The immune system is made more vulnerable because of the blood’s inability to perform its normal function. Additionally, most of the body is at risk due to the bone marrow’s inability to produce healthy platelets and red blood cells [7].
Bone marrow produces many abnormal WBCs in ALL patients. These WBCs have the potential to enter the bloodstream, causing harm to the liver, spleen, brain, and kidneys, among other organs, that can lead to additional cancers that could kill the human. Because ALL can disseminate throughout the body quickly, if it is not treated or diagnosed early, it can sometimes result in death. The essential information regarding ALL reveals that more than 5690 ALL cases will occur in the USA in 2021 [8]. More than 1550 people, including children and adults, are expected to die. Leukemia, and especially ALL, can be easily treated when diagnosed early. Leukemia can be challenging to diagnose because its symptoms resemble joint pain, bone pain, anemia, weakness, and fever.
In most cases, specific symptoms and signs lead doctors to suspect ALL in patients, while various clinical examinations validate the ALL diagnosis. Blood tests are frequently performed during the initial stage on suspected ALL patients. Complete blood counts and peripheral blood smears are performed to monitor changes in the number and appearance of WBCs and other blood cells. Utilizing chromosome-based tests such as cytogenetics, fluorescence in situ hybridization, and polymerase chain reaction, in which chromosomes are observed to recognize unusual blood cells, improves the accuracy of the diagnosis of ALL [4].
Hematologists normally utilize various intrusive techniques to analyze diseases. A biopsy, an invasive procedure that examines spinal fluid, bone marrow, or blood, is frequently performed [9]. During these examinations, the doctor checks to see if there are enough WBCs and other relevant physical signs to indicate the presence of ALL. These techniques are time-consuming, expensive, and painful. Because the results of these manual and expert-specific techniques are heavily dependent on the expertise and knowledge of the expert performing the analysis, they are also susceptible to error [10]. Medical image-analysis-based techniques provide quicker, safer, and less expensive solutions while avoiding the complications of such invasive procedures. It is simple to generalize image processing and computer-vision-based methods and eliminate human error.
Radiologists can easily perform image-based analysis. Even though they are non-invasive, these techniques can still have the same drawbacks as invasive ones. When large datasets containing hundreds of thousands of medical images are required to be analyzed by human experts, the manual analysis performed by radiologists becomes extremely laborious, error-prone, and time-consuming. In medical images, particularly histopathology images, inherent texture and morphology, which are overlapping features, make the task more difficult. Figure 1 depicts images of ALL cancerous and healthy cells [11].
Because of their low interclass detachability and high intraclass homogeneity, leukocytes are difficult to distinguish. Therefore, it is essential to develop methods for the accurate and dependable identification of leukemia for prompt diagnosis and early treatment. Nonetheless, as a rule, ALL malignant growth cells vary from solid cells based on different variables, including morphology, cell size, shape, and surface [12]. A fully automated solution based on deep learning (DL) can utilize any of these attributes to recognize healthy cells and ALL disease images.
In recent years, DL, or deep neural networks (DNNs), has emerged as a cutting-edge technology for speech recognition and computer vision [13]. DL is an artificial intelligence (AI) technique considering the portrayal of information learning. DL methods employ supervised, semi-supervised, and unsupervised learning. Learning models are very different depending on the learning framework. The basis of DL is the efficient manual replacement of features using hierarchical feature extraction and semi-supervised or unsupervised feature learning [14]. Between the input layer and the output layer in DL, there are several hidden layers. It has several practical advantages. DL supports both labeled and unlabeled datasets. It can extract high-level features from the dataset as a result. The CNN model is the most common method for analyzing medical images [15]. The backpropagation calculation enables the CNN model to adaptively learn spatial highlights in clinical images.
For ALL detection models to acquire characteristic spatial domain features, they must be trained with sufficient images. DL methods work well when many images are available for learning [16]. The use and availability of images for training are considered when selecting the hyperparameters and the model parameters. After that, the model uses the utilized dataset to learn from and classify images by updating the new weights for the specified number of classes following each training step. To improve the performance of the artificial neural network (ANN) model in specific applications, hyperparameters such as the learning rate (LR), batch size (BS), and momentum need to be fine-tuned. The most critical hyperparameter used to improve model accuracy during CNN model training is the LR. The conventional LR strategies of exponential decay, step decay, and constant LR use trial and error to determine the application-appropriate LR. Model learning with a fixed LR strategy is utilized as a baseline method rather than its alternatives. The model converges slowly when the LR is too low, and when the LR is too high, the model learning diverges, resulting in inadequate solutions. The network converges in fewer iterations at optimal LRs. The LR determines the extent of the loss gradient that is backpropagated to move to global minima. When the gradient reaches its local minimum, only the computational cost of making progress is worth it. If there is no improvement in accuracy after a few epochs or if the LR remains stuck at local minima, adaptive LR training methods use an LR that fluctuates by a predetermined value. In contrast, in the nonadaptive schedule, the LR will either decrease gradually over time at each epoch in small steps or remain constant throughout the training.
The cyclical learning rate (CLR) [17], stochastic gradient descent (SGD) with warm restarts (SGDWR) [18], also known as stochastic weight averaging (SWA), and cosine annealing are three additional dynamic LR methods that have recently emerged [19]. During training, the CLR fluctuates between predetermined upper and lower boundary values. The LR is kept at a very low level but gradually rises until it reaches its highest point. In contrast, in the nonadaptive schedule, the LR decreases gradually at each epoch in small steps or remains constant throughout the training.
The LR then drops back to the underlying worth of finishing one cycle. Consequently, a cycle has two steps and a fixed step size—the number of loops over which the LR rises to its highest value. The pattern repeats itself after each training cycle until the final epoch of the triangular LR. While increasing the LR will affect accuracy in the short term, it will also reduce training-related loss in the long run [17].
The training algorithm for deep neural networks (DNN) is the DNN with SGD [18]. After each epoch, the optimizer updates the parameters. Figure 2 shows that convergence time increases at saddle point plateaus when the learning rate is low, even though optimization occurs in small steps. In nonconvex optimization problems, avoiding saddle points is possible by increasing the learning rate.
Another modality of the dynamic learning rate schedule is cosine annealing [19]. The annealing schedule depends on the cosine function and begins with a large learning rate that gradually decreases to a minimum value before rapidly increasing again. The cosine annealing schedule is represented by Equation (1). For the ith run, each batch’s learning rate decreases with cosine annealing.
ηt = ηi min + 0.5 (ηi max − ηi min)(1 + cos(Tcur*π/Ti)
where ηi min and ηi max are the learning rate ranges and Tcur is how many epochs have passed since the most recent restart [20].
In this research, we set up a custom LR that compared the loss value and training accuracy at the beginning of each epoch to that of the previous epoch. We increased the LR if they were smaller and decreased the LR if they were larger.
In this research, we proposed a fully automated solution based on the CNN model to classify ALL cell images automatically. The proposed model used the EfficientNet-B3 model to classify images of ALL and healthy cells. The C-NMC_Leukemia dataset was pre-processed with normalization and balancing to train the designed DL model from scratch to find the most relevant parameter values and improve network convergence. By manipulating the LR, we investigated the best settings for the EfficientNet-B3 model to achieve a high classification accuracy. During the training phase, we used dynamic learning. We set up a custom LR that compared the loss value and training accuracy at the beginning of each epoch to that of the previous epoch. We increased the LR if they were smaller and decreased the LR if they were larger. In addition to the detection of ALL, the proposed model was used to distinguish between uninfected and parasitized microscopic images with an average accuracy of 97.68%. As a result, the model performed better and achieved a relatively high-level performance than traditional methods with a fixed LR. The following is a summary of the contributions made in this research:
  • For ALL disease prediction, a robust model using the EfficientNet-B3 CNN model and dynamic LR was proposed to distinguish between benign and malignant cells accurately and reliably.
  • We compared the proposed model with five other techniques: EfficientNet-B0, EfficientNet-B1, EfficientNet-B2, InceptionResNetV2, and DenseNet121.
  • With an average accuracy of 97.68%, the proposed model differentiated between parasitized and uninfected microscopic images.
The remainder of this paper is formulated as follows. Section 2 presents a literature review of CAD diagnostic systems. Section 3 displays the methodology of the proposed framework, the pre-processing of the two datasets, and the training of five classifiers. The experimental results of the proposed framework are shown in Section 4. Finally, the conclusion of our proposed framework is provided in Section 5.

2. Literature Review

Abir et al. [1] proposed a model to recognize ALL as an automated method that used various transfer learning models. This method employs local interpretable model-agnostic explanations (LIME) to guarantee reliability and validity. The proposed technique using the InceptionV3 model achieved an accuracy of 98.38%. Different approaches to transfer learning, such as InceptionResNetV2, VGG19, and ResNet101V2, were tested, and the results were found to be consistent with the proposed method using the LIME algorithm for explainable artificial intelligence (XAI).
In Mondal et al. [4], the ALL-recognition task was automated using CNN models. A weighted ensemble of deep CNN models was investigated to recommend a better ALL-cell classifier. A variety of data augmentations and pre-processing was incorporated to improve the network’s generalizability. The C-NMC-2019 ALL dataset was used to train and evaluate the proposed model. The proposed model had an area under the curve (AUC) of 0.948, a balanced accuracy of 88.3%, and a weighted F1-score of 89.7%. On the other hand, the ensemble models, such as InceptionResNet-V2, DenseNet-121, Xception, MobileNet, and VGG-16, typically had coarse and scattered learned areas not present in the ensemble.
A non-invasive, medical image-based diagnosis method based on CNN models was presented by Amin et al. [12]. A CNN-based model was used in the proposed solution to extract higher-quality features from the dataset using a module called efficient channel attention (ECA) and the visual geometry group from Oxford (VGG16). The proposed approach demonstrated that the ECA module aided in overcoming the morphological similarities that exist between images of healthy cells and ALL cancer. Training data quantity and quality were also increased using various augmentation methods. The proposed CNN model successfully extracted in-depth features with an accuracy of 91.1%. The results demonstrated that pathologists would gain from the proposed method’s ability to diagnose ALL.
Khandekar et al. [21] proposed an automation system using AI to automate the detection of blast cells. A method for object detection was incorporated into the proposed automation system, using images of microscopic blood smears to predict leukemic cells. The authors used the You Only Look Once (YOLO) algorithm for cell classification and detection in its fourth version. As a result, the classification was performed as a binary problem, with each cell classified as either healthy cells (HEM) or blast cells (ALL). Images from the ALL_IDB1 and C_NMC_2019 datasets were training and testing grounds for the Object Detection algorithm. The ALL-IDB1 dataset had a mean average precision (mAP) of 96.06%, while the C_NMC_2019 dataset had an mAP of 98.7%. During pre-screening, this proposed blast cell detection algorithm could help identify leukemia from microscopic blood smear images.
Almadhor et al. [22] used naive Bayes (NB), K-nearest neighbor (KNN), random forest (RF), and support vector machine (SVM) in proposing an ensemble automated prediction strategy. They used the C-NMC leukemia dataset from Kaggle. The C-NMC leukemia dataset is broken up into two groups: healthy and cancer cells. The results showed that SVM outperforms other algorithms with an accuracy of 90%.
Kasani et al. [23] proposed an aggregated DL model to classify leukemic B-lymphoblasts. The authors used data augmentation methods to create more training samples for the small dataset. A transfer learning concept was used to accelerate the training process and further improve the proposed network’s results to create a deep learner that was both reliable and accurate. With a test dataset accuracy of 96.58% for Leukemic B-lymphoblast diagnosis, the proposed method outperformed individual networks by combining features from the best deep learning models.
Liu et al. [24] proposed a ternary stream fine-grained classification model to differentiate lymphoblasts from normal white blood cells and reactive lymphocytes. The proposed model is based on microscopic images of peripheral blood smears. Using the C-NMC dataset, the model achieved outstanding accuracy (approximately 91.90%) and showed a promising performance in distinguishing morphological cell types.
From the previous review of the current studies conducted recently, we can conclude that no study achieved ideal classification of ALL diseases and detection of malarial parasites. However, with the C-NMC_Leukemia dataset, our proposed model achieved an average precision, recall, specificity, accuracy, and Disc similarity coefficient (DSC)98.29% or 97.83%, 97.82%, 98.31%, and 98.05%, respectively. Moreover, with the National Institutes of Health (NIH) dataset, the proposed model achieved an average precision, recall, specificity, accuracy, and DSC of 97.69%, 97.68%, 97.67%, 97.68%, and 97.68%, respectively.

3. Materials and Methods

3.1. Datasets Description

The proposed model used the C-NMC_Leukemia dataset [11] for ALL-cell prediction and the Giemsa-stained thin blood images from the dataset created by the NIH for malaria diagnosis [25]. ALL causes about 25% of pediatric cancers. Under the microscope, it is generally hard to differentiate between normal cells and immature leukemic blasts due to the similar morphological appearance of the two types of cells. An experienced oncologist assigned a normal or malignant classification to images in this dataset. The C-NMC_Leukemia dataset was divided into three sets: the training set with 9594 (90%) images, the validation set with 533 images (0.05%), and the test set with 534 images (0.05%). The NIH’s dataset images were taken from 50 healthy people and 150 people with Plasmodium falciparum infection. The dataset has a total of 27,558 images of red blood cells. Thirteen thousand seven hundred seventy-nine people were infected, and 13,779 were not. NIH’s dataset was divided into three sets: the training set with 24,804 (90%) images, the validation set with 1378 images (0.05%), and the test set with 1378 images (0.05%).

3.2. Model Architecture and Training

The EfficientNet-B3 is currently demonstrating true success in classification. An automated system for processing C-NMC_Leukemia’s images to identify ALL-cell disease is the goal of the comprehensively proposed model. The following sections will discuss the proposed model. Figure 3 details the proposed model’s steps: (1) image pre-processing, (2) dataset normalization, (3) model training, (4) training evaluation, and (5) test evaluation.
The proposed model began with downloading the two image datasets and continued with image pre-processing. The two image datasets were pre-processed with normalization, contrast handling, resizing, and artifact removal. Pre-processing the two datasets effectively yields accurate results, making it one of the most important steps. After pre-processing the two datasets, we divided them into the training set (90%), validation set (0.05%), and test set (0.05%). In the third step, we used transfer learning to train the EfficientNet- B3. The first step of transfer learning is supervised pre-training, in which we downloaded the ImageNet neural network with parameters that had been trained beforehand on a large dataset. In the second step of the transfer learning, we used the C-NMC_Leukemia and the NIH datasets to fine-tune the EfficientNet-B3 network. In the fourth step, we calculated the training dataset’s error. If the training dataset’s error was not low, we re-trained the model. If the training dataset’s error was low, we calculated the test dataset’s error. If the test dataset’s error was not low, we re-trained the model. Table 1 lists the algorithm for the dynamic LR.

3.2.1. Data Pre-Processing

In this research, data pre-processing was necessary since the images differ in resolution, pixel-level noise, size, bright text, and symbols. An image mask was executed on the images to address such artifacts using Equation (2). Moreover, the contrast of images might differ. The training images’ contrasts were normalized in the training phase to solve this problem. After that, we filtered the images to remove noise. Each pixel image was precisely subtracted from the three main colors’ average: red, green, and blue (RGB).
M a s k ( m , n ) = { max i   , i ( m , n ) min i   0   o t h e r w i s e .
By normalizing the data in various ways, the effect of various pixel intensities can be defined. The normalized data PI* is obtained using the normalization method from the pixel intensity recorded as PI. This pixel has a value that ranges from 0 to 255 for each of the three primary colors. Consequently, the value of each pixel was divided by 255 to achieve normalization. The normalization was based on the maximum and minimum values of the experiment, as presented by Equation (3). The pictures were resized to a decent goal of 300 by 300. Figure 4 depicts a blood cell before and after the pre-processing.
P I * = ( P I l M i n o l d ) M a x n e w M i n n e w M a x o l d M i n o l d + M i n n e w ,   l [ 0 , n ]

3.2.2. EfficientNet-B3

Recently, Tan and Le [26] investigated the connection between the width and depth of CNN models and devised an effective approach for designing CNN models with fewer parameters but a greater classification accuracy. They proposed seven such models, which they referred to as EfficientNet-B0 to EfficientNet-B7. They referred to them as EfficientNet CNN models. When EfficientNet CNN models were applied to the ImageNet dataset, they demonstrated that their models outperformed all recent models regarding the number of Top-1 accuracy and parameters [27].
A novel approach to scaling CNN models is the foundation for the EfficientNet family. It makes use of a straightforward yet powerful compound coefficient. Uniquely in contrast to conventional strategies that scale aspects of organizations, such as width, profundity, and goal, EfficientNet scales each aspect with a proper set of scaling coefficients consistently. Scaling individual aspects works on model execution but adjusting all organization components regarding the accessible assets works on overall execution.
EfficientNet is much smaller than other models, with ImageNet accuracy comparable to its own. For instance, the ResNet50 model, as found in the Keras application, has 23,534,592 boundaries. Still, it needs to meet the expectations of the littlest EfficientNet (called EffecientNet-B0), which has 5,330,564 boundaries. We proposed an effective model based on the EfficientNet-B3 CNN model because it strikes a good balance between accuracy and computational power [27].
The primary component of the EfficientNet model family is mobile inverted bottleneck convolution (MBConv). The MobileNet models’ concepts were the foundation for MBConv [28]. One central idea was to use depthwise separable convolutions, which entailed layering a pointwise and a depthwise convolution. The following two additional concepts were taken from MobileNet-V2, the second improved version of MobileNet: (1) residual connections that were inverted and (2) linear bottlenecks.
The EfficientNet model family begins with its stem, which is where all the experimenting with the architecture starts. The stem is common to all eight models and the final layers.
After the stem, there are seven blocks. Additionally, these blocks have varying sub-blocks, and their number grows as they progress from EfficientNet-B0 to EfficientNet-B7. The total number of layers in EfficientNet-B0 is 237, while the total number in EfficientNet-B7 is 813. The second module is the foundation for the first sub-block of the seven main blocks, except the first. Module 3 is connected to all the sub-blocks via a skip connection. The skip connection in the first sub-blocks is combined with this Module 4. Module 5 brings together each sub-block by connecting it in a skip fashion to the one before it. Finally, sub-blocks are created by combining these modules with being used in particular ways in the blocks [27]. Figure 5 shows the structure of the EfficientNetB3 model.

3.2.3. Dwell

When the validation loss on the current epoch is greater than that on the previous epoch, the DWELL callback can be useful for training a model. The proposed model has reached a point in N space (where N is the number of trainable parameters) that is less favorable than that for the previous epoch when that occurs. The callback checks for this condition and sets the model weights to those of the epoch with the least amount of validation loss if it finds it. Additionally, it slows down learning. We will remain at the same unfavorable point in N space if the LR for the subsequent epoch is kept.

4. Model Implementation and Evaluation

4.1. Model Evaluation Metrics

The proposed model was evaluated using the accuracy, precision, sensitivity, specificity, and DSC, which are presented in Equations (4)–(8).
Accuracy = (TP + TN)/(TP + TN + FP + FN)
Precision = TP/(TP + FP)
Sensitivity = TP/(TP + FN)
Specificity = TN/(TN + FP)
DSC = (2 TP)/(2 ∗ TP + FP + FN)
True positive, true negative, false positive, and false negative are represented by TP, TN, FP, and FN, respectively. Precision is defined as how much of the samples correctly predicted by the model would be positive. Sensitivity is defined as the ratio of the number of actual positives to the number of true positives. Specificity is defined as the ratio of the number of actual negatives to the number of true negatives. DSC is the harmonic average of recall and precision.

4.2. Model Implementation

The C-NMC_Leukemia dataset was split into 70% for the training, 17% for the testing, and 12% for the validation. The implementation was carried out in the Kaggle environment. The characteristics of the PC used for the experiments are x64-based Intel (R) Core (TM) i7-10510U CPU with 1.80 GHz and 2.30 GHz, 16 GB of memory, and a 64-bit Windows platform.
Table 2 shows the experiments’ results for EfficientNet-B3 and five other CNN models: EfficientNet-B0, EfficientNet-B1, EfficientNet-B2, InceptionResnetV2, and DenseNet121 using fixed LR. All CNN models were applied to the C-NMC_Leukemia dataset for the binary classification, in which we distinguished benign and malignant cells. Table 2 shows that EfficientNet-B3, EfficientNet-B0, EfficientNet-B1, EfficientNet-B2, InceptionResnetV2, and DenseNet121 had accuracy averages of 97.57%, 93.82%, 94.38%, 95.51%, 93.07%, and 82.21%, respectively. The EfficientNet-B3 model had the highest precision, recall, accuracy, and DSC average, equal to 97.42%, 96.96%, 97.57%, and 97.18%, respectively. The EfficientNet-B2 model had the highest average specificity, equal to 98.90%.
Table 2 shows the binary classification results of the six CNN models, EfficientNet-B3, EfficientNet-B0, EfficientNet-B1, EfficientNet-B2, InceptionResnetV2, and DenseNet121, on the C-NMC_Leukemia dataset. This experiment distinguished ALL (cancer) and Hem (healthy) classes.
For the ALL class, precision, recall, specificity, accuracy, and DSC were 97.82%, 98.63%, 95.29%, 97.57%, and 98.22% for EfficientNet-B3, respectively. EfficientNet-B3 and EfficientNet-B1 had the highest precision, 97.01% and 99.28%, respectively. EfficientNet-B3, EfficientNet-B0, and EfficientNet-B2 had the highest recall of 98.63%, 99.73%, and 98.9%, respectively. EfficientNet-B3 and EfficientNet-B1 achieved the highest specificity, 95.29% and 95.8%, respectively. EfficientNet-B3 and EfficientNet-B2 had the highest accuracy, 97.57% and 95.5%, respectively. EfficientNet-B3 and EfficientNet-B2 achieved the highest DSC, equal to 98.22% and 96.77%, respectively.
For the Hem class, precision, recall, specificity, accuracy, and DSC were 97.01%, 95.29%, 98.63%, 97.57%, and 96.14% for EfficientNet-B3, respectively. EfficientNet-B3 and EfficientNet-B0 had the highest precision, 97.82% and 97.99%, respectively. EfficientNet-B3, EfficientNet-B1, and EfficientNet-B2 had the highest recall of 95.29%, 95.88%, and 95.51%, respectively. EfficientNet-B3, EfficientNet-B0, EfficientNet-B2, and InceptionResNetV2 achieved the highest specificity of 98.63%, 99.73%, 98.90%, and 97.53%, respectively. EfficientNet-B3 and EfficientNet-B2 had the highest accuracy, 97.57% and 95.51%, respectively. EfficientNet-B3 and EfficientNet-B2 achieved the highest DSC, 96.14% and 92.59%, respectively. Figure 6 shows the training and validation losses and accuracy of the six CNN models using a fixed LR.
Figure 7 shows the test dataset’s confusion matrix for the six CNN models using the fixed LR. The test dataset was classified into two classes: ALL (cancer) with 364 images and Hem (healthy) with 170 images. For class ALL, the accuracy of the EfficientNet-B3 model was 98.6%, as it predicted 359 images correctly out of 364. The accuracy of the EfficientNet-B0 model was 99.7%, as it predicted 363 images correctly. The accuracy of the EfficientNet-B1model was 93.6%, as it predicted 341 images correctly. The accuracy of the EfficientNet-B2 model was 98.9%, as it predicted 360 images correctly. The accuracy of the InceptionResNetV2 model was 97.5%, as it predicted 355 images correctly, and the accuracy of the DenesNet121 was 81.5%, as it predicted 297 images correctly.
Table 3 depicts the outcomes of the experiments for EfficientNet-B3 and five additional CNN models using a dynamic LR, EfficientNet-B0, EfficientNet-B1, EfficientNet-B2, InceptionResnetV2, and DenseNet121. We used all CNN models on the C-NMC_Leukemia dataset for the binary classification to distinguish between benign and malignant cells.
Table 3 shows that EfficientNet-B3, EfficientNet-B0, EfficientNet-B1, EfficientNet-B2, InceptionResnetV2, and DenseNet121 had average accuracies of 98.31%, 97.57%, 97.19%, 97.00%, 95.32%, and 94.38%, respectively. The EfficientNet-B3 model had the highest average of precision, recall, accuracy, and DSC at 98.29%, 97.83%, 98.31%, and 98.05%, respectively. EfficientNet-B1 and EfficientNet-B2 achieved the highest average specificity, 98.08%. In Table 3, we distinguished between two classes in this experiment: Hem (healthy) and ALL (cancer). In Table 4, we compared in accuracy among the six CNN models using the proposed dynamic LR and using the fixed LR for ALL diseases.
For the ALL class, precision, recall, specificity, accuracy, and DSC were 98.37%, 99.18%, 96.47%, 98.31%, and 98.77% for EfficientNet-B3, respectively. The precision of EfficientNet-B3, EfficientNet-B0, EfficientNet-B1, and EfficientNet-B2 was 98.37%, 98.37%, 97.81%, and 97.54%, respectively. EfficientNet-B3, EfficientNet-B0, and InceptionResNetV2 had the highest recall, with 99.18%, 99.18%, and 98.35%, respectively. EfficientNet-B3 and EfficientNet-B0 achieved specificity scores of 96.47%. The most accurate models, EfficientNet-B3, EfficientNet-B0, and EfficientNet-B1, were 98.31%, 98.31%, and 97.19%, respectively. EfficientNet-B3, EfficientNet-B0, and EfficientNet-B1 accomplished the most elevated DSC, with 98.77%, 98.77%, and 97.94%, respectively.
For the Hem class, precision, recall, specificity, accuracy, and DSC were 98.20%, 96.47%, 99.18%, 98.31%, and 97.33% for EfficientNet-B3, respectively. EfficientNet-B3 and EfficientNet-B0 achieved the highest precision, recall, accuracy, and DSC, at 98.20%, 96.47, 98.31%, and 97.33%, respectively. The most specific results were obtained by EfficientNet-B3, EfficientNet-B0, and InceptionResNetV2, with 99.18%, 99.18%, and 98.35%, respectively.
Figure 8 demonstrates the accuracy and loss for the six CNN models trained and validated at a dynamic LR. Figure 9 shows the confusion matrix for the six CNN models that used a dynamic LR in the test dataset. There were two classes of the test dataset: 364 images in the ALL (cancer) class and 170 in the Hem (healthy) class. The EfficientNet-B3 model correctly predicted 360 images out of 364 for class ALL, with an accuracy of 98.9%. The EfficientNet-B0 model correctly predicted 356 images, giving it an accuracy of 97.8%. The EfficientNet-B1 and the EfficientNet-B2 models correctly predicted 357 images, achieving an accuracy of 98%. The InceptionResNetV2 model was 98.3% accurate, correctly predicting 358 images, while the DenesNet121 model was 94.2% accurate, correctly predicting 343 images.
After we experimented with the proposed classification model based on the EfficientNet-B3 CNN to predict ALL diseases, we experimented with the proposed model to classify uninfected and parasitized microscopic images on Giemsa-stained thin blood images from the dataset created by the NIH model. The outcomes of the experiment for EfficientNet-B3 and the five CNN models (EfficientNet-B0, EfficientNet-B1, EfficientNet-B2, InceptionResnetV2, and DenseNet121) using a fixed LR are presented in Table 5. EfficientNet-B3, EfficientNet-B0, EfficientNet-B1, EfficientNet-B2, InceptionResnetV2, and DenseNet121 achieved averaged accuracies of 85.22%, 92.91%, 96.60%, 75.89%, 75.89%, 97%, 97.19% respectively. The precision, recall, specificity, accuracy, and DSC average of the EfficientNet-B3 model were 88.70%, 85.22%, 85.11%, 85.22%, and 85.49%, respectively. The models with the highest average precision were Efficient-Net-B1 and Dense-Net121 at 97.18%. The model with the highest average recall was EfficientNet-B2 at 99.57%. The models with the highest average specificity were EfficientNet-B1 and Dense-Net121 at 97.17%. Dense-Net121 achieved the highest average accuracy and DSC with 97.19% and 97.17%, respectively.
Table 6 portrays the results of the test dataset for EfficientNet-B3 and five other CNN models utilizing a dynamic LR. We performed the binary classification using all CNN models to differentiate between uninfected and parasitized microscopic images on Giemsa-stained thin blood images from the dataset created by the NIH. EfficientNet-B3, EfficientNet-B0, EfficientNet-B1, EfficientNet-B2, InceptionResnetV2, and DenseNet121 achieved averaged accuracies of 97.68%, 97.61%, 97.31%, 97.39%, 97.10%, 97.61% respectively. The precision, recall, specificity, accuracy, and DSC average of the EfficientNet-B3 model were 97.69%, 97.68%, 97.67%, 97.68%, and 97.68%, respectively. EfficientNet-B3 achieved the highest average recall, specificity, accuracy, and DSC, while EfficientNet-B2 achieved the highest precision. In Table 7, we presented a comparison of accuracy of the six CNN models using the proposed dynamic LR and using the fixed LR for malaria parasite

4.3. Model Result Comparison with the Literature

Regarding thte binary classification accuracy, Atefeh et al. [29] used a system that recommends diagnoses automatically with an accuracy of 93.12%. Efthakhar et al. [30] used a machine-learning-based statistical model from a patient’s genome series with an accuracy of 95%. Therefore, the proposed EfficientNet-B3 model outperformed the most recently listed methods, as shown in Table 8. It had a remarkable accuracy rate of 98.31% for ALL predictions and 97.68% for malaria detection. Moreover, from the patient’s point of view, our model would improve the procedure by reducing the cost of diagnosis, speeding up diagnosis, and delaying the progression of the disease. The evaluation of the proposed model revealed that it is an unparalleled perceptive result with tuning compared to other ongoing models. Moreover, we compared the proposed model and different models that used dynamic LR in Table 8

5. Conclusions

This paper proposed a robust classifier to distinguish ALL as an automated model that changes the LR automatically. The classification model is based on the EfficientNet-B3 CNN model. A custom LR that compared the loss value at the beginning of each epoch to that of the previous epoch was set up in the proposed model. If it was smaller, we increased the LR; if it was larger, we decreased it. Consequently, the model performed better than a conventional method with a fixed LR and achieved a relatively high level of performance. The proposed model was evaluated using the C-NMC_Leukemia dataset, and we used normalization and balancing to pre-process the dataset. Recent classifiers were evaluated and compared to the proposed model. Our proposed model’s precision, recall, specificity, accuracy, and DSC were 98.29%, 97.83%, 97.82%, 98.31%, and 98.05%, respectively. The proposed model will help clinicians use the knowledge they extract from data stores to aid in the accurate and efficient diagnosis of ALL.. Moreover, the proposed model was used to examine microscopic images of the blood to identify the malaria parasite. Our proposed model’s average precision, recall, specificity, accuracy, and DSC were 97.69%, 97.68%, 97.67%, 97.68%, and 97.68%, respectively. The prediction time is a limitation of our proposed framework. Therefore, our future work is to improve the prediction time of the proposed framework by utilizing more optimization and feature selection approaches. Furthermore, we will use a moving average loss value by aggregating loss values across the previous N epochs.

Author Contributions

Conceptualization, S.A.E.-G. and M.E.; methodology, S.A.E.-G. and M.E.; database search, data curation, assessment of bias, A.A.A.E.-A.; writing—original draft preparation, A.A.A.E.-A.; writing—review and editing, S.A.E.-G., M.E., and A.A.A.E.-A.; supervision, S.A.E.-G. All authors have read and agreed to the published version of the manuscript.

Funding

This work is funded through research grant No. (DSR-2021-02-0206).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Acknowledgments

The authors extend their appreciation to the Deanship of Scientific Research at Jouf University for funding this work through research grant No. (DSR-2021-02-0206).

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Abir, W.H.; Uddin, F.; Khanam, F.R.; Tazin, T.; Khan, M.M.; Masud, M.; Aljahdali, S. Explainable AI in Diagnosing and Anticipating Leukemia Using Transfer Learning Method. Comput. Intell. Neurosci. 2022, 2022, 5140148. [Google Scholar] [CrossRef] [PubMed]
  2. Gehlot, S.; Gupta, A.; Gupta, R. SDCT-auxNetθ: DCT Augmented Stain Deconvolutional CNN with Auxiliary Classifier For Cancer Diagnosis. Med. Image Anal. 2020, 61, 101661. [Google Scholar] [CrossRef] [PubMed]
  3. World Health Organization. Breast Cancer Now Most Common Form of Cancer: WHO Taking Action. Available online: https://tinyurl.com/93eccmnv (accessed on 3 February 2021).
  4. Mondal, C.; Hasan, K.; Ahmad, M.; Awal, A.; Jawad, T.; Dutta, A.; Islam, R.; Moni, M.A. Ensemble of Convolutional Neural Networks to Diagnose Acute Lymphoblastic Leukemia from Microscopic Images. Inform. Med. Unlocked 2021, 27, 100794. [Google Scholar] [CrossRef]
  5. Laosai, J.; Chamnongthai, K. Classification of Acute Leukemia Using Medical-Knowledge-Based Morphology and CD Marker. Biomed. Signal Process. Control. 2018, 44, 127–137. [Google Scholar] [CrossRef]
  6. Vogado, L.H.; Veras, R.M.; Araujo, F.H.; Silva, R.R.; Aires, K.R. Leukemia Diagnosis in Blood Slides using Transfer Learning in Cnns and Svm for Classification. Eng. Appl. Artif. Intell. 2018, 72, 415–422. [Google Scholar] [CrossRef]
  7. American Society of Hematology. Hematology. Available online: https://www.hematology.org (accessed on 24 April 2021).
  8. American Cancer Society. Key Statistics for Acute Lymphocytic Leukemia. Available online: https://www.cancer.org/cancer/acute-lymphocytic-leukemia/about/key-statistics.html (accessed on 24 April 2021).
  9. Curesearch for Children’s Cancer Research. Curesearch. Available online: https://curesearch.org/Acute-Lymphoblastic-Leukemia-in-Children (accessed on 20 April 2021).
  10. Mohamed, M.; Far, B.; Guaily, A. An Efficient Technique for White Blood Cells Nuclei Automatic Segmentation. In Proceedings of the 2012 IEEE International Conference on Systems, Man, and Cybernetics (SMC), Seoul, Republic of Korea, 14 October 2012. [Google Scholar]
  11. Zakir Ullah, M.; Zheng, Y.; Song, J.; Aslam, S.; Xu, C.; Kiazolu, G.D.; Wang, L. An Attention-Based Convolutional Neural Network for Acute Lymphoblastic Leukemia Classification. Appl. Sci. 2021, 11, 10662. [Google Scholar] [CrossRef]
  12. Amin, M.M.; Kermani, S.; Talebi, A.; Oghli, M.G. Recognition of Acute Lymphoblastic Leukemia Cells in Microscopic Images using K-Means Clustering and Support Vector Machine Classifier. J. Med. Signals Sens. 2015, 5, 49–58. [Google Scholar] [PubMed]
  13. Shrestha, Y.R.; Krishna, V.; von Krogh, G. Augmenting Organizational Decision-Making with Deep Learning Algorithms: Principles, Promises, and Challenges. J. Bus. Res. 2021, 123, 588–603. [Google Scholar] [CrossRef]
  14. Xin, Y.; Kong, L.; Liu, Z.; Chen, Y.; Li, Y.; Zhu, H.; Gao, M.; Hou, H.; Wang, C. Machine Learning and Deep Learning Methods for Cybersecurity. IEEE Access 2018, 6, 35365–35381. [Google Scholar] [CrossRef]
  15. Suzuki, K. Overview of Deep Learning in Medical Imaging. Radiol. Phys. Technol. 2017, 10, 257–273. [Google Scholar] [CrossRef] [PubMed]
  16. Shen, D.; Wu, G.; Suk, H.I. Deep Learning in Medical Image Analysis. Annu. Rev. Biomed. Eng. 2017, 19, 221–248. [Google Scholar] [CrossRef] [PubMed]
  17. Smith, L.N. Cyclical Learning Rates for Training Neural Networks. In Proceedings of the 2017 IEEE Winter Conference on Applications of Computer Vision (WACV), Santa Rosa, CA, USA, 24 March 2017. [Google Scholar]
  18. Loshchilov, I.; Hutter, F. SGDR: Stochastic Gradient Descent with Warm Restarts. arXiv 2017, arXiv:1608.03983. [Google Scholar]
  19. Izmailov, P.; Podoprikhin, D.; Garipov, T.; Vetrov, D.; Wilson, A.G. Averaging Weights Leads to Wider Optima and Better Generalization. arXiv 2018, arXiv:1803.05407. [Google Scholar]
  20. Johny, A.; Madhusoodanan, K.N. Dynamic Learning Rate in Deep CNN Model for Metastasis Detection and Classification of Histopathology Images. Comput. Math. Methods Med. 2021, 2021, 5557168. [Google Scholar] [CrossRef] [PubMed]
  21. Khandekar, R.; Shastry, P.; Jaishankar, S.; Faust, O.; Sampathila, N. Automated Blast Cell Detection for Acute Lymphoblastic Leukemia Diagnosis. Biomed. Signal Process. Control. 2021, 68, 102690. [Google Scholar] [CrossRef]
  22. Almadhor, A.; Sattar, U.; Al Hejaili, A.; Mohammad, U.G.; Tariq, U.; Ben Chikha, H. An Efficient Computer Vision-Based Approach for Acute Lymphoblastic Leukemia Prediction. Front. Comput. Neurosci. 2022, 16, 1083649. [Google Scholar] [CrossRef] [PubMed]
  23. Kasani, P.H.; Won Park, S.; Won Jang, J. An Aggregated-Based Deep Learning Method for Leukemic B-Lymphoblast Classification. Diagnostics 2020, 10, 1064. [Google Scholar] [CrossRef] [PubMed]
  24. Liu, Y.; Chen, P.; Zhang, J.; Liu, N.; Liu, Y. Weakly Supervised Ternary Stream Data Augmentation Fine-Grained Classification Network for Identifying Acute Lymphoblastic Leukemia. Diagnostics 2022, 12, 16. [Google Scholar] [CrossRef] [PubMed]
  25. Malaria Cell Images Dataset; National Institute of Health (NIH). Available online: https://ceb.nlm.nih.gov/repositories/malaria-datasets (accessed on 21 December 2020).
  26. Tan, M.; Le, Q. EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks. In Proceedings of the 36th International Conference on Machine Learning, Long Beach, CA, USA, 10–15 June 2020. [Google Scholar]
  27. Alhichri, H.; Alswayed, A.S.; Bazi, Y.; Ammour, N.; Alajlan, N.A. Classification of Remote Sensing Images Using Efficientnet-B3 CNN Model with Attention. IEEE Access 2021, 9, 14078–14094. [Google Scholar] [CrossRef]
  28. Sandler, M.; Howard, A.; Zhu, M.; Zhmoginov, A.; Chen, L.-C. MobileNetV2: Inverted Residuals and Linear Bottlenecks. In Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18 June 2018. [Google Scholar]
  29. Torkamana, A.; Charkarib, N.M.; Aghaeipourc, M. An Approach for Leukemia Classification Based on Cooperative Game Theory. Anal. Cell. Pathol. 2011, 34, 235–246. [Google Scholar] [CrossRef]
  30. Alam, E.U.; Banik, S.; Chowdhury, L. A Statistical Approach to Classify the Leukemia Patients from Generic Gene Features. In Proceedings of the 2020 International Conference on Computer Communication and Informatics (ICCCI-2020), Coimbatore, India, 22–24 January 2020. [Google Scholar]
Figure 1. ALL cell image samples. The images in (AC) depict ALL cancer cells, while the images in (DF) depict healthy cells.
Figure 1. ALL cell image samples. The images in (AC) depict ALL cancer cells, while the images in (DF) depict healthy cells.
Diagnostics 13 00404 g001
Figure 2. The saddle point in the loss landscape.
Figure 2. The saddle point in the loss landscape.
Diagnostics 13 00404 g002
Figure 3. Flowchart of the proposed model.
Figure 3. Flowchart of the proposed model.
Diagnostics 13 00404 g003
Figure 4. An example of applying the pre-processing stage on blood cells.
Figure 4. An example of applying the pre-processing stage on blood cells.
Diagnostics 13 00404 g004
Figure 5. The structure of the EfficientNet-B3 model.
Figure 5. The structure of the EfficientNet-B3 model.
Diagnostics 13 00404 g005
Figure 6. The loss and accuracy curves for the six CNN models using a fixed LR.
Figure 6. The loss and accuracy curves for the six CNN models using a fixed LR.
Diagnostics 13 00404 g006
Figure 7. The test dataset’s confusion matrix for the six CNN models using a fixed LR.
Figure 7. The test dataset’s confusion matrix for the six CNN models using a fixed LR.
Diagnostics 13 00404 g007
Figure 8. The loss and accuracy curves for the six CNN models using dynamic LR.
Figure 8. The loss and accuracy curves for the six CNN models using dynamic LR.
Diagnostics 13 00404 g008
Figure 9. The test dataset’s confusion matrix for the six CNN models using dynamic LR.
Figure 9. The test dataset’s confusion matrix for the six CNN models using dynamic LR.
Diagnostics 13 00404 g009
Table 1. The dynamic LR algorithm.
Table 1. The dynamic LR algorithm.
The Dynamic Learning Rate Algorithm
Input: no_epoch, factor (factor between 0.0 and 1.0.)
Output: Dynamically Learning Rate.
Begin:
   1-while (no_epoch > 0)
    1.1-while (no_epoch == 1)
      1.1.1- Input: I (H to halt training or N to continue training).
      1.1.2 - If (I == N)
         1.1.2.1-no _epoch = N.
      1.1.3- else         
         1.1.3.1- Exit
         End if
    End while
   2- Input: dwell
   3- if dwell == True
    3.1 calculate curr_valid_loss
    3.2 calculate curr_train_accu
    3.3- if (curr_valid_loss > prev_valid_loss)
      3.3.1- curr_W = prev_W
      3.3.2- curr_B = prev_B
      3.3.3- next_lr = current_lr * factor
      End if
     3.4- if (curr_train_accu < prev_train_accu)
       3.4.1-curr_W = prev_W
       3.4.2-curr_B = prev_B
       3.4.3- next_lr = current_lr * factor
       End if
     End if
   4- no_epoch- = 1
  End while
End.
Table 2. The performance of the six CNN models using a fixed LR.
Table 2. The performance of the six CNN models using a fixed LR.
ModelClassPrecision (%)Recall (%)Specificity (%)Accuracy (%)DSC (%)
EfficientNet-B3all97.8298.6395.2997.5798.22
hem97.0195.2998.6397.5796.14
Average 97.4296.9696.9697.5797.18
EfficientNet-B0all91.9099.7381.1893.8295.65
hem99.2881.1899.7393.8289.32
Average 95.5990.4690.4593.8289.32
EfficientNet-B1all97.9993.6895.88943895.78
hem87.6395.8893.6894.3891.57
Average 87.6395.8893.6894.3891.57
EfficientNet-B2all94.7498.9088.2495.5196.77
hem95.5995.5198.9095.5192.59
Average 95.5995.5198.9095.5192.59
InceptionResNetV2all93.6997.5383.5393.0795.05
hem94.0483.5397.5393.0788.47
Average 93.8790.5390.5393.0791.76
DenseNet121all91.3881.5983.5382.2186.21
hem67.9483.5381.5982.2174.93
Average 79.6682.5682.5682.2180.57
Table 3. Performance of the six CNN models using the proposed dynamic LR.
Table 3. Performance of the six CNN models using the proposed dynamic LR.
ModelClassPrecision (%)Recall (%)Specificity (%)Accuracy (%)DSC (%)
EfficientNet-B3all98.3799.1896.4798.3198.77
hem98.2096.4799.1898.3197.33
Average 98.2997.8397.8298.3198.05
EfficientNet-B0all98.3799.1896.4798.3198.77
hem98.2096.4799.1898.3197.33
Average 95.3897.0697.8097.5796.21
EfficientNet-B1all97.8198.0895.2997.1997.94
hem95.8695.2998.0897.1995.58
Average 95.8695.2998.0897.1995.58
EfficientNet-B2all97.5498.0894.7097.0097.80
hem95.8394.7198.0897.0095.27
Average 95.8394.7198.0897.0095.27
InceptionResNetV2all94.9698.3588.8295.3296.63
hem96.1888.8298.3595.3292.35
Average 95.5793.5993.5995.3294.49
DenseNet121all97.4494.2394.7194.3895.81
hem88.4694.7194.2394.3891.48
Average 92.9594.4794.4794.3893.64
Table 4. Comparison of accuracy of the six CNN models using the proposed dynamic LR and using the fixed LR for ALL diseases.
Table 4. Comparison of accuracy of the six CNN models using the proposed dynamic LR and using the fixed LR for ALL diseases.
ModelClassAccuracy (%) of Fixed LRAccuracy (%) of Dynamic LR
EfficientNet-B3all97.5798.31
hem97.5798.31
Average 97.5798.31
EfficientNet-B0all93.8298.31
hem93.8298.31
Average 93.8297.57
EfficientNet-B1all943897.19
hem94.3897.19
Average 94.3897.19
EfficientNet-B2all95.5197.00
hem95.5197.00
Average 95.5197.00
InceptionResNetV2all93.0795.32
hem93.0795.32
Average 93.0795.32
DenseNet121all82.2194.38
hem82.2194.38
Average 82.2194.38
Table 5. The performance of binary classification of malaria parasite for the six CNN models using a fixed LR.
Table 5. The performance of binary classification of malaria parasite for the six CNN models using a fixed LR.
ModelClassPrecision (%)Recall (%)Specificity (%)Accuracy (%)DSC (%)
EfficientNet-B3Parasitized99.3970.8682.7399.5785.49
Uninfected78.0199.5787.4870.8685.49
Average 88.7085.2285.1185.2285.49
EfficientNet-B0Parasitized99.1586.5492.4299.2993.03
Uninfected88.4599.2993.5686.5493.03
Average 93.8092.9292.9992.9193.03
EfficientNet-B1Parasitized97.6196.697.197.7297.16
Uninfected96.7697.7297.2496.6097.17
Average 97.1897.1697.1797.1697.16
EfficientNet-B2Parasitized99.4275.8986.0799.5787.95
Uninfected81.0999.5789.3975.8987.95
Average 81.0999.5789.3975.8987.95
InceptionResNetV2Parasitized98.0395.8696.9398.1497.02
Uninfected96.0998.1597.1195.8697.02
Average 97.0697.0197.0297.0097.02
DenseNet121Parasitized95.9698.3797.1596.0197.17
Uninfected98.3996.0197.1998.3797.17
Average 97.1897.1997.1797.1997.17
Table 6. The performance of binary classification of malaria parasites for the six CNN models using the proposed dynamic LR.
Table 6. The performance of binary classification of malaria parasites for the six CNN models using the proposed dynamic LR.
ModelClassPrecision (%)Recall (%)Specificity (%)Accuracy (%)DSC (%)
EfficientNet-B3Parasitized97.9297.3498.0197.6797.63
Uninfected97.4598.0197.3497.6897.73
Average 97.6997.6897.6797.6897.68
EfficientNet-B0Parasitized98.3596.7598.4397.6197.54
Uninfected96.9198.4396.7497.6197.67
Average 97.6397.5997.5997.6197.67
EfficientNet-B1Parasitized97.3397.1997.4397.3197.26
Uninfected97.30974497.1997.3197.37
Average 97.3097.4497.1997.3197.37
EfficientNet-B2Parasitized 97.0697.6397.1597.3897.34
Uninfected 97.7197.1597.6397.3997.43
Average 97.7197.1597.6397.3997.43
InceptionResNetV2Parasitized 97.8996.1598.0197.0997.01
Uninfected 96.3698.0196.1597.1097.18
Average 97.1397.0897.0897.1097.10
DenseNet121Parasitized 98.2096.8998.2997.6197.54
Uninfected 97.0598.2996.8997.6197.66
Average 97.6397.5997.5997.6197.60
Table 7. Comparison of accuracy of the six CNN models using the proposed dynamic LR and using the fixed LR for malaria parasite.
Table 7. Comparison of accuracy of the six CNN models using the proposed dynamic LR and using the fixed LR for malaria parasite.
ModelClassAccuracy (%) of the Dynamic LRAccuracy (%) of the Fixed LR
EfficientNet-B3Parasitized97.6799.57
Uninfected97.6870.86
Average 97.6885.22
EfficientNet-B0Parasitized97.6199.29
Uninfected97.6186.54
Average 97.6192.91
EfficientNet-B1Parasitized97.3197.72
Uninfected97.3196.60
Average 97.3197.16
EfficientNet-B2Parasitized97.3899.57
Uninfected97.3975.89
Average 97.3975.89
InceptionResNetV2Parasitized97.0998.14
Uninfected97.1095.86
Average 97.1097.00
DenseNet121Parasitized 97.6196.01
Uninfected 97.6198.37
Average 97.6197.19
Table 8. The comparison results between the proposed model and other recent models.
Table 8. The comparison results between the proposed model and other recent models.
StudyMethodologyTested MetricsDatasets
Abir et al. [1]InceptionV3Accuracy 80%,
F-Score 79.8
C-NMC _Leukemia
Mondal et al. [4]Ensemble of Xception, VGG-16, DenseNet-121, MobileNet, and InceptionResNet-V288.3%C-NMC _Leukemia
Amin et al. [12]ECA-Net Based on VGG1691.1%C-NMC _Leukemia
Khandekar et al. [21]YOLOv4, VGG16, ResNet-50, Darknet52, CSPDarknet53 or ResNext50.For C-NMC_Leukemia dataset: Weighted F1-score on the test set of 92% with Mean Average Precision of 98.57% and recall of 96%.
For ALL-IDB1 dataset: Mean Average Precision of 95.57%, Recall of 92% and F1 score of 0.92
C-NMC _Leukemia and ALL-IDB1
Almadhor et al. [22]NB, KNN, RF, and SVM in proposing an ensemble automated prediction strategySVM outperforms other algorithms with an accuracy of 90%.C-NMC _Leukemia
Liu et al. [24]AlexNet, VGGNet, NASNet, Xception, DenseNet, InceptionV3, MobileNet, and ShuffleNet96.58%C-NMC _Leukemia
Tan and Le [26]ternary stream fine-grained model91.9%C-NMC _Leukemia
Atefeh et al. [29]A recommender system (MDSS)93.12%Data collected from Iran Blood Transfusion Organization (IBTO)
Efthakhar et al. [30]Naive Bayes95%NCBI GEO dataset
proposed model EfficientNet-B3 model98.31%C-NMC _Leukemia
proposed model for malaria detection EfficientNet-B3 model97.68%NIH
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Abd El-Ghany, S.; Elmogy, M.; El-Aziz, A.A.A. Computer-Aided Diagnosis System for Blood Diseases Using EfficientNet-B3 Based on a Dynamic Learning Algorithm. Diagnostics 2023, 13, 404. https://doi.org/10.3390/diagnostics13030404

AMA Style

Abd El-Ghany S, Elmogy M, El-Aziz AAA. Computer-Aided Diagnosis System for Blood Diseases Using EfficientNet-B3 Based on a Dynamic Learning Algorithm. Diagnostics. 2023; 13(3):404. https://doi.org/10.3390/diagnostics13030404

Chicago/Turabian Style

Abd El-Ghany, Sameh, Mohammed Elmogy, and A. A. Abd El-Aziz. 2023. "Computer-Aided Diagnosis System for Blood Diseases Using EfficientNet-B3 Based on a Dynamic Learning Algorithm" Diagnostics 13, no. 3: 404. https://doi.org/10.3390/diagnostics13030404

APA Style

Abd El-Ghany, S., Elmogy, M., & El-Aziz, A. A. A. (2023). Computer-Aided Diagnosis System for Blood Diseases Using EfficientNet-B3 Based on a Dynamic Learning Algorithm. Diagnostics, 13(3), 404. https://doi.org/10.3390/diagnostics13030404

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop