Computer-Aided Diagnosis Systems for Automatic Malaria Parasite Detection and Classification: A Systematic Review

Grignaffini, Flavia; Simeoni, Patrizio; Alisi, Anna; Frezza, Fabrizio

doi:10.3390/electronics13163174

Open AccessSystematic Review

Computer-Aided Diagnosis Systems for Automatic Malaria Parasite Detection and Classification: A Systematic Review

¹

Department of Information Engineering, Electronics and Telecommunications (DIET), “La Sapienza” University of Rome, 00184 Rome, Italy

²

National Transport Authority (NTA), D02 WT20 Dublin, Ireland

³

Faculty of Lifelong Learning, South East Technological University (SETU), R93 V960 Carlow, Ireland

⁴

Research Unit of Genetics of Complex Phenotypes, Bambino Gesù Children’s Hospital, IRCCS, 00165 Rome, Italy

^*

Author to whom correspondence should be addressed.

Electronics 2024, 13(16), 3174; https://doi.org/10.3390/electronics13163174

Submission received: 1 July 2024 / Revised: 6 August 2024 / Accepted: 9 August 2024 / Published: 11 August 2024

(This article belongs to the Special Issue Machine Learning for Biomedical Imaging Applications)

Download

Browse Figures

Versions Notes

Abstract

:

Malaria is a disease that affects millions of people worldwide with a consistent mortality rate. The light microscope examination is the gold standard for detecting infection by malaria parasites. Still, it is limited by long timescales and requires a high level of expertise from pathologists. Early diagnosis of this disease is necessary to achieve timely and effective treatment, which avoids tragic consequences, thus leading to the development of computer-aided diagnosis systems based on artificial intelligence (AI) for the detection and classification of blood cells infected with the malaria parasite in blood smear images. Such systems involve an articulated pipeline, culminating in the use of machine learning and deep learning approaches, the main branches of AI. Here, we present a systematic literature review of recent research on the use of automated algorithms to identify and classify malaria parasites in blood smear images. Based on the PRISMA 2020 criteria, a search was conducted using several electronic databases including PubMed, Scopus, and arXiv by applying inclusion/exclusion filters. From the 606 initial records identified, 135 eligible studies were selected and analyzed. Many promising results were achieved, and some mobile and web applications were developed to address resource and expertise limitations in developing countries.

Keywords:

malaria; parasite detection; blood smear images; optical microscope; computer-aided diagnostics; artificial intelligence; machine learning; deep learning; web applications; mobile devices

1. Introduction

Malaria is a deadly disease worldwide and is known as a major cause of infant mortality. Statistics from the World Malaria Report 2023 [1] compiled by the World Health Organization (WHO) show that 249 million cases of malaria were detected globally in 2022, 5 million more than the previous year. The main countries contributing to the increase in cases between 2021 and 2022 were Pakistan, Ethiopia, Nigeria, Uganda, and Papua New Guinea. Between 2000 and 2022, the mortality rate for malaria halved from 28.8 per 100,000 population at risk to 14.3, and the proportion of total deaths from malaria among children under the age of 5 decreased from 86.8% in 2000 to 76.0% in 2022.

1.1. Malaria Parasite Transmission and Life Cycle

Malaria transmission occurs through the bite of an infected female Anopheles mosquito, which transfers the parasite to humans by attacking their red blood cells (RBCs). The parasitic protozoan that causes malaria infection is of the genus Plasmodium [2], which has four types of species, namely P. falciparum, P. vivax, P. ovale, and P. malariae, of which the former is the most dangerous. Plasmodium matures and reproduces through a complex cycle [3,4] that varies in relation to the temperature of the environment [5], slowing down in cold periods, and during which it undergoes a change in its shape, size, morphology, color and so on, depending on which phase it is in, i.e., ring, trophozoite, schizont, or gametocyte [6]. The different stages in the life cycle of the Plasmodium parasite are shown in Figure 1.

1.2. Malaria Diagnosis and Treatment

To date, there has been no effective vaccine for the treatment of malaria due to the high diversity and adaptability of Plasmodium antigens, which would, therefore, require the development of specific solutions [7]. However, it is clear from recent WHO analyses that there has been a decrease in malaria deaths in recent years, despite an increase in cases worldwide. Malaria is a curable disease if diagnosed early. Currently, the gold standard for diagnosis and detection of different types of malaria is a visual inspection of Giemsa-stained blood smears using hand-held light microscopy [8], which requires an interpretation of the results by an experienced pathologist. Microscopic examination can be conducted on both thin and thick blood smears. The thick blood smear is 6 to 20 times thicker than a single layer of RBCs, making it 20 to 40 times more sensitive in detecting malaria parasites than a thin smear. Indeed, the blood smear method is often suitable for detecting parasites with an 11-fold higher sensitivity rate [9], but not for counting infected RBCs; it is, therefore, a method to be avoided in conditions of high parasitemia. In addition, further image artifacts in thick smears interfere with reading and bring more image processing challenges. On the other hand, the thin blood smear preserves the shape of the RBCs, allowing for them to be counted, and is used to visualize a parasite within a cell. Additional techniques for malaria detection include the antigen test, a fast but less precise process, and the polymerase chain reaction process, precise but expensive. Microscopic analysis is, therefore, the one generally used, but it requires laboratory instruments, a great deal of experience and time. Since people can be infected with more than one malaria parasite at the same time, it is necessary to examine the entire blood smear sample, a very time-consuming procedure for pathologists who examine voluminous samples. The manual evaluation and cell count in a thin blood smear sample takes an expert about 15 min [10], and the accuracy of the diagnosis directly depends on their experience. Due to the lack of equipment and human resources, malaria is difficult to detect in rural areas and developing countries, where only the patient’s history and symptoms are considered to reach a diagnosis [11], resulting in a delay in the diagnosis of the disease. To formulate an effective and timely treatment for malaria, in addition to early diagnosis, it is essential to recognize the degree of spread, quantify the number of parasites, and identify their life stages.

1.3. Automatic Malaria Detection and Diagnosis

To overcome the obstacles just discussed and to ensure rapid and effective malaria diagnosis, as well as a low cost and equal accuracies worldwide, automated methods must be used. Conventional image processing techniques, whose simplicity and low computational cost make them potentially suitable for contexts where it is impossible to acquire large-scale marked data, have shown promising results in various scenarios including the determination of blood film parasitization [12]. However, computer-aided diagnosis (CAD) systems based on artificial intelligence (AI) have gained much more popularity in recent decades [13] and can also be adapted for malaria parasite detection. The traditional pipeline of CAD systems for malaria diagnosis follows the following steps: data acquisition and pre-processing, cell segmentation, feature extraction, and finally object detection and classification of infected cells, parasite type, and life stage. The first step is to acquire images of the blood smears using the cameras already mounted on the light microscopy device or using external devices such as smartphones. Samples are generally stained with Giemsa, the recommended and most reliable procedure for staining thick and thin blood films to detect and identify blood parasites. The Giemsa solution consists of eosin, which colors the parasite nucleus in red, and methylene blue (light blue), which colors the cytoplasm in blue. However, automated malaria diagnosis works have been published in the literature that make use of the combination of the Giemsa reagent with other types of dyes, e.g., Wright-Giemsa, May Grünwald–Giemsa, and Leishman–Giemsa. Once the data have been acquired, pre-processing methods aim to improve the quality of blood smear images by removing impurities that may affect the accuracy of subsequent processing steps, especially cell segmentation. To this end, Gaussian, median, and geometric filtering techniques and morphological operators are commonly used for noise reduction [14,15,16], while adaptive thresholding and histogram equalization are used to improve image resolution and contrast [12]. In addition, normalization techniques are performed to reduce illumination variation in images. To increase the size of image datasets and, consequently, the performance of AI models, some transformations of data augmentation (DA) are applied to the initial data to create new artificial examples [17]. Among the classic operations used for data augmentation are rotation, zooming, cropping, noise addition, scaling, and translation [18]. In addition to these standard operations, more advanced transformations such as changing contrast or brightness can be applied. DA techniques are also applied to balance the data in the different classes. The next step of cell segmentation is significant in automatic malaria detection systems and aims to identify RBCs, white blood cells (WBCs), parasites, and other artifacts, and then separate them from each other and the background. This can be performed manually by experts, requiring very long time frames, or by image processing techniques. In the latter case, thresholding or watershed methods are the most frequently used, along with machine learning (ML) and deep learning (DL) techniques such as clustering algorithms and deep artificial neural networks (ANNs) for segmentation. Features relevant for their subsequent classification are then extracted from the segmented cells and parasites. This can be carried out manually, pointing to extract shape, color, and texture information preferring the HSV color space and the green channel of the RGB color space [19], or automatically using AI algorithms. The last step in the pipeline of CAD systems for automatic malaria diagnosis is to detect and classify parasites. At this stage, once Plasmodium-infected cells have been identified, a cell count can be performed to determine the degree of parasitemia, but also to determine the infecting type of parasite and its life cycle stage. Classic ML methods make use of the features extracted in the previous processing step, while DL methods automatically extract the information most relevant to the purpose of the analysis. Although these two approaches are generally used separately, some works propose hybrid ML/DL models or even ensemble learning systems to increase the diagnostic performance of CAD systems. In particular, ensemble learning reduces model variance by optimally combining predictions from multiple models and decreasing sensitivity to the specification of training data and selection of training algorithms. The CAD systems pipeline for the malaria just discussed is briefly summarized in Figure 2.

CAD systems for malaria diagnosis are proving to be useful tools for increasing diagnostic accuracy, limiting intra- and inter-operator variability, and reducing time and human resources. Moreover, some of these systems have been integrated into mobile applications that can be used by anyone anywhere in the world, even without an internet connection. This systematic review aims to provide a comprehensive overview of the last ten years of the literature on the applications of AI to the automatic diagnosis of malaria, and to analyze its performance.

2. Methods

The Preferred Reporting Items for Systematic Reviews and Meta-Analysis (PRISMA) guidelines [20] were followed in the present study. More details on the search, screening, and article selection strategies are given in the following subsections.

2.1. Data Sources and Search Strategy

To identify the recently published articles concerning the use of ML and DL for malaria diagnosis from tissue or cell images, the electronic databases PubMed, Scopus, and arXiv were considered, imposing the timespan between 1 January 2014 and 13 February 2024. The keywords Artificial Intelligence, Machine Learning, Deep Learning, Malaria, Images, Classification, and Diagnosis were used and combined with the logical operators “AND” and “OR” as illustrated in Figure 3.

In addition, we set filters to limit the publication language to English and to select only original articles in the Scopus database. After collecting all the articles resulting from the automatic search, duplicates were removed.

2.2. Inclusion and Exclusion Criteria

Since this systematic review aims to collect and analyze studies on the applications of ML and DL to automatic diagnosis of malaria on images, we established the following exclusion criteria for the articles selected: lack of application of automatic diagnosis techniques, implementation of segmentation tasks not followed by the automatic classification, focus on diseases other than from malaria, and analysis of data other than images. In addition, we excluded reviews and systematic reviews, non-publicly accessible articles, and work with unclear or non-explicit results. Moreover, the previous set of search criteria excludes papers not written in English and those published before 2014. The inclusion criteria were the negation of the exclusion criteria.

2.3. Quality Assessment

Quality assessment of primary research is a key step in systematic reviews. It is necessary for a complete and transparent review process, the results of which are not influenced by poorly conducted data collection. Therefore, in the present work, the standard questions of the quality checklist (SCQ) designed in [21] were applied, and, following [22], articles that answered “yes” to at least 7 of the 10 questions in Table 1 were deemed eligible.

2.4. Data Extraction

Data extraction was performed following the PRISMA guidelines [20] (Figure 4).

The examination and screening of all the search results were conducted by two independent authors (F.G. and A.A.). The initial search of electronic databases generated 606 articles (257 PubMed, 314 Scopus, and 35 arXiv). Among these articles, 398 were eliminated because they were duplicated records. After reading the title and abstract, 45 articles were discarded because they were outside the topic of the present review, following the exclusion criteria described above. In the next step, articles were carefully selected by reading the remaining 163 articles. Of these, 22 articles were excluded after reapplying the previously established inclusion/exclusion criteria, and a further 6 failed the quality analysis test. Finally, a total of 135 articles were deemed suitable for a full evaluation of the reported results. Data were collected and extracted from the selected articles by highlighting the following key aspects: malaria dataset used, pre-processing technique applied, task performed, AI tools used, results obtained (quantified by evaluation metrics), cross-validation and external validation conducted, and eXplainable AI (XAI) method used, where present. This information has been sorted and set out in summary table following the manuscript organization.

3. Artificial Intelligence Background

The purpose of this systematic review is to summarize and analyze recent applications of AI and CAD systems to malaria diagnosis. In this section, we briefly summarize the main information behind these concepts to provide a general overview of what is discussed in the following sections.

3.1. ML and DL

AI, which tries to perform some of the tasks characteristic of human intelligence like learning, has now become part of our daily lives, including medical applications. Two main branches of AI are ML and DL, a series of models and algorithms that autonomously extract useful information from data and learn from them with the aim of make predictions on new data never seen before. The difference between these two types of approaches is that DL, the most successful ML solution, exploits complex architectures, characterized by many layers and many parameters. The principal ML models include Decision Trees (DT) [23], Support Vector Machines (SVM) [24], K-Nearest Neighbors (KNNs) [25], and simply ANNs [26]. For DL instead, the most effective and widely used algorithms in computer vision (CV) are the convolutional neural networks (CNNs or ConvNets), networks composed of several layers, such as convolutional, max-pooling, normalization, dropout, fully connected, and output layers, each with its own specific function [27]; in fact, some of them have trainable parameters, while others only have the task of implementing an established function. Through these layers, each input is transformed down to the last layer where a class-based probability of that image is produced. Other commonly used DL approaches in CV are autoencoders (AEs) [28], widely applied in dimensionality reduction and feature learning, and Generative Adversarial Networks (GANs) [29], a class of deep networks for generative modeling. Training DL models with randomly initialized parameters requires a large amount of labeled data not readily available. Transfer learning (TL) represents the best solution by allowing the reuse of knowledge (the weights) extracted from a pre-trained CNN model on large labeled datasets and achieving good results in the source domain. This operation can be used as a feature extraction model but also to refine hyperparameters by freezing or unfreezing various layers. Several CNN architectures are available as pre-trained models [30,31,32], and the most commonly used ones are summarized in Table 2.

3.2. Evaluation Metrics

To evaluate the learning algorithms in a classification task, it is possible to use some synthetic indices [33], summarized in Table 3, from the scores of the true positive (TP), true negative (TN), false positive (FP), and false negative (FN) predictions. By P and N we mean the positive and negative instances (

P = T P + F N

;

N = T N + F P

), and by P′ and N′ those for which the output of the classification (correct or incorrect) is the positive and negative class (

P^{'} = T P + F P

;

N^{'} = T N + F N

).

In medical diagnosis, the goal is to have a high sensitivity, indicating the correct classification of the disease, but also a low false positive rate (FPR) to reduce patient and physician concerns. In addition, it is crucial to achieve a high diagnostic accuracy, that is, a high frequency with which a cancer diagnosis is actually related to a sick patient. If the dataset is balanced, the accuracy metric is valid for evaluating the model; otherwise, the other metrics are more reliable. For example, Receiver Operating Characteristic (ROC) curves [34] are used to select classification models based on their performance against true positive rate (TPR) and FPR values, calculated by varying the decision threshold of the classifier. From the ROC graph, it is possible to derive the Area Under Curve (AUC), representing the area enclosed under the ROC curve. The closer the AUC value is to 1, the better the classifier.

3.3. Cross-Validation

After training, the ML and DL models are tested on new data to assess their performance. In common practice, the dataset is divided into two parts, one of which is used to train the model and the other to test it. This method, called Holdout Cross-Validation, is easy to implement and computationally inexpensive. However, not all of the dataset is used for training and the split can lead to estimates that are highly dependent on the way the data were split. To obtain a reliable estimate of the model’s ability to generalize previously unseen data, it is best to use other statistical cross-validation techniques. The most commonly used method is the k-fold one [35], in which the dataset is divided into k subgroups (or “folds”) of approximately equal sizes; the model is trained k times, using each time k − 1 fold for training and the remaining fold for validation; and finally, the performance of the model is evaluated as the average of the performance on the k folds. Usually, k is set as 5 or 10. The extreme case of k-fold cross-validation is the so-called Leave-One-Out Cross-Validation, where k is equal to the number of instances in the dataset. In this variant, every instance is used as a validation fold once, and all others are used for training.

3.4. External Validation

External validation refers to the validation of ML and DL models on sources external to those used during training and testing. In the medical field, such models tend to perform well on data from the same cohort but do not perform as well on new data (of new patients, perhaps from other centers), often due to overfitting or co-variate shifts [36]. Therefore, to evaluate the application of medical AI, it is necessary to perform such validation [37] on datasets that are either public or private.

4. Public Malaria Datasets

The growing interest in AI applications in the medical field, and in particular for the diagnosis of malaria, has led researchers around the world to collect images of thick and thin blood smears of malaria patients. Many of the data collected are then made public to offer the possibility of using them to develop ML and DL approaches and to compare their performance with other works. The most used public datasets for malaria classification are listed below.

The National Institutes of Health (NIH) dataset consists of a set of thick and thin smears’ images of infected (P. falciparum and P. vivax) and uninfected patients [38] and a set of already segmented data [39]. The latter is the most widely used and contains 27,558 cells segmented from images of thin Giemsa-stained blood films captured with a mobile phone mounted on a microscope. The cells, manually annotated by an expert, are perfectly balanced between healthy and infected from P. falciparum.
The MaMic Image Database [40] contains images of blood slides that are used to identify P. falciparum. These images are captured with a microscopic camera that scans the entire slide, obtaining approximately 549 images with a resolution of 1280 × 1024 pixels. A total of 16,991 cells are segmented and manually annotated as definite or uncertain parasites by a trained expert.
The MP-IDB dataset [41] contains four different genera of malaria parasites including P. vivax, P. falciparum, P. ovale and P. malaria, each with four distinct life cycle stages (ring, trophozoite, schizont, and gametocyte). In total, 48,000 blood cells are extracted from the 229 images of thin blood smears stained with Giemsa, which are then labeled by experienced pathologists.
The BBBC041 dataset [42] contains a total of 1364 images manually annotated by three world-class experts. The 80,000 cells contained in the images belong to six distinct classes, two of which are uninfected (RBCs and leucocytes) and four parasitized (gametocytes, rings, trophozoites, and schizonts). In addition to the phase labels, the coordinates of the bounding boxes for each cell are also provided.

5. Results

This section summarizes articles on the use of AI for malaria detection and diagnosis collected from the literature. Of the 606 articles resulting from the initial search, 135 were considered eligible (see Section 2.4 and Figure 4). The organization of these studies is as follows: first, all articles using the public NIH malaria dataset are described; then, the remaining ones are analyzed. Special subsections are then dedicated to articles proposing generalizable approaches also tested on the automatic malaria diagnosis task and articles proposing methods for multiple diagnoses, including malaria.

5.1. Articles Using the NIH Malaria Dataset

For the recognition and classification of malaria on the NIH dataset, several ML and DL approaches (based on custom CNNs or pre-trained networks) have been proposed. In this section, we summarize the work resulting from the research conducted, dividing it according to the AI method used. The extracted information is summarized in dedicated tables, reporting only ACC and AUC performance due to the balance of the data. We report other metrics only if ACC and AUC are not present.

5.1.1. Pre-Trained Networks

For the task of automatic diagnosis of malaria in RBC images, among the most frequently adopted strategies is TL in pre-trained neural networks, which are considered an excellent compromise between performance effectiveness and computational efficiency. In this scenario, Zhao et al. [43] used SSD300 [44] for object detection in blood smears, developed a resolution upscaling model using a Fast Super-Resolution CNN (FSRCNN) network for all those cases where the resolution of RBCs images is insufficient, and classified them using the modified VGG-16 network. Sriporn et al. [45] used a pre-trained Xception with Mish activation function [46] and Nadam optimizer [47]. Kassim et al. [48] proposed the PlasmodiumVF-Net framework for the analysis of thick smear microscopy images for malaria diagnosis at the image and patient levels. They used two Mask R-CNNs to detect candidate RBCs for the presence of the parasite, and then two pre-trained ResNet-50s (one after each previous pattern) to filter FPs. Finally, they classified the patient based on the number and score of parasites detected in the first step so that if less than two RBCs are found to be parasitic, the patient is labeled as healthy. Sangameswaran et al. [49] developed Mass-AI-Scope (MAIScope), a novel, low-cost, portable device that automatically detects malaria parasites with two subsystems, the first of which is a pre-trained EfficientNet-lite4 network that detects RBCs, followed by a pre-trained MobileNetV2 that recognizes malaria parasites, while the second is the hardware, consisting of components such as Raspberry Pi, a camera, a touchscreen display, and an innovative low-cost microscope. Alnussairi et al. [50] classified segmented patches from microscopic images of RBC smears using pre-trained networks such as VGG-19, ResNet-50, and MobileNetV2 trained for a few epochs. Sinha et al. [51] report an automated CAD method to classify malarial thin smear blood cell images as parasitized and uninfected by using the pre-trained ResNet-50. Kundu et al. [52] proposed an intelligent hyper-parameter tuned DL-enabled malaria parasite detection and classification (HPTDL-MPDC) technique for blood smear images, which incorporates data pre-processing, VGG-19-based feature extraction, both deep ANN with Long Short-Term Memory (LSTM) and custom CNN for classification, and Adagrad optimization-based hyperparameter tuning.

Extra details on data manipulation, pre-trained network used, and results are given in Table 4.

Staying on the topic of using TL for automatic malaria diagnosis, several papers report the results of comparing different networks, both customized and pre-trained, which are generally superior using the latter. Rajaraman et al. [55] proposed a system for cell segmentation and classification of infected and non-infected cells for which they tested custom CNNs and several pre-trained networks, the best of which turned out to be ResNet-50 after the optimization of hyperparameters with the randomized grid search method. Rahman et al. [56] focused on improving malaria detection from segmented patches from microscopic images of RBC smears by introducing a deep CNN. They tested a custom CNN, the hybrid VGG-16 + SVM solution, an ensemble learning approach, and the pre-trained VGG-16, of which the latter proved to be the best. Jameela et al. [57] developed a Malaria Diagnosis System for automatic diagnosis based on the intensity features of Plasmodium parasites and erythrocytes using ResNet-50, ResNet-34, VGG-16, and VGG-19 pre-trained, the last being the best. Loddo et al. [58] compared 11 existing architectures for multiclass classification on parasite life stages and evaluated the robustness of the models on two different public datasets. Turuk et al. [59] tested three pre-trained networks, demonstrating the superiority of the modified VGG-19 over all of them. Furthermore, Qadir et al. [60] tested different TL approaches and reported the best in DenseNet in its variants DenseNet-121, DenseNet-169, and DenseNet-201. Hassan et al. [61] developed a malaria CNN by fine-tuning InceptionV3, ResNet-50, and VGG-19, the last of which had the best classification result. Shewajo et al. [62] proposed an efficient and robust image processing method based on tiles containing the malaria parasite and tested three variants of YOLOV4-based object detectors: a modified version (YOLOV4-MOD) and two versions with a lightweight architecture: YOLOV4-tiny with two detector heads and YOLOV4-tiny-3L on three detector heads. To conduct a performance analysis of DL algorithms in the diagnosis of malaria disease, Hemachandran et al. [63] compared a custom CNN with MobileNetV2 and ResNet-50, with the latter being the best. Dath et al. [64], after applying noise reduction techniques, image segmentation, and GANs as a DA strategy, implemented a custom CNN, a pre-trained ResNet-50, and a pre-trained VGG-19, of which the latter was the best.

Further information on the work just discussed is given in Table 5 to complete the discussion.

5.1.2. Custom CNN

Besides pre-trained networks, the most widely explored solutions for the task of malaria parasite detection are custom CNNs. Many authors such as Sarkar et al. [67], Masud et al. [68], Umer et al. [69], Alqudah et al. [70], Malhotra et al. [71], Delgado-Ortet et al. [72], Irmak [73], Maqsood et al. [74], Magotra et al. [75], Uzun Ozsahin et al. [76], Hcini et al. [77], Cho et al. [78], Yebasse et al. [79], and Shambhu et al. [80] have proposed custom CNNs involving the types of layers commonly used in CV and optimized architectures and hyperparameters to achieve the best performance. Some authors have conferred a name to the developed CNN networks, for example, Optimized Step-Increase CNN [81], DilationNet [82], Falcon [83], DACNN [84], Mosquito-Net [85], and MozzieNet [86]. In [87], different CNN architectures were generated using an open-source DL framework, Autokeras, which identifies the optimal neural network to perform the classification task; the optimal choice is based on performance analysis. Custom CNNs with unconventional blocks are also proposed for malaria parasite classification. For example, Khan et al. [88] proposed an STM-SB-RENet block-based CNN that consists of three split–transform–merge (STM)-based convolutional blocks with the same topology, in each of which modified channel squeezing-boosting (SB) and TL are integrated; finally, fully connected and dropout layers are added. Islam et al. [89] proposed a multiheaded attention-based transformer model with a compact convolutional transformer and visualization of heat maps highlighting the most significant regions for the network. Alonso-Ramírez et al. [90] used a cascade of two networks: CNN-LSTM, which integrates the LSTM algorithm into CNN, and CNN-BiLSTM, similar to the former but with Bidirectional LSTM (BiLSTM). Amin et al. [91] developed a CAD system by combining a semi-supervised GAN and TL, called TL-SGAN, in which the model discriminator is replaced by a modified pre-trained VGG-16, while the generator is a custom CNN.

Details of the operations performed on the data, the custom network developed, and how the results were evaluated are given in Table 6.

5.1.3. Autoencoders

AE models, generally used as image generators or dimensionality reducers, have also been employed as blood cell classifiers. In particular, Fuhad et al. [92] trained a complete AE model, in which they then replaced the decoder with a flattened layer followed by two fully connected layers, the last of which is dedicated to binary image classification. Huq et al. [93] proposed an image classification technique based on outlier detection, which exploits an AE model called AnoMalNet; this model is trained entirely on non-infected images, and in the inference phase images are labeled as parasitic if their reconstruction loss value is greater than the mean plus three times the standard deviation of the loss obtained on the training dataset.

Table 7 provides information on the works just mentioned.

5.1.4. Hybrid Models

Hybrid models combining ML and DL have also been proposed for the classification of malaria on the NIH dataset, exploiting CNN networks as feature extractors and classical ML models as classifiers. Diker [95] optimized the parameters of a deep residual network with the Bayesian method, performed neighborhood component analysis to select automatic features and increase the classifiers’ performance, and finally used SVM and KNN to perform the final classification. Raihan et al. [96] used a custom CNN to extract features from images transformed using the 2d wavelet package, select the effective features by using the Whale Optimization Algorithm (WOA), and finally classify the remaining features with XGBoost. Li et al. [97] presented a new hybrid model dubbed RAL-CNN-SVM, which incorporates residual attention learning into a ResNet-50-backbone CNN, and performs the classification with an SVM with a radial basis function (RBF) kernel. Jones et al. [98] extracted and selected features using a custom CNN method and an improved Grey–Wolf optimization method, respectively. An SVM with RBF kernel was used as a classifier. Goni et al. [99] combined a custom CNN-based feature extractor with a double hidden layer extreme learning machine (ELM) classifier, called DELM. Abubakar et al. [100] tested the combination of six different feature extractor pre-trained networks and four ML models, from which VGG-16 + SVM was the best solution. Dutta et al. [101] introduced a new Barnacles Mating Optimizer (BMO) with Deep TL (DTL) Enabled Biomedical Malaria Parasite Detection and Classification (BMODTL-BMPC) model, which uses the ELM model for the identification and classification of malaria parasites. Alassaf et al. [102] developed an intelligent deep-TL-based malaria parasite detection and classification (IDTL-MPDC) model which performs feature vector extraction with Res2Net, whose hyperparameters were optimally adjusted with the differential evolution algorithm, and uses KNN for classification. Imran et al. [103] merged the extracted features with pre-trained DarkNet-53 and DenseNet-201 networks, performed vector dimensionality reduction using the WOA, and applied several ML classifiers among which Linear Regression (LR) was the best. Madhu et al. [104] developed an innovative approach utilizing an urgent, inception-based capsule network, a diagnostic model incorporating the pre-trained InceptionV3 network to extract features from images, and Imperative Capsule to detect the malaria parasite. Qadri et al. [105] proposed the use of the pre-trained Neural Search Architecture Network (NASNet) model to extract spatial features from images, the Random Forest (RF) algorithm to derive class prediction probability features from spatial ones, and a final SVM classifier. Amin et al. [106] performed feature extraction both manually and automatically using pre-trained ResNet-50 and ResNet-18, selected a portion of these features with the generalized normal distribution optimization method, and performed classification with SVM (linear, quadratic, and cubic) and ANN classifiers (narrow, wide, and medium). Finally, Murmu et al. [107] developed a hybrid Deep-CNN-RF model for malaria parasite detection, combining pre-trained networks with RF.

Table 8 summarizes the hybrid models used to detect and classify malaria parasites on the NIH dataset.

5.1.5. Ensemble Learning

The technique of ensemble learning was also explored for the task of malaria parasite detection and classification. In particular, Marques et al. [108] report an automatic malaria detection model based on the ensemble of 10 modified pre-trained EfficientNetB0 models. Zhu et al. [109] proposed a pre-trained ResNet-18 backbone network used as a feature extractor, and the random vector functional link (RVFL), Schmidt neural network (SNN), and ELM as classifiers. The final result is the ensemble output of the three networks obtained by majority voting. Rajaraman et al. [110] combined the predictions of the VGG-19 and SqueezeNet models to reduce the variance of the model, thus improving its robustness and generalization. A combination of pre-trained VGG-16, VGG-19, and DenseNet-201 was run two-by-two with the adaptive weighted average method in [111]. In addition, a maximum vote ensemble technique was applied in combination with adaptive weighted average ensemble models to reduce the dispersion of the predictions. Finally, Nayak et al. [112] proposed an ensemble AI-enabled IoMT automated diagnosis model to classify malaria parasitized from microscopic images. This model involves two parallel phases, the first of which is a CBRM Renealing Snapshot lightweight ensemble learning model created from a combination of three distinct CNN layers, while the second is an ensemble of the three transfer-learning models DenseNet-121, ResNet-101, and VGG-19 whose results are averaged. The final result of the two phases in parallel is a majority vote.

An overview of the ensemble learning models used for malaria parasite classification performed on the NIH dataset is reported in Table 9.

5.1.6. Classic ML Models

Although DL and hybrid approaches are in vogue both for CV tasks in general and for the malaria recognition and classification on the NIH dataset, classical ML approaches have also been proposed. In [113], a system is proposed that integrates Hossain et al. [113], a variational quantum circuit (VQC) approach to parasite detection, and a rule-based expert system to detect malaria types. The optimal ML-based automated malaria parasite detection and classification (OML-AMPDC) model is proposed in [114] for pre-processing, feature extraction using Local Derivative Radial Patterns, and classification of blood smear images with an RF classifier, whose optimal parameters are chosen using the particle swarm optimization algorithm. Phumkuea et al. [115] applied the Information Gain filter to select the most significant features from the pre-processed images and analyzed the histogram of the HSV images using a DT classifier. To improve the accuracy of malaria detection and classification, Telang et al. [116] computed the first four statistical intensity moments for pixels in each image RGB color channel, performed feature selection to reduce the dimensionality of the previously obtained feature vectors, and classified the data using the RF, KNN, Linear SVM + LR, DT, Kernel SVM, and Naïve Bayes (NB) approaches.

More details on the results just discussed are given in Table 10.

5.1.7. Mobile Applications

The aim of some of the work was to develop mobile applications to support rapid, efficient, and cost-effective malaria diagnosis as a support tool in less developed areas. In this sense, Yu et al. [117] designed an Android mobile application called Malaria Screener that consists of a slide screening module with three sequential sub-modules for acquisition, parasite detection, and display of results; a data management module that stores images and corresponding metadata acquired during slide screening sessions; and finally, a data upload module that transfers local data to an online repository for record keeping and further training of the system. Yang et al. [118] developed an image processing method for parasite candidates search using an iterative global minimum screening based on intensity and their classification using a custom CNN.

Table 11 summarizes the results of mobile applications for malaria diagnosis on the NIH dataset.

5.2. Articles Using Other Public and Private Datasets

In this section, we review works proposing AI methods for the automatic diagnosis of malaria using private or public but non-NIH datasets. We summarize the results by distinguishing between works developing DL approaches, classical ML approaches, and those developing web or mobile applications. The information extracted from the various articles is reported in apposite tables that also highlight the specific task under consideration.

5.2.1. DL Approaches

This review considers recent research on the development of automatic image-based malaria diagnosis applications. As this is a CV task, DL approaches are the most commonly used. Hung et al. [119] proposed the identification of RBCs and the recognition of their stages in bright-field microscopic images of malaria-infected blood by using a pre-trained Faster Region-based Convolutional Neural Network (Faster R-CNN) followed by an AlexNet. A focus-stacking approach to the automated quantitative detection of the malaria parasite Plasmodium falciparum was proposed by Gopakumar et al. [120]. Once the best-focused images are chosen for each field of view (FoV), an SVM trained on the statistical and structural features extracted from the positions of suspected parasites (A) is compared with a CNN trained on patches surrounding the suspected positions (B), and with a CNN trained on the patch stack surrounding the suspected positions (C). Jagtap D et al. proposed a Cuckoo Search-Based Ensemble Classifier (CSEC) [121] to overcome the limitations of the monoclassifier strategy in the detection of malaria-infected erythrocytes. For the identification of the malaria parasites in thick blood smears, Pattanaik et al. proposed a CAD system that includes an ANN with two stacked models for malaria parasite detection, followed by a softmax classifier for binary classification [122]. The same authors also conducted different experiments. For example, in [123], they present a deep filter bridge combining multi-rolling stacked denoising AE, fisher vector, and ELM to automatically classify the different types of single cells in microscopic blood smear images as either infected or uninfected. In addition, for the automatic classification of microscopic blood smear images at multiple magnifications, they [124] proposed an effective multi-enhanced deep residual neural network (MM-ResNet) consisting of ResNets with different inputs, the results of which are concatenated and then classified using a dense layer. For a rapid automated diagnosis of malaria, Manescu et al. [125] applied the Extended Depth-of-Field (EDoF) in thick film microscopy and developed an EDoF-CNN that can rapidly compute EDoF images from z-stacks while improving the spatial resolution of the resulting image. The same authors also proposed the Deep Malaria Convolutional Neural Network (DeepMCNN) classifier [126], which performs object detection with a RetinaNet, and binary classification of stacks at the patient level using a pre-trained VGG-19. Total malaria parasite (MP) and WBC counts are also performed, allowing for the estimation of parasitemia in MP/

μ

L, as recommended by the WHO. A low-cost, portable digital microscope scanner capable of both bright-field and fluorescence imaging is being developed by Holmström et al. [127]. Two supervised DL algorithms called DLS1 and DLS2 are trained to detect P. falciparum parasites. The first network detects all RBCs, and in a second step, the two networks in parallel tell whether the RBCs are healthy or diseased. Vijayalakshmi [128] proposed a pre-trained VGG replacing the final layers with an SVM classifier with RBF kernel. Pardede et al. [129] implemented the RetinaNet network with ResNet-101 as a backend to detect normal and infected erythrocytes on a thin blood smear image. The label counting of the detected objects and calculation of the malaria index values were assessed to perform a diagnosis of malaria. Nakasi et al. [130] tested three networks, identifying Fastest R-CNN as the best network for classification, and SSD as a possible best solution for mobile implementation. The main objective of the project by Chowdhury et al. [131] was to create a low-cost CNN-based automated system to identify and count blood cells (RBCs, platelets or thrombocytes, and different types of WBCs), and also detect malaria infection from a digitized blood smear image. The proposed Modified Yolo network is a modified version of the YOLO network made smaller and optimized without compromising performance too much but providing faster automatic detection in hardware implementation. Lebel et al. [132] demonstrated that DL allows for both the life-stage classification of malaria parasites and accurate quantification of parasitemia in unlabeled images obtained by bright-field microscopy of live, unstained RBCs. To do this, a standard optical microscope with visible and near-UV illumination and a microscope with deep UV illumination were used, exhibiting superior performance compared to Giemsa’s manually scored smears performed by experts. The workflow involved semantic cell segmentation with ResNet-50, post-processing of the resulting masks, and parasite stage classification using GoogLeNet. To overcome the strong imbalance between classes, they used a weighted cross-entropy loss function that introduces a normalization term for the weights based on the fractional representation of each class. To automatically detect and quantify blood cells at different life stages of the malaria parasite, Loh et al. [133] used a pre-trained Mask R-CNN that returns binary masks and bounding coordinates. In addition, they performed segmentation and cell counting, and then calculated the ratio between the infected and healthy. Li et al. [134] developed a malaria stage recognition approach in blood smear images using a deep transfer graph convolutional network (DTGCN) consisting of a feature extractor based on the pre-trained ResNet-50, a source transfer graph construction component, and an unsupervised GCN. Molina et al. [135] proposed the use of the pre-trained VGG-16 model for the differentiation of RBCs, both normal erythrocytes and erythrocytes, with other types of inclusions that have a parasite-like morphology. Davidson et al. [136] performed RBC detection by pre-trained Faster R-CNN with ResNeto-50 as backbone, cell differentiation into infected/uninfected by pre-trained ResNet-50, and parasite life stage categorization by pre-trained ResNet-34. For the automated diagnosis of malaria, Ufuktepe et al. [137] proposed the Channel-wise Feature Pyramid Network for Medicine (CFPNet-M)—Detection, Extraction, and Counting (CFPNet-M-DEC), in which the cell detection module includes a modified CFPNet-M network that takes two input images and returns the truncated distance transformation for each, whereas the cell extraction and counting module is based on a modified CFPNet with two-channel input, one for each output of the previous module. To perform the P. falciparum automatic screening on thick blood smears, Abdurahman et al. [138] modified the pre-trained YOLOv3 (YOLOv3-MOD1 and YOLOv3-MOD2) and YOLOv4 (YOLOv4-MOD) to improve their ability to detect small objects by extending the feature scales and adding more detection levels. The YOLOv4-MOD turned out to be the best. Yang et al. [139] proposed a system for segmentation and classification of malaria cells. The parasite segmentation is performed using a modified U-Net (U-Net∖VGG-19) with a VGG-19 encoder and a customized loss function based on the Jaccard index. Next, the largest parasite detected in each image is classified by a Light-Net. Jabbar et al. [140] compared three different CNN networks, with/without DA and with/without noise reduction pre-processing, the best of which has five layers and uses connected convolution architecture. For a quick classification of RBCs, Preißinger et al. [141] proposed a system that, after choosing the microscopy technique for acquisition and performing some pre-processing operations, selects the RBC cross-sections that best captures the features associated with the presence/absence of the parasites and their life stage and classifies them using a custom CNN. The localization, detection, and counting of parasites and WBCs were performed by Nakasi et al. [66] using the two pre-trained models Faster R-CNN with ResNet-101 backbone and Single Shot Multibox Detector (SSD) MobileNetV2. While the former model had a better performance, the latter was more suitable for the mobile application. For the life cycle classification of P. vivax, Sengar et al. [142] designed a method that combines four modules, namely, image acquisition, data augmentation, parasitic data generation with GAN, and classification, with the vision transformer (ViT) model. The generation of additional training samples increased the robustness of the resulting model. Arshad et al. [143] proposed a method to locate Plasmodium parasites by segmentation with U-Net, on which the binary masks produced are run by the watershed method, and to classify them in two stages: binary classification and subsequent multiclass classification with ResNet50V2. For the pathogen detection on microscope images of thick blood smears, Koirala et al. [144] developed a customized YOLO-mp architecture by modifying the pre-trained YOLO architecture with three and four levels (YOLO-mp-3l and YOLO-mp-4l). Acherar et al. [145] constructed an image dataset of P. falciparum-infected and non-infected blood components, thus evaluating different networks for their classification. After segmenting the cells using the watershed algorithm, a custom CNN and the pre-trained VGG-19, ResNet-50, and EfficientNetB7 networks were compared, with the latter being the best. Meng et al. [146] studied a multi-stage malaria parasite recognition problem, proposing a novel Neighbor Correlated Graph Convolutional Network (NCGCN) that extracts image representations using a ResNet-50, uses them to formulate the correlated relationships between the samples by combining the KNN and the EPSILON-radius graph construction methods, and finally performs the classification with a two-layer Graph Convolutional Network (GCN). To determine infected cells in thin whole blood smear images, Sukumarran et al. [147] implemented and compared the three object detectors YoloV4, modified Faster R-CNN, and modified SSD-300, of which the first was the best for end-to-end identification and localization of malaria-infected cells. Katharina et al. [148] proposed the Malaria Stage Classifier, which performs the identification of individual RBCs, the one-dimensional cross-section feature extraction, and the classification of malaria blood stages with a custom CNN. Liu et al. [149] developed an AI-based object detection system for malaria diagnosis, called AIDMAN, that combines YOLOV5 for cell recognition and a CNN with an attentional alignment model for cell classification. This model was optimized by changing the number of heads, achieving the best performance with a value of 2. For the automatic identification of Plasmodium species and their developmental stages in thin blood smears, Wang et al. [150] used the YOLOV7 network with the Cross Stage Partial Network as a backbone, and the PAN algorithm for multiscale detection and feature fusion. Fasihfar et al. [151] proposed a cell segmentation approach and subsequent classification using custom six-layer CNNs. Fu et al. [152] used the ResNetSt-50 attention mechanism, applying the BYOL self-supervised learning to pre-train the network with positive samples and alleviate the problem of insufficient labeled data. To detect malaria parasites with a high efficiency and accuracy, Zedda et al. [153] proposed a modified version of the pre-trained YOLOV8m architecture, called YOLO-PAM. With a trial-and-error approach, Saxena et al. [154] developed five DL models, of which the best was a custom CNN with four convolutional layers without dropout layers and categorical output. Guemas et al. [155] proposed a real-time detection transformer (RT-DETR) object detection algorithm for discriminating plasmodium species. The proposed architecture consists of a backbone, an efficient hybrid encoder, and a transformer decoder with auxiliary prediction heads. Ilyas et al. [156] simultaneously addressed domain variation, life cycle, and magnification with a new encoder–decoder architecture with a multiclass focus for robust malaria diagnosis. In particular, they used the Fourier Adaptive Recognition System (FARS) with the integration of adversarial training and color Domain Aware Fourier Domain Adaptation for normalizing staining in histopathology, and the Segment Anything Model as a binary segmentation model. This integrated approach offers a robust and reliable solution for cross-domain, multi-stage, and multi-magnification malaria parasite recognition.

An overview of the articles just summarized is given in Table 12.

5.2.2. Classic ML Approaches

A minority of works have proposed classical ML approaches to automatic malaria diagnosis, staging, and parasite counting. For example, Molina et al. [166] developed an ML system that is able to discriminate parasitized RBCs not only from normal but also from other erythrocyte inclusions. After performing an initial pre-processing and segmentation phase, they extracted color and texture features from each component in different color spaces. The proposed system is composed of three sequential modules, of which the first recognizes normal RBCs against RBCs containing inclusions (including malaria), the second identifies whether the inclusion corresponds to malaria or not, and the third discriminates between Howell–Jolly bodies, Pappenheimer bodies, basophilic stippling, and platelets. The result of the overall system is the automatic classification of RBC images into one of six classes. SVM, KNN, Linear Discriminant Analysis (LDA), RF, and Gaussian NB models are tested at each stage, and the best models together with the most appropriate number of features are chosen on the basis of the ACC: SVM + the first 7 most relevant features for the first module; LDA + the first 610 most relevant features for the second module; LDA + the first 700 most relevant features for the third module. Abbas et al. [167] developed an automated approach to the detection, counting, and classification of erythrocytes infected with the malaria parasite at different life cycle stages. From the original images, cell segmentation is performed, followed by the detection of clumps and overlapping RBCs, which are then cleaved. Once the individual RBCs are obtained, the infected ones are detected by assessing the presence of “child” borders within their outer edges, and automatic counting is then easily implemented. The results of the automatic count are compared with those performed manually by the experts using Pearson’s correlation coefficient. For the classification of the malaria parasite’s life cycle (ring stage, trophozoite, schizont, gametocytes) on the basis of histograms of oriented gradients (HOGs) and LBP, the KNN, NB, and SVM multiclass models are used, of which SVM is the best. The automation of the process of malaria diagnosis through ML techniques was executed by Sadiq et al. [168], who followed a pipeline consisting of several steps: image pre-processing to improve their illumination and reduce noise; cell segmentation and categorization using the K-means clustering algorithm; extraction of texture and morphology features with subsequent selection of those with a high covariance between the two classes; and finally, classification using DT. Kudisthalert et al. [169] proposed an automated malaria parasite counting and classification system for Giemsa-stained thin-film images. The morphological image processing technique was used to clean up noise and reconstruct object structures, and the Hough transformation technique was used to segment RBC objects. Color features are concatenated with those extracted using a pre-trained AlexNet network and then classified with an improved version of ELM called Weighted Similarity ELM (WELM). WELM is implemented with Euclidean kernel and replaced at the last level of the Multi-Layer Perceptron (MLP). Poostchi et al. [170] developed an end-to-end automatic detection system to identify and quantify P. faciparum malaria parasites in thin Wright–Giemsa-stained blood smears from both humans and mice. The image processing pipeline comprises an initial phase of cell segmentation, followed by a phase of extraction, selection, and dimensionality reduction of color and texture features, and finally, a phase of cell classification using the linear SVM and ANN classifiers. Das et al. [171] proposed an automated system for the characterization and classification of P. vivax- and P. falciparum-infected stages using Leishman-stained microscopic images of thin blood smears. After segmenting the images with the watershed algorithm, the authors extracted texture and morphological features, selected only the most relevant ones, and finally classified them with different models among which MLP was the best. To recognize the parasite species between P. falciparum and P. vivax and assess their staging, Aris et al. [172] performed a standardized segmentation framework using thresholding and grouping; extracted the features of size, shape, texture, and color from the thick smear, including those of size, shape, and color from the thin smear; and finally designed a multistage RF classifier with 32 trees. The work of RK et al. [173] focused on the design of a new statistical image feature processing model that aims to negate the impact of brightness and illumination variation caused by the lack of standardization in microscopic settings, leading to increased performance in malaria classification. After segmenting the erythrocytes, texture features are extracted using GLCM, and a non-linear regression model is used to identify the optimum brightness factor (OBF) specific to a selected statistical feature. The feature correction factor (CF) is estimated based on the OBF, and classification is performed using SVM. The significant increase in performance with the use of CF is evidenced by the p-value analysis.

Table 13 provides an overview of the work using classic ML approaches to malaria automatic diagnosis.

5.2.3. Mobile and Web Applications

Several web and smartphone applications have been proposed for the rapid and effective diagnosis of malaria. The design and development of a web-based distributed health management system for the automatic diagnosis of malaria was proposed by Maity et al. [175]. The proposed methodology includes image pre-processing, erythrocyte segmentation, extraction, and selection of color, morphology and texture features, and automatic classification using NB, Instance-Based Learning (IB1), and C4.5 DT individually, of which the last one coupled with Correlation-based Feature Selection was the best in terms of performance. The primary entities of the proposed system are the central server, local servers (indirectly connected to each other through the central server), database (two separate modules, i.e., the local database that stores local health data for patients visited in the local hospital, and the central database that periodically retrieves the local database from all connected local centers and combines them all together for one centralized database development), and network connectivity. Chibuta et al. [176] developed an image processing pipeline to run in real-time on low-cost devices (mobile phones and desktop computers with basic specifications). By mounting a mobile phone or camera on an optical microscope and using the modified YOLOV3 detection algorithm, they proposed a malaria diagnosis system in low-resource settings. The network was trained using images resized to

224 \times 224

,

544 \times 544

, and

800 \times 800

pixels, achieving the best results with the largest size. Aiming to develop a new automated, low-cost, mobile-based diagnostic system to identify P. falciparum species at the ring development stage, Oliveira et al. [177] combined digital image processing techniques and a learning process based on AI algorithms. The pre-processing phase is aimed at identifying RBCs and limiting the algorithm’s search space, and then classification is performed by creating DTs called classifier cascades, with a number of stages of 10, 15, and 20. The result of the training was a degenerative DT with two classes that worked best using the 20-stage cascade. To use both local and remote resources, Chen et al. [178] designed a diagnostic pipeline that is implemented in a cloud-edge paradigm. The idea is that, after collecting the datasets sent by the hospitals, pre-processing with automatic color equalization, and classification using supervised U-Net are performed, and then the generated coarse probability maps and the normalized images are uploaded to the cloud server. Maps are used to weakly supervise the stacked dilated U-Net (SDU-Net), which performs segmentation by replacing the standard second convolution module in each encoding/decoding operation with multiple dilated convolutions, and concatenating all convolution outputs instead of a single second convolution output as input at the next level. Finally, the classification of parasite species and stage is performed with a pre-trained MobileNetV1, and the result is sent to the original local hospital immediately through the cloud. Blockchain technology is adopted during the data transmission process. To automate the malaria diagnostic process, Maturana et al. [179] developed a system in which a prototype of 3D-printed parts is designed for the robotization of conventional light microscopy, capable of automatically focusing the sample and tracing the entire slide, and the pre-trained YOLOV5x is used for the Plasmodium parasites’ detection. The whole system, which also performs the calculation of parasite density in thick blood smear samples, was integrated into the iMAGING smartphone application. More details about the mobile and web applications for malaria detection are summarized in Table 14.

5.3. Articles on Generalizing Malaria Diagnosis

This section is dedicated to summarizing all those studies that show the generalizability of the approaches proposed for the malaria dataset. First of all, we review studies that were not designed for a specific dataset or for the automatic diagnosis of a specific disease, but rather for an overall view, and that show their ability to generalize to different datasets including those of malaria. For example, Horstmeyer et al. [180] presented a framework to jointly optimize the physical parameters of a microscope and the CNN weights used to classify the images previously generated. Muthumbi et al. [181] proposed a method to improve the image classification performance of a standard microscope by adding a simple LED array and optimizing its illumination pattern within an improved deep CNN. Hung et al. [182] created a programming interface called Keras R-CNN that implements DL techniques for object detection, showing its potential for the identification and classification of a large number of cells. Batch and online variational learning was studied by Manouchehri et al. [183] by comparing four algorithms, the best of which was the online variational multivariate Beta mixture model (OVMBMM). In their work, Yao et al. [184] demonstrated a multi-lens microscopic imaging system that superimposes multiple independent FoVs on a single sensor for automated sample analysis with a high efficiency. Schwarz Schuler et al. [185] proposed an improved scheme to reduce the complexity of point convolutions in deep CNNs for image classification based on interleaved grouped filters without divisibility constraints. Gupta et al. [186] presented an innovative framework, called SimSearch, for quick and easy user-guided training of a DL model for rapid ROI detection in large-scale microscopy experiments. A new DL approach called IMNet, which combines multiple incremental modular CNNs, was proposed by Ali et al. [187]; this approach is computationally efficient for exploiting small data for learning generalizable, domain-invariant representations in various medical imaging applications. Musaev et al. [188] proposed a CNN ensemble called ICNN-Ensemble based on the representation of high-resolution image channels and a systematic model dropout ensemble that allows for the selection of possible members. In their research, El-Bana et al. [189] evaluated the efficiency of wavelet pooling (WP) in lightweight MobileNets architectures showing that the use of WP achieves average data savings of more than 30% compared to traditional pooling techniques.

Additional information on the methods just discussed can be found in Table 15.

In addition to these results, some of the articles that emerged from the research conducted for the present review are focused on the diagnosis of diseases other than malaria, but they report tests of the proposed models on the detection of parasitized cells. Ferreira et al. [190] aimed to distinguish five different cancer types through the RNA-Seq datasets of the thyroid, skin, stomach, breast, and lung using a pre-trained AE bound to the upper layers of a CNN. To demonstrate the generalizability of the proposed approach, they performed tests for the classification of malaria and breast cancer. The effective and early detection of acute lymphoblastic leukaemia (ALL) using CAD was proposed by Abd El-Ghany et al. [191], who developed a classification model based on EfficientNet-B3. This model was also used to examine microscopic blood images to identify the malaria parasite.

See Table 16 for further details.

5.4. Articles on Multiple Diagnoses

The results of the research conducted for this review show that some articles are not only focused on the identification and classification of malaria, but also propose valid approaches to the automatic analysis of different diseases/conditions, including malaria. Table 17 summarizes the details of this work, including the objective of the study, the analysis conducted, the proposed approach, the dataset used for malaria classification, and the results obtained.

6. Discussion

Malaria is a disease that affects millions of people worldwide, with a consistent mortality rate. The early identification and diagnosis of malaria is essential to determine the best treatment for the patient and to increase the survival rate. The light microscope examination of blood smears is the gold standard for detecting infection by malaria parasites, but it proves to be time-consuming and difficult. By using CAD systems, this procedure can become much easier, faster, and more accurate. This systematic literature review aimed to provide an overview of the development of CAD systems for the automatic identification and diagnosis of malaria. Scientific publications published between 2014 and 2024 related to ML and DL approaches to the detection and classification of malaria parasite were selected. The searches, conducted in the PubMed, Scopus, and arXiv databases, resulted in the selection of 135 research articles that focused on malaria detection and classification using histological images and reported the results obtained in terms of model performance. Based on the data collected from the selected articles, we conducted a qualitative and quantitative analysis and highlighted the main critical issues found.

6.1. Qualitative and Quantitative Analysis

Although the search included publications since 2014, the first study that reported AI approaches applied to malaria’s automatic diagnosis was published in 2015, and more than half of the selected articles were published between 2022 and 2024. Interestingly, almost 86% of the selected articles were published within the past five years. In addition, over half of the ML articles were published by 2021, and over half of the articles using DL, including hybrid and ensemble learning models, were published from January 2022 to February 2024. The trend of recent publications on the use of CAD systems for the automatic diagnosis of malaria is shown in Figure 5.

To provide as complete an overview as possible of the applications of AI to malaria diagnosis, we have included both articles that used public datasets and those that exploited private datasets in this systematic review. Slightly more than 20% of the total number of articles analyzed used non-publicly available datasets for training automated models; of the remaining articles, the NIH dataset was used in more than 50% of the total (69.16% of public datasets). A graphical representation of this analysis is provided in Figure 6.

It is evident that the NIH dataset is the most considered among the public datasets and as a whole; hence, we provide a more detailed analysis of the AI approaches involving it in the training phase. Concerning the articles summarized in Section 5.1, the following statistics are derived: DL techniques were used 94% of cases, including pre-trained networks (28.13%), custom CNN (43.75%), hybrid models (20.31%), and ensemble learning (7.81%). In general, the most commonly used pre-trained network is VGG, and in particular VGG-19. In hybrid approaches, SVM is the most popular ML model. Solutions based entirely on ML models are proposed with an overall percentage of 6%, indicating that their simplicity is not particularly suited to the task of malaria classification. Figure 7 shows the results of the analysis conducted.

As previously introduced, for the NIH dataset, which is balanced in the classes of infected and non-infected RBCs, we report in the summary Tables of Section 5.1 only the performance in terms of ACC and AUC. However, we also want to point out in Table 18 the other performance metrics used by the authors of the cited articles to provide a possible comparison for those interested.

Although the NIH dataset has been used in many applications, especially in recent years, some authors have shown that segmented data contain annotation errors. In particular, Fuhad et al. [92] had the images evaluated by an expert, who eliminated 647 falsely labeled parasites and 750 non-infected suspects. This modified dataset has been publicly shared, and several authors have already used it in their studies. If these errors were confirmed, most of the results published so far in the field of automatic malaria diagnosis would be called into question. Therefore, further investigations should be carried out.

6.2. Validation Analysis

One of the critical aspects to consider when evaluating automated ML and DL systems is their ability to generalize to data never seen before, and coming from sources other than training data. Only in this way is it possible to estimate the extent to which these models are applicable in the real world. Therefore, the so-called external validation is crucial, especially in the medical field. What emerges from the analysis of the articles selected for this paper is that only 22 of the 120 articles focusing solely on the diagnosis of malaria (18.33%) perform this validation. Furthermore, to enhance the generalizability of the proposed models, it would be appropriate to test them on public datasets so that performance can be compared with that obtained by other researchers, a key point of scientific research.

6.3. Explainability

To introduce CAD systems into clinical and diagnostic practice, it is necessary to be able to explain the decisions made by AI models. While ML algorithms are generally transparent due to their simplicity, DL algorithms, and in particular CNNs, involve opacity in decision-making due to the high complexity of the networks [197]. For this reason, such models are also called “black boxes”. In an attempt to explain and interpret the reasoning behind the decisions of DL models, reference can be made to the emerging XAI techniques, which in CV aim to generate heat maps that highlight the regions and pixels of the image that contribute the most to the final prediction. The most commonly used XAI techniques are the Class Activation Mapping (CAM), with its gradient-weighted version (Grad-CAM) [198], and the SHapley Additive exPlanations (SHAP) [199]. These techniques help to increase the confidence of medical experts in CAD systems by providing a visual explanation of automatic diagnosis models. The analysis of the papers selected for this systematic review shows that only a small percentage of the published articles, i.e., 10%, used XAI methods. These papers are shown in Table 19, distinguishing between those that used the NIH dataset and those that exploited other data.

XAI methods are still poorly integrated into CAD systems for malaria diagnosis, but it is worth remembering that their use is crucial in all fields where important decisions are required, such as medicine.

6.4. Mobile and Web Applications

Malaria is a widespread disease throughout the world, but its mortality rate is higher in less developed areas. This fact is linked to the reduced availability of instrumental and human resources. Therefore, mobile applications using smartphones and web applications with simple graphical interfaces are a useful tool in these disadvantaged situations. In Section 5.1.7 and Section 5.2.3, mobile and web applications supporting the rapid and cost-effective diagnosis of malaria are discussed. Although the primary focus of other works is not the same, many have developed solutions for the online and offline diagnoses of malaria. Table 20 shows the works that contribute to this, distinguishing between mobile and web applications.

To introduce these mobile and web-based systems in real-world settings such as underdeveloped countries where malaria is a serious health problem, some considerations have to be made regarding their feasibility, which is highly dependent on the type of device required (type of smartphone/computer or specific equipment), the possibility of working without an internet connection, and the availability of intelligent instruments that perform well with heterogeneous, uncleaned data, with different acquisition conditions (type of instrumentation, setting, and lighting), and for different sample staining. In addition, CAD systems, in general, and those designed for digital pathology as in this case, must achieve a performance that is not inferior to that of experts but, ideally, superior to be considered suitable for diagnostic routines [200].

6.5. Future Directions of AI Approaches to Malaria Diagnosis

The histological evaluation of tissue samples remains a key tool for the assessment of different types of diseases, including malaria infection. However, this examination is time-consuming and tiring for operators, some of the limitations that prompted researchers to investigate the effectiveness of AI techniques in this field. Although ML and DL techniques have shown exciting results in histopathological diagnosis by reducing diagnosis time and achieving high performance, uncertainties remain as to the current feasibility of their application in clinical practice due to several gaps to be filled, which include the absence of standardization of image acquisition/storage/analysis protocols, validation of AI models in real-world settings, and verification by experienced pathologists [201]. These and other weaknesses were also found in the articles examined in this review. A lack of homogeneity was found in the choice of datasets used for training and testing of the proposed approaches, as well as the lack of the validation phase with datasets other than the training datasets and the expert validation conducted by pathologists. Furthermore, the NIH dataset has been questioned by some authors. Since this dataset is the most frequently used public data source, it is worthwhile to check its reliability and, consequently, the credibility of the results obtained so far. Another limitation found in some of the articles analyzed is the lack of detailed information on the models developed and the pre-processing and segmentation techniques applied, a problem easily solved by making public the source code of the algorithms, which would increase the transparency of the processes and provide a starting point for further modifications in the future. Finally, a low rate of application of the model explainability analysis, necessary to provide a clear and transparent picture to medical users, was found to increase their confidence in these new intelligent systems. Despite these weaknesses, overall, AI techniques to support medical diagnosis, and in particular malaria, have attracted the interest of many researchers and have shown promising results. However, to introduce AI in the medical field, the ethical aspects of cybersecurity, protecting patients’ privacy, and ensuring fair access to AI-based healthcare must be considered in the future [202], in addition to filling the above-mentioned gaps.

6.6. General Considerations on This Review

Like any systematic review, this work may suffer from biases related to search strategies, inclusion and exclusion criteria, and the limited number of electronic databases consulted (see Section 2). For the purpose of this work, the search strategies and in particular the keywords chosen and their combinations using the “AND” and “OR” operators led to very satisfactory search results from both a quantitative and qualitative point of view. With regard to the inclusion and exclusion criteria, language and time restrictions do not seem to hinder the identification of relevant articles as we note that the number of publications on automated applications for malaria diagnosis showed an increase after 2016 (2 years later than the lower end of the time range considered), and in addition, no articles written in languages other than English were identified. Furthermore, although we excluded articles focused on the diagnosis of diseases other than malaria, we still included all those articles of generic applications or focused on different diseases but tested on malaria datasets. The decision not to include articles that use AI approaches for tasks other than classification and object detection (i.e., only for segmentation) left some work out but did not affect the objective of reviewing recent research on CAD systems. Another aspect to be assessed when conducting a systematic literature review is the quality of the databases from which articles are selected. In particular, we wondered whether the arXiv database, which is not peer-reviewed, could undermine the credibility and reliability of the review results. To this end, we consulted the research of Gusenbauer et al. [203] who aimed to outline guidance on which search systems are suitable for systematic searches. Their study analyzed 28 academic search systems widely used by researchers, including those used in this systematic review (PubMed, Scopus, and arXiv), making their qualities and limitations transparent. As a result of this analysis, the electronic databases PubMed and Scopus turned out to be principal resources, while arXiv was labeled as a supplementary resource, i.e., a resource to be used “in addition to a principal resource for its specific qualities that could retrieve additional records and to further improve the evidence base”. This database is also classified as gray literature, which the authors consider useful to maximize the completeness of systematic reviews and mitigate publication bias. Based on these findings, we decided to include the arXiv database in this systematic review, also bearing in mind that the number of articles extracted from it is 8, a small amount compared to the total number of papers reviewed. Therefore, the only real limitation of this systematic review might be the limited number of databases considered (PubMed, Scopus, and arXiv). However, we believe that these limitations may weigh less heavily than the highlighted strengths.

7. Conclusions

The current systematic review analyzes the literature evidence on the application of AI to the improvement of malaria diagnosis. Although the results of our review may be limited by the inclusion/exclusion criteria and the choice of electronic databases, its strength can be attributed to the transparency of the methodology used for the search, which makes the results reproducible and exhaustive in summarizing the evidence of AI approaches in malaria diagnosis. Our analysis showed that since 2014, there has been a progressive increase in the application of ML and DL algorithms to the diagnosis of malaria. In particular, most of the studies analyzed chose to rely on DL for the automatic analysis of blood smear images. Although these algorithms have demonstrated excellent performance in CV tasks, the implications regarding the large amount of data and high computational costs required to train the models must be considered. However, once the model is trained, it can be deployed on mobile or web applications and used quickly and easily even in underdeveloped areas. Interestingly, one promising AI approach emerged as potentially translatable shortly. Indeed, according to what was highlighted by a very recent systematic review [204], our analysis showed that DL approaches have notably evolved in this field. In particular, we observed a passage from single CNNs (custom or pre-trained) to hybrid models combining DL models and classical ML approaches, up to more complex ensemble architectures that train multiple models and combine their predictions for the specific task. These latter models often outperform the classical approaches, but their performance strongly depends on several factors including the ensemble system design, the choice of fusion method, and the high computational costs. The implementation of these approaches could be of great relevance in the future to enable early differential diagnosis of the malaria strain and its life cycle, thus providing effective guidance for treatments. Notably, despite significant research progress in this field, many challenges still need to be addressed before CAD systems can be safely and reliably introduced into clinical and diagnostic practice. Future research should focus on overcoming the problems outlined above, as well as on increased collaboration with medical experts in the field to also try to close the gaps that limit the diagnostic accuracy of intelligent systems.

Author Contributions

Conceptualization, F.G.; methodology, F.G., P.S. and A.A.; formal analysis, F.G. and A.A.; data curation, F.G.; writing—original draft preparation, F.G.; writing—review and editing, F.G., P.S. and A.A.; supervision, F.F. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Not applicable.

Acknowledgments

F.F. acknowledges the partial funding of the research activity through financial support from ICSC—Centro Nazionale di Ricerca in “High Performance Computing, Big Data and Quantum Computing”, funded by European Union—NextGenerationEU.

Conflicts of Interest

The authors declare no conflicts of interest.

References

The World Health Organization Website. Available online: https://www.who.int/publications/i/item/9789240086173 (accessed on 20 June 2024).
Collins, W.E. Plasmodium knowlesi: A malaria parasite of monkeys and humans. Annu. Rev. Entomol. 2012, 57, 107–121. [Google Scholar] [CrossRef] [PubMed]
Sato, S. Plasmodium—A brief introduction to the parasites causing human malaria and their basic biology. J. Physiol. Anthropol. 2021, 40, 1. [Google Scholar] [CrossRef] [PubMed]
Tuteja, R. Malaria—An overview. FEBS J. 2007, 274, 4670–4679. [Google Scholar] [CrossRef] [PubMed]
Paaijmans, K.P.; Thomas, M.B. Relevant temperatures in mosquito and malaria biology. In Ecology of Parasite-Vector Interactions; Wageningen Academic Publishers: Wageningen, The Netherlands, 2007; pp. 32–58. [Google Scholar]
May, Z.; Aziz, S.S.A.M.; Salamat, R. Automated quantification and classification of malaria parasites in thin blood smears. In Proceedings of the 2013 IEEE International Conference on Signal and Image Processing Applications, Melaka, Malaysia, 8–10 October 2013; pp. 369–373. [Google Scholar]
Cowman, A.F.; Berry, D.; Baum, J. The cellular and molecular basis for malaria parasite invasion of the human red blood cell. J. Cell Biol. 2012, 198, 961–971. [Google Scholar] [CrossRef] [PubMed]
Makler, M.T.; Palmer, C.J.; Ager, A.L. A review of practical techniques for the diagnosis of malaria. Ann. Trop. Med. Parasitol. 1998, 92, 419–434. [Google Scholar] [PubMed]
Dowling, M.A.C.; Shute, G.T. A comparative study of thick and thin blood films in the diagnosis of scanty malaria parasitaemia. Bull. World Health Organ. 1966, 34, 249. [Google Scholar] [PubMed]
The World Health Organization Website. Available online: https://www.who.int/publications/i/item/9789241597531 (accessed on 20 June 2024).
Quan, Q.; Wang, J.; Liu, L. An effective convolutional neural network for classifying red blood cells in malaria diseases. Interdiscip. Sci. 2020, 12, 217–225. [Google Scholar] [CrossRef]
Arco, J.E.; Górriz, J.M.; Ramírez, J.; Álvarez, I.; Puntonet, C.G. Digital image analysis for automatic enumeration of malaria parasites using morphological operations. Expert Syst. Appl. 2015, 42, 3041–3047. [Google Scholar] [CrossRef]
Ozsahin, I.; Sekeroglu, B.; Musa, M.; Mustapha, M.; Uzun Ozsahin, D. Review on Diagnosis of COVID-19 from Chest CT Images Using Artificial Intelligence. Comput. Math. Methods Med. 2020, 2020, 9756518. [Google Scholar] [CrossRef]
Fatima, T.; Farid, M.S. Automatic detection of Plasmodium parasites from microscopic blood images. J. Parasit. Dis. 2020, 44, 69–78. [Google Scholar] [CrossRef]
Das, D.K.; Ghosh, M.; Pal, M.; Maiti, A.K.; Chakraborty, C. Machine learning approach for automated screening of malaria parasite using light microscopic images. Micron 2013, 45, 97–106. [Google Scholar] [CrossRef] [PubMed]
Di Ruberto, C.; Dempster, A.; Khan, S.; Jarra, B. Analysis of infected blood cell images using morphological operators. Image Vis. Comput. 2002, 20, 133–146. [Google Scholar] [CrossRef]
Wang, J.; Perez, L. The effectiveness of data augmentation in image classification using deep learning. arXiv 2017, arXiv:1712.04621. [Google Scholar]
Shorten, C.; Khoshgoftaar, T.M. A survey on image data augmentation for deep learning. J. Big Data 2019, 6, 1–48. [Google Scholar] [CrossRef]
Abdul Nasir, A.S.; Mashor, M.Y.; Mohamed, Z. Segmentation based approach for detection of malaria parasites using moving k-means clustering. In Proceedings of the 2012 IEEE-EMBS Conference on Biomedical Engineering and Sciences, Langkawi, Malaysia, 17–19 December 2012; pp. 653–658. [Google Scholar]
Page, M.J.; McKenzie, J.E.; Bossuyt, P.M.; Boutron, I.; Hoffmann, T.C.; Mulrow, C.D.; Shamseer, L.; Tetzlaff, J.M.; Akl, E.A.; Brennan, S.E.; et al. The PRISMA 2020 statement: An updated guideline for reporting systematic reviews. BMJ 2021, 372, n71. [Google Scholar] [CrossRef] [PubMed]
Keele, S. Guidelines for Performing Systematic Literature Reviews in Software Engineering, Technical Report, version 2.3 ESBE Technical Report; ESBE; Keele University: Keele, UK, 2007. [Google Scholar]
Nasser, M.; Yusof, U.K. Deep Learning Based Methods for Breast Cancer Diagnosis: A Systematic Review and Future Direction. Diagnostics 2023, 13, 161. [Google Scholar] [CrossRef] [PubMed]
Kotsiantis, S.B. Decision trees: A recent overview. Artif. Intell. Rev. 2013, 39, 261–283. [Google Scholar] [CrossRef]
Boser, B.; Guyon, I.; Vapnik, V. A training algorithm for optimal margin classifiers. In Proceedings of the Fifth Annual Workshop on Computational Learning Theory, Pittsburgh, PA, USA, 27–29 July 1992; pp. 144–152. [Google Scholar]
Altman, N. An introduction to kernel and nearest-neighbor nonparametric regression. Am. Stat. 1992, 46, 175–185. [Google Scholar] [CrossRef]
Jain, A.K.; Mao, J.; Mohiuddin, K.M. Artificial neural networks: A tutorial. Computer 1996, 29, 31–44. [Google Scholar] [CrossRef]
Albawi, S.; Mohammed, T.A.; Al-Zawi, S. Understanding of a convolutional neural network. In Proceedings of the 2017 International Conference on Engineering and Technology (ICET), Antalya, Turkey, 21–23 August 2017; pp. 1–6. [Google Scholar]
Bank, D.; Koenigstein, N.; Giryes, R. Autoencoders. arXiv 2021, arXiv:2003.05991v2. [Google Scholar]
Pan, Z.; Yu, W.; Yi, X.; Khan, A.; Yuan, F.; Zheng, Y. Recent progress on generative adversarial networks (GANs): A survey. IEEE Access 2019, 7, 36322–36333. [Google Scholar] [CrossRef]
Alzu’bi, A.; Albalas, F.; Al-Hadhrami, T.; Younis, L.B.; Bashayreh, A. Masked face recognition using deep learning: A review. Electronics 2021, 10, 2666. [Google Scholar] [CrossRef]
Zhou, T.; Ye, X.; Lu, H.; Zheng, X.; Qiu, S.; Liu, Y. Dense convolutional network and its application in medical image analysis. Biomed. Res. Int. 2022, 2022, 2384830. [Google Scholar] [CrossRef] [PubMed]
Krichen, M. Convolutional neural networks: A survey. Computers 2023, 12, 151. [Google Scholar] [CrossRef]
Hossin, M.; Sulaiman, M.N. A review on evaluation metrics for data classification evaluations. Int. J. Data Min. Knowl. Manag. Process 2015, 5, 1. [Google Scholar]
Bradley, A.P. The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recognit. 1997, 30, 1145–1159. [Google Scholar] [CrossRef]
Yadav, S.; Shukla, S. Analysis of k-fold cross-validation over hold-out validation on colossal datasets for quality classification. In Proceedings of the 2016 IEEE 6th International Conference on Advanced Computing (IACC), Bhimavaram, India, 27–28 February 2016; pp. 78–83. [Google Scholar]
Cabitza, F.; Campagner, A.; Soares, F.; de Guadiana-Romualdo, L.G.; Challa, F.; Sulejmani, A.; Seghezzi, M.; Carobene, A. The importance of being external. methodological insights for the external validation of machine learning models in medicine. Comput. Methods Programs Biomed. 2021, 208, 106288. [Google Scholar] [CrossRef] [PubMed]
König, I.R.; Malley, J.D.; Weimar, C.; Diener, H.C.; Ziegler, A. Practical experiences on the necessity of external validation. Stat Med. 2007, 26, 5499–5511. [Google Scholar] [CrossRef] [PubMed]
The National Library of Medicine Website. Available online: https://lhncbc.nlm.nih.gov/LHC-research/LHC-projects/image-processing/malaria-datasheet.html (accessed on 20 June 2024).
Kaggle Website. Available online: https://www.kaggle.com/datasets/iarunava/cell-images-for-detecting-malaria (accessed on 20 June 2024).
Linder, N.; Turkki, R.; Walliander, M.; Mårtensson, A.; Diwan, V.; Rahtu, E.; Pietikäinen, M.; Lundin, M.; Lundin, J. A malaria diagnostic tool based on computer vision screening and visualization of plasmodium falciparum candidate areas in digitized blood smears. PLoS ONE 2014, 9, e104855. [Google Scholar] [CrossRef]
Loddo, A.; Di Ruberto, C.; Kocher, M.; Prod’Hom, G. MP-IDB: The Malaria Parasite Image Database for Image Processing and Analysis. In Proceedings of the Processing and Analysis of Biomedical Information—First International SIPAIM Workshop, SaMBa 2018, Held in Conjunction with MICCAI 2018, Granada, Spain, 20 September 2018; Revised Selected Papers. Leporé, N., Brieva, J., Romero, E., Racoceanu, D., Joskowicz, L., Eds.; Springer: Berlin/Heidelberg, Germany, 2018; Volume 11379, pp. 57–65. [Google Scholar]
Broad Bioimage Benchmark Collection Website. Available online: https://data.broadinstitute.org/bbbc/BBBC041/ (accessed on 20 June 2024).
Zhao, O.S.; Kolluri, N.; Anand, A.; Chu, N.; Bhavaraju, R.; Ojha, A.; Tiku, S.; Nguyen, D.; Chen, R.; Morales, A.; et al. Convolutional neural networks to automate the screening of malaria in low-resource countries. PeerJ 2020, 8, e9674. [Google Scholar] [CrossRef]
Liu, W.; Anguelov, D.; Erhan, D.; Szegedy, C.; Reed, S.; Fu, C.Y.; Berg, A.C. Ssd: Single shot multibox detector. In Proceedings of the Computer Vision—ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, 11–14 October 2016; Proceedings, Part I 14. Springer International Publishing: Berlin/Heidelberg, Germany, 2016; pp. 21–37. [Google Scholar]
Sriporn, K.; Tsai, C.F.; Tsai, C.E.; Wang, P. Analyzing malaria disease using effective deep learning approach. Diagnostics 2020, 10, 744. [Google Scholar] [CrossRef] [PubMed]
Misra, D. Mish: A Self Regularized Non-Monotonic Neural Activation Function 2019. arXiv 2019, arXiv:1908.08681. [Google Scholar]
Kc, K.; Yin, Z.; Wu, M.; Wu, Z. Depthwise separable convolution architectures for plant disease classification. Comput. Electron. Agric. 2019, 165, 104948. [Google Scholar] [CrossRef]
Kassim, Y.M.; Yang, F.; Yu, H.; Maude, R.J.; Jaeger, S. Diagnosing malaria patients with plasmodium falciparum and vivax using deep learning for thick smear images. Diagnostics 2021, 11, 1994. [Google Scholar] [CrossRef] [PubMed]
Sangameswaran, R. MAIScope: A low-cost portable microscope with built-in vision AI to automate microscopic diagnosis of diseases in remote rural settings. arXiv 2022, arXiv:2208.06114. [Google Scholar]
Alnussairi, M.H.D.; İbrahim, A.A. Malaria parasite detection using deep learning algorithms based on (CNNs) technique. Comput. Electr. Eng. 2022, 103, 108316. [Google Scholar] [CrossRef]
Sinha, S.; Gupta, N. Computer-aided Diagnosis of Malaria through Transfer Learning using the ResNet50 Backbone. arXiv 2023, arXiv:2304.02925. [Google Scholar]
Kundu, T.K.; Anguraj, D.K.; Sudha, S.V. Modeling a Novel Hyper-Parameter Tuned Deep Learning Enabled Malaria Parasite Detection and Classification. Comput. Mater. Contin. 2023, 77, 3289–3304. [Google Scholar] [CrossRef]
Zhang, H.; Cisse, M.; Dauphin, Y.N. Mixup: Beyond Empirical Risk Minimization. arXiv 2017, arXiv:1710.09412. [Google Scholar]
Yun, S.; Han, D.; Oh, S.J.; Chun, S.; Choe, J.; Yoo, Y. CutMix: Regularization Strategy to Train Strong Classifiers with Localizable Features. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Seoul, Republic of Korea, 27 October–2 November 2019; pp. 6023–6032. [Google Scholar]
Rajaraman, S.; Antani, S.K.; Poostchi, M.; Silamut, K.; Hossain, M.A.; Maude, R.J.; Jaeger, S.; Thoma, G.R. Pre-trained convolutional neural networks as feature extractors toward improved malaria parasite detection in thin blood smear images. PeerJ 2018, 6, e4568. [Google Scholar] [CrossRef]
Rahman, A.; Zunair, H.; Rahman, M.S.; Yuki, J.Q.; Biswas, S.; Alam, M.A.; Alam, N.B.; Mahdy, M.R.C. Improving malaria parasite detection from red blood cell using deep convolutional neural networks. arXiv 2019, arXiv:1907.10418. [Google Scholar]
Jameela, T.; Athota, K.; Singh, N.; Gunjan, V.K.; Kahali, S. Deep learning and transfer learning for malaria detection. Comput. Intell. Neurosci. 2022, 2022, 2221728. [Google Scholar] [CrossRef] [PubMed]
Loddo, A.; Fadda, C.; Di Ruberto, C. An empirical evaluation of convolutional networks for malaria diagnosis. J. Imaging 2022, 8, 66. [Google Scholar] [CrossRef] [PubMed]
Turuk, M.; Sreemathy, R.; Kadiyala, S.; Kotecha, S.; Kulkarni, V. CNN Based Deep Learning Approach for Automatic Malaria Parasite Detection. IAENG Int. J. Comput. Sci. 2022, 49, 745–753. [Google Scholar]
Qadir, A.M.; Abdalla, P.A.; Ghareb, M.I. Malaria Parasite Identification from Red Blood Cell Images Using Transfer Learning Models. Passer J. Basic Appl. Sci. 2022, 4, 63–79. [Google Scholar] [CrossRef]
Hassan, E.; Shams, M.Y.; Hikal, N.A.; Elmougy, S. A novel convolutional neural network model for malaria cell images classification. Comput. Mater. Contin. 2022, 72, 5889–5907. [Google Scholar] [CrossRef]
Shewajo, F.A.; Fante, K.A. Tile-based microscopic image processing for malaria screening using a deep learning approach. BMC Med. Inform. 2023, 23, 39. [Google Scholar] [CrossRef]
Hemachandran, K.; Alasiry, A.; Marzougui, M.; Ganie, S.M.; Pise, A.A.; Alouane, M.T.H.; Chola, C. Performance analysis of deep learning algorithms in diagnosis of malaria disease. Diagnostics 2023, 13, 534. [Google Scholar] [CrossRef] [PubMed]
Dath, M.K.; Nazir, N.; Dhankhar, A.; Solanki, K.; Dahiya, O. Malarial Diagnosis with Deep Learning and Image Processing Approaches. Int. J. Recent Innov. Trends Comput. 2023, 11, 210–222. [Google Scholar] [CrossRef]
Quinn, J.A.; Andama, A.; Munabi, I.; Kiwanuka, F. Automated blood smear analysis for mobile malaria diagnosis. Mob. Point Care Monit. Diagn. Device Des. 2014, 31, 115. [Google Scholar]
Nakasi, R.; Mwebaze, E.; Zawedde, A. Mobile-aware deep learning algorithms for malaria parasites and white blood cells localization in thick blood smears. Algorithms 2021, 14, 17. [Google Scholar] [CrossRef]
Sarkar, S.; Sharma, R.; Shah, K. Malaria detection from RBC images using shallow Convolutional Neural Networks. arXiv 2020, arXiv:2010.11521. [Google Scholar]
Masud, M.; Alhumyani, H.; Alshamrani, S.S.; Cheikhrouhou, O.; Ibrahim, S.; Muhammad, G.; Shamim Hossain, M.; Shorfuzzaman, M. Leveraging deep learning techniques for malaria parasite detection using mobile application. Wirel. Commun. Mob. Comput. 2020, 2020, 1–15. [Google Scholar] [CrossRef]
Umer, M.; Sadiq, S.; Ahmad, M.; Ullah, S.; Choi, G.S.; Mehmood, A. A novel stacked CNN for malarial parasite detection in thin blood smear images. IEEE Access 2020, 8, 93782–93792. [Google Scholar] [CrossRef]
Alqudah, A.; Alqudah, A.M.; Qazan, S. Lightweight Deep Learning for Malaria Parasite Detection Using Cell-Image of Blood Smear Images. Rev. d’Intelligence Artif. 2020, 34, 571–576. [Google Scholar] [CrossRef]
Malhotra, R.; Joshi, D.; Shin, K.Y. Approaching Bio Cellular Classification for Malaria Infected Cells Using Machine Learning and then Deep Learning to compare & analyze K-Nearest neighbors and Deep CNNs. arXiv 2020, arXiv:2005.11417. [Google Scholar]
Delgado-Ortet, M.; Molina, A.; Alférez, S.; Rodellar, J.; Merino, A. A deep learning approach for segmentation of red blood cell images and malaria detection. Entropy 2020, 22, 657. [Google Scholar] [CrossRef] [PubMed]
Irmak, E. A novel implementation of deep-learning approach on malaria parasite detection from thin blood cell images. Electrica 2021, 21, 216–224. [Google Scholar] [CrossRef]
Maqsood, A.; Farid, M.S.; Khan, M.H.; Grzegorzek, M. Deep malaria parasite detection in thin blood smear microscopic images. Appl. Sci. 2021, 11, 2284. [Google Scholar] [CrossRef]
Magotra, V.; Rohil, M.K. Malaria diagnosis using a lightweight deep convolutional neural network. Int. J. Telemed. Appl. 2022, 2022, 4176982. [Google Scholar] [CrossRef]
Uzun Ozsahin, D.; Mustapha, M.T.; Bartholomew Duwa, B.; Ozsahin, I. Evaluating the performance of deep learning frameworks for malaria parasite detection using microscopic images of peripheral blood smears. Diagnostics 2022, 12, 2702. [Google Scholar] [CrossRef] [PubMed]
Hcini, G.; Jdey, I.; Ltifi, H. Improving Malaria Detection Using L1 Regularization Neural Network. J. Univ. Comput. Sci. 2022, 285, 1087–1107. [Google Scholar] [CrossRef]
Cho, Y.S.; Hong, P.C. Applying Machine Learning to Healthcare Operations Management: CNN-Based Model for Malaria Diagnosis. Healthcare 2023, 11, 1779. [Google Scholar] [CrossRef] [PubMed]
Yebasse, M.; Cheoi, K.J.; Ko, J. Malaria Disease Cell Classification With Highlighting Small Infected Regions. IEEE Access 2023, 11, 15945–15953. [Google Scholar] [CrossRef]
Shambhu, S.; Koundal, D.; Das, P. Deep learning-based computer assisted detection techniques for malaria parasite using blood smear images. Int. J. Adv. Technol. Eng. Explor. 2021, 10, 990. [Google Scholar]
Kashtriya, V.; Doegar, A.; Gupta, V.; Kashtriya, P. Identifying malaria infection in red blood cells using optimized stepincrease convolutional neural network model. Int. J. Innovative Technol. Exploring Eng. 2019, 8, 813–818. [Google Scholar]
Mahmud, T.; Fattah, S.A. Automatic Diagnosis of Malaria from Thin Blood Smear Images using Deep Convolutional Neural Network with Multi-Resolution Feature Fusion. arXiv 2020, arXiv:2012.05350. [Google Scholar]
Banerjee, T.; Jain, A.; Sethuraman, S.C.; Satapathy, S.C.; Karthikeyan, S.; Jubilson, A. Deep Convolutional Neural Network (Falcon) and transfer learning-based approach to detect malarial parasite. Multimed. Tools Appl. 2022, 81, 13237–13251. [Google Scholar] [CrossRef]
Oyewola, D.O.; Dada, E.G.; Misra, S.; Damaševičius, R. A novel data augmentation convolutional neural network for detecting malaria parasite in blood smear images. Appl. Artif. Intell. 2022, 36, 2033473. [Google Scholar] [CrossRef]
Kumar, A.; Singh, S.B.; Satapathy, S.C.; Rout, M. MOSQUITO-NET: A deep learning based CADx system for malaria diagnosis along with model interpretation using GradCam and class activation maps. Expert Syst. 2022, 39, e12695. [Google Scholar] [CrossRef]
Asif, S.; Khan, S.U.R.; Zheng, X.; Zhao, M. MozzieNet: A deep learning approach to efficiently detect malaria parasites in blood smear images. Int. J. Imaging Syst. Technol. 2024, 34, e22953. [Google Scholar] [CrossRef]
Alaiad, A.; Migdady, A.; Al-Khatib, R.E.M.; Alzoubi, O.; Zitar, R.A.; Abualigah, L. Autokeras Approach: A Robust Automated Deep Learning Network for Diagnosis Disease Cases in Medical Images. J. Imaging 2023, 9, 64. [Google Scholar] [CrossRef]
Khan, S.H.; Shah, N.S.; Nuzhat, R.; Majid, A.; Alquhayz, H.; Khan, A. Malaria parasite classification framework using a novel channel squeezed and boosted CNN. Microscopy 2022, 71, 271–282. [Google Scholar] [CrossRef]
Islam, M.R.; Nahiduzzaman, M.; Goni, M.O.F.; Sayeed, A.; Anower, M.S.; Ahsan, M.; Haider, J. Explainable transformer-based deep learning model for the detection of malaria parasites from blood cell images. Sensors 2022, 22, 4358. [Google Scholar] [CrossRef] [PubMed]
Alonso-Ramírez, A.A.; García-Capulín, C.H.; Rostro-González, H.; Prado-Olivarez, J.; Gutiérrez-López, M.; Barranco-Gutiérrez, A.I. Classifying parasitized and uninfected malaria red blood cells using convolutional-recurrent neural networks. IEEE Access 2022, 10, 97348–97359. [Google Scholar] [CrossRef]
Amin, I.; Hassan, S.; Belhaouari, S.B.; Azam, M.H. Transfer Learning-Based Semi-Supervised Generative Adversarial Network for Malaria Classification. Comput. Mater. Contin. 2023, 74, 6335–6349. [Google Scholar] [CrossRef]
Fuhad, K.F.; Tuba, J.F.; Sarker, M.R.A.; Momen, S.; Mohammed, N.; Rahman, T. Deep learning based automatic malaria parasite detection from blood smear and its smartphone based application. Diagnostics 2020, 10, 329. [Google Scholar] [CrossRef]
Huq, A.; Reza, M.T.; Hossain, S.; Dipto, S.M. AnoMalNet: Outlier Detection based Malaria Cell Image Classification Method Leveraging Deep Autoencoder. arXiv 2023, arXiv:2303.05789. [Google Scholar] [CrossRef]
Tek, F.B.; Dempster, A.G.; Kale, I. Computer vision for microscopy diagnosis of malaria. Malar. J. 2009, 8, 153. [Google Scholar] [CrossRef]
Diker, A. An efficient model of residual based convolutional neural network with Bayesian optimization for the classification of malarial cell images. Comput. Biol. Med. 2022, 148, 105635. [Google Scholar] [CrossRef]
Raihan, M.J.; Nahid, A.A. Malaria cell image classification by explainable artificial intelligence. Health Technol. 2022, 12, 47–58. [Google Scholar] [CrossRef]
Li, D.; Ma, Z. Residual attention learning network and SVM for malaria parasite detection. Multimed. Tools Appl. 2022, 81, 10935–10960. [Google Scholar] [CrossRef]
Jones, C.B.; Murugamani, C. Malaria parasite detection on microscopic blood smear images with integrated deep learning algorithms. Int. Arab J. Inf. Technol. 2023, 20, 170–179. [Google Scholar] [CrossRef] [PubMed]
Goni, M.O.F.; Mondal, M.N.I.; Islam, S.R.; Nahiduzzaman, M.; Islam, M.R.; Anower, M.S.; Kwak, K.S. Diagnosis of Malaria Using Double Hidden Layer Extreme Learning Machine Algorithm with CNN Feature Extraction and Parasite Inflator. IEEE Access 2023, 11, 4117–4130. [Google Scholar] [CrossRef]
Abubakar, A.; Ajuji, M.; Yahya, I.U. DeepFMD: Computational analysis for malaria detection in blood-smear images using deep-learning features. Appl. Syst. Innov. 2021, 4, 82. [Google Scholar] [CrossRef]
Dutta, A.K.; Mageswari, R.U.; Gayathri, A.; Dallfin Bruxella, J.M.; Ishak, M.K.; Mostafa, S.M.; Hamam, H. Barnacles mating optimizer with deep transfer learning enabled biomedical malaria parasite detection and classification. Comput. Intell. Neurosci. 2022, 2022, 7776319. [Google Scholar] [CrossRef] [PubMed]
Alassaf, A.; Sikkandar, M.Y. Intelligent Deep Transfer Learning Based Malaria Parasite Detection and Classification Model Using Biomedical Image. Comput. Mater. Contin. 2022, 72, 5273–5285. [Google Scholar] [CrossRef]
Imran, T.; Sharif, M.; Tariq, U.; Zhang, Y.-D.; Nam, Y.; Nam, Y.; Kang, B.-G. Malaria Blood Smear Classification Using Deep Learning and Best Features Selection. Comput. Mater. Contin. 2020, 70, 1875–1891. [Google Scholar] [CrossRef]
Madhu, G.; Mohamed, A.W.; Kautish, S.; Shah, M.A.; Ali, I. Intelligent diagnostic model for malaria parasite detection and classification using imperative inception-based capsule neural networks. Sci. Rep. 2023, 13, 13377. [Google Scholar] [CrossRef]
Qadri, A.M.; Raza, A.; Eid, F.; Abualigah, L. A novel transfer learning-based model for diagnosing malaria from parasitized and uninfected red blood cell images. Decis. Anal. 2023, 9, 100352. [Google Scholar] [CrossRef]
Amin, J.; Anjum, M.A.; Ahmad, A.; Sharif, M.I.; Kadry, S.; Kim, J. Microscopic parasite malaria classification using best feature selection based on generalized normal distribution optimization. PeerJ Comput. Sci. 2024, 10, e1744. [Google Scholar] [CrossRef] [PubMed]
Murmu, A.; Kumar, P. DLRFNet: Deep learning with random forest network for classification and detection of malaria parasite in blood smear. Multimed. Tools Appl. 2024, 83, 63593–63615. [Google Scholar] [CrossRef]
Marques, G.; Ferreras, A.; de la Torre-Diez, I. An ensemble-based approach for automated medical diagnosis of malaria using EfficientNet. Multimed. Tools Appl. 2022, 81, 28061–28078. [Google Scholar] [CrossRef] [PubMed]
Zhu, Z.; Wang, S.; Zhang, Y. ROENet: A ResNet-based output ensemble for malaria parasite classification. Electronics 2022, 11, 2040. [Google Scholar] [CrossRef] [PubMed]
Rajaraman, S.; Jaeger, S.; Antani, S.K. Performance evaluation of deep neural ensembles toward malaria parasite detection in thin-blood smear images. PeerJ 2019, 7, e6977. [Google Scholar] [CrossRef] [PubMed]
Bhuiyan, M.; Islam, M.S. A new ensemble learning approach to detect malaria from microscopic red blood cell images. Sens. Int. 2023, 4, 100209. [Google Scholar] [CrossRef]
Nayak, S.R.; Nayak, J.; Vimal, S.; Arora, V.; Sinha, U. An ensemble artificial intelligence-enabled MIoT for automated diagnosis of malaria parasite. Expert Syst. 2022, 39, e12906. [Google Scholar] [CrossRef]
Hossain, M.M.; Rahim, M.A.; Bahar, A.N.; Rahman, M.M. Automatic malaria disease detection from blood cell images using the variational quantum circuit. Inform. Med. Unlock. 2021, 26, 100743. [Google Scholar] [CrossRef]
Kundu, T.K.; Anguraj, D.K. Optimal Machine Learning Based Automated Malaria Parasite Detection and Classification Model Using Blood Smear Images. Traitement du Signal 2023, 40, 91–99. [Google Scholar] [CrossRef]
Phumkuea, T.; Nilvisut, P.; Wongsirichot, T.; Damkliang, K. A New Computer-Aided Diagnosis of Precise Malaria Parasite Detection in Microscopic Images Using a Decision Tree Model with Selective Optimal Features. Biomed. Eng. Appl. Basis Commun. 2023, 35, 2350004. [Google Scholar] [CrossRef]
Telang, H.; Sonawane, K. COVID-19 and Malaria Parasite Detection and Classification by Bins Approach with Statistical Moments Using Machine Learning. Int. J. Image Graph. Signal Process. 2023, 3, 1–13. [Google Scholar] [CrossRef]
Yu, H.; Yang, F.; Rajaraman, S.; Ersoy, I.; Moallem, G.; Poostchi, M.; Palaniappan, K.; Antani, S.; Maude, R.J.; Jaeger, S. Malaria Screener: A smartphone application for automated malaria screening. BMC Infect. Dis. 2020, 20, 825. [Google Scholar] [CrossRef] [PubMed]
Yang, F.; Poostchi, M.; Yu, H.; Zhou, Z.; Silamut, K.; Yu, J.; Maude, R.J.; Jaeger, S.; Antani, S. Deep learning for smartphone-based malaria parasite detection in thick blood smears. IEEE J. Biomed. Health Inform. 2020, 24, 1427–1438. [Google Scholar] [CrossRef] [PubMed]
Hung, J.; Carpenter, A. Applying faster R-CNN for object detection on malaria images. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, Honolulu, HI, USA, 21–26 July 2017; pp. 56–61. [Google Scholar]
Gopakumar, G.P.; Swetha, M.; Sai Siva, G.; Sai Subrahmanyam, G.R.K. Convolutional neural network-based malaria diagnosis from focus stack of blood smear images acquired using custom-built slide scanner. J. Biophotonics 2018, 11, e201700003. [Google Scholar] [CrossRef] [PubMed]
Jagtap, D.C.; Rani, N.U. Cuckoo Search based Ensemble Classifier for Predictive Analysis of Malaria Infection Scope on thin Blood Smears. Indian J. Public Health Res. Develop. 2019, 10, 1019–1031. [Google Scholar] [CrossRef]
Pattanaik, P.A.; Mittal, M.; Khan, M.Z. Unsupervised deep learning cad scheme for the detection of malaria in blood smear microscopic images. IEEE Access 2020, 8, 94936–94946. [Google Scholar] [CrossRef]
Pattanaik, P.A.; Swarnkar, T.; Swain, D. Deep filter bridge for malaria identification and classification in microscopic blood smear images. Int. J. Adv. Intell. Paradig. 2021, 20, 126–137. [Google Scholar] [CrossRef]
Pattanaik, P.A.; Mittal, M.; Khan, M.Z.; Panda, S.N. Malaria detection using deep residual networks with mobile microscopy. J. King Saud-Univ.-Comput. Inf. Sci. 2022, 34, 1700–1705. [Google Scholar] [CrossRef]
Manescu, P.; Neary-Zajiczek, L.; Shaw, M.J.; Elmi, M.; Claveau, R.; Pawar, V.; John Shawe-Taylor, J.; Kokkinos, I.; Srinivasan, M.A.; Lagunju, I.; et al. Deep learning enhanced extended depth-of-field for thick blood-film malaria high-throughput microscopy. arXiv 2019, arXiv:1906.07496. [Google Scholar]
Manescu, P.; Shaw, M.J.; Elmi, M.; Neary-Zajiczek, L.; Claveau, R.; Pawar, V.; Kokkinos, I.; Oyinloye, G.; Bendkowski, C.; Oladejo, O.A.; et al. Expert-level automated malaria diagnosis on routine blood films with deep neural networks. Am. J. Hematol. 2020, 95, 883–891. [Google Scholar] [CrossRef]
Holmström, O.; Stenman, S.; Suutala, A.; Moilanen, H.; Kücükel, H.; Ngasala, B.; Mårtensson, A.; Mhamilawa, L.; Aydin-Schmidt, B.; Lundin, M.; et al. A novel deep learning-based point-of-care diagnostic method for detecting Plasmodium falciparum with fluorescence digital microscopy. PLoS ONE 2020, 15, e0242355. [Google Scholar] [CrossRef] [PubMed]
Vijayalakshmi, A. Deep learning approach to detect malaria from microscopic images. Multimed. Tools Appl. 2020, 79, 15297–15317. [Google Scholar] [CrossRef]
Pardede, J.; Dewi, I.A.; Fadilah, R.; Triyani, Y. Automated malaria diagnosis using object detection retina-net based on thin blood smear image. J. Theor. Appl. Inf. Technol. 2020, 98, 757–767. [Google Scholar]
Nakasi, R.; Mwebaze, E.; Zawedde, A.; Tusubira, J.; Akera, B.; Maiga, G. A new approach for microscopic diagnosis of malaria parasites in thick blood smears using pre-trained deep learning models. SN Appl. Sci. 2020, 2, 1255. [Google Scholar] [CrossRef]
Chowdhury, A.B.; Roberson, J.; Hukkoo, A.; Bodapati, S.; Cappelleri, D.J. Automated complete blood cell count and malaria pathogen detection using convolution neural network. IEEE Robot. Autom. Lett. 2020, 5, 1047–1054. [Google Scholar] [CrossRef]
Lebel, P.; Dial, R.; Vemuri, V.N.; Garcia, V.; DeRisi, J.; Gómez-Sjöberg, R. Label-free imaging and classification of live P. falciparum enables high performance parasitemia quantification without fixation or staining. PLoS Comput. Biol. 2021, 17, e1009257. [Google Scholar] [CrossRef]
Loh, D.R.; Yong, W.X.; Yapeter, J.; Subburaj, K.; Chandramohanadas, R. A deep learning approach to the screening of malaria infection: Automated and rapid cell counting, object detection and instance segmentation using Mask R-CNN. Comput. Med Imaging Graph. 2021, 88, 101845. [Google Scholar] [CrossRef]
Li, S.; Du, Z.; Meng, X.; Zhang, Y. Multi-stage malaria parasite recognition by deep learning. GigaScience 2021, 10, giab040. [Google Scholar] [CrossRef] [PubMed]
Molina, A.; Rodellar, J.; Boldú, L.; Acevedo, A.; Alférez, S.; Merino, A. Automatic identification of malaria and other red blood cell inclusions using convolutional neural networks. Comput. Biol. Med. 2021, 136, 104680. [Google Scholar] [CrossRef]
Davidson, M.S.; Andradi-Brown, C.; Yahiya, S.; Chmielewski, J.; O’Donnell, A.J.; Gurung, P.; Jeninga, M.D.; Prommana, P.; Andrew, D.W.; Petter, M.; et al. Automated detection and staging of malaria parasites from cytological smears using convolutional neural networks. Biol. Imaging 2021, 1, e2. [Google Scholar] [CrossRef]
Ufuktepe, D.K.; Yang, F.; Kassim, Y.M.; Yu, H.; Maude, R.J.; Palaniappan, K.; Jaeger, S. Deep learning-based cell detection and extraction in thin blood smears for malaria diagnosis. In Proceedings of the 2021 IEEE Applied Imagery Pattern Recognition Workshop (AIPR), Washington, DC, USA, 12–14 October 2021; pp. 1–6. [Google Scholar]
Abdurahman, F.; Fante, K.A.; Aliy, M. Malaria parasite detection in thick blood smear microscopic images using modified YOLOV3 and YOLOV4 models. BMC Bioinform. 2021, 22, 112. [Google Scholar] [CrossRef] [PubMed]
Yang, Z.; Benhabiles, H.; Hammoudi, K.; Windal, F.; He, R.; Collard, D. A generalized deep learning-based framework for assistance to the human malaria diagnosis from microscopic images. Neural Comput. Appl. 2022, 34, 14223–14238. [Google Scholar] [CrossRef]
Jabbar, M.A.; Radhi, A.M. Diagnosis of malaria infected blood cell digital images using deep convolutional neural networks. Iraq. J. Sci. 2022, 63, 380–396. [Google Scholar] [CrossRef]
Preißinger, K.; Kellermayer, M.; Vértessy, B.G.; Kézsmárki, I.; Török, J. Reducing data dimension boosts neural network-based stage-specific malaria detection. Sci. Rep. 2022, 12, 16389. [Google Scholar] [CrossRef]
Sengar, N.; Burget, R.; Dutta, M.K. A vision transformer based approach for analysis of plasmodium vivax life cycle for malaria prediction using thin blood smear microscopic images. Comput. Methods Programs Biomed. 2022, 224, 106996. [Google Scholar] [CrossRef]
Arshad, Q.A.; Ali, M.; Hassan, S.; Chen, C.; Imran, A.; Rasul, G.; Sultani, W. A dataset and benchmark for malaria life-cycle classification in thin blood smear images. Neural Comput. Appl. 2022, 34, 4473–4485. [Google Scholar] [CrossRef]
Koirala, A.; Jha, M.; Bodapati, S.; Mishra, A.; Chetty, G.; Sahu, P.K.; Mohanty, S.; Padhan, T.K.; Mattoo, J.; Hukkoo, A. Deep Learning for Real-Time Malaria Parasite Detection and Counting Using YOLO-mp. IEEE Access 2022, 10, 102157–102172. [Google Scholar] [CrossRef]
Acherar, A.; Tantaoui, I.; Thellier, M.; Lampros, A.; Piarroux, R.; Tannier, X. Real-life evaluation of deep learning models trained on two datasets for Plasmodium falciparum detection with thin blood smear images at 500x magnification. Inform. Med. Unlock. 2022, 35, 101132. [Google Scholar] [CrossRef]
Meng, X.; Ha, Y.; Tian, J. Neighbor Correlated Graph Convolutional Network for multi-stage malaria parasite recognition. Multimed. Tools Appl. 2022, 81, 11393–11414. [Google Scholar] [CrossRef]
Sukumarran, D.; Hasikin, K.; Mohd Khairuddin, A.S.; Ngui, R.; Wan Sulaiman, W.Y.; Vythilingam, I.; Divis, P.C.S. An automated malaria cells detection from thin blood smear images using deep learning. Trop. Biomed. 2023, 40, 208–219. [Google Scholar]
Katharina, P.; István, K.; János, T. An automated neural network-based stage-specific malaria detection software using dimension reduction: The malaria microscopy classifier. MethodsX 2023, 10, 102189. [Google Scholar] [CrossRef] [PubMed]
Liu, R.; Liu, T.; Dan, T.; Yang, S.; Li, Y.; Luo, B.; Zhuang, Y.; Fan, X.; Zhang, X.; Cai, H.; et al. AIDMAN: An AI-based object detection system for malaria diagnosis from smartphone thin-blood-smear images. Patterns 2023, 4, 100806. [Google Scholar] [CrossRef] [PubMed]
Wang, G.; Luo, G.; Lian, H.; Chen, L.; Wu, W.; Liu, H. Application of deep learning in clinical settings for detecting and classifying malaria parasites in thin blood smears. Open Forum Infect. Dis. 2023, 10, ofad469. [Google Scholar] [CrossRef] [PubMed]
Fasihfar, Z.; Rokhsati, H.; Sadeghsalehi, H.; Ghaderzadeh, M.; Gheisari, M. AI-Driven Malaria Diagnosis: Developing a Robust Model for Accurate Detection and Classification of Malaria Parasites. Iran. J. Blood Cancer 2023, 15, 112–124. [Google Scholar] [CrossRef]
Fu, M.; Wu, K.; Li, Y.; Luo, L.; Huang, W.; Zhang, Q. An intelligent detection method for plasmodium based on self-supervised learning and attention mechanism. Front. Med. 2023, 10, 1117192. [Google Scholar] [CrossRef] [PubMed]
Zedda, L.; Loddo, A.; Di Ruberto, C. YOLO-PAM: Parasite-Attention-Based Model for Efficient Malaria Detection. J. Imaging 2023, 9, 266. [Google Scholar] [CrossRef] [PubMed]
Saxena, S.; Sanyal, P.; Bajpai, M.; Prakash, R.; Kumar, S. Trials and tribulations: Developing an artificial intelligence for screening malaria parasite from peripheral blood smears. Med. J. Armed Forces India, 2023; in press. [Google Scholar] [CrossRef]
Guemas, E.; Routier, B.; Ghelfenstein-Ferreira, T.; Cordier, C.; Hartuis, S.; Marion, B.; Bertout, S.; Varlet-Marie, E.; Costa, D.; Pasquier, G. Automatic patient-level recognition of four Plasmodium species on thin blood smear by a real-time detection transformer (RT-DETR) object detection algorithm: A proof-of-concept and evaluation. Microbiol. Spectr. 2024, 12, e01440-23. [Google Scholar] [CrossRef] [PubMed]
Ilyas, T.; Ahmad, K.; Arsa, D.M.S.; Jeong, Y.C.; Kim, H. Enhancing medical image analysis with unsupervised domain adaptation approach across microscopes and magnifications. Comput. Biol. Med. 2024, 108055. [Google Scholar] [CrossRef]
Communicable Disease Center Website. Available online: https://www.cdc.gov/dpdx/malaria/index.html (accessed on 20 June 2024).
Rezatofighi, S.H.; Soltanian-Zadeh, H. Automatic recognition of five types of white blood cells in peripheral blood. Comput. Med. Imaging Graph. 2011, 35, 333–343. [Google Scholar] [CrossRef]
Sarrafzadeh, O.; Rabbani, H.; Talebi, A.; Banaem, H.U. Selection of the best features for leukocytes classification in blood smear microscopic images. In Proceedings of the Medical Imaging 2014: Digital Pathology, San Diego, CA, USA, 15–20 February 2014; Volume 9041, pp. 159–166. [Google Scholar]
Mendeley Data Website. Available online: https://data.mendeley.com/datasets/5bf2kmwvfn/1 (accessed on 20 June 2024).
Mendeley Data Website. Available online: https://data.mendeley.com/datasets/2v6h4j48cx/1 (accessed on 20 June 2024).
Tek, F.B.; Dempster, A.G.; Kale, I. Parasite detection and identification for automated thin blood film malaria diagnosis. Comput. Vis. Image Underst. 2010, 114, 21–32. [Google Scholar] [CrossRef]
Ljosa, V.; Sokolnicki, K.L.; Carpenter, A.E. Annotated high-throughput microscopy image sets for validation. Nat. Methods 2012, 9, 637. [Google Scholar] [CrossRef] [PubMed]
Makerere AI Lab. Website. Available online: https://air.ug/datasets/ (accessed on 20 June 2024).
Sultani, W.; Nawaz, W.; Javed, S.; Danish, M.S.; Saadia, A.; Ali, M. Towards low-cost and efficient malaria detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022, New Orleans, LA, USA, 18–24 June 2022; IEEE: Piscataway, NJ, USA, 2022; pp. 20655–20664. [Google Scholar]
Molina, A.; Alférez, S.; Boldú, L.; Acevedo, A.; Rodellar, J.; Merino, A. Sequential classification system for recognition of malaria infection using peripheral blood cell images. J. Clin. Pathol. 2020, 73, 665–670. [Google Scholar] [CrossRef] [PubMed]
Abbas, N.; Saba, T.; Rehman, A.; Mehmood, Z.; Kolivand, H.; Uddin, M.; Anjum, A. Plasmodium life cycle stage classification based quantification of malaria parasitaemia in thin blood smears. Microsc. Res. Tech. 2019, 82, 283–295. [Google Scholar] [CrossRef] [PubMed]
Sadiq, M.J.; Balaram, V.V.S.S.S. DTBC: Decision tree based binary classification using with feature selection and optimization for malaria infected erythrocyte detection. Int. J. Appl. Eng. Res. 2017, 12, 15923–15934. [Google Scholar]
Kudisthalert, W.; Pasupa, K.; Tongsima, S. Counting and classification of malarial parasite from giemsa-stained thin film images. IEEE Access 2020, 8, 78663–78682. [Google Scholar] [CrossRef]
Poostchi, M.; Ersoy, I.; McMenamin, K.; Gordon, E.; Palaniappan, N.; Pierce, S.; Maude, R.J.; Bansal, A.; Srinivasan, P.; Miller, L.; et al. Malaria parasite detection and cell counting for human and mouse using thin blood smear microscopy. J. Med. Imaging 2018, 5, 044506. [Google Scholar] [CrossRef] [PubMed]
Das, D.K.; Maiti, A.K.; Chakraborty, C. Automated system for characterization and classification of malaria-infected stages using light microscopic images of thin blood smears. J. Microsc. 2015, 257, 238–252. [Google Scholar] [CrossRef] [PubMed]
Aris, T.A.; Nasir, A.S.A.; Mustafa, W.A.; Mashor, M.Y.; Haryanto, E.V.; Mohamed, Z. Robust Image Processing Framework for Intelligent Multi-Stage Malaria Parasite Recognition of Thick and Thin Smear Images. Diagnostics 2023, 13, 511. [Google Scholar] [CrossRef]
RK, B. Extrinsic parameter’s adjustment and potential implications in Plasmodium falciparum malaria diagnosis. Microsc. Res. Tech. 2021, 85, 685–696. [Google Scholar]
Bio-Sig Data Website. Available online: http://www.biosigdata.com/?download=malaria-image (accessed on 20 June 2024).
Maity, M.; Dhane, D.; Mungle, T.; Maiti, A.K.; Chakraborty, C. Web-enabled distributed health-care framework for automated malaria parasite classification: An e-health approach. J. Med. Syst. 2017, 41, 192. [Google Scholar] [CrossRef] [PubMed]
Chibuta, S.; Acar, A.C. Real-time malaria parasite screening in thick blood smears for low-resource setting. J. Digit. Imaging 2020, 33, 763–775. [Google Scholar] [CrossRef] [PubMed]
Oliveira, A.; Prats, C.; Espasa, M.; Serrat, F.Z.; Sales, C.M.; Silgado, A.; Codina, D.L.; Arruda, M.E.; i Prat, J.G.; Albuquerque, J. The Malaria System MicroApp: A New, Mobile Device-Based Tool for Malaria Diagnosis. JMIR Res. Protoc. 2017, 6, e70. [Google Scholar] [CrossRef] [PubMed]
Chen, S.; Zhao, S.; Huang, C. An Automatic Malaria Disease Diagnosis Framework Integrating Blockchain Enabled Cloud-edge Computing and Deep Learning. IEEE Internet Things J. 2023, 10, 21544–21553. [Google Scholar] [CrossRef]
Maturana, C.R.; de Oliveira, A.D.; Nadal, S.; Serrat, F.Z.; Sulleiro, E.; Ruiz, E.; Bilalli, B.; Veiga, A.; Espasa, M.; Abelló, A.; et al. iMAGING: A novel automated system for malaria diagnosis by using artificial intelligence tools and a universal low-cost robotized microscope. Front. Microbiol. 2023, 14, 1240936. [Google Scholar] [CrossRef] [PubMed]
Horstmeyer, R.; Chen, R.Y.; Kappes, B.; Judkewitz, B. Convolutional neural networks that teach microscopes how to image. arXiv 2017, arXiv:1709.07223. [Google Scholar]
Muthumbi, A.; Chaware, A.; Kim, K.; Zhou, K.C.; Konda, P.C.; Chen, R.; Judkewitz, B.; Erdmann, A.; Kappes, B.; Horstmeyer, R. Learned sensing: Jointly optimized microscope hardware for accurate image classification. Biomed. Opt. Express 2019, 10, 6351–6369. [Google Scholar] [CrossRef] [PubMed]
Hung, J.; Goodman, A.; Ravel, D.; Lopes, S.C.; Rangel, G.W.; Nery, O.A.; Malleret, B.; Nosten, F.; Lacerda, M.V.; Ferreira, M.U.; et al. Keras R-CNN: Library for cell detection in biological images using deep neural networks. BMC Bioinform. 2020, 21, 300. [Google Scholar] [CrossRef] [PubMed]
Manouchehri, N.; Kalra, M.; Bouguila, N. Online variational inference on finite multivariate beta mixture models for medical applications. IET Image Process. 2021, 15, 1869–1882. [Google Scholar] [CrossRef]
Yao, X.; Pathak, V.; Xi, H.; Chaware, A.; Cooke, C.; Kim, K.; Xu, S.; Li, Y.; Dunn, T.; Konda, P.C.; et al. Increasing a microscope’s effective field of view via overlapped imaging and machine learning. Opt. Express 2022, 30, 1745–1761. [Google Scholar] [CrossRef]
Schwarz Schuler, J.P.; Also, S.R.; Puig, D.; Rashwan, H.; Abdel-Nasser, M. An enhanced scheme for reducing the complexity of pointwise convolutions in CNNs for image classification based on interleaved grouped filters without divisibility constraints. Entropy 2022, 24, 1264. [Google Scholar] [CrossRef] [PubMed]
Gupta, A.; Sabirsh, A.; Wählby, C.; Sintorn, I.M. SimSearch: A human-in-the-loop learning framework for fast detection of regions of interest in microscopy images. IEEE J. Biomed. Health Inform. 2022, 26, 4079–4089. [Google Scholar] [CrossRef] [PubMed]
Ali, R.; Hardie, R.C.; Narayanan, B.N.; Kebede, T.M. IMNets: Deep learning using an incremental modular network synthesis approach for medical imaging applications. Appl. Sci. 2022, 12, 5500. [Google Scholar] [CrossRef]
Musaev, J.; Anorboev, A.; Seo, Y.S.; Nguyen, N.T.; Hwang, D. ICNN-Ensemble: An Improved Convolutional Neural Network Ensemble Model for Medical Image Classification. IEEE Access 2023, 11, 86285–86296. [Google Scholar] [CrossRef]
El-Bana, S.; Al-Kabbany, A.; Elragal, H.M.; Said, E.K. Evaluating the Potential of Wavelet Pooling on Improving the Data Efficiency of Light-Weight CNNs. IEEE Access 2023, 11, 51199–51213. [Google Scholar] [CrossRef]
Ferreira, M.F.; Camacho, R.; Teixeira, L.F. Using autoencoders as a weight initialization method on deep neural networks for disease detection. BMC Med. Inform. Decis. Mak. 2020, 20, 1–18. [Google Scholar] [CrossRef] [PubMed]
Abd El-Ghany, S.; Elmogy, M.; El-Aziz, A.A. Computer-Aided Diagnosis System for Blood Diseases Using EfficientNet-B3 Based on a Dynamic Learning Algorithm. Diagnostics 2023, 13, 404. [Google Scholar] [CrossRef] [PubMed]
Fraunhofer AICOS Portugal. DEMalariaScope—Automatic Detection of Malaria in Blood Smears Using Smartphones. Available online: https://www.aicos.fraunhofer.pt/en/our_work/projects/malariascope.html (accessed on 18 March 2020).
Quinn, J.A.; Nakasi, R.; Mugagga, P.K.B.; Byanyima, P.; Alfred Andama, W.L. Deep Convolutional Neural Networks for Microscopy-Based Point of Care Diagnostics. Comput. Vis. Pattern Recognit. 2016, 56, 271–281. [Google Scholar]
Traore, B.B.; Kamsu-Foguem, B.; Tangara, F. Deep convolution neural network for image recognition. Ecol. Inform. 2018, 48, 257–268. [Google Scholar] [CrossRef]
Open Science Framework Website. Available online: https://osf.io/3kc2d/ (accessed on 20 June 2024).
Kanakasabapathy, M.K.; Thirumalaraju, P.; Kandula, H.; Doshi, F.; Sivakumar, A.D.; Kartik, D.; Gupta, R.; Pooniwala, R.; Branda, J.A.; Tsibris, A.M.; et al. Adaptive adversarial neural networks for the analysis of lossy and domain-shifted datasets of medical images. Nat. Biomed. Eng. 2021, 5, 571–585. [Google Scholar] [CrossRef]
Gunning, D.; Aha, D. DARPA’s explainable artificial intelligence (XAI) program. AI Mag. 2019, 40, 44–58. [Google Scholar]
Wang, S.; Zhang, Y. Grad-CAM: Understanding AI models. Comput. Mater. Contin. 2023, 76, 1321–1324. [Google Scholar]
Sundararajan, M.; Najmi, A. The many Shapley values for model explanation. In Proceedings of the 37th International Conference on Machine Learning, PMLR, Online, 13–18 July 2020; pp. 9269–9278. [Google Scholar]
Bera, K.; Schalper, K.A.; Rimm, D.L.; Velcheti, V.; Madabhushi, A. Artificial intelligence in digital pathology—new tools for diagnosis and precision oncology. Nat. Rev. Clin. Oncol. 2019, 16, 703–715. [Google Scholar] [CrossRef] [PubMed]
Park, S.H.; Han, K. Methodologic guide for evaluating clinical performance and effect of artificial intelligence technology for medical diagnosis and prediction. Radiology 2018, 286, 800–809. [Google Scholar] [CrossRef]
Briganti, G.; LeMoine, O. Artificial intelligence in medicine: Today and tomorrow. Front. Med. 2020, 7, 27. [Google Scholar] [CrossRef]
Gusenbauer, M.; Haddaway, N.R. Which academic search systems are suitable for systematic reviews or meta-analyses? Evaluating retrieval qualities of Google Scholar, PubMed, and 26 other resources. Res. Synth. Methods 2020, 11, 181–217. [Google Scholar] [CrossRef]
Sukumarran, D.; Hasikin, K.; Khairuddin, A.S.M.; Ngui, R.; Sulaiman, W.Y.W.; Vythilingam, I.; Divis, P.C. Machine and deep learning methods in identifying malaria through microscopic blood smear: A systematic review. Eng. Appl. Artif. Intell. 2024, 133, 108529. [Google Scholar] [CrossRef]

Figure 1. Plasmodium parasite transmission and life cycle.

Figure 2. CAD’s pipeline for automatic malaria diagnosis.

Figure 3. Combination of keywords for the search strategy.

Figure 4. PRISMA flow diagram.

Figure 5. The trend of publications over the past 10 years.

Figure 6. Pie chart of the percentages of use of public and private datasets.

Figure 7. Analysis of models trained on the NIH dataset.

Table 1. Quality checklist.

No	Quality Question
SCQ1	Is the report clear and coherent?
SCQ2	Is the aim of the research clearly specified?
SCQ3	Is the data collection method clearly described?
SCQ4	Have the diversity contexts been well explored?
SCQ5	Are the findings of the study reliable?
SCQ6	Are there links between the data, interpretation, and conclusion?
SCQ7	Are the methodology and experimentation process clear?
SCQ8	Are the research procedures documented adequately?
SCQ9	Are they important, if credible?
SCQ10	Could the research findings be replicated?

Table 2. Overview of common pre-trained models.

Model	Variants	Parameters	Layers	Input Size
ResNet	ResNet-18	11.17 M	18	224 × 224
	ResNet-50	25.60 M	50	224 × 224
	ResNet-101	44 M	101	224 × 224
DenseNet	DenseNet-121	17 M	121	224 × 224
	DenseNet-169	29 M	169	224 × 224
	DenseNet-201	20.2 M	201	224 × 224
Inception-ResNet	Inception-ResNetV1	24 M	60	299 × 299
Inception-ResNet	Inception-ResNetV2	56 M	164	299 × 299
MobileNet	MobileNet	13 M	30	224 × 224
MobileNet	MobileNetV2	3.5 M	53	224 × 224
VGG	VGG-16	138 M	16	224 × 224
VGG	VGG-19	143 M	19	224 × 224

Table 3. Summary of the evaluation metrics.

Metric	Formula	Description
Recall (REC) or Sensitivity	$\frac{T P}{T P + F N}$	A measure of an algorithm’s ability to correctly predict TP from the total number of positive observations
Specificity (SP)	$\frac{T N}{T N + F P}$	Calculation of the percentage of TN out of the total number of negative observations predicted by an algorithm
Precision (PR)	$\frac{T P}{T P + F P}$	The extent to which the algorithm is correct in terms of positive results. It is calculated as the TP over the total number of positive observations
Accuracy (ACC)	$\frac{T P + T N}{T P + T N + F P + F N}$	General description of the performance of the model on all classes. It is calculated as the ratio of correct predictions to the total number of predictions
F1-score (F1)	$2 \times \frac{T P}{P + P^{'}}$	The harmonic mean of precision and sensitivity
Matthews correlation coefficient (MCC)	$\frac{(T P \times T N) - (F P \times F N)}{\sqrt{(P^{'}) (T P + F N) (T N + F P) (N^{'})}}$	Evaluation of a binary classification model performance. Value in range [−1, 1], where the minimum value indicates a poor classifier while the maximum value indicates a correct classifier

Table 4. Overview of cited works using pre-trained networks to classify NIH datasets (results have been rounded). The symbol “-” indicates that no information is provided on a particular operation. The abbreviation “Av” indicates the average results over several repetitions of the experiment.

Pre-Processing	Data Augmentation	Network	Results	Cross-Validation	External Validation	Author and Year
Rescaling $128 \times 128 \times 3$	-	SSD300 + modified pre-trained VGG-16	Original images: ACC 96.00 ± 0.3%; AUC 99.2 ± 0.2%. Images after FSRCNN: ACC 96.53 ± 0.43%; AUC 99.40 ± 0.10%	5-fold	BBBC041: AUC 94.50 ± 2.5%	Zhao et al., 2020 [43]
Rescaling $299 \times 299 \times 3$ ; normalization [0, 1]	Random rotation between 0° and 270°	Pre-trained Xception	ACC 98.86%	Hold-Out	-	Sriporn et al., 2020 [45]
-	-	Mask R-CNN + 2 pre-trained ResNet-50	Image level: Av ACC 83.9%. Patient level: Av ACC 92.3%	5-fold	-	Kassim et al., 2021 [48]
Rescaling $224 \times 224 \times 3$	Rotation, tone, etc.	Pre-trained EfficientDet-lite4 + MobileNetv2 (MAIScope)	Av ACC 89.90%	Hold-Out	Images of malaria field microscope slides provided by Cancyte Labs, India	Sangameswaran et al., 2022 [49]
Rescaling $50 \times 50 \times 3$ ; normalization [0, 1]; color standardization	Rotation, flipping, horizontal/vertical shift, lightning or darkening, zooming in and out	Pre-trained VGG-19, ResNet-50, MobileNetV2	ACC 100.00%	Hold-Out	-	Alnussairi et al., 2022 [50]
-	MixUp [53] and CutMix [54] algorithms	Pre-trained ResNet-50	ACC 98.75%	Hold-Out	-	Sinha et al., 2023 [51]
Rescaling $120 \times 120 \times 3$	Rotation ( $20^{\circ}$ ), zooming (0.05), width/height shifting ( $\pm 0.05$ ), shearing (0.05), and flipping	Pre-trained VGG-19 + Custom CNNs (HPTDL-MPDC)	ACC 89.00%	5-fold	-	Kundu et al., 2023 [52]

Table 5. Overview of cited works comparing some custom and pre-trained CNN approaches to classify NIH datasets (results have been rounded). The symbol “-” indicates that no information is provided on a particular operation. The abbreviation “Av” indicates the average results over several repetitions of the experiment.

Pre-Processing	Data Augmentation	Best Model	Results	Cross-Validation	External Validation	Author and Year
Rescaling $224 \times 224 \times 3$	-	ResNet-50	Cellular level: ACC 98.60%; AUC 99.90%. Patient level: ACC 95.9 ± 0.8%; AUC 99.10 ± 0.5%	5-fold	-	Rajaraman et al., 2018 [55]
Rescaling $200 \times 200 \times 3$ ; Gaussian blurring; darkening and lightening; ZCA whitening; feature-wise standardization	Vertical/horizontal flipping and shifting, rotation ( $\pm 25^{\circ}$ ), clipping 0-20% of height/width, translation ( $\pm 0.2$ ), and cutting ( $\pm 25$ )	Pre-trained VGG-16	Image level: ACC 97.00 ± 0.5%. Patient level: ACC 93.00%; AUC 86.50%.	5-fold and Hold-Out	-	Rahman et al., 2019 [56]
Rescaling $224 \times 224 \times 3$ ; normalization	-	Pre-trained VGG-19	ACC 97.21%	Hold-Out	-	Jameela et al., 2022 [57]
Color adjustment (multiplication of each channel by the ratio of the average of the three channels to the average of the channel under consideration)	Random rotations ( $\pm 90^{\circ}$ ), random translations ( $\pm 10$ pixels) on the X-axis and Y-axis, and 50% reflection around the X-axis and Y-axis.	Pre-trained ResNet-18 and DenseNet-201	ResNet-18 binary classification on NIH: 97.68%. DenseNet-201 multiclass classification on MP-IDB-FC: ACC 99.40%	5-fold	DenseNet-201 trained on NIH and tuned to MP-IDB-FC: ACC 97.45 ± 1.40%. DenseNet-201 trained on MP-IDB-FC and tuned to MP-IDB-VC (P. vivax): ACC 87.10 ± 1.15%	Loddo et al., 2022 [58]
-	-	Pre-trained VGG-19	ACC 93.89%	Hold-Out	-	Turuk et al., 2022 [59]
-	-	Pre-trained DenseNet	Densenet-121: REC 94.90%. DenseNet-169: PR 95.97%. DenseNet-201: ACC 93.39%	Hold-Out	-	Qadir et al., 2022 [60]
Rescaling $224 \times 224 \times 3$ ; normalization [0, 1]	Rotation (range of $20^{\circ}$ ), width/height shift (0.2), shear (0.2), horizontal/vertical flip	Pre-trained VGG-19	ACC 98.90%	Hold-Out	-	Hassan et al., 2022 [61]
Tile cropping	-	Modified YOLO-v4 (YOLO-v4-tiny)	REC 95.30%; Av PR 87.10%	Hold-Out	On [65]: Av PR 83.40%; REC 94.70%). On [66]: Av PR 73.10%; REC 96.30%	Shewajo et al., 2023 [62]
-	-	Pre-trained ResNet-50	ACC 98.49%	Hold-Out	-	Hemachandran et al., 2023 [63]
-	GAN	Pre-trained VGG-19	ACC 98.01%	Hold-Out	-	Dath et al., 2023 [64]

Table 6. Overview of cited works using custom CNN approaches to classify NIH datasets (results have been rounded). The symbol “-” indicates that no information is provided on a particular operation. The abbreviation “Av” indicates the average results over several repetitions of the experiment.

Pre-Processing	Data Augmentation	CNN Details	Results	Cross-Validation	External Validation	Author and Year
Rescaling; median filtering	-	3 convolutional layers	Binary: ACC 98.93%. Multi-class: ACC 99.06%	10-fold	-	Sarkar et al., 2020 [67]
Rescaling 64 × 64 × 3; grayscale conversion; flattening	-	4 convolutional blocks	ACC 92.00%	Hold-Out	-	Masud et al., 2020 [68]
Rescaling $120 \times 120 \times 3$ ; YUV conversion; intensity equalization; BGR conversion; stain normalization	-	22 layers	ACC 99.96 ± 0.001%	5-fold	-	Umer et al., 2020 [69]
Rescaling $64 \times 64 \times 3$	-	18 layers	ACC 98.85%	Hold-Out	-	Alqudah et al., 2020 [70]
Rescaling $64 \times 64 \times 3$	-	5 convolutional layers	ACC 94.56%	4-fold	-	Malhotra et al., 2020 [71]
Rescaling $181 \times 181 \times 3$	-	13 layers	ACC 95.00%	Hold-Out	Thin and thick blood films from Core Laboratory at the Hospital Clínic of Barcelona: ACC 75.39%	Delgado-Ortet et al., 2020 [72]
Rescaling $44 \times 44 \times 3$	-	20 layers	ACC. 95.28%; AUC 98.86%	5-fold	-	Irmak, 2021 [73]
Rescaling $125 \times 125 \times 3$ ; bilateral filtering	Zooming (0.1), rotation ( $25^{\circ}$ ), shearing (0.05), horizontal flipping, width/height shifting (0.1)	5 convolutional layers	ACC 96.82%	5-fold	-	Maqsood et al., 2021 [74]
Rescaling $134 \times 134 \times 3$	Rotation, shearing and zooming, width/height shifting, and horizontal flipping	6 convolutional layers	ACC 96.23%	Hold-Out	-	Magotra et al., 2022 [75]
Rescaling $64 \times 64 \times 3$	-	4 convolutional layers	Thin blood smears: ACC 96.06%. Thick blood smears: ACC 96.97%	Hold-Out	-	Uzun Ozsahin et al., 2022 [76]
Rescaling $64 \times 64 \times 3$	-	-	ACC 99.70%	Hold-Out	-	Hcini et al., 2022 [77]
Rescaling; Gaussian blurring	-	2 convolutional layers	ACC 97.81%	4-fold	-	Cho et al., 2023 [78]
Adaptive thresholding; increased intensity of the red channel; reassembly of the 3 RGB channels	-	1 convolutional layer	ACC 97.21 ± 0.3%	5-fold	-	Yebasse et al., 2023 [79]
Rescaling $64 \times 64 \times 3$	-	2 convolutional layers	ACC 96.02%	Hold-Out	-	Shambhu et al., 2023 [80]
Rescaling $100 \times 100 \times 3$	Rotations ( $90^{\circ}$ , $180^{\circ}$ , and $270^{\circ}$ ), and reflections along X-axis and Y-axis	4 convolutional layers	ACC 98.3%	5-fold	-	Kashtriya et al., 2019 [81]
Rescaling $32 \times 32 \times 3$ , $64 \times 64 \times 3$ , $128 \times 128 \times 3$ , $256 \times 256 \times 3$ ; normalization	-	4 parallel CNNs with 2 convolutional layers each	ACC 99.50%; AUC 99.90%	Hold-Out	-	Mahmud et al., 2020 [82]
Rescaling $128 \times 128 \times 3$ ; normalization [0, 1]	-	9 layers	ACC 92.94%	Hold-Out	-	Banerjee et al., 2022 [83]
-	Rotation ( $\pm 20^{\circ}$ , to the right or left), translation ( $\pm 3$ pixels), horizontal/vertical random scaling (between $\pm 1$ )	-	ACC 94.79%	Hold-Out	-	Oyewola et al., 2022 [84]
Rescaling $120 \times 120 \times 3$ ; normalization with mean of 0.5 and standard deviation of 0.5	Random rotation, random horizontal/vertical flip, and color jitter of 0.05%	3 convolutional layers	ACC 96.6%; AUC 99.01%	Hold-Out	-	Kumar et al., 2022 [85]
Rescaling $50 \times 50 \times 3$	Horizontal flipping, rotation ( $20^{\circ}$ ), width/height shifting (0.1), zooming (0.1), shearing (0.1), and filling mode “nearest”	4 blocks	ACC 96.73%; AUC 99.35%	Hold-Out	-	Asif et al., 2024 [86]
Rescaling $32 \times 32 \times 3$	-	2 convolutional layers	ACC 95.60%	Hold-Out	-	Alaiad et al., 2023 [87]
Rescaling $124 \times 124 \times 3$ ; discrete wavelet transform (DWT); merging the inverse DWTs at low–low and high–high resolution, moving to a size $82 \times 82 \times 1$	Rotation (range $\pm 5^{\circ}$ ), shearing (±0.05), translation, reflection (±1)	3 convolutional blocks	ACC 97.98%; AUC 99.6%	Hold-Out	-	Khan et al., 2022 [88]
Rescaling $96 \times 96 \times 3$	-	-	Original dataset: ACC 96.41%; AUC 99.11%. Modified dataset: ACC 99.25%; AUC 99.99%	Hold-Out	-	Islam et al., 2022 [89]
Rescaling $96 \times 96 \times 3$	-	3 convolutional layers	CNN-LSTM: ACC 99.86%. CNN-BiLSTM: ACC 99.89%	Hold-Out	-	Alonso-Ramírez et al., 2022 [90]
Rescaling $128 \times 128 \times 3$ ; normalization [0, 1]	-	VGG-16 + CNN	ACC 96.6%; AUC 98.00%	5-fold	-	Amin et al., 2023 [91]

Table 7. Overview of cited works using AE to classify NIH datasets (results have been rounded). The symbol “-” indicates that no information is provided on a particular operation. The abbreviation “Av” indicates the average results over several repetitions of the experiment.

Pre-Processing	Data Augmentation	Model	Results	Cross-Validation	External Validation	Author and Year
Rescaling $32 \times 32 \times 3$ ; normalization [0, 1]	Random rotation ( $20^{\circ}$ ) and zooming (0.05), width/height shift ( $\pm 0.05$ ), shearing (0.05), and horizontal flipping	Custom CNN based on AE	ACC 99.23%	Hold-Out	On [94]: ACC 98.79%	Fuhad et al., 2020 [92]
Rescaling $32 \times 32 \times 3$ ; grayscale conversion	-	Custom AE	ACC 98.49%	Hold-Out	-	Huq et al., 2023 [93]

Table 8. Overview of cited works using hybrid models to classify NIH datasets (results have been rounded). The symbol “-” indicates that no information is provided on a particular operation. The abbreviation “Av” indicates the average results over several repetitions of the experiment.

Pre-Processing	Data Augmentation	Model	Results	Cross-Validation	External Validation	Author and Year
Rescaling $224 \times 224 \times 3$ ; color constancy method	-	Custom Deep Residual CNN + SVM	ACC 99.90%	Hold-Out	-	Diker, 2022 [95]
Rescaling $120 \times 120 \times 3$ ; grayscale conversion	-	Custom CNN + XGBoost	ACC 94.78%	Hold-Out	-	Raihan et al., 2022 [96]
Rescaling $224 \times 224 \times 3$ ; normalization [0, 1]; median filtering; data cleaning	-	Custom RAL-CNN + SVM (RAL-CNN-SVM)	ACC 99.70%; AUC 99.90%	Hold-Out	-	Li et al., 2022 [97]
Bilateral filtering	-	Custom CNN + SVM	ACC 94.00%	Hold-Out	-	Jones et al., 2023 [98]
Rescaling $32 \times 32 \times 3$ ; normalization [0, 1]	-	Custom CNN + DELM (CNN-DELM)	Original dataset: ACC 97.79%; AUC 99.52%. Modified dataset: ACC 99.66%; AUC 99.96%	Hold-Out	-	Goni et al., 2023 [99]
Rescaling $224 \times 224 \times 3$	-	Pre-trained VGG-16 + SVM	ACC 94.88%; AUC 98.50%	10-fold	-	Abubakar et al., 2021 [100]
Gaussian filtering	-	NasNetLarge + ELM (BMODTL-BMPC)	ACC 99.04%	Hold-Out	-	Dutta et al., 2022 [101]
Median filtering	-	Pre-trained Res2Net + KNN (IDTL-MPDC)	ACC 95.86%; AUC 98.63%	Hold-Out	-	Alassaf et al., 2022 [102]
Lab* conversion; intensity adjustment; histogram equalization and leveling	-	Pre-trained DarkNet-53 and DenseNet-201 + LR	ACC 99.67%	5-fold	-	Imran et al., 2022 [103]
Rescaling $128 \times 128 \times 3$	-	Pre-trained Inception-v3 + Capsule ANN	ACC 99.35%; AUC 99.73%	Hold-Out	-	Madhu et al., 2023 [104]
-	-	Pre-trained NASNet + RF + SVM	ACC 98.60 ± 0.18%	10-fold	-	Qadri et al., 2023 [105]
Bilateral filtering	-	Pre-trained ResNet-18 + Custom ANN	Wide ANN: ACC 99.48%; AUC 98.00%	10-fold	-	Amin et al., 2024 [106]
Rescaling $224 \times 224 \times 3$ ; normalization [0, 1]; thresholding	Rotation	Pre-trained VGG-16 and VGG-19 + RF	ACC 93.35%	Hold-Out	-	Murmu et al., 2024 [107]

Table 9. Overview of cited works using ensemble learning approaches to classify NIH datasets (results have been rounded). The symbol “-” indicates that no information is provided on a particular operation. The abbreviation “Av” indicates the average results over several repetitions of the experiment.

Pre-Processing	Data Augmentation	Model	Results	Cross-Validation	External Validation	Author and Year
Normalization [0, 1]	-	Ensemble of 10 pre-trained EfficientNetB0	ACC 96.23%	10-fold	-	Marques et al., 2022 [108]
Rescaling $224 \times 224 \times 3$	-	Pre-trained ResNet-18 + Ensemble RVFL/SNN/ELM	ACC 95.73 ± 2.63%	5-fold	-	Zhu et al., 2022 [109]
Rescaling $100 \times 100 \times 3$ ; mean normalization	Rotation, translation, cutting, zooming, and flipping	Ensemble pre-trained VGG-19/SqueezeNet	ACC 99.51 ± 0.1%; AUC 99.92 ± 0.1%	5-fold	-	Rajaraman et al., 2019 [110]
Rescaling $64 \times 64 \times 3$ ; normalization [0, 1]	Rotation (0.2), zooming (0.2), horizontal/vertical flipping	Ensemble of pre-trained VGG-16, VGG-19, and DenseNet-201	ACC 97.92%	Hold-Out	-	Bhuiyan et al., 2023 [111]
Rescaling $224 \times 224 \times 3$ ; normalization [0, 1]	Horizontal flipping, rotation ( $15^{\circ}$ ), zooming (10%), lighting (20%)	Ensemble CNN/pre-trained DenseNet-121, ResNet-101, VGG-19	Av ensemble: ACC 99.13%; AUC 99.60%	Hold-Out	-	Nayak et al., 2022 [112]

Table 10. Overview of cited works using classic ML models to classify NIH datasets (results have been rounded). The symbol “-” indicates that no information is provided on a particular operation. The abbreviation “Av” indicates the average results over several repetitions of the experiment.

Pre-Processing	Data Augmentation	Model	Results	Cross-Validation	External Validation	Author and Year
Rescaling; median filtering	-	VQC	Binary: ACC 98.93%. Multiclass: ACC 99.06%	10-fold	-	Hossain et al., 2021 [113]
Noise removal and contrast enhancement using CLAHE technique	-	RF (OML-AMPDC)	ACC 90.32%	Hold-Out	-	Kundu et al., 2023 [114]
Rescaling $100 \times 100 \times 3$ ; 24-bit color code conversion; HSV conversion	-	DT	ACC 95.60%	10-fold	-	Phumkuea et al., 2023 [115]
-	-	RF, KNN, Linear SVM + LR, DT, Kernel SVM and NB	ACC between 95.00% and 99.00%	10-fold	-	Telang et al., 2023 [116]

Table 11. Overview of cited works developing mobile applications for malaria diagnosis on NIH datasets (results have been rounded). The symbol “-” indicates that no information is provided on a particular operation. The abbreviation “Av” indicates the average results over several repetitions of the experiment.

Pre-Processing	Data Augmentation	Model	Results	Cross-Validation	External Validation	Author and Year
-	-	Pre-trained CNN	Thick smears: patch level ACC 96.89%, AUC 98.48%; patient level ACC 78.00%, AUC 84.90%. Thin smears: patch level ACC 98.60%, AUC 99.90%; patient level ACC 95.90%, AUC 99.10%	5-fold	-	Yu et al., 2020 [117]
Rescaling $44 \times 44 \times 3$	-	Custom CNN	ACC 93.46 ± 0.32%); AUC 98.39 ± 0.18%	5-fold	Part of the original dataset: ACC 97.26%; AUC 97.34%	Yang et al., 2019 [118]

Table 12. Overview of cited works using DL approaches to malaria parasite diagnosis (results have been rounded). The symbol “-” indicates that no information is provided on a particular operation. The abbreviation “Av” indicates the average results over several repetitions of the experiment.

Dataset	Task	Model	Results	Cross-Validation	External Validation	Author and Year
1300 Giemsa-stained images infected with P. vivax, containing approximately 100,000 cells	Binary classification	Faster R-CNN + AlexNet	ACC 98%	Hold-Out	-	Hung et al., 2017 [119]
Images of slides colored by Leishman, from which 1400 positive patches and 326,934 negative patches are cut out	Binary classification and cell counting	SVM and custom CNN	Binary classification: (A) SE 96.38 ± 0.88%; SP 95.43 ± 0.85%; MCC 91.81 ± 1.50%; AUC 99.10 ± 0.37%. (B) SE 98.91 ± 0.36%; SP 99.39 ± 0.31%; MCC 98.31 ± 0.39%; AUC 99.87 ± 9.9871 $\times 10^{- 2}$ %. (C) SE 99.14 ± 0.37%; SP 99.62 ± 0.18%; MCC 98.77 ± 0.32%; AUC 99.92 ± 7.5764 $\times 10^{- 2}$ %. Cell counting: (A) SE 92.87%; SP 93.84%; MCC 44.35%. (B) SE 96.56%; SP 98.27%; MCC 70.33%. (C) SE 96.98%; SP 98.50%; MCC 73.02%	10-fold	-	Gopakumar et al., 2018 [120]
2867 samples of which 1244 normal and 1623 infected	Binary classification	Ensemble learning (CSEC)	PR 95.64%; SE 92.80%; SP 94.96%; ACC 93.78%;	3-fold	-	Jagtap D et al., 2019 [121]
1182 microscopic images of thick blood smears collected by the Makerere University AI research group	Binary classification	ANN with stacked models	ACC 89.10%; SE 93.90%; SP 83.10%; F1 94.50%	10-fold	-	Pattanaik et al., 2020 [122]
DPDx dataset: 39,000 cells extracted from 108 patch-cut images [157]	Binary and multiclass classification	Deep filter bridge	Binary: Av F1 98.36%; Av ACC 98.12%. Multiclass: ACC P. falciparum 90.00%, P. ovale 98.80%, P. vivax 87.47%, P. malariae 98.60%, and P. knowlesi 90.00%	10-fold	-	Pattanaik et al., 2021 [123]
1182 images with 3 different magnifications and frames of 7245 parasites	Binary classification	MM-ResNet	Av ACC 98.08%	5-fold	-	Pattanaik et al., 2022 [124]
Z-stacks of thick blood smears stained with Giemsa	Malaria parasite detection	EDoF-CNN	REC 74.00%; PR 79.00%	Hold-Out	-	Manescu et al., 2019 [125]
Z-stacks of Giemsa-stained thick and thin blood smears from 239 patients. The 4168 extracted cells are annotated by pathologists	Binary classification	RetinaNet + VGG-19 (DeepMCNN)	SE 92.00%; SP 90.00%; PR 91.00%; PPV 92.00%; NPV 90.00%	Hold-Out	-	Manescu et al., 2020 [126]
125 thin blood smears stained with DAPI collected from 100 patients	Malaria detection	DLS 1 e DLS 2	r = 0.99, p < 0.01	Hold-Out	-	Holmström et al., 2020 [127]
50 microscopic images obtained from Giemsa-stained blood smears + MaMic Database	Binary classification	VGG19 - SVM	ACC 93.13%; SE 93.44%; SP 92.92%; PR 89.95%; F1 91.66%	5-fold	-	Vijayalakshmi, 2020 [128]
27 images from [119] and 79 images from Dr. Yani Triyani repository.	Object detection	RetinaNet (ResNet-101 backbone)	Object detection: Av PR 94.00%; Av REC 74.00%; Av ACC 73.00%. Diagnosis: SP 98.00%; SE 99.00%	Hold-Out	-	Pardede et al., 2020 [129]
643 images of thick blood smears with parasite delineation boxes annotated by experienced laboratory technicians	Parasite detection	Faster R-CNN (ResNet-101 backbone)	Av PR 94.48%; PR 77.91%; REC 89.81%	Hold-Out	-	Nakasi et al., 2020 [130]
(1) Leukocyte Images for Segmentation and Classification (LISC) database [158] (five types of WBCs on a background of RBCs); (2) Isfahan University of Medical Science (IUMC) database [159] (individual WBCs with binary masks); (3) MAMIC Database; (4) segmented NIH	Multiclass classification + object detection	ModifiedYolo network	Cell classification: Av PR 95.56%. Malaria detection: SE 100.00%; SP 97.47%	Hold-Out	Combination of LISC and MAMIC images not contained in the training set: Av PR 93.66%	Chowdhury et al., 2020 [131]
64,966 cells from the first system and 14,219 from the second. The images are annotated by hand in 4 classes	Multiclass classification	ResNet50 + GoogLeNet	First system: ACC 96.80%. Second system: ACC 98.80%	Hold-Out	-	Lebel, et al., 2021 [132]
297 images acquired from Giemsa-stained samples and annotated by hand	Multiclass classification	Pre-trained Mask R-CNN	Detection: ACC 94.57%; Av PR 73.10%; PR 82.00%. Cell counting: ACC 89.56%	Hold-Out	-	Loh et al., 2021 [133]
BBBC041	Multi-stage classification + binary adaptation	DTGCN	Staging: ACC 98.30 ± 0.03%; PR 98.50 ± 0.02%; REC 98.30 ± 0.02%; F1 98.30 ± 0.03%	-	Binary classification on NIH: ACC 95.4 ± 0.05%; PR 95.4 ± 0.07%; REC 95.4 ± 0.06%; F1 95.4 ± 0.07%	Li et al., 2021 [134]
6415 images of RBCs segmented from 53 thin peripheral blood smears stained with May Grünwald-Giemsa.	Multiclass classification at the cellular level, and binary at the patient level	Pre-trained VGG-16	ACC 99.75%; PR 97.00%; SE 100.00%	Hold-Out	23 independent blood smears: ACC 99.50%; PR 95.70%; F1 97.40%; SE 99.20%, and SP 99.90% on RBCs. SE 100.00% and SP 91.70% at patient level	Molina et al., 2021 [135]
38,000 RBCs from Giemsa-stained blood smears with P. falciparum and P. vivax	Parasite detection + binary classification	Faster R-CNN + ResNet-50 and ResNet-34	Object detection: Av PR 98.00% on P. vivax; Av PR 99.00% on P. falciparum. Binary classification: ACC 99.80%; AUC 97.90%	Hold-Out	-	Davidson et al., 2021 [136]
955 microscopy images from 193 patients (148 P. falciparum and 45 non-infected)	Cell counting	CFPNet-M-DEC	Image level: ACC 92.28%. Patient level: ACC: 92.20%	Hold-Out	-	Ufuktepe et al., 2021 [137]
1182 images of which 948 were malaria-infected (with 7628 P. falciparum parasites) and 234 normal with artifacts due to impurities [65]	Object detection	YOLOv3-MOD1, YOLOv3-MOD2, and YOLOv4-MOD	YOLOv4-MOD: Av PR 96.32%; PR 95.00%; REC 94.00%; F1 94.00%. YOLOV3-MOD2: Av PR 96.14%. YOLOV3-MOD1: Av PR 95.46%	Hold-Out	-	Abdurahman et al., 2021 [138]
Giemsa-stained images from MP-IDB dataset (1) and [160] (2).	Multiclass classification	U-Net_VGG-19 + Custom CNN (Light-Net)	Cell classification: Av PR 88.00% (1); ACC 99.50% (2). Patient classification: ACC 100.00% (1) (2).	Hold-Out	Detection rate (dt) 99.00% on [161]. Species classification (SpC) ACC 63.00% on [55]. dt 99.00% and SpC ACC 84.00% on [162]. dt 85.00% and SpC ACC 31.00% on [163]	Yang et al., 2021 [139]
2565 RBC images belonging to two classes	Binary classification	Custom CNN	ACC 99.22%; REC 98.37%; PR 100.00%; F1 99.17%	Hold-Out	-	Jabbar et al., 2022 [140]
792 Giemsa-stained microscopy images of 17 malaria-infected patients, totaling 26,223 cells [160] manually labeled by three experts	Multiclass classification	Custom CNN	ACC 97.90%	Hold-Out	Same training data acquired by fluorescence microscopy (FM) (1232 images, 45,726 RBC): ACC 96.30%; and acquired by atomic force microscopy (AFM) (173 images, 7386 RBCs): ACC 98.90%	Preißinger et al., 2022 [141]
903 field stain images of blood smears often manually annotated by experienced laboratory technicians	Multiclass detection	Faster R-CNN and SSD MobileNetV2	Faster R-CNN: Av PR 66.09%. SSD MobileNetV2: Av PR 62.69%	Hold-Out	-	Nakasi et al., 2021 [66]
345 images from thin Giemsa-stained blood smears, each containing approximately 111 cells. The labels refer to the stage and location of the parasite	Parasite life stage classification	ViT	ACC: 90.03%	Hold-Out	-	Sengar et al., 2022 [142]
Dataset–IML-Malaria: 345 images of Giemsa-stained blood samples containing P. vivax in four phases. A total of 38,000 cells	Binary + multiclass classification	ResNet50V2	Av ACC 79.61%; F1 82.04%	Hold-Out	NIH: Av ACC 96.26%; F1 96.18%. BBBC041: Av ACC 99.62%; F1 93.07%	Arshad et al., 2022 [143]
1182 images of thick blood smears treated with field stain (948 P. falciparum infected and 234 negative) [164]	Binary classification	YOLO-mp	YOLO-mp-3l: Av PR 93.99% (IoU = 0.5). YOLO-mp-4l: Av PR 94.07% (IoU = 0.5)	5-fold	2703 images from 133 thick smears treated with field stain. YOLO-mp-3l: Av PR 79.32%. YOLO-mp-4l: Av PR 80.92%.	Koirala et al., 2022 [144]
Images of Giemsa-stained thin blood smears of 202 patients (half healthy, half malaria)	Binary classification	Pre-trained networks	Cells level ResNet50: ACC 99.40 ± 0.3%; PR 99.10 ± 0.4%; SE 98.50 ± 0.5%; SP 99.70 ± 0.1%; F1 98.80 ± 0.5%. Patient level VGG-19: SE 98.00%; SP 56.00%	5-fold	NIH. Cells level: ACC 98.50 ± 0.1%; PR 98.60 ± 1.2%; SE 98.40 ± 0.6%; SP 98.6 ± 1%; F1 98.60 ± 1.2%. Patient level: SE 100.00%; SP 0.02%	Acherar et al., 2022 [145]
1364 images of cells captured from Giemsa-stained blood smears and labeled in six classes	Multiclass classification	NCGCN	ACC 94.17%; PR 94.84%; REC 94.17%; F1 94.20%; AUC 99.00%	Hold-Out	Binary classification on 2149 infected and 5302 healthy cells. ACC 95.46%; REC 95.21%; PR 95.46%; F1 95.12%	Meng et al., 2022 [146]
MP-IDB Dataset + 236 Giemsa-stained thin blood films and 3038 cells (all parasite types in various stages of infection)	Object detection	YOLOV4	Av PR 93.87%; PR 83.00%; REC 95.00%; F1 89.00%	Hold-Out	-	Sukumarran et al., 2023 [147]
See [141]	Multiclass classification	Malaria Stage Classifier	ACC 98.70%	Hold-Out	AFM images: ACC 99.30%. FM images: ACC 98.00%	Katharina et al., 2023 [148]
1822 Giemsa-stained thin-blood-smear images (5654 patch) from 140 patients	Binary classification	AIDMAN	Patch level: ACC, PR, SE, SP, F1 98.62%; AUC 99.92%. Whole image level: ACC 97.00%; PR 97.13%; SE 97.00%; SP 94.18%; F1 96.96%; AUC 98.84%.	Hold-Out	-	Liu et al., 2023 [149]
12,708 Wright-Giemsa thin blood smear images of 380 patients. 12,546 malaria RBCs in various stages	Multiclass classification	YOLOV7	Av PR 90.20%; REC 96.00%; PR 94.90%; SE 96.80%; SP 99.30%; AUC > 99.90%	Hold-Out	-	Wang et al., 2023 [150]
624 images of P. falciparum, 548 images of P. vivax, 588 images of P. ovale and 160 images of suspected healthy individuals	Multiclass classification	Custom CNN	ACC 99.59%; SE 99.58%; SP 99.59%; PR 95.62%; REC 99.58%; F1 99.58%; MMC 98.68%;	5-fold	-	Fasihfar et al., 2023 [151]
Blood smear images	Binary classification	ResNetSt50 + BYOL	ACC 97.80%; SE 96.50%; SP 98.90%	Hold-Out	-	Fu et al., 2023 [152]
MP-IDB + Dataset–IML-Malaria [143]	Multiclass classification	YOLO-PAM	MP-IDB Av PR 83.60%. IML Av PR 59.90%	Hold-Out	-	Zedda et al., 2023 [153]
352 images of Leishman-Giemsa-stained peripheral blood smears containing normal and infected RBCs	Multiclass classification	Custom CNN	SE 85.00%; SP 94.00%	Hold-Out	-	Saxena et al., 2023 [154]
24,720 images from 475 thin blood smears stained by May Grunwald–Giemsa corresponding to 2,002,597 labels	Object detection + binary classification	RT-DETR	-	-	4508 images taken from 170 thin blood smears of 54 healthy and 116 infected patients. Patient level: 6-classes ACC 68.20%; 5-classes ACC 79.40%; binary ACC 91.80%. Label level: PR 68.60%; REC 66.90%; Av PR 63.80%; F1 67.70%; MCC 63.30%	Guemas et al., 2024 [155]
M5 Dataset [165]. 7543 large-scale multi-microscope images, for a total of 20,331 labeled nucleus	Object detection	FARS	Highest Av PR 67.13% (HCM at 1000× magnification)	Hold-Out	-	Ilyas et al., 2024 [156]

Table 13. Overview of cited works using ML approaches to malaria parasite diagnosis (results have been rounded). The symbol “-” indicates that no information is provided on a particular operation. The abbreviation “Av” indicates the average results over several repetitions of the experiment.

Dataset	Task	Model	Results	Cross-Validation	External Validation	Author and Year
15,660 RBC images from 87 smears stained with May Grünwald–Giemsa labeled by experts	Multiclass classification (6)	SVM and LDA	Image level: ACC 97.7%. Patient level: SE 100.00%; SP 90.00%	5-fold	-	Molina et al., 2020 [166]
DPDx dataset [157]	Cell counting + multiclass classification (4)	SVM	Cell counting: PR 96.18%; REC 98.03%; F1 97.04%. Object detection: SE 83.79%; SP 95.67%. Classification: SE 80.54%; SP 91.99%	Hold-Out		Abbas et al., 2019 [167]
MaMic dataset and bio-Sig data [174]	Binary classification	DT	ACC 92.60%; PR 96.64%; SE 93.12%; SP 91.21%	Hold-Out	-	Sadiq et al., 2017 [168]
23,248 RBC images from Giemsa-stained thin blood smears: 22,290 healthy WBCs, 437 ring shapes, 327 trophozoites and 194 schizonts	Binary classification + multiclass classification of infected RBC (ring, trophozoite, and schizont).	WELM	Cell counting: ACC 97.94%. Classification ACC: 95.99%	10-fold	SE 94.64% and SP 93.50% on NIH.	Kudisthalert et al., 2020 [169]
Human-NIAID dataset	Binary classification	SVM and ANN	ANN: ACC 88.00%; REC 91.00%; PR 99.00%; F1 90.00%. SVM: ACC 83.00%; REC 91.00%; PR 98.00%; F1 97.00%	Hold-Out	-	Poostchi et al., 2018 [170]
Leishman-stained thin blood smear images of 100 healthy subjects and 50 with malaria. 750 images obtained for P. falciparum- and P. vivax-infected blood smears labeled by a pathologist, with a total of 888 RBCs	Multiclass classification (6 classes: 5 infected and 1 healthy)	MLP	SP 98.64%; SE 100.00%; PR 96.84%	3-fold	-	Das et al., 2015 [171]
Private dataset: 50 Giemsa-stained blood samples. Public dataset: MP-IDB	Binary classification	SVM	Private dataset: ACC 86.11%; F1 90.90%. Public dataset: ACC 95.38%; F1 97.11%.	Hold-Out	-	RK et al., 2021 [173]
500 images from 4 thick blood smears and 4035 images from 34 thin blood smears, both under different illumination conditions	Binary + multiclass classification	RF	Parasites detection: ACC 86.89% on thick smears; Species recognition: ACC 98.82%. Parasite staging: ACC 90.78%	Hold-Out	-	Aris et al., 2023 [172]

Table 14. Overview of cited works developing mobile and web applications for malaria parasite diagnosis (results have been rounded). The symbol “-” indicates that no information is provided on a particular operation. The abbreviation “Av” indicates the average results over several repetitions of the experiment.

Dataset	Task	Model	Results	Cross-Validation	External Validation	Author and Year
500 images of Leishman-stained blood smears from different FoVs. The ratio between P. falcuparum and P. vivax is 2:3	Multiclass classification (6 classes: 5 infected and 1 healthy)	C4.5 DT	ACC 99.24%, SE 99.20%, SP 99.60%, F1 99.00%, AUC 100.00%	Hold-Out	10 images acquired from two different microscopes with different settings. The developed model correctly classifies each image	Maity et al., 2017 [175]
Dataset A: coordinates of 49,900 P. falciparum parasites in 2703 images. Dataset B: coordinates of 7628 P. falciparum in 1182 images	Object (parasite) detection at cell and blood smear image levels	Modified pre-trained YOLOV3	Parasite level: Av PR 88.70% (A) and 90.20% (B). Whole image level: ACC 99.07% (A) and 97.46% (B).	Hold-Out	Parasite level: Av PR 80.10% for the network trained on dataset A and tested on dataset B; Av PR 83.20% for the network trained on dataset B and tested on dataset A	Chibuta et al., 2020 [176]
500 images of Giemsa-stained samples, annotated by expert parasitologists	Binary classification	DT	ACC 91.62 ± 2.26%; SP 96.11 ± 2.72%; PR 66.34 ± 10.68%; SE 54.91 ± 24.46%	10-fold	-	Oliveira et al., 2017 [177]
MP-IDB	Multiclass classification	U-Net + SDU-Net + MobileNetV1	Parasite type: ACC 98.47%; SP 96.80%; SE 97.99%. Parasite stage: ACC 98.83%; SP 97.91%; SE 98.54%.	Hold-Out	-	Chen et al., 2023 [178]
2571 images of Giemsa-stained thick blood smears infected with parasites.	Object detection and multiclass classification	iMAGING	YOLOv5x: PR 92.10%; REC 93.50%; F1 92.79%; Av PR 94.40%; mAP 50.00%	Hold-Out	116 images of negative samples: ACC 96.98%. 50 images of Plasmodium samples: ACC 94.00%	Maturana et al., 2023 [179]

Table 15. Overview of cited works proposing general approaches tested on malaria diagnosis (results have been rounded). The abbreviation “Av” indicates the average results over several repetitions of the experiment.

Aim of the Study	Malaria Dataset	Method Used	Results	Author and Year
Optimize the image acquisition and classification methods	Hema-stained single-layer smears. 563 uninfected and 695 infected data	Discretized physical CNN (DP-CNN) network	Av ACC 94.90 ± 2.0%	Horstmeyer et al., 2017 [180]
Co-opting image illumination and automated classification	15 images of Hema, 3 stained thin smears, and 2126 samples from thick smears	Custom CNN with physical layers	Thin smears: Av ACC 88.20 ± 1.9%; PR 93.90%; REC 81.80%. Thick smears: Av ACC 99.00 ± 0.1%; PR 99.90%; REC 98.10%	Muthumbi et al., 2019 [181]
Identification and classification of a large number of cells	BBBC041	Keras R-CNN	Av PR 78.00%	Hung et al., 2020 [182]
Study of batch and online variational learning for multiple tasks	NIH	OVMBMM	ACC 92.50%; PR 90.47%; REC 95.11%; F1 92.68%	Manouchehri et al., 2021 [183]
Study of multi-lens microscopic imaging system	SimuData [118]: 1800 large FOV images from 150 individual patients	10-layers VGG	2 lenses: AUC 98.50%. 7 lenses: AUC 95.30%	Yao et al., 2022 [184]
Simplify the point convolutions in deep CNNs	NIH	Modified EfficientNet-B0	With 16 channels for groups: ACC 97.61%	Schwarz Schuler et al., 2022 [185]
Making training of a DL model quick and easy	BBBC041	Pre-trained ResNet-18	AUC > 70.00%	Gupta et al., 2022 [186]
Provide an efficient approach to medical imaging applications	NIH	IMNet (incremental modular network)	ACC 97.00 ± 0.36%; SP 97.90 ± 0.39%; SE 96.10 ± 0.63%; AUC 99.50 ± 0.1%	Ali et al., 2022 [187]
Improve model accuracy using EL	NIH	ICNN-Ensemble	ACC 99.67%; PR 96.60%; REC 90.00%; F1 96.60%	Musaev et al., 2023 [188]
Demonstrating the efficiency of WP in medical diagnostics	NIH	Lightweight MobileNet	ACC 97.18%; PR 97.43 ± 1.20; REC 97.44 ± 1.28; F1 97.4 ± 1.30	El-Bana et al., 2023 [189]

Table 16. Overview of cited works tested on parasite detection (results have been rounded). The abbreviation “Av” indicates the average results over several repetitions of the experiment.

Aim of the Study	Generalization Task	Malaria Dataset	Method Used	Results	Author and Year
Distinguish five different types of cancer through RNA sequences	Malaria and breast cancer diagnosis	26,839 samples and 1058 features, collected by the Fraunhofer AICOS institution, through the MalariaScope project [192]	Pre-trained AE bound to the final layers of a custom CNN	ACC 92.36 ± 0.46%; MCC 84.00 ± 1%; PR 88.91 ± 2.14%; REC 91.11 ± 2.49%; F1 89.95 ± 0.61%	Ferreira et al., 2020 [190]
Automatic early detection of ALL	Malaria parasite identification	NIH	EfficientNet-B3	ACC 97.68%; Av PR 97.69%; REC 97.68%; SP 97.67%	Abd El-Ghany et al., 2023 [191]

Table 17. Overview of cited works focusing on multiple diagnoses (results have been rounded). The abbreviation “Av” indicates the average results over several repetitions of the experiment.

Aim of the Study	Task	Malaria Dataset	Method Used	Results	Author and Year
Development of point-of-care diagnostics using microscopes and smartphones	Automatic diagnosis of malaria, tuberculosis, and intestinal parasites	Thick blood smears stained with Giemsa. 7245 Plasmodium objects in 1182 annotated images	Custom CNN	AUC 100.00%; Av PR 97.00%	Quinn et al., 2016 [193]
Make microscopes able to indicate the presence of an epidemic pathogen in a sample	Identification of the malaria and vibrio cholerae pathogen	Images downloaded from the web	Custom CNN	ACC 94.64%	Traore et al., 2018 [194]
Show that adversarial learning can be efficiently used in the medical field	Human embryos evaluation, human sperm morphology quantification, and malaria diagnosis	8 patients’ thin blood smears stained with Giemsa captured using 3 different microscopes [195]	Unsupervised adversarial neural network using ResNet-50 trained on NIH dataset as backbone (MD-net-ResNet-50)	AUC 95.2% (IC 95% = 93.5–96.6); AUC 95.4% (IC 95% = 92.6–97.4); AUC 92.00% (IC 95% = 88.5–94.7)	Kanakasabapathy et al., 2021 [196]

Table 18. Evaluation metrics used by articles based on the NIH dataset.

Evaluation Metrics	References
PR	[49,50,51,52,59,60,61,62,63,64,67,68,69,70,73,74,80,86,87,88,89,90,91,92,93,96,97,98,99,100,101,102,103,107,108,110,111,112,113,114,115,117,118]
REC	[43,50,51,52,55,57,59,60,61,62,63,64,67,68,69,70,73,74,75,80,82,83,85,86,87,88,89,90,91,92,93,95,96,97,98,99,100,101,102,103,104,106,107,108,109,111,112,113,114,115,117,118]
SP	[43,55,57,59,61,67,70,73,74,75,80,82,85,86,88,91,92,95,98,101,102,103,104,106,109,112,113,117,118]
F1	[43,50,52,55,57,59,60,61,63,64,67,68,69,74,80,83,85,86,87,88,89,90,91,92,93,95,96,97,98,99,100,101,102,103,104,107,108,109,110,111,112,114,115,117]
MCC	[43,55,61,67,68,74,85,86,95,98,103,110]

Table 19. XAI methods implemented in malaria CAD systems.

XAI Method	Dataset Used	References
CAM	NIH	[70,73,107]
CAM	Other	[146]
Grad-CAM	NIH	[67,85,86,87,89,97,106]
Grad-CAM	Other	-
SHAP	NIH	[96]
SHAP	Other	-

Table 20. Overview of articles on developing mobile and web applications for automatic malaria diagnosis.

Applications
Mobile	Web
[57,59,85,92,110,136]	[43,66,68,86,92,143,179]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Grignaffini, F.; Simeoni, P.; Alisi, A.; Frezza, F. Computer-Aided Diagnosis Systems for Automatic Malaria Parasite Detection and Classification: A Systematic Review. Electronics 2024, 13, 3174. https://doi.org/10.3390/electronics13163174

AMA Style

Grignaffini F, Simeoni P, Alisi A, Frezza F. Computer-Aided Diagnosis Systems for Automatic Malaria Parasite Detection and Classification: A Systematic Review. Electronics. 2024; 13(16):3174. https://doi.org/10.3390/electronics13163174

Chicago/Turabian Style

Grignaffini, Flavia, Patrizio Simeoni, Anna Alisi, and Fabrizio Frezza. 2024. "Computer-Aided Diagnosis Systems for Automatic Malaria Parasite Detection and Classification: A Systematic Review" Electronics 13, no. 16: 3174. https://doi.org/10.3390/electronics13163174

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Computer-Aided Diagnosis Systems for Automatic Malaria Parasite Detection and Classification: A Systematic Review

Abstract

1. Introduction

1.1. Malaria Parasite Transmission and Life Cycle

1.2. Malaria Diagnosis and Treatment

1.3. Automatic Malaria Detection and Diagnosis

2. Methods

2.1. Data Sources and Search Strategy

2.2. Inclusion and Exclusion Criteria

2.3. Quality Assessment

2.4. Data Extraction

3. Artificial Intelligence Background

3.1. ML and DL

3.2. Evaluation Metrics

3.3. Cross-Validation

3.4. External Validation

4. Public Malaria Datasets

5. Results

5.1. Articles Using the NIH Malaria Dataset

5.1.1. Pre-Trained Networks

5.1.2. Custom CNN

5.1.3. Autoencoders

5.1.4. Hybrid Models

5.1.5. Ensemble Learning

5.1.6. Classic ML Models

5.1.7. Mobile Applications

5.2. Articles Using Other Public and Private Datasets

5.2.1. DL Approaches

5.2.2. Classic ML Approaches

5.2.3. Mobile and Web Applications

5.3. Articles on Generalizing Malaria Diagnosis

5.4. Articles on Multiple Diagnoses

6. Discussion

6.1. Qualitative and Quantitative Analysis

6.2. Validation Analysis

6.3. Explainability

6.4. Mobile and Web Applications

6.5. Future Directions of AI Approaches to Malaria Diagnosis

6.6. General Considerations on This Review

7. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI