1. Introduction
The demand for food items like rice is increasing day by day due to the growing population all over the world. Almost 23% of people get most of their calories from rice [
1]. Considering its impacts in global nutrition, it holds significant importance among the multitude of crops available all over the world. The effective management of rice diseases can increase the production of rice in agricultural innovations.
Multiple diseases affecting rice, such as viral, bacterial and fungal effects degrade the quality and quantity of rice which creates major problems for the global economy. It is important to identify paddy disease very early for proper management and treatment in the agricultural field. Due to the viral, bacterial and fungal effects, a variety of rice diseases, namely, bacterial blight, rice brown spot, rice blast and leaf smut are seen in paddy fields [
2]. Each disease has a particular color, shape and pattern of affected portions on leaves. Rice bacterial blight shows water-soaked stripes on leaf edges, yellow or white stripes on leaf edges [
3]. Brown spots can be identified from a distance by their brownish scorched appearance on the leaves.
Though manual monitoring can indicate disease, it is tedious and erroneous for large fields [
4]. In order to initiate disease treatment, individuals are required to bring the disease sample to specialists and await further instructions which is time-consuming and costly in agricultural field management [
5,
6].
In this case, researchers concentrate on detecting plant disease through image processing instead of manual observations [
7]. Various image-processing approaches have been employed for disease management in the agricultural field. It is quite challenging to detect rice disease at an early stage properly based on colors, spots or streaks on leaves or stems using image processing techniques [
8,
9,
10].
Therefore, a Convolutional Neural Network (CNN) is incorporated with pre-processing of images to get reliable outcomes in rice disease detections and classifications [
11]. CNNs are used for the classification of datasets with proper feature extractions. Segmentation, thresholding and clustering are used to analyze the color, shape and pattern of affected spots on leaves in rice disease classifications [
12].
So, the overall task involves a series of objectives related to rice leaf disease classification. Firstly, an image processing technique is applied to extract the diseased portions from rice leaf images. This step allows for targeted analysis and identification of the affected areas. Next, a Convolutional Neural Network (CNN) is constructed, which is trained and utilized for the purpose of classification. The CNN is fine-tuned by adjusting various parameters and modifying layers to achieve optimal performance in disease identification. Subsequently, the model is tested using rice leaf disease images and its accuracy is compared against known data. Lastly, the model is integrated into an Android-based smartphone application, enabling real-time detection of rice leaf diseases. This comprehensive approach aims to enhance the identification and management of rice leaf diseases through advanced image processing techniques and machine learning algorithms.
A literature review, a study of related papers, is discussed in
Section 2.
Section 3 contains the methodology we have used for our work. The result and discussion are given in
Section 4.
Section 5 explains how the trained model has been plugged into a mobile application. The conclusion and probable future work are highlighted in
Section 7.
2. Literature Review
Stephen et al. [
13] used CNN architectures for the identification of healthy and diseased leaves. The researchers applied self-attention with ResNet34 and ResNet18 for the improvement of feature selection and avoided gradient problems by using ResNet50 and ResNet34. They suggested that self-attention with ResNet34 achieved 98.54% average accuracy in multiclass classification.
The authors of [
14] reviewed ML and AI methods for the detailed identification of rice diseases. As rice is an economically important food crop all across the globe, they focused on deep learning, machine learning and AI tools for recognizing rice diseases.
The authors of [
15] detected paddy leaf disease by using faster region-based CNN (Faster R-CNN) in real-time. It was proposed that if the regional proposal network (RPN) was faster the efficiency of the R-CNN was increased. The candidate regions were generated because RPN can locate the location of the targeted object in a precise way. They used already available datasets and also produced their own. They combined 650 healthy leaf images, 600 rice blast images, 650 brown spots and 500 hispa images, and produced a total dataset of 2400 images. By concentrating on 2230, 11 and 2022 plants, and 4 of 17 on rice blast, brown spot, and hispa classes, the authors recorded 98.09%, 98.85% and 99.17% accuracy, respectively. They identified healthy rice leaves with 99.25% average accuracy.
To classify the rice grains, the authors of [
16] proposed the use of a CNN for image-based datasets and a Deep Neural Network (DNN) or ANN for feature-based datasets. No specific rice diseases were targeted in this study because the researchers focused on different types of rice grains for the detection of healthy leaves. They collected 15,000 photographs of five types of rice grains: Arborio, Basmati, Ipsala, Jasmine and Karacadag and finally collected 75,000 images in total. After extracting 106 features from these images, they produced a feature-based dataset including morphological, four-shape and color features. According to the findings, the average grain classification for ANN was reported as 99.87%, while DNN achieved a classification rate of 99.95% and CNN demonstrated a perfect classification rate of 100%.
The authors of [
17] introduced a new CNNIR-OWELM-based algorithm for the categorization of paddy diseases by combining optimal weighted extreme learning machines (WELM) and Residual Network (ResNet) v2 based on CNN. This system combined IoT for segmenting the infected areas through histogram segmentation and capturing images, followed by the use of deep learning inception (ResNet v2) for the extraction of features.
The authors of [
18] used a freely accessible dataset and produced 3500 images of diseased and healthy paddy leaves. They introduced the researchers to highly efficient crop care systems by using a Convolutional Neural Network compared to other models. This study suggested a very fast solution to classifying healthy and diseased crops by locating the affected areas of plants. They obtained 70% accuracy by creating a classification module with CNN and building the model for 1–10 epochs. For eight epochs, 72.17% of the validation accuracy was achieved.
The researchers of [
19] generated ensemble models for the classification of various kinds of diseases such as brown spot, bacterial stripe disease, sheath blight, false smut and leaf blast. The findings of the study achieved 91% overall accuracy.
Feng et al. [
20] detected paddy leaf diseases by employing hyperspectral imaging (HIS) and generated CNN architectures by using deep transfer learning techniques. The results of the study indicated that fine-tuning was an efficient solution that provided 88% accuracy.
Upadhyay et al. [
21] identified and classified rice plant diseases by analyzing the size, color and shape of lesions present on leaf images using the CNN method, and achieved 99.7% accuracy on the dataset.
Chen et al. [
22] presented the BLSNet strategy, which could identify and detect the damage to leaves caused by Bacterial Leaf Streak (BLS) disease. This disease affects the quantity and quality of rice growth. In a comparative analysis with other benchmark models, BLSNet exhibited superior performance in accurately detecting and identifying damage and its level of severity.
The authors of [
23] used color features for the classification of paddy diseases. The researchers extracted a total of 172 features from each channel after analyzing 14 color spaces. A dataset of 619 images belonging to four classes (sheath blight, healthy leaves, bacterial leaf blight and rice blast) was used. For testing their methodology, they used different classifiers, including discriminant classifiers (DCs), SVM, K-NN, NB, DT, Random Forest (RF) and LR; 94.65% of the SVM’s highest accuracy was reported.
The researchers of [
24] built a system by integrating machine learning (ML) and image processing technologies. For the identification of plant diseases, they developed an application that also predicts the amount of fertilizer to be used for diseased crops. They produced a dataset of 1000 images (for three diseases) and separated them into different folders according to the type of disease. Image processing techniques were used for obtaining their relevant features and creating two different classification models: Back Propagation Neural Network (BPNN) and Support Vector Machine (SVM).
Chen et al. [
25] achieved impressive bed disease classification and image processing through deep learning techniques. They achieved 94.07% overall accuracy on the public dataset and 98.63% on rice disease image classification by combining the Inception and DenseNet modules.
To categorize the rice crop diseases: false smuts, bacterial leaf blight and brown spots [
26], it was proposed by the authors to use a Support Vector Machine (SVM) classifier. They also proposed that Bag of Word (BoW) and Scale-Invariant Feature Transform (SIFT) should be used for extracting the features. Moreover, after the SVM classifier, the use of Brute-Force (BF) matchers and K-means clustering was proposed. A dataset comprising 400 images was used, which belonged to different sources: Rice Research Institute (RRI), Rice Knowledge Bank (RKB) and the American Psychopathological Society (APS). The researchers indicated 90.9% precision, 91.6% recall and 94.16% average accuracy, but the dataset was very small because the SVM classifier is susceptible to overfitting regarding multiclass classification.
The authors of [
27] reviewed image processing techniques and provided an outline to classify and detect plant diseases. Image processing techniques can help in the detection of leaf diseases at an early phase and they can be controlled to prevent them from spreading further. The researchers indicated that color co-occurrence, Neural Network and K-means clustering may be employed to identify and categorize plant diseases.
T. Islam et al. [
28] recommended an approach with color feature to identify rice diseases, namely, rice blight and rice black spots. They extracted RGB values of impacted portions of leaves using a naive Bayesian classifier. They achieved over 89% categorization reliability for rice disease detection and classifications considering the percentage of the RGB affected area as a feature in their work.
Phadikar and Goswami [
29] employed image processing to identify rice blast and rice brown spot in the agricultural field. They considered noise removal and segmentation criteria to get better performance in disease detection techniques. They obtained only 84% accuracy with five features of homogeneity, contrast, correlation, energy and entropy for disease classification through image processing.
Joshi and Jadhav [
30] proposed a model to detect rice disease, namely, rice blast, rice bacterial blight, rice brown spot and rice sheath rot. They used the Minimum Distance Classifier (MDC) and K-Nearest Neighbor classifier (K-NN) to extract features of like shape and color in their model. They obtained 89.23% classification accuracy with the steps of pre-processing and segmentation in their model.
Qiu et al. [
31] employed a deep convolutional network to construct a paddy disease identification model. They utilized the Keras deep learning architecture for training and explored various convolution kernel sizes and pooling functions to investigate the classification and recognition of three different rice diseases. Impressively, their model achieved an accuracy exceeding 90%.
Krishnamoorthy et al. [
32] introduced an innovative transfer learning approach using the InceptionResNetV2 model. They effectively combined feature weights and fine-tuned hyperparameters to accurately identify three distinct rice diseases, achieving recognition accuracy of 95.67%.
In reference [
33] the authors aimed to enhance the accuracy of rice disease diagnosis using VGG-16 and GoogLeNet models. These models were trained on a dataset comprising three distinct species of painless diseases. The experimental results demonstrated that the average classification accuracies achieved by GoogLeNet and VGG-16 were 91.28% and 92.24%, respectively.
Early and accurate recognition of plant diseases is crucial for safeguarding grain production. Vimal K. Shrivastava et al. [
34] addressed the limitations of traditional plant disease detection systems. Their study involved four classes, including three disease categories and one class for healthy leaves. To accomplish this, they leveraged a pre-trained deep CNN model, namely AlexNet, along with an SVM classifier and transfer learning techniques. Impressively, their approach yielded an accuracy of 91.37% in disease classification.
Dengshan Li et al. [
35] introduced an innovative mechanism for real-time rice leaf disease detection using deep learning techniques. Their approach involved employing faster-RCNN for image detection from video streams. Additionally, they explored several deep CNN models, including YOLOv3, ResNet-101, ResNet-50 and VGG16, to enhance the accuracy and performance of their detection system.
According to the literature review, it was revealed that researchers have used different classifiers for rice disease detection and classifications through image processing. They obtained 80~99% classification accuracy with different classification algorithms such as SVM, Naive Bayes Classifier, K-NN MDC, etc. They achieved faster and non-invasive criteria in rice disease detections considering the RGB percentage of the affected leaves. It is quite challenging to achieve more reliable outcomes based on the color features in image processing. Therefore, the proposed model incorporates K-means clustering with CNN techniques to obtain more reliable outcomes in agricultural innovations. There are hardly any models with a trade-off between the accuracy and size of the network. A smaller CNN network with lower run time and space can be designed to keep significant accuracy considering the use of the network in mobile applications.
4. Results and Discussion
According to
Figure 8, the activated areas which are actually the affected portions of the leaves are visualized when the background is removed using thresholding criteria in image processing. The pattern, edge and color are extracted as features that are responsible for predicting the class.
The dataset was obtained on different rice disease samples including bacterial blight, brown spot and leaf smut. The model was trained using a dataset of 2700 images and validated with 1200 images. The model was tested through 630 images of disease-affected leaves. The training initiates with a learning rate of 0.001 and continues until the validation criteria are fulfilled as shown in
Figure 9.
According to
Figure 9, training accuracy reaches 1 at or near the 80th iteration while validation accuracy converges after the 130th iteration, alternatively said, after the 5th epoch. The validation accuracy of the progress changes with iteration and reaches around 97.9% after 130 iterations as shown in
Figure 10.
Beside our main dataset another dataset with three disease classes was trained and tested for our network. For most cases it provided promising accuracy. Though for dataset-2 with field background behind multiple leaves, it fell below 80% as shown in
Table 3. The separating capability of the model, for this case only, is not very reliable as the leaves represented in the image are overlapped and cannot be segmented properly.
After training the model, the confusion matrix was obtained to classify the diseases of bacterial leaf blight, brown spot and leaf smut as shown in
Figure 11. According to the confusion matrix in
Figure 11, the classification accuracy was seen as 97.9% whereas the error rate was obtained as 2.1% for rice disease classifications.
The confusion matrix shows the test accuracy of bacterial leaf blight, brown spot and leaf smut as 97.2%, 96.8% and 100% accuracy as shown in
Figure 11 for predicting the disease classes.
The ablation technique was used to determine the effect of two image processing steps by comparing their accuracy for each case. Incorporating these two steps raised accuracy by about 8% as shown in
Table 4.
A higher area under curve (AUC) indicates better overall performance, indicating a higher probability of correctly classifying positive instances compared to negative instances. For our model, leaf smut covered most area under curve and brown spot the least as shown by the ROC curve in
Figure 12.
A high precision indicates that, when the model makes a positive prediction for a sample, it indicates a higher likelihood of being accurate. A high recall indicates that the model demonstrates a high capability to accurately identify a significant portion of positive samples. It assesses the model’s capacity to avoid false negatives. It is clear from
Table 5 precision and specificity for brown spot are highest and the recall for leaf smut is highest. For brown spot, the f1 score is also highest and the other two have nearly equal value.
6. Comparison to Existing Approaches
The researchers used image processing to detect rice diseases so that farmers can easily identify the disease for proper disease management in the agricultural field. The proposed model includes image thresholding and K-means clustering as image pre-processing criteria for getting better performance in disease detection techniques. The model included a CNN approach to obtain more accuracy in image classification with proper feature extractions. The reliability of the suggested model was compared with other existing models for paddy leaf disease detection or classification as shown in
Figure 15.
According to the accuracy diagram in
Figure 15, KNN (k = 1) shows an accuracy of 90% whereas KNN (k = 3) shows an accuracy of 72% in the performance of the disease detection model. The model with VGG-16 criteria obtained an accuracy of 92% for rice disease detections. Though the accuracy obtained by Inception-V3 criteria [
9] was higher than the CNN-based model, the transfer learning steps increased the computational complexity of the Inception-V3-based model. In the proposed model, we made a trade-off between network sizes and performance for rice disease detections and classifications. This is because the trained model was faster and more competent for considering small network sizes in mobile applications.
Table 6 shows the reliability of the suggested model compared with the TensorFlow models through Mobile Net, Inception-V3 and AlexNet criteria based on the parameters of processing time and average accuracy in mobile applications. The processing time and accuracy were obtained based on a performance evaluation for 10 affected images in disease detections and classifications. The Mobile Net criteria showed 83% accuracy with a 145 ms processing time while Inception-V3 showed 99% accuracy with 230 ms processing time and AlexNet showed 99.3% accuracy with 195 ms processing time in smartphone applications. Our model showed 97.9% average accuracy but was faster than most other existing models with 103 ms processing time in mobile applications.
Only the affected area of the leaf is processed in the proposed model instead of processing the whole leaf which takes significantly less time than entire leaf processing as shown in
Table 6. The proposed model for rice leaf disease classification performed well with test accuracy of 97.9% and processing time of 103 ms to use in smart agriculture. IoT-based automated farming also requires this type of model to classify diseases and take necessary steps to manage those diseases efficiently without the necessity of human vision. As rice is the most popular food and the global economy mostly depends on agriculture, the application level of this research is a role-playing concept in the growth and development of agricultural production.
The advantages of the mobile application can also be used to classify diseases at the grass root level. The application can be compatibly used with a properly functioning camera to classify these three popular rice leaf diseases. Thus, paddy leaf disease classification using CNN and image processing technologies with compatibility to use in mobile applications can bring confidence of accurately classifying the diseases in agriculture.