Next Article in Journal
Fuzzy Partial Metric Spaces and Fixed Point Theorems
Next Article in Special Issue
Thyroid Nodule Segmentation in Ultrasound Image Based on Information Fusion of Suggestion and Enhancement Networks
Previous Article in Journal
Fixed-Time Convergent Gradient Neural Network for Solving Online Sylvester Equation
Previous Article in Special Issue
AI-Empowered Attack Detection and Prevention Scheme for Smart Grid System
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Deep Learning-Based Plant-Image Classification Using a Small Training Dataset

Division of Electronics and Electrical Engineering, Dongguk University, 30 Pildong-ro, 1-gil, Jung-gu, Seoul 04620, Korea
*
Author to whom correspondence should be addressed.
Mathematics 2022, 10(17), 3091; https://doi.org/10.3390/math10173091
Submission received: 6 July 2022 / Revised: 10 August 2022 / Accepted: 23 August 2022 / Published: 28 August 2022
(This article belongs to the Special Issue Computational Intelligent and Image Processing)

Abstract

:
Extensive research has been conducted on image augmentation, segmentation, detection, and classification based on plant images. Specifically, previous studies on plant image classification have used various plant datasets (fruits, vegetables, flowers, trees, etc., and their leaves). However, existing plant-based image datasets are generally small. Furthermore, there are limitations in the construction of large-scale datasets. Consequently, previous research on plant classification using small training datasets encountered difficulties in achieving high accuracy. However, research on plant image classification based on small training datasets is insufficient. Accordingly, this study performed classification by reducing the number of training images of plant-image datasets by 70%, 50%, 30%, and 10%, respectively. Then, the number of images was increased back through augmentation methods for training. This ultimately improved the plant-image classification performance. Based on the respective preliminary experimental results, this study proposed a plant-image classification convolutional neural network (PI-CNN) based on plant image augmentation using a plant-image generative adversarial network (PI-GAN). Our proposed method showed the higher classification accuracies compared to the state-of-the-art methods when the experiments were conducted using four open datasets of PlantVillage, PlantDoc, Fruits-360, and Plants.

1. Introduction

Numerous plant image-based classification methods exist [1,2,3,4,5,6,7,8,9,10,11,12,13,14]. However, existing plant-image datasets have few training image data [14]. This hinders the achievement of high classification accuracy. In addition, it is challenging to construct large-scale training datasets. Therefore, this study examined a method for obtaining a higher accuracy using fewer number of training image data. In this study, four open datasets including PlantVillage dataset [15], PlantDoc dataset [16], Fruits-360 dataset [17], and Plants dataset [18] were used in various experiments. The total number of image data of training sets in each plant dataset was reduced by 70%, 50%, 30%, and 10% for comparative experiments. The experiments demonstrated how the classification accuracy is reduced to a certain extent depending on the amount of data. In addition, the datasets in which the amount of data were reduced by 70%, 50%, 30%, and 10% were applied with conventional augmentation methods [19,20], tutorial on image data augmentation in Keras [21], and available libraries [22] to restore the amount of data to 100% for additional experiments. These experiments demonstrated how the classification accuracy improves accordingly. Furthermore, the accuracy could be improved further by augmenting the datasets through a plant-image augmentation method based on a generative adversarial network (GAN). Classification was performed using a variety of reduced datasets and the datasets that were augmented using various methods. Two types of conventional plant-image classification methods and the convolutional neural network (CNN)-based plant classification method proposed in this study were used. The experiments conducted in this study showed that the proposed CNN-based method achieved an accuracy higher than that achieved by the conventional methods.

2. Related Works

Conventional studies on plant image classification can be divided into large dataset-based methods and small dataset-based methods. The large dataset-based methods include the following. For an attention-based fruit classification method [1], a MobileNetV2-based lightweight deep learning model was proposed and a MobileNetV2 pre-trained model with ImageNet was used as a backbone. A trilinear convolutional neural network model (T-CNN) was proposed for a crop and crop-disease classification method [14]. Therein, the PlantVillage dataset [15], the PlantDoc dataset [16], and a pre-trained model with ImageNet were used for conducting diverse experiments. The ExtResnet model was proposed for a grape-variety recognition method [7]. Therein, a wine grape instance segmentation dataset (WGISD) [23] was used to conduct experiments. The Fruits-360 image dataset was used to conduct experiments with a fruit recognition method [10]. Furthermore, various feature extraction methods (Hu moments, Haralick texture, and color histogram) as well as machine learning methods (decision tree, k-nearest neighbors, linear discriminant analysis, logistic regression, naïve bayes, random forest, and support vector machine) were used to conduct experiments and compare the results. A multi-class CNN model was proposed for a fruit classification method [6]. The FIDS30 dataset [24] and Fruits-360 dataset were used to conduct experiments in that study. Two deep learning models (a light model and a pretrained model) were proposed for a fruit classification method [8]. A supermarket product dataset and an in-house dataset were used to conduct experiments in that study. EfficientNet-B0 and Fruit-360 were used to conduct experiments with a fruit recognition method [5]. Histogram of Oriented Gradient (HOG) was used to conduct a classification experiment with a fruit classification method [13]. A deep convolutional neural network (DCNN) was used to conduct a classification experiment with an autonomous fruit recognition method [4]. Bag of features (BoF), conventional CNN, and AlexNet were used to conduct a classification experiment with a fruit recognition method [9]. Furthermore, the accuracy of these methods was compared using the Fruit-360 dataset [17]. Inception v3 [25] and VGG16 [26] were used to conduct a classification experiment with a fruit image classification method [3]. Furthermore, the accuracy of these methods was compared using the Fruit-360 dataset. FruitNet was proposed for a fruit-image classification method [2]. Fourteen deep learning methods were compared in that study. However, the accuracy was compared using the Fruit-360 dataset. ShuffleNet V2 and Fruit-360 dataset were used for a fruit-image classification method [12]. CNN and Fruit-360 dataset were used with a fruit-variety classification method [11]. Furthermore, ROIs were generated from the original apple image using YOLO V3 [27].
In general, plant-based image datasets are small in size. Furthermore, it is difficult to construct large datasets. However, none of the above-mentioned methods considered small training datasets. Hence, this study proposed a new image augmentation and classification method for plant image classification based on small training datasets. Table 1 presents the advantages and disadvantages of the proposed method and conventional plant-image classification methods.
This study is novel compared with previous studies in terms of the following three aspects:
-
Thereby, this study proposed a plant-image classification convolutional neural network (PI-CNN). It outperforms conventional plant-classification methods. The proposed PI-CNN was configured as a residual block-based shallow model to reduce the number of training parameters. It demonstrated high accuracy on datasets of various sizes.
-
This study proposed a new plant-image augmentation method, namely, a plant-image generative adversarial network (PI-GAN). It uses two types of input images from which the features are aggregated to generate new training images.
-
The models designed in this study are disclosed [28] for fair performance evaluation by other researchers.
The remaining parts of this paper are organized as follows. The proposed method is explained in detail in Section 3. The experimental results and analyses are presented in Section 4. Finally, the discussion and conclusions are presented in Section 5 and Section 6, respectively.

3. Materials and Methods

3.1. Overall Procedure of Proposed Method

The proposed methods are explained in detail in this section. Figure 1 shows the flowcharts of the methods. As shown in Figure 1a, image augmentation by PI-GAN involves the use of two types of input images (different plant images) to augment plant images. Moreover, as shown in Figure 1b, the concepts of the visual geometry group network (VGG-Net) [26] and residual network (ResNet) [29] were combined for plant image classification by the PI-CNN proposed in this study. The size of input images and number of output classes vary across the four datasets used in this study. For example, the size of an input image in Figure 1a is 100 × 100 × 3 pixels, whereas the output of PI-GAN is 100 × 100 × 3 pixels. Furthermore, the size of an input image in Figure 1b is 256 × 256 × 3 pixels, whereas the output class of PI-CNN is 14. As shown in Figure 1a, the augmented plant image is used for training PI-CNN in Figure 1b.

3.2. Detailed Structure of Proposed PI-GAN and PI-CNN

The detailed structure of the PI-GAN proposed in this study is presented in Table 2, Table 3, Table 4, Table 5, Table 6 and Table 7 and Figure 2. The generator and discriminator networks of the PI-GAN method are shown in Table 2 and Table 6. Furthermore, Table 8 and Figure 3 explain the proposed PI-CNN structure in detail. The structure shown in Table 2, Table 3, Table 4, Table 5, Table 6, Table 7 and Table 8 includes the input layer (input_layer), convolutional layer (conv2d), max pooling layer (max_pool), encoder block (encoder), decoder block (decoder), concatenate layer (concat), residual block (res_block), rectified linear unit (lrelu), parametric rectified linear unit (prelu), up sampling layer (Up2), additional operation layer (add), discriminator block (disc_block), and fully connected layer (FC). Furthermore, tanh and sigmoid represent activation functions. The stride and padding in Table 2, Table 3, Table 4 and Table 5 are (1 × 1) and (1 × 1), respectively. Meanwhile, the padding in Table 6 and Table 7 is (1 × 1). The input of a generator network is a 100 × 100 × 3 plant image, as shown in Figure 2, whereas the output is a 100 × 100 × 3 augmented plant image. The input of a discriminator network is a 100 × 100 × 3 plant image as in Figure 2, whereas the output is 1 × 1. In addition, a 100 × 100 × 3 image is used as an input of the proposed PI-CNN, whereas the output comprises 14 × 1 probabilities. The number of classes of output in Table 6 and Table 8 is 2 and 14, respectively. However, as explained in Section 3.3, the size of an input image in Table 2, Table 6, and Table 8 varies across the four types of datasets used in this study. Furthermore, the number of classes of output in Table 6 and Table 8 vary. The “Times” columns in Table 2 and Table 6 indicate the number of times each layer is used.

3.3. Dataset and Experimental Setup

In this study, the experiments were conducted using the PlantVillage dataset [15], PlantDoc dataset [16], Fruits-360 dataset [17], and Plants dataset [18]. The datasets comprise images of plants, fruits, and plant diseases acquired in different environments. The Fruits-360 dataset is composed of manually cropped images. The size and depth of the images are 100 × 100 × 3 pixels and 24 bits, respectively. The total number of images in the train and test sets are 41,322 and 13,877, respectively. Here, the test set is divided again to test and validation sets, and the number images of the test and validation sets are 12,877 and 1000, respectively. The PlantVillage dataset is also composed of manually cropped images. The size and depth of the images are 256 × 256 × 3 pixels and 24 bits, respectively. The total number of images is 54,305. Here, the dataset is divided to train, test, and validation sets, and each has 40,000, 13,305, and 1000 images, respectively. The PlantVillage dataset includes grayscale images and segmented images. The PlantDoc dataset is composed of manually cropped and original images. The size of images varies significantly: the smallest and largest sizes are 150 × 150 × 3 and 5616 × 3744 × 3, respectively. The depth of images is 24 bits. The total number of images in the train and test sets are 2336 and 236, respectively. Here, the test set is divided again to test and validation sets, and the number images of the test and validation sets are 200 and 36, respectively. The Plants dataset is composed of manually cropped and original images. The depth and size of the images are similar to those for the PlantDoc dataset. This dataset consists of a training set, a test set, and validation sets, and each contains 13,149, 5218, and 1521 images, respectively. The above-mentioned information is summarized in Table 9.
Figure 4, Figure 5, Figure 6 and Figure 7 show example images of the Fruits-360 dataset, PlantDoc dataset, PlantVillage dataset, and Plants dataset.
The process of training and testing both image augmentation and classification algorithms was performed using a desktop computer equipped with an Intel Core i7-6700 [email protected] GHz, an Nvidia GeForce GTX TITAN X graphics processing unit (GPU) card [30], and a random-access memory (RAM) of 32 GB. The proposed model and algorithm were implemented using OpenCV library (version 4.3.0) (Intel Corporation, CA, USA) [31], Python (version 3.5.4) (Rossum, G.V., DE, USA) [32], and the Keras application programming interface (API) (version 2.1.6-tf) (Chollet, F., CA, USA) with a TensorFlow backend engine (version 1.9.0) (Google, CA, USA) [33].

3.4. Data Augmentation

In this subsection, the augmented images obtained through the proposed plant-image augmentation method are explained in detail. As shown in Figure 2, when two images are input into PI-GAN and the features of input_2 are mixed with input_1 by using channel-wise concatenation as Layer 6 in Table 2 and L6 in Figure 2, a new image is generated through a decoder. The classification results were compared by conducting experiments wherein the number of images in the training set was increased from 70% to 100%, from 50% to 100%, from 30% to 100%, and from 10% to 100%. The experiments were conducted to compare the conventional augmentation methods and the PI-GAN-based augmentation methods. For example, a reduction in the total number of images in the training sets of the four datasets explained in Table 10 and Section 3.3 (Fruits-360 dataset: 41,322, PlantVillage dataset: 40,000, PlantDoc dataset: 2336, and Plants dataset: 13,149) by 70% results in 28,926, 28,000, 1635, and 9204 images, respectively. Furthermore, a reduction by 50% results in 20,661, 20,000, 1168, and 6574 images, respectively; a reduction by 30% results in 12,396, 12,000, 700, and 3944 images, respectively; and a reduction by 10% results in 4132, 4000, 233, and 1314 images, respectively.
The reduced training sets were augmented back to 100% (41,322, 40,000, 2336, and 13,149) through conventional augmentation methods (shifting in eight directions, in-plane rotation, flipping, blurring, scaling, brightness, and contrast). The training datasets were also augmented to 100% through the PI-GAN-based method. The PlantVillage and PlantDoc datasets were divided further into disease and crop sub-datasets in the experiment. Furthermore, training was performed by combining the training sets of the Plants and PlantDoc datasets (the numbers with ‘*’ in Table 11) in the training phase. Thus, the numbers of train subsets and test subsets are four (Fruits-360, PlantVillage crop, PlantVillage disease, and PlantDoc crop + PlantDoc disease + Plants) and six (Fruits-360, PlantVillage crop, PlantVillage disease, PlantDoc crop, PlantDoc disease, and Plants), respectively, as in Table 11.
Furthermore, training subsets of the original datasets, reduced datasets, datasets augmented by the conventional method, and datasets augmented by PI-GAN were constructed. The numbers of training subsets are 4 (100% of Fruits-360, PlantVillage crop, PlantVillage disease, and Plant), 16 (70%, 50%, 30%, and 10% of the four training subsets), 16 (70%, 50%, 30%, and 10% of the subsets were increased to 100%, 100%, 100%, and 100% of them by the conventional method), and 16 (70%, 50%, 30%, and 10% of the subsets were increased to 100%, 100%, 100%, and 100% of them by PI-GAN), respectively. Thus, the total number of training subsets is 52, whereas the total number of test subsets is 6. Detailed explanations are presented in Table 10 and Table 11.
In this study, the results from the original four training subsets (six test subsets) and the reduced 16 training subsets (6 test subsets) are compared in Section 4.2.1. In addition, the results obtained by augmenting the reduced datasets through conventional augmentation methods (16 training subsets and 6 test subsets) and through PI-GAN-based augmentation methods (16 training subsets and 6 test subsets) are compared in Section 4.2.2 and Section 4.2.3, respectively.

4. Experimental Results

This section is divided into four subsections. These address the training setup, ablation study, comparisons with the state-of-the-art methods, and processing time. The training setup of the training phase such as hyperparameters and training loss are explained in Section 4.1. The results obtained from the ablation study are presented in Section 4.2. Furthermore, Section 4.2.1 compares the classification results by using 30 sub-datasets of different sizes, Section 4.2.2 compares the classification results by using 24 datasets augmented through conventional augmentation methods, and Section 4.2.3 compares the classification results by using 24 datasets augmented through the GAN-based augmentation methods. Section 4.3 compares the experimental results of 78 datasets obtained through the existing plant-image classification methods and the proposed method. Finally, the processing time is recommended in Section 4.4.

4.1. Training Setup

The training setups of the PI-GAN-based plant image augmentation method and proposed PI-CNN-based plant image classification method were as follows. The batch size, training epoch, and learning rate were set to 8, 50, and 0.0001 for the PI-GAN and to 8, 40, and 0.0001 for the PI-CNN, respectively. Moreover, we used the binary cross-entropy loss [34] for both generator and discriminator losses and used the categorical cross-entropy loss [35] for the PI-CNN loss. Adaptive moment estimation (Adam) [36] was used as an optimizer in both PI-CNN and discriminator networks. Figure 8a,b show the training loss and validation loss curves of the proposed PI-GAN per epoch, respectively, whereas Figure 8c,d show the training and validation loss and accuracy curves of the proposed PI-CNN per epoch, respectively. As shown by the convergences of the training loss and accuracy curves in Figure 8, the proposed PI-GAN and PI-CNN were trained sufficiently with the training data. Furthermore, as shown by the convergences of the validation loss and validation accuracy curves in Figure 8, the proposed PI-GAN and PI-CNN are not overfitted to the training data. Table 12 and Table 13 show the search spaces and selected values of hyperparameters for PI-GAN and PI-CNN.

4.2. Ablation Study

The results of various ablation studies are presented in this subsection. The accuracy of the plant classification was measured using four types of metrics in Equations (1)–(4). Here, the true positive rate (TPR), positive predictive values (PPV), accuracy (ACC) [37], and F1-score [38] are presented. In the equations given below, TP, FP, FN, and TN refer to true positive, false positive, false negative, and true negative, respectively. Here, “#” indicates “the number of”.
TPR = # TP # TP   + # FN
PPV   = # TP # TP   + # FP
ACC = # TP + # TN # TP   + # TN   + # FP   + # FN
F 1 - score = 2 · PPV   ·   TPR PPV   +   TPR

4.2.1. Plant Image Classification

In this subsection, we conducted experiments to obtain a good generator structure for PI-GAN, and to obtain a classification structure for PI-CNN. Four different generator networks were compared by using Fruits-360 dataset and PI-CNN, as shown in Table 14. Moreover, four different networks were compared for PI-CNN by using Fruits-360 dataset and without using PI-GAN, as shown in Table 15. As shown in Table 14, a generator network with four encoder-decoders showed higher accuracy compared to others. Moreover, as shown in Table 15, a network with sixteen residual blocks showed higher accuracy compared to others. Therefore, we used a generator network with four encoder-decoders for PI-GAN and a network with sixteen residual blocks for PI-CNN.
In addition, four training sub-datasets (PlantVillage disease, PlantVillage crop, PlantDoc disease + PlantDoc crop + Plants, and Fruits-360) were divided further according to five sizes (100%, 70%, 50%, 30%, and 10%), as explained in Section 3.4, to train the image classification models. The classification accuracy was measured using six testing sub-datasets (PlantVillage disease, PlantVillage crop, PlantDoc disease, PlantDoc crop, Plants, and Fruits-360). The results are compared in Table 16, Table 17 and Table 18. As shown in these tables, the classification accuracy decreases as the size of the training dataset decreases.

4.2.2. Plant Image Classification with Conventional Image Augmentation Methods

In this subsection, image classification was repeated using the training datasets augmented through conventional augmentation methods (image shifting to four directions, in-plane rotation, flipping, Gaussian blurring, scaling, and the adjustments of brightness and contrast) to improve the classification accuracy. Figure 9 shows an example of augmented image data. The results of the experiments are compared in Table 19, Table 20 and Table 21.
Additionally, we conducted experiments using random cropping [45] for augmentation and showed the results in Table 22, Table 23 and Table 24. A comparison between Table 19, Table 20, Table 21, Table 22, Table 23 and Table 24 and Table 16, Table 17 and Table 18 reveals that the augmentation improved the classification accuracy.

4.2.3. Plant Image Classification with PI-GAN-Based Augmentation Methods

In this subsection, image classification was performed using the training datasets augmented back by the PI-GAN-based augmentation method to improve the classification accuracy shown in Table 16, Table 17 and Table 18. Figure 10 shows an example of augmented image data. The results of the experiments are compared in Table 25, Table 26 and Table 27. A comparison of these tables with Table 16, Table 17 and Table 18 reveals that the augmentation improved the classification accuracy.
Figure 11, Figure 12, Figure 13 and Figure 14 present a comparison of F1-scores between the reduced datasets and augmented datasets conveniently. The results in the figures are presented in the following order (top to bottom): original dataset (O), reduced dataset (R), dataset augmented by conventional method (T), and dataset augmented by PI-GAN (G).
As shown in Figure 11, Figure 12, Figure 13 and Figure 14, lower F1-scores were obtained when the reduced datasets were used, whereas higher F1-scores were obtained when the datasets augmented by conventional methods were used. Furthermore, higher F1-scores were obtained when the datasets augmented by PI-GAN-based methods were used. In Table 28, Table 29 and Table 30, the existing augmentation method [46] based on conditional GAN is used to perform the classification for additional experiments. As shown in Table 25, Table 26, Table 27, Table 28, Table 29 and Table 30, classification accuracies by using the proposed PI-GAN are higher compared to those using the existing method [46].
As shown in Table 16, Table 17 and Table 18, we achieved higher accuracy when using a bigger dataset. Moreover, we achieved higher accuracy when using image augmentation, as shown in Table 19, Table 20, Table 21, Table 22, Table 23, Table 24, Table 25, Table 26, Table 27, Table 28, Table 29 and Table 30. Thus, increasing the size of the dataset will not be detrimental to PI-CNN.

4.3. Comparisons with State-of-the-Art Methods

In this subsection, existing plant-image classification methods are compared with the proposed PI-CNN, as shown in Table 31, Table 32 and Table 33. As shown in Table 31, Table 32 and Table 33, the proposed PI-CNN achieved a high classification accuracy compared with the state-of-the-art methods.

4.4. Processing Time

The processing time of the PI-GAN method and classification method (PI-CNN) in the testing phase is shown in Table 34. The processing time was measured in the environments explained in Section 3.3. As shown in Table 34, the frame rate of the PI-GAN method is approximately 21.78 frames per second (fps) (=1000/45.92). Moreover, the frame rate of the proposed PI-CNN is approximately 19.29 fps (=1000/51.85). The total frame rate including both image augmentation and classification method is approximately 10.23 fps (1000/97.77).

5. Discussion

In general, the plant-image classification performance is affected by the structure of the classification model, as well as the image quality and number of images in the datasets. Existing plant-image open datasets are typically small in size compared with open datasets of other fields. Furthermore, only a small number of open datasets exist. Therefore, previous models trained using a small number of plant images demonstrate a lower classification accuracy. This study verified that image classification performed using a small number of training images yielded a lower classification accuracy as the number of images decreased. Accordingly, experiments were conducted in this study by increasing the number of images by using the augmentation method involving PI-GAN. The experimental results obtained with four open databases demonstrated that the classification accuracy was improved compared with those of the state-of-the-art methods for all the reduced datasets. However, there are error cases when augmenting images using PI-GAN, as shown in Figure 15. For example, augmented image is presented in Figure 15a, where the date, carambula, and augmented date image are presented. In the case of the augmented date image, the pattern of the date in the image was lost and looks more like a rock. Moreover, as shown in Figure 15b, pineapple, cactus fruit, and augmented pineapple are presented. Here, the augmented pineapple image became blurry. These errors arise because the two input images are only combined by using a channel-wise concatenation operation rather than by analyzing a pattern style of an input image and applying the pattern style of one image to another input image.
Moreover, example images misclassified by the proposed PI-CNN are presented in Figure 16. For example, apple, peach, and augmented apple images are presented from the left to right in Figure 16a, and peach, apple, and augmented peach images are presented in the same way in Figure 16b. In case of Figure 16a, an apple image (input_1) is augmented based on a peach image (input_2). Thus, the augmented apple (output) image looks a little bit like a peach image. On the contrary, a peach image (input_1) is augmented based on an apple image (input_2) in Figure 16b. So, the augmented peach image looks similar to the apple image. That is, the classification errors occur due to the loss of class information in the augmented images used for the training of PI-CNN in both Figure 16a,b.
We obtained the PI-GAN and PI-CNN structures based on experimental results through ablation studies (Section 4.2). For the model structure adjustment and optimization for the specific domain of plant images, we conducted experiments by using various parameters (Table 12 and Table 13) and structures (Table 14 and Table 15).
Compared to previous GAN-based image augmentation methods [47,48,49], the proposed PI-GAN is novel as follows:
-
Inputs are random numbers (noise data) in [47,48,49], whereas inputs of the PI-GAN are plant images.
-
The methods [47,48,49] use a single input, whereas the proposed PI-GAN uses two images as inputs.
-
The generators of the methods [47,48,49] were designed based on up-sampling networks, whereas the generator of the proposed PI-GAN was designed based on an encoder-decoder network.
-
In the methods [47,48,49], a generator network is trained to generate an image from noise data, whereas the generator network in the proposed PI-GAN extracts features from two different plant images and generates a plant image by combining the extracted features.
Using a small dataset causes the overfitting problem. In the training phase, we can monitor overfitting based on the curves of the loss of validation dataset. For example, we can identify overfitting when validation loss stops decreasing after a certain number of training epochs while the training loss keeps decreasing. The overfitting occurs in cases where the training metric keeps searching for the best fit only for the training dataset. The solution to overcome the overfitting is to increase the number of training data.
In this paper, we conducted various experiments by using training datasets with different sizes. Training and validation loss curves of the experiments are presented in the following figures to determine whether the models were overfitted.
As shown in Figure 17d, the training loss is decreased more than those in Figure 17a–c. This is because the number of training images is too small in Figure 17d, and the model is fitted easily. However, the validation loss in Figure 17d is not decreased after epoch 10, which confirms overfitting caused by a small training set. Similarly, overfitting occurs in Figure 17c owing to the same reason. The testing accuracies with the augmented data of Figure 17g,h become higher than those with the original data of Figure 17c,d as shown in Table 25, Table 26 and Table 27. In addition, the validation losses with the augmented data of Figure 17g,h are a little lower than those with the original data of Figure 17c,d.
There have been various experiments conducted based on different numbers of parameters and layers [26,29]. As shown in [29], the error rates by the CNN model usually increase when the number of layers and parameters decrease. The experimental results with VGG Net in [26] confirm that the error rates by the CNN model usually increase when the number of layers decrease. Therefore, we confirm that other conventional CNN models usually achieve worse results by reducing the parameters.

6. Conclusions

This study proposed plant-image augmentation and classification methods and performed various experiments. New plant images were obtained by combining two types of input plant images in the plant image augmentation by PI-GAN. The proposed classification by PI-CNN involved the classification of images using augmented and original images. Moreover, the accuracy in a case where a small number of training datasets were used was compared with that in a case where a large number of training datasets were used. The size of the original dataset was set to 100%, and the dataset size was reduced by 70%, 50%, 30%, and 10% to compare the difference in accuracy. Subsequently, the accuracy was compared again by augmenting the training datasets that had been reduced by 70%, 50%, 30%, and 10%, back to 100%, using conventional and PI-GAN-based augmentation methods. As shown in Table 25, Table 26 and Table 27, classification accuracies obtained by using the proposed PI-GAN augmentation method are higher compared to those obtained by using conventional data augmentation methods as in Table 19, Table 20, Table 21, Table 22, Table 23 and Table 24. This confirms that the proposed PI-GAN outperforms the conventional augmentation methods. To augment a plant image, the proposed PI-GAN extracts feature from two different plant images and combines them to generate a new plant image, whereas conventional augmentation methods change a plant image by in-plane rotation, shifting, flipping, and adjustments of brightness and contrasts to generate different images. In addition, as shown in Table 25, Table 26 and Table 27, classification accuracies obtained by using PI-CNN with PI-GAN are higher compared to those obtained by using PI-CNN with conventional augmentation methods. Moreover, classification accuracies obtained by using PI-CNN with PI-GAN are higher compared to those obtained by using PI-CNN with the existing conditional GAN-based method [46], as shown in Table 28, Table 29 and Table 30. This confirms that the proposed PI-GAN works better with the proposed PI-CNN. The results of the experiments performed using four open datasets (namely, PlantDoc, PlantVillage, Plants, and Fruits-360) verified that the proposed PI-GAN and PI-CNN achieved a high classification accuracy compared with the state-of-the-art methods in all the cases.
In future studies, a variety of explainable artificial intelligence (XAI) methods [39,40,41,42,43,44] will be considered to examine the methods further to improve the accuracy of PI-CNN methods for image classification. In addition, we will conduct further research into methods for maintaining the class information in the augmented images in order to minimize the classification errors.

Author Contributions

Methodology, G.B.; validation, S.H.N.; supervision, K.R.P.; writing—original draft, G.B.; writing—review and editing, K.R.P. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported in part by the National Research Foundation of Korea (NRF) funded by the Ministry of Science and ICT (MSIT) through the Basic Science Research Program (NRF-2022R1F1A1064291), in part by the NRF funded by the MSIT through the Basic Science Research Program (NRF-2021R1F1A1045587), and in part by the NRF funded by the MSIT through the Basic Science Research Program (NRF-2020R1A2C1006179).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Shahi, T.B.; Sitaula, C.; Neupane, A.; Guo, W. Fruit classification using attention-based MobileNetV2 for industrial applications. PLoS ONE 2022, 17, e0264586. [Google Scholar] [CrossRef] [PubMed]
  2. Siddiqi, R. Comparative performance of various deep learning based models in fruit image classification. In Proceedings of the 11th International Conference on Advances in Information Technology, Bangkok, Thailand, 1–3 July 2020; Volume 14, pp. 1–9. [Google Scholar] [CrossRef]
  3. Siddiqi, R. Effectiveness of transfer learning and fine tuning in automated fruit image classification. In Proceedings of the 2019 3rd International Conference on Deep Learning Technologies, Xiamen, China, 5–7 July 2019; pp. 91–100. [Google Scholar] [CrossRef]
  4. Savant, S.P.; Khanagoudar, P.S. Autonomous fruit recognition system based on deep convolutional neural network. Int. J. Sci. Res. Sci. Eng. Technol. 2020, 7, 666–669. [Google Scholar] [CrossRef]
  5. Srivastava, S.; Singh, T.; Sharma, S.; Verma, A. A fruit recognition system based on modern deep learning technique. Int. J. Eng. Res. Technol. 2020, 9, 896–898. [Google Scholar] [CrossRef]
  6. Biswas, B.; Ghosh, S.K.; Ghosh, A. A robust multi-label fruit classification based on deep convolution neural network. In Computational Intelligence in Pattern Recognition. Advances in Intelligent Systems and Computing; Das, A., Nayak, J., Naik, B., Pati, S., Pelusi, D., Eds.; Springer: Singapore, 2020; Volume 999. [Google Scholar] [CrossRef]
  7. Franczyk, B.; Hernes, M.; Kozierkiewicz, A.; Kozina, A.; Pietranik, M.; Roemer, I.; Schieck, M. Deep learning for grape variety recognition. Procedia Comput. Sci. 2020, 176, 1211–1220. [Google Scholar] [CrossRef]
  8. Hossain, M.S.; Al-Hammadi, M.; Muhammad, G. Automatic fruit classification using deep learning for industrial applications. IEEE Trans. Ind. Inform. 2019, 15, 1027–1034. [Google Scholar] [CrossRef]
  9. Hamid, N.N.A.A.; Razali, R.A.; Ibrahim, Z. Comparing bags of features, conventional convolutional neural network and AlexNet for fruit recognition. Indones. J. Electr. Eng. Comput. Sci. 2019, 14, 333–339. [Google Scholar] [CrossRef]
  10. Kader, A.; Sharif, S.; Bhowmick, P.; Mim, F.H.; Srizon, A.Y. Effective workflow for high-performance recognition of fruits using machine learning approaches. Int. Res. J. Eng. Technol. 2020, 7, 1516–1521. [Google Scholar]
  11. Katarzyna, R.; Paweł, M.A. Vision-based method utilizing deep convolutional neural networks for fruit variety classification in uncertainty conditions of retail sales. Appl. Sci. 2019, 9, 3971. [Google Scholar] [CrossRef] [Green Version]
  12. Ghosh, S.; Mondal, M.J.; Sen, S.; Chatterjee, S.; Kar Roy, N.; Patnaik, S. A novel approach to detect and classify fruits using ShuffleNet V2. In Proceedings of the IEEE Applied Signal Processing Conference, Kolkata, India, 7–9 October 2020; pp. 163–167. [Google Scholar] [CrossRef]
  13. Muhathir, M.; Santoso, M.H.; Muliono, R. Analysis naïve bayes in classifying fruit by utilizing HOG feature extraction. J. Inform. Telecommun. Eng. 2020, 4, 250–259. [Google Scholar] [CrossRef]
  14. Wang, D.; Wang, J.; Li, W.; Guan, P. T-CNN: Trilinear convolutional neural networks model for visual detection of plant diseases. Comput. Electron. Agric. 2021, 190, 106468. [Google Scholar] [CrossRef]
  15. PlantVillage Dataset. Available online: https://www.kaggle.com/datasets/emmarex/plantdisease (accessed on 5 July 2022).
  16. Singh, D.; Jain, N.; Jain, P.; Kayal, P.; Kumawat, S.; Batra, N. PlantDoc: A dataset for visual plant disease detection. In Proceedings of the 7th ACM IKDD CoDS and 25th COMAD, Hyderabad, India, 5–7 January 2020; pp. 249–253. [Google Scholar] [CrossRef]
  17. Fruits-360 Dataset. Available online: https://www.kaggle.com/datasets/moltean/fruits (accessed on 5 July 2022).
  18. Plants Dataset. Available online: https://www.kaggle.com/datasets/muhammadjawad1998/plants-dataset99-classes/metadata (accessed on 5 July 2022).
  19. Pawara, P.; Okafor, E.; Schomaker, L.; Wiering, M. Data augmentation for plant classification. In Proceedings of the Advanced Concepts for Intelligent Vision, Antwerp, Belgium, 18–21 September 2017; pp. 1–12. [Google Scholar] [CrossRef]
  20. Ghesquiere, M.; Ngxande, M. Deep learning for plant disease detection. In Advances in Computer Vision and Computational Biology; Springer International Publishing: Berlin/Heidelberg, Germany, 2021; pp. 69–84. [Google Scholar] [CrossRef]
  21. Image Data Augmentation. Available online: https://machinelearningmastery.com/how-to-configure-image-data-augmentation-when-training-deep-learning-neural-networks/ (accessed on 5 July 2022).
  22. Popular Image Augmentation Packages. Available online: https://www.kaggle.com/code/parulpandey/overview-of-popular-image-augmentation-packages/notebook (accessed on 5 July 2022).
  23. Santos, T.; Leonardo, D.S.; Andreza, D.S.; Sandra, A. Embrapa wine grape instance segmentation dataset—Embrapa WGISD (1.0.0) [Data set]. Zenodo 2019. Available online: https://zenodo.org/record/3361736#.Ywgs0nZByUk (accessed on 5 July 2022). [CrossRef]
  24. FIDS30 Dataset. Available online: https://www.kaggle.com/datasets/arnavmehta710a/fids30 (accessed on 5 July 2022).
  25. Szegedy, C.; Vanhoucke, V.; Ioffe, S.; Shlens, J.; Wojna, Z. Rethinking the inception architecture for computer vision. arXiv 2015, arXiv:1512.00567. [Google Scholar]
  26. Simonyan, K.; Zisserman, A. Very deep convolutional networks for large-scale image recognition. arXiv 2015, arXiv:1409.1556v6. [Google Scholar]
  27. Redmon, J.; Farhadi, A. Yolo V3: An incremental improvement. arXiv 2018, arXiv:1804.02767. [Google Scholar]
  28. PI-GAN and PI-CNN Models. Available online: https://github.com/ganav/PI-GAN-and-PI-CNN-networks (accessed on 5 July 2022).
  29. He, K.; Zhang, X.; Ren, S.; Sun, J. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar] [CrossRef]
  30. Nvidia GeForce GTX TITAN X. Available online: https://www.nvidia.com/en-us/geforce/products/10series/titan-x-pascal/ (accessed on 5 July 2022).
  31. OpenCV. Available online: http://opencv.org/ (accessed on 5 July 2022).
  32. Python. Available online: https://www.python.org/ (accessed on 5 July 2022).
  33. Keras. Available online: https://keras.io/ (accessed on 5 July 2022).
  34. Binary Cross-Entropy Loss. Available online: https://towardsdatascience.com/understanding-binary-cross-entropy-log-loss-a-visual-explanation-a3ac6025181a (accessed on 5 July 2022).
  35. Categorical Cross-Entropy Loss. Available online: https://peltarion.com/knowledge-center/documentation/modeling-view/build-an-ai-model/loss-functions/categorical-crossentropy (accessed on 5 July 2022).
  36. Kingma, D.P.; Ba, J.B. ADAM: A method for stochastic optimization. In Proceedings of the 3rd International Conference on Learning Representations, San Diego, CA, USA, 7–9 May 2015; pp. 1–15. [Google Scholar]
  37. Powers, D.M.W. Evaluation: From precision, recall and f-measure to roc, informedness, markedness & correlation. Mach. Learn. Technol. 2011, 2, 37–63. [Google Scholar]
  38. Derczynski, L. Complementarity, F-score, and NLP evaluation. In Proceedings of the Tenth International Conference on Language Resources and Evaluation 2016, Portorož, Slovenia, 23–28 May 2016; European Language Resources Association; pp. 261–266. Available online: https://aclanthology.org/L16-1040 (accessed on 5 July 2022).
  39. Wang, W.; Wang, Z.; Du, M.; Yang, F.; Zhang, Z.; Ding, S.; Mardziel, P.; Hu, X. Score-CAM: Score-weighted visual explanations for convolutional neural networks. arXiv 2020, arXiv:1910.01279v2. [Google Scholar] [CrossRef]
  40. Selvaraju, R.R.; Cogswell, M.; Das, A.; Vedantam, R.; Parikh, D.; Batra, D. Grad-CAM: Visual explanations from deep networks via gradient-based localization. In Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy, 22–29 October 2017; pp. 618–626. [Google Scholar] [CrossRef] [Green Version]
  41. Feng, Z.; Zhu, M.; Stanković, L.; Ji, H. Self-Matching CAM: A novel accurate visual explanation of CNNs for SAR image interpretation. Remote Sens. 2021, 13, 1772. [Google Scholar] [CrossRef]
  42. Desai, S.; Ramaswamy, H.G. Ablation-CAM: Visual explanations for deep convolutional network via gradient-free localization. In Proceedings of the IEEE Winter Conference on Applications of Computer Vision, Snowmass Village, CO, USA, 1–5 March 2020; pp. 972–980. [Google Scholar] [CrossRef]
  43. Chattopadhay, A.; Sarkar, A.; Howlader, P.; Balasubramanian, V.N. Grad-CAM++: Generalized gradient-based visual explanations for deep convolutional networks. In Proceedings of the IEEE Winter Conference on Applications of Computer Vision, Lake Tahoe, NV, USA, 12–15 March 2018; pp. 839–847. [Google Scholar] [CrossRef]
  44. Zhou, B.; Khosla, A.; Lapedriza, A.; Oliva, A.; Torralba, A. Learning deep features for discriminative localization. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 2921–2929. [Google Scholar] [CrossRef]
  45. Takahashi, R.; Matsubara, T.; Uehara, K. Data Augmentation Using Random Image Cropping and Patching for Deep CNNs. IEEE Trans. Circuits Syst. Video Technol. 2020, 30, 2917–2931. [Google Scholar] [CrossRef]
  46. Jordan, J.B.; Chloe, M.B.; Luis, J.M.; Anikó, E.; Diego, R.F. Fruit quality and defect image classification with conditional GAN data augmentation. Sci. Hortic. 2022, 293, 110684. [Google Scholar] [CrossRef]
  47. Frid-Adar, M.; Klang, E.; Amitai, M.; Goldberger, J.; Greenspan, H. Synthetic data augmentation using GAN for improved liver lesion classification. In Proceedings of the 2018 IEEE 15th International Symposium on Biomedical Imaging (ISBI 2018), Washington, DC, USA, 4–7 April 2018; pp. 289–293. [Google Scholar]
  48. Singh, A.; Bansal, A.; Chauhan, N.; Sahu, S.P.; Dewangan, D.K. Image generation using GAN and its classification using SVM and CNN. In Proceedings of the International Conference on Emerging Trends and Technologies on Intelligent Systems, Online, 4–5 March 2021; Springer: Singapore, 2021; pp. 89–100. [Google Scholar]
  49. Bhattacharya, D.; Banerjee, S.; Bhattacharya, S.; Uma Shankar, B.; Mitra, S. GAN-based novel approach for data augmentation with improved disease classification. In Advancement of Machine Intelligence in Interactive Medical Image Analysis; Springer: Singapore, 2020; pp. 229–239. [Google Scholar]
Figure 1. Overview of the methods designed in this study. (a) Proposed PI-GAN for plant image augmentation; (b) proposed PI-CNN for plant image classification.
Figure 1. Overview of the methods designed in this study. (a) Proposed PI-GAN for plant image augmentation; (b) proposed PI-CNN for plant image classification.
Mathematics 10 03091 g001
Figure 2. Example of the detailed structure of PI-GAN.
Figure 2. Example of the detailed structure of PI-GAN.
Mathematics 10 03091 g002
Figure 3. Example of the detailed structure of the proposed PI-CNN.
Figure 3. Example of the detailed structure of the proposed PI-CNN.
Mathematics 10 03091 g003
Figure 4. Example images of Fruits-360 dataset used in our experiment. Left to right: images of apple, banana, peach, and grape.
Figure 4. Example images of Fruits-360 dataset used in our experiment. Left to right: images of apple, banana, peach, and grape.
Mathematics 10 03091 g004
Figure 5. Example images of PlantDoc dataset used in our experiment. Left to right: images of apple leaf, apple leaf with rust disease, healthy bell pepper leaf, and bell pepper leaf with spot disease.
Figure 5. Example images of PlantDoc dataset used in our experiment. Left to right: images of apple leaf, apple leaf with rust disease, healthy bell pepper leaf, and bell pepper leaf with spot disease.
Mathematics 10 03091 g005
Figure 6. Example images of PlantVillage dataset used in our experiment. Left to right: images of healthy apple leaf, apple leaf with black rot disease, healthy cherry leaf, and cherry leaf with powdery mildew disease.
Figure 6. Example images of PlantVillage dataset used in our experiment. Left to right: images of healthy apple leaf, apple leaf with black rot disease, healthy cherry leaf, and cherry leaf with powdery mildew disease.
Mathematics 10 03091 g006
Figure 7. Example images of Plants dataset used in our experiment. Left to right: images of aeonium, almond, asparagus, and sunflower.
Figure 7. Example images of Plants dataset used in our experiment. Left to right: images of aeonium, almond, asparagus, and sunflower.
Mathematics 10 03091 g007
Figure 8. Training loss, validation loss, and validation accuracy curves of PI-GAN and PI-CNN with Fruits-360 dataset. (a,b) training loss and validation loss curves, respectively, of PI-GAN; (c,d) show the training and validation loss and accuracy curves of the proposed PI-CNN, respectively.
Figure 8. Training loss, validation loss, and validation accuracy curves of PI-GAN and PI-CNN with Fruits-360 dataset. (a,b) training loss and validation loss curves, respectively, of PI-GAN; (c,d) show the training and validation loss and accuracy curves of the proposed PI-CNN, respectively.
Mathematics 10 03091 g008aMathematics 10 03091 g008b
Figure 9. Examples of images augmented by conventional augmentation methods. Images augmented by (a) flipping and in-plane rotation; (b) image shifting in four directions; (c) the adjustments of brightness and contrast; (d) Gaussian blurring; (e) scaling.
Figure 9. Examples of images augmented by conventional augmentation methods. Images augmented by (a) flipping and in-plane rotation; (b) image shifting in four directions; (c) the adjustments of brightness and contrast; (d) Gaussian blurring; (e) scaling.
Mathematics 10 03091 g009aMathematics 10 03091 g009b
Figure 10. Examples of images augmented by PI-GAN. Left to right: (a) mango, date, and augmented image; (b) pear, mango, and augmented image; (c) huckleberry, date, and augmented image; (d) maracuja, tamarillo, and augmented image.
Figure 10. Examples of images augmented by PI-GAN. Left to right: (a) mango, date, and augmented image; (b) pear, mango, and augmented image; (c) huckleberry, date, and augmented image; (d) maracuja, tamarillo, and augmented image.
Mathematics 10 03091 g010
Figure 11. Comparison of F1-scores based on all the datasets (original dataset (O), reduced dataset (R), dataset augmented by conventional method (T), and dataset augmented by PI-GAN (G)). The datasets have been reduced to 70%.
Figure 11. Comparison of F1-scores based on all the datasets (original dataset (O), reduced dataset (R), dataset augmented by conventional method (T), and dataset augmented by PI-GAN (G)). The datasets have been reduced to 70%.
Mathematics 10 03091 g011
Figure 12. Comparison of F1-scores based on all the datasets (original dataset (O), reduced dataset (R), dataset augmented by conventional method (T), and dataset augmented by PI-GAN (G)). The datasets have been reduced to 50%.
Figure 12. Comparison of F1-scores based on all the datasets (original dataset (O), reduced dataset (R), dataset augmented by conventional method (T), and dataset augmented by PI-GAN (G)). The datasets have been reduced to 50%.
Mathematics 10 03091 g012
Figure 13. Comparison of F1-scores based on all the datasets (original dataset (O), reduced dataset (R), dataset augmented by conventional method (T), and dataset augmented by PI-GAN (G)). The datasets have been reduced to 30%.
Figure 13. Comparison of F1-scores based on all the datasets (original dataset (O), reduced dataset (R), dataset augmented by conventional method (T), and dataset augmented by PI-GAN (G)). The datasets have been reduced to 30%.
Mathematics 10 03091 g013
Figure 14. Comparison of F1-scores based on all the datasets (original dataset (O), reduced dataset (R), dataset augmented by conventional method (T), and dataset augmented by PI-GAN (G)). The datasets have been reduced to 10%.
Figure 14. Comparison of F1-scores based on all the datasets (original dataset (O), reduced dataset (R), dataset augmented by conventional method (T), and dataset augmented by PI-GAN (G)). The datasets have been reduced to 10%.
Mathematics 10 03091 g014
Figure 15. Error cases by the proposed PI-GAN. From the left to right: (a) date, carambula, and augmented date; (b) pineapple, cactus fruit, and augmented pineapple.
Figure 15. Error cases by the proposed PI-GAN. From the left to right: (a) date, carambula, and augmented date; (b) pineapple, cactus fruit, and augmented pineapple.
Mathematics 10 03091 g015
Figure 16. Error cases by the proposed PI-CNN. From the left to right: (a) apple, peach, and augmented apple; (b) peach, apple, and augmented peach.
Figure 16. Error cases by the proposed PI-CNN. From the left to right: (a) apple, peach, and augmented apple; (b) peach, apple, and augmented peach.
Mathematics 10 03091 g016
Figure 17. Training and validation loss curves. Curves obtained by using training sets with size of (a) 70%, (b) 50%, (c) 30%, (d) 10%. Curves obtained by using training sets augmented from training sets with size of (e) 70%, (f) 50%, (g) 30%, (h) 10%.
Figure 17. Training and validation loss curves. Curves obtained by using training sets with size of (a) 70%, (b) 50%, (c) 30%, (d) 10%. Curves obtained by using training sets augmented from training sets with size of (e) 70%, (f) 50%, (g) 30%, (h) 10%.
Mathematics 10 03091 g017aMathematics 10 03091 g017b
Table 1. Comparison between the proposed and previous methods for plant image classification.
Table 1. Comparison between the proposed and previous methods for plant image classification.
MethodsAdvantagesDisadvantages
Large training set-based[1,2,3,4,5,6,7,8,9,10,11,12,13,14]High accuracyDoes not consider small training sets
Small training set-basedProposed methodConsiders various sizes of training sets. Considers various sizes of training setsLower accuracy than that achieved when a large training set is used
Table 2. Description of the generator network used in our PI-GAN.
Table 2. Description of the generator network used in our PI-GAN.
Layer NumberLayer TypeTimesNumber of FiltersNumber of
Parameters
Layer Connection
(Connected to)
0input_layer_1×100input_1
1input_layer_2×100input_2
2encoder_1×4128605,696input_layer_1
3encoder_2×4128605,696input_layer_2
4res_block_1×2128590,592encoder_1
5res_block_2×2128590,592encoder_2
6concat×100res_block_1 & res_block_2
7res_block_3×3256885,888concat
8decoder×41281,328,640res_block_3
9conv2d (tanh)×133459decoder
Total number of trainable parameters: 4,610,563.
Table 3. Description of an encoder block of the generator network.
Table 3. Description of an encoder block of the generator network.
Layer NumberLayer TypeLayer Connection
(Connected to)
1conv2d_1input
2prelu_1conv2d_1
3conv2d_2prelu_1
4prelu_2conv2d_2
5max_poolprelu_2
Table 4. Description of a decoder block of the generator network.
Table 4. Description of a decoder block of the generator network.
Layer NumberLayer TypeLayer Connection
(Connected to)
1conv2d_1input
2prelu_1conv2d_1
3conv2d_2prelu_1
4prelu_2conv2d_2
5Up2prelu_2
Table 5. Description of a residual block of the generator network.
Table 5. Description of a residual block of the generator network.
Layer NumberLayer TypeLayer Connection
(Connected to)
1conv2d_1input
2preluconv2d_1
3conv2d_2prelu
4addconv2d_2 & input
Table 6. Description of the discriminator network of PI-GAN.
Table 6. Description of the discriminator network of PI-GAN.
Layer NumberLayer TypeTimesNumber of
Filters
Number of StridesNumber of
Parameters
Layer Connection
(Connected to)
0input layer×1000input
1conv2d×112813584input layer
2lrelu_1×1000conv2d
3disc_block×5128, 128
256, 256, 256
1, 1
2, 2, 2
1,770,496lrelu_1
4lrelu_2×1000disc_block
5FC (sigmoid)×1class#0173,057lrelu_2
Total number of trainable parameters: 1,947,137.
Table 7. Description of a convolution block of the discriminator network.
Table 7. Description of a convolution block of the discriminator network.
Layer NumberLayer TypeLayer Connection
(Connected to)
1conv2dinput
2lreluconv2d
Table 8. Description of the proposed PI-CNN.
Table 8. Description of the proposed PI-CNN.
Layer NumberLayer TypeNumber of
Filters
Number of
Parameters
Layer Connection
(Connected to)
1input layer_100input
2conv2d_1641792input layer_1
3conv2d_26436,928conv2d_1
4max_pool_100conv2d_2
5res_block_16473,920max_pool_1
6res_block_26473,920res_block_1
7res_block_36473,920res_block_2
8res_block_46473,920res_block_3
9conv2d_312873,856res_block_4
10conv2d_4128147,584conv2d_3
11max_pool_200conv2d_4
12res_block_5128295,296max_pool_2
13res_block_6128295,296res_block_5
14res_block_7128295,296res_block_6
15res_block_8128295,296res_block_7
16conv2d_5128147,584res_block_8
17conv2d_6128147,584conv2d_5
18max_pool_300conv2d_6
19res_block_9128295,296max_pool_3
20res_block_10128295,296res_block_9
21res_block_11128295,296res_block_10
22res_block_12128295,296res_block_11
23conv2d_7128147,584res_block_12
24conv2d_8128147,584conv2d_7
25max_pool_400conv2d_8
26res_block_13128295,296max_pool_4
27res_block_14128295,296res_block_13
28res_block_15128295,296res_block_14
29res_block_16128295,296res_block_15
30FC (softmax)class#258,062res_block_16
Total number of trainable parameters: 4,947,790.
Table 9. Summary of the datasets.
Table 9. Summary of the datasets.
DatasetsTraining SetsTest SetsValidation SetsDimensionDepthExtensionClass#
Fruits-36041,32212,8771000100 × 10024jpg81
PlantVillage40,00013,3051000256 × 25624jpg38
PlantDoc233620036150 × 150–5616 × 374424jpg27
Plants13,14952181521104 × 104–3168 × 475224jpg99
Table 10. Detailed explanations of train and test sets used in our experiments.
Table 10. Detailed explanations of train and test sets used in our experiments.
DatasetsTraining SetsTest SetsValidation Sets
100%70%50%30%10%
Fruits-360 41,32228,92620,66112,396413212,8771000
PlantVillage 40,00028,00020,00012,000400013,3051000
PlantDoc23361635116870023320036
Plants13,149920465743944131452181521
Table 11. Detailed explanations of train and test subsets used in our experiments. Training was performed by combining the training sets of the Plants and PlantDoc datasets (the numbers with ‘*’).
Table 11. Detailed explanations of train and test subsets used in our experiments. Training was performed by combining the training sets of the Plants and PlantDoc datasets (the numbers with ‘*’).
DatasetsTrain SetsTest SetsValidation Sets
CropDiseaseCropDiseaseCropDisease
Fruits-36041,322 12,877 1000
PlantVillage11,22228,77835929713270730
PlantDoc757 *1579 *761241422
Plants13,149 * 5218 1521
Table 12. Search space and selected values of hyperparameters for PI-GAN.
Table 12. Search space and selected values of hyperparameters for PI-GAN.
ParametersWeight Decay
(Weight Regularization L2)
LossKernel
Initializer
Bias
Initializer
OptimizerLearning RateBeta_1Beta_2EpsilonEpochsBatch
Size
Search Space[0.001, 0.01, 0.1][“binary cross-entropy,”
“VGG-19”]
“glorot uniform”“zeros”[“SGD,” “adam”][0.0001, 0.001, 0.01, 0.1][0.7, 0.8, 0.9][0.8, 0.9, 0.999][1 × 10−9,
1 × 10−8,
1 × 10−7]
[1–50][1, 4, 8, 16]
Selected Value0.01“binary cross-entropy”“glorot uniform”“zeros”“adam”0.00010.90.9991 × 10−8508
Table 13. Search space and selected values of hyperparameters for PI-CNN.
Table 13. Search space and selected values of hyperparameters for PI-CNN.
ParametersLearning Rate Decay (for SGD)Momentum
(for SGD)
LossMetricsOptimizerLearning RateEpochsBatch
Size
Search Space[0.000001, 0.00001, 0.0001][0.9, 0.8, 0.7]“categorical cross-entropy”[“categorical_accuracy”, “accuracy”][“SGD,” “adam”][0.0001, 0.001, 0.01, 0.1][1–40][1, 4, 8, 16]
Selected Value0.000010.9“categorical cross-entropy”” accuracy”“adam”0.0001308
Table 14. Comparison of accuracies by the proposed PI-GAN generator and variants of generators in the Fruits-360 training set of 50%.
Table 14. Comparison of accuracies by the proposed PI-GAN generator and variants of generators in the Fruits-360 training set of 50%.
MethodsTPRPPVACCF1-Score
encoder-decoder ×176.0080.2986.7478.08
encoder-decoder ×276.4381.6286.0778.94
encoder-decoder ×376.5680.4087.4978.43
encoder-decoder ×4 (proposed)76.7081.6587.9479.18
encoder-decoder ×575.8581.1286.6278.40
Table 15. Comparison of accuracies by the proposed PI-CNN and variants of CNN in the Fruits-360 dataset.
Table 15. Comparison of accuracies by the proposed PI-CNN and variants of CNN in the Fruits-360 dataset.
MethodsTPRPPVACCF1-Score
res_block-1495.0193.1597.3894.07
res _block-1595.2492.4398.5393.81
res _block-16 (proposed)95.4094.3398.5994.87
res _block-1795.1492.4696.9493.78
Table 16. Comparison of accuracies achieved by the proposed PI-CNN in the original and reduced datasets. The accuracies were achieved in the PlantVillage dataset (unit: %).
Table 16. Comparison of accuracies achieved by the proposed PI-CNN in the original and reduced datasets. The accuracies were achieved in the PlantVillage dataset (unit: %).
DatasetDiseaseCrop
TPRPPVACCF1-ScoreTPRPPVACCF1-Score
Original100%89.6295.1397.2692.3898.4894.7699.5796.62
Reduced70%86.2089.1192.3487.6689.1086.2790.4187.69
50%71.8978.6784.7075.2871.1078.6084.6474.85
30%52.8047.4865.5450.1453.2045.8763.1349.54
10%31.8434.2943.5533.0732.9833.2543.9133.12
Table 17. Comparison of accuracies achieved by the proposed PI-CNN in the original and reduced datasets. The accuracies were achieved in the PlantDoc dataset (unit: %).
Table 17. Comparison of accuracies achieved by the proposed PI-CNN in the original and reduced datasets. The accuracies were achieved in the PlantDoc dataset (unit: %).
DatasetDiseaseCrop
TPRPPVACCF1-ScoreTPRPPVACCF1-Score
Original100%66.1367.8780.3667.0078.2674.6686.5776.46
Reduced70%59.5454.4874.0857.0170.3764.7880.8167.58
50%40.2039.2161.0539.7151.9548.1572.5550.05
30%36.6229.3145.2132.9741.6235.3153.1438.47
10%21.3022.5129.1221.9121.9822.4132.1822.20
Table 18. Comparison of accuracies achieved by the proposed PI-CNN in the original and reduced datasets. The accuracies were achieved in the Plants and Fruits-360 dataset (unit: %).
Table 18. Comparison of accuracies achieved by the proposed PI-CNN in the original and reduced datasets. The accuracies were achieved in the Plants and Fruits-360 dataset (unit: %).
DatasetPlants DatasetFruits-360 Dataset
TPRPPVACCF1-ScoreTPRPPVACCF1-Score
Original100%90.1086.1196.5288.1195.4094.3398.5994.87
Reduced70%80.4676.4092.8178.4391.7089.0091.8090.35
50%62.7160.2284.1061.4772.4778.6383.6375.55
30%49.7441.5060.3445.6252.6845.8665.4449.27
10%30.6631.3340.8531.0039.3736.1948.3137.78
Table 19. Comparison of accuracies achieved by the proposed PI-CNN in the original and augmented datasets. The accuracies were achieved in the PlantVillage dataset (unit: %).
Table 19. Comparison of accuracies achieved by the proposed PI-CNN in the original and augmented datasets. The accuracies were achieved in the PlantVillage dataset (unit: %).
DatasetDiseaseCrop
Before
Augmentation
After
Augmentation
TPRPPVACCF1-ScoreTPRPPVACCF1-Score
100%100%89.6295.1397.2692.3898.4894.7699.5796.62
70%100%86.8990.1793.8788.5389.7688.9692.6089.36
50%100%71.6679.8886.9775.7773.5880.8484.8277.21
30%100%54.8649.3466.1352.1053.1146.9563.3450.03
10%100%32.7136.2144.2834.4632.8034.9745.2933.89
Table 20. Comparison of accuracies achieved by the proposed PI-CNN in the original and augmented datasets. The accuracies were achieved in the PlantDoc dataset (unit: %).
Table 20. Comparison of accuracies achieved by the proposed PI-CNN in the original and augmented datasets. The accuracies were achieved in the PlantDoc dataset (unit: %).
DatasetDiseaseCrop
Before
Augmentation
After
Augmentation
TPRPPVACCF1-ScoreTPRPPVACCF1-Score
100%100%66.1367.8780.3667.0078.2674.6686.5776.46
70%100%60.1256.2376.8258.1871.8665.8082.4068.83
50%100%42.8339.5562.6941.1953.3650.9272.3152.14
30%100%38.8733.7148.836.2944.5338.258.5742.87
10%100%23.4022.8830.3023.1424.4523.8034.6924.13
Table 21. Comparison of accuracies achieved by the proposed PI-CNN in the original and augmented datasets. The accuracies were achieved in the Plants and Fruits-360 dataset (unit: %).
Table 21. Comparison of accuracies achieved by the proposed PI-CNN in the original and augmented datasets. The accuracies were achieved in the Plants and Fruits-360 dataset (unit: %).
DatasetPlants DatasetFruits-360 Dataset
Before
Augmentation
After
Augmentation
TPRPPVACCF1-ScoreTPRPPVACCF1-Score
100%100%90.1086.1196.5288.1195.4094.3398.5994.87
70%100%82.1976.3092.8679.2592.3090.7293.6191.51
50%100%63.3261.9485.9362.6373.4078.8083.6676.10
30%100%51.5241.3062.4746.4154.3947.2567.5050.82
10%100%30.7833.6041.1632.1939.5737.2648.3338.42
Table 22. Comparison of accuracies achieved by the proposed PI-CNN in the original and augmented datasets. The accuracies were achieved in the PlantVillage dataset (unit: %).
Table 22. Comparison of accuracies achieved by the proposed PI-CNN in the original and augmented datasets. The accuracies were achieved in the PlantVillage dataset (unit: %).
DatasetDiseaseCrop
Before
Augmentation
After
Augmentation
TPRPPVACCF1-ScoreTPRPPVACCF1-Score
100%100%89.6295.1397.2692.3898.4894.7699.5796.62
70%100%86.3988.4593.3987.4188.4588.8491.7288.65
50%100%70.9979.1885.6774.8672.9080.0984.5876.32
30%100%53.0748.4964.3650.6852.6546.0861.5249.15
10%100%31.5336.0643.3033.6430.8534.9144.9732.75
Table 23. Comparison of accuracies achieved by the proposed PI-CNN in the original and augmented datasets. The accuracies were achieved in the PlantDoc dataset (unit: %).
Table 23. Comparison of accuracies achieved by the proposed PI-CNN in the original and augmented datasets. The accuracies were achieved in the PlantDoc dataset (unit: %).
DatasetDiseaseCrop
Before
Augmentation
After
Augmentation
TPRPPVACCF1-ScoreTPRPPVACCF1-Score
100%100%66.1367.8780.3667.0078.2674.6686.5776.46
70%100%59.6055.7276.4057.5970.6364.4680.9667.40
50%100%41.6838.8760.7440.2253.2350.5971.1151.88
30%100%37.3033.5547.8335.3343.4738.0057.3240.55
10%100%22.0422.1830.1322.1123.1222.2733.9222.69
Table 24. Comparison of accuracies achieved by the proposed PI-CNN in the original and augmented datasets. The accuracies were achieved in the Plants and Fruits-360 dataset (unit: %).
Table 24. Comparison of accuracies achieved by the proposed PI-CNN in the original and augmented datasets. The accuracies were achieved in the Plants and Fruits-360 dataset (unit: %).
DatasetPlants datasetFruits-360 dataset
Before
Augmentation
After
Augmentation
TPRPPVACCF1-ScoreTPRPPVACCF1-Score
100%100%90.1086.1196.5288.1195.4094.3398.5994.87
70%100%80.5676.2291.8378.3391.4790.6292.3791.04
50%100%61.3761.0784.3261.2273.1478.6182.7875.78
30%100%50.4141.2761.0945.3852.4345.4465.8348.68
10%100%29.1833.1140.9331.0237.9535.8448.2236.87
Table 25. Comparison of accuracies achieved by the proposed PI-CNN in the original and PI-GAN-augmented datasets. The accuracies were achieved in the PlantVillage dataset (unit: %).
Table 25. Comparison of accuracies achieved by the proposed PI-CNN in the original and PI-GAN-augmented datasets. The accuracies were achieved in the PlantVillage dataset (unit: %).
Dataset DivisionDiseaseCrop
Before
Augmentation
After
Augmentation
TPRPPVACCF1-ScoreTPRPPVACCF1-Score
100%100%89.6295.1397.2692.3898.4894.7699.5796.62
70%100%88.6693.3696.4091.0193.6190.9293.6592.27
50%100%74.5081.2387.6777.8774.3082.4988.1078.40
30%100%56.0051.3668.3353.6856.1548.4666.6252.31
10%100%34.3438.6047.5936.4736.8136.2046.4536.51
Table 26. Comparison of accuracies achieved by the proposed PI-CNN in the original and PI-GAN-augmented datasets. The accuracies were achieved in the PlantDoc dataset (unit: %).
Table 26. Comparison of accuracies achieved by the proposed PI-CNN in the original and PI-GAN-augmented datasets. The accuracies were achieved in the PlantDoc dataset (unit: %).
Dataset DivisionDiseaseCrop
Before
Augmentation
After
Augmentation
TPRPPVACCF1-ScoreTPRPPVACCF1-Score
100%100%66.1367.8780.3667.0078.2674.6686.5776.46
70%100%63.6857.1078.5060.3973.1668.1783.6270.67
50%100%43.6943.6865.4143.6955.4752.9276.5954.20
30%100%41.1536.2450.438.5147.7342.362.2845.57
10%100%26.4825.8732.1426.1828.8026.9136.3327.86
Table 27. Comparison of accuracies achieved by the proposed PI-CNN in the original and PI-GAN-augmented datasets. The accuracies were achieved in the Plants and Fruits-360 datasets (unit: %).
Table 27. Comparison of accuracies achieved by the proposed PI-CNN in the original and PI-GAN-augmented datasets. The accuracies were achieved in the Plants and Fruits-360 datasets (unit: %).
Dataset DivisionPlantsFruits-360
Before
Augmentation
After
Augmentation
TPRPPVACCF1-ScoreTPRPPVACCF1-Score
100%100%90.1086.1196.5288.1195.4094.3398.5994.87
70%100%84.5280.6696.1882.5994.5392.3194.4893.42
50%100%66.6464.5388.5265.5976.7081.6587.9479.18
30%100%53.2845.3463.9349.3155.1249.6269.8952.37
10%100%34.5235.9644.3135.2443.4039.7051.9041.55
Table 28. Comparison of accuracies achieved by the proposed PI-CNN in the original and PI-GAN-augmented datasets. The accuracies were achieved in the PlantVillage dataset (unit: %).
Table 28. Comparison of accuracies achieved by the proposed PI-CNN in the original and PI-GAN-augmented datasets. The accuracies were achieved in the PlantVillage dataset (unit: %).
Dataset DivisionDiseaseCrop
Before
Augmentation
After
Augmentation
TPRPPVACCF1-ScoreTPRPPVACCF1-Score
100%100%89.6295.1397.2692.3898.4894.7699.5796.62
70%100%87.3492.1095.2989.6692.4390.8592.6491.63
50%100%73.2680.7686.8176.8373.9882.2887.9277.91
30%100%55.3250.1368.2752.6055.7447.6865.4651.40
10%100%33.8838.6046.4336.0936.3935.9245.5636.15
Table 29. Comparison of accuracies achieved by the proposed PI-CNN in the original and PI-GAN-augmented datasets. The accuracies were achieved in the PlantDoc dataset (unit: %).
Table 29. Comparison of accuracies achieved by the proposed PI-CNN in the original and PI-GAN-augmented datasets. The accuracies were achieved in the PlantDoc dataset (unit: %).
Dataset DivisionDiseaseCrop
Before
Augmentation
After
Augmentation
TPRPPVACCF1-ScoreTPRPPVACCF1-Score
100%100%66.1367.8780.3667.0078.2674.6686.5776.46
70%100%63.0156.7577.2159.7271.7367.4182.2469.51
50%100%42.8143.2964.1343.0454.1752.9175.2253.53
30%100%39.6535.3449.4137.3747.1541.8260.8244.33
10%100%26.2724.3931.8125.2927.9825.9835.8126.95
Table 30. Comparison of accuracies achieved by the proposed PI-CNN in the original and PI-GAN-augmented datasets. The accuracies were achieved in the Plants and Fruits-360 datasets (unit: %).
Table 30. Comparison of accuracies achieved by the proposed PI-CNN in the original and PI-GAN-augmented datasets. The accuracies were achieved in the Plants and Fruits-360 datasets (unit: %).
Dataset DivisionPlantsFruits-360
Before
Augmentation
After
Augmentation
TPRPPVACCF1-ScoreTPRPPVACCF1-Score
100%100%90.1086.1196.5288.1195.4094.3398.5994.87
70%100%83.7979.5994.9081.6393.9092.2994.1293.09
50%100%65.9163.2387.7264.5575.4281.2987.2678.25
30%100%52.7945.0063.7348.5854.5349.0869.3651.66
10%100%33.5535.1243.3634.3243.1538.7050.5340.80
Table 31. Comparison of accuracies achieved by the proposed PI-CNN and the existing methods in the original and reduced datasets. The accuracies were achieved in the PlantVillage dataset (unit: %).
Table 31. Comparison of accuracies achieved by the proposed PI-CNN and the existing methods in the original and reduced datasets. The accuracies were achieved in the PlantVillage dataset (unit: %).
MethodsDatasetDiseaseCrop
TPRPPVACCF1-ScoreTPRPPVACCF1-Score
Wang [14]100%88.3194.0296.3491.1797.1593.5498.3595.35
70%85.6588.8491.4887.2588.8485.7489.9887.29
50%70.8577.6484.0474.2570.5877.5183.2474.05
30%51.5646.3364.9248.9552.8444.1763.8448.51
10%30.5433.8943.1232.2232.0132.9843.5432.50
Shahi [1]100%89.3294.5396.9491.9397.9394.3298.1296.13
70%85.8488.1791.7487.0188.1783.4489.2485.81
50%69.4576.1782.6172.8170.8276.3282.4273.57
30%51.5246.4463.1448.9851.4343.6662.4547.55
10%29.1433.4842.7831.3131.5232.7442.4932.13
Srivastava [5]100%88.9094.8697.1091.7997.7394.5198.4096.10
70%84.9088.2591.1286.5589.0085.3588.9287.14
50%70.8677.8083.6974.1769.6677.5884.1473.40
30%52.3046.9165.1349.4652.3545.6963.1348.80
10%31.0833.0243.3732.0231.5232.2442.6631.88
Jordan [46]100%88.2094.1296.9191.0697.1093.8598.6295.45
70%86.0689.0191.3787.5189.0185.4890.0587.21
50%71.3978.6584.3574.8570.0178.3684.5873.95
30%52.0747.1165.2049.4652.1245.2562.6748.44
10%31.3833.7042.5332.5032.9632.7942.9332.87
Ours100%89.6295.1397.2692.3898.4894.7699.5796.62
70%86.2089.1192.3487.6689.1086.2790.4187.69
50%71.8978.6784.7075.2871.1078.6084.6474.85
30%52.8047.4865.5450.1453.2045.8763.1349.54
10%31.8434.2943.5533.0732.9833.2543.9133.12
Table 32. Comparison of accuracies achieved by the proposed PI-CNN and the existing methods in the original and reduced datasets. The accuracies were achieved in the PlantDoc dataset (unit: %).
Table 32. Comparison of accuracies achieved by the proposed PI-CNN and the existing methods in the original and reduced datasets. The accuracies were achieved in the PlantDoc dataset (unit: %).
MethodsDatasetDiseaseCrop
TPRPPVACCF1-ScoreTPRPPVACCF1-Score
Wang [14]100%65.1266.4979.8265.8177.3773.5484.9575.46
70%58.2654.3973.0756.3368.5064.7479.8966.62
50%38.3038.2060.6738.2551.2547.8271.3149.54
30%35.8528.3844.1832.1240.1734.8852.6937.53
10%19.4221.3127.6820.3720.2621.2930.2120.78
Shahi [1]100%64.5866.6678.8365.6277.7973.4286.0275.61
70%58.9053.7272.7656.3168.8563.6980.4266.27
50%38.7938.6459.7338.7250.4147.9671.0749.19
30%35.7928.5944.5032.1939.7634.0551.5036.91
10%21.0322.0628.3321.5521.5821.0131.3521.30
Srivastava [5]100%65.6667.4379.0366.5377.2574.0085.9775.59
70%58.5553.1373.7055.7169.1164.4979.7266.72
50%39.9038.9360.8839.4150.8347.1972.0748.95
30%36.1129.0044.1332.1740.4034.8152.3337.40
10%19.9921.0427.6920.5020.7621.7131.7221.22
Jordan [46]100%65.1267.6479.7866.3676.7874.6486.2875.69
70%58.1653.7272.9855.8570.3364.0680.5967.05
50%39.5637.7760.2738.6450.9246.9672.3748.86
30%35.6028.9243.7831.9140.2333.9551.9236.83
10%19.9221.4627.9320.6620.5921.2631.6320.92
Ours100%66.1367.8780.3667.0078.2674.6686.5776.46
70%59.5454.4874.0857.0170.3764.7880.8167.58
50%40.2039.2161.0539.7151.9548.1572.5550.05
30%36.6229.3145.2132.9741.6235.3153.1438.47
10%21.3022.5129.1221.9121.9822.4132.1822.20
Table 33. Comparison of accuracies achieved by the proposed PI-CNN and the existing methods in the original and reduced datasets. The accuracies were achieved in the Plants and Fruits-360 datasets (unit: %).
Table 33. Comparison of accuracies achieved by the proposed PI-CNN and the existing methods in the original and reduced datasets. The accuracies were achieved in the Plants and Fruits-360 datasets (unit: %).
MethodsDatasetPlantsFruits-360
TPRPPVACCF1-ScoreTPRPPVACCF1-Score
Wang [14]100%89.7285.2995.6387.5194.4794.0497.8894.26
70%80.2576.3592.2378.3091.2488.7790.8790.01
50%62.5559.8584.1061.2072.2478.4982.9375.37
30%48.8940.7159.5144.8052.0645.2065.1648.63
10%29.6830.6040.6130.1438.8435.2447.5437.04
Shahi [1]100%89.3584.4794.7586.9193.5393.7497.1793.64
70%80.0476.3091.6678.1790.7788.5489.9489.66
50%62.3959.4884.0960.9472.0178.3582.2375.18
30%48.0339.9358.6743.9851.4544.5564.8848.00
10%28.7029.8640.3729.2838.3234.3046.7736.31
Srivastava [5]100%89.7485.9695.6087.8194.2693.1098.2593.68
70%79.9875.4792.5777.6690.9188.1590.6089.51
50%61.4359.7982.9760.6072.4077.9582.6275.07
30%49.5340.9358.9344.8251.6045.8464.3448.55
10%30.0031.0640.2830.5238.9535.7148.0037.26
Jordan [46]100%89.3485.2095.7187.2294.2293.8897.4394.05
70%79.5375.5992.5877.5191.6188.3490.5489.95
50%61.8458.8882.7760.3372.3078.3483.1075.20
30%49.3040.4160.0344.4152.0845.1265.0048.35
10%30.2430.3440.5030.2939.3735.3148.1137.23
Ours100%90.1086.1196.5288.1195.4094.3398.5994.87
70%80.4676.4092.8178.4391.7089.0091.8090.35
50%62.7160.2284.1061.4772.4778.6383.6375.55
30%49.7441.5060.3445.6252.6845.8665.4449.27
10%30.6631.3340.8531.0039.3736.1948.3137.78
Table 34. Processing time of the methods per image (unit: ms).
Table 34. Processing time of the methods per image (unit: ms).
MethodsProcessing Time
Image augmentation by PI-GAN45.92
Classification by PI-CNN51.85
Total97.77
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Batchuluun, G.; Nam, S.H.; Park, K.R. Deep Learning-Based Plant-Image Classification Using a Small Training Dataset. Mathematics 2022, 10, 3091. https://doi.org/10.3390/math10173091

AMA Style

Batchuluun G, Nam SH, Park KR. Deep Learning-Based Plant-Image Classification Using a Small Training Dataset. Mathematics. 2022; 10(17):3091. https://doi.org/10.3390/math10173091

Chicago/Turabian Style

Batchuluun, Ganbayar, Se Hyun Nam, and Kang Ryoung Park. 2022. "Deep Learning-Based Plant-Image Classification Using a Small Training Dataset" Mathematics 10, no. 17: 3091. https://doi.org/10.3390/math10173091

APA Style

Batchuluun, G., Nam, S. H., & Park, K. R. (2022). Deep Learning-Based Plant-Image Classification Using a Small Training Dataset. Mathematics, 10(17), 3091. https://doi.org/10.3390/math10173091

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop