A Novel Approach to Grade Cotton Aphid Damage Severity with Hyperspectral Index Reconstruction

Hu, Xiaohong; Qiao, Hongbo; Chen, Baogang; Si, Haiping

doi:10.3390/app12178760

Open AccessArticle

A Novel Approach to Grade Cotton Aphid Damage Severity with Hyperspectral Index Reconstruction

by

Xiaohong Hu

^*,

Hongbo Qiao

,

Baogang Chen

and

Haiping Si

College of Information and Management Science, Henan Agricultural University, Zhengzhou 450002, China

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2022, 12(17), 8760; https://doi.org/10.3390/app12178760

Submission received: 7 August 2022 / Revised: 24 August 2022 / Accepted: 29 August 2022 / Published: 31 August 2022

Download

Browse Figures

Versions Notes

Abstract

:

As a kind of important insect pest of cotton crops, aphids cause serious damage in cotton yields and quality worldwide, posing a significant risk to economic losses. Automatic detection of the pest damage level plays an important role in cotton field management. However, it is usually regarded as a classification problem in machine learning, where the disease severity levels are taken as independent categories and the inter-level relationship has not fully been considered. To utilize the inherited relations among different severity levels caused by cotton aphids, a novel approach based on the spectral index reconstruction was proposed in this study. First, six types of initial spectral indices were reconstructed based on healthy samples in the training set. Then, the severity sequences corresponding to the reconstructed initial spectral indices (RISIs) were sorted and compared with the ideal sequence. After attaining sequences most consistent with the ideal one, the ratio between the inter- and intra- levels was calculated to select the sensitive RISI. Moreover, the range of each severity level was established by the thresholds between adjacent grades of the selected sensitive RISI, which was finally used to determine the disease severity level caused by cotton aphids. Results of the cotton aphids showed that the proposed approach achieved a grading performance with OA = 0.944, AA = 0.900, and Kappa coefficient = 0.928. Hence, the proposed approach based on hyperspectral index reconstruction is effective and has potential application in grading the aphid infestation severity of cotton.

Keywords:

cotton aphid; data reconstruction; hyperspectral indices; grade relation

1. Introduction

Cotton is one of the dominant commercial crops in major cotton-producing countries such as China, United States, India, Pakistan, Uzbekistan and West Africa [1,2], which is not only a raw material for the textile industry, but also plays an important role in the chemical industry. The growth and production of cotton are significant to economic and social development. However, cotton disease infection is one of the main factors affecting the profitability and sustainability of agricultural management [3], so it is of great importance to monitor the cotton health conditions in a timely manner and precisely detect the disease infection severity. Among all of the cotton diseases and insect pests, cotton aphids have attracted more and more attention. The fast-growing cotton aphid is one of the most destructive sucking pests in the world, and its population increases rapidly [4,5]. The infestation of cotton aphids causes damage through direct feeding, virus transmission, and honeydew contamination. Once aphid infection occurs, the cotton leaves curl downward and have a crinkled appearance. Heavy damage to cotton may decrease photosynthesis, resulting in the stunting of seedlings and yield losses. Insecticide application is an effective method of cotton aphid control; however, the abuse of insecticides could result in the failure of disease control and even environmental pollution such as soil contamination. Therefore, it is practically urgent to monitor and control aphid infestation more effectively and efficiently so that prevention and control measures can be formulated to reduce economic losses.

The traditional ground-based survey method of plant infestation usually suffers from tremendous labor costs, low efficiency, and subjective human error, which makes it difficult to accurately estimate the infection areas and severity at a large scale. The most common strategy of this ground-based survey for disease detection is to conduct visual surveys by experienced producers, who can identify subtle changes in plant phenotype such as plant color, curl of the plant leaves, etc., and scout the infected area of the crop. In comparison, the spectral information of plant growth can be collected with remote sensing technology, which not only covers a large area, but also at a relatively low cost [6,7,8], since the stresses induced by pests and diseases affect photosynthesis and the physical structure of plants, which result in the alteration of some characteristic bands among the different severity levels of pests and diseases [9,10,11,12]. Hence, the crop damage severity caused by diseases and insect pests can be identified with the sensitive wavelength. On this basis, the spectral index (SI) combined with the sensitive spectrum is obviously related to the physiological and biochemical processes of crop infection.

A typical severity identification of crop infestation based on hyperspectral remote sensing technology mainly includes the steps of sensitivity feature selection and classification model determination. In the step of sensitivity feature selection, the features can be reflectance or transformed reflectance data (derivative, continuum removed, etc.) and the vegetation index [13], in which index construction is a simple and effective method and numerous indices have been formulated to detect the infection of plants by using remote sensing technologies [14]. Then, the extracted features are usually input into the classical machine learning-based classifier to determine the crop health status. The commonly used classification methods in the model determination step include random forest (RF) [15], support vector machine (SVM) [16], K-nearest neighbor (KNN) [17], etc. These classification models are usually trained on a labeled dataset and the labels of the test dataset are determined by supervised learning. For example, the iterative self-organizing data analysis techniques algorithm (ISODATA) was applied to recognize root rot infested areas of cotton [18]. Zhao Hengqian et al. [19] proposed an automatic crop disease severity classification method based on vegetation index normalization with six typical vegetation indices on cotton fields infected with root rot. Elhadi et al. [20] developed a phaeosphaeria leaf spot identification method based on random forest, which could be used for early disease detection in maize fields with the extracted six key features. Nagasubramanian et al. [21] used genetic algorithms (GA) to identify the optimal combination of six bands from 240 hyperspectral bands, and then classified the combination with SVM for the early-stage detection of charcoal rot disease in soybean stems. Mirik et al. [22] adopted the maximum likelihood algorithm to classify Landsat 5 Thematic Mapper (TM) images in Texas to discriminate the wheat streak mosaic virus. In particular, Feng Wei et al. [23] established an optimum dual-green vegetation index to improve the detection of wheat powdery mildew, which provided a new idea and analytical method for the spectral monitoring of wheat diseases, and also indicated that the spectral monitoring of healthy wheat in constructing a dual-green vegetation index significantly improved the disease estimation precision compared with the optimal common vegetation index. Lili Luo et al. [24] constructed a sensitive index for measuring spectral differences between leaves infected by MDMV and healthy leaves and also developed the classification model with integrated linear discriminant analysis (LDA) with SVM to detect the damage severity caused by the maize dwarf mosaic virus.

Despite the promising results in previous studies, spot-level hyperspectral measurements have rarely been used to detect cotton aphids, and the crops are usually treated with diseases or insect inoculum, which can make the infection severity of the crops relatively uniform. However, in a natural environment, crop diseases and pests often originate from one place and then spread to other regions. Moreover, it is challenging to analyze and reconstruct the hyperspectral indices to provide valuable information indicating the physiological and biochemical processes of pest and disease damage on plants. In addition, the severity level identification of pest and disease damage in crops mainly relies on classification models. The research has paid more attention on selecting the effective model based on the influence of different classification models, but not much effort has been taken to analyze the inherited relations among different classes or grades, and the severity class is considered as independent class, however, different classes in disease damage correspond to specific disease severity ranges. Ignoring the inter-class associations will affect the classification performance and increase the complexity of the inter-class comparison. Therefore, to address the issue presented above, we constructed six typical spectral indices from the full spectral bands, which were then reconstructed based on the comparison with healthy samples. Furthermore, a novel approach to select sensitive spectral indices was proposed, and simultaneously, a corresponding method of setting thresholds between adjacent disease levels instead of selecting the classification model was established to identify the severity caused by cotton aphids.

2. Materials and Methods

2.1. Experimental Site and Sampling

The experiment was performed in the Korla region (41°44′59″ N, 85°48′30″ E) of Xinjiang, China, a typical cotton production area, which is located in the southern foot of the Tianshan Mountains, and the northeastern edge of the Tarim Basin. It has a temperate continental arid climate, with a total annual sunshine duration of 2990 h, an annual average temperature of 11.4 °C, a minimum of −28 °C, an average annual precipitation of 58.6 mm, and an annual maximum evaporation of 2788.2 mm. Cotton was planted with a large scale and simple planting structure, where cotton aphids were the main cotton pests. The field data experiments were conducted in the budding stage from 3 July to 11 July 2019. Any disease or insect inoculum had not been treated and no pesticide was applied until after data collection in the cotton fields. During these periods, sampling plots were selected randomly for the canopy hyperspectral reflectance and corresponding aphid damage grade measurements.

2.2. Data Collection

2.2.1. Spectral Measurements

All spectral measurements were taken from a height of 50 cm above the canopy, and spectra were obtained under a clear sky with minimal or no wind between 12:00 and 16:00 (Beijing local time) using a Field Spec HandHeld spectrometer. The spectral measurement range was from 325 to 1075 nm with a resolution of 1.4 nm, and the canopy spectra were averaged over ten scans. Before and after each sample measurement, the sensor was calibrated for baseline reflectance using a white polytetrafluoroethylene panel. To reduce the impact of environmental conditions, sample points with 4 m² (2 m × 2 m) were randomly used for measurement, and the average value was used as the representative spectral reflectance of the sample point. In this study, a total of 61 field measured samples were obtained and used to verify the effectiveness of the proposed method. The samples were randomly split into the training (75%) and testing (25%) datasets.

2.2.2. Disease Severity Assessment

The investigation method was carried out according to the National Standard Pesticide Guidelines for the field efficacy trials, Fungicides against cotton aphid (GB/T15799-2011), which stipulates the requirements for judging the disease level in Table 1. The disease progression of each plant was calculated according to the percentage of infected leaves. As shown in Table 1, each selected plant was categorized into one of the five levels of cotton aphid damage severity from Grade 0 (healthy) and Grade 1 (mild infestation) to Grade 4 (severe damage).

Cotton samples in our study were selected from each root quadrat according to the traditional five-point sampling method (in the central area and the surrounding four corners). In order to comprehensively consider the incidence and damage severity caused by cotton aphids at the plot scale, the disease index (DI) was calculated with Equation (1) [25,26] and used as a comprehensive index to evaluate the incidence degree of cotton aphids in this study.

DI = \frac{\sum (Number of leaves of each grade \times Disease grade)}{Maximum disease grade \times The number of total leaves}

(1)

The disease index was divided into four grades (G0–G4): G0 for healthy (0), G1 for mild (0–25%), G2 for moderate (25–50%), G3 for severe (50%–75%), and G4 for profound (>75%).

2.3. Construction and Reconstruction of the Initial Spectral Indices

Under the stress of disease, the physiological and biochemical characteristics as well as the apparent morphology of crops varied, which in turn caused the changes in the spectral indices. Therefore, its spectral response could be regarded as a function of pigment, water, morphology, and structure changes [27,28,29]. SI formulated with the sensitive bands could reflect the fluctuation of the spectral response to realize the detection of the disease in light of the pathological mechanism [30,31]. In this study, we constructed the SIs through two stages. The first stage was the construction of the initial SIs, where all possible combinations of six types of hyperspectral SIs were constructed to simplify the spectral detection of cotton aphid damage including three three-band SIs and three two-band SIs. These initial SIs were then reconstructed with the healthy samples in the second stage to evaluate the cotton aphid damage effectively.

In the first stage, three types of two-band SIs including the differential spectral index (DSI), normalized difference spectral index (NDSI), and the ratio spectral index (RSI) were calculated for the construction of the two-band initial indices, and three types of three-band indices that improved the differential spectral index (IDSI), photochemical spectral index (PSI), and the ratio spectral index (IRSI) were constructed as the three-band initial indices based on the raw spectral response from the full band range. These were formulated as follows:

D S I (λ_{i}, λ_{j}) = R_{λ}_{_{i}} - R_{λ}_{_{j}}

(2)

R S I (λ_{i}, λ_{j}) = \frac{R_{λ}_{_{i}}}{R_{λ}_{_{j}}}

(3)

N D S I (λ_{i}, λ_{j}) = \frac{R_{λ}_{_{i}} - R_{λ}_{_{j}}}{R_{λ}_{_{i}} + R_{λ}_{_{j}}}

(4)

I D S I (λ_{i}, λ_{j}, λ_{k}) = R_{λ}_{_{i}} - R_{λ}_{_{j}} + R_{λ}_{_{k}}

(5)

I R S I (λ_{i}, λ_{j}, λ_{k}) = \frac{R_{λ}_{_{i}}}{R_{λ}_{_{j}} + R_{λ}_{_{k}}}

(6)

P S I (λ_{i}, λ_{j}, λ_{k}) = \frac{R_{λ}_{_{i}} - R_{λ}_{_{j}}}{R_{λ}_{_{i}} + R_{λ}_{_{k}}}

(7)

In Equations (2)–(4),

λ_{i}, λ_{j}

were all possible two-band combinations in the range of 325–1075 nm;

R_{λ}_{_{i}}

and

R_{λ}_{_{j}}

were the reflectance values at

λ_{i}, λ_{j}

, respectively. In Equations (5)–(7),

λ_{i}, λ_{j}, λ_{k}

were all possible three-band combinations and

R_{i}, R_{j}, R_{k}

were the reflectance at

λ_{i}, λ_{j}, λ_{k}

, respectively.

Let m be the number of training samples. After the construction and calculation of the initial SIs for each training sample, we reconstructed them based on the SI difference between the sample

x_{i}

(1 \leq i \leq m)

and each healthy one in the training dataset to highlight their possible difference. The RISI, defined by the mean absolute value of the difference, is represented by Equation (8), where the higher the RISI, the greater the difference from healthy, and the more serious the disease, and vice versa. To some extent, the influence of illumination, canopy, and blade structure could be suppressed with the RISI.

R I S I = \frac{1}{μ} \sum_{t = 1}^{μ} |S I - S I_{t}|

(8)

where

S I_{t}

is the SI of the healthy sample

x_{t}

, and

μ

is the number of healthy samples in the training dataset.

According to Equation (8), the RISI sequence

a_{1}, a_{2}, \dots, a_{m}

of the training samples can be achieved, and

a_{1}, a_{2}, \dots, a_{m}

can be the sequence of reconstructed DSI (RDSI), reconstructed RSI (RRSI), reconstructed NDSI (RNDSI), reconstructed IDSI (RIDSI), reconstructed IRSI (RIRSI), and reconstructed PSI (RPSI), where

a_{i} (1 \leq i \leq m)

represents the RISI of the training sample

x_{i}

.

2.4. Selection of the Sensitive RISI

Donate

s_{1}, s_{2}, \dots, s_{m}

as the sequence of

a_{1}, a_{2}, \dots, a_{m}

in ascending order, and its corresponding severity levels caused by cotton aphids are represented as

g_{1}, g_{2}, \dots, g_{m}

g_{i} \in \{0, 1, 2, 3, 4}

. The RISI indicates the difference between the sample and all of the healthy samples in the training dataset. The lower the disease severity level, the closer to the healthy ones, and the higher rank in the ideal level sequence, while the higher severity level induced a lower rank in the desired sorting sequence. Therefore, donate

m_{0}

,

m_{1}

,

m_{2}

,

m_{3}

,

m_{4}

as the number of samples in the training set at the levels of 0, 1, 2, 3, and 4 respectively, then the ideal disease level sequence of

s_{1}, s_{2}, \dots, s_{m}

can be formulated as

0, 0, \dots, 0, 1, 1, \dots, 1, 2, 2, \dots, 2, 3, 3, \dots, 3, 4, 4, \dots 4

, where there are

m_{0}

samples with G0, followed by

m_{1}

samples with G1,

m_{2}

samples with G2,

m_{3}

samples with G3, and

m_{4}

samples with G4. Then, the consistency

K I

between

g_{1}, g_{2}, \dots, g_{m}

and the ideal disease level sequence is represented as follows:

K I = \frac{ρ_{o} - ρ_{e}}{1 - ρ_{e}}

(9)

where

ρ_{o} = \sum_{r = 0}^{4} \frac{ψ_{r}}{m}

,

ρ_{e} = \frac{\sum_{r = 0}^{4} m_{r}^{2}}{m^{2}}

;

ψ_{r}

is the number of samples with grade

r

at the same position in both sequences; and

m

is the number of samples in the training dataset. According to Equation (9), the KIs of all possible combinations in RDSI, RRSI, RNDSI, RIDSI, RIRSI, and RPSI were calculated.

After the calculation of the KIs for all RISIs, the sensitive band combinations for each type of RISI were selected with the maximum KI, that is, the severity grade sequences of the RISIs similar to the ideal sequence were taken as the sensitive band combination of RDSI, RRSI, RNDSI, RIDSI, RIRSI, and RPSI, respectively. If the maximum KI was obtained for more than one RISI, the severity ratio between the inter- and intra-grades is defined as:

η = \frac{\sum_{r = 1}^{4} (α_{r} - β_{r - 1})}{\sum_{r = 0}^{4} σ_{r}}

(10)

where

α_{r}

is minimum RISI with grade

r (0 \leq r \leq 4)

of sequence

s_{1}, s_{2}, \dots, s_{m}

, which reached the maximum KI, and

β_{r}

,

σ_{r}

are the maximum and standard deviation of grade r in the sequence, respectively. The numerator of

η

describes the boundary difference between adjacent grades. The larger the value, the stronger the discrimination. The denominator of

η

denotes the intra-grade difference, and the smaller the denominator, the stronger the discrimination. Therefore, if there are more than one maximum KI value, the sensitive RISI is selected with the largest

η

from RDSI, RRSI, RNDSI, RIDSI, RIRSI, and RPSI, respectively.

2.5. Cotton Aphid Severity Grading

Suppose the ascending sort sequence of optimal RISI is

o_{1}, o_{2}, \dots, o_{m}

, the minimum RISI of grade

r

is

α_{r}

, and the maximum value of grade

r - 1

is

β_{r - 1}

, then the division threshold

θ_{r}

between the grade

r - 1

and

r

is represented in Equation (11):

θ_{r} = \frac{α_{r} + β_{r - 1}}{2} (1 \leq r \leq 4)

(11)

Therefore, if the training sample of optimal RISI is greater than or equal to the threshold

θ_{r}

, it would be determined as grade

r

, otherwise it would be judged as grade

r - 1

. Therefore, the thresholds

θ_{1}, θ_{2}, θ_{3}, θ_{4}

divided the RISI into five intervals, as shown in Figure 1.

The cotton aphid severity of each test sample was graded based on the level division threshold of the optimal band index. First, the initial SI of the test sample was calculated according to the selected optimal spectral index type and band combination, then they were reconstructed with the health training sample set

H^{L}

according to Equation (8). Finally, through the thresholds

θ_{1}, θ_{2}, θ_{3}, θ_{4}

calculated with Equation (11), the RISI of the test sample determined its corresponding range, thus the severity level of the cotton aphid damage could be obtained.

3. Results

3.1. Evaluation Protocol

To evaluate the performance of the proposed approach, three commonly used evaluation criteria—overall accuracy (OA), average accuracy (AA), and kappa coefficient (Kc)—were adopted [32]. These are described in Equations (12)–(14) as shown below.

OA = \frac{number of correctly classified samples}{total test samples}

(12)

AA = \frac{classwise accuracies}{number of classes}

(13)

Kc = \frac{total accuracy - random accuracy}{1 - random - accuracy}

(14)

Higher AA, OA, and Kc values represent a better performance of the approach. In this study, all of the computational analyses were performed on a notebook, which was configured with an Intel Core i7-10510U CPU (eight cores) with 32 GB RAM memory.

3.2. KI for Two-Band Parameters

The KI for all of the possible reconstructed two-band combinations of RDSI, RNDSI, and RRSI were analyzed at the 1 nm interval with the full range of 345–1075 nm; Figure 1 shows the KI contour maps. According to Figure 1, the band combinations of higher KI were extracted from the hotspot areas as the better spectral index to grade the disease severity. As shown in the KI contour map, within the full wavelength, compared with Figure 2b,c, there were large continuous hotspots in Figure 2a, whereas the greatest KI values of 1 were mostly in a small number of hotspots (Figure 2b,c) among the three contour maps.

Among the three types of reconstructed two-dimensional SIs, the maximum KI of RDSI was 0.935, six pairs of red side band combinations were all reached 0.935; RDSI (R702, R715) were taken as the sensitive RDSI of cotton aphid stress, 702 and 715 nm were both in the red edge region. For RNDSI and RRSI, the kappa coefficient of most band combinations was below 0.7, while the KI of some band combinations in the 490–730 nm region reached the maximum of 1. The contour map of RNDSI was similar to that of RRSI. The number of band combinations whose KI value reached 1 was 96 and 61 for RNDSI and RRSI, respectively. Due to multiple band combinations with the maximum KI, according to Equation (10), the maximum band combinations RNDSI (R643, R656) and RRSI(R656, R643) were taken as the sensitive two-band indices of cotton aphid stress; RNDSI and RRSI had the same sensitive spectrum consisting of 643 and 656 nm, which were in the red region. Moreover, RRSI (R656, R643) yielded the best results for both the KI value and

η

among the possible two-band combinations of RDSI, RNDSI, and RRSI. The three contour maps all showed that the blue regions had the weakest correlation with KI.

3.3. KI for Three-Band Parameters

The 3D KI slice maps of the ideal damage level sequence

0, 0, \dots, 0, 1, 1, \dots, 1, 2, 2, \dots, 2, 3, 3, \dots, 3, 4, 4, \dots 4

and damage level sequence

g_{1}, g_{2}, \dots, g_{m}

using three-band RISIs, which consisted of RIDSI, RIRSI, RPSI in ascending order, are shown in Figure 3, Figure 4 and Figure 5.

Among the three types of reconstructed three-dimensional SIs, the maximum KI value was the maximum value of 1, and the number of band combinations whose kappa value reached 1 were 3162, 46,471, and 42,130 for RIDSI, RPSI, and RIRSI, respectively. The contour maps in Figure 3, Figure 4 and Figure 5 show that RIDSI had more relatively greater KI values than RPSI and RIRSI. According to the contour maps of RIDSI, RPSI, and RIRSI, the blue regions had weaker correlation with DI, especially for RIDSI and RIRSI. According to Figure 4, it was obvious that the spectrum in the red edge and NIR region had stronger correlation with DI for RIRSI. For multiple band combinations whose KI achieved the maximum value, according to Equation (10), RIDSI (R1051, R1049, R747), RPSI (R519, R438, R706) and RIRSI (R732, R878, R712) were taken as the sensitive spectral index band combinations for RIDSI, RPSI, and RIRSI, consisting of 747 nm for RIDSI, 706 nm for RPSI, 732 nm, and 712 nm for RIRSI in the red edge region. This shows that the red edge region had strong correlations to the cotton aphids. Moreover, RIRSI (R732, R878, R712) yielded the best results for both KI and the inter- and intra-ratio

η

.

Among the six types of pest and disease specific indices tested, RIRSI (R732, R878, R712) had the highest KI and

η

followed by RRSI (R656, R643). Therefore, RIRSI (R732, R878, R712) were selected as the sensitive spectral index band combinations.

3.4. Adjacent Grade Thresholds for Sensitive RISIs

After RIRSI (R732, R878, R712) was eventually selected as the most sensitive spectral index band combination from six types of RISIs, the thresholds of the adjacent levels could be calculated according to Equation (11). Furthermore, the thresholds of adjacent grades for the other five types of RISIs including two-band indices, RDSI (R702, R715), RNDSI (R643, R656), RRSI (R656, R643) as well as three-band indices, RIDSI (R1051, R1049, R747) and RPSI (R519, R438, R706), were also calculated, as shown in Table 2, where

θ_{1}, θ_{2}, θ_{3}, θ_{4}

represents the threshold between G0 and G1, G1 and G2, G2 and G3, G3 and G4, respectively. For each threshold

θ_{i} (1 \leq i \leq 4)

, the threshold value increased with the increase in i. Therefore, through the reconstruction of the spectral index, it had obvious variation characteristics in the sensitive band combination, which made it easy to identify the disease level using adjacent grade thresholds.

3.5. Grade Performance of Test Samples

The grading performance with the most sensitive band combination RIRSI (R732, R878, R712) was further tested along with RPSI (R519, R438, R706), RIDSI (R1051, R1049, R747), RNDSI (R643, R656), RRSI (R656, R643), and RDSI (R702, R715); the other five indices used the thresholds calculated according to Table 2. It is evident from Table 3 that the classification accuracy of cotton aphid severity with RIRSI (R732, R878, R712), chosen from six types of band indices, showed promising results, which were superior to all of the other five indices. The classification performance of RRSI (R656, R643) was essentially the same as RPSI (R519, R438, R706), while the performance for RIDSI (R1051, R1049, R747) was lower.

4. Discussion

With the virus transmission of aphids in a cotton field, ruinous damage of yield and cotton quality is inevitable, which causes huge economic loss and environmental pollution problems [3]. The automatic detection method of crop disease severity level has become increasingly important in the field of agricultural diseases and pests [2,7,10]. The crops have been treated with inoculum in most of the studies on crop diseases and pests [23,33], which has made it easier to detect the infected samples. In this study, the cotton field was in a natural environment, without any inoculum treatment, and the cotton aphids were the result of natural processes. Previous studies have explored the discrimination of cotton aphids at the leaf scale and showed a clear correlation between the leaf reflectance and cotton aphid DI [34]. This correlation provides a basis for the exploration of the automatic grading method of aphid damage on a larger scale. In this study, the natural environment and monitoring scale made the automatic grading method more challenging and valuable.

Traditionally, the existing binary classifiers or threshold segmentation methods have been adopted to distinguish between healthy and infected samples, but not the multiple disease severity levels. Meanwhile, grading disease severity was regarded as a multi-class classification problem in machine learning, and it was mainly considered that there was no correlation among multiple levels. However, the severity levels of the crop diseases and insect pests were not independent classes without any inter-class correlation, and the disease severity classification based on existing classifiers in machine learning usually had weak interpretability. The automatic disease severity grading approach proposed in this study could help to fill this gap. The proposed approach mined the inherited relations among different severity levels with RISIs and calculated the level intervals, which gave it stronger interpretability than the traditional multi-class classifiers, and the threshold setting between the adjacent disease levels made it easier to determine the severity grades of the diseases and pests. In grading the disease severity caused by cotton aphids, the damage severity could be divided into five ranges based on sensitive RIRSI (R732, R878, R712), and 0.006, 0.018, 0.029, 0.041 were adjacent thresholds for interval division. As far as multi-class classifiers in grading cotton aphids, pairwise class discrimination was needed to optimize the classifier parameters, that is, 10 inter-class discriminations, and it was difficult to obtain the level range of high dimensional data, but the proposed method only distinguished between the adjacent levels and thresholds and level range could be acquired with five discriminations.

A large number of contiguous narrow bands in disease detection resulted in data redundancy, which increased the difficulty in data processing [19]. It was significant to develop a selection method of sensitive spectrums or spectral indices and use a limited number of wavebands that were strongly correlated to crop diseases in grading the damage severity level. In this study, it was difficult to detect the cotton aphid severity level due to the large number of hyperspectral wavebands with the full range of 345–1075 nm. The complex problem of sensitive waveband selection was converted to the distance ranking based on reconstructed spectral indices with healthy samples, which was a huge difference between this study and other studies with regard to the severity classification of disease and infection [14]. The proposed dimension reduction method by selecting the sensitive waveband was evaluated using three types of hyperspectral indices on all possible two- and three-band combinations. As shown in Table 3, among the sensitive band combination selected from each type of spectral indices, the minimum OA, AA, and Kc values were 77.8%, 72%, and 0.714, respectively, and the best performance with OA = 94.4%, AA = 90%, and Kc = 0.928. The perfect grading performance shows that the proposed sensitive band selection method could be used to grade the damage severity of cotton aphids. Additionally, in [23,24], the difference between the healthy and infected samples was introduced in disease detection, which contributed to the improvement in the detection results. In the study, the distance from healthy samples were computed, which not only highlighted the difference between the infected and healthy samples, but also suppressed the similar growing conditions between them such as the management practice, weather and soil conditions, etc., thus the distance between healthy cotton and cotton infected by cotton aphids could have great potential in eliminating the influence of the growing conditions to improve the disease severity grading performance.

The relationship between NDSI, RSI, and aphid infestation severity was observed with traditional linear regression method at leaf scale in [34], as shown in their contour maps, NDSI of 600–1100 nm and 1400–2000 nm, and RSI of 1850–2100 nm were “hotspots” with high coefficients; NDSI (678, 1471) and RSI (1975, 1904) were determined from these sensitive regions. However, in order to be consistent with Landsat Thematic Mapper spectral 1–5, 690–760 nm was not analyzed in their study, which was exactly the red edge region with high relation to many plant diseases and insect pests [23,24]. Compared with their results, as shown in Figure 3, Figure 4 and Figure 5, the sensitive spectral index RIRSI (R732, R878, R712) were derived within the full range of 345–1075 nm at the 1 nm interval in our study, which consisted of the 732 and 712 nm bands in the red edge area. Figure 3b,c shows the contour maps of RNDSI and RRSI, which were based on the traditional NDSI and RSI. It was obvious that both of the contour maps had a similar distribution of “hotspots”, the three-edge regions including the red edge, yellow edge, and blue edge regions as well as 850–900 nm obtained greater KI values. The red areas showed the highest correlation, while the blue areas showed the lowest, which was in accordance with [34]. Whereas the optimal bands of RNDSI and RRSI selected in this study were not the same as that in [34], this might be due to the following. (1) The authors in [34] regarded disease severity grading as a regression problem and their contour maps corresponded to the continuous disease index, while the contour maps in Figure 3 in our study corresponded to the discrete ideal sequence. (2) Compared to the leaf level hyperspectral measurements in their study, the spot level was adopted in this study, and the contour maps were at different scales. (3) In their study, the red edge regions were not considered.

After the spectral index reconstruction, the sensitive spectral indices, RDSI (R702, R715), RNDSI (R643, R656), RRSI (R656, R643), RIDSI (R1051, R1049, R747), RPSI (R519, R438, R706), and RIRSI (R732, R878, R712) were selected from each type of RISI on the possible band combination. RRSI (R656, R643) obtained the best performance of the two-band combination with OA = 88.9%, AA = 80%, Kc = 0.855, and RIRSI (R732, R878, R712) showed the best performance with OA = 94.4%, AA = 90%, Kc = 0.928, which demonstrated that the type of differential RISI including the improved differential RISI had a stronger correlation to the damage severity levels than other types of RISIs. Additionally, the red-edge was indicative of crop growth and nutrition status [23]. In this study, it was found that the reflectance of the red-edge region was highly correlated with cotton aphids.

Although the method has achieved good grading results on cotton aphids, the samples in this study were limited. In future research, a larger number of samples under complex conditions such as longer time, various regions and crops as well as different diseases and pests will be collected, and more types of SIs will be used to further evaluate and improve the performance of the grading method.

5. Conclusions

In this study, a stress grading method based on spectral index reconstruction was proposed, which automatically selects one spectral index to reduce the dimension of contiguous narrow bands and sets the thresholds of adjacent levels with RISIs. Compared with the traditional grading methods based on the multi-class classifier, it showed stronger interpretability. The performance was evaluated by using six types of hyperspectral SIs on a cotton field infected with aphids at the plot scale, and the results demonstrated that the spectral index distance between the healthy cotton and cotton infected by aphids had great potential in detecting the sensitive bands of cotton aphids. In addition, instead of the traditional pairwise class discrimination, the disease severity grading can be simplified by a comparison between adjacent levels. Furthermore, the applicability of the proposed approach was established based on three types of two-band SIs and three types of three-band SIs, among which the three-band RIRSI (R732, R878, R712) achieved the best grading performance and the red-edge region was sensitive to cotton aphids. The proposed approach provides a new way to automatically grade diseases and pests through remote sensing technology.

Author Contributions

Conceptualization, X.H.; Methodology, X.H.; Validation, H.Q. and B.C.; Formal analysis, X.H. and H.S.; Investigation, H.Q.; Resources, H.Q.; Data curation, B.C.; Writing—original draft preparation, X.H.; Writing—review and editing, B.C.; Visualization, H.S.; Supervision, H.S.; Project administration, H.Q.; Funding acquisition, H.Q. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Joint Funds of the National Natural Science Foundation of China (U2003119).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Prabhakar, M.; Prasad, Y.G.; Vennila, S.; Thirupathi, M.; Sreedevi, G.; Rao, G.R.; Venkateswarlu, B. Hyperspectral indices for assessing damage by the solenopsis mealybug (Hemiptera: Pseudococcidae) in cotton. Comput. Electron. Agric. 2013, 97, 61–70. [Google Scholar] [CrossRef]
Ramos, A.P.M.; Gomes, F.D.G.; Pinheiro, M.M.F.; Furuya, D.E.G.; Gonçalvez, W.N.; Junior, J.M.; Michereff, M.F.F.; Blassioli-Moraes, M.C.; Borges, M.; Alaumann, R.A.; et al. Detecting the attack of the fall armyworm (Spodoptera frugiperda) in cotton plants with machine learning and spectral measurements. Precis. Agric. 2022, 23, 470–491. [Google Scholar] [CrossRef]
Aggarwal, P.K.; Kalra, N.; Chander, S.; Pathak, H. InfoCrop a generic simulation model for assessment of crop yields, losses due to pests and environmental impact of agro-ecosystems in tropical environments Model description. Agric. Syst. 2006, 89, 1–25. [Google Scholar] [CrossRef]
Arshad, M.; Suhail, A. Studying the sucking insect pests community in transgenic Bt cotton. Int. J. Agric. Biol. 2010, 12, 764–768. [Google Scholar]
Reisig, D.; Godfrey, L. Remotely sensing arthropod and nutrient stressed plants—A case study with nitrogen and cotton aphid (Hemiptera: Aphididae). Environ. Entomol. 2010, 39, 1255–1263. [Google Scholar] [CrossRef] [PubMed]
Martinelli, F.; Scalenghe, R.; Davino, S.; Panno, S.; Scuderi, G.; Ruisi, P.; Villa, P.; Stroppiana, D.; Boschetti, M.; Goulart, L.R.; et al. Advanced methods of plant disease detection. A review. Agron. Sustain. Dev. 2015, 35, 1–25. [Google Scholar] [CrossRef]
Zhang, J.; Huang, Y.; Pu, R.; Gonzalez-Moreno, P.; Yuan, L.; Wu, K.; Huang, W. Monitoring plant diseases and pests through remote sensing technology: A review. Comput. Electron. Agric. 2019, 165, 104943. [Google Scholar] [CrossRef]
Mahlein, A.K. Plant disease detection by imaging sensors-parallels and specific demands for precision agriculture and plant phenotyping. Plant Dis. 2015, 100, 241–251. [Google Scholar] [CrossRef]
Zhang, L.; Zhao, J.; Jia, K.; Li, X. Plant Spectral Discrimination Based on Phenological Features. Spectrosc. Spectr. Anal. 2015, 35, 2836–2840. [Google Scholar] [CrossRef]
Li, W. Advances in application of hyperspectral remote sensing technology in monitoring agricultural pests and diseases. Mod. Agric. Sci. Technol. 2019, 14, 126–128. [Google Scholar]
Zhang, J.; Yuan, L.; Wang, J.; Luo, J.; Du, S.; Huang, W. Advances in remote sensing monitoring of crop diseases and insect pests. Trans. Chin. Soc. Agric. Eng. 2012, 28, 1–11. [Google Scholar]
Zhang, N.; Yang, G.; Zhao, C.; Zhang, J.; Yang, X.; Pan, Y.; Huang, W.; Xu, B.; Li, M. Progress and prospect of hyperspectral remote sensing for crop pests and diseases. J. Remote Sens. 2021, 25, 403–422. [Google Scholar] [CrossRef]
Shi, Y.; Huang, W.J.; Gonzalez-Moreno, P.; Luke, B.; Dong, Y.Y.; Zheng, Q.; Ma, H.Q.; Liu, L.Y. Wavelet-based rust spectral feature set (WRSFs): A novel spectral feature set based on continuous wavelet transformation for tracking progressive host-pathogen interaction of yellow rust on wheat. Remote Sens. 2018, 10, 525. [Google Scholar] [CrossRef] [Green Version]
Zhang, N.; Yang, G.J.; Pan, Y.C.; Yang, X.D.; Chen, L.P.; Zhao, C.J. A review of advanced technologies and development for hyperspectral-based plant disease detection in the past three decades. Remote Sens. 2020, 12, 3188. [Google Scholar] [CrossRef]
Dhau, I.; Adam, E.; Mutanga, O.; Ayisi, K.; Abdel-Rahman, E.M.; Odindi, J.; Masocha, M. Testing the capability of spectral resolution of the new multispectral sensors on detecting the severity of grey leaf spot disease in maize crop. Geocarto Int. 2018, 33, 1223–1236. [Google Scholar] [CrossRef]
Meng, R.; Lv, Z.G.; Yan, J.B.; Chen, G.S.; Zhao, F.; Zeng, L.L.; Xu, B.Y. Development of spectral disease indices for southern corn rust detection and severity classification. Remote Sens. 2020, 12, 3233. [Google Scholar] [CrossRef]
Golhani, K.; Balasundram, S.K.; Vadamalai, G.; Pradhan, B. A review of neural networks in plant disease detection using hyperspectral data. Inf. Process. Agric. 2018, 5, 354–371. [Google Scholar] [CrossRef]
Yang, C.; Odvody, G.N.; Fernandez, C.J.; Landivar, J.A.; Minzenmayer, R.R.; Nichols, R.L. Evaluating unsupervised and supervised image classification methods for mapping cotton root rot. Precis. Agric. 2015, 16, 201–215. [Google Scholar] [CrossRef]
Zhao, H.; Yang, C.; Guo, W.; Zhang, L.; Zhang, D. Automatic Estimation of Crop Disease Severity Levels Based on Vegetation Index Normalization. Remote Sens. 2020, 12, 1930. [Google Scholar] [CrossRef]
Elhadi, A.; Houtao, D.; John, O.; Abdel-Rahman, E.M.; Onisimo, M. Detecting the Early Stage of Phaeosphaeria Leaf Spot Infestations in Maize Crop Using In Situ Hyperspectral Data and Guided Regularized Random Forest Algorithm. J. Spectrosc. 2017, 2017, 691387. [Google Scholar] [CrossRef]
Nagasubramanian, K.; Jones, S.; Sarkar, S.; Singh, A.K.; Singh, A.; Ganapathysubramanian, B. Hyperspectral band selection using genetic algorithm and support vector machines for early identification of charcoal rot disease in soybean stems. Plant Methods 2018, 14, 86. [Google Scholar] [CrossRef] [PubMed]
Mirik, M.; Agrilife, T.; Vernon, F.; Price, J.A.; Agrilife, T. Satellite Remote Sensing of Wheat Infected by Wheat streak mosaic virus. Plant Dis. 2011, 95, 4–12. [Google Scholar] [CrossRef] [PubMed]
Wei, F.; Shen, W.Y.; Li, H.; Duan, J.Z.; Guo, B.B.; Li, Y.X.; Wang, C.Y.; Guo, T.C. Improved remote sensing detection of wheat powdery mildew using dual-green vegetation indices. Precis. Agric. 2016, 17, 608–627. [Google Scholar] [CrossRef]
Luo, L.; Chang, Q.; Wang, Q.; Huang, Y. Identification and Severity Monitoring of Maize Dwarf Mosaic Virus Infection Based on Hyperspectral Measurements. Remote Sens. 2021, 13, 4560. [Google Scholar] [CrossRef]
Cai, C.J.; Ma, Z.H.; Wang, H.G.; Zhang, Y.P.; Huang, W.J. Comparison research of hyperspectral properties between near-ground and high altitude of wheat strip rust. Acta Phytopathol. Sin. 2007, 37, 77–82. [Google Scholar]
Huang, W.; Lamb, D.W.; Niu, Z.; Zhang, Y.; Liu, L.; Wang, J. Identification of yellow rust in wheat using in-situ spectral reflectance measurements and airborne hyperspectral imaging. Precis. Agric. 2007, 8, 187–197. [Google Scholar] [CrossRef]
Mahlein, A.; Rumpf, T.; Welke, P.; Dehne, H.W.; Plumer, L.; Steiner, U.; Oerke, E.C. Development of spectral indices for detecting and identifying plant diseases. Remote Sens. Environ. 2013, 128, 21–30. [Google Scholar] [CrossRef]
Zheng, Q.; Huang, W.; Cui, X.; Dong, Y.; Shi, Y.; Ma, H.; Liu, L. Identification of Wheat Yellow Rust Using Optimal Three-Band Spectral Indices in Different Growth Stages. Sensors 2018, 19, 35. [Google Scholar] [CrossRef]
Liu, L.; Dong, Y.; Huang, W.; Du, X.; Ren, B.; Huang, L.; Zheng, Q.; Ma, H. A Disease Index for Efficiently Detecting Wheat Fusarium Head Blight Using Sentinel-2 Multispectral Imagery. IEEE Access 2020, 8, 52181–52191. [Google Scholar] [CrossRef]
Ashourloo, D.; Mobasheri, M.R.; Huete, A. Developing two spectral disease indices for detection of wheat leaf rust (Pucciniatriticina). Remote Sens. 2014, 6, 4723–4740. [Google Scholar] [CrossRef]
Devadas, R.; Lamb, D.W.; Simpfendorfer, S.; Backhouse, D. Evaluating ten spectral vegetation indices for identifying rust infection in individual wheat leaves. Precis. Agric. 2008, 10, 459–470. [Google Scholar] [CrossRef]
Vaddi, R.; Manoharan, P. CNN based hyperspectral image classification using unsupervised band selection and structure-preserving spatial features. Infrared Phys. Technol. 2020, 110, 103457. [Google Scholar] [CrossRef]
An, G.; Xing, M.; He, B.; Kang, H.; Shang, J.; Liao, C.; Huang, X.; Zhang, H. Extraction of Areas of Rice False Smut Infection Using UAV Hyperspectral Data. Remote Sens. 2021, 13, 3185. [Google Scholar] [CrossRef]
Chen, T.; Zeng, R.; Guo, W.; Hou, X.; Lan, Y.; Zhang, L. Detection of Stress in Cotton (Gossypium hirsutum L.) Caused by Aphids Using Leaf Level Hyperspectral Measurements. Sensors 2018, 18, 2798. [Google Scholar] [CrossRef] [Green Version]

Figure 1. The threshold division of cotton aphid severity.

Figure 2. The contour map of KI for the two-band combinations (a) RDSI, (b) RNDSI, (c) RRSI.

Figure 3. The 3D KI slice map of the ideal sequence and damage level sequence on RIDSI. (a) Vertical slice, (b) horizontal slice, (c) best slice.

Figure 4. The 3D KI slice map of the ideal sequence and damage level sequence on RIRSI. (a) Vertical slice, (b) horizontal slice, (c) best slice.

Figure 5. The 3D KI slice map of the ideal sequence and damage level sequence on RPSI. (a) Vertical slice, (b) horizontal slice, (c) best slice.

Table 1. The grading criteria for cotton aphid infestation.

Grade	Grading Standard
0	Healthy
1	Few aphids scattered over the plant. Foliage free from crinkling with no yellowing symptoms
2	Crinkling and curling of few leaves in the upper portion of plant
3	The most severely damaged leaves curl up to half circle or above, showing an arc
4	The most severely damaged leaves are completely curled and spherical

Table 2. The thresholds of the sensitive RISIs and band combinations.

Type	Name	Band Combination	θ₁	θ₂	θ₃	θ₄
Two Dimensions	RDSI	(R₇₀₂, R₇₁₅)	0.006	0.019	0.039	0.061
	RNDSI	(R₆₄₃, R₆₅₆)	0.003	0.017	0.027	0.033
	RRSI	(R₆₅₆, R₆₄₃)	0.006	0.031	0.048	0.062
Three Dimensions	RIDSI	(R₁₀₅₁, R₁₀₄₉, R₇₄₇)	0.018	0.052	0.122	0.189
	RPSI	(R₅₁₉, R₄₃₈, R₇₀₆)	0.006	0.022	0.025	0.034
	RIRSI	(R₇₃₂, R₈₇₈, R₇₁₂)	0.006	0.018	0.029	0.041

Table 3. The grade performance of the sensitive RISIs and band combinations.

Type	Name	Band Combination	OA	AA	K_c
Two Dimensions	RDSI	(R₇₀₂, R₇₁₅)	0.778	0.720	0.714
	RNDSI	(R₆₄₃, R₆₅₆)	0.833	0.750	0.784
	RRSI	(R₆₅₆, R₆₄₃)	0.889	0.800	0.855
Three Dimensions	RIDSI	(R₁₀₅₁, R₁₀₄₉, R₇₄₇)	0.833	0.820	0.783
	RPSI	(R₅₁₉, R₄₃₈, R₇₀₆)	0.889	0.800	0.853
	RIRSI	(R₇₃₂, R₈₇₈, R₇₁₂)	0.944	0.900	0.928

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Hu, X.; Qiao, H.; Chen, B.; Si, H. A Novel Approach to Grade Cotton Aphid Damage Severity with Hyperspectral Index Reconstruction. Appl. Sci. 2022, 12, 8760. https://doi.org/10.3390/app12178760

AMA Style

Hu X, Qiao H, Chen B, Si H. A Novel Approach to Grade Cotton Aphid Damage Severity with Hyperspectral Index Reconstruction. Applied Sciences. 2022; 12(17):8760. https://doi.org/10.3390/app12178760

Chicago/Turabian Style

Hu, Xiaohong, Hongbo Qiao, Baogang Chen, and Haiping Si. 2022. "A Novel Approach to Grade Cotton Aphid Damage Severity with Hyperspectral Index Reconstruction" Applied Sciences 12, no. 17: 8760. https://doi.org/10.3390/app12178760

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Novel Approach to Grade Cotton Aphid Damage Severity with Hyperspectral Index Reconstruction

Abstract

1. Introduction

2. Materials and Methods

2.1. Experimental Site and Sampling

2.2. Data Collection

2.2.1. Spectral Measurements

2.2.2. Disease Severity Assessment

2.3. Construction and Reconstruction of the Initial Spectral Indices

2.4. Selection of the Sensitive RISI

2.5. Cotton Aphid Severity Grading

3. Results

3.1. Evaluation Protocol

3.2. KI for Two-Band Parameters

3.3. KI for Three-Band Parameters

3.4. Adjacent Grade Thresholds for Sensitive RISIs

3.5. Grade Performance of Test Samples

4. Discussion

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI