1. Introduction
Cotton is the main cash crop in China [
1], and different nitrogen application levels have a significant impact on the growth of cotton [
2,
3,
4]. The leaf area index (LAI) is one of the important indicators reflecting crop canopy structure and growth [
5,
6]. Monitoring LAI changes can provide a basis for variable cotton fertilization [
7,
8]. Therefore, rapid, accurate, and non-destructive monitoring of cotton LAI is of great significance for guiding crop fertilization. Traditional LAI monitoring mainly relies on manual sampling, which requires a large amount of labor, and involves time costs and lags; thus, it cannot meet the needs of real-time monitoring.
Remote sensing technology can realize timely, dynamic, and macro monitoring, and it has become an important means of monitoring crop growth information. In recent years, a large number of studies conducted both domestically and abroad have used remote sensing technology to investigate crop biomass [
9,
10,
11], chlorophyll content [
12,
13,
14], water and nitrogen content [
15,
16,
17,
18], and other physiological and biochemical parameters. In terms of monitoring crop LAI, a large number of studies have been carried out using remote sensing means, such as handheld spectrometers [
19], UAVs [
20], and satellites [
21,
22,
23]. Ground spectral monitoring has the advantages of being non-destructive and accurate; however, due to the limitations of the shooting range and instrument weight, near-Earth spectroscopy cannot achieve continuous and rapid monitoring at a spatial scale [
24]. In addition, studies have shown that satellite images have a certain potential in crop LAI monitoring [
25]; however, due to their image resolution of 10–60 m, they are mostly used for crop LAI monitoring at the forest or large regional scale [
26,
27]. UAV has fast and repeated capture capability in crop monitoring, and it has higher image resolution than satellite images [
28], making it more suitable for the precise monitoring of small plots. Some scholars have monitored the LAI of wheat [
29], rice [
30], corn [
31], and other crops [
32,
33] using spectral images obtained by UAV, and have obtained good results. In summary, based on the advantages of the UAV platform and the existing research, UAV has certain feasibility in crop growth monitoring. Therefore, this study uses UAV as a platform to carry out research on hyperspectral sensors.
UAVs can quickly acquire a large amount of hyperspectral data, which contain rich information, but they also have the problem of data redundancy. The hyperspectral reflectance and vegetation indices of the cotton canopy can be extracted from UAV hyperspectral images. Among them, the hyperspectral reflectance of plant canopy is the most direct response to vegetation characteristics. There has been a lot of research based on this, such as Zhang et al. [
29], who estimated the LAI of winter wheat using UAV hyperspectral images on the basis of spectral transformation and a variable selection method. Li et al. [
34], using the pretreated canopy spectral reflectance, effectively identified the characteristic wavelength to achieve the rapid estimation of nitrogen concentration in winter wheat leaves. Yang et al. [
35] used preprocessing feature screening and modeling of hyperspectral images obtained by UAV to monitor soil organic matter and soil total nitrogen in farmland. In summary, hyperspectral images acquired by UAV have more noise, whereas spectral transformation can effectively reduce image noise, and feature screening can achieve dimension reduction for hyperspectral data. Therefore, in this study, different spectral transformation methods and feature selection methods were selected to process the cotton canopy spectrum to improve the model accuracy. Feature screening after pretreatment directly reflects the vegetation characteristics; however, there are also problems, such as the instability of screening results and modeling results, and the poor quantitative analysis effect. Vegetation indices in quantifying plant growth through the difference between vegetation in different band ranges and soil backgrounds. Han et al. [
36] used the vegetation indices extracted from UAV multispectral images to invert the leaf surface number of winter wheat under different water treatment conditions. The number of multispectral bands was limited, and the accuracy of the vegetation index model constructed on the basis of multispectral bands was reduced to some extent. Yao et al. [
37] used UAV narrowband spectral images to construct the modified triangular vegetation index (MTVI), thereby improving the accuracy of the LAI monitoring model. In this study, higher-resolution hyperspectral images were selected to obtain more band information to reflect the physiological and biochemical information of cotton and achieve vegetation index optimization. However, when plants grow luxuriantly, vegetation indices will become saturated, whereby, with an increase in LAI, vegetation indices would remain unchanged or exhibit a small change trend. Therefore, in this study, spectral reflectance and vegetation indices were combined to establish a model to improve the accuracy of the cotton LAI monitoring model.
Machine learning algorithms are increasingly combined with remote sensing technology for crop growth monitoring due to their strong learning ability and ability to mine and understand deep information in data [
38,
39]. Most domestic and foreign scholars extracted vegetation indices from spectral information and used machine learning algorithms to improve the accuracy of the monitoring model [
27,
37,
38].
At present, LAI monitoring is typically conducted using the spectral data based on spectral vegetation index modeling calculation, and the plant canopy hyperspectral reflectance is the most direct response characteristic of vegetation, providing more detailed and more abundant information in comparison with multispectral vegetation indices. Furthermore, a reasonable spectrum transformation can partly eliminate the spectral data from background and noise. However, hyperspectral data also have multicollinearity, and machine learning can overcome the problem of collinearity between variables from different angles. Therefore, in order to improve the accuracy of the cotton LAI monitoring model, this study used different methods for spectral transformation and then screened sensitive bands combined with vegetation indices established using hyperspectral data after different methods of prediction processing. Two machine learning algorithms were used to build LAI monitoring models, and the best model was identified in order to provide a basis for accurate cotton field management and variable fertilization in Xinjiang.
4. Discussion
In this study, different growth period LAI changes and spectral responses were analyzed, and the results showed that with the increase of the growth period, LAI increased, while LAI of early-maturing varieties decreased at the floc opening stage and LAI of late-maturing varieties tended to remain unchanged, which was caused by the gradual cessation of vegetative growth and the drop of aging leaves at the later stage of crop growth. In terms of spectral response, LAI in the visible region was negatively correlated with canopy spectral reflectance, while LAI in the near-infrared region was positively correlated with canopy spectral reflectance, which is consistent with previous studies on winter wheat [
44], rice [
45], and rape [
32]. This is due to the spectral reflectance of vegetation. The difference in the 350–800 nm range is mainly due to the influence of chlorophyll and other pigments in plants, and the difference in the 800–1000 nm range is due to the scattering of plant cells and tissues. Cotton growing luxuriantly and multi-leaf superimposed radiation will produce high reflectance in the near-infrared band. Therefore, canopy spectra of different LAI values differ more significantly in the near-infrared region.
The original canopy spectrum is affected by the solar radiation flux, crop structure characteristics, and soil background conditions [
46]. Spectral pretreatment can reduce background noise information and effectively improve the accuracy of spectral information [
34]. Previous studies have shown that SNV can be used to eliminate the interference caused by light scattering and path length changes, and it can better predict the monitoring of the P and K content in tea [
47]. Rei et al. [
48] estimated the chlorophyll content after treating the original canopy spectrum with different methods, and the results showed that SNV and MSC did not show better performance. This is in contrast to this study’s results, which is potentially related to the spectral data in this study being obtained from UAV sensors, to atmospheric differences, to unmanned aerial vehicle (UAV) flight patterns and noise, or to the Rei blade clip being used to obtain spectral data, which did not require further correction.
The hyperspectral analysis includes two steps: characteristic band screening and regression modeling [
49]. In this study, LAI-sensitive bands were screened out using SPA and SFLA, and the results showed that the model based on SFLA had better performance. Ren et al. [
50] compared four band screening methods to grade black tea. Li et al. [
51] estimated soil arsenic content on the basis of hyperspectral data, obtaining similar results in this study. This is because, compared with the SPA algorithm, SFLA is a novel method of forwarding variable cyclic selection. The maximum projection vector wavelength is taken as the combination of candidate wavelengths, and the correction is made on this basis. In combination with the modeling method, acceptable prediction results are obtained on the spectrum, and better generalization performance can be obtained by reducing the data dimension, which has broad application potential. In the existing research models, band screening effectively reduced the data dimensionality, but the traditional linear regression modeling still had collinearity problems.
Artificial intelligence (AI) coupled with promising machine learning (ML) techniques well known from computer science is broadly affecting many aspects of various fields, including science and technology, industry, and even our day-to-day life [
52]. In recent years, in order to better realize the monitoring of cotton growth information, some scholars introduced machine vision, deep learning, and other technologies, which effectively improved the monitoring model accuracy [
11,
53]. In this study, the LAI monitoring model was established by comparing MSR and ELM algorithms, and the results showed that ELM was superior to MSR. The SNV-SFLA-ELM model has the best accuracy (R
2 = 0.7340, RMSE = 1.4494, and rRMSE = 23.40%; verification set R
2 = 0.7153, RMSE = 1.6796, and rRMSE = 26.32%), Yu et al. [
54] compared PLSR and ELM to construct a hyperspectral inversion model of nitrogen content in rice leaves, and the results showed that ELM had better performance. Chen et al. [
55] diagnosed the nitrogen content in the apple canopy on the basis of hyperspectral reflectance. Liu et al. [
56] estimated rice chlorophyll content on the basis of hyperspectral reflectance, and the ELM model had the best effect. Meanwhile, the above study results show that ELM has a small computation scale and good generalization, but its practical application is slow. In this study, the ELM model also showed good prediction ability, but it needs to be improved for future research and application.
The model established in this study can be used for monitoring LAI throughout the growth period of cotton, including different cotton varieties. Chen et al. [
57] established LAI monitoring models for cotton at different growth stages using multispectral data obtained by UAV, with R
2 = 0.65 and RMSE = 0.62, similar to the performance of the monitoring model established on the basis of vegetation indices in this study. Hyperspectral reflectance can directly reflect the geometric and physiological characteristics of vegetation, and vegetation indices can quantitatively describe plant growth through the difference of vegetation and soil background in different band ranges. They are homologous but have different characteristic emphases. Therefore, in order to improve the accuracy of the model, canopy spectral reflectance and vegetation indices were combined to construct the model in this study, and more sensors and modeling methods could be introduced to monitor the model construction in the future.
In summary, SNV pretreatment was used for spectral data and SFLA was used to screen sensitive bands, which could optimize model variables. However, the correlation between vegetation indices established by spectral reflectance after SNV pretreatment and LAI was higher, which could improve model accuracy. ELM can effectively resist noise and is more suitable for modeling remote sensing data. The SNV-SFLA-SNV-VIs model was the optimal model established in this study, with training set R2 = 0.9208, RMSE = 0.8216, and rRMSE = 12.89%, and validation set R2 = 0.9066, RMSE = 0.9590, and rRMSE = 15.72%. In order to determine the predictive ability of the model in different periods, the model was verified in different periods in this study. The results showed that the accuracy of the validation model based on a single period decreased, but the overall accuracy was high, while LAI that was too large or too small had a greater error in estimation. Thus, although this model has a broad application prospect in monitoring LAI during the whole growth period of cotton, future studies still need to add data sets on this basis to ensure the universality of the model. In addition, this study based on the optimal model of different N application levels of LAI estimation ability test, the results show that the model under different nitrogen levels estimate ability has significant differences. Under different nitrogen levels, the N4 interchange and N2 precision is relatively low and, combined with the model, may be related to the LAI distribution under different nitrogen treatment. The model for larger LAI will be underestimated, while for smaller LAI, overestimation will occur. The research on this part is relatively weak, which is an important issue to be paid attention to in the future model construction and optimization process.
In this study, different nitrogen treatments and different cotton varieties were established, but the method in this study was based on the spectral data of a cotton canopy in the same year at a specific site, which limits its prediction ability for other datasets or regions. Therefore, in order to optimize the SNV-SFLA-SNV-VIs model in terms of stability and accuracy, further datasets involving additional years, planting patterns, and regions need to be collected for model correction, in order to realize LAI estimation of cotton in Xinjiang by using a machine learning method in the future.
5. Conclusions
In this study, using the spectral data of cotton canopy height obtained by UAV, band combinations were screened using different pretreatment and band screening methods, and vegetation indices were constructed using hyperspectral data after different pretreatment methods were used to estimate the LAI of cotton throughout the growth period using MSR and ELM. The results showed that the canopy spectra of different LAI were significantly different in the 760–1000 nm range, and there was an obvious correlation between the canopy spectrum and LAI. By comparing methods under different pretreatments, it can be seen that the band screening based on SPA was too concentrated, resulting in information redundancy and incomplete information extraction. The sensitive bands screened by SFLA were evenly distributed. The vegetation index established based on the hyperspectral data pretreated by SNV had a higher correlation with LAI, and the DVI had the highest correlation. In comparing the results of the cotton LAI estimation model established using two modeling methods based on different modeling objects, ELM was superior to MSR. SNV-SFLA was the best monitoring model based on pretreated hyperspectral reflectance, and SNV was the best monitoring model based on vegetation indices. However, when combining hyperspectral reflectance with vegetation indices, the ELM model based on SNV-SFLA-SNV-VIs had the best effect among all models, with training set R2 = 0.9208, RMSE = 0.8216, and rRMSE = 12.89%, and validation set R2 = 0.9066, RMSE = 0.9590, and rRMSE = 15.72%. Therefore, the combination of both modeling objects could effectively improve the model’s accuracy.