Research on Grape-Planting Structure Perception Method Based on Unmanned Aerial Vehicle Multispectral Images in the Field

Qu, Aili; Yan, Zhipeng; Wei, Haiyan; Ma, Liefei; Gu, Ruipeng; Li, Qianfeng; Zhang, Weiwei; Wang, Yutan

doi:10.3390/agriculture12111894

Open AccessArticle

Research on Grape-Planting Structure Perception Method Based on Unmanned Aerial Vehicle Multispectral Images in the Field

by

Aili Qu

,

Zhipeng Yan

,

Haiyan Wei

,

Liefei Ma

,

Ruipeng Gu

,

Qianfeng Li

,

Weiwei Zhang

and

Yutan Wang

^*

School of Mechanical Engineering, Ningxia University, Yinchuan 750021, China

^*

Author to whom correspondence should be addressed.

Agriculture 2022, 12(11), 1894; https://doi.org/10.3390/agriculture12111894

Submission received: 26 October 2022 / Accepted: 3 November 2022 / Published: 10 November 2022

(This article belongs to the Section Digital Agriculture)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

In order to accurately obtain the distribution of large-field grape-planting sites and their planting information in complex environments, the unmanned aerial vehicle (UAV) multispectral image semantic segmentation model based on improved DeepLabV3+ is used to solve the problem that large-field grapes in complex environments are affected by factors such as scattered planting sites and complex background environment of planting sites, which makes the identification of planting areas less accurate and more difficult to manage. In this paper, firstly, the standard deviation (SD) and interband correlation of UAV multispectral images were calculated to obtain the best band combinations for large-field grape images, and five preferred texture features and two preferred vegetation indices were screened using color space transformation and grayscale coevolution matrix. Then, supervised classification methods, such as maximum likelihood (ML), random forest (RF), and support vector machine (SVM), unsupervised classification methods, such as the Iterative Self-organizing Data Analysis Techniques Algorithm (ISO DATA) model and an improved DeepLabV3+ model, are used to evaluate the accuracy of each model in combination with the field visual translation results to obtain the best classification model. Finally, the effectiveness of the classification features on the best model is verified. The results showed that among the four machine learning methods, SVM obtained the best overall classification accuracy of the model; the DeepLabV3+ deep learning scheme based on spectral information + texture + vegetation index + digital surface model (DSM) obtained the best accuracy of overall accuracy (OA) and frequency weight intersection over union (FW-IOU) of 87.48% and 83.23%, respectively, and the grape plantation area relative error of extraction was 1.9%. This collection scheme provides a research basis for accurate interpretation of the planting structure of large-field grapes.

Keywords:

unmanned aerial vehicle; grape farming; planting structures; machine learning

1. Introduction

Grapes occupy an important position in the global fruit production. According to the International Organization of Vine and Wine (IOVW), the vineyard planted area in the major vineyard growing countries is about 7.33 million hm² in 2020, and China ranks third in the world with 783,000 hm² of grape planted area [1,2]. With the expansion of planting scale and rising production of specialty crops such as grapes, it is important to obtain the structural composition and planting area of grape canopy quickly and accurately [3,4,5].

Remote sensing technology [6,7,8] provides great support for people to obtain spatial distribution information of the land. Li et al. [9] studied the monitoring of frost damage of wine grapes by using satellite remote sensing technology, and they established a monitoring framework by integrating the spectral information of visible and infrared light to achieve the monitoring of frost damage of wine grapes. However, satellite remote sensing technology is costly and not suitable for farmers with small planting areas. For this reason, this paper adopts the UAV remote sensing technology with low acquisition cost and high resolution [10,11] to study the information of grape farmlands and solve the problem of low accuracy in identifying grape-planting areas in the field under complex environments. Han et al. [12] combined visible and UAV multispectral cameras to acquire images of downed maize and extract spectral information, texture features, etc., screened potential Eigenfactors, and constructed two logistic models based on selected Eigenfactors, which not only extracted the area of downed maize but also predicted the probability of occurrence of downed maize. Valderrama Landeros et al. [13] combined multispectral data with a combined algorithm of normalized vegetation indices to monitor mangrove forests with different health conditions, which is important for the conservation of small mangrove forests. Lan et al. [14] applied UAV remote sensing technology to rice weed identification and achieved accurate identification of rice and weeds based on a semantic segmentation method of full convolution. The aforementioned researchers applied UAV remote sensing technology to agriculture and achieved better results, and for this reason, they also firmly established this paper to apply UAV remote sensing technology to grape farming structure identification to solve the problems of low accuracy and difficult management of large-field grape-growing areas in complex environments [15,16,17].

Although there have been many applications of multispectral UAVs in various crops [18,19], there are still some problems in the application, especially in the recognition accuracy, and there have been less applications in grapes. Che’Ya et al. [20] used UAV multispectral images to select six (440, 560, 680, 710, 720, and 850) bands of multispectral images to distinguish weeds from crops. Ren et al. [21] first used the Relief-F method to obtain the weight factors of each band, then used a partitioning strategy based on the correlation between bands to orderly divide the entire band interval, and finally selected the band with the highest importance score in each subinterval to obtain the best band of the image; however, the content of their study mainly tended to the image band itself, and less to the texture of the image. However, their studies mainly focus on the image bands themselves, with less consideration of the image texture, vegetation index, and band information. Shi et al. [22] used LiDAR and a multispectral camera to obtain relevant data and extract texture features of temperate tree species, and the results showed that the accuracy was significantly improved when LiDAR 3D structural features were combined with texture features for tree species classification. Zhang et al. [23] analyzed 11 vegetation index types based on multispectral orthophotos, and the algorithms were compared to conclude that the vegetation index information of the images can be used for ground classification and the combination of multispectral bands and vegetation indices can improve the classification accuracy more than the method using only spectral bands. It can be seen that adding vegetation indices and optimal bands when recognizing targets can effectively enhance the recognition accuracy of image bands [24,25,26]; using the integrated features of each band-specific correlation, Sun et al. [27] proposed a band-information-enhanced method for vineyard area recognition, which solved the problem of texture similarity among features in vineyard areas, but the recognition accuracy was not high for areas with large differences in spectral information. To improve the recognition accuracy, Kwan et al. [28] compared a classification algorithm based on deep learning with nine traditional classification algorithms and concluded that the extraction accuracy of SVM for vegetation was higher than that of NDVI, and they also proposed that DSM could be added to improve the extraction accuracy for vegetation.

In summary, this paper extracts 24 types of texture features, 5 types of vegetation index information, and spectral information of each waveband from field grape images based on UAV remote sensing technology and DeepLabV3+, and comprehensively analyzes the weighting ratio of the influence of these types of features on the model [29,30,31,32] to filter out the most suitable scheme for grape-planting structure extraction, In addition, we propose to improve the DeepLabV3+ deep semantic segmentation model to improve the extraction accuracy of grape-planting structures in fields from UAV multispectral images by improving the input layer structure to adapt to multispectral images and grape field vegetation a priori feature fusion, modifying the activation function to optimize the DeepLabV3+ model [33,34,35]. This paper also explores the extraction accuracy of grape field crops under different waveband combinations to provide reference for future UAV multispectral applications in agriculture, and provides ideas for agricultural informatization [36,37,38].

2. Materials and Methods

2.1. Study Area

The experimental site of this paper is located at Nanliang Farm, Xixia District, Yinchuan City, central Ningxia Hui Autonomous Region. The geographical coordinates are longitude 106°9′30″–106°9′50″ east, latitude 38°38′0″–38°38′10″ north, with an area of about 13 km². The region has obvious temperate continental monsoon climate characteristics, with strong solar radiation, low rainfall, high daily evapotranspiration, and severe dusty weather. Winter lasts for a long time and summer is relatively short. The annual frost-free period is about 166 days, with the first frost usually in October and the last frost usually in April of the following year. Soil types in the study area are mainly sandy soils, sandy loam, and clay soils.

2.2. Image Acquisition and Dataset Construction

In this paper, a DJI warp M600 multirotor UAV was used to obtain spectral data, as shown in Figure 1. It includes three visible wavelengths: 450 nm (blue), 555 nm (green), and 660 nm (red); and 710 nm (RedB), 840 nm (N₁), 940 nm (N₂). The UAV flight altitude was set at 75 m, with a heading overlap rate of 85%, a side overlap rate of 70%, and a ground image resolution of 4.68 (cm/pixel/M600).

The spectral remote sensing images of six different bands under the same test area were collected continuously in early September 2021. The camera performance was stable, the UAV basically maintained a flight speed of 8 m/s throughout the whole process, according to the predetermined serpentine trajectory line, and the shooting points were evenly distributed on the serpentine trajectory line, as shown in Figure 2, ensuring that each test field was photographed with a small flight distance. The study area has a large number of trees; different sun heights will produce different shadows, and the crops will also produce different shadows, which will change the texture characteristics and color characteristics of the crops and affect the recognition. For this reason, the shooting time was set at 12:00 noon, and the shooting time was about 18 min. A total of 400–500 original multispectral remote sensing images were obtained for each test field.

2.3. Data Preprocessing

The test area images were first calibrated by the calibration plate, and then checked and rejected by the Pix4Dmapper (Pix4D, Lausanne, Switzerland). As the images of the grape-growing areas needed to be stitched together after shooting, this would lead to problems such as overexposure and color distortion, which in turn would affect the accuracy of subsequent information extraction, as shown in Figure 3.

To avoid the above problems, three kinds of histogram correction tests (histogram equalization, histogram prescriptive, and Esri correction) were conducted on the stitched images in this paper. The results show that the Esri correction is better than the other two histogram correction methods, and the obtained features have clearer boundaries and greater variability of features between categories, which are easy to distinguish.

2.4. Factors Influencing the Selection of Characteristics of Grape Land

Extracting and analyzing the spectral information, texture features, and vegetation indices of the planting area, and screening the key feature parameters that can extract the vineyard structure, are essential for the information perception of the vineyard.

2.4.1. Best Band

When screening the appropriate optimal band combinations of Daejeon grape images, care needed to be taken that the preferred combinations contained a large amount of information and used data downscaling, but still retained their efficient original images. The basic spectral data of the Daejeon grape images were analyzed to obtain the standard deviation (SD) and mean (means) of the grayscale of the six bands, as well as the correlation coefficient matrix between the bands. SD is often used to measure the dispersion of the grayscale of an image, and a larger SD indicates a greater amount of effective information that maximizes the presentation of the image.

The basic spectral data of Ohta grape images were analyzed to obtain the optimal bands, and optimum index factor (OIF) was introduced, which is a method that considers the SD, mean, and other measures of information quantity, as well as the correlation coefficient between bands, and can quickly and accurately describe the information quality of the band combination [39,40], calculated as follows:

R_{OIF} = \frac{{SD}_{1} + {SD}_{2} + {SD}_{3}}{R_{12} + R_{13} + R_{23}}

(1)

2.4.2. Texture Characteristics

Texture is an arrangement property that expresses periodic variations or approximately the same combination of certain regions in remote sensing images, and quantitatively portrays the degree of homogeneity and detailed internal characteristics of each category in the images. The Daejeon grape images acquired by multispectral UAV can provide grayscale information of R, G, and B channels in the visible band, but the random noise between R, G, and B channels is high and the information contained in them is overlapping. The HSI model can eliminate the influence of intensity components in color images, and can significantly reduce the workload of image analysis and processing, which is conducive to the detection and analysis of color characteristics.

The principal component analysis (PCA) [41] algorithm is used for feature dimensionality reduction. PCA is the most classical algorithm for dimensionality reduction of remote sensing high-dimensional data, and its core idea is to maximize the variance as a criterion to find the optimal projection components. The amount of information in the texture feature data after dimensionality reduction by PCA is huge, and the five types of texture features filtered by OIF and eliminated with probability statistics are shown in Table 1, and three types of texture features with small overlap of information and larger amount of data contained in the components are screened as the theoretical basis of the later classifier.

2.4.3. Vegetation Index

The visible light vegetation index for vegetation information extraction was constructed based on the reflection and absorption characteristics in the large-field viticulture area with rich crop types facing more interference in the complex environment. However, it is difficult to ensure high classification accuracy by virtue of the visible vegetation index in the large-field grape cultivation area with rich crop types, large differences in vegetation cover, and complex environment. Therefore, this paper uses the visible and near-infrared spectral information of multispectral images to extract richer vegetation indices as the theoretical basis for subsequent model segmentation.

In this paper, four common vegetation indices (Normalized Difference Vegetation Index (NDVI), Ratio Vegetation Index (RVI), Difference Vegetation Index (DVI), and Soil Adjustment Vegetation Index (SAVI)) were extracted [42], the preferred vegetation index characteristics were determined by comparison, and the spectral information of different types of crops was recorded in uncorrected digital number (DN) values for the field grape images acquired by the UAV multispectral system. The multispectral images were first calibrated before interpretation, the recorded DN values were transformed into surface reflectance (SR) values of the images, and the spectral data of the grape-growing areas were analyzed to obtain the spectral characteristics parameters of each category in the test area.

2.5. DeepLabV3+ Model and Improvements

2.5.1. DeepLabV3+ Model Building

DeepLabV3+ is an improvement based on the DeepLabV3 deep learning model, and the DeepLabV3 model was tested after 16-fold interpolation sampling of the output convolutional layer. The deep learning network structure is composed of encoder and decoder; the encoder is used to extract the depth features of the image, and is composed of a deep convolution neural network (DCNN) and an atrous spatial pyramid pooling (ASPP) module. The DCNN is used to extract the features of the input image, and the ASPP is used to optimize the DCNN module to extract the depth feature map, in which the ASPP module uses the null convolution with four different sampling rates to extract multiscale information from the feature map output of the feature extraction network. The disadvantage of high-magnification sampling is solved. The decoding area fuses the shallow feature map features with the depth feature map after upsampling, and optimizes the location information that cannot be recovered by upsampling using the shallow features to obtain the semantic segmentation prediction results, as shown in Figure 4.

2.5.2. Model Training

The deep learning framework and server-related configurations used in this paper are shown in Table 2.

The dataset was cut into 960 images of fixed pixel size 256 × 256 from the original images and labels of arbitrary scale, and 5000 model input images of fixed size 256 × 256 were obtained by rotation, noise addition, mirroring, and other enhancements. The dataset was divided according to the ratio of 8:2, and 4569 training sets and 1142 test sets were obtained. The model was trained using the momentum gradient descent algorithm under the mainstream PyTorch framework, and the main parameters were set as follows: batch size was 32, the training used the cross-entropy loss function, the momentum was set to 0.9, the learning degradation rate was 0.1, and the initial learning rate was set to 0.0001.

2.5.3. Research on Information Extraction of Large-Field Grapes Based on Traditional Methods

The remote sensing images were processed by the above methods, and the models of support vector machine, maximum likelihood classifier, random forest classifier, and ISO DATA classifier were constructed in ArcGIS software (Esri, Redlands, CA, USA), respectively. The information was extracted from the multispectral images of Daejeon grapes, and the optimal model was obtained by comparing the results with the evaluation indexes such as overall accuracy (OA) and kappa coefficient, and compared with the improved model in DeepLabV3+.

2.5.4. Model Improvement

In this paper, we replace the loss function and change the input layer of the network based on the DeepLabV3+ model in order to obtain a better network for extracting information of large-field grapes. Firstly, the best band selected by different band combinations and OIF algorithm is used as the input image for model loading; secondly, the preferred vegetation index and texture features are incorporated into the model, the loss function is replaced, the hyperparameters are adjusted by several trials, and then the network model is optimized.

2.6. Evaluation Indicators

The classification accuracy of the scheme is evaluated based on the relative error of the confusion matrix and area.

2.6.1. Extraction of Test Set Area

The prediction results of the model in the validation set were imported into ArcGIS software, the grape-planting area was counted, the relative error of the model area extraction was calculated to obtain the validation set area as the real area by combining field mapping and photographed images, and the relative error was calculated as follows.

E r r o r = \frac{|M - T|}{T}

(2)

where M denotes the grape area predicted by the DeepLabV3+ model on the validation set, and the true value T denotes the grape acreage in the validation set obtained from field mapping.

2.6.2. Confusion Matrix

The confusion matrix is the most basic and intuitive way to measure the accuracy of the subtype model. Many classification metrics are also derived from the confusion matrix, such as OA, kappa coefficient (kappa), precision, recall, F1-score [43], etc.; mean intersection over union (MIoU) [44] and FW-IOU can also be obtained from the confusion matrix by calculation for this purpose In this paper, these metrics are cited to evaluate the semantic segmentation models.

(1) OA: The ratio of the number of correctly classified samples to the total number of validation samples for constructing the classifier.

(2) Kappa: The final judgment index based on producer accuracy (PA) and user accuracy (UA) and using the information of the whole error matrix, which can reflect the classification accuracy of the model comprehensively and accurately.

(3) FW-IOU: This is an improvement of MIoU, where each category is weighted according to its importance, which is derived from its frequency of occurrence.

(4) F1-score: This is the harmonic mean of the combined precision (P) and recall (R).

The formulas for calculating these metrics are as follows.

F 1 = \frac{2 PR}{P + R}

(3)

MIOU = \frac{1}{k + 1} \sum_{i = o}^{k} \frac{\sum_{j = 0}^{k} p_{ij} p_{ii}}{\sum_{j = 0}^{k} p_{ij} + \sum_{j = 0}^{k} p_{ji} - p_{ii}}

(4)

FW - IOU = \frac{1}{\sum_{i = 0}^{k} \sum_{j = 0}^{k} p_{ij}} \sum_{i = o}^{k} \frac{\sum_{j = 0}^{k} p_{ij} p_{ii}}{\sum_{j = 0}^{k} p_{ij} + \sum_{j = 0}^{k} p_{ji} - p_{ii}}

(5)

Assume that there are k+1 classes of targets in the dataset, 0 denotes the background,

i

is the positive sample class, and

j

is the negative sample class. Then

p_{i i}

represents the number of pixels that are actually class

i

and predicted to be class

i

(true positive, TP), and then

p_{j j}

represents the number of pixels that are actually class

j

and predicted to be class

j

(false positive, FP). The closer the value of the average intersection ratio is to 1, the better the network segmentation effect; the closer the value is to 0, the worse the network effect.

3. Results

3.1. Analysis of the Effect of Image Band Results

The standard deviations of bands 1, 2, 3, 4, 5, and 6 (corresponding to B, G, R, N₁, N₂, and N₃) were obtained by statistical analysis of the study area: 12,568, 10,725, 6877, 14,508, 18,949, and 20,450, respectively. From the visible band (B, G, R, and N₁) to the near-infrared band (N₂ and N₃), the standard deviation first decreases and then increases.

In order to consider standard deviation, band correlation coefficient, mean value, and other measures of information indicators, the optimal index factor (OIF) was introduced and calculated by computing Equation (1) and using Python programming language to select three bands to calculate the OIF values for that band combination. For the multispectral images of six bands, the OIF values of 15 sets of three-band combinations were calculated. The calculation results are shown in Table 3, and finally the band combinations of 1, 2, 5 (B, G, N₂) and 1, 2, 6 (B, G, N₃) were selected as the best band combinations for the Daejeon grape images.

3.2. Characterization of Color and Texture

By studying the different correlation coefficients and standard deviations between different texture features, we lay the foundation for the subsequent extraction of different crops in farmlands. In this paper, five categories of texture features with greater impact are selected as the basis for image classification, and the sample features of each category under RGB color space and HIS color space are compared. It can be seen that there are obvious differences in hue, saturation, and brightness of grapes, corn, greenhouses, etc. The results are shown in Figure 5 and Figure 6.

In the comparison between the RGB color space and the HIS color space, the internal texture and boundary features of the grapes in the HIS color space were clearly defined, and the shadows around the grapes caused by the sun height were eliminated. In the complex environment of the field grape test area, the increase of spatial resolution reduces the spectral differences of the same type of features to some extent, but the internal detail features of the images are significantly enhanced, which can more finely delineate the planting plot boundaries of grapes, corn, and greenhouses, and the canopy shape and geometric structure of the vegetation in its test area are more clearly characterized, which increases the accuracy and robustness of the texture features in the classification model.

The obtained results were then subjected to PCA algorithm feature dimensionality reduction, and the standard deviation and correlation coefficients among the components were obtained using ENVI software (Exelis visual information Solutions, Boulder, CO, USA), as shown in Table 4.

The correlations between the components were screened by statistical data analysis, and component 1 had a low correlation with components 2 and 4 and a high correlation with bands 3 and 5. The correlation between component 2 and component 4 is around 0.8, which is a high correlation, and the correlation with 3 and 5 is also low. Component 3 has a strong correlation with component 5. The final components of hue mean (H-Means), hue synergy (H-Hom), hue contrast (H-Con), hue low-pass (H-CLP), and saturation low-pass (S-CLP) were obtained as the five preferred texture classification features to support the improvement of the algorithm and the accuracy of the data.

3.3. Vegetation Index Analysis

The spectral feature variability of plant canopy leaves was used to distinguish the planting information of different crops on multispectral images, and the four common vegetation indices of NDVI, RVI, DVI, and SAVI were extracted in ArcGIS software, as shown in Figure 7 below.

As can be seen from Figure 7, NDVI can, to a certain extent, weaken the external influences brought about by soil background, shadows from solar height, and atmospheric errors. It is more sensitive to changes in the substratum of different crops, and its value ranges between −1 and 1; RVI is more sensitive to the growth state of vegetation, and is more sensitive to crops with high coverage and good growth condition in the image; SAVI and DVI are more sensitive to changes in soil, and other SAVIs and DVIs are more sensitive to changes in the background, but less sensitive to large green vegetation. In summary, NDVI and RVI were chosen as the preferred vegetation indices to study the characteristics of the vine-growing area according to the actual situation of the experimental site. The NDVI and RVI of different features in the study area were extracted, as shown in Table 5.

From the vegetation index schematic, it can be seen that the NDVI values of maize are higher than those of grapes, due to the higher reflectance of maize in the NIR band. From the statistical table of the ratio vegetation index of the features in the study area, it can be concluded that the difference in RVI of grapes, maize, canopy, trees, and weeds is easy to distinguish.

3.4. Research on Traditional Grape Land Information Extraction Methods

The field grape validation set was classified using SVM [45], ML [46], RF [47], and ISO DATA, and the classification results are shown in Figure 8 and Table 6. The results showed that the cultivation of grapes in the field was complex, containing not only non-crops, such as barns and land, but also green vegetation, such as maize and grape weeds. Among them, the canopy density of the three types of green vegetation differed greatly, with maize being higher, grapes second, and weeds the smallest; the uneven salinization in the study area led to problems such as the large differences in the growth of grapes.

For the UAV multispectral images of the field grape test area under complex agricultural information, four common machine learning classification methods (SVM, ISO Data, MC, RF) are compared and their corresponding evaluation indexes are calculated. As can be seen from the chart, the classification accuracy of SVM is better than the other three categories, with OA of 76.03% and kappa coefficient of 0.72, and the user accuracy for grape extraction is up to 97%. However, the traditional four classification methods, in terms of the overall effect, all have the phenomenon of image element mixing and cannot achieve accurate extraction of grape image information from large fields.

3.5. Unimproved DeepLabV3+ Model in the Test Set Results

The unimproved DeepLabV3+ model has the phenomena of image element mixing and category edge stitching mismatch in the test set. The reason for this phenomenon is that the overall remote sensing image is large and needs to be cropped into 256 × 256 small images before loading into the model for training and prediction. When the unimproved DeepLabV3+ model has poor extraction of each category edge area, it will make the test set results of stitching appear as edge mismatch and other phenomena. In addition, there is a mismatch in the number of grapes for some regions, and the error caused by shadows is not solved, as shown in Figure 9.

4. Discussion

4.1. Effect of Spectral Information on Improving DeepLabV3+ Model

In this subsection, only the effect of different bands of spectral information on the improved model is considered, and the images of grape-growing areas with different band combinations are tested on the improved DeepLabV3+ model. Seven sets of comparison experiments are set up, in which there are RGB visible three-bands, three-bands of NIR, and different NIR bands incorporated on the basis of RGB, respectively, the real bands of the original images, and the best transformed bands. The model prediction results are shown in Table 7, where R, G, and B denote the red, green, and blue three-bands of visible 450, 555, and 660, respectively, and N₁, N₂ table, and N₃ denote the NIR bands of 710 nm, 840 nm, and 940 nm, respectively.

The BGN₃, RGB, RGB-N₁, RGB-N₂, RGB-N₃, and RGB-N₁N₂N₃ were experimentally compared, where RGN₃ was the best combination of bands determined by the OIF algorithm and the band correlation coefficient. When each NIR band is loaded on top of an RGB band, the classification accuracy of the DeepLabV3+ model is improved to some extent. When increasing to the six-band RGB_N₁N₂N₃ multispectral experimental set, the overall accuracy and FW-IoU reach the better values of 79.09% and 76.79%, respectively. The overall accuracy of RGB-N₁N₂N₃ in the validation set is higher than that of RGB, RGB-N₁, RGB-N₂, and RGB-N₃, which leads to the conclusion that the best band combination obtained by data downscaling is BGN₃ (1, 2, 6), satisfying the reduced data volume while still achieving better accuracy.

The specific extraction effect of the improved DeepLabV3+ model for grapes is shown in Table 8. Through experimental comparison, it can be concluded that the best band selection in the previous section is feasible, and the N3 band spectral information has a greater influence on the model. The experimental group BGN3 (126) has the best grape extraction effect, the evaluation index F1-score of extraction accuracy can reach 86.0%, and MIOU is the optimal value of 75.6%.

4.2. Effect of Texture, Vegetation Index, and DSM on Improved DeepLabV3+ Model

In this section, the mean value (H-means) of the first principal component of the HIS color space of the study area image is used as the preferred texture feature of the Daejeon grape image, and NDVI is used as the preferred vegetation index of the image. Then DSM is incorporated on this basis [43], and the improved DeepLabV3+ method is used to test the effects of remote sensing images of texture, vegetation index, and DSM on the model prediction results.

As shown in Table 9, the overall accuracy OA improves by 5% and FW-IOU improves by 5.88% when the first principal component H-mean under HIS color space is added based on the improved DeepLabV3+ incorporating six-band spectral information. It can be seen that the texture features shared by the RGB band and NIR band are the biggest factors affecting the model classification. When incorporating the vegetation index NDVI, the OA improved by 0.49% and the FW-IOU improved by 0.19%. When incorporating DSM, the model improved OA by 2.9% and FW-IOU by 0.37% on the validation set. It was shown that the factors affecting the extraction accuracy of the field vine planting structure were, in order, texture feature H-mean > DSM > vegetation index NDVI.

Through the above analysis, the final training set was imported into the improved DeepLabV3+ deep learning model for validation, and it was obtained that its OA could reach 87.48%, which exceeded the best OA of 76.03% for the information extraction effect of SVM among the traditional information extraction methods. The improved DeepLabV3+ model in this paper can effectively improve the classification accuracy and robustness of the model for field planting areas.

4.3. Test Set Area Extraction and Summary

The prediction results of the model in the validation set were imported into ArcGIS software, as shown in Figure 10, the grape-planting area was counted, and the relative error of the model area extraction was calculated (Equation (2)). The grape-planting area obtained from field mapping was T: 5475.08 m², the final predicted extraction area of grapes was M: 5348.21 m², and the relative error of the extraction area was 1.9%, which again verified the feasibility of the experimental scheme.

In summary, the DeepLabV3+ deep learning scheme based on spectral information + texture + vegetation index + DSM in this paper has greater advantages in all evaluation indexes compared with traditional SVM, ML, RF, and ISO DATA information extraction methods and the unimproved DeepLabV3+ deep learning model, which can meet the requirements of the complex situation of large-field grape-planting area recognition accuracy and planting structure situation extraction, as well as further to manage the problem of missing plants and plants in large-field grape areas.

5. Conclusions

(1) Aiming at the problems that the traditional field grape information extraction methods are extremely ineffective in extracting field grapes in complex environments and the DeepLabV3+ model has confusion in recognizing image elements and some grapes are misclassified into trees, an improved DeepLabV3+-based field grape information extraction scheme is proposed to provide an effective and feasible solution for deciphering the planting structure of Ningxia field grapes.

(2) The experimental results for the field grape dataset showed that the best band combination for grape-growing areas is BGN₃ (1, 2, 6); the main factor affecting the classification accuracy of large-field grape images is the texture feature shared by the RGB band and the NIR band, and the fusion of DSM in the model can improve the classification accuracy. The DeepLabV3+ deep learning scheme based on spectral information + texture + vegetation index + DSM was finally determined. The OA of the improved scheme reaches 87.48%, which is 11.45 percentage points higher than the traditional optimal classification method SVM, and the FW-IOU achieves the best accuracy of 83.23%. It solves the problem of confusing image elements of the original model, improves the recognition accuracy of large-field grapes in complex environments, and the relative error of extracted area is 1.9%.

The deep learning model based on UAV multispectral images to improve DeepLabV3+ proposed in this study solves the problem of information collection of large-field grape plantation areas in complex environments and improves the recognition accuracy of grape plantation areas in complex environments. It meets the requirements of information management of grape-growing areas and lays the foundation for the realization of information management of grape-growing areas.

Author Contributions

Conceptualization, Z.Y., Y.W. and A.Q.; methodology, Z.Y. and H.W.; software, H.W., Z.Y. and R.G.; formal analysis, Z.Y., L.M. and Y.W.; investigation, A.Q., Q.L., W.Z. and Z.Y.; resources, Y.W. and A.Q.; writing—review and editing, Z.Y. and H.W.; visualization, H.W. and Z.Y.; supervision, Y.W.; project administration, Y.W. and A.Q. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Ningxia Key research and development program (Grant No. 2018BBF02020-01).

Institutional Review Board Statement

Not applicable.

Data Availability Statement

The test methods and data for this research are available from the authors upon request.

Conflicts of Interest

The authors declare no conflict of interest.

References

Pastonchi, L.; Di Gennaro, S.F.; Toscano, P.; Matese, A. Comparison between satellite and ground data with UAV-based information to analyse vineyard spatio-temporal variability. OENO One 2020, 54, 919–934. [Google Scholar] [CrossRef]
Sun, L.; Gao, F.; Anderson, M.C.; Kustas, W.P.; Alsina, M.M.; Sanchez, L.; Sams, B.; McKee, L.; Dulaney, W.; White, W.A.; et al. Daily Mapping of 30 m LAI and NDVI for Grape Yield Prediction in California Vineyards. Remote Sens. 2017, 9, 317. [Google Scholar] [CrossRef] [Green Version]
Cruz-Ramos, C.; Garcia-Salgado, B.; Reyes-Reyes, R.; Ponomaryov, V.; Sadovnychiy, S. Gabor Features Extraction and Land-Cover Classification of Urban Hyperspectral Images for Remote Sensing Applications. Remote Sens. 2021, 13, 2914. [Google Scholar] [CrossRef]
Lee, G.; Hwang, J.; Cho, S. A Novel Index to Detect Vegetation in Urban Areas Using UAV-Based Multispectral Images. Appl. Sci. 2021, 11, 3472. [Google Scholar] [CrossRef]
Peng, X.; Chen, D.; Zhou, Z.; Zhang, Z.; Xu, C.; Zha, Q.; Wang, F.; Hu, X. Prediction of the Nitrogen, Phosphorus and Potassium Contents in Grape Leaves at Different Growth Stages Based on UAV Multispectral Remote Sensing. Remote Sens. 2022, 14, 2659. [Google Scholar] [CrossRef]
Liu, Z.; Wan, W.; Huang, J.; Han, Y.; Wang, J. Progress on key parameters inversion of crop growth based on unmanned aerial vehicle remote sensing. Nongye Gongcheng Xuebao/Trans. Chin. Soc. Agric. Eng. 2018, 34, 60–71. [Google Scholar] [CrossRef]
Yuan, H.; Yang, G.; Li, C.; Wang, Y.; Liu, J.; Yu, H.; Feng, H.; Xu, B.; Zhao, X.; Yang, X. Retrieving Soybean Leaf Area Index from Unmanned Aerial Vehicle Hyperspectral Remote Sensing: Analysis of RF, ANN, and SVM Regression Models. Remote Sens. 2017, 9, 309. [Google Scholar] [CrossRef] [Green Version]
Fang, H.; Chen, H.; Jiang, H.; Wang, Y.; Liu, Y.F.; Liu, F.; He, Y. Research on Method of Farmland Obstacle Boundary Extraction in UAV Remote Sensing Images. Sensors 2019, 19, 4431. [Google Scholar] [CrossRef] [Green Version]
Li, W.; Huang, J.; Yang, L.; Chen, Y.; Fang, Y.; Jin, H.; Sun, H.; Huang, R. A Practical Remote Sensing Monitoring Framework for Late Frost Damage in Wine Grapes Using Multi-Source Satellite Data. Remote Sens. 2021, 13, 3231. [Google Scholar] [CrossRef]
Soubry, I.; Patias, P.; Tsioukas, V. Monitoring vineyards with UAV and multi-sensors for the assessment of water stress and grape maturity. J. Unmanned Veh. Syst. 2017, 5, 37–50. [Google Scholar] [CrossRef]
Torres-Sánchez, J.; Mesas-Carrascosa, F.; Santesteban, L.-G.; Jiménez-Brenes, F.; Oneka, O.; Villa-Llop, A.; Loidi, M.; López-Granados, F. Grape Cluster Detection Using UAV Photogrammetric Point Clouds as a Low-Cost Tool for Yield Forecasting in Vineyards. Sensors 2021, 21, 3083. [Google Scholar] [CrossRef] [PubMed]
Han, L.; Yang, G.; Feng, H.; Zhou, C.; Yang, H.; Xu, B.; Li, Z.; Yang, X. Quantitative Identification of Maize Lodging-Causing Feature Factors Using Unmanned Aerial Vehicle Images and a Nomogram Computation. Remote Sens. 2018, 10, 1528. [Google Scholar] [CrossRef] [Green Version]
Valderrama-Landeros, L.; Flores-De-Santiago, F.; Kovacs, J.M.; Flores-Verdugo, F. An assessment of commonly employed satellite-based remote sensors for mapping mangrove species in Mexico using an NDVI-based classification scheme. Environ. Monit. Assess. 2017, 190, 23. [Google Scholar] [CrossRef] [PubMed]
Lan, Y.; Huang, K.; Yang, C.; Lei, L.; Ye, J.; Zhang, J.; Zeng, W.; Zhang, Y.; Deng, J. Real-Time Identification of Rice Weeds by UAV Low-Altitude Remote Sensing Based on Improved Semantic Segmentation Model. Remote Sens. 2021, 13, 4370. [Google Scholar] [CrossRef]
Ahmed, O.S.; Shemrock, A.; Chabot, D.; Dillon, C.; Williams, G.; Wasson, R.; Franklin, S.E. Hierarchical land cover and vegetation classification using multispectral data acquired from an unmanned aerial vehicle. Int. J. Remote Sens. 2017, 38, 2037–2052. [Google Scholar] [CrossRef]
Matese, A.; Di Gennaro, S.F.; Orlandi, G.; Gatti, M.; Poni, S. Assessing Grapevine Biophysical Parameters From Unmanned Aerial Vehicles Hyperspectral Imagery. Front. Plant Sci. 2022, 13, 898722. [Google Scholar] [CrossRef]
Wei, P.; Xu, X.; Li, Z.; Yang, G.; Li, Z.; Feng, H.; Chen, G.; Fan, L.; Wang, Y.; Liu, S. Remote sensing estimation of nitrogen content in summer maize leaves based on multispectral images of UAV. Nongye Gongcheng Xuebao/Trans. Chin. Soc. Agric. Eng. 2019, 35, 126–133. [Google Scholar] [CrossRef]
Matese, A.; Di Gennaro, S.F.; Berton, A. Assessment of a canopy height model (CHM) in a vineyard using UAV-based multispectral imaging. Int. J. Remote Sens. 2017, 38, 2150–2160. [Google Scholar] [CrossRef]
Di Gennaro, S.F.; Matese, A.; Gioli, B.; Toscano, P.; Zaldei, A.; Palliotti, A.; Genesio, L. Multisensor approach to assess vineyard thermal dynamics combining high-resolution unmanned aerial vehicle (UAV) remote sensing and wireless sensor network (WSN) proximal sensing. Sci. Hortic. 2017, 221, 83–87. [Google Scholar] [CrossRef]
Che’Ya, N.; Dunwoody, E.; Gupta, M. Assessment of Weed Classification Using Hyperspectral Reflectance and Optimal Multispectral UAV Imagery. Agronomy 2021, 11, 1435. [Google Scholar] [CrossRef]
Ren, J.; Wang, R.; Liu, G.; Feng, R.; Wang, Y.; Wu, W. Partitioned Relief-F Method for Dimensionality Reduction of Hyperspectral Images. Remote Sens. 2020, 12, 1104. [Google Scholar] [CrossRef] [Green Version]
Shi, Y.; Wang, T.; Skidmore, A.K.; Heurich, M. Improving LiDAR-based tree species mapping in Central European mixed forests using multi-temporal digital aerial colour-infrared photographs. Int. J. Appl. Earth Obs. 2020, 84, 101970. [Google Scholar] [CrossRef]
Zhang, Y.; Yang, W.; Sun, Y.; Chang, C.; Yu, J.; Zhang, W. Fusion of Multispectral Aerial Imagery and Vegetation Indices for Machine Learning-Based Ground Classification. Remote Sens. 2021, 13, 1411. [Google Scholar] [CrossRef]
Xu, T.; Wang, F.; Xie, L.; Yao, X.; Zheng, J.; Li, J.; Chen, S. Integrating the Textural and Spectral Information of UAV Hyperspectral Images for the Improved Estimation of Rice Aboveground Biomass. Remote Sens. 2022, 14, 2534. [Google Scholar] [CrossRef]
Jełowicki, Ł.; Sosnowicz, K.; Ostrowski, W.; Osińska-Skotak, K.; Bakuła, K. Evaluation of Rapeseed Winter Crop Damage Using UAV-Based Multispectral Imagery. Remote Sens. 2020, 12, 2618. [Google Scholar] [CrossRef]
Zou, X.; Mõttus, M. Sensitivity of Common Vegetation Indices to the Canopy Structure of Field Crops. Remote Sens. 2017, 9, 994. [Google Scholar] [CrossRef] [Green Version]
Sun, Z.; Zhu, S.; Gao, Z.; Gu, M.; Zhang, G.; Zhang, H. Recognition of grape growing areas in multispectral images based on band enhanced DeepLabv3+. Trans. Chin. Soc. Agric. Eng. (Trans. CSAE) 2022, 38, 229–236. [Google Scholar] [CrossRef]
Kwan, C.; Gribben, D.; Ayhan, B.; Li, J.; Bernabe, S.; Plaza, A. An Accurate Vegetation and Non-Vegetation Differentiation Approach Based on Land Cover Classification. Remote Sens. 2020, 12, 3880. [Google Scholar] [CrossRef]
Yue, J.; Feng, H.; Tian, Q.; Zhou, C. A robust spectral angle index for remotely assessing soybean canopy chlorophyll content in different growing stages. Plant Methods 2020, 16, 104. [Google Scholar] [CrossRef]
López-García, P.; Intrigliolo, D.S.; Moreno, M.A.; Martínez-Moreno, A.; Ortega, J.F.; Pérez-Álvarez, E.P.; Ballesteros, R. Assessment of Vineyard Water Status by Multispectral and RGB Imagery Obtained from an Unmanned Aerial Vehicle. Am. J. Enol. Vitic. 2021, 72, 285–297. [Google Scholar] [CrossRef]
Dai, J.; Zhang, G.; Guo, P.; Zeng, T.; Cui, M.; Xue, J. Classification method of main crops in northern Xinjiang based on UAV visible waveband images. Nongye Gongcheng Xuebao/Trans. Chin. Soc. Agric. Eng. 2018, 34, 122–129. [Google Scholar] [CrossRef]
Du, M.; Noguchi, N. Monitoring of Wheat Growth Status and Mapping of Wheat Yield’s within-Field Spatial Variations Using Color Images Acquired from UAV-camera System. Remote Sens. 2017, 9, 289. [Google Scholar] [CrossRef] [Green Version]
Fu, J.; Yi, X.; Wang, G.; Mo, L.; Wu, P.; Kapula, K.E. Research on Ground Object Classification Method of High Resolution Remote-Sensing Images Based on Improved DeeplabV3+. Sensors 2022, 22, 7477. [Google Scholar] [CrossRef] [PubMed]
Akcay, O.; Kinaci, A.C.; Avsar, E.O.; Aydar, U. Semantic Segmentation of High-Resolution Airborne Images with Dual-Stream DeepLabV3+. ISPRS Int. J. Geo-Inf. 2022, 11, 23. [Google Scholar] [CrossRef]
Li, H.; Wang, G.; Dong, Z.; Wei, X.; Wu, M.; Song, H.; Amankwah, S. Identifying Cotton Fields from Remote Sensing Images Using Multiple Deep Learning Networks. Agronomy 2021, 11, 174. [Google Scholar] [CrossRef]
Ma, Q.; Han, W.; Huang, S.; Dong, S.; Li, G.; Chen, H. Distinguishing Planting Structures of Different Complexity from UAV Multispectral Images. Sensors 2021, 21, 1994. [Google Scholar] [CrossRef]
Rahman, M.; Fan, S.; Zhang, Y.; Chen, L. A Comparative Study on Application of Unmanned Aerial Vehicle Systems in Agriculture. Agriculture 2021, 11, 22. [Google Scholar] [CrossRef]
Sun, Y.; Han, J.; Chen, Z.; Shi, M.; Fu, H.; Yang, M. Monitoring Method for UAV Image of Greenhouse and Plastic-mulched Landcover Based on Deep Learning. Nongye Jixie Xuebao/Trans. Chin. Soc. Agric. Mach. 2018, 49, 133–140. [Google Scholar] [CrossRef]
Bascon, M.V.; Nakata, T.; Shibata, S.; Takata, I.; Kobayashi, N.; Kato, Y.; Inoue, S.; Doi, K.; Murase, J.; Nishiuchi, S. Estimating Yield-Related Traits Using UAV-Derived Multispectral Images to Improve Rice Grain Yield Prediction. Agriculture 2022, 12, 1141. [Google Scholar] [CrossRef]
Cheng, B.; Raza, A.; Wang, L.; Xu, M.; Lu, J.; Gao, Y.; Qin, S.; Zhang, Y.; Ahmad, I.; Zhou, T.; et al. Effects of Multiple Planting Densities on Lignin Metabolism and Lodging Resistance of the Strip Intercropped Soybean Stem. Agronomy 2020, 10, 1177. [Google Scholar] [CrossRef]
Su, T. Superpixel-based principal component analysis for high resolution remote sensing image classification. Multimedia Tools Appl. 2019, 78, 34173–34191. [Google Scholar] [CrossRef]
Matese, A.; Di Gennaro, S.F. Practical Applications of a Multisensor UAV Platform Based on Multispectral, Thermal and RGB High Resolution Images in Precision Viticulture. Agriculture 2018, 8, 116. [Google Scholar] [CrossRef] [Green Version]
Zhang, X.; Li, L.; Di, D.; Wang, J.; Chen, G.; Jing, W.; Emam, M. SERNet: Squeeze and Excitation Residual Network for Semantic Segmentation of High-Resolution Remote Sensing Images. Remote Sens. 2022, 14, 4770. [Google Scholar] [CrossRef]
Feng, J.; Wang, D.; Yang, F.; Huang, J.; Wang, M.; Tao, M.; Chen, W. PODD: A Dual-Task Detection for Greenhouse Extraction Based on Deep Learning. Remote Sens. 2022, 14, 5064. [Google Scholar] [CrossRef]
Xia, J.; Wang, Y.; Dong, P.; He, S.; Zhao, F.; Luan, G. Object-Oriented Canopy Gap Extraction from UAV Images Based on Edge Enhancement. Remote Sens. 2022, 14, 4762. [Google Scholar] [CrossRef]
Basheer, S.; Wang, X.; Farooque, A.A.; Nawaz, R.A.; Liu, K.; Adekanmbi, T.; Liu, S. Comparison of Land Use Land Cover Classifiers Using Different Satellite Imagery and Machine Learning Techniques. Remote Sens. 2022, 14, 4978. [Google Scholar] [CrossRef]
Han, L.; Yang, G.; Dai, H.; Xu, B.; Yang, H.; Feng, H.; Li, Z.; Yang, X. Modeling maize above-ground biomass based on machine learning approaches using UAV remote-sensing data. Plant Methods 2019, 15, 1–19. [Google Scholar] [CrossRef]

Figure 1. DJI Jingwei M600 drone.

Figure 2. Flight path and rejection of invalid images.

Figure 3. Data preprocessing technology route.

Figure 4. Improved DeepLabV3+ network structure.

Figure 5. Sample characteristics in HIS color space. (a) Corn. (b) Greenhouse. (c) Grapes.

Figure 6. Sample characteristics under RGB color model. (a) Corn. (b) Greenhouse. (c) Grapes.

Figure 7. Map of each vegetation index in the study area. (a) Normalized Difference Vegetation Index (NDVI). (b) Ratio Vegetation Index (RVI). (c) Difference Vegetation Index (DVI). (d) Soil Adjustment Vegetation Index (SAVI).

Figure 8. Classification effects of field grapes under the four methods. (a) SVM classification effect. (b) ML classification effect. (c) RF classification results. (d) ISO DATA classification results.

Figure 9. Unimproved DeepLabV3+ segmentation effect.

Figure 10. Interpretation of the test set catalog.

Table 1. Texture feature calculation formula and definition.

Texture Characteristics	Calculation Formula	Feature Definition Description
Mean (Mean)	$Mean = \frac{1}{n \times n} \sum i \sum jf (i, j)$	Metrics describing trends in grayscale concentration.
Variance (Var)	$Var = \sum i \sum jf {(i - j)}^{2} f (i, j)$	Describes the extent to which the grayscale value deviates from the mean.
Contrast ratio (Con)	$Con = \sum i \sum j {(i - j)}^{2} f (i, j)$	Describes the difference in gray level within the neighborhood of the window; the greater the local variation, the greater the value of contrast.
Differentiation (Dis)	$Dis = \sum i \sum j \|i - j\| f (i, j)$	Similar to contrast, the greater the local contrast, the greater the phase anisotropy.
Homogeneity (HOM)	$HOM = \sum i \sum j \frac{f (i, j)}{1 + {(i - j)}^{2}}$	A measure of texture similarity, with higher values representing less variation in the window range and smaller differences in grayscale values.

Table 2. Model training configuration.

Name	Configuration
Processor	Xeon-5118
GPU	Nvidia Titan × GPU
Memory	64G
Operating system	Ubuntu 16.04
Programming languages	Python
Frame	PyTorch
CUDA	Cuda10.0.130
CUDNN	Cudnn7.6.4

Table 3. OIF values for each waveband.

Serial Number	Wave Portfolio	OIF	Serial Number	Wave Portfolio	OIF
1	1,2,3	14,950	6	2,3,5	16,388
2	1,2,4	19,217	7	2,3,6	18,124
3	1,2,5	20,643	8	3,4,5	14,193
4	1,2,6	19,658	9	3,4,6	15,077
5	2,3,4	15,100	10	4,5,6	14,877

Table 4. Correlation coefficients between textures.

Correlation	H-Mean	H-Var	H-Hom	H-Con	H-DIS
H-Mean	1
H-Var	0.135006	1
H-Hom	−0.839	−0.20519	1
H-Con	0.161737	0.853561	−0.25749	1
H-DIS	0.692475	0.481488	−0.89624	0.587236	1

Table 5. NDVI and RVI values of different features.

	Experimental Factors	Grapes	Corn	Greenhouse	Trees	Weeds
NDVI	Average value	0.597	0.704	0.388	0.505	0.599
NDVI	Variance	0.0906	0.1830	0.0462	0.0625	0.299
RVI	Average	4.12	6.17	−1.41	9.78	5.06
RVI	Variance	0.93	2.80	0.63	0.98	1.59

Table 6. Comparison of four types of classification methods.

Classification Method	Evaluation Indicators
Classification Method	Overall Accuracy	Kappa Factor
SVM	76.03%	0.72
RF	73.4%	0.696
ML	70.03%	0.658
ISO Data	69.4%	0.64

Table 7. Influence of spectral features on model classification.

Test Group Name	Training Time	Backbone Module	Number of Channels	OA	FW-IOU
RGB	4 h 11 min 20 s	ResNet101	3	77.28%	74.45%
N₁N₂N₃	4 h 14 min 21 s	ResNet101	3	78.23%	76.14%
RGB-N₁	4 h 15 min 9 s	ResNet101	4	77.87%	75.39%
RGB-N₂	4 h 13 min 26 s	ResNet101	4	77.18%	74.51%
RGB-N₃	4 h 13 min 23 s	ResNet101	4	77.82%	75.48%
RGB-N₁N₂N₃	4 h 35 min 30 s	ResNet101	6	79.09%	76.79%
BGN₃(126)	4 h 13 min 23 s	ResNet101	3	78.32%	76.07%

Table 8. Evaluation metrics related to grape image segmentation.

Test Group Name	Precision	Recall	F1-Score	MIOU
RGB	72.81%	88.60%	79.93%	66.58%
N₁N₂N₃	82.62%	72.68%	77.33%	63.04%
RGB-N₁	76.99%	89.84%	82.92%	70.08%
RGB-N₂	73.01%	79.38%	76.66%	61.37%
RGB-N₃	78.51%	76.87%	77.68%	63.51%
RGB-N₁N₂N₃	82.41%	87.76%	85.0%	74.0%
BGN₃(126)	79.45%	93.81%	86.0%	75.6%

Table 9. Incorporation of texture, vegetation index, and DSM evaluation index.

Test Group Name	OA	FW-IOU
RGB-N₁N₂N₃	79.09%	76.79%
BGN₃(126)	78.32%	76.07%
K-means	84.09%	82.67%
NDVI	84.58%	82.86%
DSM	87.48%	83.23%

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Qu, A.; Yan, Z.; Wei, H.; Ma, L.; Gu, R.; Li, Q.; Zhang, W.; Wang, Y. Research on Grape-Planting Structure Perception Method Based on Unmanned Aerial Vehicle Multispectral Images in the Field. Agriculture 2022, 12, 1894. https://doi.org/10.3390/agriculture12111894

AMA Style

Qu A, Yan Z, Wei H, Ma L, Gu R, Li Q, Zhang W, Wang Y. Research on Grape-Planting Structure Perception Method Based on Unmanned Aerial Vehicle Multispectral Images in the Field. Agriculture. 2022; 12(11):1894. https://doi.org/10.3390/agriculture12111894

Chicago/Turabian Style

Qu, Aili, Zhipeng Yan, Haiyan Wei, Liefei Ma, Ruipeng Gu, Qianfeng Li, Weiwei Zhang, and Yutan Wang. 2022. "Research on Grape-Planting Structure Perception Method Based on Unmanned Aerial Vehicle Multispectral Images in the Field" Agriculture 12, no. 11: 1894. https://doi.org/10.3390/agriculture12111894

APA Style

Qu, A., Yan, Z., Wei, H., Ma, L., Gu, R., Li, Q., Zhang, W., & Wang, Y. (2022). Research on Grape-Planting Structure Perception Method Based on Unmanned Aerial Vehicle Multispectral Images in the Field. Agriculture, 12(11), 1894. https://doi.org/10.3390/agriculture12111894

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Research on Grape-Planting Structure Perception Method Based on Unmanned Aerial Vehicle Multispectral Images in the Field

Abstract

1. Introduction

2. Materials and Methods

2.1. Study Area

2.2. Image Acquisition and Dataset Construction

2.3. Data Preprocessing

2.4. Factors Influencing the Selection of Characteristics of Grape Land

2.4.1. Best Band

2.4.2. Texture Characteristics

2.4.3. Vegetation Index

2.5. DeepLabV3+ Model and Improvements

2.5.1. DeepLabV3+ Model Building

2.5.2. Model Training

2.5.3. Research on Information Extraction of Large-Field Grapes Based on Traditional Methods

2.5.4. Model Improvement

2.6. Evaluation Indicators

2.6.1. Extraction of Test Set Area

2.6.2. Confusion Matrix

3. Results

3.1. Analysis of the Effect of Image Band Results

3.2. Characterization of Color and Texture

3.3. Vegetation Index Analysis

3.4. Research on Traditional Grape Land Information Extraction Methods

3.5. Unimproved DeepLabV3+ Model in the Test Set Results

4. Discussion

4.1. Effect of Spectral Information on Improving DeepLabV3+ Model

4.2. Effect of Texture, Vegetation Index, and DSM on Improved DeepLabV3+ Model

4.3. Test Set Area Extraction and Summary

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI