1. Introduction
Northwest China, especially Inner Mongolia, has low vegetation coverage. Therefore, the ecological problem of land desertification is particularly serious. To control the local fragile ecological environment, it is necessary to master the trend of land desertification and vegetation coverage, and the assessment of vegetation coverage has an important impact on the production and life of local residents and the protection of ecosystems [
1]. Owing to the vast area and scarce vegetation in Inner Mongolia, different vegetation coverage assessment methods have different advantages, disadvantages, and application scopes [
2], and the selection of an extraction method with high accuracy is crucial [
3].
As manual monitoring has many constraints in terms of time, area, and other issues, it is impossible to carry out large-scale and time-efficient monitoring of desert grassland, so remote sensing technology is widely used in the technical field of desert vegetation extraction, and there has been some progress. Among them, satellite remote sensing technology is often applied to vegetation cover estimation and has a relatively mature theoretical basis but, because of the low resolution of satellite remote sensing and other problems, the extraction of desert vegetation cannot meet the accuracy requirements [
4]. However, owing to the advantages of high resolution, rich geometric texture information, high timeliness, and low cost, UAV remote sensing technology effectively makes up for the shortcomings of traditional satellite remote sensing, and has gradually become an important way to monitor and extract vegetation information [
5]. Therefore, remote sensing technology has been applied to carry out more classification studies on plant communities, among which UAV remote sensing is the main platform to extract desert vegetation information by supervised classification methods and object-oriented classification methods [
6]. Moreover, with the application and development of unmanned aerial vehicles (UAV) in the field of remote sensing, the combination of UAV and remote sensing technology has been widely used in forestry and resource survey vegetation extraction [
7]. In recent years, light and small UAVs have been widely used in all walks of life, and the application of UAV technology to remote sensing has become a new development trend [
8]. Among them, constructing suitable vegetation indices in UAV visible light remote sensing images can reflect the surface vegetation condition more simply and effectively and is one of the main methods to extract vegetation information quickly from remote sensing images. Among them, many scholars have proposed a series of vegetation indices based on the visible light band using the characteristics of green vegetation in the visible light band [
9]. For example, Wang Meng et al. [
10] and Niu Yaxiao et al. [
11] combined UAV remote sensing data with different threshold determination methods to classify the vegetation coverage of crops. Li Bing et al. [
12] and Wang Zhenkun et al. [
13] used low-altitude remote sensing data acquired by unmanned aerial vehicles to extract crop cover and quickly classify vegetation, respectively. Stefan Puliti et al. [
14] and Shi Bo et al. [
15] applied UAV remote sensing technology to large-scale forest investigations and achieved good results. However, the accuracy of UAV visible light remote sensing images applied to desert vegetation extraction will be affected by factors such as vegetation shading and light intensity. This paper is dedicated to solving this problem.
The main research object of this paper is visible light remote sensing images captured by UAVs. As the visible light true color sensors carried by UAVs have only three bands of RGB [
16], the strong reflection characteristics of vegetation in the near-infrared band cannot be expressed. Among more than 150 vegetation index models [
17] published in the literature, this paper selects the EXG (excess green) index [
18], the MGRVI (modified green-red vegetation index) [
19], the NGRDI (normalized green-red difference index) [
20], the RGBVI (red-green-blue vegetation index) [
21], and the VDVI (visible-band difference vegetation index) [
22], all of which are constructed based on the RGB color space. Moreover, the contrast between vegetation and non-vegetation areas of these visible light vegetation indexes is obvious, and the vegetation information recognition effect is good, in which the vegetation area is bright white and the non-vegetation area is dark gray. Through the verification of many papers in related fields, the extraction accuracy can reach more than 75%, which has a certain reference value. Therefore, these vegetation indexes are selected as the reference objects for our experiment.
As UAV images are required to extract and classify vegetation information in desert areas, the influence of environmental factors in the desert on UAV images should be taken into account [
23]. To ensure the accuracy of supervised classification, the Compute ROI Separability tool should be used to calculate the separation degree between any categories when separating different ground objects. The separation degree is based on the Jeffries–Matusita distance and transition separation degree and is used to measure the separability between different categories [
24]. Its range is (0, 2). The greater the separation degree, the better the discrimination ability. If the separation degree is greater than 1.8, it is qualified; if it is greater than 1.9, it is accurate. Therefore, for each index model used as a reference in our experiments, the separation degree between different ground objects reached more than 1.9 during supervised classification. By analyzing the experimental results, it was found that vegetation shadow, UAV flight height, light intensity, and other factors affect the accuracy of vegetation extraction. In particular, vegetation shadows have a great impact on extraction accuracy, which may lead to false detections or missing detections. In existing studies, the application of the RGB color space in the RGB threshold method, the HSV (hue, saturation, value) discrimination method, and the RGB decision tree method is only used to calculate vegetation coverage from the perspective of color discrimination (especially green pixels), and these methods have certain limitations due to the changes in lighting environment [
25]. Meanwhile, in image processing, color components other than green also have different degrees of influence on vegetation coverage extraction [
26]. To solve this problem, a new method for vegetation extraction and shadow separation is proposed in this paper. The method is a Lab color-space-based L2AVI index model in which the
L channel in the Lab color space can be used to calculate the brightness of light pixels in the image, analyze the performance of shadows in the image, and avoid the interference of light environment changes on vegetation extraction. As the chlorophyll content of vegetation in desert areas is very small, the proportion of green components in the image can be enhanced by the channel and the influence of other color components can be avoided. Thus, the vegetation can be extracted and land desertification can be monitored with high accuracy.
5. Conclusions
Through the comparison of supervised classification results statistics, supervised classification results charts, and error analysis, the vegetation in the EXG vegetation index is highlighted, which can ensure the extraction accuracy of lush vegetation areas. However, owing to the large darkness of the shadow area, it is easy to encounter the problem of wrong detection and missing detection in areas with sparse vegetation growth. It is difficult to distinguish the vegetation shadow area and the sand area. Therefore, extracting vegetation information in areas with more shadows will lead to large errors.
In the MGRVI model, the regional divide between the vegetation and the land is clearer, and the light and dark details are good, so it is easier to obtain fine vegetation. However, the shadow model will have a higher brightness, which will also affect the overall precision of the division to a certain extent. For example, the shadow area in the Ordos research area is large, and the shadow color in the image taken by the UAV is dark, which is reflected in the MGRVI model. This results in the high brightness of the shadow model, which has a great impact on the accuracy. Therefore, the MGRVI can achieve the expected accuracy when collecting vegetation with less shadow. However, high-precision extraction cannot be guaranteed in areas with more vegetation shadows.
Compared with other RGB color space index models, the RGBVI model can better distinguish between shadow, land, and vegetation models, perfectly solving the method error caused by vegetation shadow in the MGRVI model. In the model of land, shadow, and vegetation, the color difference is larger, the contrast is more obvious, and the light and dark details are clearer, which is more conducive to the extraction of vegetation and the division of its features. However, the RGBVI is greatly affected by illumination, which means that good illumination conditions and suitable UAV flying altitudes are required to maintain the expected accuracy.
After supervised classification, it was found that the extraction effect of the NGRDI model was relatively ordinary, and the accuracy in several models was mid-range. The NGRDI had a certain effect in dividing the vegetation and vegetation shadow area, but it was not as significant as that in the RGBVI, the VDVI, or the L2AVI model.
Although the VDVI model has an accuracy of 91.79% and 97.07% in the Ulanqab area and the Ordos area, respectively, it is unable to classify the vegetation shadow in the Bayannur area because of the dark shade of the low shrub vegetation, so the vegetation information cannot be extracted stably in the areas with more and deeper vegetation shadow.
Meanwhile, it was found in the study that the flight altitude of the UAV directly determines the resolution of visible light images, i.e., the higher the flight altitude, the lower the resolution, such as for the extraction of desert vegetation, which usually requires the flight altitude to be controlled below 100 m. The flight speed of the drone is also the key to the image quality—too fast easily leads to blurred images. The lower the overlap rate, the better the stitching quality. The overlapping part should be minimized and the impact area should cover the whole study area. In addition, because the UAV visible images are based on RGB color space, the image information is controlled by only three bands, and relatively few vegetation indices can be constructed. In the future development trend of UAV, multispectral sensors can be used to increase the near-infrared band and then be combined with the Lab color space. In this thesis, more vegetation indices can be constructed to adapt to the extraction of vegetation information in different areas.
Through the control experiment, it can be concluded that the L2AVI proposed in this paper reached the highest accuracy, with the VDVI also obtaining high accuracy in the extraction of low plants or vegetation with less shadow. For the EXG, EGRDI, MGRVI, and RGBVI indices, the extraction accuracy is relatively general, and they cannot distinguish vegetation from non-vegetation. For the proposed L2AVI model, the index constructed through the Lab color space can avoid the traditional index model based on the RGB color space, which is unable to extract vegetation information stably and with high accuracy as a result of vegetation shadow, UAV flight height, and illumination conditions. The L2AVI model preserves the L channel, strengthens the a channel, and separates the b channel. It can effectively and stably classify and extract three ground feature elements (vegetation shadow, land, and vegetation information) in desert areas, and is not affected by the errors caused by illumination conditions and UAV flying altitude. Therefore, it is more suitable for the extraction of vegetation in desert areas.