1. Introduction
Tobacco, as a high-value crop, is an important source of income, employment opportunities, and tax revenue in both developed and developing countries. In the global tobacco-growing sector, China ranks first, accounting for 32% of the total production. The tobacco industry has a significant market size, and its tax revenue accounts for a large proportion of the national governmental fiscal revenue [
1]. Yunnan and Guizhou are the main flue-cured tobacco growing areas in China. Most of the tobacco planting areas in these regions are in plateau mountainous areas. The tobacco fields are broken and the planting is scattered in these regions, which means that there are considerable difficulties in the manual collection of statistics on tobacco production, making it difficult to meet the requirements of the need for fast and timely tobacco planting and monitoring [
2]. Timely grasp of the spatial distribution, planting area, growth, yield, and disaster losses of tobacco is important for accurate yield estimation that can assist enterprises in rationally arranging tobacco harvest, refined tobacco management, and for assisting government decision making.
Tobacco is mostly planted in the plateau areas of China, and its distribution is extensive and scattered. The efficiency of manual reporting is low and significantly affected by human factors. Remote sensing technology has advantages, such as large coverage, quick access to information, and low cost, thereby compensating for the shortcomings of human surveys, and has been widely used in estimating large-scale crop yield estimation and the gathering of planting area statistics. Remote sensing data sources range from medium resolution (Landsat) to high spatial resolution remote sensing satellite images (such as SPOT-5, China-Brazil Earth Resources Satellite 02 B, ZY-1 02C, ZY-3) [
3,
4,
5,
6,
7]. Working methods include the following: optical images and synthetic aperture radar (SAR) [
8], remote sensing platform research from high-altitude satellite remote sensing to low-altitude unmanned aerial vehicle (UAV) remote sensing [
9,
10,
11], monitoring method research from pixel feature-based index calculation using statistical methods to object-oriented depth and machine learning methods and identifying studies from regions to individual tobacco plants [
12,
13].
Optical data are easily affected by cloudy and rainy weather in mountainous areas, and effective information cannot easily be obtained. Moreover, the data acquisition cycle is long, which is not conducive to the dynamic monitoring of crop growth. SAR data comprise data captured throughout the day and is unaffected by clouds and rainy weather unaffected by cloudy and rainy weather. However, interference by mountainous terrain can easily interfere with the backscatter information of the SAR [
14,
15]. For the rapid acquisition of crop information in complex habitats in plateaus and mountainous areas, unmanned aerial vehicles (UAV) have a wide range of prospective applications. They possess the characteristics of high resolution, low cost, and have a short data acquisition cycle and are thus suitable for the extraction of crop information in the complex habitats of plateau areas [
16,
17,
18].
Guizhou Province lies in the low-latitude plateau area, with the total area of tobacco plantations being large; however, the spatial distribution is scattered. Here, the tobacco leaves are generally mixed or intercropped with other crops [
19,
20]. Therefore, it is difficult to accurately extract the planted area using large-scale low-to-medium remote sensing satellite image data; moreover, small plots are easily missed and misinterpret. In Southwestern China, it is difficult to obtain high spatial resolution satellite remote sensing images. UAV remote sensing has become the main means of monitoring tobacco-planting areas owing to its flexibility, high spatial resolution, and low cost. Object-oriented image analysis and deep learning are the main classification methods for remote sensing images with a high spatial resolution. The classification of various crops using deep learning has problems of complex models, large amounts of sample data, long training times, low portability, and low accuracy. There have been extensive studies on the application of UAV visible light remote sensing in agricultural information extraction, crop type identification, and growth monitoring [
21,
22,
23], and the use of plant protection UAV can also provide a reference for crop pest control [
24]. At the same time, the color index can identify crops and realize morphological feature extraction [
25]. It can use the OSTU threshold method to extract plants based on visible light images [
26]. Leaf polygons can also be extracted based on visible-light images to obtain average plant size information and number of plants [
27]. In addition to plant morphology, plant information can also be extracted by color index and other methods. The color stretches the images of different varieties of corn crops, with an extraction accuracy of 76–83%, but the impact of weeds on corn plant recognition has not been eliminated [
28]; however, characteristic information, such as the green index, can improve the accuracy of crop extraction to more than 90 and effectively estimate crop yield [
29]. There are relatively few studies on the monitoring of economic crops in the mountainous areas of the Southwest Plateau. Scattered crop planting distribution, complex growth environment, and large terrain fluctuations still exist in this area. Therefore, the characteristic economic crops need to be quickly and accurately identified [
30].
In view of this, considering the characteristics of broken crop planting plots and cloudy and foggy weather in karst mountain areas, the existing UAV remote sensing tobacco single plant recognition research has some shortcomings, such as high recognition accuracy, relatively complex calculation methods and complex processing process. The study focused on the tobacco-planting base in Beipanjiang Town, Zhenfeng County, Guizhou Province in the southwestern mountainous area of China as its research area, and the Phantom 4 Pro V2.0 drone was used to collect high-resolution orthophoto images (DOM). Tobacco plant information was extracted using methods such as color index and filter enhancement to obtain an efficient and scalable process for extracting tobacco plantation information effectively and provide refined data for monitoring crop planting in areas with complex terrain, and provide a decision-making basis for fine management of tobacco.
2. Materials and Methods
2.1. Overview of the Study Area
The research area has a complex environment and comprises a test and a verification area. Beipanjiang Town, Zhenfeng County, Guizhou Province, located in the mountainous area of southwest China, has a karst landform type. The altitude is 1324–1966.8 m, and the climatic conditions in the study area are suitable for tobacco cultivation. As shown in
Figure 1, the center point of the test area was 105°35′54.597″ E, 25°36′37.063″ N; the cultivated land has a high degree of fragmentation, and the large exposed rock area is large. The center point of the verification area is at 105°35′50.205″ E, 25°36′4.522″ N, and the land types mainly include corn, weeds, shrubs, and cement roads. The total amount of cultivated land resources in the area is insufficient, and the soil layer is shallow, discontinuous in distribution, and low in water retention capacity and drought resistance.
2.2. Data Collection and Preprocessing
The Phantom 4 Pro V2.0 is equipped with a 1-in 20-megapixel image sensor, a quad-rotor foldable body, a camera with 20-megapixel effective pixels, and a 1-in CMOS image sensor. The hovering accuracy is as follows: vertical ±0.1 m (when visual positioning is working normally); ±0.5 m (when the GPS positioning operates normally), horizontal: ±0.3 m (when the visual positioning is working function normally); and ±1.5 m (when the GPS positioning is functions normally). As the weather conditions and time of data collection are related to the change in the color difference of the image, the incident sunlight from 12:00 to 14:00 is perpendicular to the ground surface, which can reduce the shadow cast by crops in oblique sunlight. The study area was a karst mountainous area, and the image collection time was from 12:00 to 13:30 on 12 June 2021. The light conditions were sufficient, and the wind power was level 3, which met the safe operating conditions of UAV. On a sunny day, cloud cover in the mountains can be avoided. The height difference of the survey area is large, and the overall slope is 45°. Altizure was used to plan the data acquisition route. To ensure data accuracy, the course overlap was set at 85, the side overlap at 70, and design altitude at 90 to avoid data redundancy.
The Phantom 4 Pro V2.0 quadrotor UAV was used to obtain RGB true color images. The Pix4Dmapper software was used to stitch the high-resolution images of the UAV to generate digital orthophotos of the survey area. Pix4DMapper was used for, amongst others, orthophoto stitching, aerial photo screening, aerial three encryption and image overlap matching, internal orientation, beam method LAN adjustment calculation, and camera self-calibration to perform tilt correction and projection difference correction on UAV remote sensing images. Additionally, it was used to complete the calculation of the image mosaic, color leveling, and cropping to finally obtain the orthophoto image data, as shown in
Figure 2.
In the experiment, two UAV images of the area were selected as data: the test area and the verification area, as shown in
Figure 3. The size of the test area was 10,424 pixels × 12,082 pixels, the pixel size was 0.027. They were located northwest region of the Beipanjiang River, with the terrain comprising a hillside slope which gradually decreases from northwest to southeast. The planting area was irregular. Further, the tobacco slices were dark green in color, and the leaves were relatively large. Simultaneously, the test area was a tobacco-growing area in a complex scene, and included roads, shrubs, rocks, houses, and bare soil.
The size of the verification area was 13,720 pixels × 16,412 pixels, and the pixel size was 0.03. Relative to the south of the test area, the terrain gradually decreased from northeast to the southwest. The planting area was irregular, the color of tobacco slices was mostly cyan, and the leaves were small. The two images could be used to effectively extract the number of tobacco plants by visual inspection, thereby providing an effective reference for extracting the number of tobacco plants in the later stage. However, there was mutual occlusion between the tobacco and surrounding shrubs and rocks, and the tobacco planting boundary was unclear and the shrub and weed planting range overlapped, making it difficult for later extraction.
2.3. Data Feature Analysis
The types of features have different eigenvalues in the red (R), green (G), and blue bands (B) [
31]. To explore the difference between the target features of tobacco plants and surrounding features, this study set up a test area and an experimental area, located west of the Beipanjiang River, which provided a better foundation for testing the feasibility of the research method. The value range of 0–255 is used to construct color histogram by R, G and B channels, which is beneficial to obtain color information from invisible visible images. As can be seen from
Figure 4, the value of green band of TOBACCO RGB curve is between 165 and 215, much higher than that of red band, 135–195 and blue wave segment, 135–170, indicating that tobacco has an obvious indicator in G band. The low coincidence degree between the weed RGB curve band values and tobacco plants, between 125 and 180, was also one of the factors that interfered with tobacco plant extraction. There are obvious troughs in the RGB curves of rocks, in which B band intersects with R and G band. The b-band value of the maize RGB curve is between 80 and 155, while the b-band value of tobacco is between 165 and 215. The two are less confusing and have little interference to the correct extraction of tobacco plants.
3. Methodology
3.1. Technical Route
Combined with parameters, such as chromatic aberration, texture features and geometric size of visible light images from UAV, Excess Green Index (EXG), Normalized Green-red Difference Vegetation Index (NGRDI) and Excess Green Minus Excess Red Index (EXG-EXR) of tobacco plants were compared at the seedling stage. The applicability of the image was preserved, the frequency information of tobacco was retained, and the information of other objects in the image was suppressed. Subsequently, the image threshold method was used to conduct the best threshold segmentation of the image. Finally, the target identification of tobacco plants and the extraction of the number of plants were realized, as shown in
Figure 5.
3.2. Color Index
The current vegetation index research is mainly based on the visible light band and the near-infrared bands, such as the normalized vegetation index and the ratio vegetation index. There are many kinds of images and the remote sensing images required generally have a high acquisition cost and poor in effectiveness and spatial resolution. The vegetation indices based on visible light mainly include Excess Green Index (EXG) [
32,
33,
34], Normalized Green-Red Difference Vegetation Index (NGRDI), and Excess Green Minus Excess Red Index (EXG-EXR) [
35]. Woebbecke et al. [
36] found that the EXG index has been widely used in recent years and has high accuracy in distinguishing vegetation from soil. The types of images in the study area mainly include tobacco, weeds, shrubs, cement roads, retaining walls, and weed-proof cloth. The EXG was used to separate vegetation and non-vegetation. The calculation equation is explained below.
In the equation, G, R, and B denote the green, red, and blue channel bands, respectively. In ENVI5.5, the R, G, and B bands are calculated as b1, b2, and b3, respectively.
Image filtering calculation:
The image after the color index calculation contains shrubs, weeds, and other noises that interfere with the extraction of the target tobacco plant; therefore, the noise of the tobacco plant target recognition needs to be suppressed. Image filtering can suppress noise in the target image by ensuring the details of the image. In this study, high-pass filter [
37,
38], low-pass filter [
39], and directional filtering (directional) [
40,
41] were used to conduct comparative experiments to try to eliminate other crop noise of the image while overcoming the boundary effect, such that the gray value of the surface object can smoothly transition from the center to the edge and improve the accuracy of extraction of plant extraction.
3.2.1. High Pass Filter
The high-pass filtering process eliminates low-frequency components in the image while maintaining its high-frequency information of the image. It can be used to enhance edge information between different regions, that is, by using a transform kernel with a high central value (typically surrounded by negative weights). The default high-pass filter in ENVI uses a 3 × 3 transform kernel (the center value is “8”, and the outer pixel value is “−1”); therefore, the value of the high-pass filter transform kernel must be an odd number.
where D
0 is the distance from the frequency to the origin, and D(u, v) is the distance from the point (u, v) to the center of the frequency rectangle.
Assuming that the image size of the result of the calculation of the over-green index is M × N, and its Fourier transform has the same size, the center of the frequency rectangle is located at (M/2, N/2). The distance between the point and the center of the frequency rectangle is given by Equation (5), and the filtered image is expressed by Equation (6):
where M × N is the image size of the calculation result of the assumed green index, (u, v) = (M/2, N/2) represents the center of the frequency rectangle, and the distance between the point and the center of the frequency rectangle.
3.2.2. Low-Pass Filter
Low frequency filtering is used to save the low-frequency component information in the image, eliminating its high-frequency information. Low-pass filtering of ENVI is conducted using the IDL “SMOOTH” function on the selected target image. This function uses boxcar averaging, and the size of the box is determined by the size of the transform kernel; typically, the default transform kernel size is 3 × 3. However, when using the low-pass filter, it is easy to generate transitions to eliminate high-frequency components, resulting in blurred image edges.
where D
0 represents the radius of the passband, and the calculation method of D is the distance between two points.
3.2.3. Directional Filtering
Directional filtering involves the edge enhancement method. The purpose is to selectively enhance the image features of a specific direction component (for example, gradient), so the result shows that the output image has the same pixel value area of 0, and different pixel areas of value appear as lighter edges. The main steps to implement directional filtering are as follows: (1) select filters > convolutions > directional; (2) notably, the standard filtering in the convolution parameters dialog box needs to be adjusted, and the ENVI pass-through filtering needs to be in the text marked “Angle”. The desired direction (in degrees) must be entered in the box.
3.3. Threshold Segmentation
The main role of image segmentation is to distinguish between different areas with special implications in the image to achieve the optimal threshold, which has the characteristics of clear physical meaning and easy operation. The basic principles are as follows: to set different feature thresholds, divide the image pixels into several categories, set the original image to be f(x,y), find the feature value T in f(x,y) based on certain rules, and segment the image. Image after segmentation comprises two parts. Assuming that b0 = 0 (black), b1 = 1 (white), namely, the binarization of the image [
42,
43], the core idea states that when the threshold T results in the largest class variance between the target and the background, T is the best threshold for identifying and extracting the target objects.
where W
0 is the proportion of the target pixel point in the whole scene image; W
1 is the proportion of the background pixel point in the whole scene image; μ
0 is the average grayscale of the target object pixel; μ
1 is the average grayscale of the background pixel; μ is the total average gray level of the image; MN represents the size of the image; N
0 is the number of pixels with gray level is less than T; N
1 is the number of pixels with gray level is larger than T; and σ is the inter-class variance.
3.4. Accuracy Verification
To quantitatively evaluate the viability of this method—referring to the OTSU research method [
44,
45]—we defined the true positive (TP) as the number of correctly classified tobacco plants in the identification extraction results, and the false positive (FP) as the wrongly classified tobacco plants in the extraction results. False negative (FN) represents the tobacco plants omitted from the extraction results. The calculation equations of each evaluation index, namely, branching factor (BF), detection rate (DP), and integrity (QP) are
where BF increases with the increase in the number of misclassified tobacco plants; DP is the percentage of correctly classified tobacco plants; QP represents the quality of the extraction results of tobacco plants; and the higher the QP value, the better the extraction results. On the whole, the BP value is negatively correlated with the DP and QP values, the smaller the BP value, the better the DP and QP values, indicating the better the extraction effect [
46,
47,
48].
4. Results and Analysis
4.1. Exponential Image
Field investigation revealed that tobacco plants are often accompanied by rocks and weeds because of the complex growth environment, which increases the interference for the precise extraction of tobacco plants. To improve the separability of the soil and vegetation in ENVI5.5, the EXG, NGRDI, and EXG-EXR indices were independently calculated using the band calculation tool in ENVI5.5.
Table 1 shows that, under different environments, the calculation results of each index increased with the complexity of the growth environment of tobacco plants, and the separation of soil and vegetation in the calculation results of the NGRDI gradually decreased, with the results of the EXG-EXR decreasing gradually. Soil–vegetation separation is low, while the hypergreen index has higher extraction ability for tobacco plants in complex environments, and can better distinguish between soil, rocks, and weeds.
In the EXG image of the bare soil area, the tobacco plants were black, the soil background was gray, and weeds were bright white. The extraction effect of tobacco plants was good and distinguished the types of objects in the bare soil area. Normalized green and red tobacco plants and weeds in the image, calculated by NGRDI were gray, and the soil background was black. In the results of this index, tobacco plants and weeds were easily mixed; therefore, the phenomenon of wrong extraction occurred. There was also the mixed extraction of tobacco plants and weeds. In the EXG calculation result image with the rock as the background, the tobacco plants were gray and the rocks were black, and the difference between the two was clearer. In the image calculated by NGRDI, the rocks were scattered and bright white, tobacco plants were black, and soil was gray. Tobacco plants were gray, and rocks were black in the EXG-EXR calculation result image, but the extraction effect of tobacco at the seedling stage was poor. In the EXG calculation result image with weeds as the background, both tobacco and weeds were gray, and the soil was black. It is important to note that some parts of the NGRDI image resulted in a different color index. In the image calculated by the NGRDI, weeds and some tobacco were bright white and tobacco and soil were black, so the extraction effect was not ideal. In summary, the NGRDI is prone to the mixed extraction of weeds under the background of weeds, resulting in the wrong extraction of tobacco plants; EXG-EXR is easy to omit tobacco at the seedling stage under the rock background; EXG is more efficient in extracting tobacco under the background of weeds and rocks, and is more suitable for extracting tobacco plants in terrain broken areas.
The statistical sample characteristics using spss2.0 are shown in
Table 2. The gray-scale images have different responses to different targets on the ground. The DN values of tobacco, weeds and corn are positive, and the DN values of rock non-vegetation land are negative. It can be seen that by calculating the green index, the vegetated land and non-vegetated land were successfully separated.
According to the statistical characteristics of the four ground features in
Table 2, the DN values of tobacco range from 54 to 131, weeds range from 62 to 142, and corn ranges from 70 to 104. The DN values of the three overlap within the range of 70 to 104, which is obviously confusing, and is not conducive to the extraction of tobacco plants. The average DN value of weeds is 101.24, which is higher than that of tobacco and maize, and the DN value of tobacco is 95.74, which is close to the DN value of weeds is 101.24, indicating that tobacco and weeds have some differences in the statistical characteristics of DN value. The standard deviation of tobacco is 15.07, which is greater than the standard deviation of weeds and corn, indicating that the dispersion of tobacco values is too large. The study needs to further separate tobacco, weeds and corn
4.2. Image Filtering Enhancement
After calculating the EXG color index and verification areas, the information of tobacco plants could be preserved to a certain extent to suppress the noise from rocks and weeds, but it still contained some noise. To increase the accuracy of plant extraction, high-pass, low-pass, and directional filtering were selected for image filtering enhancement. In ENVI5.5, we selected the convolution “Convolution and Morphology” and the convolution rotation “High Pass Gaussian”; the default size of the transformation kernel was 3 × 3 filtering to eliminate the information of specific frequencies and enhance the image of the study area.
The image results after processing using different image filtering methods show that high-pass filtering cannot enhance relatively complete tobacco plant information and cannot distinguish the surrounding objects well. Direction filtering is slightly better than high-pass filtering for enhancing crop information, but the accuracy is low. Low-pass filtering can enhance the information of tobacco plants and has a good filtering effect with respect to the information regarding surrounding objects. After conducting low-pass filtering, the average value of the ExG image is 77, and the degree of dispersion of the DN value is reduced. For 62–76, the border of tobacco plants can be better preserved, and the main information of the image is effectively preserved. The study area before and after low-pass filtering showed that the DN values of rocks and weeds were considerably different, and tobacco plants could be better separated from soil and weeds. The DN value changed in the following manner: blue (soil, rocks) < gray (weeds) < green (tobacco plants).
4.3. Tobacco Plant Extraction
Based on the image obtained after low-pass filtering, the threshold t was set as the grayscale segmentation threshold of tobacco plants and other plants. We selected color slices in ENVI5.5 and, through multiple threshold parameter adjustments, set a threshold to separate tobacco plants from other background values, and masked to extract tobacco plots. The categories 62–76 in ENVI isolated and identified tobacco plants; therefore, the size was set according to the threshold of this category in ArcGIS. The grayscale segmentation results at
t = 65 were loaded into the ArcGIS 10.2. The area of all patches was 5336.23 m
2, the largest patch area was 8.79 m
2, the minimum area was 0.00078 m
2, and the average patch area was 0.012 m
2. In the threshold extraction process, the area of a single tobacco plant was used to segment larger patches and eliminate fine patches. As shown in
Figure 6, when the grayscale segmentation threshold was
t = 65, the plant area was
S = 5.18 dm
2, and 404 plants were missing (FN), 119 plants were incorrectly extracted (FP) in the study area, and the actual number of plants extracted was 57,594 (TP). The number of true tobacco plants (TP) calculated during the field investigation was 58,117, the calculated branching factor (BF) was 0.002, the detection rate (DP) was 99.3%, and completeness (QP) was 99.1%.
To further test the applicability of this research method, the Phantom 4 Pro V2.0 quadrotor drone was used to collect the verification images of the tobacco planting area in the tobacco planting base of Zhenfeng County at 2021-6-13T12:00-13:00. The conditions were clear with a wind force of 4. Tobacco plants were in the growing period, and weeds and rocks were scattered between the rows of the plants. The DN value calculated based on the ExG index ranged from 62 to 77, and distinguishing the tobacco plants from weeds in the verification area was difficult. The calculation results of ExG were processed via low-pass filter convolution in the test area; the DN value converged to 70–78, and the low-frequency information of tobacco plants was well preserved. When the grayscale segmentation threshold was t = 65 and the plant area was S = 5.18 dm2, 1685 plants were missing (FN) and 880 plants were incorrectly extracted (FP) in the test area, and the actual number of plants (TP) extracted was 71,256. Through field investigation, the TP was estimated to be 71469; the calculated BF was 0.012, the DP was 97.69%, and the QP was 96.53%.
The method proposed in this study was preliminarily verified in the experimental area and the verification area, which proves that the method is feasible for identifying and counting the number of tobacco plants in the summer growth period; it is better to calculate the EXG during the vigorous period of plant growth. According to the method proposed above, the image calculated by the EXG index was processed by low-pass filtering in the complex environment of the verification area. The extraction results of tobacco plants when the plant area was
S = 5.18 dm
2 are in
Table 3. According to the terrain and tobacco growing environment, the extraction results were divided into a flat area, slope area, bare rock area, and weed area. Because the larger rock was located at a higher altitude, it was easy to block the tobacco plants below the rock, and the drone image shows these partially blocked tobacco plants. This was because the large coverage area of the rock provided a clearer and more consistent background. Obviously, to determine the effect on the extraction of the tobacco plants, the weed area should be judged according to its size and the distance from the tobacco plants. When the weed area is large, the research should be eliminated by threshold segmentation. When the weed area is similar to the tobacco plant area, misinterpretation can easily occur; this can also occur when weeds and tobacco plants grow together. In general, the detection rates of this method in the test and experimental areas are 96.61 and 95.66%, respectively, which are reasonable extraction results. In the experimental area, the number of tobacco plants in the seedling stage was 809, which was too large; thus, the extraction accuracy of tobacco plants in the verification area was reduced.
It can be seen from the below tables as
Table 4 and
Table 5 that this method has a good extraction effect during the vigorous growth period of tobacco. The wrong extraction mainly occurs when the tobacco plant is in the seedling stage, its growth is not obvious, and it is covered by vegetation. The phenomenon of omission occurred in the vegetation sheltering, close to the shrub, and the morphological characteristics of tobacco plants were not obvious.
5. Discussion
(1) Applicability of methods. This study realizes the identification of tobacco plants and the calculation of the number of tobacco plants in the fragmented terrain at low cost and with high efficiency, and overcomes factors such as fragmented plots, scattered planting, and steep terrain. The feasibility of calculating the number of plants can provide basic data for the accurate yield estimation of a single tobacco plant. In this study, the characteristics of tobacco vegetation were enhanced by calculating the EXG, and the target characteristics were enhanced by low-pass filtering; the target detection rate of tobacco plant identification and counting in the terrain fragmentation area reached 96.61%, the integrity was 95.66%, and the branching factor was 0.012. This is good recognition accuracy; however, its accuracy is still susceptible to rocks, weeds, and other field backgrounds.
(2) Parameter selection. Mining suitable color features from tobacco images according to the existing RGB color system is an important step in the process of target recognition using tobacco color features. In addition, by analyzing the color changes of tobacco plants in different periods, a time series data set of tobacco plant growth can be constructed to analyze the different characteristics of tobacco plants over time. The identification and counting of tobacco plants based on time series remote sensing can also be applied to such issues as tobacco diseases and insect pests, fine management and other issues. Thus, future research on this is important.
(3) Analysis of the influencing factors. Tobacco plant plots in the study area had extensive rock coverage and many weeds, which increases the number of error extraction of this target crop. Factors in the planting of this target crop include weeds and tobacco plant canopy appearing as sheets, which confuses the identification of tobacco plants; larger target rocks may obscure tobacco, thereby reducing the actual number of tobacco plants. A ground investigation found that some tobacco plants could not be alive in the early stages. If they survive, farmers replant in a later period. Therefore, when the survey was conducted in June, some seedling-stage tobacco plants appeared to have increased the difficulty of identification.
(4) Restriction of terrain factors. Distortion in ultra-low-altitude UAV remote sensing images is mainly attributed to geometric distortions caused by sensors and terrain fluctuations [
46]. The terrain of the karst mountainous area is broken, and the tobacco planting plots are complex in composition and have large height differences. It is difficult for drones to fly at ultra-low altitude and close to the ground to collect high-resolution images. Using a fixed aerial height to collect the images, we found that tobacco plants are mainly distributed on the top of the mountain and the mountainside, and a phenomenon of target image information decline also occurs. Therefore, in future research, it is necessary to determine the effect of terrain factors on the information change rules in ultra-low-altitude crop targets and identify the multi-level visible light identification features and information attenuation rules for crops at different altitudes, which can effectively improve the quality of visible light remote sensing images.
(5) Method limitations. During our research, we found that the use of color indices and low-pass filtering can better achieve the target identification of tobacco plants, but this is easily disturbed by weeds. Based on the two-dimensional image features in the study, future research should consider introducing the three-dimensional structural features of tobacco plants into the identification method. To rectify the phenomena of missed detection and false detection in UAV images, morphological methods need to be considered further, and accurate extraction should be conducted based on the shape of tobacco. For weeds and crops with similar spectral characteristics, which are easily extracted by mistake, the “morphological-spectral” joint feature is used to eliminate the interference caused by similar spectra. Moreover, in this paper, the wrong mention of weeds and shrubs has not been solved reasonably well. In future studies, in-depth research is required to identify better methods of extracting target crops with high precision.
6. Conclusions
Our research explores the applicability and efficiency of using the method employing quadrotor UAV visible light images in precision agriculture monitoring in complex mountainous areas. The results showed that the quadrotor UAV is easy to operate, has low cost, can obtain highly accurate images of tobacco plots, and has unique advantages with respect to monitoring special economic crops in terrain-fragmented areas. The results show that the quadrotor UAV used is easy to operate, low in cost, highly accurate in identifying tobacco plants, and has unique advantages in monitoring special economic crops in fragmented terrain areas. At the same time, this method can monitor the growth of tobacco plants under different agricultural management methods, and can also provide a reference for agricultural refinement, field yield estimation, and modern agricultural management in karst mountainous areas.
The difference between this study and previous studies is that the method in proposed in this paper does not require a large number of samples, which is highly transferable, overcomes comprehensive factors, such as large terrain height differences and the uneven growth status of tobacco plants, and realizes the target detection of tobacco plants in the fragmented terrain. The test and verification areas accounted for 96.61% and 97.69% of the area, respectively. The method introduces low-pass filtering into the identification of crop plants, filters the information in the high-frequency part of the image, and retains the low-frequency information of tobacco plants in the image, while suppressing the slowly changing background and improving the accuracy of traditional color index identification of crop plants.
The traditional method of estimating tobacco yield by hand is not enough to scientifically support the development and refinement of tobacco growing statistics and improved production management. This study enables the accurate extraction of tobacco plants, which provides a reference method for tobacco refinement’s production. In future research, the registration and fusion of point cloud data and optical images should be completed based on the high-precision spatial position relationship, and the color feature information of the target image should be given to the image matching point cloud data to improve the accuracy of tobacco recognition. At the same time, the differences of color and morphological characteristics of tobacco in different growth stages should be considered when extracting tobacco plants.