Next Article in Journal
Artificial Intelligence Reinventing Materials Engineering: A Bibliometric Review
Previous Article in Journal
An Algorithm for Nutrient Mixing Optimization in Aquaponics
Previous Article in Special Issue
Extraction of Coal Mine Surface Collapse Information and Design of Comprehensive Management Model Based on Multi-Source Remote Sensing—Taking Zhaogu Mining Area as Example
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Extraction of Garlic in the North China Plain Using Multi-Feature Combinations from Active and Passive Time Series Data

1
College of Information Science and Engineering, Shandong Agricultural University, Tai’an 271018, China
2
National Engineering Laboratory of Efficient Utilization of Soil and Fertilizer Resources, College of Resources and Environment, Shandong Agricultural University, Tai’an 271018, China
*
Author to whom correspondence should be addressed.
Appl. Sci. 2024, 14(18), 8141; https://doi.org/10.3390/app14188141
Submission received: 25 June 2024 / Revised: 2 September 2024 / Accepted: 9 September 2024 / Published: 10 September 2024
(This article belongs to the Special Issue Intelligent Computing and Remote Sensing—2nd Edition)

Abstract

:
Garlic constitutes a significant small-scale agricultural commodity in China. A key factor influencing garlic prices is the planted area, which can be accurately and efficiently determined using remote sensing technology. However, the spectral characteristics of garlic and winter wheat are easily confused, and the widespread intercropping of these crops in the study area exacerbates this issue, leading to significant challenges in remote sensing image analysis. Additionally, remote sensing data are often affected by weather conditions, spatial resolution, and revisit frequency, which can result in delayed and inaccurate area extraction. In this study, historical data were utilized to restore Sentinel-2 remote sensing images, aimed at mitigating cloud and rain interference. Feature combinations were devised, incorporating two vegetation indices into a comprehensive time series, along with Sentinel-1 synthetic aperture radar (SAR) time series and other temporal datasets. Multiple classification combinations were employed to extract garlic within the study area, and the accuracy of the classification results was systematically analyzed. First, we used passive satellite imagery to extract winter crops (garlic, winter wheat, and others) with high accuracy. Second, we identified garlic by applying various combinations of time series features derived from both active and passive remote sensing data. Third, we evaluated the classification outcomes of various feature combinations to generate an optimal garlic cultivation distribution map for each region. Fourth, we developed a garlic fragmentation index to assess the impact of landscape fragmentation on garlic extraction accuracy. The findings reveal that: (1) Better results in garlic extraction can be achieved using active–passive time series remote sensing. The performance of the classification model can be further enhanced by incorporating short-wave infrared bands or spliced time series data into the classification features. (2) Examination of garlic cultivation fragmentation using the garlic fragmentation index aids in elucidating variations in accuracy across the study area’s six counties. (3) Comparative analysis with validation samples demonstrated superior garlic extraction outcomes from the six primary garlic-producing counties of the North China Plain in 2021, achieving an overall precision exceeding 90%. This study offers a practical exploration of target crop identification using multi-source remote sensing data in mixed cropping areas. The methodology presented here demonstrates the potential for efficient, cost-effective, and accurate garlic classification, which is crucial for improving garlic production management and optimizing agricultural practices. Moreover, this approach holds promise for broader applications, such as nationwide garlic mapping.

1. Introduction

As the global population expands and dietary patterns evolve, the cultivation area and production of garlic have steadily increased [1]. China consistently dominates in garlic production, exportation, and consumption, positioning garlic as a critical crop within the nation’s economy [2]. In recent years, garlic production has exhibited a steady increase, underscoring the significance of promptly accessing information regarding its planted area and distribution to inform assessments of garlic prices and food security. Conventional garlic planting area surveys rely on field sampling techniques, which are characterized by high costs, prolonged sampling periods, and challenges in providing timely information on garlic cultivation [3]. The maturation of the remote sensing industry offers a solution, enabling rapid, large-scale, and cost-effective monitoring of garlic cultivation through satellite remote sensing data [4,5,6]. In China’s primary garlic-producing regions, mixed cropping of garlic with other crops is prevalent, resulting in numerous mixed pixels in low spatial resolution remote sensing imagery. This phenomenon poses a challenge to accurately identifying garlic cultivation areas. Therefore, the utilization of high spatial resolution imagery emerges as the preferred approach for precisely delineating the spatial distribution of garlic within these key production zones [7,8,9]. Furthermore, the predominant mixed cropping of garlic with winter wheat and rape in the study area yields similar spectral signatures during specific stages of vegetation growth, leading to the manifestation of “foreign body in the same spectrum” in multispectral images [10]. Consequently, discriminating between garlic, rape, and winter wheat solely based on multispectral images at a particular stage of the vegetation growth cycle becomes inherently challenging, significantly impeding garlic identification accuracy.
The duration and diversity of time series data for agricultural crops are directly correlated with the complexity of ground vegetation types [11,12,13,14]. Leveraging remote sensing imagery provided by satellites to form time series data enables the rapid and precise extraction of crop types. Time series imagery has found widespread application in classifying both single and multiple crop types [15,16,17,18], predominantly by constructing the NDVI (normalized difference vegetation index) time series from remotely sensed optical imagery to serve as an indicator of crop phenology [19,20,21,22,23]. However, time series data typically encompass a significant period during which environmental factors are complex and variable, and the crop state undergoes changes. Therefore, relying solely on NDVI as an indicator may not be universally applicable across all environments. To establish a more robust basis for classification, scholars have explored alternative indices to mitigate environmental influences and thereby enhance classification accuracy. For instance, the Enhanced Vegetation Index (EVI) [24] and Modified SAVI (Modified Soil Adjusted Vegetation Index, MSAVI) [25] have been investigated. Through the integration of various indices to construct new time series data, it becomes feasible to more accurately reflect crop growth dynamics.
Although significant progress has been made in vegetation classification using time series data from either active or passive remote sensing images, both modalities possess distinct advantages and limitations. Obtaining high spatial and temporal resolution time series data exclusively from one type of remote sensing image remains challenging. Optical imagery, often utilized for constructing time series curves over the crop growth period, is susceptible to cloud cover and shadowing, diminishing its usability [26,27]. On the other hand, SAR is sensitive to environmental factors such as rain, snow, and atmospheric humidity, potentially compromising image quality under adverse weather conditions. Chen et al. demonstrated the potential of combining active and passive remote sensing for garlic research by extracting garlic using both stepwise and simultaneous approaches, yielding favorable results [28].
While prior classification studies have primarily focused on enhancing classification accuracy [29,30], insufficient attention has been devoted to exploring the factors influencing classification accuracy. However, investigating these factors may offer novel insights for improving classification accuracy. There is no doubt that ground crop cultivation characterized by good regularity and uniformity is advantageous for remote sensing image classification. However, due to the unique circumstances in our country, it is often challenging to achieve this ideal situation. There are differences in the cultivation patterns between garlic and winter wheat in the study area, with garlic cultivation being less regular and more dispersed. To accurately describe and understand the impact of garlic dispersion on classification accuracy, further investigation into the degree of garlic dispersion is warranted.
In recent years, the application of machine learning methods in remote sensing classification has seen rapid development. Among these methods, random forest (RF), support vector machine, extreme gradient boosting, decision tree, and deep learning are commonly employed. Random forest, in particular, has demonstrated excellent classification performance in remote sensing classification and has garnered widespread recognition from previous researchers [31,32].
While there exists a growing body of literature and practical experience in land cover classification using time series image sets [33,34], the focus of interest in this study lies in the identification and extraction of individual crops based on time series data. Unlike most past studies, the study area under consideration includes the presence of a same-season crop (winter wheat) with a phenological period very similar to the crop of interest, with mixing of the two being common. Distinguishing between these two types of crops using only optical time series data, strongly correlated with phenological characteristics, poses a significant challenge. Moreover, the complexity and diversity of vegetation types in the study area make it prone to the occurrence of the same spectral signature in remote sensing images. Consequently, achieving the desired extraction accuracy by using remote sensing images at a specific moment to differentiate crops with similar phenological characteristics is difficult [35,36]. Additionally, traditional classification methods based on time series data have limitations and are particularly challenging in distinguishing between different crops with similar phenological characteristics.
In this paper, we present a study aimed at extracting garlic in the study area through time series data, with a focus on the cultivation area and distribution of garlic, without making detailed distinctions between crops other than garlic. Situated in the eastern part of China, the study area predominantly features garlic and winter wheat as the main winter crops. The aims of this study were to address the following three research questions: (1) whether the superposition or splicing of temporal information can help improve the classification accuracy of garlic; (2) whether the superposition of optical data and Sentinel-1 SAR data can help identify garlic accurately; (3) whether the constructed garlic fragmentation index can effectively express the fragmentation of garlic distribution in different regions and help analyze the reasons for the differences in garlic extraction accuracy in different regions.

2. Data and Methodology

2.1. Study Area

The study area is situated within the North China Plain, spanning from 114° E to 121° E and from 32° N to 40° N. Characterized by low terrain, this region is conducive to the growth of agricultural crops, rendering it one of the primary grain-producing areas in China. Six primary garlic-producing regions were selected for this study: Jinxiang County, Laiwu District, Lanling County, Pizhou City, Qixian County, and Zhongmou County. These areas span a geographical range from 113.5° E to 118.5° E and 33.7° N to 36.6° N, covering a total area of 8597 km2. These regions experience a temperate monsoon continental climate and typically engage in a two-season crop rotation system. The predominant crops cultivated include garlic, winter wheat, vegetables, and peanuts. Among these, garlic cultivation encompasses 228,500 ha, representing 26.58% of the total area. Garlic planting commences in early October each year, with growth accelerating in April of the following year as temperatures rise, culminating in harvesting in June. Conversely, winter wheat is sown in mid-October, exhibiting slightly slower growth than garlic, and harvested in June as well. Mixed cropping of the two is widespread in the study area, with most areas featuring a blend of the two crops in smaller units. Given the study’s focus on garlic distribution and acreage, other ground cover types are not meticulously distinguished. Vegetation other than garlic (such as winter wheat, vegetables, peanuts, peppers, or other crops) is uniformly classified as “other”. The fertility period of garlic in the study area is detailed in Figure 1. The distribution of the study area is shown in Figure 2.

2.2. Data

2.2.1. Field Survey Data

Between 27 January 2023 and 3 February 2023, researchers conducted a week-long survey in the study area to comprehend garlic and winter wheat cultivation practices. The survey route commenced in Tai’an, continuing to Jinxiang, then to Qixian and Zhongmu, followed by Pizhou and Lanling, and concluding in Laiwu. Before departure, points of interest within each county of the study area were selected based on satellite remote sensing imagery and time series data, with the sampling route planned according to the distribution of these points. The counties in the study area are major garlic-producing regions in China, where garlic and winter wheat cover nearly all cultivated areas, occupying vast expanses of the landscape, with minimal field weeds. Small patches of vegetables and other crops adjacent to garlic fields present challenges to classification accuracy. Trees are mainly concentrated near mountains, towns, villages, and along the roads between fields. Vegetables and other vegetation are predominantly found around villages, with much of it grown in greenhouses.
During the survey, samples of various vegetation types were collected, with a particular focus on garlic and winter wheat. Given the vast expanse of the study area and the intricacy of its terrain, achieving uniform sample collection was challenging. Consequently, in addition to field sample points, visual samples were also selected. To ensure the accuracy of visual samples, high-resolution remote sensing images were initially employed to classify the sample categories, followed by integration with Sentinel-2 NDVI time series remote sensing images for final categorization. Subsequently, the accuracy of the data was verified via meticulous comparison of all samples with high-resolution remote sensing images. Ultimately, the samples utilized in this study were compiled from ground-truthing samples and visual samples.

2.2.2. Synthetic Aperture Radar Data

The Sentinel-1 (S1) data are obtained from the “COPERNICUS/S1_GRD” remote sensing image collection on Google Earth Engine (GEE). Within this study, the Sentinel-1 radar image is directly accessed via the GEE platform, having undergone official preprocessing procedures, including updating orbital metadata, removal of GRD boundary noise, elimination of thermal noise, radiometric calibration, and ortho-correction. The data mode employed for this study is the IW mode, which is well-suited for remote sensing studies of land surfaces, particularly with VV + VH polarization.

2.2.3. Remote Sensing Image Data

Utilizing GEE as a remote sensing big data processing platform enables direct access to Sentinel-2 and Sentinel-1 remote sensing images, circumventing the laborious storage and processing requirements on local computers, thanks to its cloud computing capability [37,38]. The Sentinel-2 (S2) remote sensing image utilized in this study is sourced from the Sentinel-2 image dataset on the GEE platform with product class 2A. This dataset undergoes ortho-correction and atmospheric apparent reflectivity processing by the GEE official, and the imagery retrieved from the cloud platform corresponds to the “COPERNICUS/S2_SR_HARMONIZED” product, representing surface reflectance data. In this investigation, five bands of Sentinel-2 remote sensing satellite data are employed, namely, red, green, blue, near-infrared, and short-wave infrared, with a ground resolution of 20 m for the short-wave infrared band and 10 m for all other bands.

2.3. Methodology

The technical methodology of the study, illustrated in Figure 3, encompasses the following steps: (1) conducting a field survey in conjunction with Sentinel-2 NDVI curves to characterize the growth dynamics of garlic and winter wheat; (2) utilizing Sentinel-2 NDVI time series data to extract crops during the winter season within the study area employing random forests; (3) identifying a specific time point within the vegetation growth cycle and plotting time series curves using two different vegetation indices before and after the identified time point, subsequently stitching them together to generate comprehensive crop time series data; (4) formulating diverse combinations of classification features based on time series data from primary passive remote sensing bands or indices, employing random forests to reclassify winter crops using several different feature combinations, and validating the results and accuracy; (5) conducting a comparative analysis of the classification accuracy and outcomes for each combination of classification features and determining the optimal combination of features; and (6) utilizing the constructed garlic fragmentation index to assess the fragmentation of the classification results in each region. These steps represent a systematic approach to enhance the accuracy and reliability of winter crop classification, with a particular focus on garlic, within the study area.

2.3.1. Sentinel-2 Time Series Curve Composition

The Sentinel-2 NDVI curves time series were obtained from October 2021 to June 2022 using the GEE Sentinel-2 dataset and combined with field survey data to accurately understand the climatic characteristics of garlic, winter wheat, and other crops. The Sentinel-2 NDVI was calculated using the following formula:
N D V I = ρ N I R ρ R E D ρ N I R + ρ R E D
where ρ N I R is the reflectance in the near-infrared band and ρ R E D is the reflectance in the red band.
Figure 4 shows that October corresponds to the sowing stage, while June marks the harvest period for both garlic and winter wheat, which are planted and harvested nearly simultaneously. Notably, the NDVI patterns of garlic and winter wheat exhibit similarity, with considerable overlap between the upper and lower boundaries of the two crops, posing challenges in their differentiation solely based on NDVI values from a single day. It is noteworthy that the overall changes in winter wheat were smoother relative to garlic, consistently displaying higher NDVI values than garlic throughout most of the growing season. Additionally, although both crops reach their peaks at nearly the same time, garlic demonstrates more rapid fluctuations in its rise and fall. These distinctive variations aid in facilitating a more accurate differentiation between the two crops.
Due to various factors such as weather conditions, obtaining high-quality time series data for all image elements can be challenging. To maximize data quality, this study employs a two-step optimization process for time series data. Firstly, monthly median values of each image element are used to generate time series data. Subsequently, a method utilizing historical data is employed to correct data anomalies within the vegetation growth period, thereby enhancing the data quality. This method identifies anomalies in each image value through harmonic regression analysis and replaces them with normal historical data, thereby improving the reliability of the time series data.
The method for splicing the time series data involves using MSAVI or SAVI to calculate the time series data for the study area during periods with low NDVI values. For periods with higher NDVI values, NDVI, GNDVI, or EVI are used instead. For instance, MSAVI time series data from October 2021 to January 2022 are combined with EVI time series data from February 2022 to June 2022 to create complete time series data, referred to as double index time series data (DITS). A time node between January and March was selected for splicing the DITS data based on NDVI time series data and field surveys. Several potential time nodes were tested to determine the optimal one, and through comparison of classification results using different time nodes, January was identified as the most suitable time node for DITS.
The formulas for MSAVI, SAVI, EVI, and GNDVI for Sentinel-2 are provided below, respectively:
M S A V I = 1 2 2 ρ N I R + 1 2 ρ N I R + 1 2 8 ρ N I R ρ R E D 2
S A V I = ρ N I R ρ R E D ρ N I R + ρ R E D + L 1 + L
E V I = ρ N I R ρ R E D ρ N I R + 6 ρ R E D 7.5 ρ B L U E + 1
G N D V I = ρ N I R ρ G R E E N ρ N I R + ρ G R E E N
where ρ N I R is the reflectance in the near infrared band, ρ R E D is the reflectance in the red band, ρ G R E E N is the reflectance in the green band, L is the soil conditioning coefficient, and ρ B L U E is the reflectance in the blue band.

2.3.2. Sentinel-1 Time Series Curve Composition

Figure 5 illustrates the Sentinel-1 time series curves of garlic and winter wheat based on 5000 sample pixels. Figure 5a presents the time series data of VH/VV for garlic and winter wheat, while Figure 5b displays the time series data of VH and VV for garlic and winter wheat. Analysis of Figure 5b indicates that the time series data of VV for garlic and winter wheat exhibit a similar pattern of change to the time series data of VH, with more significant differences observed around the month of April, particularly in the troughs and the timing of their occurrence. Conversely, examination of Figure 5a suggests that the trend of the time series data of VH/VV for garlic and winter wheat is similar, although the difference is significant from February to May. Notably, the peak for garlic occurs around February, whereas for winter wheat, it is around April.

2.3.3. The Random Forest Classifier

In this study, we utilized the random forest algorithm for classification. This method reduces the overfitting risk often associated with individual decision trees by constructing multiple trees and aggregating their predictions, resulting in strong robustness [39,40]. The algorithm effectively handles high-dimensional data, maintaining powerful classification performance, even with a large number of features, thus mitigating the curse of dimensionality [41]. Its random selection of samples and features further enhances its resilience to noise, improving the accuracy when dealing with noisy datasets. Although random forest is typically regarded as a “black box” model, it offers insights into feature importance, providing some level of interpretability in the decision-making process.
Through experimentation, we determined that optimal results were achieved by configuring the RF classifier with 200 trees. The application of RF to crop classification has demonstrated its efficacy, yielding favorable outcomes [42,43].

2.3.4. Garlic Extraction

Initially, winter crop distribution maps (WCMs) were derived using RF classification based on Sentinel-2 NDVI time series data obtained from the GEE cloud platform (see Section 2.3.1). Subsequently, the WCM was segmented based on different feature combinations to produce garlic distribution maps (GDMs) for each region.
A total of nine distinct feature combinations were utilized to classify the WCM using RF, resulting in the generation of garlic distribution maps (GDMs) for each region. The accuracy of the classifications obtained from all feature combinations was compared to determine the optimal combination for each county and to generate the planting distribution map and area. The classification feature combinations encompass various time series data, with each combination demonstrating a certain level of similarity. Comparing the classification accuracy of different feature combinations allows for the analysis of the importance of specific information.
In this study, we designate x-y (where x = me, mg, mn, se, sn, n, e and y = sm, s, m, swir) as a feature combination, where x represents the vegetation index time series data included in the combination, and y represents other spectral time series data and SAR time series data. We categorize vegetation index time series data, spectral band data, and microwave data as the three types of data considered in this study. Typically, spectral bands comprise the four bands of red, green, blue, and shortwave infrared (multi-bands), while the three microwave time series data of VH, VV, and VH/VV are collectively referred to as SAR. In the feature combination, x comprises two letters to denote that the vegetation index time series data of the combination are derived from the DITS, while one letter signifies the time series data of a single index. Each letter in y then represents a feature, where “swir” denotes the shortwave infrared band only. For example, the feature combination labeled “me-sm” includes time series data from the DITS of MSAVI and EVI and SAR time series data, as well as time series data from the red, green, blue, and shortwave infrared bands. In summary, the classification feature combinations may encompass time series data from MSAVI, SAVI, EVI, NDVI, GNDVI, SAR, and multi-bands. The timing data included in each feature combination is depicted in Table 1. It is noteworthy that in feature combinations containing two vegetation indices, the index positioned at a later point in the table is situated in the first half of the DITS, while the other is located in the second half.

2.3.5. Accuracy Verification

The validation samples for the WCM were obtained from field-collected samples and randomly selected points. To ensure objectivity in accuracy validation and to minimize subjectivity in sample selection, the visual validation samples for the GDM were generated from random points within the study area. The methodology used to determine the attributes of these random points was consistent with the approach used for determining the attributes of visual samples in the training set (see Section 2.2.1). The details of the GDM validation points, regarded as ground truth data, are presented in Table 2.
All classification results underwent validation by comparing the confusion matrix with the samples, a method of accuracy validation conducted at the pixel level and commonly employed in accuracy assessment. Classification accuracy was evaluated by comparing the validation predictions with the selected validation points and conducting a consistency test.

2.3.6. Garlic Fragmentation Analysis

To comprehensively analyze the factors contributing to variations in accuracy across different regions, this study leverages previous research findings [44,45] and integrates the characteristics of the study area to evaluate garlic fragmentation using the GFI. The GFI quantifies the fragmentation of garlic cultivation areas, with higher values indicating greater fragmentation [46]. The formula is provided below:
G F I = n / A
where GFI represents the garlic fragmentation index, n denotes the number of plots planted with garlic, and A is the total area planted with garlic in the sample area.
Ten square samples, each with a side length of one kilometer, were randomly selected from each county in the study area to assess the degree of garlic fragmentation. The fragmentation of the garlic crop in each county was evaluated based on the GFI of the samples. To more accurately reflect garlic fragmentation, the random area range of Laiwu and Pizhou was artificially constrained.

3. Results

3.1. Winter Crop Classification Results

In this study, NDVI time series data were utilized to extract winter crops, with previous studies confirming the reliability of the NDVI for vegetation extraction [47,48,49]. The area under crop cultivation in Pizhou is 1618.03 km2, constituting 77.62% of the county’s total area, making it the largest in the study area. In contrast, Laiwu has 611.78 km2 under crop cultivation, accounting for 35.18% of its total area, the smallest in the study area. Figure 6 illustrates the WCM for each county within the study area. The classification results indicate that vegetation distribution in the study area predominantly extends outward, centered on anthropogenic activity intensity. Further details regarding the WCM are provided in Table 3.

3.2. Extraction Accuracy

First, this study successfully extracted winter vegetation using remote sensing imagery, while all other vegetation types were categorized as “Other”. The results in Table 3 demonstrate high accuracy for winter vegetation classification models across all regions (all above 92% overall). The producer and user accuracies for winter crops were over 96% (except for Zhongmou County, which had a producer accuracy of 92%), providing a reliable foundation for further garlic classification.
Secondly, by classifying the WCM, we generated the GDM for each region. The classification models employed different combinations of classification features, and the resulting GDM classification accuracy is depicted in Table 4 and Table 5. The results indicate that all combinations of classification features yielded satisfactory classification results (overall accuracy consistently above 84%). Within each district, multiple feature combinations demonstrated the potential for achieving high classification accuracy. Among the various counties in the study area, Zhongmou and Laning counties attained higher accuracy levels, followed by Jinxiang and Laiwu counties, while Pizhou and Qi counties exhibited more average classification outcomes.
Based on Table 4, the Kappa of each classification feature across different regions exhibit a similar trend to the overall accuracy change, with all achieving optimal results in Laning County and lower accuracy in Pizhou and Qixian counties. In most cases, as the number of features within a combination of classification features decreases, both classification accuracy and Kappa coefficients decrease (except in Lanling County), with decreases of less than 10%.
Among the various feature combinations, the inclusion of DITS in the feature combination offers some advantages over the NDVI, a benefit reflected in most areas (except Pizhou). However, when replacing the NDVI with the EVI in the feature combinations, this advantage becomes negligible, as both the feature combinations with DITS added and E-SM obtained the best results in their respective regions, showing no significant advantage. Nevertheless, this does not discount the utility of DITS, as the feature combination with DITS added was able to achieve better producer accuracy versus user accuracy in extracting garlic in most regions.
As evidenced by the classification results of the feature combination n-m vs. n-, the addition of spectral information significantly enhances accuracy, with the Kappa improving by more than 1% and even more than 6% in most regions. Relative to N-SSWIR, N-SM contains three additional RGB bands, resulting in a relatively minor difference in accuracy across regions compared to that between N-R and N- (except for Jinxiang, where the former’s accuracy difference is less than half of the latter’s in all other regions). The shortwave infrared band contributes more significantly to improving classification accuracy compared to the RGB band.
N-sm includes more SAR time series data (VH, VV, and VH/VV) compared to n-m, leading to better accuracy in most areas. Except for a 3% decrease in the Kappa in Laning County, Laiwu and Pizhou improved by approximately 1%, and all other areas exceeded 2%. The integration of active and passive time series remote sensing data has a more pronounced effect on enhancing classification accuracy in most areas. Overall, e-sm demonstrates good stability and achieves favorable results in most regions with strong generalizability. DITS exhibits advantages in enhancing accuracy in certain areas, but different combinations of vegetation indices require further testing. Moreover, the combination of active and passive remote sensing time series data can significantly enhance accuracy and is applicable across various regions.

3.3. Garlic Extraction Results

Figure 7 illustrates the GDM classification outcomes for the study area, generated from the optimal feature combinations identified in the preceding section. According to these classification results, the total area under garlic cultivation amounts to 1845.11 km2. Garlic cultivation exhibits a dual characteristic of concentration and dispersion. Concentration is evident in the predominant cultivation of garlic in specific regions of the study area, while its cultivation is sparse or absent in other areas. The dispersed nature of garlic cultivation is exemplified by its frequent intercropping with winter wheat or other crops in regions where it is cultivated extensively, rather than being grown in isolation.
Among the six counties in the study area, Pizhou has the largest garlic cultivation area, covering 542.80 km2. Figure 7 indicates that garlic is cultivated across the entirety of Jinxiang and Qixian counties, whereas in other counties, garlic cultivation is more concentrated in specific areas. Additionally, other types of vegetation exhibit a more continuous spatial distribution.

3.4. Classification Contribution

Table 6 illustrates the ranking of classification contributions from the feature combinations that yielded the best classification results for each region. Due to the extensive number of features, only the top fifteen classification contributions are provided. The table highlights the significant contribution of SWIR to classification accuracy, with multiple SWIR features included (up to 56 features when considering all classified features). This finding aligns with previous observations regarding the role of SWIR features in enhancing accuracy. Notably, SAR features are scarcely represented in the table, except for Jinxiang and Pizhou, indicating limitations in using SAR features for crop species determination.
Furthermore, the table incorporates various vegetation indices, suggesting their applicability for garlic extraction across different regions. Among these, the EVI and MSAVI are more frequently observed, indicating their broader suitability for classification. The presence of different vegetation indices in distinct regions underscores the effectiveness of DITS in enhancing garlic extraction accuracy.

3.5. Garlic Fragmentation

Table 7 presents the fragmentation degree calculated based on the best classification results for each region. Overall, Lanling, Zhongmou, and Qixian exhibit lower fragmentation degrees. Although there is a certain correlation between the fragmentation index and classification accuracy, this relationship is not linear. When the difference of the fragmentation degree is more than 0.2, the area with the larger fragmentation degree can be predicted, and at this time the extracted classification accuracy will be reduced accordingly. However, when the change in the degree of fragmentation is at a relatively low level, the impact of the change in the degree of fragmentation on the classification results will be reduced.
As depicted in Table 7, regions with lower fragmentation demonstrate greater stability in both user accuracy and producer accuracy. Despite Laiwu district exhibiting higher overall accuracy, its garlic identification accuracy is lower due to a higher prevalence of other vegetation types. The validation points are more distributed across other vegetation types in this district, contributing to the inflated overall accuracy.

4. Discussion

4.1. Advantages of Data Preprocessing

The study area for this experiment encompasses a wide geographical expanse, with an east-west longitude difference of 7°, necessitating approximately three image strips from Sentinel-2 satellites to comprehensively cover the area. However, due to cloud cover effects, the intervals between usable image acquisitions from different strips may exceed ten days [50]. Optical imagery remains the primary reference for classification in this experiment, necessitating a hierarchical processing approach. Despite compiling time series data based on monthly mean values, anomalies persist in many pixel values [51]. To address this, harmonic analysis was employed to identify anomalies in the vegetation index relative to the previous two years’ data and rectify them, thereby reducing the impact of cloud or other noise artifacts and improving the reliability of the time series data. Experimental results confirm the efficacy of these methods in providing more dependable data for garlic classification.

4.2. Advantages of DITS

Previous studies on crop extraction from time series data have indeed affirmed that superior outcomes can be achieved by identifying crops using such data [52,53,54,55]. The diverse vegetation within the study area, coupled with the presence of winter wheat sharing spectral and phenological similarities with garlic, presented a formidable challenge for garlic extraction. As depicted in Figure 4, garlic and winter wheat exhibit considerable similarity in their NDVI during the phenological period, rendering it arduous to discern the two solely through remote sensing imagery on a given day during the growing season. Field surveys have indicated that the foliar index of garlic during the same period is lower than that of winter wheat, theoretically suggesting that the DITS of garlic should also be smaller than that of winter wheat. However, as evident from Figure 4, the NDVI of garlic and winter wheat closely align before January, with both exhibiting lower foliar indexes in the early growth stage, thereby complicating differentiation between the two using NDVI time series data at the onset to achieve the desired effect.
From the classification results, it is apparent that the inclusion of additional classification features enhances the accuracy of classification outcomes. This is evident from Table 4 and Table 6, where SWIR bands play a pivotal role in improving accuracy, likely attributed to the substantial difference in leaf water content between garlic and winter wheat from February to June [56,57]. Both the SAVI and MSAVI mitigate the influence of the vegetation-soil environment on the vegetation spectral characteristics in the early stages of crop growth [58,59], with the MSAVI exhibiting dynamic adaptability to changes in vegetation density and soil effects. Hence, employing the MSAVI as the first half of DITS during the early stages of crop growth could prove more beneficial in discriminating garlic from winter wheat, a hypothesis validated by the results in Table 5 and Table 6. The near-simultaneous maturity period of garlic and winter wheat, when crop density is higher and soil background reflection exerts less influence, renders NDVI values of both crops convergent, posing challenges to differentiation using the NDVI. Conversely, the EVI offers a distinct advantage in observing denser vegetation and is less susceptible to noise [60], thus utilizing the EVI as the latter half of DITS may facilitate better discrimination between garlic and winter wheat. Notably, as depicted in Table 5, utilizing time series data comprising the EVI combined with microwave and several single-band time series data yields improved results in several regions, underscoring the significance of the EVI in classification contributions, as highlighted in Table 6.

4.3. Advantages of Combining Active and Passive Remote Sensing

Optical images are frequently contaminated by clouds and their shadows, while SAR images serve as a dependable data source in nearly all weather conditions [61]. Processed microwave remote sensing data exhibit a sharp response mechanism to plant structure [62]; thus, the inclusion of polarized data would enhance the accuracy of feature classification. As depicted in Figure 5, a distinct disparity exists between the VH/VV of garlic and winter wheat, with the garlic VH/VV peak occurring earlier and being of larger magnitude compared to that of winter wheat. Therefore, VH/VV, by amalgamating the characteristics of VH and VV, can streamline computations and furnish additional information to differentiate between the two, thus aiding in garlic extraction. Although the addition of SAR features enhances classification accuracy, as stated by Veloso et al., observed backscatter comprises a blend of ground backscatter perturbed by soil moisture and surface roughness and vegetation backscatter altered by the 3D structure of vegetation [63]. Consequently, SAR features may not be efficacious in discerning the class of garlic. Moreover, given that the resolution of the SWIR band of Sentinel-2 is 20 m, which is relatively low compared to other visible light bands, it may impede the discrimination between garlic and winter wheat plots on the ground in most instances. Hence, the significance of SAR features in garlic extraction is underscored in this study, while the amalgamation of active and passive remote sensing techniques can enhance classification accuracy.

4.4. The Role of the Fragmentation Index

To assess the reliability of the GFI, the index was applied to multiple regions with varying environmental conditions to determine its consistency. Results showed that the GFI remained stable across different datasets, indicating its robustness. Additionally, previous studies [46] have successfully utilized similar indices, further supporting its reliability. However, it should be noted that the GFI may be influenced by factors such as data resolution and image quality. In areas with lower resolution or significant cloud cover, the accuracy of the index may be reduced.
Combining Table 5 and Table 6, it is evident that the fragmentation index holds predictive value for extraction outcomes, with less accurate results observed in areas exhibiting greater fragmentation. Thus, analyzing ground fragmentation allows for advance prediction of extraction accuracy. Additionally, examining garlic extraction results and analyzing actual planting fragmentation can inform garlic policy formulation, thereby reducing planting costs.

4.5. Summary and Future Directions

A comparison of classification results indicates that, except in Pizhou where the accuracy of the commonly used NDVI time series data was nearly equivalent to that of other features, the overall accuracy of NDVI fell short by more than three percentage points compared to the optimal feature combinations in other regions, with the gap exceeding five percentage points in most areas. Compared to previous studies, the method proposed in this paper achieves higher crop classification accuracy, with overall improvements ranging from 1% to 3% [64,65]. The extraction of garlic crop distribution and area within the study region marks a significant advancement in winter crop classification. This is particularly important because garlic, despite having a growth cycle similar to that of winter wheat, is often overlooked in studies focused on winter crop mapping [66,67].
In summary, the integration of time series data contributes to enhanced garlic classification accuracy, albeit with variations in performance across regions. While the spliced time series data improves classification accuracy in certain areas, its efficacy is diminished in a few other regions. Integration of optical data with Sentinel-1 SAR data aids accurate garlic identification [68,69], although its impact is less pronounced in regions with better classification results. Lastly, the newly devised garlic fragmentation index facilitates analysis of extraction accuracy disparities across regions.
However, this study has certain limitations and stochastic elements, which could serve as avenues for future research. A primary limitation lies in the reliance on pre-filtered remote sensing imagery for garlic classification. Despite yielding high accuracy results, this approach may still harbor errors that permit the filtration of a small number of garlic plots. Although this study judging anomalies via harmonic regression and patching time series data in the study area using previous data can effectively increase the validity of the data, this algorithm is not directly related to the surface data of the current year, and the integration of multi-sensor remote sensing data can be considered for study in the future, such as the STARFM algorithm. Additionally, green onions in the study area exhibit phenological characteristics similar to those of garlic. Although scallions are not commonly cultivated in the study area, their presence could potentially affect the accuracy of extraction.
When applying the classification method across different regions, several limitations arise. Firstly, varying agricultural practices and crop structures in different areas can pose significant challenges for accurate classification. The spectral characteristics of crops can differ regionally, necessitating the collection of additional samples to account for these variations. Secondly, when expanding the classification to a larger area, a more extensive and evenly distributed sample set is required, which often involves supplementary visual sampling, increasing the complexity of the process. Additionally, the increased scale leads to larger datasets, and processing these on platforms such as GEE may result in limitations such as memory constraints and extended computation times. To mitigate these issues, optimizing the algorithm is essential to prevent processing delays or timeouts. The garlic classification results from the six counties demonstrate that different regions may require distinct combinations of classification features to achieve optimal results. Therefore, when applying this method to different regions, it is advisable to first test the effectiveness of various classification features before proceeding with the classification. However, there also exists a robust set of features that provide high accuracy and can be utilized when precision requirements are less stringent.

5. Conclusions

In this study, we empirically demonstrated that combining active and passive remote sensing with multispectral time series data in the North China Plain enables the acquisition of garlic distribution and area. This method proves to be both economical and efficient, yielding high-quality results. By integrating active and passive imagery and utilizing time series data of red, green, blue, and short-wave infrared, we extracted six primary garlic production areas in the North China Plain using the random forest algorithm. These extractions were validated using ground sampling points and manually selected samples at a later stage. Our study identified key features conducive to enhancing classification results, highlighting the efficacy of combining active and passive remote sensing for improved accuracy, resulting in high classification accuracy (overall accuracy greater than 90%).
Furthermore, the garlic fragmentation index aids in elucidating disparities in extraction accuracy, offering insights into variations in garlic classification accuracy across different regions. The research data and platforms utilized in this study are freely accessible, holding significant potential for extension to broader processes, such as garlic extraction nationwide. Future endeavors could leverage higher-resolution satellites to further enhance accuracy.

Author Contributions

C.P. and C.D. conceived and designed the methodology; C.P. and B.G. performed the methodology; W.Z. and Y.C. collected the collection data; C.P. and W.W. analyzed the data; C.P. wrote the paper. All authors have read and agreed to the published version of the manuscript. All authors reviewed the manuscript.

Funding

This research was funded by Natural Science Foundation of Shandong Province grant number [ZR2021MD096].

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors on request.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

DITSDouble index time series Data
EVIEnhanced vegetation index
GDMGarlic distribution maps
GEEGoogle Earth Engine
GFIGarlic fragmentation index
GNDVIGreen normalized difference vegetation index
MSAVIModified soil-adjusted vegetation index
NDVINormalized Difference Vegetation Index
RFRandom forest
RGBSentinel-2’s red, green, and blue bands
SARSynthetic aperture radar
SAVISoil-adjusted vegetation index
SWIRShortwave infrared
VHVertical–horizontal polarization
VVVertical–vertical polarization
VV/VHThe ratio of vertical–vertical (VV) and vertical–horizontal (VH) polarization
WCMWinter crop distribution maps
me-smThe feature combination “me-sm” includes time series data from the DITS of MSAVI and EVI, SAR time series data, as well as time series data from the RGB and SWIR bands.
mg-smThe feature combination “mg-sm” includes time series data from the DITS of MSAVI and GNDVI, SAR time series data, as well as time series data from the RGB and SWIR bands.
mn-smThe feature combination “mn-sm” includes time series data from the DITS of MSAVI and NDVI, SAR time series data, as well as time series data from the RGB and SWIR bands.
sn-smThe feature combination “sn-sm” includes time series data from the DITS of SAVI and NDVI, SAR time series data, as well as time series data from the RGB and SWIR bands.
e-smThe feature combination “e-sm” includes EVI time series data, SAR time series data, as well as time series data from the RGB and SWIR bands.
n-srThe feature combination ‘n-sr’ includes NDVI, SAR, and Red time series data.
n-sThe feature combination “n-s” includes NDVI time series data, as well as SAR time series data.
n-The feature combination “n-” includes only NDVI time series data.
n-sswirThe feature combination ‘n-sswir’ includes NDVI, SAR, and SWIR time series data.

References

  1. Li, W.; Pan, J.; Peng, W.; Li, Y.; Li, C. Garlic Crops’ Mapping and Change Analysis in the Erhai Lake Basin Based on Google Earth Engine. Agronomy 2024, 14, 755. [Google Scholar] [CrossRef]
  2. Zhang, H.; Xiao, F.; He, W.; Chai, Z.; Ewe, H.-T. Multiyear Automated Mapping and Price Analysis of Garlic in Main Planting Areas of China Using Time-Series Remote Sensing Images. IEEE J. Sel. Top. Appl. Earth Observ. Remote Sens. 2022, 15, 5222–5233. [Google Scholar] [CrossRef]
  3. Zhang, H.; Kang, J.; Xu, X.; Zhang, L. Accessing the temporal and spectral features in crop type mapping using multi-temporal Sentinel-2 imagery: A case study of Yi’an County, Heilongjiang province, China. Comput. Electron. Agric. 2020, 176, 105618. [Google Scholar] [CrossRef]
  4. Wei, M.; Qiao, B.; Zhao, J.; Zuo, X. Application of remote sensing technology in crop estimation. In Proceedings of the 2018 IEEE 4th International Conference on Big Data Security on Cloud (BigDataSecurity), IEEE International Conference on High Performance and Smart Computing,(HPSC) and IEEE International Conference on Intelligent Data and Security (IDS), Omaha, NE, USA, 3–5 May 2018; pp. 252–257. [Google Scholar]
  5. Zhang, J.; Zhou, Q.; Shen, X.; Li, Y. Cloud detection in high-resolution remote sensing images using multi-features of ground objects. J. Geovis. Spat. Anal. 2019, 3, 14. [Google Scholar] [CrossRef]
  6. Song, X.-P.; Huang, W.; Hansen, M.C.; Potapov, P. An evaluation of Landsat, Sentinel-2, Sentinel-1 and MODIS data for crop type mapping. Sci. Remote Sens. 2021, 3, 100018. [Google Scholar] [CrossRef]
  7. Potgieter, A.; Apan, A.; Hammer, G.; Dunn, P. Early-season crop area estimates for winter crops in NE Australia using MODIS satellite imagery. ISPRS-J. Photogramm. Remote Sens. 2010, 65, 380–387. [Google Scholar] [CrossRef]
  8. Potgieter, A.B.; Lawson, K.; Huete, A.R. Determining crop acreage estimates for specific winter crops using shape attributes from sequential MODIS imagery. Int. J. Appl. Earth Obs. Geoinf. 2013, 23, 254–263. [Google Scholar] [CrossRef]
  9. Vintrou, E.; Ienco, D.; Bégué, A.; Teisseire, M. Data mining, a promising tool for large-area cropland mapping. IEEE J. Sel. Top. Appl. Earth Observ. Remote Sens. 2013, 6, 2132–2138. [Google Scholar] [CrossRef]
  10. Tian, H. Study on Mapping for Winter Wheat Distribution in China Based on Sentinel-1&2 Images. Ph.D. Thesis, University of Chinese Academy of Sciences in Partial Fulfillment of the Requirement, Beijing, China, 2019. [Google Scholar]
  11. Wardlow, B.D.; Egbert, S.L.; Kastens, J.H. Analysis of time-series MODIS 250 m vegetation index data for crop classification in the U.S. Central Great Plains. Remote Sens. Environ. 2007, 108, 290–310. [Google Scholar] [CrossRef]
  12. Ozdogan, M. The spatial distribution of crop types from MODIS data: Temporal unmixing using Independent Component Analysis. Remote Sens. Environ. 2010, 114, 1190–1204. [Google Scholar] [CrossRef]
  13. Fernandes, M.R.; Aguiar, F.C.; Silva, J.M.N.; Ferreira, M.T.; Pereira, J.M.C. Spectral discrimination of giant reed (Arundo donax L.): A seasonal study in riparian areas. ISPRS-J. Photogramm. Remote Sens. 2013, 80, 80–90. [Google Scholar] [CrossRef]
  14. Tscharntke, T.; Grass, I.; Wanger, T.C.; Westphal, C.; Batary, P. Beyond organic farming—Harnessing biodiversity-friendly landscapes. Trends Ecol. Evol. 2021, 36, 919–930. [Google Scholar] [CrossRef] [PubMed]
  15. Sun, C.; Fagherazzi, S.; Liu, Y. Classification mapping of salt marsh vegetation by flexible monthly NDVI time-series using Landsat imagery. Estuar. Coast. Shelf Sci. 2018, 213, 61–80. [Google Scholar] [CrossRef]
  16. Shao, Y.; Lunetta, R.S.; Wheeler, B.; Iiames, J.S.; Campbell, J.B. An evaluation of time-series smoothing algorithms for land-cover classifications using MODIS-NDVI multi-temporal data. Remote Sens. Environ. 2016, 174, 258–265. [Google Scholar] [CrossRef]
  17. Jia, K.; Liang, S.; Zhang, L.; Wei, X.; Yao, Y.; Xie, X. Forest cover classification using Landsat ETM+ data and time series MODIS NDVI data. Int. J. Appl. Earth Obs. Geoinf. 2014, 33, 32–38. [Google Scholar] [CrossRef]
  18. Ahmed, T.; Singh, D. Probability density functions based classification of MODIS NDVI time series data and monitoring of vegetation growth cycle. Adv. Space Res. 2020, 66, 873–886. [Google Scholar] [CrossRef]
  19. Arvor, D.; Jonathan, M.; Meirelles, M.S.P.; Dubreuil, V.; Durieux, L. Classification of MODIS EVI time series for crop mapping in the state of Mato Grosso, Brazil. Int. J. Remote Sens. 2011, 32, 7847–7871. [Google Scholar] [CrossRef]
  20. Jia, K.; Liang, S.; Wei, X.; Yao, Y.; Su, Y.; Jiang, B.; Wang, X. Land cover classification of Landsat data with phenological features extracted from time series MODIS NDVI data. Remote Sens. 2014, 6, 11518–11532. [Google Scholar] [CrossRef]
  21. Schmidt, M.; Udelhoven, T.; Gill, T.; Roder, A. Long term data fusion for a dense time series analysis with MODIS and Landsat imagery in an Australian Savanna. J. Appl. Remote Sens. 2012, 6, 18. [Google Scholar] [CrossRef]
  22. Sibanda, M.; Murwira, A. The use of multi-temporal MODIS images with ground data to distinguish cotton from maize and sorghum fields in smallholder agricultural landscapes of Southern Africa. Int. J. Remote Sens. 2012, 33, 4841–4855. [Google Scholar] [CrossRef]
  23. Sun, H.; Xu, A.; Lin, H.; Zhang, L.; Mei, Y. Winter wheat mapping using temporal signatures of MODIS vegetation index data. Int. J. Remote Sens. 2012, 33, 5026–5042. [Google Scholar] [CrossRef]
  24. Liu, H.Q.; Huete, A. A feedback based modification of the NDVI to minimize canopy background and atmospheric noise. IEEE Trans. Geosci. Remote Sens. 1995, 33, 457–465. [Google Scholar] [CrossRef]
  25. Qi, J.; Chehbouni, A.; Huete, A.R.; Kerr, Y.H.; Sorooshian, S. A modified soil adjusted vegetation index. Remote Sens. Environ. 1994, 48, 119–126. [Google Scholar] [CrossRef]
  26. Tian, H.; Li, W.; Wu, M.; Huang, N.; Li, G.; Li, X.; Niu, Z. Dynamic monitoring of the largest freshwater lake in China using a new water index derived from high spatiotemporal resolution Sentinel-1A data. Remote Sens. 2017, 9, 521. [Google Scholar] [CrossRef]
  27. Zhu, Z.; Woodcock, C.E. Object-based cloud and cloud shadow detection in Landsat imagery. Remote Sens. Environ. 2012, 118, 83–94. [Google Scholar] [CrossRef]
  28. Chen, Y.; Guo, Y.; Qiao, L.; Xia, H. Coupling optical and SAR imagery for automatic garlic mapping. Front. Sustain. Food Syst. 2022, 6, 1007568. [Google Scholar] [CrossRef]
  29. Howard, D.M.; Wylie, B.K. Annual crop type classification of the US Great Plains for 2000 to 2011. Photogramm. Eng. Remote Sens 2014, 80, 537–549. [Google Scholar] [CrossRef]
  30. Zhang, J.; Feng, L.; Yao, F. Improved maize cultivated area estimation over a large scale combining MODIS–EVI time series data and crop phenological information. ISPRS-J. Photogramm. Remote Sens. 2014, 94, 102–113. [Google Scholar] [CrossRef]
  31. Houborg, R.; McCabe, M.F. A hybrid training approach for leaf area index estimation via Cubist and random forests machine-learning. ISPRS-J. Photogramm. Remote Sens. 2018, 135, 173–188. [Google Scholar] [CrossRef]
  32. Rodriguez-Galiano, V.F.; Ghimire, B.; Rogan, J.; Chica-Olmo, M.; Rigol-Sanchez, J.P. An assessment of the effectiveness of a random forest classifier for land-cover classification. ISPRS-J. Photogramm. Remote Sens. 2012, 67, 93–104. [Google Scholar] [CrossRef]
  33. Sishodia, R.P.; Ray, R.L.; Singh, S.K. Applications of Remote Sensing in Precision Agriculture: A Review. Remote Sens. 2020, 12, 3136. [Google Scholar] [CrossRef]
  34. Peña, M.A.; Brenning, A. Assessing fruit-tree crop classification from Landsat-8 time series for the Maipo Valley, Chile. Remote Sens. Environ. 2015, 171, 234–244. [Google Scholar] [CrossRef]
  35. Jia, K.; Li, Q.; Tian, Y.; Wu, B.; Zhang, F.; Meng, J. Accuracy Improvement of Spectral Classification of Crop Using Microwave Backscatter Data. Spectrosc. Spectr. Anal. 2011, 31, 483–487. [Google Scholar]
  36. Jia, K.; Wu, B.; Li, Q. Crop classification using HJ satellite multispectral data in the North China Plain. J. Appl. Remote Sens. 2013, 7, 073576. [Google Scholar] [CrossRef]
  37. Jiang, Y.; Chen, B.; Huang, Y.; Cui, J.; Guo, Y. Crop Planting Area Extraction based on Google Earth Engine and NDVI Time SeriesDifference Index. J. Geo-Inf. Sci. 2021, 23, 938–947. [Google Scholar]
  38. Mou, X.; Li, H.; Huang, C.; Liu, Q.; Liu, G. Application progress of Google Earth Engine in land use and land cover remote sensinginformation extraction. Inst. Geogr. Sci. Nat. Resour. 2021, 33, 1–10. [Google Scholar]
  39. Tian, S.; Zhang, X.; Tian, J.; Sun, Q. Random Forest Classification of Wetland Landcovers from Multi-Sensor Data in the Arid Region of Xinjiang, China. Remote Sens. 2016, 8, 954. [Google Scholar] [CrossRef]
  40. Liu, J.; Feng, Q.; Gong, J.; Zhou, J.; Liang, J.; Li, Y. Winter wheat mapping using a random forest classifier combined with multi-temporal and multi-sensor data. Int. J. Digit. Earth 2017, 11, 783–802. [Google Scholar] [CrossRef]
  41. Belgiu, M.; Drăguţ, L. Random forest in remote sensing: A review of applications and future directions. ISPRS-J. Photogramm. Remote Sens. 2016, 114, 24–31. [Google Scholar] [CrossRef]
  42. Dong, C.; Zhao, G.; Qin, Y.; Wan, H. Area extraction and spatiotemporal characteristics of winter wheat-summer maize in Shandong Province using NDVI time series. PLoS ONE 2019, 14, e0226508. [Google Scholar] [CrossRef]
  43. Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
  44. Li, X.; Ou, M.; Ma, X. Analysis on Impact of Fragmentation Based on LandscapeIndex to Cultivated Land Use Efficiency—A Case on LixiaheDistrict in Yangzhou City. J. Nat. Resour. 2011, 26, 1758–1767. [Google Scholar]
  45. Wang, X.; Bu, R.; Hu, Y.; Xiao, D. Analysis on landscape Fragment of Liaohe delta wetlands. Chin. J. Appl. Ecol. 1996, 7, 299–304. [Google Scholar]
  46. Chen, W.; Feng, X.; Ma, R.; Hong, Q. Method of Cultivated Land Fragmentation Evaluation and Empirical Research: A Case of Ningbo City in Zhejiang Province. China Land Sci. 2016, 30, 80–87. [Google Scholar]
  47. Dong, C.; Zhao, G. Influence of Time Series Data Quality on Land Cover Classification Accuracy. Remote Sens. Technol. Appl. 2020, 35, 558–566. [Google Scholar] [CrossRef]
  48. Guo, Y.; Xia, H.; Zhao, X.; Qiao, L.; Qin, Y. Estimate the Earliest Phenophase for Garlic Mapping Using Time Series Landsat 8/9 Images. Remote Sens. 2022, 14, 4476. [Google Scholar] [CrossRef]
  49. Cai, Y.; Guan, K.; Peng, J.; Wang, S.; Seifert, C.; Wardlow, B.; Li, Z. A high-performance and in-season classification system of field-level crop types using time-series Landsat data and a machine learning approach. Remote Sens. Environ. 2018, 210, 35–47. [Google Scholar] [CrossRef]
  50. Guo, Y.; Jia, X.; Paull, D.; Benediktsson, J.A. Nomination-favoured opinion pool for optical-SAR-synergistic rice mapping in face of weakened flooding signals. ISPRS-J. Photogramm. Remote Sens. 2019, 155, 187–205. [Google Scholar] [CrossRef]
  51. Cai, Y.; Li, X.; Zhang, M.; Lin, H. Mapping wetland using the object-based stacked generalization method based on multi-temporal optical and SAR data. Int. J. Appl. Earth Obs. Geoinf. 2020, 92, 102164. [Google Scholar] [CrossRef]
  52. Walker, J.J.; de Beurs, K.M.; Wynne, R.H. Dryland vegetation phenology across an elevation gradient in Arizona, USA, investigated with fused MODIS and Landsat data. Remote Sens. Environ. 2014, 144, 85–97. [Google Scholar] [CrossRef]
  53. Tian, H.; Wang, Y.; Chen, T.; Zhang, L.; Qin, Y. Early-Season Mapping of Winter Crops Using Sentinel-2 Optical Imagery. Remote Sens. 2021, 13, 3822. [Google Scholar] [CrossRef]
  54. Zhu, Z.; Fu, Y.; Woodcock, C.E.; Olofsson, P.; Vogelmann, J.E.; Holden, C.; Wang, M.; Dai, S.; Yu, Y. Including land cover change in analysis of greenness trends using all available Landsat 5, 7, and 8 images: A case study from Guangzhou, China (2000–2014). Remote Sens. Environ. 2016, 185, 243–257. [Google Scholar] [CrossRef]
  55. Wardlow, B.D.; Egbert, S.L. Large-area crop mapping using time-series MODIS 250 m NDVI data: An assessment for the U.S. Central Great Plains. Remote Sens. Environ. 2008, 112, 1096–1116. [Google Scholar] [CrossRef]
  56. Ghulam, A.; Li, Z.-L.; Qin, Q.; Yimit, H.; Wang, J. Estimating crop water stress with ETM+ NIR and SWIR data. Agric. For. Meteorol. 2008, 148, 1679–1695. [Google Scholar] [CrossRef]
  57. Xiao, X.; Boles, S.; Frolking, S.; Li, C.; Babu, J.Y.; Salas, W.; Moore, B., III. Mapping paddy rice agriculture in South and Southeast Asia using multi-temporal MODIS images. Remote Sens. Environ. 2006, 100, 95–113. [Google Scholar] [CrossRef]
  58. Guo, B.; Yang, F.; Fan, Y.; Han, B.; Chen, S.; Yang, W. Dynamic monitoring of soil salinization in Yellow River Delta utilizing MSAVI–SI feature space models with Landsat images. Environ. Earth Sci. 2019, 78, 1–10. [Google Scholar] [CrossRef]
  59. Huete, A.R. A soil-adjusted vegetation index (SAVI). Remote Sens. Environ. 1988, 25, 295–309. [Google Scholar] [CrossRef]
  60. Matsushita, B.; Yang, W.; Chen, J.; Onda, Y.; Qiu, G. Sensitivity of the enhanced vegetation index (EVI) and normalized difference vegetation index (NDVI) to topographic effects: A case study in high-density cypress forest. Sensors 2007, 7, 2636–2651. [Google Scholar] [CrossRef]
  61. Mascolo, L.; Lopez-Sanchez, J.M.; Vicente-Guijalba, F.; Nunziata, F.; Migliaccio, M.; Mazzarella, G. A complete procedure for crop phenology estimation with PolSAR data based on the complex Wishart classifier. IEEE Trans. Geosci. Remote Sens. 2016, 54, 6505–6515. [Google Scholar] [CrossRef]
  62. Wang, D.; Zhou, Q.; Chen, Z.; Liu, J. Research advances on crop identification using synthetic aperture radar. Trans. Chin. Soc. Agric. Eng. 2014, 30, 203–212. [Google Scholar]
  63. Veloso, A.; Mermoz, S.; Bouvet, A.; Le Toan, T.; Planells, M.; Dejoux, J.-F.; Ceschia, E. Understanding the temporal behavior of crops using Sentinel-1 and Sentinel-2-like data for agricultural applications. Remote Sens. Environ. 2017, 199, 415–426. [Google Scholar] [CrossRef]
  64. Yang, N.; Liu, D.; Feng, Q.; Xiong, Q.; Zhang, L.; Ren, T.; Zhao, Y.; Zhu, D.; Huang, J. Large-Scale Crop Mapping Based on Machine Learning and Parallel Computation with Grids. Remote Sens. 2019, 11, 1500. [Google Scholar] [CrossRef]
  65. Zhang, X.-W.; Liu, J.-F.; Qin, Z.; Qin, F. Winter wheat identification by integrating spectral and temporal information derived from multi-resolution remote sensing data. J. Integr. Agric. 2019, 18, 2628–2643. [Google Scholar] [CrossRef]
  66. Pan, Y.; Li, L.; Zhang, J.; Liang, S.; Zhu, X.; Sulla-Menashe, D. Winter wheat area estimation from MODIS-EVI time series data using the Crop Proportion Phenology Index. Remote Sens. Environ. 2012, 119, 232–242. [Google Scholar] [CrossRef]
  67. Tao, J.-B.; Wu, W.-B.; Zhou, Y.; Wang, Y.; Jiang, Y. Mapping winter wheat using phenological feature of peak before winter on the North China Plain based on time-series MODIS data. J. Integr. Agric. 2017, 16, 348–359. [Google Scholar] [CrossRef]
  68. Steinhausen, M.J.; Wagner, P.D.; Narasimhan, B.; Waske, B. Combining Sentinel-1 and Sentinel-2 data for improved land use and land cover mapping of monsoon regions. Int. J. Appl. Earth Obs. Geoinf. 2018, 73, 595–604. [Google Scholar] [CrossRef]
  69. Zhang, H.; Xu, R. Exploring the optimal integration levels between SAR and optical data for better urban land cover mapping in the Pearl River Delta. Int. J. Appl. Earth Obs. Geoinf. 2018, 64, 87–95. [Google Scholar] [CrossRef]
Figure 1. Diagram of garlic fertility period.
Figure 1. Diagram of garlic fertility period.
Applsci 14 08141 g001
Figure 2. Overview of the study area.
Figure 2. Overview of the study area.
Applsci 14 08141 g002
Figure 3. Workflow of garlic extraction based on active–passive remote sensing time series data.
Figure 3. Workflow of garlic extraction based on active–passive remote sensing time series data.
Applsci 14 08141 g003
Figure 4. The time series of Sentinel-2 NDVI for garlic and winter wheat. The curves in the figure represent the average NDVI values derived from the samples, while the upper and lower boundaries indicate the standard deviation.
Figure 4. The time series of Sentinel-2 NDVI for garlic and winter wheat. The curves in the figure represent the average NDVI values derived from the samples, while the upper and lower boundaries indicate the standard deviation.
Applsci 14 08141 g004
Figure 5. Time series of Sentinel-1 curves for garlic and winter wheat. (a) Time series data on the ratio of vertical–vertical (VV) and vertical–horizontal (VH) polarization in garlic and winter wheat. (b) Time series data of VV and VH polarization for garlic and winter wheat. The curves in the figure represent the average values derived from the samples.
Figure 5. Time series of Sentinel-1 curves for garlic and winter wheat. (a) Time series data on the ratio of vertical–vertical (VV) and vertical–horizontal (VH) polarization in garlic and winter wheat. (b) Time series data of VV and VH polarization for garlic and winter wheat. The curves in the figure represent the average values derived from the samples.
Applsci 14 08141 g005
Figure 6. Winter crop distribution maps. This figure illustrates the winter vegetation classification results for each county within the study area. Green indicates areas of winter vegetation, while gray represents other land cover types.
Figure 6. Winter crop distribution maps. This figure illustrates the winter vegetation classification results for each county within the study area. Green indicates areas of winter vegetation, while gray represents other land cover types.
Applsci 14 08141 g006aApplsci 14 08141 g006b
Figure 7. Garlic distribution maps. This figure illustrates the garlic classification results for each county within the study area. Blue indicates areas of garlic, while gray represents other land cover types.
Figure 7. Garlic distribution maps. This figure illustrates the garlic classification results for each county within the study area. Blue indicates areas of garlic, while gray represents other land cover types.
Applsci 14 08141 g007
Table 1. Combination of classification characteristics.
Table 1. Combination of classification characteristics.
Feature SetsMSAVISAVIEVIGNDVINDVISARMulti-Bands
① me-sm×××
② mg-sm×××
③ mn-sm×××
④ sn-sm×××
⑤ e-sm××××
⑥ n-sm××××
⑦ n-s×××××
⑧ n-××××××
⑨ n-sswir××××SWIR (only)
In the table, MSAVI, SAVI, EVI, GNDVI, and NDVI refer to the modified soil-adjusted vegetation index, soil-adjusted vegetation index, enhanced vegetation index, green normalized difference vegetation index, and normalized difference vegetation index, respectively. SAR represents time series data from vertical–vertical (VV) and vertical–horizontal (VH) polarizations, along with their ratio. Multi-bands refer to the time series data of the red, green, blue, and shortwave infrared bands. ‘SWIR (only)’ indicates that the feature combination n-sswir includes only the SWIR band from the four bands represented by multi-bands in the time series data. A ‘√’ indicates the presence of the corresponding categorical feature in the feature combination, while a ‘×’ denotes its absence.
Table 2. Garlic distribution map verification points by county.
Table 2. Garlic distribution map verification points by county.
RegionClass ResultsJinxiangLaiwuLanlingPizhouQixianZhongmou
Ground Truth (Pixels)Garlic321149210343236196
Other255355305349280285
All576504515692516481
Table 3. Results of accuracy validation for winter vegetation map.
Table 3. Results of accuracy validation for winter vegetation map.
RegionWinter CropsOtherGround Truth (Pixels)Overall
ResultsUser’s AccuracyProducer’s AccuracyUser’s AccuracyProducer’s AccuracyWinter CropsOtherOverall AccuracyKappa
Jinxiang99.9599.8699.3799.803804297997.850.961
Laiwu98.2797.9097.1392.121335133296.060.921
Lanling97.2696.2497.3195.725361410498.430.964
Pizhou98.6798.2199.9591.266487622194.900.903
Qixian97.6299.2896.2696.123016361294.500.893
Zhongmou97.2392.0396.5296.893648326392.820.878
Table 4. Tables of overall accuracy and Kappa coefficient for garlic extraction.
Table 4. Tables of overall accuracy and Kappa coefficient for garlic extraction.
RegionJinxiangLaiwuLanlingPizhouQixianZhongmou
ResultsOAKappaOAKappaOAKappaOAKappaOAKappaOAKappa
1.me-sr92.5085.1892.7285.8294.1688.3187.3270.9290.9584.0494.7291.07
2.mg-sr92.1284.4791.2882.7993.5787.1190.7082.3989.3981.3293.8389.57
3.mn-sr92.1284.4794.5483.4193.5887.1588.2677.8489.5581.5694.2790.28
4.sn-sr84.2484.7992.5485.3793.5587.1187.8176.9889.7481.8694.4990.65
5.e-sr92.5085.1893.4887.3192.8085.6787.8176.8489.9882.3995.3992.19
6.n-sr92.3184.7994.1682.4693.3786.7388.4178.1189.9482.2194.9591.43
7.n-r90.8081.7090.4181.4794.5589.0087.8177.0089.1780.9588.1680.13
8.n-89.4979.0786.5274.4090.6381.5287.7976.9284.0971.6077.4162.97
9.n-sswir89.3178.6489.3679.2890.8681.4888.2177.7087.8278.5388.8281.44
In the table, “OA” represents overall accuracy, with bold values indicating that this method ranks among the top three for accuracy in the corresponding region.
Table 5. Tables of user’s accuracy and producer’s accuracy for garlic extraction.
Table 5. Tables of user’s accuracy and producer’s accuracy for garlic extraction.
RegionJinxiangLaiwuLanlingPizhouQixianZhongmou
ResultsUAPAUAPAUAPAUAPAUAPAUAPA
1.me-sr91.7797.2087.5894.3794.6191.9195.5886.3292.2891.1094.5096.41
2.mg-sr91.4596.5784.9193.7592.8292.3894.4193.8690.2590.2592.5896.39
3.mn-sr91.4596.5785.8193.0193.6691.4393.0590.0191.7489.4193.9496.37
4.sn-sr91.5087.2087.1095.0793.2792.3893.0389.7790.6890.6894.5096.41
5.e-sr91.7797.2087.2695.8091.0892.3893.3390.0191.4590.6896.3996.89
6.n-sr91.5097.2085.0692.9193.2491.9192.7790.6490.7291.1095.4396.41
7.n-r89.1497.2082.4295.1196.591.9192.4589.4790.5689.4186.8594.87
8.n-88.3596.8878.1991.4991.7990.4892.1989.5081.5087.7179.2992.78
9.n-sswir87.8596.8981.0190.7893.3086.1992.7790.0187.1989.4189.8695.39
“UA” refers to user accuracy, and “PA” denotes producer accuracy, with bolded values indicating that this method ranks among the top three for accuracy in the corresponding region.
Table 6. Classification contribution scale.
Table 6. Classification contribution scale.
RankingJinxiangLaiwuLanlingPizhouQixianZhongmou
122/05 RED22/02 SWIR22/02 SWIR22/05 GNDVI22/03 SWIR22/03 EVI
222/05 GREEN22/02 BLUE22/02 BLUE21/10 MSAVI22/06 EVI22/02 BLUE
322/06 RED21/12 EVI22/03 SWIR22/06 GNDVI22/04 SWIR22/05 GREEN
422/04 VV22/05 RED22/06 NDVI22/04 GNDVI21/10 MSAVI21/12 EVI
522/06 GREEN22/02 GREEN22/03 BLUE21/11 MSAVI22/05 SWIR22/06 GREEN
622/05 BLUE21/11 EVI22/04 SWIR21/12 MSAVI22/06 SWIR22/02 GREEN
722/05 EVI22/03 SWIR21/10 NDVI22/02 GNDVI22/03 EVI22/02 SWIR
822/05 SWIR22/06 EVI22/03 GREEN22/03 GNDVI22/04 RED22/06 SWIR
922/06 BLUE22/06 RED22/04 RED22/01 MSAVI22/04 EVI22/03 GREEN
1022/06 SWIR22/02 RED22/02 GREEN21/10 VH/VV22/04 GREEN22/03 SWIR
1122/05 VV22/05 EVI22/06 SWIR21/11 VH/VV22/06 GREEN22/04 GREEN
1222/04 GREEN22/02 EVI22/05 SWIR22/03 VH/VV22/01 MSAVI22/03 BLUE
1322/04 RED22/04 SWIR22/06 RED21/12 VH/VV22/04 BLUE22/02 EVI
1422/04 VH22/03 GREEN22/03 RED22/04 VH/VV22/06 BLUE22/04 SWIR
1522/05 RED22/06 GREEN22/06 BLUE22/05 VH/VV22/05 GREEN22/04 EVI
In the table, RED, GREEN, BLUE, and SWIR represent different bands of Sentinel-2 remote sensing imagery. VV and VH refer to the horizontal–horizontal polarization and horizontal–vertical polarization of Sentinel-1, respectively. VH/VV denotes the ratio between the VH and VV polarizations.
Table 7. Table of garlic crushing by county.
Table 7. Table of garlic crushing by county.
RegionGFIUAExampleExample Coordinate
Jinxiang0.117591.77Applsci 14 08141 i001116°23′03″ E 34°57′36″ N
Laiwu0.826187.26Applsci 14 08141 i002117°28′57″ E 34°16′40″ N
Lanling0.100192.82Applsci 14 08141 i003118°05′41″ E 34°45′36″ N
Pizhou0.337894.41Applsci 14 08141 i004117°46′38″ E 34°57′36″ N
Qixian0.169192.28Applsci 14 08141 i005114°53′37″ E 34°32′05″ N
Zhongmou0.118096.39Applsci 14 08141 i006113°57′22″ E 34°50′24″ N
GFI in the table refers to garlic fragmentation and UA represents the consumer accuracy of garlic extraction.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Peng, C.; Gao, B.; Wang, W.; Zhu, W.; Chen, Y.; Dong, C. Extraction of Garlic in the North China Plain Using Multi-Feature Combinations from Active and Passive Time Series Data. Appl. Sci. 2024, 14, 8141. https://doi.org/10.3390/app14188141

AMA Style

Peng C, Gao B, Wang W, Zhu W, Chen Y, Dong C. Extraction of Garlic in the North China Plain Using Multi-Feature Combinations from Active and Passive Time Series Data. Applied Sciences. 2024; 14(18):8141. https://doi.org/10.3390/app14188141

Chicago/Turabian Style

Peng, Chuang, Binglong Gao, Wei Wang, Wenji Zhu, Yongqi Chen, and Chao Dong. 2024. "Extraction of Garlic in the North China Plain Using Multi-Feature Combinations from Active and Passive Time Series Data" Applied Sciences 14, no. 18: 8141. https://doi.org/10.3390/app14188141

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop