Next Article in Journal
MR-YOLO: An Improved YOLOv5 Network for Detecting Magnetic Ring Surface Defects
Previous Article in Journal
Emissions Production by Exhaust Gases of a Road Vehicle’s Starting Depending on a Road Gradient
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Landscape Classification System Based on RKM Clustering for Soil Survey UAV Images–Case Study of the Small Hilly Areas in Jurong City

College of Resources and Environmental Sciences, Nanjing Agricultural University, Nanjing 210095, China
*
Author to whom correspondence should be addressed.
Sensors 2022, 22(24), 9895; https://doi.org/10.3390/s22249895
Submission received: 18 October 2022 / Revised: 8 December 2022 / Accepted: 8 December 2022 / Published: 15 December 2022
(This article belongs to the Section Remote Sensors)

Abstract

:
With the advantages of high accuracy, low cost, and flexibility, Unmanned Aerial Vehicle (UAV) images are now widely used in the fields of land survey, crop monitoring, and soil property prediction. Since the distribution of soil and landscape are closely related, this study makes use of the advantages of UAV images to classify the landscape to build a landscape classification system for soil investigation. Firstly, land use, object, and topographic factor were selected as landscape factors based on soil-forming factors. Then, based on multispectral images and Digital Elevation Models (DEM) acquired by UAV, object-oriented classification of different landscape factors was carried out. Additionally, we selected 432 sample data and validation data from the field survey. Finally, the landscape factor classification results were superimposed to obtain the landscape unit applicable to the system classification. The landscape classification system oriented to the soil survey was constructed by clustering 11,897 landscape units through the rough K-mean clustering algorithm. Compared to K-mean clustering, the rough K-mean clustering was better, with a Silhouette Coefficient of 0.26247 significantly higher than that of K-mean clustering. From the classification results, it can be found that the overall classification results are somewhat fragmented, but the landscape boundaries at the small area scale are consistent with the actual situation and the fragmented small spots are less. Comparing the small number of landscape boundaries obtained from the actual survey, we can find that the landscape boundaries in the landscape classification map are generally consistent with the actual landscape boundaries. In addition, through the analysis of two soil profile data within a landscape category, we found that the identified soil type of soil formation conditions and the landscape factor type of the landscape category is approximately the same. Therefore, this landscape classification system can be effectively used for soil surveys, and this landscape classification system is important for soil surveys to carry out the selection of survey routes, the setting of profile points, and the determination of soil boundaries.

1. Introduction

Soil survey is a basic method for the field study of soils, through which we can understand the classification and distribution of soils. During soil surveys in the field, we need to dig up the soil and analyze and compare the soil profile pattern and the surrounding environment, which often requires many resources to excavate the soil profile, so we need a more convenient method to conduct soil surveys. Landscape and soil are correlated [1], and Nikiforova [2] proposed to create a system of classification of natural soil landscapes, considering natural soil is a self-sufficient system, but also as a subset of natural landscape systems, i.e., soil as a derived landscape element. We can consider that the boundary of soil and the boundary of landscape overlap under certain circumstances, so a classification of the landscape under certain conditions also implies a classification of soil. Nowadays, studies have been conducted to systematically classify soils at the soil series level through landscape variability [3,4,5,6,7], and relevant technical guidelines for soil surveys have been published [8] to guide practical soil surveys.
In recent years, Unmanned Aerial Vehicle (UAV) technology has developed rapidly, has the advantage of the flexibility and low cost, and has been widely used in soil and environmental resource investigation. Compared with satellite images, which have the disadvantage of errors due to cloud cover and atmospheric reflection, UAV images have advantages. A large number of studies [9,10] have been conducted using UAVs as piggyback platforms to predict soil properties with generally high prediction accuracy. On the one hand, traditional soil surveys are labor-intensive, while UAVs have lower costs compared to traditional soil surveys, and on the other hand, UAVs serve as a good piggyback platform, and the advanced sensors on board UAVs can be used to obtain multi-dimensional data that cannot be obtained by traditional soil surveys. Ma [11] achieved the identification of Seriphidium transiliense, Ceratocarpus arenarius, and bare ground by Red, Red Edge, and Near InfraRed (NIR) bands obtained from UAV imagery. Yan [12] used high-resolution UAV imagery for vegetation cover prediction. These predicted or identified indicators are within the scope of soil surveys, so it is necessary to introduce UAVs into soil surveys. In addition, the results of these studies are all highly accurate, which provides a basis for high-precision soil classification.
Some studies have used UAVs for studies such as landscape pattern analysis [13,14] and even landscape classification [15], but in these studies only the distribution of the landscape pattern was considered, focusing purely on a single attribute of topography or ground cover as the main component of the landscape. Therefore, these studies may only be applied to national land surveys but not to soil classification. In a landscape classification system oriented to soil classification, it is important to include not only factors that facilitate the differentiation of landscapes, but also factors that affect soil occurrence. There are many approaches to landscape classification, including decision trees for unsupervised classification [3,4,5], convolutional neural networks for supervised classification [16,17], Landscape Character Assessment (LCA) [18], and Parameter Configuration [19,20], all of which can be used for landscape classification. However, most of these methods rely heavily on personal experience in understanding and evaluating the landscape, and therefore cannot provide a quantitative description of the composition of the landscape. Some metrics can be used to describe the landscape by quantification. Harrison [19] proposes the use of geomorphic, a physical landform classification scheme, to visualize and characterize the parameter configuration landscape. However, these parameters can only be used to reflect differences between landscapes and cannot describe the composition of the landscape, which is unsuitable for soil surveys.
Therefore, this study aims to classify landscapes based on high-precision images from UAVs. The quantifiable components of the soil landscape (i.e., landscape factors) are selected and classified based on soil-forming factors and related soil classification studies. The landscape factors are then superimposed to obtain landscape units. Finally, all landscape units are clustered to obtain a soil survey-oriented landscape classification system. Based on the correlation between soil and landscape, the final results can provide a reference for determining soil types and boundaries, and also provide an important basis for route selection and profile point setting for field soil surveys.

2. Materials and Methods

2.1. Study Area

The study area is located in the Longshan Reservoir catchment and is northeast and southwest of Jurong City, Jiangsu Province, People’s Republic of China, with an area of 9.02 square kilometers, including 7.46 square kilometers of land area. The study area is in latitude 32°2′30.66″ N to 32°6′10.08″ N and longitude 119°13′29.55″ E to 119°16′50.69″ E (Figure 1). It has a north subtropical monsoon climate. Rainfall is abundant, with an annual precipitation of 1088.2 mm, and the annual average dryness is around 1.0. The average annual temperature is 15.6 °C, the average daily temperature above 10 °C for the crop growing period is 226 days on average, the frost-free period is 229 days, and the crop maturity system is biannual [21]. Crops are mainly rice, wheat, and corn. The southwest of the study area is plain, with absolute elevation ranging from 20 m to 70 m, and the plain area gradually increases in elevation from southeast to northwest. The northeast of the study area is hilly, and its easternmost part is GaoLi Mountain, with absolute elevation ranging from 50 m to 130 m. The topography of the study area is complex, with a large number of hills and terraces, and the development of gullies and valleys is between the hills and terraces with drastic elevation changes. The soil-forming parent material in the study area is mainly the lower Shu loess parent material, rocky weathering slope accumulation parent material, and alluvial parent material. The slopes and foothills below the middle of the stony hills are mostly rocky weathered slope deposit parent material, and the slopes are mostly in the range of 6° to 15°. The parent material type of the muddy hills, the hillocks, and the gully land is mainly XiaShu loess with deep loess layer. The parent material type of terraces is mainly alluvial parent material [3].

2.2. Data Collection and Data Pre-Processing

The experiment uses the DJI Phantom 4 Multispectral Edition as the flight platform. The UAV is equipped with an all-in-one spectral imaging system that integrates six cameras: visible, red, blue, green, red edge, and near infrared, which are responsible for visible and multispectral imaging, respectively. The UAV is also equipped with Real-Time Kinematic (RTK) and TimeSync time synchronization systems, which can achieve centimeter-level positioning. Among them, the camera resolution is about 2.12 million pixels, the focal length is 5.74 mm, and the spectral parameters are shown in Table 1.
This study was conducted between late December 2021 and early January 2022, with clear and cloudless weather from 10:00 to 14:00 on the same day. The flight altitude was 120 m, the heading overlap rate was 80%, the side overlap rate was 60%, the whole flight was automatic, and the whiteboard was taken between flights to obtain the corrected images from the UAV multispectral camera. At the same time, sample points were collected on the same day, the landscape of the sample points was observed and photographed, the coordinates of the sample points were located using Global Positioning System (GPS), and the information and coordinates of the sample points such as land use type, object type, and microtopography were recorded. A total of 432 sample points were collected, as shown in Figure 1, including 308 training samples. Twelve soil profile sample sites were collected between July 2022 and August 2022. The morphology of the soil profile was observed, and the soil genetic horizons were classified according to differences in properties such as compactness, color, and redox degree of the soil, while the soil-forming environment, GPS, and the morphological characteristics of soil genetic horizon were recorded. Morphological characteristics of the soil genetic horizon include indicators of genetic horizon thickness, boundary, color, root, texture, structure, nascent body, invasive body, etc.
DJI Terra software was used for stitching and radiometric correction of the UAV images, and five single-band images of blue, green, red, red edge, near infrared, and Digital Elevation Model (DEM) were obtained. In ENVI5.3, the five single-band images were band-synthesized and resampled to 0.1 m, and the DEM was also resampled to 0.1 m.

2.3. Selection and Classification of Landscape Factors

According to previous studies, landscape factors oriented to soil classification should include land use type, object type, and topography [3,4,5]. In Simensen’s [22] study on the correlation of landscape factors, topographies, ground cover, vegetation, topography, and soil were ranked in the top correlations, respectively. Some landscape classifications combine landforms/surface relief with land use types to classify landscapes [23,24]. In many bottom-up constructed landscape classification systems, the lowest landscape unit is composed of an overlay of topography, soil genotype, major vegetation community complex, and current land cover type [24]. In this study, three categories of land use type, object type, and topography were selected as landscape factors for this landscape classification system by combining the previous research results and research objectives.
The UAV hyperspectral images were subjected to the object-oriented classification of land use types and object types in eCognition Developer 9.0. Image segmentation was performed using multilevel segmentation, and the Estimation of Scale Parameter (ESP) scale evaluation tool was used to obtain the best segmentation effect scale for features [25], and the classifier algorithm was a random forest algorithm, and feature parameters were selected for land use type classification and object type classification concerning other studies [25,26,27,28]: the mean and standard deviation value of each band, normalized difference vegetation Index (NDVI), normalized difference water index (NDWI), red edge normalized difference vegetation index (REDNDVI), Length/Width, shape index, and gray level co-occurrence matrix (GLCM) to calculate the texture features [29], and the specific calculation formula are shown in Table 2.
Topography classification using the Geomorphons (GM) topography classification method [30], with the search radius of the search window set to 20 [31] and the flatness threshold set to 5° [32], can achieve better classification results. The lower search radius reflects the terrain at a small scale, i.e., microtopography. The slope reference includes flat (≤2°), gentle slope (2 to 6°), medium gentle slope (6° to 15°), medium slope (15° to 25°), and steep slope (≥25°) [32]. The slope direction is divided into 4 categories: north, east, south, and west. The relative elevation is divided into nine categories from 30 m to 100 m in a gradient of 10 m, plus 0–30 m and 100–130 m. The details are shown in Table 3 and Table 4.

2.4. Rough K-Means Clustering

Lingras [33,34] proposed the RKM algorithm, a clustering inspired by intervals that exploit the basic properties of the original rough set theory, namely the concepts of lower and upper approximation. The method treats each cluster as a rough set. Objects that can belong to a cluster with certainty are divided into the lower approximation set of that class of clusters, while data samples with uncertain affiliation are divided into the upper approximation set of multiple clusters. Thus, an object can belong to more than one cluster, and the objects divided into upper approximation sets can be regarded as “buffers” between different clusters. The clustering process is as follows.
Step 1: For each cluster and object, find the distance d and the threshold T.
Step 2: Use the classification criteria (R1 and R2) to classify the data objects into lower and higher estimates.
Step 3: Compute the new cluster centers according to the following expressions.
z i = { v A _ ( c i ) v | A _ ( c i ) |   i f [ A _ ( c i )   a n d   A ¯ ( c i ) A _ ( c i ) = ] v A ¯ ( c i ) A _ ( c i ) v | A ¯ ( c i ) A _ ( c i ) |   i f [ A _ ( c i ) =   a n d   A ¯ ( c i ) A _ ( c i ) ] w l × v A _ ( c i ) v | A _ ( c i ) | + w u × v A ¯ ( c i ) A _ ( c i ) v | A ¯ ( c i ) A _ ( c i ) |   i f [ A _ ( c i )   a n d   A ¯ ( c i ) A _ ( c i ) ]
w l + w u = 0   and   w l > w u ,   w l and w u correspond to the relative importance of the lower and upper approximations, respectively.
Step 4: Repeat steps Step 2 to Step 4 until the algorithm converges.

2.5. Kappa Coefficient

The Kappa coefficient is a measure of classification accuracy and was proposed by Cohan [35]. Its calculation formula is as follows.
Kappa = P 0 P e 1 P e
P 0 = i = 1 n P i i N
P 0 is the overall accuracy of the classification, P e expresses the probability that the classification result due to chance is consistent with the type of ground survey data, n is the number of types classified, N is the total number of samples, and P i i is the number of samples of type i that is correctly classified.
When the Kappa coefficient is closer to 1, it means that the classification effect is better, and the classification result is more consistent with the actual situation.

2.6. Silhouette Coefficient

Silhouette Coefficient (SC) for cluster validity analysis clearly shows not only the high similarity of sample points within clusters but also the high object dissimilarity between clusters [36]. The silhouette coefficient of object i is calculated as follows:
s ( i ) = b ( i ) a ( i ) m a x { a ( i ) , b ( i ) }
a ( i ) is the intra-cluster dissimilarity, the average distance from object i to other objects in the same cluster, and b ( i ) is the inter-cluster dissimilarity, the average distance from object i to all objects in some other cluster C.
If s ( i ) is close to 1, it means that object i is more reasonably clustered; if s ( i ) is close to −1, it means that object i is more deserving of classification into a certain cluster C; if s ( i ) is close to 0, it means that object i is on the boundary of two clusters. The Mean of the Silhouette Coefficient (MSC) can be expressed as:
S ¯ = 1 n i = 1 n s i

2.7. Determination of the Optimal Number of Clusters

The RKM clustering algorithm still clusters the dataset with a determined number of classes k and randomly selected initial clustering centers. However, usually, the clustering number k cannot be determined in advance, and the randomly selected initial clustering centers tend to make the clustering results unstable [37]. Therefore, the optimal number of clusters k needs to be determined. The algorithm for determining the optimal number of clusters in this paper is summarized as follows.
Step 1: Select the search radius [ k m i n , k m a x ] of the clustering number. k m i n = 20 and k m a x = 100 are taken in this paper according to the experimental purpose.
Step 2: Take all the numbers in the search radius [ k m i n , k m a x ] for the clustering number k   respectively
Step 3: Perform RKM clustering.
Step 4: Count the clustering results and compare the mean of Silhouette Coefficients S ¯ of different clustering numbers k .
Step 5: Output the best clustering number k .

3. Results and Analysis

3.1. Classification and Evaluation of Landscape Factors

Based on texture, geometric features, and spectral features for land use types and object types, the land use types and object types obtained from the field survey are used as training samples for object-oriented classification to obtain land use type maps and object type maps. The topography was classified into 9 categories by the GM topography classification method, and topographic classification maps were generated by DEM. Reclassify the DEM to get the DEM classification map. Reclassify the slope map and aspect map generated by DEM to get the slope classification map and aspect map.
The classification accuracy of land use type and object type as landscape factors significantly affect the classification accuracy of the landscape, and the confusion matrix accuracy evaluation and statistics were conducted on Arcmap10.8 based on the field survey sample points and classification results. Among them, the total classification accuracy of land use types was 93.38% with a Kappa coefficient of 0.9216, the total classification accuracy of object types was 91.18% with a Kappa coefficient of 0.8999, and the production accuracy and user accuracy of land use types and object types were mostly above 90%.
From the classification results, it can be seen that the main land use type in the study area is arable land, with 16.60% and 15.69% of the whole study area in paddy fields and dry land, respectively, and the main crops are rice, wheat, edible rape, and corn, which are consistent with the actual situation. In addition, a small amount of finely watered land exists near the greenhouses and construction sites for growing vegetables. According to the field conditions, most of the crops in the study area are planted twice a year, with winter wheat and rice grown in paddy fields, rape and wheat grown in drylands, and a significant portion of the drylands are planted with corn starting in summer. In addition to arable land, a large number of orchards, tea gardens, nurseries, and other gardens are used to grow cash crops in the study area, accounting for 13.53% of the total study area. Natural forests in the hills of the northeastern part of the study area and small patches of plantation forests planted in the central and southeastern plains of the study area make up the entire study area, accounting for 17.15% of the total study area. The wasteland consisted mainly of river floodplains, reedbeds near the watershed, and a small amount of long-abandoned arable land, covering 9.44% of the total study area. Finally, the remaining building land and water area accounted for 6.61% and 19.87% of the total study area.
In this study, it is possible to identify and classify oilseed rape finely distributed on the roadside or fieldside and watered land distributed near houses, which is important for the classification of fragmented landscapes. For example, some of the paddy fields in the study area are flooded in winter, and the water left on the surface of the paddy fields may be misclassified as water, and there may be a strip of water like a ridge in the paddy fields, so there is a need to merge the land use types and object types appropriately.

3.2. Landscape Classification

Since the purpose of this study is a landscape classification system oriented to soil survey, landscape differences of building sites and waters were not considered in the landscape classification, and all landscape factors were superimposed after removing building sites and waters for a total of 11,897 landscape units.
The optimal number of clusters was calculated to be 95, and the mean of Silhouette Coefficient S   ¯   = 0.26247. The landscape factors were clustered by the RKM algorithm, the results were shown in Figure 2, and the dominant landscape unit in the category was named the landscape category.The naming format was “topography–slope–slope direction–elevation–land use type—object type”, and water and building land are divided into 97 categories of landscape, as shown in Table 5.
The landscape classification results were analyzed for t9-s1-a4-d4-l7-o10, a landscape category with an area of 1468.64 ha, accounting for 1.63% of the total study area. It consists of 61 landscape units clustered together, mainly distributed in the plain in the southwest of the study area. The relative elevation of this landscape category ranges from 30 m to 50 m, the slope is between 0° and 6°, and the slope direction is mainly north and east. The main land use type is paddy field, and the object type is mainly paddy (Figure 3). Based on the two soil profile sample sites from the field soil survey (Figure 4), the soils of this landscape type can be identified to the subgroup level under the guidance of the Chinese Soil Taxonomy (CST) criteria. The names in order from soil order to subgroup are Anthrosols, Stagnic Anthrosols, Fe-Stagnic Anthrosols, and Ordinary Fe-Stagnic Anthrosols.
According to the profile record sheet, anthrostagnic epipedon and hydragric horizon were present at the profile sampling sites. Anthrostagnic epipedon was mainly caused by long-term tillage, and hydragric horizon was mainly caused by changes in soil groundwater level [38]. Its hydragric horizon also had a large number of Fe-Mn nodules and no gleyic features, indicating that the cultivated land was well irrigated, the soil was well drained, and the soil development was good. These diagnostic horizons and diagnostic features are mainly influenced by anthropogenic tillage and water table, which are inseparable from land use and topography. t9-s1-a4-d4-l7-o10 has clear boundaries and concentrated distribution, and when comparing the influence of each landscape factor on clustering, we found that land use and topography in this landscape category influence land use, and topography on the clustering was found to be dominant in this landscape category and had a large influence on the clustering results.
However, the dominant landscape factors are not the same among different landscape categories, for example, topography has the greatest influence on clustering in class t6-s3-a1-d5-l7-o8, and topography and land use jointly influence the clustering results in t9-s2-a3-d5-l3-o3. Therefore, the same landscape factor may have different effects on the clustering of different landscape types. Additionally, in the actual soil classification, paddy fields, as one of the land types most severely affected by man, have huge differences in physical and chemical properties from other soils due to long-term tillage and irrigation. Land use and crop type are often decisive in the classification of soils related to paddy fields, so the use of land as the main landscape factor affecting landscape classification in the t9-s2-a2-d5-l1-o1 landscape category is very consistent with the actual soil classification.

3.3. Landscape Outcome Assessment

In this study, 11,897 landscape units were clustered by RKM clustering, plus water and building sites were divided into 97 classes with the MSC S   ¯   = 0.26247. In addition, K-mean clustering was set as the control group, and the mean of the Silhouette Coefficient was calculated for the number of clusters from 20 to 100, and the results are shown in Figure 5.
The MSC of RKM clustering is higher than the MSC of K-mean clustering 0.16436 when the number of clusters is equal to the optimal number of clusters 95. In addition, the MSC of RKM clustering is higher than the highest MSC of K-mean clustering 0.16806 in the whole range. Therefore, the clustering effect of rough K-mean clustering is significantly higher than that of K-mean clustering.
From the final classification results, although the overall classification results are somewhat fragmented, the landscape boundaries at the small area scale are in line with the actual situation and the fragmented small spots are less. By comparing a small number of landscape boundaries obtained from the actual survey, the landscape boundaries in the landscape classification map are generally consistent with the actual landscape boundaries.

4. Discussion

Landscape boundaries and soil boundaries tend to overlap in general, and setting several inspection profiles within a particular landscape to understand soil variation within a landscape category can determine that the same soil is often present in the same landscape [39]. Therefore, in the actual soil survey, after classifying the landscape by UAV images or satellite images, a reasonable soil survey route is set based on the landscape classification map. Each route should contain as many landscape types as possible, and a small number of soil and check profiles should be taken in each landscape type [40]. If the soil types within the same landscape type are the same, it can be assumed that the boundary of the landscape is the boundary of the soil. Therefore, landscape classification is of great importance for soil investigation, and it has great advantages to obtain data in some inaccessible areas by UAV. Compared with remote sensing images, which are easily and costly affected by cloud cover and atmosphere, UAV images are not only more accurate and less affected by cloud cover and atmosphere, but also have greater advantages in terms of convenience and low cost of acquiring data [41].
During the actual soil survey, it can be found that soil changes are gradual and therefore the boundaries of landscape categories may be blurred. In some classifications, some objects are in between two different categories, and thus these objects cannot be classified effectively. Therefore, many classifications reflect the “fuzzy relationship” between types through the affiliation of fuzzy clusters [42,43]. However, this affiliation relationship cannot be shown on the graph in practical applications. In this study, rough sets are introduced into the landscape classification through rough clustering, where a landscape unit may belong to more than one landscape category. These landscape units can then be regarded as buffers between different landscape categories, i.e., asymptotic processes of soil properties in natural situations.

5. Conclusions

In this study, landscape classification in the Jurong district is achieved by RKM clustering with the advantage of convenience and low cost of high-precision UAV data. The results are in line with the actual situation and the experiment obtained the expected results. Compared with the K-mean clustering results, the clustering effect is better, which shows the superiority of UAV data and the RKM clustering algorithm. It provides a basis for soil survey and soil classification and further provides a reference meaning for landscape classification that can be widely used.

Author Contributions

Z.F. designed and implemented the whole experiment, collected and analyzed the data, and verified and evaluated the experimental results, W.L., F.Z. and C.Z. assisted in the experiment and data collection; J.P. was the initiator and supervisor of the project; J.P. and Z.L. were responsible for the supervision and evaluation of the entire experiment. All authors have read and agreed to the published version of the manuscript.

Funding

This project has been funded by the National Natural Science Foundation of China (41971057).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

Informed consent was obtained from all subjects involved in the study.

References

  1. Chen, H.Z. On the principles and methods of soil mapping-an example of Xuanlang Guang sample area in Anhui Province. Chin. J. Soil Sci. 2002, 02, 86–89. [Google Scholar]
  2. Nikiforova, A.A.; Bastian, O.; Fleis, M.E.; Nyrtsov, M.V.; Khropov, A.G. Theoretical development of a natural soil-landscape classification system: An interdisciplinary approach. Catena 2019, 177, 238–245. [Google Scholar] [CrossRef]
  3. Wang, H.Q. Preliminary Study on the Soil Series Investigation Methods in Hilly Area by Using Soil-Landscape Models. Master’s Thesis, Nanjing Agricultural University, Nanjing, China, 2015. [Google Scholar]
  4. Zhang, P.Y. Study on Soil Series Investigation Method in Plain Agricultural Areas Based on Soil-Landscape Relationship. Master’s Thesis, Nanjing Agricultural University, Nanjing, China, 2019. [Google Scholar]
  5. Bai, H.R. Predictive Mapping of Stagnic Anthrosols Soil Series and Evaluation of Regional Soil Fertility in Plain Agricultural. Master’s Thesis, Nanjing Agricultural University, Nanjing, China, 2020. [Google Scholar]
  6. Nikiforova, A.A.; Fleis, M.E.; Kazantsev, N.N. Multi-scale soil-landscape maps as the basis of geographic information systems for soil melioration. IOP Conf. Ser. Earth Environ. Sci. 2019, 368, 012038. [Google Scholar] [CrossRef] [Green Version]
  7. Nikiforova, A.A.; Fleis, M.E.; Nyrtsov, M.V.; Kazantsev, N.N.; Kim, K.V.; Belyonova, N.K.; Kim, J.K. Problems of modern soil mapping and ways to solve them. Catena 2020, 195, 104885. [Google Scholar] [CrossRef]
  8. Pan, J.J.; Cao, L.D.; Li, Z.F. A Guideline for Soil Series Survey in Hilly Area; Phoenix Education Publishing, Ltd.: Nanjing, China, 2017. [Google Scholar]
  9. Zhu, Y.L.; Wang, D.Y.; Zhang, H.; Shi, P. Soil organic carbon content retrieved by UA V-borne high resolution spectrometer. Trans. Chin. Soc. Agric. Eng. 2021, 37, 66–72. [Google Scholar]
  10. Yang, N.; Cui, W.X.; Zhang, Z.T.; Zhang, J.R.; Chen, J.Y.; Du, R.Q.; Lao, C.C.; Zhou, Y.C. Soil salinity inversion at different depths using improved spectral index with UAV multispectral remote sensing. Trans. Chin. Soc. Agric. Eng. 2020, 36, 13–21. [Google Scholar]
  11. Ma, J.; Liu, W.H.; Jin, C.L.; Gong, K.; Liu, Z.B.; Li, Y.; Li, J.X.; Wang, S.J.; Lei, Y.X. Identification of Grassland Plants by UAV Multispectral Remote Sensing Based on CNN and SVM. Acta Agrestia Sin. 2022, 30, 3165–3174. [Google Scholar]
  12. Yan, G.; Li, L.; Coy, A.; Mu, X.; Zhou, H. Improving the estimation of fractional vegetation cover from UAV RGB imagery by colour unmixing. ISPRS J. Photogramm. Remote Sens. 2019, 158, 23–34. [Google Scholar] [CrossRef]
  13. Zhang, Z.H.; Liu, W.; Li, X.H.; Zhu, J.X.; Zhang, H.T.; Yang, D.; Xu, C.H.; Xu, X.L. The spatial distribution pattern of rock in rocky desertification area based on unmanned aerial vehicle imagery and object-oriented classification method. J. Geo-Inf. Sci. 2020, 22, 2436–2444. [Google Scholar]
  14. Gao, S.; Yuan, X.P.; Gan, S.; Yang, M.L.; Yuan, X.Y.; Luo, W.D. Fusion of UAV image and LiDAR point cloud to study the detection technology of mountain surface cover landscape characteristics. Bull. Surv. Mapp. 2022, 1, 110–115. [Google Scholar]
  15. Zhang, S.Y.; Li, Z.F.; Xu, F.; Pan, J.J.; Jiang, X.X.; Zhang, W.M. Optimization of estuary wetland landscape classification based on multi-temporal UAV images. Chin. J. Ecol. 2020, 39, 3174–3184. [Google Scholar]
  16. Hu, B. Deep Learning Image Feature Recognition Algorithm for Judgment on the Rationality of Landscape Planning and Design. Complexity 2021, 2021, 1–15. [Google Scholar] [CrossRef]
  17. Lv, J.J.; Ma, T.; Dong, Z.W.; Yao, Y.; Yuan, Z.H. Temporal and Spatial Analyses of the Landscape Pattern of Wuhan City Based on Remote Sensing Images. Int. J. Geo-Inf. 2018, 7, 340. [Google Scholar] [CrossRef] [Green Version]
  18. Jonatan, A.G. Propuesta metodológica para la identificación, caracterización y cualificación de los paisajes: La cuenca endorreica de Padul (Andalucía) como caso de estudio. Boletín Asoc. Geógrafos Españoles 2019, 80, 5. [Google Scholar]
  19. Harrison, K.R.; Ombuki-Berman, B.M.; Engelbrecht, A.P. Visualizing and Characterizing the Parameter Configuration Landscape of Differential Evolution using Physical Landform Classification. In Proceedings of the IEEE Symposium Series on Computational Intelligence, Canberra, ACT, Australia, 1–4 December 2020; pp. 2437–2444. [Google Scholar]
  20. Gao, B.; Huang, Q.; He, C.; Sun, Z.; Zhang, D. How does sprawl differ across cities in China? A multi-scale investigation using nighttime light and census data. Landsc. Urban Plan. 2016, 148, 89–98. [Google Scholar] [CrossRef]
  21. Lu, H.D. Landscape Classification at A Small Area for Soil Series Survey and Mapping. Master’s Thesis, Nanjing Agricultural University, Nanjing, China, 2014. [Google Scholar]
  22. Simensen, T.; Halvorsen, R.; Erikstad, L. Methods for landscape characterization and mapping: A systematic review. Land Use Policy 2018, 75, 557–569. [Google Scholar] [CrossRef]
  23. Chmielewski, T.J.; Butler, A.; Kułak, A.; Chmielewski, S. Landscape’s physiognomic structure: Conceptual development and practical applications. Landsc. Res. 2017, 43, 410–427. [Google Scholar] [CrossRef]
  24. Michalik-Śnieżek, M.; Chmielewski, S.; Chmielewski, T.J. An introduction to the classification of the physiognomic landscape types: Methodology and results of testing in the area of Kazimierz Landscape Park (Poland). Phys. Geogr. 2018, 40, 384–404. [Google Scholar] [CrossRef]
  25. Ma, H.R. Object-based Remote Sensing Image Classification of Forest Based on Multi-level Segmentation; Beijing Forestry University: Beijing, China, 2014. [Google Scholar]
  26. Zong, Y.; Li, Y.F.; Liu, H.Y. A Study of Coastal Wetland Vegetation Classification Based on Object-oriented Random Forest Method. J. Nanjing Norm. Univ. 2021, 21, 9. [Google Scholar]
  27. Li, Z.; Han, W.C.; Hu, H.Y.; Gao, X.; Wang, L.L.; Xiao, F.; Liu, W.C.; Guo, W.H.; Sun, D.F. Land use/cover classification based on combining spectral mixture analysis model and object-oriented method. Trans. Chin. Soc. Agric. Eng. 2021, 37, 225–233. [Google Scholar]
  28. Hu, J.M.; Dong, Z.Y.; Yang, X.Z. An object-oriented information extraction method for high-resolution remote sensing images. Geospat. Inf. 2021, 19, 10–13. [Google Scholar]
  29. Chen, M.L.; Dai, S.K. Analysis on Image Texture based on Gray-level Co-occurrence Matrix. Commun. Technol. 2012, 45, 108–111. [Google Scholar] [CrossRef]
  30. Jasiewicz, J.; Stepinski, T.F. Geomorphons-a pattern recognition approach to classification and mapping of landforms. Geomorphology 2013, 182, 147–156. [Google Scholar] [CrossRef]
  31. Ngunjiri, M.W.; Libohova, Z.; Owens, P.R.; Schulze, D.G. Landform pattern recognition and classification for predicting soil types of the Uasin Gishu Plateau, Kenya. Catena 2020, 188, 104390. [Google Scholar] [CrossRef]
  32. Wei, Y.C.; Zhao, M.F.; Zhu, C.D.; Zhang, X.X.; Pan, J.J. Predicting soil property in hilly regions by using landscape and multiscale micro-landform features. Chin. J. Appl. Ecol. 2022, 33, 467–476. [Google Scholar]
  33. Lingras, P.; Peters, G. Rough clustering. WIREs Data Min. Knowl. Discov. 2011, 1, 64–72. [Google Scholar] [CrossRef]
  34. Lingras, P.; Peters, G. Applying Rough Set Concepts to Clustering; Springer: London, UK, 2012. [Google Scholar]
  35. Cohen, J.A. Coefficient agreement for nominal scales. Educ. Psychol. Meas. 1960, 2, 37–46. [Google Scholar] [CrossRef]
  36. Zhang, Z.; Guo, X.J.; Zhang, K.P. Clustering Center Selection on K- means Clustering Algorithm. J. Jilin Univ. 2019, 37, 5. [Google Scholar]
  37. Zhou, S.B.; Xu, Z.Y.; Tang, X.Q. New method for determining optimal number of clusters in K-means clustering algorithm. Comput. Eng. Appl. 2010, 46, 27–31. [Google Scholar]
  38. Gong, Z.T. Chinese Soil Taxonomy; Science Press: Beijing, China, 1999. [Google Scholar]
  39. Wang, F. Regional Landscape Classification for Soil Series Survey in the Plain Farmland Area; Nanjing Agricultural University: Nanjing, China, 2015. [Google Scholar]
  40. Wang, H.Q.; Pan, J.J.; Yu, W.F.; Wang, W.Y.; Li, Z.F. The Interpretation of Landscape for Soil Series Survey in a Small Area. Chin. J. Soil Sci. 2015, 46, 257–264. [Google Scholar]
  41. Sun, G.; Huang, W.J.; Chen, P.F.; Gao, S.; Wen, X. Advances in UAV-based Multispectral Remote Sensing Applications. Trans. Chin. Soc. Agric. Mach. 2018, 49, 1–17. [Google Scholar]
  42. Yu, X.; Zhou, W.; He, H. A method of remote sensing image auto classification based on interval type-2 fuzzy c-means. In Proceedings of the IEEE International Conference on Fuzzy Systems, Beijing, China, 6–11 July 2014; pp. 223–228. [Google Scholar]
  43. Li, K.; Xu, J.; Zhao, T.; Liu, Z. A fuzzy spectral clustering algorithm for hyperspectral image classification. IET Image Process. 2021, 15, 2810–2817. [Google Scholar] [CrossRef]
Figure 1. Map of the study area and sampling points distribution.
Figure 1. Map of the study area and sampling points distribution.
Sensors 22 09895 g001
Figure 2. Map of landscape classification.
Figure 2. Map of landscape classification.
Sensors 22 09895 g002
Figure 3. Landscape factor composition of t9-s1-a4-d4-l7-o10 (a) topography (b) slope (c) aspect (d) dem (e) land use (f) object.
Figure 3. Landscape factor composition of t9-s1-a4-d4-l7-o10 (a) topography (b) slope (c) aspect (d) dem (e) land use (f) object.
Sensors 22 09895 g003
Figure 4. Section sample point diagram.
Figure 4. Section sample point diagram.
Sensors 22 09895 g004
Figure 5. Chart of the Mean of Silhouette Coefficient of RKM and K-mean.
Figure 5. Chart of the Mean of Silhouette Coefficient of RKM and K-mean.
Sensors 22 09895 g005
Table 1. Spectral parameters of DJI Phantom 4 Multispectral camera.
Table 1. Spectral parameters of DJI Phantom 4 Multispectral camera.
Serial Number of
Spectrum
Name of SpectrumCentral
Wavelength/nm
Full Width at Half Peak/nm
B 1 Blue45032
B 2 Green56032
B 3 Red65032
B 4 Red Edge73032
B 5 Near InfraRed84052
Table 2. Calculation formula of feature parameter.
Table 2. Calculation formula of feature parameter.
Feature CategoryFeature ParameterFormula
Spectrum featureMean B k ¯ = 1 n i = 1 n B k i
Standard Deviation σ L = 1 n i = 1 n ( B k i B k ¯ ) 2
Shape featureNDVI NDVI = B 5 B 3 B 5 + B 3
NDWI NDWI = B 2 B 5 B 2 + B 5
REDNDVI REDNDVI = B 5 B 4 B 5 + B 4
Geometric featureLength/Width γ = l w = e i g 1 ( S ) e i g 2 ( S )
Shape Index I = L 4 A
Texture featuresGLCM P ( i , j , d , θ ) = # { ( k , l ) , ( m , n ) ( M × N ) | f ( k , l ) = i , f ( m , n ) = j }
Notes:   k is the band, n is the number of pixels contained in the image object, B k i is the pixel value,   B k ¯ is the mean value of the band L i , l is the length of the outer wrapping rectangle of the object; w is the width of the outer wrapping rectangle of the object, L is the length of the boundary of the image object, A is the area, P ( i , j , d , θ ) is the pixel from the point ( k , l ) with grayness i in the image, and the statistics with its distance d = ( m k , n 1 ) of the point ( m , n ) with grayscale j at the same time, d is the relative distance expressed in terms of the number of pixels, and θ is the relative direction.
Table 3. Categories of terrain, slope, and aspect.
Table 3. Categories of terrain, slope, and aspect.
TopographySlopeAspect
CodeCategoryCodeCategorySlopeCodeCategory
1Flat1Flat≤2°1North
2Ridge2Gentle Slope2°–6°2East
3Spur3Gentle–Middle Slope6°–15°3South
4Hollow4Middle Slope15°∼25°4West
5Valley5Steep Slope≥25°
6Peak
7Shoulder
8Slope
9Footslope
10Pit
Table 4. Categories of elevation, land use, and object.
Table 4. Categories of elevation, land use, and object.
ElevationLand useObject
CodeElevationCodeCategoryCodeCategory
10–30 m1Paddy field1Paddy
230–40 m2Watered land2Vegetables
340–50 m3Dry land3Wheat
450–60 m4Orchard4Rape
560–70 m5Tea plantation5Maize
670–80 m6Nursery6Sesame
780–90 m7Woodland7Orchard
890–100 m8Wasteland8Tea
9100–130 m9Greenhouse9Nursery
10Building10Evergreen forest
11Water11Mixed forests
12Deciduous forest
13Wasteland
14Building
15Water
Table 5. Category of landscape factor.
Table 5. Category of landscape factor.
LandscapeAreaArea PercentageCodeLandscapeAreaArea PercentageCode
t1-s1-a1-d4-l3-o31426.9991.58%1t6-s2-a4-d9-l3-o325.58820.03%50
t2-s1-a1-d5-l3-o3324.390.36%2t6-s2-a4-d9-l7-o10812.63140.90%51
t2-s2-a3-d5-l1-o371.45060.08%3t6-s3-a1-d5-l7-o82000.8572.22%52
t3-s1-a1-d3-l1-o1979.57471.09%4t6-s3-a1-d5-l9-o2168.03050.19%53
t3-s1-a1-d5-l3-o5863.98570.96%5t6-s3-a1-d9-l1-o162.99140.07%54
t3-s1-a2-d7-l7-o10129.57760.14%6t6-s3-a4-d3-l3-o3820.54040.91%55
t3-s1-a3-d5-l3-o31008.9261.12%7t6-s4-a1-d2-l1-o1389.42150.43%56
t3-s1-a3-d7-l3-o515.98060.02%8t6-s4-a2-d2-l4-o7607.15630.67%57
t3-s1-a4-d5-l9-o249.67190.06%9t6-s4-a2-d5-l3-o3648.8720.72%58
t3-s2-a1-d6-l1-o1192.41460.21%10t6-s4-a2-d7-l7-o12563.29720.62%59
t3-s2-a4-d5-l1-o1361.84450.40%11t6-s4-a4-d4-l7-o131315.3311.46%60
t3-s3-a1-d9-l7-o12140.28210.16%12t6-s4-a4-d5-l1-o1344.09970.38%61
t3-s3-a4-d4-l7-o10972.65221.08%13t6-s4-a4-d8-l3-o5162.87130.18%62
t3-s3-a4-d9-l3-o539.31050.04%14t7-s1-a4-d3-l3-o3207.37020.23%63
t4-s1-a1-d2-l3-o3469.91330.52%15t7-s2-a1-d2-l7-o9350.62330.39%64
t4-s1-a4-d2-l7-o8471.49720.52%16t7-s2-a3-d2-l1-o343.02020.05%65
t4-s1-a4-d6-l7-o10376.42270.42%17t7-s2-a3-d9-l1-o124.09970.03%66
t4-s2-a1-d5-l1-o393.58830.10%18t7-s2-a4-d9-l7-o10330.66160.37%67
t5-s1-a3-d6-l3-o5119.1270.13%19t7-s3-a1-d5-l7-o10544.46190.60%68
t5-s1-a4-d3-l1-o363.5260.07%20t7-s3-a1-d7-l3-o549.70570.06%69
t5-s1-a4-d5-l7-o12277.94030.31%21t7-s3-a4-d9-l9-o250.73330.06%70
t5-s2-a1-d3-l3-o5381.39060.42%22t7-s4-a1-d2-l1-o1103.570.11%71
t5-s2-a1-d5-l1-o3127.97580.14%23t7-s4-a1-d4-l3-o3182.21740.20%72
t5-s2-a1-d5-l9-o277.36890.09%24t7-s4-a2-d3-l7-o13121.23640.13%73
t5-s2-a1-d7-l7-o12198.56370.22%25t7-s4-a3-d6-l3-o3168.79810.19%74
t5-s2-a4-d2-l3-o5772.30750.86%26t7-s4-a3-d6-l7-o10414.94930.46%75
t5-s2-a4-d2-l8-o13264.27670.29%27t7-s4-a4-d4-l1-o2194.71450.22%76
t5-s2-a4-d5-l1-o1107.14130.12%28t7-s4-a4-d4-l3-o761.25940.07%77
t5-s2-a4-d9-l1-o131.44180.03%29t8-s1-a1-d4-l7-o81251.5721.39%78
t5-s2-a4-d9-l7-o11115.14220.13%30t8-s2-a3-d2-l8-o13517.47680.57%79
t5-s3-a1-d5-l7-o10577.56420.64%31t8-s2-a4-d2-l3-o5567.85780.63%80
t5-s3-a1-d9-l3-o340.2250.04%32t9-s1-a1-d2-l1-o31216.0791.35%81
t5-s3-a4-d8-l3-o582.16360.09%33t9-s1-a1-d6-l7-o13299.91960.33%82
t5-s3-a4-d8-l9-o221.26270.02%34t9-s1-a4-d3-l1-o3380.92840.42%83
t5-s4-a1-d2-l7-o10289.99340.32%35t9-s1-a4-d4-l7-o101468.6441.63%84
t5-s4-a2-d3-l1-o179.88990.09%36t9-s1-a4-d9-l3-o527.40920.03%85
t5-s4-a3-d8-l7-o12247.39440.27%37t9-s2-a1-d2-l7-o12429.3990.48%86
t5-s4-a4-d4-l3-o3187.52250.21%38t9-s2-a2-d5-l1-o12456.6422.72%87
t5-s4-a4-d4-l7-o9783.6180.87%39t9-s2-a3-d5-l3-o31827.012.03%88
t6-s1-a1-d3-l3-o3377.74360.42%40t9-s2-a3-d8-l7-o11139.78570.15%89
t6-s1-a1-d9-l7-o10142.05890.16%41t9-s2-a4-d2-l1-o1832.8910.92%90
t6-s1-a4-d3-l1-o1862.93920.96%42t9-s3-a1-d7-l3-o521.21960.02%91
t6-s1-a4-d3-l7-o9765.050.85%43t9-s3-a2-d3-l3-o554.49720.06%92
t6-s1-a4-d6-l7-o131480.81.64%44t9-s3-a4-d5-l3-o564.57730.07%93
t6-s2-a1-d3-l7-o131339.791.49%45t9-s4-a2-d4-l7-o13107.56020.12%94
t6-s2-a1-d5-l1-o1508.48470.56%46t10-s1-a1-d4-l7-o9510.73560.57%95
t6-s2-a1-d5-l7-o8680.43790.75%47Water17925.7119.87%96
t6-s2-a1-d9-l3-o551.5570.06%48Building5963.9776.61%97
t6-s2-a4-d5-l3-o31297.761.44%49Buffer Zone23035.3525.54%98
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Fang, Z.; Lu, W.; Zhu, F.; Zhu, C.; Li, Z.; Pan, J. Landscape Classification System Based on RKM Clustering for Soil Survey UAV Images–Case Study of the Small Hilly Areas in Jurong City. Sensors 2022, 22, 9895. https://doi.org/10.3390/s22249895

AMA Style

Fang Z, Lu W, Zhu F, Zhu C, Li Z, Pan J. Landscape Classification System Based on RKM Clustering for Soil Survey UAV Images–Case Study of the Small Hilly Areas in Jurong City. Sensors. 2022; 22(24):9895. https://doi.org/10.3390/s22249895

Chicago/Turabian Style

Fang, Zihan, Wenhao Lu, Fubin Zhu, Changda Zhu, Zhaofu Li, and Jianjun Pan. 2022. "Landscape Classification System Based on RKM Clustering for Soil Survey UAV Images–Case Study of the Small Hilly Areas in Jurong City" Sensors 22, no. 24: 9895. https://doi.org/10.3390/s22249895

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop