Next Article in Journal
Landslide Susceptibility Evaluation Based on Potential Disaster Identification and Ensemble Learning
Next Article in Special Issue
Analysis of Landscape Characteristics and Influencing Factors of Residential Areas on the Qinghai–Tibet Plateau: A Case Study of Tibet, China
Previous Article in Journal
Learning Design Strategies in MOOCs for Physicians’ Training: A Scoping Review
Previous Article in Special Issue
Evaluating the Resources and Environmental Carrying Capacity in Laos Using a Three-Dimensional Tetrahedron Model
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Evaluation of Landslide Susceptibility Based on CF-SVM in Nujiang Prefecture

1
College of Earth Sciences, Yunnan University, Kunming 650500, China
2
Engineering Research Center of Domestic High-Resolution Satellite Remote Sensing Geology for Universities of Yunnan Province, Kunming 650500, China
*
Author to whom correspondence should be addressed.
Int. J. Environ. Res. Public Health 2022, 19(21), 14248; https://doi.org/10.3390/ijerph192114248
Submission received: 6 September 2022 / Revised: 27 October 2022 / Accepted: 28 October 2022 / Published: 31 October 2022
(This article belongs to the Special Issue Impacts of Human Activities and Climate Change on Landscape)

Abstract

:
At present, landslide susceptibility assessment (LSA) based on landslide characteristics in different areas is an effective measure for landslide management. Nujiang Prefecture in China has steep mountain slopes, a large amount of water and loose soil, and frequent landslide disasters, which have caused a large number of casualties and economic losses. This paper aims to understand the characteristics and formation mechanism of regional landslides through the evaluation of landslide susceptibility so as to provide relevant references and suggestions for spatial planning and disaster prevention and mitigation in Nujiang Prefecture. Based on the grid cell, this study selected 10 parameters, namely elevation, slope, aspect, lithology, proximity to faults, proximity to road, proximity to rivers, normalized difference vegetation index (NDVI), land-use type, and precipitation. Support vector machine (SVM), certainty factor method (CF), and deterministic coefficient method–support vector machine (CF-SVM) were used to evaluate the landslide susceptibility in Nujiang Prefecture. According to these three models, the study area was divided into five landslide susceptibility grades, including extremely high susceptibility, high susceptibility, moderate susceptibility, low susceptibility, and very low susceptibility. Receiver operating characteristic curve (ROC) was applied to verify the accuracy of the model. The results showed that CF model (ROC = 0.865), SVM model (ROC = 0.892), CF-SVM model (ROC = 0.925), and CF-SVM model showed better performance. Therefore, CF-SVM model results were selected for analysis. The study found that the characteristics of high and extremely high landslide-prone areas in Nujiang Prefecture have the following characteristics: intense human activities, large density of buildings and arable land, rich water resources, good economic development, perfect transportation facilities, and complex topography and landform. In addition, there is a finding inconsistent with our common sense that the distribution of landslide disasters in the study area does not decrease with the increase of NDVI value. This is because the Nujiang River basin is a high mountain canyon area with low rock strength, barren soil, and underdeveloped vegetation and root system. In an area with large slope, the probability of landslide disaster will increase with the increase of NDVI. The CF-SVM coupling model adopted in this study is a good first attempt in the study of landslide hazard susceptibility in Nujiang Prefecture.

1. Introduction

Landslide refers to the natural phenomenon in which the soil or rock mass on the slope slides down the slope as a whole or in a scattered manner along a certain weak surface or zone, caused by various natural processes and human activities under the action of gravity [1,2,3,4,5,6]. Whether caused by natural factors or human activities, landslides cause a great deal of economic losses and loss of life every year [7,8,9]. Extreme natural events such as landslides cannot be foreseen, but their risk can be reduced by taking precautions and early warning measures. Landslide susceptibility assessment aims to predict and analyze the spatial and temporal distribution and occurrence probability of landslide hazard [10]. In recent decades, the most appropriate way to prevent landslide events has been to conduct spatial assessment of landslide susceptibility [11,12]. Currently, a series of studies around the world has assessed landslide susceptibility in the context of multiple influencing factors. The results can provide an important decision-making basis for landslide hazard risk management, territorial space planning and layout, and landslide monitoring [13].
Major advances in computing power, remote sensing, and geographic information systems (GIS) have facilitated the development of landslide susceptibility maps. Various landslide susceptibility mapping models and methods have been proposed, including: (1) knowledge-based approaches such as analytic hierarchy process [14] and expert scoring method [15]; (2) data-driven approaches such as information content method [16], frequency ratio [17], certainty factor [18], index of entropy [19], and logistic regression analysis [20]; and (3) machine-learning methods (ML), such as decision tree [21,22], artificial neural network (ANN) [23], support vector machine (SVM) [24], and random forest [25]. The combination of GIS with data-driven methods or machine learning methods (ML) has been widely used for landslide susceptibility assessment using spatial and non-spatial data [17,23,24,25]. For example, Cheng, Y. et al. [26] used analytic hierarchy process to study highway tunnel landslides. Chen, F. et al. [16] used the information model for spatial susceptibility prediction and analysis of landslides. Ali, S. et al. [27] used GIS technology to draw the landslide susceptibility map along the China–Pakistan Economic Corridor (Karakoram Highway). Klose et al. [28] assessed landslide susceptibility in northwestern Central Europe using a landslide geographic information database and bivariate statistical analysis, while Niu, R.Q. et al. [29] performed a susceptibility evaluation using ML, and Wu et al. [30] used decision tree and support vector machine to evaluate the landslide susceptibility of Guojiaba Town in the Three Gorges Reservoir Area, China. Most of the above studies applied multiple single models to landslide susceptibility and did not consider coupling studies of different types of models.
A total of 6181 geological disasters occurred in China, including 4220 landslides, accounting for 68.27 percent of the total according to the 2019 National Geological Disaster Bulletin. Thus, we can see the seriousness of landslide geological disasters. Nujiang Prefecture in China is a key area for the prevention and control of landslide geological disasters. Because of its complex terrain, unique climate, and fragile ecological environment, landslides occur frequently, seriously affecting the local economic and social development [31,32]. At present, it is urgent to evaluate the susceptibility of landslide disasters in Nujiang Prefecture. Landslide susceptibility assessment (LSA) is a typical non-engineering measure and an effective method to mitigate, prevent, and manage landslide damage [8,33,34,35,36,37,38]. In this research area, some experts have carried out landslide disaster research. For example, He, J. et al. [39] analyzed landslide inducing factors in Lushui County, Nujiang Prefecture, from the aspects of stratigraphic lithology, geological structure, earthquake, and rainfall. Tan, S.C. et al. [40] designed and established the spatial database for meteorological forecasting and the early warning of slope geological disasters, which solved the limitation of the traditional database, which could not effectively manage spatial data. Hu, J. et al. [41] optimized the critical rainfall, critical rain intensity and prone zoning of precipitation-type geological disasters in Yunnan Province. Li, Y.M. et al. [42] used the CF to analyze the susceptibility of slope geological disasters, analyzed the influence degree of each factor on the occurrence of slope geological disasters through the susceptibility index, and drew the zoning map of slope geological disaster susceptibility. Yao, Y.W. et al. [43] used the distribution of geological disaster points to analyze the characteristics of geological disasters in Lanping County, Nujiang Prefecture, and put forward countermeasures for disaster prevention and control. Although some people have studied the landslide disaster in Nujiang Prefecture, there are few studies on its susceptibility carried out by using models. In the global scale, few people use mathematical statistics model and machine learning model coupling to study the landslide susceptibility.
At present, scholars use a variety of models to study landslide susceptibility. Among these models, CF and SVM models have been widely used and praised. Theoretically, CF is a mathematical statistical model, which has advantages in calculating CF values of various levels of influence factors. SVM is a quantitative and objective model obtained by various numerical calculations, which has great advantages in the study of small sample data.
The research shows that the coupling model can synthesize the advantages of each model and make up for the shortcomings of each model. In terms of evaluation accuracy and success rate, the model has obvious advantages over the single model. The hybrid application of the model can significantly improve the accuracy and reliability of the results. Therefore, this study selected two single models (CF model and SVM model) and one mixed model (CF-SVM) to assess landslide susceptibility through the analysis of landslide data in Nujiang Prefecture. The advantages and disadvantages of the deterministic coefficient model and support vector machine are complementary, which effectively improves the prediction accuracy of the model. This study provides a reference for the selection of regional landslide susceptibility evaluation model under similar geological conditions. It will help protect people’s lives and property, reduce disaster losses, and improve the efficiency of disaster prevention and mitigation. It can also provide scientific basis for government departments to formulate disaster prevention and mitigation measures, which has important application value.

2. Materials and Methods

2.1. Overview of the Study Area and Data Sources

2.1.1. Overview of the Study Area

In this study, Nujiang Prefecture was selected as the study area. Nujiang Prefecture is located in the longitudinal ridge and valley area of the Hengduan Mountains in the northwest of Yunnan Province. It is located at 25°33′–28°23′ E and 98°09′–99°39′ N. It includes the Lushui, Fugong, Gongshan, and Lanping Counties. The total area of the four counties is 14,703 km2 (Figure 1). Nujiang Prefecture is located in a unique plateau and mountainous environment, which is a typical deep-cutting zone of alpine valleys. More than 40 mountain peaks exceed 4000 m. It includes four mountain ranges and three rivers: Lika Mountain, Dulong River, Gaoligong Mountain, Nujiang River, Biluo Snow Mountain, Lancang River, and Yunling Mountain. The three rivers slope across the entire territory from north to south, forming a typical alpine and canyon landform. Due to plate collision and subduction, a series of deep fault zones were formed in Nujiang Prefecture. Affected by erosion and gravity, the rock mass is broken, loose materials are piled up, and the steep slope reclamation phenomenon in Nujiang Prefecture is obvious, leading to serious soil erosion and frequent geological disasters. Nujiang Prefecture has a subtropical mountain monsoon climate, and its valley areas present subtropical humid climate characteristics. The mountain ranges are alternately affected by the Qinghai–Tibet Plateau and the Bay of Bengal air currents, coupled with the large topographical and vertical climate differences.

2.1.2. Data Sources

Table 1 lists the landslide conditioning factors used in this study, together with their sources and scales. Among them, the data of historical disaster points were provided by the project team employed in this study, with a total of 561 landslide disaster points.

2.2. Models

The deterministic coefficient model can determine landslide susceptibility based on the relationship between past landslide points and hazard-inducing factors, that is, quantitatively reflecting the susceptibility interval of a certain hazard-inducing factor. However, it is impossible to reflect the contribution of this factor in landslide occurrence as a whole. In contrast, the SVM model is not prone to overfitting in case of limited samples, yields good performance in the classification process, and can characterize the degree of contribution of the evaluation factors. Therefore, coupling the two models allows utilizing the advantages of both; that is, the deterministic coefficient model is used for calculating the susceptibility of different classes between factors for classification of landslide data and non-landslide data, and the SVM model is used for training and prediction to improve the evaluation accuracy.

2.2.1. Deterministic Coefficient Model

The certainty factor method (CF) is a probability function belonging to the category of bivariate statistical analysis and can be used to analyze the susceptibility of disaster events according to various factors.
CF   = PPa PPs PPa   1 PPs       ,   PPa > PPs PPa PPs PPs   1 PPa       ,     PPa < PPs
PPa is the probability of geological disasters occurring in the evaluation factor A, and when applied, it is the ratio of the number of geological disasters existing in the evaluation factor A to the area of the factor A. PPs is the prior probability of geological disasters occurring in the whole study area, that is, the ratio of the number of geological disasters in the whole study area to the total study area. The variation range of CF is [−1, 1], and the positive value represents a high certainty of geological disaster occurrence. The negative value represents a low certainty of geological disaster occurrence. When the calculated result is close to 0, it means that the factor cannot determine whether the given area is prone to geological disasters.

2.2.2. SVM Model

Support vector machine (SVM) is a binary classification model, which is a classification prediction model developed on the basis of statistical principles [9]. This method is widely used in various fields. SVM is more reasonable and effective than other learning methods in solving small sample, high dimensional, and non-linear problems. The basic principle is to find an optimal hyperplane, which can not only correctly divide the two types of sample points but also maximize the geometric interval from the nearest sample point to the plane. SVM is suitable for small samples and nonlinear and high-dimensional space problems, which can highlight its unique advantages and can be combined with other machine learning to jointly analyze problems. In this study, the training set is set as T = x 1 , y 1 x 2 , y 2 , , x n , y n , where x is the input vector, and x 1 ~ x n represents elevation, aspect, slope, lithology, proximity to faults, proximity to road, proximity to river, NDVI, precipitation, and land-use type, respectively. In the formula, y 0 ,   1 , 1 and 0 denote landslide and non-landslide, respectively. The goal of SVM classification is to find an optimal separating hyperplane that can be distinguished between landslides and non-landslides in the above training set. The equation of the separating hyperplane is: w x + b = 0 , and w is called the vector, and b is called the intercept. The prediction accuracy of SVM depends on the choice of kernel function. There are four types of kernel functions commonly used: linear kernel function, polynomial kernel function, radial basis kernel function, and Sigmoid kernel function. RBF is widely used in landslide susceptibility prediction. Its advantages are its fewer parameters, strong flexibility, and good performance. Therefore, this study adopts RBF kernel function to build support vector machine model, as shown in Formula (2).
K   x i x = exp γ x i x 2
where x is the input vector, and γ is the gamma parameter.

2.2.3. CF-SVM Model

When selecting training samples, most studies often take a certain number of landslide data as samples for training and the rest as test samples [34]. These research methods only consider the contribution of each evaluation factor to landslide formation, that is, only analyze the corresponding relationship between evaluation factor and sample without analyzing the stable slope (negative sample), so the selection of sample is often one-sided, and it is difficult to give an in-depth explanation of the mechanism of landslide formation. On the basis of summarizing relevant research, this study adopts the method of positive sample and negative sample for sample selection. The specific process is as follows: Firstly, the CF model is used to partition landslide susceptibility, and then, non-landslide samples are selected from the very-low- and low-susceptibility areas, making the overall sample more reasonable and more authoritative.
The input variables of the model were 561 landslide and 561 non-landslide raster cells, among which landslide raster cells were the known landslide cataloguing information above, and non-landslide cells were mainly collected and acquired in the very-low-susceptibility area and the low-susceptibility area in the whole Nujiang Prefecture in a random way. After that, the above 561 landslide and 561 non-landslide grid cells were randomly divided into two parts, 70% of which were used for SVM model training and 30% of which were used for SVM model testing. By applying the above trained and tested SVM model to the CF values of 11 evaluation factors, the spatial distribution of landslide susceptibility in Nujiang Prefecture could be obtained.

2.3. Selection and Grading of Evaluation Factors and Sampling Strategy

2.3.1. Selection of Evaluation Factors

The influencing factors of geological disasters were divided into two categories: internal leading factors and external environmental triggering factors [44]. In this study, based on the collected historical disaster, relevant literature, geological conditions of the study area, landslide formation conditions, and development characteristics, the elevation, aspect, slope, proximity to river, lithology, proximity to faults, normalized difference vegetation index (NDVI), proximity to roads, land-use type, and precipitation were used as evaluation factors to construct an evaluation index system for landslide susceptibility assessment.

2.3.2. Correlation Analysis of Evaluation Factors

When there are multiple collinearities between the evaluation factors, the model becomes complicated, and the prediction accuracy of the model decreases. To avoid this, SPSS software was used to analyze the correlation of each evaluation factor. If the absolute value of the correlation coefficient is greater than 0.3, it means that there is a strong correlation between the factors; otherwise, the correlation is weak. Results of the correlation analysis are shown in Table 2. Only the soil type factor exceeded 0.3, and the absolute values of the correlation coefficients among the other evaluation factors were all less than 0.3, indicating that the correlation between the factors except for the soil type is weak and can be used for landslide susceptibility analysis.

2.3.3. Grading of Evaluation Factors

The evaluation factors were classified according to the geological environment of the study area and the spatial distribution characteristics of landslides. The evaluation factors were divided into two types: continuous and discrete. The CF values of each factor is shown in Table 3,the density of disaster point at different levels of each factor is shown in Figure 2, the single-factor grading diagram is shown in Figure 3.

Land-Use Type

Different land-use patterns have different impacts on geological disasters. Unplanned land-use patterns destroy the natural environment and aggravate the occurrence of geological disasters.Based on the 2017 land-use data and Google Earth images from 2020, the land-use data-type distribution map of Nujiang in 2020 was obtained in this study by visual interpretation and correction according to first-level classification standards (Figure 2a). By using the deterministic coefficient model, CF values of unused land, grassland, construction land, cultivated land, forest land, and water bodies were calculated (Table 3).The density of disaster point at different levels of each factor is shown in Figure 3a.

Elevation

Although elevation does not directly affect the occurrence of landslides [32], different elevations lead to different factors, such as rainfall, temperature, soil type, vegetation type, and intensity of human activities. Nujiang Prefecture has high mountains, deep valleys, steep slopes, rapid waters, and complex topography and landforms with an elevation difference of more than 4000 m. According to different vertical climatic zones, ArcGIS was used to classify the elevations (Figure 2b), and the CF values of different elevation zones were calculated. It can be seen from Table 2 that when the elevation is less than 1900 m, the certainty coefficient value is close to 1, indicating that within this range, landslide disasters are extremely prone to occur. Superimposing the elevation classification map with the vector of residential areas in Nujiang Prefecture revealed that areas with an elevation of less than 1900 m are densely distributed, human engineering activities such as steep slope reclamation and slope excavation are more frequent, and the damage to the original natural environment is more serious. Most of the water systems are distributed in areas with lower altitudes, which are more likely to cause landslide disasters. In contrast, when the elevation is more than 1900 m, as the altitude increases, the CF value and the possibility of geological disasters gradually decrease.The density of disaster point at different levels of each factor is shown in Figure 3b.

Slope

Slope is a key factor affecting the stability of an area. Areas with large slopes are more likely to witness landslides, while areas with flat terrains and small slopes are less likely to have geological disasters. The study area has high mountains and steep slopes, with a slope range of 0.35–88.38°, which was divided into six grades: 0–10°, 10–20°, 20–30°, 30–40°, 40–50°, and >50° (Figure 2c). The CF model was used to calculate the certainty coefficient of each slope grade, and the relationship between the slope and the occurrence of landslide disasters in the study area was analyzed. The results are shown in Table 3. The certainty coefficient of slope within 10–30° is large, indicating the higher possibility of geological disasters in this area.The density of disaster point at different levels of each factor is shown in Figure 3c.

Aspect

Sunny slopes and shaded slopes receive different solar radiation intensities, which affect the vegetation growth, vegetation types, rainfall, and soil moisture [41,42,43], thereby acting as one of the evaluation factors of landslide disasters. The text was based on ArcGIS10.7 and Nujiang Prefecture DEM data, and the aspect raster map was calculated according to a previously defined method [45,46] (Figure 2d). The aspect was divided into plane, north, northeast, east, southeast, south, southwest, west, and northwest. For the nine levels, the CF value of each slope direction classification area was calculated using the deterministic coefficient model, and the results are shown in Table 3.The density of disaster point at different levels of each factor is shown in Figure 3d.

Proximity to Rivers

Rivers are an important factor in the development of geological disasters. Deforestation in the slopes and erosion at the river banks result in an increase in the empty area. In addition, the water flow increases the moisture content of the soil and increases the weight and softens the rock and soil, thereby reducing the stability of the slope and increasing the probability of geological disasters. In this study, the data for the Nujiang River system were extracted using the hydrological analysis function in the ArcGIS software, and the multi-ring buffer area was used in the field analysis to establish the river buffer area at 200 m intervals. The buffer area was divided into seven intervals by using the CF model (Figure 2e), and the relationship between geological hazard susceptibility and distance to rivers was analyzed (Table 3). The results showed that the greater the distance from the rivers, the smaller the CF value. The CF value is greater than 0 for distances of 0–600 m, indicating that the distance from the water system plays a vital role in the occurrence of geological disasters. When the distance from the water system is greater than 600 m, the CF value is less than 0, which shows that the influence of the water system on geological disasters is relatively weak at such distances.The density of disaster point at different levels of each factor is shown in Figure 3e.

Lithology

Different rock and soil bodies have different lithologies: their structural, physical, and chemical properties and their ability to resist erosion and weathering vary. Therefore, the probability of occurrence of geological disasters also varies. In this study, stratigraphic groups were used as the basic unit to divide the rock and soil mass in Nujiang Prefecture into four groups: soft rock, hard rock, harder rock, and loose rock. The vector data were converted to raster data by using ArcGIS to obtain the Nujiang State lithology classification map shown in Figure 2f. Next, using the CF model, the influence of stratum lithology on the occurrence and development of geological disasters was quantitatively analyzed. The density of disaster point at different levels of each factor is shown in Figure 3f.The results (Table 3) revealed that the harder rock group has the largest certainty coefficient value, indicating that geological disasters are prone to occur in soft and hard interbedded rock formations, such as quartz sandstone, sandstone, shale-intercalated limestone, slate-intercalated basalt, and limestone-interbedded slate, because soft rock interbeds with hard rock, and soft rock becomes a natural sliding bed, creating favorable conditions for the occurrence of geological disasters, while loose rocks (such as sandy clay and sandy gravel) have poor stability and cannot form slopes with large slopes. Hard rocks are resistant to weathering and are not easily eroded; thus, the slopes have good stability and are not prone to geological disasters.

Proximity to Faults

Fault structure is one of the indispensable evaluation factors. During the formation of a fault zone, the rock and soil get divided, destroying the integrity and continuity of the rock formation and affecting the stability of the slope. Broken rock mass and loose deposits can easily lead to geological disasters. The faults in Nujiang Prefecture are clustered; the main faults are distributed from north to south, and the other major faults are mainly located along the north-south direction, with less distribution in the east-west direction. The fault zones in the region are densely distributed and structurally developed. In this study, the multi-ring buffer tool was used to divide the intervals with the catastrophic point ratio and the point density curve mutation points in each grading interval as a reference(Figure 3g), a fault buffer with a distance of 400 m was established (Figure 2g), and the certainty coefficient of each interval was calculated (Table 3). The results showed that the greater the distance from the fault zone, the smaller the certainty coefficient. The CF value is larger in the interval of 0–800 m, indicating that geological disasters are more likely to occur. When the distance from the fault zone exceeds 2000 m, the CF value is less than 0. Thus, the fault structure has little influence on the occurrence of geological disasters at such increased distances.

Proximity to Road

The transportation network in Nujiang Prefecture is developing rapidly with the construction of numerous bridges and tunnels. The excavation of slopes and blasting in engineering construction activities disturb the rock and soil mass AND destroy the stability of the slope, and the rock becomes loose and fragile, thus causing geological disasters. According to the road distribution map of Nujiang Prefecture, a buffer zone with a 200 m radius was established (Figure 2h); the deterministic coefficient values of the buffer zones are shown in Table 3. The density of disaster point at different levels of each factor is shown in Figure 3h.

Precipitation

Rainfall is usually considered as the most vital evaluation factor for the susceptibility evaluation of geological disasters. According to statistical analysis, most landslides in Nujiang Prefecture are rainstorm-type landslides. The influence of rainfall on geological disasters is mainly due to the following aspects: water erosion on the surface leading to soil erosion, a large amount of rainwater infiltration, softening rock strata, increasing slope weight, decreasing slope stability, and induced slope slip. The Kriging interpolation method based on the GIS spatial analysis function was used to perform spatial interpolation of the rainfall data of 11 meteorological stations in Nujiang Prefecture. The natural breakpoint method was used to divide the rainfall data into five grades (Table 3), and the classification of annual average precipitation in Nujiang Prefecture was obtained (Figure 2i). The density of disaster point at different levels of each factor(Figure 3i).

NDVI

Normalized difference vegetation index (NDVI) was first proposed by Rouse et al. in the 1970s [46,47,48]. Because NDVI can better reflect vegetation growth and vegetation coverage, it has been used by numerous scholars. The larger the NDVI value, the larger the vegetation coverage [44,47,48]. In the present study, the MODIS data for 2019 with a spatial resolution of 250 m was selected, the vegetation coverage was extracted, and the outliers were removed using the ENVI5.3 software. The NDVI values were converted to 0–1 by using the fuzzy membership tool in the ArcGIS overlay analysis tool. The natural breakpoint method was used to divide normalized NDVI values into five grades (Figure 2j), and the CF values within the range of different NDVI grades were determined (Table 3).The density of disaster point at different levels of each factor is shown in Figure 3j. NDVI reflects the quantitative relationship between landslide disaster and vegetation density. NDVI can be calculated by the near-infrared band IR and infrared band R obtained from satellite images, as follows:
NDVI =   IR R IR + R
In Equation (5), the value range of NDVI is −0.26 to 0.80, and it is divided into five levels by using the equal spacing method: which are <0, 0–0.2, 0.2–0.4, 0.4–0.6, and >0.6.
The results show that except for interference areas such as water bodies and clouds, there is a trend that the better the vegetation coverage, the larger the CF value. This is contrary to our common conclusion, but it is not a calculation error; instead, it is because Nujiang Prefecture is a typical landform of the southwest alpine valley area with high mountains and steep slopes, poor soil, underdeveloped root systems, and unprotected vegetation. When the vegetation is destroyed, the impact force generated causes the slope to move and displace, thereby increasing the susceptibility to geological disasters. Therefore, in the southwest alpine and valley area, good vegetation coverage is not necessarily conducive for reducing the occurrence of geological disasters. On the contrary, in areas with larger slopes, the greater the NDVI value, the more likely the occurrence of geological disasters.

2.3.4. Sampling Strategy of Modeling Samples

Before modeling, the positive and negative samples in the study area must be sampled. The positive sample is the landslide point in the study area, and the negative sample is the non-landslide point. The selection of the negative sample is very important for the construction of the model. Because the specific location of the landslide-prone area cannot be accurately determined before the prediction, selection of non-landslide points in the landslide-prone area must be avoided; this will help maintain the prediction accuracy of the model. Therefore, in this study, the CF value of each factor was calculated first, and then, the sum of the CF values of all the factors under each grid was calculated to obtain the susceptibility index based on the CF model. A quick evaluation was performed, and finally, by using the natural breakpoint method, landslide susceptibility was divided into five grades: low-prone area, less-prone area, medium-prone area, higher-prone area, and highly prone area. By using the CF model as the a priori model, non-landslide points were randomly selected in areas other than the high-prone area so as to ensure the accuracy of the selection of non-landslide points considering the uncertainty and spatial correlation of landslide-prone area. A total of 561 high probability non-landslide points were selected in the study area. The combination of the existing 561 landslide points and the 561 high-probability non-landslide points selected using the CF model was used as the training and test datasets for the modeling. Among them, 70% of the data was used as the training set and 30% as a test set.The process is shown in Figure 4.

2.4. Model Construction and Application

The CF value of each factor calculated using the CF model was used as the classification data of the SVM model. By using the selected classification data, the appropriate parameters were selected, and the model was trained. Finally, the trained model was used to perform predictions for the entire study area, and the evaluation results of disaster susceptibility were obtained.
After the modeling samples were selected, the model for the study area was constructed. The overall idea of evaluating the susceptibility of the study area is as follows. The CF formula was used to calculate the susceptibility of each factor as the classification data of the SVM model. Then, the selected classification data were used to select appropriate parameters to train the model. Finally, the trained model was used to predict landslide susceptibility for the entire study area.
Using GIS as a platform, the CF value of each index factor was calculated under different state grading by using the CF model. Next, the resolution of the 10 factor layers was unified to 30 m, and the CF values of the factors were added with equal weights. Finally, The landslide susceptibility index of Nujiang Prefecture was reclassified using ArcGIS. As shown in Figure 5, Nujiang Prefecture was divided into areas with five levels of susceptibility: extremely high susceptibility, high susceptibility, medium susceptibility, low susceptibility, and very low susceptibility.
ArcGIS was used to extract the extremely low-prone areas and low-prone areas in the CF model results. Then, 561 non-landslide points and 561 landslide points in the extremely low- and low-prone areas were selected, and the landslide and non-landslide unit spatial data were obtained. Next, the data were normalized. The spatial data of landslide and non-landslide units were divided into 70% and 30% for the training set and test set, respectively. The model was developed using SPSS Modeler18; then, the data were inputted into the SVM model for training and testing. To study the accuracy of the model, four kernel functions, namely linear kernel function, polynomial kernel function, radial basis kernel function, and Sigmoid kernel function, were used for training and testing. The one yielding the highest accuracy was selected: radial basis kernel function. Next, the spatial normalized data for the grid unit of Nujiang Prefecture were inputted into the trained model to obtain the Nujiang Prefecture landslide susceptibility index. Finally, ArcGIS was used to reclassify the landslide susceptibility index, as shown in Figure 6. Accordingly, Nujiang Prefecture was divided into areas with five levels of susceptibility: extremely high susceptibility, high susceptibility, medium susceptibility, low susceptibility, and very low susceptibility.
The four kernel functions are as follows:
(1) Linear kernel function:
K   x , x i = x i x  
(2) Polynomial kernel function:
K   x , x i = x i x + 1 p
(3) Radial basis kernel function:
K   x , x i = exp   γ x i x 2
(4) Sigmoid kernel function:
K   x , x i = tan v   x i x + c

3. Results

3.1. Factor Importance

The importance of index factors reflects the influence degree of different index factors on regional landslide susceptibility. Therefore, calculating and analyzing the importance of each index factor can provide a guiding basis for landslide disaster management. The predictive ability of the indicator factors used in this study is shown in Figure 7. The results obtained by the CF-SVM model show that the elevation (0.301) of landslide adjusting factors is the highest, followed by land use (0.289), proximity to road (0.171), proximity to fault (0.072), proximity to rivers (0.061), NDVI (0.034), slope (0.021), precipitation (0.020), aspect (0.019), and lithology (0.012).

3.2. Landslide Susceptibility Maps

Through ArcGIS10.6 software, the trained model was used to calculate the landslide susceptibility index (LSI), as shown in Figure 6. It can be seen from Figure 6 that the probability of landslide in the whole study area is −1~1. The CF susceptibility index ranged from −0.6997 to 0.4867. The prevalence index of SVM ranged from −0.7236 to 0.6788. The susceptibility index of CF-SVM ranged from −0.9104 to 0.7543. Using the natural discontinuity point classification method in ArcGIS, LSI value is divided into five easy levels: extremely high, high, medium, low, and extremely low, as shown in Figure 7. According to Figure 7, extremely high, high, medium, low, and extremely low levels of CF accounted for 10.03%, 19.60%, 26.02%, 27.24%, and 17.12%, respectively. SVM was 7.67%, 18.22%, 26.29%, 28.00%, and 19.82%; CF-SVM accounted for 7.09%, 16.57%, 10.10%, 30.11%, and 35.63%, respectively.

3.2.1. Evaluation Results of Susceptibility Based on CF Model

The CF value of each index factor under different state grading was calculated and then inputted into each factor layer; then, the ArcGIS raster calculator was used for overlapping with equal weights to obtain the Nujiang prefecture susceptibility index map. The results show that the extremely high- and high-prone areas cover 4355.67 km2, which is only 25.89% of the total area, but include 532 landslide points, accounting for approximately 94.83% of the total number of geological disasters, and the density of disaster points is as high as 0.1221/km2, which is extremely low. The proportion of disaster points in low-prone areas is only 0.36%.

3.2.2. The Susceptibility Evaluation Results Based on the SVM Model

The spatial normalized data of the grid unit in Nujiang Prefecture was inputted into the trained SVM model to obtain the susceptibility index of the grid unit in Nujiang Prefecture. The natural breakpoint method was then used to determine the landslide susceptibility of the grid unit in Nujiang Prefecture. The evaluation index was reclassified to obtain the susceptibility index map of Nujiang Prefecture. The results show that the extremely high- and high-prone areas cover 3806.92 km2, accounting for only 23.66% of the total area, but include 472 landslide points, accounting for approximately 84.13% of the total number of geological disasters, and the density of disaster points is as high as 0.1239/km2, which is extremely low. Further, the proportion of disaster points in low-prone areas is only 1.25%.

3.2.3. Evaluation Results of Susceptibility Based on the Coupling of CF and SVM

The proportion of disasters in high-risk areas can reflect the scientific nature of model evaluation. It is more convenient for government departments to include more disaster units in high-risk areas. In this study, the GIS field calculator was used to determine the area and proportion of qualitative disaster susceptibility grades as well as the number of geological disaster points in each grade and their proportions and density (Table 4, Table 5 and Table 6). The disaster densities in the extremely high- and high-prone areas of CF, SVM, and CF + SVM are 0.1221, 0.1239, and 0.1351 disasters/km2, respectively. The results show that the extremely high- and high-risk areas evaluated by CF + SVM have a higher proportion of landslide disaster, which is more suitable for the practical application of landslide susceptibility in Nujiang Prefecture. The CF + SVM model performed better than the individual CF and SVM models. The results obtained using the CF + SVM model showed that the areas with extremely high vulnerability to geological disasters in Nujiang Prefecture are mainly distributed along the banks of the Dulong River, Nujiang River, Lancang River, and their tributaries as well as along roads at all levels. The analysis result is consistent with the actual geological hazard distribution characteristics in the study area. The extremely high- and high-prone areas cover 3479.05 km2, accounting for only 23.66% of the total area, but include 470 landslide points, accounting for 83.77% of the total number of geological disasters, and the density of disaster points is as high as 0.1351/km2, indicating the susceptibility to geological disasters. The higher the number, the greater the probability of the total number of geological disasters. The CF and SVM models also showed similar effects. In summary, Nujiang Prefecture should strengthen the prevention and control of geological disasters in extremely high- and high-prone areas in the future.

3.3. Test and Comparison of Models

Area under curve (AUC) is defined as the area under the ROC curve; the value ranges between 0.5 and 1. The larger the AUC value, the higher the prediction accuracy of the model. Figure 8 shows that the AUC values of the CF, SVM, and CF + SVM models are 0.865, 0.892, and 0.925, respectively, indicating that the three models have high accuracy and that the coupled model yields higher accuracy than the individual models. Thus, the coupling of CF and SVM models is more suitable for the assessment of landslide susceptibility.

4. Discussion

Most of the historical slope geological disaster sites in Nujiang Prefecture are located in the highly susceptible areas, mainly distributed along rivers and roads in Nujiang Prefecture. The results are in good agreement with the actual occurrence of slope geological disasters, indicating that the selected sensitivity factors and evaluation models are reasonable. These results are consistent with those reported in other studies assessing landslide susceptibility [39,40,42,49,50,51].
Currently, landslide researchers have applied various machine learning methods to different areas with different results. Even within a single region, different models, such as logistic regression and support vector machines, may produce different results due to weighted differences, which in turn are related to their probability distribution functions. These differences stem in part from the choice of model and uncertainty in the input data. Today, many works focus only on the application of a single model to susceptibility assessment (e.g., [52,53,54,55,56]). In this study, statistical models and machine learning methods are coupled. The SVM algorithm is enhanced by creating, using, and testing an integrated CF-SVM model. The results show that the proposed model provides higher prediction accuracy than SVM algorithm. It performs better than a single model. Based on the training and validation datasets, the model successfully distinguishes the landslide-prone areas in the study area. Our results support previous studies showing that coupled models can significantly reduce overfitting and noise problems in the modeling process [13,23,54,57,58,59,60,61,62]. The novelty of our method is that we consider the combination of statistical models and machine learning models, which can perform well in solving the problem of poor performance of a single model.
To improve land management and distribution policies, it is essential to designate landslide-prone areas. Machine learning algorithms are widely used in landslide susceptibility mapping. The main objective is to study and capture the nonlinear relationship between landslide events and their conditional parameters. However, there are still some shortcomings: (1) The selection and analysis of evaluation factors are insufficient. Due to the complex geological structure of the study area, there may be obvious correlation between landslide disaster influencing factors, or the control factors and influencing factors of screening factors are insufficient, which may lead to the decrease of model accuracy. (2) Insufficient normalization of evaluation factors: There are often differences in attributes and dimensions between influencing factors and controlling factors, which will lead to the loss of important disaster evaluation factors. (3) The sample set construction is insufficient; the negative sample selection method is especially very important. Whether the data set cleaning is in place, negative sample selection rules, and the imbalance of positive and negative sample ratio will directly affect the evaluation accuracy of the model. The coupling method of CF and SVM proposed in this paper selects factors after multivariate collinearity diagnosis, emphasizing the importance of data cleaning. Quantization and normalization of the impact factors, complete deletion of null values, replacement of noise values, interpolation outliers and other cleaning work for the original data set, and standard selection of negative samples can ensure that each feature has the same impact on the evaluation results so as to ensure the accuracy of the results. This research method effectively solves the above problems and improves the evaluation accuracy of landslide susceptibility.
The occurrence of landslide is a complex process involving geology, mechanics, meteorology and hydrology, cartography, and other fields of knowledge. Only using the model to evaluate the landslide susceptibility can only roughly predict the range of landslide occurrence, but it is not very accurate. In addition, the number of models selected in this paper is limited, and multi-model optimization comparison can be further carried out to obtain more accurate results. Due to the accelerated urbanization and intense human engineering activities in Nujiang Prefecture, China, in recent years, it is recommended to conduct a vulnerability evaluation every three years with sufficient data on landslide disaster sites and compare the evaluation results to provide a basis for local disaster prevention and reduction and reasonable land planning.

5. Conclusions

Taking Nujiang Prefecture as the study area and by analyzing the data, 10 evaluation factors were selected. Landslide susceptibility was evaluated using CF, SVM, deterministic coefficient, and SVM coupling (CF-SVM). The following conclusions were drawn:
(1) Landslides are common in China’s Nujiang Prefecture, where they cause severe damage to roads, buildings, and other infrastructure. Moreover, future losses are likely to grow as the economy grows. The Chinese government and departments at all levels in Nujiang Prefecture are concerned about the possible loss of life caused by the landslide. To address this issue, Chinese policymakers and policymakers need to better understand where landslides are likely to occur. The accurate landslide sensitivity map provided in this paper can help them select suitable sites for infrastructure development.
(2) In this paper, the prediction accuracy of support vector machine and CF model and their combination in the study area is obtained. The AUC of CF, SVM, and CF-SVM models were 0.865, 0.892, and 0.925, respectively, indicating that the prediction accuracy of the CF-SVM model was the highest, and the model was more suitable for landslide hazard susceptibility evaluation in the study area. This study solves the problem that a single model cannot effectively evaluate the susceptibility of landslide disaster and provides a new idea for the study of landslide disaster in Nujiang Prefecture. It provides an important decision-making basis for disaster prevention and reduction, territorial space planning, and dynamic monitoring in Nujiang Prefecture.

Author Contributions

Conceptualization, Y.L.; methodology, Y.L.; validation, Y.L., X.D. and P.J.; formal analysis, X.D., P.J., Y.Y. and W.J.; resources, Y.Y. and Z.Z.; data curation, Y.L., X.D., Z.Z., W.J. and Y.Y.; writing—original draft preparation, Y.L. and X.D.; writing—review and editing, Y.L., X.D., P.J., Y.Y. and W.J.; visualization, X.D.; supervision, Y.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China (Grant No.41161070); The key Project of Yunnan Science and Technology Department and Yunnan University Joint Fund (Grant No.2019 FY003017); and Geological Survey Project of China Geological Survey (Grant No.DD 201905 45; Project of China Geological Survey (DD20221824).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

The authors extend their sincere thanks to the Earth Observation Team, BigeMap Download, China National Center of Surveying and Mapping, and geospatial Data Cloud, the research platform that provides vector data and image data. We acknowledge all people who contributed to the data collection and processing, as well as the constructive and insightful comments by the editor and anonymous reviewers.

Conflicts of Interest

Declare The authors declare no conflict of interest.

References

  1. Ahmed, B.; Dewan, A. Application of Bivariate and Multivariate Statistical Techniques in Landslide Susceptibility Modeling in Chittagong City Corporation, Bangladesh. Remote Sens. 2017, 9, 304. [Google Scholar] [CrossRef] [Green Version]
  2. Chang, Z.; Du, Z.; Zhang, F.; Huang, F.; Chen, J.; Li, W.; Guo, Z. Landslide Susceptibility Prediction Based on Remote Sensing Images and GIS: Comparisons of Supervised and Unsupervised Machine Learning Models. Remote Sens. 2020, 12, 502. [Google Scholar] [CrossRef] [Green Version]
  3. Chen, W.; Shahabi, H.; Shirzadi, A.; Hong, H.; Akgun, A.; Tian, Y.; Liu, J.; Zhu, A.X.; Li, S. Novel hybrid artificial intelligence approach of bivariate statistical-methods-based kernel logistic regression classifier for landslide susceptibility modeling. Bull. Eng. Geol. Environ. 2019, 78, 4397–4419. [Google Scholar] [CrossRef]
  4. Hussain, M.A.; Chen, Z.; Zheng, Y.; Shoaib, M.; Shah, S.U.; Ali, N.; Afzal, Z. Landslide Susceptibility Mapping Using Machine Learning Algorithm Validated by Persistent Scatterer In-SAR Technique. Sensors 2022, 22, 3119. [Google Scholar] [CrossRef]
  5. Oh, H.-J.; Pradhan, B. Application of a neuro-fuzzy model to landslide-susceptibility mapping for shallow landslides in a tropical hilly area. Comput. Geosci. 2011, 37, 1264–1276. [Google Scholar] [CrossRef]
  6. Sabokbar, H.F.; Roodposhti, M.S.; Tazik, E. Landslide susceptibility mapping using geographically-weighted principal component analysis [Geomorphology, 226, 1 December 2014, 15–24]. Geomorphology 2015, 234, 192. [Google Scholar] [CrossRef]
  7. Shahabi, H.; Hashim, M. Landslide susceptibility mapping using GIS-based statistical models and Remote sensing data in tropical environment. Sci. Rep. 2015, 5, 9899. [Google Scholar] [CrossRef] [Green Version]
  8. Wang, Z.; Brenning, A. Active-Learning Approaches for Landslide Mapping Using Support Vector Machines. Remote Sens. 2021, 13, 2588. [Google Scholar] [CrossRef]
  9. Zhou, C.; Yin, K.; Cao, Y.; Li, Y. Landslide Susceptibility Assessment by Applying the Coupling Method of Radial Basis Neural Network and Adaboost:A Case Study from the Three Gorges Reservoir Area. Earth Sci. 2020, 45, 1865–1876. [Google Scholar]
  10. Huang, F.M. Landslide Displacement Prediction and Susceptibility Evaluation Based on 3S and Artificial Intelligence. Ph.D. Thesis, China University of Geosciences, Wuhan, China, 2017. [Google Scholar]
  11. Nor Diana, M.I.; Muhamad, N.; Taha, M.R.; Osman, A.; Alam, M.M. Social Vulnerability Assessment for Landslide Haz-ards in Malaysia: A Systematic Review Study. Land 2021, 10, 315. [Google Scholar] [CrossRef]
  12. Ram, P.; Gupta, V. Landslide hazard, vulnerability, and risk assessment (HVRA), Mussoorie township, lesser himalaya, India. Environ. Dev. Sustain. 2022, 24, 473–501. [Google Scholar] [CrossRef]
  13. Guo, Z.Z.; Shi, Y.; Huang, F.M.; Fan, X.M.; Huang, J.S. Landslide susceptibility zonation method based on C5.0 decision tree and K-means cluster algorithms to improve the efficiency of risk management. Geosci. Front. 2021, 12, 101249. [Google Scholar] [CrossRef]
  14. Hu, Z.Y.; Luo, W.Q.; Yan, E.C.; Tan, Y.S. Evaluation of geological hazard susceptibility in Yingshan County based on im-proved analytic hierarchy process. Saf. Environ. Eng. 2018, 25, 28–32. [Google Scholar] [CrossRef]
  15. Cao, P.Y.; Hu, S.; Yang, D.D.; Cao, M.M. Risk assessment of Xi’an geological disaster based on fuzzy hierarchy analysis. J. Arid. Land Resour. Environ. 2017, 31, 136–142. [Google Scholar] [CrossRef]
  16. Chen, F.; Cai, C.; Li, X.S.; Sun, T.; Qian, Q. Landslide susceptibility evaluation based on information content and neural net-work model. Chin. J. Rock Mech. Eng. 2020, 39, 2859–2870. [Google Scholar] [CrossRef]
  17. Ozdemir, A.; Altural, T. A comparative study of frequency ratio, weights of evidence and logistic regression methods for landslide susceptibility mapping: Sultan Mountains, SW Turkey. J. Asian Earth Sci. 2013, 64, 180–197. [Google Scholar] [CrossRef]
  18. Chen, W.; Li, Y. GIS-based evaluation of landslide susceptibility using hybrid computational intelligence models. Catena 2020, 195, 104777. [Google Scholar] [CrossRef]
  19. Hong, H.; Chen, W.; Xu, C.; Youssef, A.M.; Pradhan, B.; Dieu Tien, B. Rainfall-induced landslide susceptibility assessment at the Chongren area (China) using frequency ratio, certainty factor, and index of entropy. Geocarto Int. 2017, 32, 139–154. [Google Scholar] [CrossRef]
  20. Sun, D.L.; Xu, J.H.; Wen, H.J.; Wang, D.Z. Assessment of landslide susceptibility mapping based on Bayesian hyperparameter optimization: A comparison between logistic regression and random forest. Eng. Geol. 2021, 281, 105972. [Google Scholar] [CrossRef]
  21. Huang, F.M.; Shi, Y.; Ouyang, W.P.; Hong, A.Y.; Zeng, Z.Q.; Xu, F.G. Landslide susceptibility prediction based on evidence weight and Chi-square automatic interactive detection decision tree. J. Civ. Environ. Eng. (Chin. Engl.) 2022, 44, 16–28. [Google Scholar]
  22. Wang, G.; Lei, X.; Chen, W.; Shahabi, H.; Shirzadi, A. Hybrid Computational Intelligence Methods for Landslide Suscep-tibility Mapping. Symmetry 2020, 12, 325. [Google Scholar] [CrossRef] [Green Version]
  23. Dong Van, D.; Jaafari, A.; Bayat, M.; Mafi-Gholami, D.; Qi, C.; Moayedi, H.; Tran Van, P.; Hai-Bang, L.; Tien-Thinh, L.; Phan Trong, T.; et al. A spatially explicit deep learning neural network model for the prediction of landslide susceptibility. Catena 2020, 188, 104451. [Google Scholar] [CrossRef]
  24. Tehrany, M.S.; Pradhan, B.; Mansor, S.; Ahmad, N. Flood susceptibility assessment using GIS-based support vector machine model with different kernel types. Catena 2015, 125, 91–101. [Google Scholar] [CrossRef]
  25. Chen, W.; Xie, X.; Peng, J.; Shahabi, H.; Hong, H.; Dieu Tien, B.; Duan, Z.; Li, S.; Zhu, A.X. GIS-based landslide susceptibility evaluation using a novel hybrid integration approach of bivariate statistical based random forest method. Catena 2018, 164, 135–149. [Google Scholar] [CrossRef]
  26. Chen, Y.; Liu, Z.B.; Liu, S.Y.; Cai, G.J.; Tong, L.Y. Construction risk identification of long span shallow buried highway tunnel based on analytic hierarchy process. Chin. J. Geotech. Eng. 2011, 33, 198–202. [Google Scholar]
  27. Ali, S.; Biermanns, P.; Haider, R.; Reicherter, K. Landslide susceptibility mapping by using a geographic information system (GIS) along the China-Pakistan Economic Corridor (Karakoram Highway), Pakistan. Nat. Hazards Earth Syst. Sci. 2019, 19, 999–1022. [Google Scholar] [CrossRef] [Green Version]
  28. Klose, M.; Gruber, D.; Damm, B.; Gerold, G. Spatial databases and GIS as tools for regional landslide susceptibility modeling. Z. Fur Geomorphol. 2014, 58, 1–36. [Google Scholar] [CrossRef]
  29. Niu, R.; Peng, L.; Ye, R.; Wu, X. Landslide Susceptibility Assessment Based on Rough Sets and Support Vector Machine. J. Jilin Univ. Earth Sci. Ed. 2012, 42, 430–439. [Google Scholar]
  30. Wu, X.; Ren, F.; Niu, R. Landslide susceptibility assessment using object mapping units, decision tree, and support vector machine models in the Three Gorges of China. Environ. Earth Sci. 2014, 71, 4725–4738. [Google Scholar] [CrossRef]
  31. Tie, Y.B.; Ruan, C.F.; Yang, S.; Zeng, B.; Zhang, H.B.; Li, F.Q.; Sun, J.F.; He, Z.Q. Characteristics and formation mechanism of the "May 25" rainstorm induced geological disasters in Gongshan County, Yunnan Province. Bull. Soil Water Conserv. 2021, 41, 10–15. [Google Scholar] [CrossRef]
  32. Hao, X.Z.; Li, Y.M. Analysis of impoverishment factors and countermeasures in Nujiang Prefecture. J. Yunnan Univ. Nat. Sci. Ed. 2007, 29, 166–172. [Google Scholar]
  33. Shen, D.; Shi, Z.; Peng, M.; Zhang, L.; Zhu, Y. Preliminary analysis of a rainfall-induced landslide hazard chain in Enshi City, Hubei Province, China in July 2020. Landslides 2021, 18, 509–512. [Google Scholar] [CrossRef]
  34. Huang, F.; Zhang, J.; Zhou, C.; Wang, Y.; Huang, J.; Zhu, L. A deep learning algorithm using a fully connected sparse autoencoder neural network for landslide susceptibility prediction. Landslides 2020, 17, 217–229. [Google Scholar] [CrossRef]
  35. Chen, W.; Chen, X.; Peng, J.; Panahi, M.; Lee, S. Landslide susceptibility modeling based on ANFIS with teach-ing-learning-based optimization and Satin bowerbird optimizer. Geosci. Front. 2021, 12, 93–107. [Google Scholar] [CrossRef]
  36. Pandey, V.K.; Pourghasemi, H.R.; Sharma, M.C. Landslide susceptibility mapping using maximum entropy and support vector machine models along the highway corridor, Garhwal Himalaya. Geocarto Int. 2020, 35, 168–187. [Google Scholar] [CrossRef]
  37. Wang, H.B.; Liu, G.J.; Xu, W.Y.; Wang, G.H. GIS-based landslide hazard assessment: An overview. Prog. Phys. Geogr. Earth Environ. 2005, 29, 548–567. [Google Scholar] [CrossRef]
  38. Yong, C.; Dong, J.; Fei, G.; Bin, T.; Tao, Z.; Hao, F.; Li, W.; Qinghua, Z. Review of landslide susceptibility assessment based on knowledge mapping. Stoch. Environ. Res. Risk Assess. 2022, 36, 2399–2417. [Google Scholar] [CrossRef]
  39. He, J.; Li, B. Analysis of main inducing factors of landslide in Lushui County, Nujiang Prefecture. J. Kunming Univ. Sci. Technol. Sci. Technol. 2005, 30, 1–4. [Google Scholar]
  40. Tan, S.C.; Jin, Y.Z.; Hu, X.G.; Jiang, S.D.; Feng, L. Design and establishment of spatial database for meteorological forecast and early warning of slope geological hazards. J. Earth 2012, 33, 812–818. [Google Scholar]
  41. Hu, J.; Min, Y.; Li, H.H.; Xu, Y.J.; Li, C.; Li, X. Study on optimization of fine meteorological risk warning model for geological hazards in Yunnan Province. Sci. Disaster 2016, 31, 110–115. [Google Scholar]
  42. Li, Y.M.; Xie, Y.Y.; Jiang, D.M.; Duan, Y.P. Study on the sensitivity of environmental factors for geological hazards in Nujiang Prefecture. Res. Soil Water Conversat. 2018, 25, 300–305. [Google Scholar] [CrossRef]
  43. Yao, Y.W.; Liu, W.; Fang, N. Characteristics and prevention countermeasures of geological disasters in Lanping County, Nujiang Prefecture, Yunnan Province. Yunnan Geol. 2019, 38, 483–488. [Google Scholar]
  44. Dikshit, A.; Sarkar, R.; Pradhan, B.; Acharya, S.; Alamri, A.M. Spatial Landslide Risk Assessment at Phuentsholing, Bhutan. Geosciences 2020, 10, 131. [Google Scholar] [CrossRef] [Green Version]
  45. Chen, L.; Li, L.; Wu, F.; Xu, Y. Evaluation of the Geological Hazard Vulnerability in the Beiliu City Based on GIS and In-formation Value Model. Earth Environ. 2020, 48, 471–479. [Google Scholar]
  46. Li, Y.; Chen, L.; Yin, K.; Zhang, Y.; Gui, L. Quantitative risk analysis of the hazard chain triggered by a landslide and the generated tsunami in the Three Gorges Reservoir area. Landslides 2021, 18, 667–680. [Google Scholar] [CrossRef]
  47. Bruzon, A.G.; Arrogante-Funes, P.; Arrogante-Funes, F.; Martin-Gonzalez, F.; Novillo, C.J.; Fernandez, R.R.; Vazquez-Jimenez, R.; Alarcon-Paredes, A.; Alonso-Silverio, G.A.; Cantu-Ramirez, C.A.; et al. Landslide Susceptibility Assessment Using an AutoML Framework. Int. J. Environ. Res. Public Health 2021, 18, 10971. [Google Scholar] [CrossRef]
  48. Yang, X.; Liu, R.; Yang, M.; Chen, J.J.; Liu, T.Q.; Yang, Y.T.; Chen, W.; Wang, Y.T. Incorporating Landslide Spatial In-formation and Correlated Features among Conditioning Factors for Landslide Susceptibility Mapping. Remote Sens. 2021, 13, 2166. [Google Scholar] [CrossRef]
  49. Bayat, M.; Ghorbanpour, M.; Zare, R.; Jaafari, A.; Pham, B.T. Application of artificial neural networks for predicting tree survival and mortality in the Hyrcanian forest of Iran. Comput. Electron. Agric. 2019, 164, 104929. [Google Scholar] [CrossRef]
  50. Bui, D.T.; Shahabi, H.; Omidvar, E.; Shirzadi, A.; Geertsema, M.; Clague, J.J.; Khosravi, K.; Pradhan, B.; Pham, B.T.; Chapi, K.; et al. Shallow Landslide Prediction Using a Novel Hybrid Functional Machine Learning Algorithm. Remote Sens. 2019, 11, 931. [Google Scholar] [CrossRef] [Green Version]
  51. Hong, H.Y.; Liu, J.Z.; Zhu, A.X.; Shahabi, H.; Pham, B.T.; Chen, W.; Pradhan, B.; Bui, D.T. A novel hybrid integration model using support vector machines and random subspace for weather-triggered landslide susceptibility assessment in the Wuning area (China). Environ. Earth Sci. 2017, 76, 652. [Google Scholar] [CrossRef]
  52. Hong, H.Y.; Liu, J.Z.; Bui, D.T.; Pradhan, B.; Acharya, T.D.; Pham, B.T.; Zhu, A.X.; Chen, W.; Bin Ahmad, B. Landslide susceptibility mapping using J48 Decision Tree with AdaBoost, Bagging and Rotation Forest ensembles in the Guangchang area (China). Catena 2018, 163, 399–413. [Google Scholar] [CrossRef]
  53. Hong, H.Y.; Pradhan, B.; Xu, C.; Tien Bui, D. Spatial prediction of landslide hazard at the Yihuang area (China) using two-class kernel logistic regression, alternating decision tree and support vector machines. Catena 2015, 133, 266–281. [Google Scholar] [CrossRef]
  54. Liu, J.K.; Shih, P.T.Y. Topographic Correction of Wind-Driven Rainfall for Landslide Analysis in Central Taiwan with Validation from Aerial and Satellite Optical Images. Remote Sens. 2013, 5, 2571–2589. [Google Scholar] [CrossRef] [Green Version]
  55. Shahabi, H.; Hashim, M.; Bin Ahmad, B. Remote sensing and GIS-based landslide susceptibility mapping using frequency ratio, logistic regression, and fuzzy logic methods at the central Zab basin, Iran. Environ. Earth Sci. 2015, 73, 8647–8668. [Google Scholar] [CrossRef]
  56. Shirzadi, A.; Shahabi, H.; Chapi, K.; Bui, D.T.; Pham, B.T.; Shahedi, K.; Bin Ahmad, B. A comparative study between popular statistical and machine learning methods for simulating volume of landslides. Catena 2017, 157, 213–226. [Google Scholar] [CrossRef]
  57. Bui, D.T.; Shirzadi, A.; Shahabi, H.; Geertsema, M.; Omidvar, E.; Clague, J.J.; Pham, B.T.; Dou, J.; Asl, D.T.; Bin Ahmad, B.; et al. New Ensemble Models for Shallow Landslide Susceptibility Modeling in a Semi-Arid Watershed. Forests 2019, 10, 743. [Google Scholar] [CrossRef] [Green Version]
  58. Nhu, V.H.; Shirzadi, A.; Shahabi, H.; Singh, S.K.; Al-Ansari, N.; Clague, J.J.; Jaafari, A.; Chen, W.; Miraki, S.; Dou, J.; et al. Shallow Landslide Susceptibility Mapping: A Comparison between Logistic Model Tree, Logistic Regression, Naive Bayes Tree, Artificial Neural Network, and Support Vector Machine Algorithms. Int. J. Environ. Res. Public Health 2020, 17, 2749. [Google Scholar] [CrossRef] [Green Version]
  59. Pham, B.T.; Bui, D.T.; Prakash, I.; Dholakia, M.B. Hybrid integration of Multilayer Perceptron Neural Networks and ma-chine learning ensembles for landslide susceptibility assessment at Himalayan area (India) using GIS. Catena 2017, 149, 52–63. [Google Scholar] [CrossRef]
  60. Pham, B.T.; Prakash, I.; Singh, S.K.; Shirzadi, A.; Shahabi, H.; Tran, T.T.G.; Buig, D.T. Landslide susceptibility modeling using Reduced Error Pruning Trees and different ensemble techniques: Hybrid machine learning approaches. Catena 2019, 175, 203–218. [Google Scholar] [CrossRef]
  61. Zhou, X.Z.; Wen, H.J.; Li, Z.W.; Zhang, H.; Zhang, W.G. An interpretable model for the susceptibility of rainfall-induced shallow landslides based on SHAP and XGBoost. Geocarto Int. 2022, 1–27. [Google Scholar] [CrossRef]
  62. Devkota, K.C.; Regmi, A.D.; Pourghasemi, H.R.; Yoshida, K.; Pradhan, B.; Ryu, I.C.; Dhital, M.R.; Althuwaynee, O.F. Landslide susceptibility mapping using certainty factor, index of entropy and logistic regression models in GIS and their comparison at Mugling-Narayanghat road section in Nepal Himalaya. Nat. Hazards 2013, 65, 135–165. [Google Scholar] [CrossRef]
Figure 1. Location map of the study area.
Figure 1. Location map of the study area.
Ijerph 19 14248 g001
Figure 2. Single-factor grading diagram: (a) land-use type; (b) elevation; (c) slope; (d) aspect; (e) proximity to rivers; (f) lithology; (g) proximity to faults; (h) proximity to road; (i) precipitation; (j) NDVI.
Figure 2. Single-factor grading diagram: (a) land-use type; (b) elevation; (c) slope; (d) aspect; (e) proximity to rivers; (f) lithology; (g) proximity to faults; (h) proximity to road; (i) precipitation; (j) NDVI.
Ijerph 19 14248 g002aIjerph 19 14248 g002bIjerph 19 14248 g002c
Figure 3. The density of disaster point at different levels of each factor: (a) land-use type; (b) elevation; (c) slope; (d) aspect; (e) proximity to rivers; (f) lithology; (g) proximity to faults; (h) proximity to road; (i) precipitation; (j) NDVI.
Figure 3. The density of disaster point at different levels of each factor: (a) land-use type; (b) elevation; (c) slope; (d) aspect; (e) proximity to rivers; (f) lithology; (g) proximity to faults; (h) proximity to road; (i) precipitation; (j) NDVI.
Ijerph 19 14248 g003aIjerph 19 14248 g003bIjerph 19 14248 g003c
Figure 4. Graphical description of the methodological approach pursued CF-SVM-based LSM.
Figure 4. Graphical description of the methodological approach pursued CF-SVM-based LSM.
Ijerph 19 14248 g004
Figure 5. Slide susceptibility index diagram.
Figure 5. Slide susceptibility index diagram.
Ijerph 19 14248 g005
Figure 6. Evaluation grade of landslide susceptibility of each model.
Figure 6. Evaluation grade of landslide susceptibility of each model.
Ijerph 19 14248 g006
Figure 7. Factor importance.
Figure 7. Factor importance.
Ijerph 19 14248 g007
Figure 8. ROC curve.
Figure 8. ROC curve.
Ijerph 19 14248 g008
Table 1. Source and scales for the landslide conditioning factors used in this study.
Table 1. Source and scales for the landslide conditioning factors used in this study.
Conditioning FactorSourceScaleClassification Method
ElevationDEM was derived from ASTER GDEM data of the Geospatial Data Cloud (http://www.gscloud.cn/ (accessed on 25 January 2022).)30 × 30 mManual
AspectManual
SlopeManual
LithologyGeological map provided by Nujiang State Land and Land Bureau.1:250,000Lithological units
Proximity to faultsEqual interval
Proximity to riversThe National Basic Geographic Information Database (https://wwwngcc.cn/ (accessed on 12 November 2021).)--Equal interval
Proximity to road--Equal interval
NDVIThe normalized difference vegetation index (NDVI) data were obtained from NASA (https://www.nasa.gov/ (accessed on 5 November 2021).)250 × 250 mNatural breaks
PrecipitationThe meteorological data were procured from the Nujiang Meteorological Bureau and Water Bureau.--Natural breaks
Land-use typeThe land-use type data were carried out based on the Landsat 8 OLI/TIRS data of Geospatial Data Cloud (http://www.gscloud.cn/ (accessed on 25 December 2020).)The interpretation accuracy reached 94.17%.Land-cover unit
Table 2. Correlation values of each factor.
Table 2. Correlation values of each factor.
FactorElevationAspectSlopeLithologyProximity to FaultsProximity to RiversProximity to RoadNDVIPrecipitationLand-Use TypeAgrotype
Elevation1.000
Aspect0.0701.000
Slope−0.095−0.0461.000
Lithology0.4130.078−0.0371.000
Proximity to faults0.0540.031−0.0190.1701.000
Proximity to rivers0.0740.0780.0360.109−0.0591.000
Proximity to road0.1390.0920.0070.1090.0630.0341.000
NDVI0.2160.048−0.1320.092−0.045−0.001−0.0081.000
Precipitation−0.160−0.1380.082−0.3090.0260.105−0.066−0.0771.000
Land-use type−0.065−0.0090.193−0.052−0.0200.1280.032−0.1070.0921.000
Agrotype0.371−0.009−0.1050.0650.2500.1310.0480.1840.0680.1211.000
Table 3. CF values of each factor.
Table 3. CF values of each factor.
FactorClassificationAreaNumber of Landslide PointspapsCF
Slope0–10°731.0079300.04100.03820.0731
10–20°2240.54281160.05180.03820.2735
20–30°3820.08151870.04900.03820.2293
30–40°4425.6031530.03460.0382−0.0973
40–50°2680.7157630.02350.0382−0.3933
>50°805.0491120.01490.0382−0.6186
AspectPlane22.225500.00000.0382−1.0000
North1886.0004400.02120.0382−0.4538
Northeast1702.0764680.04000.03820.0467
East1852.4251000.05400.03820.3048
Southeast1806.8571750.04150.03820.0840
South1907.3826560.02940.0382−0.2375
Southwest1849.5159660.03570.0382−0.0671
West1862.0028960.05160.03820.2703
Northwest1814.5143600.03310.0382−0.1379
Elevation<1200253.4256250.09860.03820.6375
1200–1600695.91511180.16960.03820.8057
1600–1900985.47661620.16440.03820.7984
1900–24002467.76221870.07580.03820.5162
2400–30004342.7025670.01540.0382−0.6050
3000–36003874.663820.00050.0382−0.9870
>36002083.054200.00000.0382−1.0000
Proximity to rivers0–2001610.7471940.05840.03820.3599
200–4001539.69211190.07730.03820.5264
400–6001476.4599840.05690.03820.3424
600–8001412.6364900.06370.03820.4170
800–10001348.4016660.04890.03820.2292
1000–12001273.5279510.04000.03820.0491
>12006041.535570.00940.0382−0.7599
LithologyWeak rock group7073.29442300.03250.0382−0.1528
Hard rock group4066.3621480.03640.0382−0.0479
Harder rock group3545.01871830.05160.03820.2712
Loose rock group18.324900.00000.0382−1.0000
Proximity to faults<4001764.5994940.05330.03820.2950
400–8001569.76561190.07580.03820.5164
800–12001359.7416840.06180.03820.3975
1200–16001274.3034900.07060.03820.4780
1600–2000999.1170660.06610.03820.4392
2000–2400844.9938510.06040.03820.3824
>24006890.4792570.00830.0382−0.7897
NDVIPoor vegetation cover289.265500.00000.0382−1.0000
Average vegetation coverage980.5989210.02140.0382−0.4483
Good vegetation coverage3216.54311020.03170.0382−0.1744
Excellent vegetation cover5367.60632450.04560.03820.1706
Very excellent vegetation cover4832.63261930.03990.03820.0464
Proximity to
road
0–2001753.14962750.15690.03820.7868
200–4001560.81151000.06410.03820.4205
400–6001390.5945450.03240.0382−0.1570
600–8001249.3899330.02640.0382−0.3161
800–10001111.7763290.02610.0382−0.3248
1000–12001087.8549310.02850.0382−0.2606
1200–1400863.1207180.02090.0382−0.4631
>14005686.3026300.00530.0382−0.8663
Land-use typeConstruction land137.15371060.77290.03820.9883
Plowland1056.79532780.26310.03820.8889
Grassland1206.970230.00250.0382−0.9372
Forest10918.00861660.01520.0382−0.6108
Unutilized land666.054980.01200.0382−0.6935
Water body718.017300.00000.0382−1.0000
Precipitation852–10214331.93672070.04780.03820.2095
1021–11523233.56861310.04050.03820.0605
1152–12992430.3624500.02060.0382−0.4705
1299–14512797.7581300.04650.03820.2284
1451–16561909.3743430.02250.0382−0.4192
Table 4. Statistical table of CF model susceptibility.
Table 4. Statistical table of CF model susceptibility.
Degree of SusceptibilityArea (km2)Ratio (%)Number of Disasters (Unit)Ratio%Disaster Point Density (Unit/km2)
Very low susceptibility2516.500017.12%0.00000.00000.0000
Low susceptibility4005.790027.24%2.00000.36%0.0005
Moderate susceptibility3825.040026.02%27.00004.81%0.0071
High susceptibility2881.430019.60%150.000026.74%0.0521
Very high susceptibility1474.240010.03%382.000068.09%0.2591
Total14,703.00001.0000561.00001.0000
Table 5. Statistical table of SVM model susceptibility.
Table 5. Statistical table of SVM model susceptibility.
Degree of SusceptibilityArea (km2)Ratio (%)Number of Disasters (Unit)Ratio%Disaster Point Density (Unit/km2)
Very low susceptibility2914.320019.82%0.00000.00000.0000
Low susceptibility4116.110028.00%7.00001.25%0.0017
Moderate susceptibility3865.650026.29%82.000014.62%0.0212
High susceptibility2678.860018.22%173.000030.84%0.0646
Very high susceptibility1128.06007.67%299.000053.30%0.2651
Total10,473.001.00005611.0000
Table 6. Statistical table of CF + SVM model susceptibility.
Table 6. Statistical table of CF + SVM model susceptibility.
Degree of SusceptibilityArea (km2)Ratio (%)Number of Disasters (Unit)Ratio%Disaster Point Density (Unit/km2)
Very low susceptibility5238.3700235.63%0.00000.00000.0000
Low susceptibility4427.6266230.11%223.92%0.0049
Moderate susceptibility1557.9451210.10%6912.30%0.0442
High susceptibility2435.8321216.57%115427.45%0.0632
Very high susceptibility1043.226127.09%31656.33%0.3029
Total10,473.001.00005611.0000
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Li, Y.; Deng, X.; Ji, P.; Yang, Y.; Jiang, W.; Zhao, Z. Evaluation of Landslide Susceptibility Based on CF-SVM in Nujiang Prefecture. Int. J. Environ. Res. Public Health 2022, 19, 14248. https://doi.org/10.3390/ijerph192114248

AMA Style

Li Y, Deng X, Ji P, Yang Y, Jiang W, Zhao Z. Evaluation of Landslide Susceptibility Based on CF-SVM in Nujiang Prefecture. International Journal of Environmental Research and Public Health. 2022; 19(21):14248. https://doi.org/10.3390/ijerph192114248

Chicago/Turabian Style

Li, Yimin, Xuanlun Deng, Peikun Ji, Yiming Yang, Wenxue Jiang, and Zhifang Zhao. 2022. "Evaluation of Landslide Susceptibility Based on CF-SVM in Nujiang Prefecture" International Journal of Environmental Research and Public Health 19, no. 21: 14248. https://doi.org/10.3390/ijerph192114248

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop