A Comparative Assessment of Machine Learning Models for Landslide Susceptibility Mapping in the Rugged Terrain of Northern Pakistan

Shahzad, Naeem; Ding, Xiaoli; Abbas, Sawaid

doi:10.3390/app12052280

Open AccessArticle

A Comparative Assessment of Machine Learning Models for Landslide Susceptibility Mapping in the Rugged Terrain of Northern Pakistan

by

Naeem Shahzad

^1,2,*

,

Xiaoli Ding

^1,2

and

Sawaid Abbas

^1,3,*

¹

Department of Land Surveying and Geo-Informatics, The Hong Kong Polytechnic University, Hong Kong SAR, China

²

Research Institute for Land and Space, The Hong Kong Polytechnic University, Hong Kong SAR, China

³

Remote Sensing, GIS and Climatic Research Lab (RSGCRL), National Center of GIS and Space Applications, University of the Punjab, Lahore 54590, Pakistan

^*

Authors to whom correspondence should be addressed.

Appl. Sci. 2022, 12(5), 2280; https://doi.org/10.3390/app12052280

Submission received: 25 December 2021 / Revised: 12 February 2022 / Accepted: 15 February 2022 / Published: 22 February 2022

(This article belongs to the Section Earth Sciences)

Download

Browse Figures

Versions Notes

Abstract

:

This study investigated the performances of different techniques, including random forest (RF), support vector machine (SVM), maximum entropy (maxENT), gradient-boosting machine (GBM), and logistic regression (LR), for landslide susceptibility mapping (LSM) in the rugged terrain of northern Pakistan. Initially, a landslide inventory of 200 samples was produced along with an additional 200 samples indicating nonlandslide areas and divided into training (70%) and validation (30%) groups using a stratified loop-based random sampling approach. Then, a geospatial database of 12 possible landslide influencing factors (LIFs) was generated, including elevation, slope, aspect, topographic wetness index (TWI), topographic position index (TPI), distance to drainage, distance to fault, distance to road, normalized difference vegetation index (NDVI), rainfall, land cover/land use (LCLU), and a geological map of the study area. None of the LIFs were redundant for the modeling, as indicated by the multicollinearity test (tolerance > 0.1) and information gain ratio (IGR > 0). We extended the evaluation measures of each algorithm from area-under-the-curve (AUC) analysis to the calculation of performance overall (POA) with the help of precision, recall, F1 score, accuracy (ACC), and Matthew’s correlation coefficient (MCC). The results showed that the SVM was the most promising model (AUC = 0.969, POA = 2669) for the LSM, followed by RF (AUC = 0.967, POA = 2656), GBM (AUC = 0.967, POA = 2623), maxENT (AUC = 0.872, POA = 1761), and LR (AUC = 0.836, POA = 1299). It is important to note that the SVM, RF, and GBM were the top performers, with almost similar accuracy. Thus, each of these could be equally effective for LSM and can be used for risk reduction and mitigation measures in the rugged terrain of Pakistan and other regions with similar topography.

Keywords:

machine learning; landslide susceptibility; random forest; support vector machine; gradient-boosting machine; maximum entropy; rugged terrain; Pakistan

Graphical Abstract

1. Introduction

Landslides, in comparison to other natural disasters, are one of the most fatal geological hazards. Under the influence of gravity, these events occur when internal resistive forces of a mass of rock, debris, or earth weaken the slope stability [1]. Factors that influence landslides often include intensive rainfall, volcanic eruption, land use/land cover (LULC), earthquakes, geological evolution, irrigation, and climatic variations [2,3]. Brabb [4] suggested that losses due to landslides could be controlled and minimized by 90% if proper attention is given to investigating their trigger factors and identifying high-risk areas. Guzzetti et al. [5] found that only 1% of all the Earth’s continents have been evaluated for landslide hazards, out of which 17% are attributed to landslides [6].

Researchers have proposed various landslide susceptibility mapping (LSM) approaches using satellite remote sensing (SRS) and geographic information systems (GIS) to identify landslide-prone areas [7,8,9,10,11] by analyzing and evaluating the probability of landslide occurrences [5]. These mapping approaches are divided into two categories, i.e., qualitative and quantitative [1,12]. The qualitative approaches mostly rely on the level of an expert’s knowledge, whereas the quantitative methods are based on probabilistic/statistical theories or models [13]. Both approaches have their limitations, for instance, the misinterpretation of the expert may lower the accuracy in the qualitative methods, whereas the use of poor, low-precision, and inaccurate data may decrease the accuracy in the quantitative methods [14]. Based on historic events, these prediction and mapping approaches can identify and locate the probability of occurrences of such events in a particular region with similar environmental, geophysical, geological, and other related conditions [15]. Although research on LSM is not capable of stopping landslide hazards totally due to its complex nature, highly precise LSM can help in reducing the impacts of this hazard by developing effective mitigation policies and measures [16,17]. In order to generate high-precision susceptibility maps, machine learning (ML) and data-mining approaches have attracted a number of researchers, particularly since 2005, and the development of ML-based quantitative methods has proven to provide accurate and reliable results of landslide susceptibility [18]. Based on the assumption of certain geological, geotechnical, and climatic conditions exhibiting similar patterns of landslide [19], these methods are considered as most effective in solving the nonlinear nature of geospatial problems.

Numerous studies have discussed the advantages of different ML methods including logistic regression (LR) [20], artificial neural network (ANN) [21], support vector machine (SVM) [10,21,22], decision trees (DT) [22], maximum entropy (maxENT) [11], random forest (RF) [23,24,25,26,27], and boosted trees (BT) [28]. A comprehensive review of the architecture and working mechanisms of popular ML methods can be found in Merghadi et al. [29]. Several efforts have been made to implement, investigate and compare these ML methods in different geographical settings [10,15,22,23,24,25,30,31]. For example, Merghadi et al. [32] investigated and compared the performances of RF, gradient-boosting machine (GBM), SVM, neural network (NNET), and LR in Mila Basin, Algeria. The authors of this study found that the GBM and RF methods provided good results, with areas under the curve (AUCs) of 0.897 and 0.895, respectively. On the other hand, a similar assessment was conducted by Wang et al. [33] in Shexian County, China, using the RF, SVM, GBM, multilayer perceptron (MLP), and LR models and found that RF and SVM provided the results with the best AUCs of 0.821 and 0.803, respectively. Recently, Qing et al. [34] implemented multiple ML models including generalized linear models (GLM), SVM, Naïve Bayes (NB), and other tree-based models to study the susceptibility of a debris flow along the China–Pakistan Karakoram Highway. The authors adopted different modeling strategies based on the watershed and catchment boundaries in the peripheries of the highway and found that the SVM performed best (AUC = 0.96). Ali et al. [35] assessed the prediction accuracies and performances of NB and RF by comparing their results with a fuzzy decision-making trial and evaluation laboratory approach combined with the analytic network process approach (FDEMATEL–ANP) in Kysuca river basin, Slovakia. RF was found to be an optimal model with an AUC value of 0.954 in the region. As a conclusion to the studies published so far, the accuracies of ML algorithms vary under different conditions of topography, geological setting, and climate [20]. Regardless of the number of ML methods, there is ‘no rule of thumb’ and ‘no free lunch’ [36] regarding which method or technique is the most suitable for landslide susceptibility due to the high level of uncertainty behind the processes and the varying topographical and climatic conditions of areas [37]. Therefore, it is necessary to study landslide dynamics and proneness by testing these algorithms for different site conditions for effective risk management and planning.

In the current study, we adopted five ML algorithms, RF, SVM, maxENT, GBM, and LR, to study the landslide susceptibility at the regional level in deep canyons and rugged terrain of northern Pakistan, where slopes vary up to 89 degrees. The selection of these five ML models was based on several reasons. First, these ML models constitute limited studies at a regional level and two of them (i.e., GBM and maxENT) have never been explored for LSM at this scale. Second, unlike other ML models, these models do not need a large amount of distributed sample data. Third, considering the working mechanisms of these models, it is expected that these models are suitable to achieve a high accuracy of landslide modeling at this scale. The diverse and sudden variation in the slopes and the presence of active thrust and strike slip faults in the area gives rise to frequent landslide events, especially after the October 2005 earthquake [38], and no significant ML-based approaches have been conducted in this region for LSM. Therefore, this study is the first of its kind in the area with the main objectives of investigating and highlighting the important influencing factors and assessing the performances of ML algorithms. Therefore, it is believed that this research study could provide a significant contribution to the scientific community which can be further used to delineate, identify, and understand the landslide risk and hotspots for planning and decision making in this hazardous region and other regions with similar topography.

2. Materials and Methods

In this research study, five ML techniques (RF, SVM, maxENT, GBM, and LR) were implemented to generate landslide susceptibility maps and to evaluate their performances in the region. The overall adopted methodology in this research comprised five main steps: (i) preparation of landslide inventory; (ii) preparation of landslide influencing factors; (iii) evaluation of the selected landslide influencing factors; (iv) tuning, training, and modeling through the selected models using training data; and (v) performance evaluation of the modeled results using validation data. A workflow of this methodology is summarized in Figure 1 and a step-by-step discussion is elaborated in later sections. The data were prepared in ArcMap, whereas modeling and evaluation were performed in R v3.6.3 (The R Foundation for Statistical Computing, Vienna, Austria).

2.1. Description of the Study Area

The northern areas of Pakistan are known for various natural hazards such as earthquakes, landslides, and floods [39]. One of the worst earthquakes in October 2005 (Mw = 7.6) left behind more than 80,000 deaths and a huge loss to infrastructure [40] and gave rise to slope failure events. This single event destabilized a large number of slopes in this part of the country. The central part of the study area under investigation constitutes the Hazara Division of the Khyber Pakhtunkhwa province of Pakistan, including part of Azad Jammu and Kashmir (AJK) in the east, Gilgit in the north, and Islamabad (the capital city of Pakistan) in the south. Geographically, the study area extends between 72°30′30.16″ E, 33°42′48.92″ N and 74°11′46.64″ E, 35°51′28.29″ N with an elevation range from 211 to 8558 m (Figure 2).

With an area of more than 34,000 sq·km, the study area covers several mountain hills and tourist places with a diverse variety of land features including rivers, major faults, snow-covered peaks, and sparse to medium-dense forests [41]. Geologically, the formation of the region ranges from Precambrian to Quaternary, which includes metasediments and metamorphic and other consolidated materials. The area is comprised of two main zones, i.e., the Harman seismic zone (HSZ) and the Indus Kohistan seismic zone (IKSZ). The study area is comprised of four main units, i.e., (i) the Jijal complex, (ii) Kamila amphibolite with para and ortho amphibolite plutonic rocks, (iii) Kohistan batholith, and (iv) metavolcanic/metasedimentary groups [42]. The major thrust faults in the study area include the main mantle thrust (MMT) and the main Karakorum thrust (MKT). The minor faults such as the Kohistan fault are in depositional contact, i.e., contact separating the Higher Himalayan Crystalline (HHC) with two faults making cross-sectional lines [43]. The study area is part of the upper Indus basin and feeds two of the major dams in Pakistan, i.e., the Tarbela and Mangla Dams. The main highway in the study area, Karakoram Highway (KKH), which is part of the China–Pakistan Economic Corridor (CPEC), mostly follows the Indus River and has been facing a severe problem of a large number of unstable slopes, degradation, and landslides due to a variety of factors such as steep slopes, geology, and rainfall.

The study area can be broadly divided into two climate zones, i.e., the southern zone and the northern zone. Geographically, the southern zone extends from Islamabad/Haripur up to the Dasu area and exhibits a high amount (approximately 1000 mm/year) of rainfall, with maximum rainfall during monsoon season. The northern zone, which extends from Dasu to Chilas and Gilgit, experiences lower rainfall (approximately 220 mm/year) as compared to the southern zone. One of the reasons for this sharp change in the climatic condition is the sudden change in the topography and terrain orientation in the area where the Indus valley turns into sharp bends. The changing patterns in terms of average rainfall and temperature (minimum and maximum) are shown in Figure 3.

2.2. Data Acquisition and Data Preparation

A Landsat-8 OLI image on 27 October 2017 and a Shuttle Radar Topographic Mission (SRTM) Digital Elevation Model (DEM) were obtained from United States Geological Survey (USGS) Earth Resources Observation and Science (EROS) data center. The data were preprocessed to develop a land-cover/land-use (LCLU) map; normalized difference vegetation index (NDVI); and subsequent topographic parameters such as slope, aspect, topographic wetness index (TWI), and topographic position index (TPI). Geological layers and fault lines were extracted from the maps published by the Geological Survey of Pakistan in 1993 and 1982, which were further modified from [44]. Other toponyms, such as catchments, roads, and drainages, were extracted from topographic sheets developed by the Survey of Pakistan (SoP). Due to limited ground weather stations in the area, time-averaged monthly precipitation data product (GPM_3IMERGM) of Integrated MultiSatellite Retrievals for the Global Precipitation Measurement (GPM) mission (IMERG) for the year 2017 was processed to generate the annual precipitation layer.

2.3. Preparation of Landslide Inventory

In prediction and susceptibility modeling, it is typically assumed that an event that occurred in the past can occur again in the future at the same place or other places under similar conditions. Therefore, a landslide inventory is imperative for assessing the link between past events and landslide influencing factors. However, the records of such events (i.e., landslide inventories) are often limited to events that occurred at large scales in accessible areas, lacking the details of small-scale events in inaccessible regions [5,9]. To overcome this limitation of the inventory in the study area, a database (inventory) of past landslides, including rockfall, topples, slides, and debris flow events, was developed by integrating the historic records from Frontier Works Organization (FWO), Geological Survey of Pakistan, and from the databases published in [45,46,47]. Google Earth (GE) images were used to validate geospatial locations and hotspots of these events. Figure 4 represents an example of the landslide event that occurred near the Dasu area, which can be easily interpreted on the GE image shown in Figure 4b.

With an inventory of 200 landslide events, an equal number of hotspots for nonlandslides were also generated for modeling. Considering the geographic locations of the landslide events, the 200 nonlandslide locations were randomly selected, particularly in the areas with dense forest, water bodies, peaks with permanent snow, and settlements in relatively plainer areas (slope angle < 3°), as implemented by [35]. Thus, a total of 400 samples were used to train and validate the landslide susceptibility maps. These samples were divided into training (70%) and validation (30%) data sets using a stratified loop-based random sampling technique. Of the data, 70% (280 samples) were used for training and model fitting and the remaining 30% (120 samples) of the data were used to validate the results of the models. Figure 2 shows the spatial distribution of these training and validation samples.

2.4. Selection of Landslide Influencing Factors (LIFs)

The accuracy of susceptibility maps is highly affected by LIFs. There are no universal criteria for selecting conditional factors due to the complex nature of the geoenvironment in different areas [25]. The selection of these LIFs largely depends on the climatic, geological, and topographical condition of the area; therefore, considering the topography, LCLU, and geological conditions of the study area, a total of 12 LIFs (Table 1, Figure 5) including elevation, slope, aspect, TWI, TPI, distance to drainage, distance to fault, distance to road, NDVI, rainfall, LCLU, and geological layers were prepared. All the influencing factors were resampled at a standard pixel size of 30 m. The details of the chosen influencing factors are discussed in Table 1 and their spatial patterns and distribution are presented in Figure 5.

2.5. Evaluation of Landslide Influencing Factors

The accuracy of the model output strongly depends on the quality of influencing factors, which require prior knowledge of the terrain. Adding a large number of influencing factors into the model does not always ensure the reliability and accuracy of the modeled results and can introduce abnormalities in modeling processes. Therefore, it is necessary to evaluate the incorporated factors before using them for susceptibility modeling. For this purpose, in this study, all 12 selected influencing factors were scanned through the two tests, i.e., information gain ratio (IGR) [52] and multicollinearity [53]. The IGR test, as provided in Equation (1), is a well-known feature selection method in ML, which is used to assess the contribution of factors, whereas the multicollinearity test was performed to evaluate the interassociation (the linear relationship between two or more factors) [54].

IGR (E_{x, a}) = \frac{IG}{IV}

(1)

where information gain is IG and the intrinsic information is IV. Higher values of IGR represent the higher contribution of a particular factor for modeling and vice versa. For the multicollinearity test, variance inflation factor (VIF) and tolerance (TOL) were calculated using Equation (2).

{VIF}_{i} = \frac{1}{1 - R_{i}^{2}} = \frac{1}{TOL} for i = 1, 2, \dots, k

(2)

where

R_{i}^{2}

is the coefficient of determination of the explanatory variables. A VIF value greater than 10 or a TOL value of less than 0.1 is considered to have a serious multicollinearity problem [55]. Figure 6 shows the obtained IGR, TOL, and VIF values of each influencing factor and it is found that none of the factors exhibit a collinearity problem. To calculate VIF and TOL, we used ‘car’ and ‘VIF’ packages, whereas for the IGR test, ‘FSelector’ package was used.

2.6. Machine Learning Algorithms and Susceptibility Modeling

ML has proven to be the most effective approach which incorporates the best use of past events to map and model susceptibility with higher accuracies. In this study, the selected ML models were carefully tuned and trained to generate landslide susceptibility maps of the area with higher accuracy. The processing mechanisms and the framework of these selected models are elaborated in detail in the following subsections.

2.6.1. Random Forest (RF)

RF is an ensemble learning method that works on the principle of generating a large number of random trees to minimize the risk of overfitting [56]. RF does not require rescaling and resampling the data, as this model can handle the missing values very easily by building a DT. A DT, being a binary model, is constructed by dividing input data into an increasing number of homogenous nodes. The number of trees (ntree) and number of variables used to split each node (mtry) are the two main parameters used to tune and train the RF models. Considering the size and type of the sample data, the optimal values of these parameters can be found with the help of different methods [57] to achieve higher performance in modeling. In this study, a grid-based search method [58], a widely accepted approach for parametrization, was utilized for tuning the models. The modeling was performed through the ‘caret’ and ‘randomForest’ packages in R.

2.6.2. Support Vector Machine (SVM)

The SVM is considered as one of the best-performing algorithms and uses different kernels to quantify the similarity of two observations in feature space. Typical examples of the kernels used in SVM are linear (LN), polynomial (PL), radial-based function (RBF), and sigmoid (SIG). The performances of using these kernels highly affect the modeling process and its results [7]. Considering the spread and complexity of the sample data, the RBF kernel was adopted in this study. The RBF kernel in SVM is influenced by hyperplane parameters responsible for the smoothness of the decision boundary in various ways [59]. To increase the efficiency of the SVM, these parameters were first evaluated using 10-fold repeated cross-validation for tuning and training process. We used the ‘caret’ and ‘e1071’ R packages for the SVM modeling.

2.6.3. Maximum Entropy (maxENT)

Maximum entropy has been successfully applied to study species distribution and related research [60]. The Bayesian approach of maxENT modeling analyzes the available data for predicting distribution based on responses from influencing factors without relying on the sample size. Maximum entropy is a statistical learning approach that works on the principle of entropy and attempts to fit the model based on given data. Therefore, it also requires parametrization like other ML algorithms to avoid overfitting problems. The parametrization is done before generating the final susceptibility map. Other than the number of background samples, the parameterization and model tuning mainly involve a tradeoff factor known as the regularization parameter [61]. After several iterations with the help of already defined training and validation datasets, the best fit of the model was found with the regularization parameter of 1.0 and background samples of 8000 in this study. We used the ‘dismo’ package for modeling through the maxENT.

2.6.4. Gradient-boosting Machine (GBM)

This powerful ensemble learning model uses a combination of different multiple trees [28]. In recent years, GBM has proven to be more successful in different domains and is being considered as a leading method for prediction modeling [24,29]. As compared to the RF technique, this boosted tree model builds successive trees in which each tree reduces errors from the previous tree to improve the performance of the model in an iterative manner [62]. Among the different parameters, number of trees, shrinkage, and interaction depth are the most important in this model. Number of trees is required to minimize the loss function in the model, learning rate (shrinkage) is required to control the complexity and contribution of trees in the model, whereas interaction depth defines the number of splits in each tree [63]. To find the best fit of the model, a grid-based search with 10-fold repeated cross-validation approach was adopted to find the optimal values of these parameters used in training and modeling processes. For modeling through GBM, we used the caret’ and ‘gbm’ R packages.

2.6.5. Logistic Regression (LR)

LR is one of the most widely used approaches and has been very famous for providing better results with limited computer resources [57]. This GLM works on the principle of linear regression theory and can be an effective tool to solve probabilistic problems with the help of the logitic function [64]. This logistic function defines a relationship between sample data, their probabilities, and their dependencies on the influencing factors, as provided in Equation (3).

P (x) = \frac{e^{y}}{1 + e^{y}}

(3)

where ‘

P (x)

’ is the occurrence probability and ‘y’ corresponds to the fitting function with the multiple influencing factors, which can be understood with the help of the expression presented in Equation (4).

y = β_{0} + β_{1} x_{1} + β_{2} x_{2} + \dots + β_{n} x_{n}

(4)

where

β_{0}, β_{1},

and

x_{n}

are model intercept, coefficient of regression, and influencing factors, respectively. The LR model is known for its ability to find the optimal fitting functions among training samples and LIFs; therefore, to achieve accuracy through this model, the incorporated LIFs must not attain any collinearity issue between them. To achieve LSM results through LR, we used the ‘caret ‘and ‘nnet’ R packages.

The details of the model tuning and parametrization processes for each ML algorithm are given in the Supplementary Materials (Figures S1–S5).

2.7. Performance Evaluation of Models

Model validation and performance evaluation are the main steps in ML [30]. Assessment using training data reflects the level of model fitness, whereas evaluation using validation data is an unbiased assessment of the model fit [21]. In this study, the modeling results were first evaluated using a receiver operating curve (ROC) followed by validation through five performance evaluation indices, i.e., precision, recall, F1 score, accuracy (ACC), and Matthew’s correlation coefficient (MCC), with the help of the ‘caret’ and ‘ModelMetrics’ packages. The statistical descriptions of these indices are shown in Table 2. All of these indices depend upon the four parameters, i.e., true positive (TP), true negative (TN), false positive (FP), and false negative (FN) [23].

The results obtained from the evaluation measures helped rank the models for their performances. However, there was a chance that one model would perform well in some indices but give a poor performance in others. Therefore, in this study, we incorporated a combined index to quantify the overall performance of the models through the performance overall (POA) [65] by taking the sum of the calculated indices, i.e., accuracy, F1 score, and MCC (Equation (5)).

{Performance}_{overall} (POA) = ACC + MCC + F 1 - score

(5)

The model with the highest magnitude of POA represented the best model. Moreover, a decreasing trend in the POA values corresponded to the performance ranking of the models for LSM.

3. Results

This study presents a comparative assessment of five ML models for LSM in the rugged terrain of northern Pakistan. In order to gather the objectives of this study, the contribution and importance of the LIFs to the modeling were first assessed, providing modeling results and an evaluation of their performance, comparison, and accuracy based on the conventional assessment approach in the machine learning framework.

3.1. Importance and Contribution of Landslide Influencing Factors (LIFs)

The independent importance of an influencing factor is one of the most significant analyses in ML. The role of the influencing factors and their contribution to modeling plays a vital role in achieving reliable results. In this study, the factor importance in regard to all of the five models was first assessed using the varImp function of the ‘caret’ package. This function calculates and documents the mean decrease (scaled) of accuracy in ‘class-specific’ manners. The details of the contribution of these factors in each model are shown in Figure 7a. It can be observed in Figure 7a that during modeling through the RF and GBM approaches, LCLU was the most significant contributing factor. On the other hand, elevation took the lead in the SVM and LR models, and aspect was the most significant contributing factor in the maxENT model. The varying nature of the factor contribution in the different models reflects the working behavior of these models. Therefore, keeping in view the contribution of these factors, each model provided different results in terms of prediction (Figure 8).

However, considering this inconsistency of the factors contributing to each model, the need to assess the relative importance and overall contribution of these incorporated factors was important. Therefore, in this study, we also incorporated a technique presented by Breiman et al. [66], which works on the principle of RF to assess the importance of factors using MeanDecreaseAccuracy and MeanDecreaseGini in the ‘caret’ and ‘randomForest’ libraries in R. MeanDecreaseAccuracy rescales the class-specific importance scores using ‘standard error’, employing their mean. On the other hand, MeanDecreaseGini works on the principle of splitting a variable through the average gain, which works on a local decision function to select the best split. This importance analysis counts the number of observations against each factor in the data. The importance of each factor calculated through this approach is shown in Figure 7b. The results showed that LCLU, NDVI, elevation, slope and aspect, distance to drainage, distance to road, and distance to faults exhibited high importance for landslide susceptibility, compared with the other predictors such as rainfall, geology, TPI, and TWI in the area. This ranking and importance contribution analysis agrees with the performances of the models in terms of their accuracies, as described in the performance evaluation section.

3.2. Landslide Susceptibility Maps (LSMs)

The prediction results of the modeling are presented in Figure 8, which highlights the higher vulnerability of the study area for landslide susceptibility. For interpretation and analysis purposes, the value ranges of the modeled landslide susceptibility maps (0 to 1) were classified into five different classes. The division of the ranges was purely based on Jenke’s natural break approach [67] and has been adopted by a number of researchers in the field [35]. The value ranges for each of these five classes in each model is shown in Figure 8. The class with the lowest susceptibility range was classified as conditionally stable (very low susceptibility), whereas the other ranges were classified as low-, medium-, high-, and very-high-susceptibility classes in the map [15]. The selected LIFs were then analyzed for their contribution to the modeling to identify the major causative factor for this hazard, leading to a comprehensive discussion of the model performances, comparisons, and validations.

3.3. Performance Evaluation of Models and Analysis of the Results

The most important factor involved in the evaluation of the models was the analysis based on the area under the curve (AUC), which represents a measure of the separability with values ranges from 0 (lower) to 1 (higher) [68]. Higher values of AUC reflect the highest performance. A model is considered reliable if the AUC values are higher than 0.7 [69]. The AUC values of these models calculated using validation data are shown in Figure 9.

All the investigated ML models produced results with AUC values above 83%, which highlights the reliability of these models in providing good results in the study area. However, the variations in AUC values among all the models was relatively high, with AUCs of 0.969 in SVM, 0.967 in RF, 0.872 in maxENT, 0.967 in GBM, and 0.836 in LR (Figure 9). Keeping in view the assessment and validation of the models’ output, it is not recommended to investigate using only one parameter (i.e., AUC) for ML approaches, as this may not reflect the true prediction ability of models with varying and closer values. Therefore, the model results were further scanned through the other evaluation procedures based on the calculation of indices, and the results of these indices-based analyses are provided in Figure 10a. Figure 10a shows that across all five ML algorithms, the results of RF and SVM were found to be consistent in terms of providing higher performance accuracies than the rest of the models. The achieved values of these indices for ACC, precision, recall, F1 score, and MCC in RF were 0.87, 0.75, 0.98, 0.85, and 2655, respectively, whereas in SVM these values were 0.87, 0.77, 0.96, 0.85, and 2667.99, respectively (Figure 10a). However, in addition to SVM and RF as the top two models, GBM was also one of the leading models, providing results with higher accuracies, and it can be seen in Figure 10a that the difference between the performances of this model and the performances of SVM and RF was almost negligible, with ACC 0.86, precision 0.77, recall 0.94, F1 score 0.84, and MCC 2621.99. Therefore, it can be stated that no significant performance difference was found between these three models (RF, SVM, and GBM). LR and maxENT provided an averaged result with an ACC of above 0.65; a recall value of above 0.86; and the lowest values in precision, F1 score, and MCC (Figure 10a). To evaluate the best performer, the POA was calculated, which indicated that the SVM (POA = 2669) was the best model and placed RF (POA = 2656) and GBM (POA = 2623) second and third, respectively (Figure 10b). The models in this study were trained with the best possible configurations; therefore, the performance rankings of some of the models in this study followed the results of previous research [34]. However, the importance of the GBM model for LSM cannot be neglected, as it stands with SVM and RF in terms of the evaluation measures.

4. Discussions

LSM provides a basis for risk management and planning; therefore, highly accurate LSM is required for effective mitigation measures. Due to the complex nature of this hazard, it is quite challenging to produce high-quality and accurate maps at a regional scale in rugged mountains. In comparison to the conventional approaches, ML methods are efficient in solving geotechnical problems related to landslides. With a number of different modeling algorithms, the prediction results obtained by applying a single algorithm for mapping and modeling the susceptibility cannot be reliable, as most of the ML algorithms provide different performances in different areas at different scales, which further depends on the topography, climate, LCLU, and other natural and anthropogenic factors. Therefore, it is important to investigate, understand, and evaluate the differences in the outputs of various ML models to select and identify an optimal model [70]. Highlighting the various issues with the LSM during the modeling processes, Lee et al. [10] suggested putting a great effort into the selection and evaluation of the influencing factors. Therefore, in the present research study, as the very first stage, the twelve different landslide influencing factors selected (Figure 5a–l) were scanned through two different tests based on IGR and multicollinearity prior to modeling the susceptibility. Then, the contribution analysis was performed to assess the relative importance of the LIFs, ensuring the models were carefully tuned and trained by finding the optimal corresponding parameters in five different ML models: RF, SVM, maxENT, GBM, and LR. After this, a comparative assessment of the obtained modeled results was performed by calculating the AUC and confusion matrix, and they were ranked on the basis of their performances by calculating the POA.

4.1. Evaluation of LIFs and Modeling Criteria

The results of the IGR test indicated that with an IGR > 0, all of the selected LIFs were important and would not introduce any problem in the modeling process (Figure 6). The results of the multicollinearity test showed that none of the LIFs had a collinearity problem (Figure 6). In addition to the other factors affecting the accuracy of the models discussed above, the model results were also influenced by the training data in terms of the number of samples, which played a vital role in tuning the models. As a rule of thumb, Jain et al. [71] found that a minimum of 10 × d × C samples are required to tune models with ‘C’ classes in d-dimensional problems. Therefore, in this study, the number of training samples used (280) was above the minimum requirement, and these were further used to tune and train the models to achieve training accuracies of 0.866, 0.854, 0.760, 0.892, and 0.707 for RF, SVM, maxENT, GBM, and LR, respectively.

The modeling results of RF, SVM, maxENT, GBM, and LR, obtained using training data, were classified into five different classes. The spatial distributions of the landslide susceptibility classes shown in Figure 8 present the varying nature of the results in different models. Therefore, to check the similarities in the prediction results of the models in the area, a cross-correlation between the model outputs using validation data was performed, and it was found that RF–SVM, RF–GBM, and SVM–GBM had the most-correlated models in this study, with correlation values of 0.993, 0.981, and 0.985, respectively (Figure 11a). This verifies that the three models, RF, SVM, and GBM, performed very well, with approximately similar accuracy in the area.

4.2. Performances of Susceptibility Modeling

Figure 8 and Figure 12a–e show that most of the predicted landslide zones follow a drainage network in the region and correspond to the high and very-high susceptibility classes along with this network. Aspect-based analysis conducted using SVM results showed that these high and very-high classes are mostly attributed to the slopes facing NE to WSW through E and S. These aspects are considered as warmer aspects in the region and can be correlated with climatic conditions. This distribution of the susceptible classes modeled using the SVM approach with reference to slope aspect can be seen in Figure 11b. Similar results of the landslide abundance along such aspects were also produced by Koukis et al. [72] and Tsangaratos and Ilia [14]. The variations in climatic conditions along these aspects can be attributed to some other factors [73]. For example, these aspects mainly constitute relatively mid-level values of NDVI with sparse vegetation and experience a large and intense portion of direct rainfall, especially on the ESE–SW direction. The other slopes along the W to NE through the NW, normally known as cooler aspects, mainly constitute dense forest and snow at relatively higher altitudes. As shown in Figure 7b, the contribution of LCLU, NDVI, slope aspect, and elevation plays a vital role in the abundance of such a hazard; therefore, the prediction results of susceptibility and aspect-based analysis coincide mainly with the existence of these hazards. As the area is comprised of a diverse topography, with sudden changes in terrain, slopes, and aspect angles at some places (southern and northern zones), the contribution of other factors in the modeling cannot be neglected in this region. With this diverse topography, it can be highlighted that landslides in the area are mostly a cumulative effect of all the above mentioned factors.

The modeled results showed a very good agreement with validation data during the AUC analysis. In this study, the performance analysis of the modeled results showed that all five models were reliable in achieving good results in the area, with AUC values above 0.83. The better performances of RF, SVM, and GBM were observed in the scanning results of the models using indices-based evaluation measures (Figure 10a,b). Wang et al. [65] concluded that if the difference between the accuracies of the model using training and validation data is higher, then there is a possibility of overfitting the models during the tuning and training process. In this study, we used validation data to acquire ACC values of 0.87, 0.87, 0.73, 0.86, and 0.68 for the RF, SVM, maxENT, GBM, and LR models, respectively, which corresponds to minimal differences between the accuracies obtained using the training data; therefore, the results obtained in this study showed fine-tuning, training, and modeling with higher reliability. From POA values of 2656 in RF, 2669 in SVM, 2623 in GBM, 1761 in maxent, and 1299 in LR (Figure 10b), we concluded the higher performance of the SVM model in this study followed by RF, GBM, maxENT, and LR.

SVM models are considered to provide the best solution to a nonlinear process such as landslides, which makes this model best-suited to correctly predict the probability of landslide events by separating them with the help of a hyperplane [7]. Brenning [8] highlighted the ability of the SVM model to produce smooth prediction surfaces when compared with the tree-based models, with the presence of spatial artefacts in his results. Comparing results generated from LR and SVM, Yao et al. [74] found that SVM produced a higher prediction accuracy. Similar findings of outperformance and the production of smoothed results in the SVM model have been reported by Goetz et al. [75] and Yilmaz [76] when performing a comparison with NN, DT, and LR models. Similar to the findings of the SVM model’s outperformance in this study, Qing et al. [34] also reported that SVM performed the best among several other ML approaches for debris flow susceptibility along with the China–Pakistan Karakoram highway. However, the effectiveness of the RF model over SVM in different studies has also been discussed in [14,34]. For example, Youssef and Pourghasemi [77] compared SVM and RF with several other machine learning techniques (MLTs) in Saudi Arabia and found that the RF model was the best performer in the region for LSM. Similar to this finding, Tsangaratos and Ilia [14] observed a slight difference between the prediction rates of RF and SVM when applied to the north-west (NW) Peloponnese, Greece, and concluded the better performance of the RF model compared to SVM. Similar results with a minute difference in both of these models are observed by Wang et al. [33] in Shexian County, China. With the varying nature of the accuracy and performance of SVM and RF in different studies, we infer that comparative to other MLTs, the performance results of SVM and RF are quite close, with minimal differences (Figure 9 and Figure 10), and that both of these models can be considered as equivalent in providing reliable results in the area. The similarity in AUC values (0.967) of RF and GBM was due to the ensemble natures of these models, with decent interpretability. This finding is in agreement with the results from recent studies [24,78]. Considering the adhering similarity in prediction abilities, normally GBM offers a relatively higher number of sensitive parametrizations than RF, which makes it difficult to implement in a larger area [32]. Therefore, with the slight difference in POA values (about 1.24%) in this study between these two models, RF was found to be more reliable than GBM. Although maxENT has been successfully applied to species distribution modeling, a comparison of its performance with other ML models has not been fully investigated, particularly for LSM. However, some efforts have been made to compare the performance of this model [11,79,80,81]. For example, Park [11] compared the performance of the maxENT with the LR model by producing landslide susceptibility maps of Boeun, Korea and found the effectiveness of maxENT over LR. Similar to the findings of Park [11], Felicísimo et al. [81] found that this model performed better than LR in Deba Valley, Northern Spain. In contrast to the other outperforming models, LR yields the lowest accuracy, and our findings regarding the relatively poor performance of LR are also supported by other related studies [15,32].

4.3. Realtime Validation of Modeled Results

Some recent landslide events in the area also helped to validate the results of modeled landslide susceptibility. For example, eight landslide events occurred along Mansehra–Naran–Jalkhad (MNJ) Road on 26 July 2019, resulting in the destruction of several houses and a road blockage for days, with some causalities [82]. Another landslide event occurred on 15 July 2019 along the Muzaffarabad–Abbottabad road, resulting in the blockage of the road for two days. Figure 12f shows the Google Earth image of the landslide section (black polygon) along MNJ road near Jared town. The modeled outputs along this region from all five models are shown in Figure 12a–e, and it can be seen that this landslide section has been modeled and categorized as mostly in the high- and very-high-susceptibility classes in all the investigated models.

5. Conclusions

Landslides, being the most critical natural hazard, are a very serious threat to human life, infrastructure, and natural resources across the globe. The identification of an accurate and stable ML model for predicting landslide susceptibility is often a very difficult and demanding task, especially when applied to a region with rugged terrain. In this study, we implemented five state-of-the-art ML algorithms, i.e., RF, SVM, maxENT, GBM, and LR, to investigate their performances for LSM in part of northern Pakistan. The study indicated that:

The 12 selected influencing factors (i.e., elevation, slope, aspect, TWI, TPI, distance to drainage, distance to fault, distance to road, NDVI, rainfall, LCLU, and geological layer) were useful when scanned through the IGR test, and none of the LIFs had a multicollinearity problem.
The extracted 70% of the sample data used for the tuning and training of the models showed a good agreement with the tuning parameters, which helped in achieving higher accuracies during the training process.
Apart from predicting large-scale landslides, a visual analysis (Figure 12) of modeled susceptibility maps indicated that all five models were also able to predict very-small-scale landslides, which can provide a better understanding to identify landslide risks in the region.
The descriptive analysis of the LIF contribution showed that the area lying in the elevation range of 505 to 3895 m, with NE–WSW facing slopes, near the drainage networks, with LCLU other than snow, dense vegetation, cultivated land, and water bodies could be attributed directly to the slope failure manifestation in the study area, which can experience a relatively higher chance of landslide occurrences in the future.
During the validation process, using 30% of the samples, the results showed that SVM (AUC = 0.969, POA = 2669), RF (AUC = 0.967, POA = 2656), and GBM (AUC = 0.967, POA = 2623) were the top three performers, leaving behind maxENT (AUC = 0.872, POA = 1761) and LR (AUC = 0.836, POA = 1299). The validation through the calculation of other parameters (precision, recall, ACC, F1 score, and MCC) showed a similar trend in terms of the performance sequence of the models.
Exhibiting minimal differences in terms of all the evaluated parameters with SVM and RF, the study also found the better performance of the GBM model for LSM in the region.
SVM > RF > GBM > maxENT > LR represents the overall performance ranking with decreasing POA values.
Conclusively, these modeled results can be helpful in understanding and assessing the scope of landslide problems in this area and can also provide help in risk reduction and mitigation measures in this region as well as in other parts of the globe with similar topographical, climatic, and geological conditions.

Supplementary Materials

The following are available online at https://www.mdpi.com/article/10.3390/app12052280/s1, Figure S1: Changing trends in mtry against ntree = 800 in random forest, Figure S2: Optimization of cost and sigma values for SVM, Figure S3: Changing trends in parametric values of GBM, Figure S4: Changing trends in parametric values of maxENT for different feature types, Figure S5: Changing trends in parametric values of decay for LR model.

Author Contributions

All authors contributed to the manuscript and discussed the results. This research was supervised by X.D.; conceptualization, N.S. and X.D.; data curation and methodology, N.S.; formal analysis, N.S. and S.A.; funding acquisition, X.D. All authors have read and agreed to the published version of the manuscript.

Funding

The first author was supported by a Hong Kong Ph.D. Fellowship of the Research Grants Council (RGC) of Hong Kong. The work was supported by the RGC (Grants Numbers: PolyU152164/18E and PolyU152233/19E); the Research Institute for Sustainable Urban Development (RISUD); and the Innovation and Technology Fund.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data and other related scripts that support the findings of this study are available on demand from the corresponding author.

Acknowledgments

The authors wish to acknowledge the Research Grants Council (RGC) of Hong Kong, the Research Institute for Sustainable Urban Development (RISUD) of the Hong Kong Polytechnic University, and the Innovation and Technology Fund (ITF) for their support in this research. We greatly appreciate the editor and the reviewers for their constructive suggestions and insightful comments that helped us greatly to improve this manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

References

Corominas, J.; van Westen, C.; Frattini, P.; Cascini, L.; Malet, J.P.; Fotopoulou, S.; Catani, F.; Van Den Eeckhaut, M.; Mavrouli, O.; Agliardi, F.; et al. Recommendations for the quantitative analysis of landslide risk. Bull. Eng. Geol. Environ. 2014, 73, 209–263. [Google Scholar] [CrossRef]
Mondini, A.C.; Guzzetti, F.; Chang, K.-T.; Monserrat, O.; Martha, T.R.; Manconi, A. Landslide failures detection and mapping using Synthetic Aperture Radar: Past, present and future. Earth Sci. Rev. 2021, 216, 103574. [Google Scholar] [CrossRef]
Segoni, S.; Tofani, V.; Rosi, A.; Catani, F.; Casagli, N. Combination of Rainfall Thresholds and Susceptibility Maps for Dynamic Landslide Hazard Assessment at Regional Scale. Front. Earth Sci. 2018, 6, 85. [Google Scholar] [CrossRef] [Green Version]
Brabb, E.E. Proposal for Worldwide Landslide Hazard Maps. In Proceedings of the Seventh International Conference and Field Workshop on Landslides in Czech and Slovak Republics, Czech Republic and Slovakia, 28 August–15 September 1993; Rotterdam: Brookfield, VT, USA, 1993; pp. 15–27. [Google Scholar]
Guzzetti, F.; Mondini, A.C.; Cardinali, M.; Fiorucci, F.; Santangelo, M.; Chang, K.-T. Landslide inventory maps: New tools for an old problem. Earth-Sci. Rev. 2012, 112, 42–66. [Google Scholar] [CrossRef] [Green Version]
Ma, Z.; Mei, G.; Piccialli, F. Machine learning for landslides prevention: A survey. Neural Comput. Appl. 2020, 33, 10881–10907. [Google Scholar] [CrossRef]
Ballabio, C.; Sterlacchini, S. Support Vector Machines for Landslide Susceptibility Mapping: The Staffora River Basin Case Study, Italy. Math. Geosci. 2012, 44, 47–70. [Google Scholar] [CrossRef]
Brenning, A. Spatial prediction models for landslide hazards: Review, comparison and evaluation. Nat. Hazards Earth Syst. Sci. 2005, 5, 853–862. [Google Scholar] [CrossRef]
Chen, S.; Miao, Z.; Wu, L.; He, Y. Application of an Incomplete Landslide Inventory and One Class Classifier to Earthquake-Induced Landslide Susceptibility Mapping. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2020, 13, 1649–1660. [Google Scholar] [CrossRef]
Lee, S.; Hong, S.-M.; Jung, H.-S. A Support Vector Machine for Landslide Susceptibility Mapping in Gangwon Province, Korea. Sustainability 2017, 9, 48. [Google Scholar] [CrossRef] [Green Version]
Park, N.W. Using maximum entropy modeling for landslide susceptibility mapping with multiple geoenvironmental data sets. Environ. Earth Sci. 2015, 73, 937–949. [Google Scholar] [CrossRef]
Fell, R.; Corominas, J.; Bonnard, C.; Cascini, L.; Leroi, E.; Savage, W.Z. Guidelines for landslide susceptibility, hazard and risk zoning for land use planning. Eng. Geol. 2008, 102, 85–98. [Google Scholar] [CrossRef] [Green Version]
Xue, L.; Xiaoli, L.; Jinggang, L.; Qiuliang, W.; Wulin, L.; Lifen, Z. Factor analysis of earthquake-induced geological disasters of the M7. 0 Lushan earthquake in China. Geod. Geodyn. 2013, 4, 22–29. [Google Scholar] [CrossRef] [Green Version]
Tsangaratos, P.; Ilia, I. Chapter 24-Applying Machine Learning Algorithms in Landslide Susceptibility Assessments. In Handbook of Neural Computation; Samui, P., Sekhar, S., Balas, V.E., Eds.; Academic Press: Cambridge, MA, USA, 2017; pp. 433–457. ISBN 978-0-12-811318-9. [Google Scholar]
Kavzoglu, T.; Colkesen, I.; Sahin, E.K. Machine Learning Techniques in Landslide Susceptibility Mapping: A Survey and a Case Study. In Landslides: Theory, Practice and Modelling; Advances in Natural and Technological Hazards Research 50; Pradhan, S.P., Vishal, V., Singh, T.N., Eds.; Springer International Publishing: Berlin/Heidelberg, Germany, 2019; pp. 283–301. ISBN 9783319773766. [Google Scholar]
Iovine, G.; Cohen, D. Advanced methods in landslide modelling. Nat. Hazards 2014, 73, 1–4. [Google Scholar] [CrossRef] [Green Version]
Choi, K.Y.; Cheung, R.W.M. Landslide disaster prevention and mitigation through works in Hong Kong. J. Rock Mech. Geotech. Eng. 2013, 5, 354–365. [Google Scholar] [CrossRef] [Green Version]
Pourghasemi, H.R.; Teimoori Yansari, Z.; Panagos, P.; Pradhan, B. Analysis and evaluation of landslide susceptibility: A review on articles published during 2005–2016 (periods of 2005–2012 and 2013–2016). Arab. J. Geosci. 2018, 11, 193. [Google Scholar] [CrossRef]
Guzzetti, F.; Carrara, A.; Cardinali, M.; Reichenbach, P. Landslide hazard evaluation: A review of current techniques and their application in a multi-scale study, Central Italy. Geomorphology 1999, 31, 181–216. [Google Scholar] [CrossRef]
Lombardo, L.; Mai, P.M. Presenting logistic regression-based landslide susceptibility results. Eng. Geol. 2018, 244, 14–24. [Google Scholar] [CrossRef]
Tien Bui, D.; Tuan, T.A.; Klempe, H.; Pradhan, B.; Revhaug, I. Spatial prediction models for shallow landslide hazards: A comparative assessment of the efficacy of support vector machines, artificial neural networks, kernel logistic regression, and logistic model tree. Landslides 2016, 13, 361–378. [Google Scholar] [CrossRef]
Pradhan, B. A comparative study on the predictive ability of the decision tree, support vector machine and neuro-fuzzy models in landslide susceptibility mapping using GIS. Comput. Geosci. 2013, 51, 350–365. [Google Scholar] [CrossRef]
Dou, J.; Yunus, A.P.; Tien Bui, D.; Merghadi, A.; Sahana, M.; Zhu, Z.; Chen, C.-W.; Khosravi, K.; Yang, Y.; Pham, B.T. Assessment of advanced random forest and decision tree algorithms for modeling rainfall-induced landslide susceptibility in the Izu-Oshima Volcanic Island, Japan. Sci. Total Environ. 2019, 662, 332–346. [Google Scholar] [CrossRef]
Sahin, E.K. Assessing the predictive capability of ensemble tree methods for landslide susceptibility mapping using XGBoost, gradient boosting machine, and random forest. SN Appl. Sci. 2020, 2, 1308. [Google Scholar] [CrossRef]
Hong, H.; Pourghasemi, H.R.; Pourtaghi, Z.S. Landslide susceptibility assessment in Lianhua County (China): A comparison between a random forest data mining technique and bivariate and multivariate statistical models. Geomorphology 2016, 259, 105–118. [Google Scholar] [CrossRef]
Kosicki, J.Z. Should topographic metrics be considered when predicting species density of birds on a large geographical scale? A case of Random Forest approach. Ecol. Modell. 2017, 349, 76–85. [Google Scholar] [CrossRef]
Kosicki, J.Z. Generalised Additive Models and Random Forest Approach as effective methods for predictive species density and functional species richness. Environ. Ecol. Stat. 2020, 27, 273–292. [Google Scholar] [CrossRef]
Friedman, J.H. Greedy function approximation: A gradient boosting machine. Ann. Stat. 2001, 29, 1189–1232. [Google Scholar] [CrossRef]
Merghadi, A.; Yunus, A.P.; Dou, J.; Whiteley, J.; ThaiPham, B.; Bui, D.T.; Avtar, R.; Abderrahmane, B. Machine learning methods for landslide susceptibility studies: A comparative overview of algorithm performance. Earth-Sci. Rev. 2020, 207, 103225. [Google Scholar] [CrossRef]
Arabameri, A.; Saha, S.; Roy, J.; Chen, W.; Blaschke, T.; Bui, D.T. Landslide susceptibility evaluation and management using different machine learning methods in the Gallicash River Watershed, Iran. Remote Sens. 2020, 12, 475. [Google Scholar] [CrossRef] [Green Version]
Xing, Y.; Yue, J.; Guo, Z.; Chen, Y.; Hu, J.; Travé, A. Large-Scale Landslide Susceptibility Mapping Using an Integrated Machine Learning Model: A Case Study in the Lvliang Mountains of China. Front. Earth Sci. 2021, 9, 622. [Google Scholar] [CrossRef]
Merghadi, A.; Abderrahmane, B.; Tien Bui, D. Landslide susceptibility assessment at Mila basin (Algeria): A comparative assessment of prediction capability of advanced machine learning methods. ISPRS Int. J. Geo-Inf. 2018, 7, 268. [Google Scholar] [CrossRef] [Green Version]
Wang, Z.; Liu, Q.; Liu, Y. Mapping landslide susceptibility using machine learning algorithms and GIS: A case study in Shexian county, Anhui province, China. Symmetry 2020, 12, 1954. [Google Scholar] [CrossRef]
Qing, F.; Zhao, Y.; Meng, X.; Su, X.; Qi, T.; Yue, D. Application of machine learning to debris flow susceptibility mapping along the China-Pakistan Karakoram Highway. Remote Sens. 2020, 12, 2933. [Google Scholar] [CrossRef]
Ali, S.A.; Parvin, F.; Vojteková, J.; Costache, R.; Linh, N.T.T.; Pham, Q.B.; Vojtek, M.; Gigović, L.; Ahmad, A.; Ghorbani, M.A. GIS-based landslide susceptibility modeling: A comparison between fuzzy multi-criteria and machine learning algorithms. Geosci. Front. 2021, 12, 857–876. [Google Scholar] [CrossRef]
Wolpert, D.H. The Lack of a Priori Distinctions between Learning Algorithms. Neural Comput. 1996, 8, 1341–1390. [Google Scholar] [CrossRef]
Ray, R.L. Remote Sensing Approaches and Related Techniques to Map and Study Landslides; Lazzari, M., Ed.; IntechOpen: Rijeka, Croatia, 2020; Chapter 2; ISBN 9781789858242. [Google Scholar]
Tariq, S.; Gomes, C. Landslide Environment in Pakistan after the Earthquake-2005: Information Revisited to Develop Safety Guidelines for Minimizing Future Impacts. J. Geogr. Nat. Disasters 2017, 7, 1–11. [Google Scholar] [CrossRef]
Rafiq, L.; Blaschke, T. Disaster risk and vulnerability in Pakistan at a district level. Geomat. Nat. Hazards Risk 2012, 3, 324–341. [Google Scholar] [CrossRef]
International Federation of Red Cross and Crescent Societies. World Disasters Report; IFRC: Geneva, Switzerland, 2003. [Google Scholar]
Ahmed, M.F.; Rogers, J.D.; Ismail, E.H. A regional level preliminary landslide susceptibility study of the upper Indus river basin. Eur. J. Remote Sens. 2014, 47, 343–373. [Google Scholar] [CrossRef] [Green Version]
Ali, S.; Haider, R.; Abbas, W.; Basharat, M.; Reicherter, K. Empirical assessment of rockfall and debris flow risk along the Karakoram Highway, Pakistan. Nat. Hazards 2021, 106, 2437–2460. [Google Scholar] [CrossRef]
Faisal, S.; Larson, K.P.; King, J.; Cottle, J.M. Rifting, subduction and collisional records from pluton petrogenesis and geochronology in the Hindu Kush, NW Pakistan. Gondwana Res. 2016, 35, 286–304. [Google Scholar] [CrossRef] [Green Version]
Pêcher, A.; Seeber, L.; Guillot, S.; Jouanne, F.; Kausar, A.; Latif, M.; Majid, A.; Mahéo, G.; Mugnier, J.L.; Rolland, Y.; et al. Stress field evolution in the northwest Himalayan syntaxis, northern Pakistan. Tectonics 2008, 27. [Google Scholar] [CrossRef]
Froude, M.J.; Petley, D.N. Global fatal landslide occurrence from 2004 to 2016. Nat. Hazards Earth Syst. Sci. 2018, 18, 2161–2181. [Google Scholar] [CrossRef] [Green Version]
Kirschbaum, D. Global Catalog of Rainfall-Triggered Landslides for Spatial and Temporal Hazard Characterization. In Landslide Science for a Safer Geoenvironment; Sassa, K., Canuti, P., Yin, Y., Eds.; Springer International Publishing: Cham, Switzerland, 2014; pp. 809–814. ISBN 9783319050508. [Google Scholar]
Kirschbaum, D.; Stanley, T.; Zhou, Y. Spatial and temporal analysis of a global landslide catalog. Geomorphology 2015, 249, 4–15. [Google Scholar] [CrossRef]
Ågren, A.M.; Lidberg, W.; Strömgren, M.; Ogilvie, J.; Arp, P.A. Evaluating digital terrain indices for soil wetness mapping-a Swedish case study. Hydrol. Earth Syst. Sci. 2014, 18, 3623–3634. [Google Scholar] [CrossRef] [Green Version]
Encyclopedia of GIS; Shekhar, S.; Xiong, H.; Zhou, X. (Eds.) Springer International Publishing: Cham, Switzerland, 2017; ISBN 9783319178844. [Google Scholar]
Rouse, J.W.; Haas, R.H.; Schell, J.A.; Deeering, D. Monitoring vegetation systems in the Great Plains with ERTS (Earth Resources Technology Satellite). In Proceedings of the Third Earth Resources Technology Satellite-1, Washington, DC, USA, 10–14 December 1973. [Google Scholar]
Blaschke, T. Object based image analysis for remote sensing. ISPRS J. Photogramm. Remote Sens. 2010, 65, 2–16. [Google Scholar] [CrossRef] [Green Version]
Quinlan, J.R. Induction of decision trees. Mach. Learn. 1986, 1, 81–106. [Google Scholar] [CrossRef] [Green Version]
Menard, S.W. Applied Logistic Regression Analysis; Sage Publications: Thousand Oaks, CA, USA, 1995; ISBN 0803957572. [Google Scholar]
Alin, A. Multicollinearity. Wiley Interdiscip. Rev. Comput. Stat. 2010, 2, 370–374. [Google Scholar] [CrossRef]
Haitovsky, Y. Multicollinearity in Regression Analysis Comment. Rev. Econ. Stat. 1969, 51, 465. [Google Scholar] [CrossRef]
Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef] [Green Version]
James, G.; Witten, D.; Hastie, T.; Tibshirani, R. An Introduction to Statistical Learning, Springer Texts; Springers: New York, NY, USA, 2013; ISBN 9780387781884. [Google Scholar]
Kavzoglu, T.; Colkesen, I. A kernel functions analysis for support vector machines for land cover classification. Int. J. Appl. Earth Obs. Geoinf. 2009, 11, 352–359. [Google Scholar] [CrossRef]
Xu, C.; Dai, F.; Xu, X.; Lee, Y.H. GIS-based support vector machine modeling of earthquake-triggered landslide susceptibility in the Jianjiang River watershed, China. Geomorphology 2012, 145–146, 70–80. [Google Scholar] [CrossRef]
Phillips, S.J.; Anderson, R.P.; Schapire, R.E. Maximum entropy modeling of species geographic distributions. Ecol. Modell. 2006, 190, 231–259. [Google Scholar] [CrossRef] [Green Version]
Elith, J.; Phillips, S.J.; Hastie, T.; Dudík, M.; Chee, Y.E.; Yates, C.J. A statistical explanation of MaxEnt for ecologists. Divers. Distrib. 2011, 17, 43–57. [Google Scholar] [CrossRef]
Micheletti, N.; Foresti, L.; Robert, S.; Leuenberger, M.; Pedrazzini, A.; Jaboyedoff, M.; Kanevski, M. Machine Learning Feature Selection Methods for Landslide Susceptibility Mapping. Math. Geosci. 2014, 46, 33–57. [Google Scholar] [CrossRef] [Green Version]
França, S.; Cabral, H.N. Predicting fish species richness in estuaries: Which modelling technique to use? Environ. Model. Softw. 2015, 66, 17–26. [Google Scholar] [CrossRef]
McCullagh, P.; Nelder, J.A. Generalized Linear Models (Monographs on Statistics and Applied Probability), 2nd ed.; Chapman and Hall/CRC: Boca Raton, FL, USA, 1989; ISBN 0412317605. [Google Scholar]
Wang, H.; Zhang, L.; Yin, K.; Luo, H.; Li, J. Landslide identification using machine learning. Geosci. Front. 2020. [Google Scholar] [CrossRef]
Breiman, L.; Friedman, J.H.; Olshen, R.A.; Stone, C.J. Classification and Regression Trees; Chapman and Hall/CRC: Boca Raton, FL, USA, 1984; ISBN 9781351460491. [Google Scholar]
Jenks, G.F. The Data Model Concept in Statistical Mapping. Int. Yearb. Cartogr. 1967, 7, 186–190. [Google Scholar]
Flach, P. ROC Analysis. In Encyclopedia of Machine Learning; Springer: Berlin/Heidelberg, Germany, 2011; pp. 869–875. [Google Scholar]
Swets, J.A. Measuring the accuracy of diagnostic systems. Science 1988, 240, 1285–1293. [Google Scholar] [CrossRef] [Green Version]
Brenning, A. Statistical geocomputing combining R and SAGA: The example of landslide susceptibility analysis with generalized additive models. Hambg. Beiträge Zur Phys. Geogr. Und Landsch. 2008, 19, 410. [Google Scholar]
Jain, A.K.; Duin, R.P.W.; Mao, J. Statistical pattern recognition: A review. IEEE Trans. Pattern Anal. Mach. Intell. 2000. [Google Scholar] [CrossRef] [Green Version]
Koukis, G.; Rozos, D.; Hadzinakos, I. Relationship between rainfall and landslides in the formations of Achaia County, Greece. In Proceedings of the International Symposium on Engineering Geology and the Environment, Athens, Greece, 23–27 June 1997. [Google Scholar]
Dai, F.C.; Lee, C.F.; Ngai, Y.Y. Landslide risk assessment and management: An overview. Eng. Geol. 2002, 64, 65–87. [Google Scholar] [CrossRef]
Yao, X.; Tham, L.G.; Dai, F.C. Landslide susceptibility mapping based on Support Vector Machine: A case study on natural slopes of Hong Kong, China. Geomorphology 2008, 101, 572–582. [Google Scholar] [CrossRef]
Goetz, J.N.; Brenning, A.; Petschko, H.; Leopold, P. Evaluating machine learning and statistical prediction techniques for landslide susceptibility modeling. Comput. Geosci. 2015, 81, 1–11. [Google Scholar] [CrossRef]
Yilmaz, I. Comparison of landslide susceptibility mapping methodologies for Koyulhisar, Turkey: Conditional probability, logistic regression, artificial neural networks, and support vector machine. Environ. Earth Sci. 2010, 61, 821–836. [Google Scholar] [CrossRef]
Youssef, A.M.; Pourghasemi, H.R. Landslide susceptibility mapping using machine learning algorithms and comparison of their performance at Abha Basin, Asir Region, Saudi Arabia. Geosci. Front. 2021, 12, 639–655. [Google Scholar] [CrossRef]
Kalantar, B.; Ueda, N.; Saeidi, V.; Ahmadi, K.; Halin, A.A.; Shabani, F. Landslide susceptibility mapping: Machine and ensemble learning based on remote sensing big data. Remote Sens. 2020, 12, 1737. [Google Scholar] [CrossRef]
Raso, E.; Di Martire, D.; Cevasco, A.; Calcaterra, D.; Scarpellini, P.; Firpo, M. Evaluation of Prediction Capability of the MaxEnt and Frequency Ratio Methods for Landslide Susceptibility in the Vernazza Catchment (Cinque Terre, Italy). In Applied Geology: Approaches to Future Resource Management; De Maio, M., Tiwari, A.K., Eds.; Springer International Publishing: Cham, Switzerland, 2020; pp. 299–316. ISBN 978-3-030-43953-8. [Google Scholar]
Pandey, V.K.; Pourghasemi, H.R.; Sharma, M.C. Landslide susceptibility mapping using maximum entropy and support vector machine models along the highway corridor, Garhwal Himalaya. Geocarto Int. 2018, 35, 168–187. [Google Scholar] [CrossRef]
Felicísimo, Á.M.; Cuartero, A.; Remondo, J.; Quirós, E. Mapping landslide susceptibility with logistic regression, multiple adaptive regression splines, classification and regression trees, and maximum entropy methods: A comparative study. Landslides 2013, 10, 175–189. [Google Scholar] [CrossRef]
Dawn Six Killed as Heavy Rains Pound KP 2019. 2019. Available online: https://www.dawn.com/news/1496462. (accessed on 20 May 2020).

Figure 1. The methodological framework of modeling and evaluation strategy.

Figure 2. Location of the study area and landslide inventory showing training and validation data.

Figure 3. Monthly average temperature and precipitation (a) northern sone (b) southern zone.

Figure 4. Photographs of events: (a) landslide along KKH, (b) GE visualization of this event.

Figure 5. Maps of the Landslide Influencing Factors: (a) elevation, (b) slope, (c) aspect, (d) TWI, (e) TPI, (f) distance to drainage, (g) distance to fault, (h) distance to road, (i) NDVI, (j) rainfall, (k) LCLU, and (l) geology.

Figure 6. Evaluation parameters of landslide influencing factors.

Figure 7. LIF importance analysis: (a) contribution to each model, (b) relative importance and overall contribution.

Figure 8. Landslide susceptibility maps: (a) RF, (b) SVM, (c) maxent, (d) GBM, and (e) LR.

Figure 9. ROC curves of modeled results using validation data.

Figure 10. Performance evaluation and comparison: (a) ACC, precision, recall, F1 Score, and MCC values; (b) POA of the models.

Figure 11. Model comparison: (a) correlation between models, (b) spider graph of SVM vs. slope aspect.

Figure 12. Recent landslide (black rectangle) near Kaghan: (a) RF, (b) SVM, (c) maxENT, (d) GBM, (e) LR, (f) GE image.

Table 1. Characteristic details of LIFs.

S. No.	Influencing Factor	Input Data	Description
1	Elevation	SRTM DEM	Digital elevation of the terrain surface. Values vary between 279 and 5934 m (Figure 5a).
2	Slope	SRTM DEM	A most crucial parameter that has a direct influence on slope failure and susceptibility. Values are up to 89 degrees (Figure 5b).
3	Aspect	SRTM DEM	The exposure of the slope to conditions such as sunlight, temperature, and winds. An important causative factor to susceptibility (Figure 5c).
4	TWI	SRTM DEM	TWI [48] is an index used to quantify topographic control of hydrological processes. Values vary from −3.7 to 16.43 (Figure 5d).
5	TPI	SRTM DEM	TPI [48] is an index that reflects the morphology of the topography (Figure 5e).
6	Distance to drainage	Topographic Sheet	Rivers and natural streams play an important role in slope failure due to the accumulation of water in the surrounding surface and subsurface. The proximity layer was generated based on Euclidean distance [49]. Values vary from 0 to 50,000 m (Figure 5f).
7	Distance to fault	Geological Map	The strength of rocks/soil decreases with the presence of faults and lineaments. The fault lines were extracted from input datasets and the proximity layer was calculated based on Euclidean distance [49]. Values vary from 0 to 7250 m (Figure 5g).
8	Distance to road	Topographic Sheet	To evaluate the effects of road engineering, this proximity layer was calculated based on Euclidean distance [49]. Values vary from 0 to 30,000 m (Figure 5h).
9	NDVI	Landsat-8 OLI	NDVI [50] illustrates the density and spread of vegetation in contrast with the non-vegetated land. It is an important factor that exploits the relationship between landslide occurrence and vegetation cover density. Values range between −1 and +1 (Figure 5i).
10	Rainfall	Topographic Rainfall Mission (TRMM)/GPM	Monthly averaged data product for the year 2017 was downloaded and used for the assessment with other factors. As discussed in Section 2.1 on the study area, the two climatic zones (north and south) can easily be identified in this rainfall map (Figure 5j).
11	LCLU	Landsat-8 OLI	The object-based image analysis (OBIA) technique [51] was adopted to extract 8 LCLU classes with an accuracy of 86% (Figure 5k).
12	Geological layer	Geological Map	Fourteen different units of geology were identified. The distribution and details of these units can be found in Figure 5l.

Table 2. Characteristic details of index-based evaluation measures.

Index	Statistical Definition	Usage
Precision	$\frac{TP}{(TP + FP)}$	This evaluates the fraction of TP samples among all predicted positive samples.
Recall	$\frac{TP}{TP + FN}$	This quantifies the fraction of TP samples among all real positive samples in the data.
F1 score	$\frac{2 * PPV * SST}{PPV + SST}$	This harmonic mean of precision and recall provides a value between 0 (worst) and 1 (best).
ACC	$\frac{(TP + TN)}{(TP + TN + FP + FN)}$	Accuracy is the quantification of percentage samples for accurately predicted data in inventory/catalogue.
MCC	$\frac{TP * TN - FP * FN}{\sqrt{(TP + FP) (TP + FN) (TN + FP) (TN + FN)}}$	The most comprehensive index provides values between −1 (disagreement between sample and prediction) and 1 (as a perfect prediction with respect to samples).

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Shahzad, N.; Ding, X.; Abbas, S. A Comparative Assessment of Machine Learning Models for Landslide Susceptibility Mapping in the Rugged Terrain of Northern Pakistan. Appl. Sci. 2022, 12, 2280. https://doi.org/10.3390/app12052280

AMA Style

Shahzad N, Ding X, Abbas S. A Comparative Assessment of Machine Learning Models for Landslide Susceptibility Mapping in the Rugged Terrain of Northern Pakistan. Applied Sciences. 2022; 12(5):2280. https://doi.org/10.3390/app12052280

Chicago/Turabian Style

Shahzad, Naeem, Xiaoli Ding, and Sawaid Abbas. 2022. "A Comparative Assessment of Machine Learning Models for Landslide Susceptibility Mapping in the Rugged Terrain of Northern Pakistan" Applied Sciences 12, no. 5: 2280. https://doi.org/10.3390/app12052280

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Comparative Assessment of Machine Learning Models for Landslide Susceptibility Mapping in the Rugged Terrain of Northern Pakistan

Abstract

1. Introduction

2. Materials and Methods

2.1. Description of the Study Area

2.2. Data Acquisition and Data Preparation

2.3. Preparation of Landslide Inventory

2.4. Selection of Landslide Influencing Factors (LIFs)

2.5. Evaluation of Landslide Influencing Factors

2.6. Machine Learning Algorithms and Susceptibility Modeling

2.6.1. Random Forest (RF)

2.6.2. Support Vector Machine (SVM)

2.6.3. Maximum Entropy (maxENT)

2.6.4. Gradient-boosting Machine (GBM)

2.6.5. Logistic Regression (LR)

2.7. Performance Evaluation of Models

3. Results

3.1. Importance and Contribution of Landslide Influencing Factors (LIFs)

3.2. Landslide Susceptibility Maps (LSMs)

3.3. Performance Evaluation of Models and Analysis of the Results

4. Discussions

4.1. Evaluation of LIFs and Modeling Criteria

4.2. Performances of Susceptibility Modeling

4.3. Realtime Validation of Modeled Results

5. Conclusions

Supplementary Materials

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI