Next Article in Journal
Deep Seasonal Network for Remote Sensing Imagery Classification of Multi-Temporal Sentinel-2 Data
Next Article in Special Issue
Deformation Behavior and Reactivation Mechanism of the Dandu Ancient Landslide Triggered by Seasonal Rainfall: A Case Study from the East Tibetan Plateau, China
Previous Article in Journal
Black Carbon in a City of the Atacama Desert before and after the Start of the COVID-19 Lockdown: Ground Measurements and MERRA-2 Reanalysis
Previous Article in Special Issue
Landslide Susceptibility Analysis on the Vicinity of Bogotá-Villavicencio Road (Eastern Cordillera of the Colombian Andes)
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Deep Learning and Machine Learning Models for Landslide Susceptibility Mapping with Remote Sensing Data

1
School of Computer Science, China University of Geosciences, Wuhan 430074, China
2
Ningbo Alatu Digital Science and Technology Corporation Limited, Ningbo 315000, China
3
School of Geography and Information Engineering, China University of Geosciences, Wuhan 430074, China
4
Badong National Observation and Research Station of Geohazards, China University of Geosciences, Wuhan 430074, China
*
Author to whom correspondence should be addressed.
Remote Sens. 2023, 15(19), 4703; https://doi.org/10.3390/rs15194703
Submission received: 30 August 2023 / Revised: 23 September 2023 / Accepted: 23 September 2023 / Published: 26 September 2023

Abstract

:
Karakoram Highway (KKH) is an international route connecting South Asia with Central Asia and China that holds socio-economic and strategic significance. However, KKH has extreme geological conditions that make it prone and vulnerable to natural disasters, primarily landslides, posing a threat to its routine activities. In this context, the study provides an updated inventory of landslides in the area with precisely measured slope deformation (Vslope), utilizing the SBAS-InSAR (small baseline subset interferometric synthetic aperture radar) and PS-InSAR (persistent scatterer interferometric synthetic aperture radar) technology. By processing Sentinel-1 data from June 2021 to June 2023, utilizing the InSAR technique, a total of 571 landslides were identified and classified based on government reports and field investigations. A total of 24 new prospective landslides were identified, and some existing landslides were redefined. This updated landslide inventory was then utilized to create a landslide susceptibility model, which investigated the link between landslide occurrences and the causal variables. Deep learning (DL) and machine learning (ML) models, including convolutional neural networks (CNN 2D), recurrent neural networks (RNNs), random forest (RF), and extreme gradient boosting (XGBoost), are employed. The inventory was split into 70% for training and 30% for testing the models, and fifteen landslide causative factors were used for the susceptibility mapping. To compare the accuracy of the models, the area under the curve (AUC) of the receiver operating characteristic (ROC) was used. The CNN 2D technique demonstrated superior performance in creating the landslide susceptibility map (LSM) for KKH. The enhanced LSM provides a prospective modeling approach for hazard prevention and serves as a conceptual reference for routine management of the KKH for risk assessment and mitigation.

1. Introduction

Landslides, one of the most common natural disasters, prevalent in mountainous regions worldwide, pose significant threats to the ecosystem [1]. Landslides are accounted as the downhill movement of debris, soil, and rocks under the force of gravity and can be classified based on the materials involved (mud, rock, soil, or debris) and their movement type (topple flow or slide) [2]. The factors leading to landslides are a combination of tectonics, geomorphology, and climate change, which culminate in a critical slope evolution [3,4]. Other triggering factors contribute to landslides depending on the specific features of the area. Natural variables such as rainfall, rapid snowmelt, earthquakes, and anthropogenic activities, e.g., habitation construction, irrigation, etc., can play a role in the occurrence of landslides [5]. While landslides are often regarded as a natural process, their occurrence mostly has been influenced by anthropogenic activity [6]. In recent years, exponentially growing populations, a surge in infrastructure development, and settlement growth in developing countries’ mountainous regions have increased the probability of landslides, leading to an alarming increase in landslide-related fatalities [7].
The “China-Pakistan Economic Corridor” (CPEC), a significant project under the “One Belt and One Road” initiative, is centered around connecting Pakistan and China via the Karakoram Highway (KKH). The KKH was constructed from 1974 to 1978 and commenced operation in 1979. The highway encompasses most of the route of the CPEC. However, this vital route faces challenges due to the high mountainous terrain with overflowing loose debris and heavy rainfall, triggering frequent and severe geological catastrophes such as glacier debris flows, rock falls, landslides, debris and soil slippage, and avalanches [8]. Determining landslide probabilities along the KKH is a complex process influenced by limited data availability, technical limitations, and harsh environments. Since its completion, the reputation of the KKH has been marred by various geohazards [9]. Specifically, earth-induced landslides in 2005 caused considerable damage to the highway [10]. Enormous rockslides and rock avalanches have occurred, with over 115 incidents reported since 1987 [11]. Moreover, in 2010, a landslide blocked the Hunza River, inundating 19 km of the highway with a loss of 20 lives and damaging 350 houses [12]. The geological conditions along the KKH pose additional challenges, including fragile and weathered rock masses, varying climates, low and high terrains, diverse stratigraphy, and local variations in tectonic motion. Due to these factors, the study region has become a geohazard laboratory. Enhancing precise LSM along the KKH to mitigate the risks posed by these natural hazards is imperative.
Recently, remote sensing (RS) and geographic information systems (GIS) technology have made remarkable technological progress. The utilization of GIS spatial analysis tools and remote-sensing-derived data has enhanced the effectiveness of landslide susceptibility mapping for accurate assessment. Here, comprehensive landslide inventory data and knowledge of landslide conditioning factors are crucial for both data-driven spatial modeling and knowledge-based approaches [13]. Researchers have conducted numerous studies using bivariate analyses to quantify the spatial correlations between landslides and specific factors that influence their dispersion [14,15,16,17]. Several other studies have applied knowledge-based spatial approaches to produce natural risk vulnerability maps, fuzzy logic models [18,19], the analytical hierarchy process (AHP) [20], and the evidential belief function [21], as well as data-driven spatial approaches such as support vector machines [22,23,24], logistic regression methods [24,25], artificial neural network (ANN) models [26,27,28], alternating decision tree (ADTree) [29], principal component analysis (PCA) [30], deep belief network (DBN) [31], decision tree [25,32], superposable neural networks [33], and naïve Bayes [34]. Expertise-based models often encounter challenges due to their reliance on expert opinions, which can introduce biases [35,36].
The primary strengths of probabilistic and ML approaches lie in their objective statistical foundation, consistency, capacity for precisely analyzing the factors influencing landslide development, and capacity building for updates. In this perspective, researchers are continuously seeking new and relatively more robust algorithms that can generalize across different spatial scales [37,38]. Deep learning algorithms, which are specifically developed for large datasets but have seen limited application thus far, need to be implemented and evaluated in this context. Currently, deep learning models, particularly recurrent neural networks and convolutional neural networks, have demonstrated remarkable success across various applications, making them well suited for handling big data [39]. RNNs, like other DL models, comprise a loss function, learnable parameters, and layers [40]. On the other hand, CNNs differ from RNNs as they include convolutional and pooling layers and focus solely on the current input data, while RNNs consider both the earlier provided inputs and present input data [41]. CNNs have proven effective in tasks like semantic segmentation and object detection [7]. Conversely, RNNs show superior performance in tasks such as image recognition, characterization, and sequential data analysis, including time series spatial data [42]. Despite the acceptable results achieved by CNNs and RNNs in various domains, their true efficiency and capabilities in landslide modeling and large-scale landslide susceptibility mapping (LSM) on big data have not been thoroughly analyzed [13]. A few deep learning models have been utilized for natural hazard vulnerability mapping, containing landslide susceptibility mapping and flash floods [43,44,45]. However, these studies have separately employed different deep learning models, and their relative proficiency has not been evaluated yet.
In recent years, interferometric synthetic aperture radar (InSAR) methods have acquired universal approval and usage as tools for landslide monitoring and mapping. Over the past two decades, the RS technique, particularly In-SAR, has demonstrated substantial possibility across different fields, including the study of landslide deformation [46] and groundwater extraction [47]. PS-InSAR proves useful in automatic slow-moving landslide mapping using a spatial statistical technique, the detection of particular landslides and the delineation of extended unstable regions, redefining of the limits of historical landslides, the detection of landslides using a multitemporal analysis of SAR imagery, and the verification of the terrain elements causing slope deformation [48]. In areas prone to frequent and rapid large landslides, RS provides a solution through surveys and advanced detection methods [49]. These techniques can greatly aid in assessing and creating landslide inventory maps. Various methods of InSAR have been effectively used in mapping slope displacement, including that in [50], the assessment of land displacement places identified by using SBAS-InSAR [51], the D-InSAR technique for landslide observing and land deformation [51,52], the coherence pixel technique [53], the SqeeInSAR approach to measuring surface motion [51], interferometric point target analysis [54], the use of StaMPS to evaluate the displacement in a high-vegetation region [55,56], and the PSInSAR method to compute the movement of landslides. These approaches are related to detecting and mapping landslide events, as mentioned in [54,57,58].
In this study, a combination of optical RS analysis and the InSAR technique is utilized to identify landslides and create an updated landslide inventory. The main goals are as follows: (1) mapping all types of landslides along the KKH and estimating displacement maps to identify new landslides, identify unstable places, and redefine the boundaries of previously identified landslides based on the deformation model; (2) generating a landslide susceptibility map using state-of-the-art ML and deep learning (DL) models, including random forest, XGBoost, recurrent neural networks, and convolutional neural networks; (3) comparing the performance of these advanced ML and DL models in terms of landslide susceptibility; (4) assessing the significance and relationships of environmental and anthropogenic factors influencing landslides and their role in evaluating landslide susceptibility in the study area; and (5) determining the most accurate susceptible model reliant on precision and AUC value. Despite the fact that the KKH faces significant landslide threats every summer, previous research has not adequately addressed the issue. Therefore, the landslide susceptibility map produced in this study will aid urban planning and disaster reduction efforts in the area. Moreover, the final InSAR-based landslide inventory will assist in tracking risky areas to minimize future hazards and fatalities. It is imperative to highlight that no previous studies have applied RNNs and CNNs for LSM at KKH. As the first study to utilize and compare these ML and DL models for LSM in this region, it will substantially contribute to the scientific literature.

2. Materials and Methods

2.1. Study Area and Geological Settings

The KKH in northern Pakistan is significant as part of the CPEC but is prone to frequent disruptions caused by various geological and hydro-climatological hazards. The study area was focused along a 263 km section of the KKH, passing through different districts of Gilgit Baltistan. A 5 km buffer around this section, covering an area of 3320 km2 (Figure 1), was examined for the study. The terrain in the study area is rugged, with elevations ranging from 822 to 5545 m above mean sea level. The area experiences mild summers and harsh winters, and the yearly rainfall varies from 120 to 130 mm. The minimum and maximum temperatures are −21 °C and 16 °C, respectively, according to data from the Meteorological Department of Pakistan (https://www.pmd.gov.pk, accessed on 15 March 2023).
The lithology in the research area is significant in triggering landslides. The rocks of the area are primarily of Mesozoic and Paleozoic age. Based on the geological map produced by Searle et al. [59], the research area comprises a diverse range of rocks, including sedimentary, volcano-sedimentary, igneous, volcanic, and metamorphic rocks (Figure 2). These rocks are further stratified into various types, such as greenschist, siliciclastic, carbonates, basalt, andesite, granite, gabbro, and others. The Chalt schists, kilk formation, Quaternary sediments, deformed Misgar slates, and Gujhal dolomite are the most significant among the area’s lithologic formations. All of these lithologies have been tectonically affected and have contributed to slope destabilization along the highway [9]. Over time, the lithologies exposed along the KKH have undergone weathering and weakening due to anthropogenic, hydro-climatic, and seismic events, resulting in significant landslides and surface distortion. Structurally, the area is sophisticated because it lies in a convergent boundary, specifically the Main Karakoram Thrust. This structural setting adds to the susceptibility of the area to geological hazards like landslides.

2.2. Datasets

The Alaska Satellite Facility (ASF) datasets provide an ALOS-PALSAR DEM (digital elevation model) with a resolution of 12.5 m, which was accessed from https://search.asf.alaska.edu/ (accessed on 10 February 2023). Additionally, Sentinel-2 images with a resolution of 10 m were derived from the Copernicus dataset (https://scihub.copernicus.eu, accessed on 10 February 2023) to produce a landcover map for the study area. Geological maps and fault lines for the research area were processed using the ArcGIS software 10.8 to understand the geological features [59,60]. To assess the relationship between rainfall and landslide events, annual precipitation data were obtained from the GIOVANNI online database system (https://giovanni.gsfc.nasa.gov/giovanni/, accessed on 15 March 2023), as rainfall and landslides are found to be directly proportionally related. For this study, two years (June 2021 to June 2023) of C-band Sentinel-1A SAR dataset imagery was obtained from the ASF (search.asf.alaska.edu) online system. The dataset contains scenes in descending and ascending tracks, as presented in Table 1. Figure 3 depicts the technical route of the research.

2.3. Updated Landslide Inventory

Creating landslide inventory maps is a crucial step in LSM [61]. These maps provide essential information about the landscape’s locations and types of landslides, serving as a foundation for predicting future landslides [1]. Landslide inventory maps are generated for a diversity of reasons, including identifying the type and location of landslides in a particular region; showing the impact of a single landslide-triggering incident, such as a rapid snowmelt incident, an intense rainfall event, or an earthquake; emphasizing the quantity of mass movements; calculating the frequency area statistics of slope failures; and providing relevant data to build landslide risk models or susceptibility models [62].
In this study, a total of 571 landslides were mapped using various techniques, including SBAS-InSAR and PS-InSAR, past studies [12,60,63], Frontier Works Organization road clearance logs, optical imagery analysis, Google Earth, and fieldwork in the study area. The landslide inventory, on the other hand, was developed through the visual interpretation of Sentinel-2 images with 10 m resolution (2022) and by using Google Earth, and it was checked using previous documents and a field evaluation of the study region. The inventory map was categorized into eight categories based on the material displacement, comprising 99 scree, 113 rockslides, 28 rock falls, 20 rock avalanches, 20 debris slides, 271 debris flows, 7 debris falls, and 13 complex slides (Figure 4).
The inventory map is applied to verify the identification of landslides through InSAR in the study area. Following the InSAR analysis, potential landslides are recognized based on their high displacement velocity, and these newly detected landslides are then incorporated into the updated version of the inventory map.

2.4. Landslide Conditioning Factors (LCFs)

The process of achieving high accuracy in the landslide susceptibility model and predicting vulnerable areas heavily relies on carefully selecting and preparing the Landslide Conditioning Factors (LCFs) database [64]. There are no universal standards for selecting independent variables for LSM [8,65,66]. The LCFs were chosen in this study based on information gathered from the relevant literature, data specific to the study area, and field investigations. Fifteen LCFs were selected for the current study, including slope, aspect, topographic wetness index (TWI), distance to roads, lithology, distance to rivers, roughness, distance to faults, curvature, precipitation, plan curvature, soil, profile curvature, elevation, and landcover (Table 2). Thematic layers with a spatial resolution of 12.5 × 12.5 m pixel size were prepared (Figure 5 and Figure 6), all using the WGS84 Datum, UTM-Zone 43 coordinate system.

2.5. Multicollinearity Assessment

In landslide susceptibility prediction, selecting the right variables is a critical step that significantly impacts the model’s performance. To improve the accuracy of hazard mapping, it is essential to identify and choose appropriate variables for inclusion in the model [67]. One of the main challenges in variable selection is dealing with multicollinearity, which may arise due to the improper use of redundant or highly correlated factors [68]. To address the issue of multicollinearity, tolerance and variance inflation factors (VIF) are calculated. These measures help to identify closely and linearly related variables in a regression model, which could potentially lead to a reduction in the model’s performance [69]. Through the standard equations for VIF and tolerance, a thorough evaluation of the degree of multicollinearity in the data is conducted. By identifying and removing variables with high levels of multicollinearity, the precision of landslide susceptibility models can be enhanced, and the most significant variables contributing to the prediction of landslide hazards can be identified more accurately.
T O L = 1 R j 2
V I F = 1 1 R j 2
where R j 2 represents the regression value of j on various factors. Thus, multicollinearity problems usually happen if the TOL value is <0.10 and the variance inflation factors value is >10 [70].

2.6. Machine Learning Models

The modeling process included fitting, identifying, and developing an ML model.
The grid unit was used as the model unit, with a spatial resolution of 12.5 m for both the DEM and RS data, and all evaluation factors are recalculated at this level.
The model includes 15 conditioning factors and a landslide target variable (1 indicating landslide and 0 indicating non-landslide), with each row producing an object.
Each column illustrates an object’s characteristic, and it is modified into a two-dimensional matrix for training (70%, 2138 samples) and testing (30%, 917 samples).
The models are constructed using training data, and predictions are made using test data. The landslide vulnerability index maps are produced by combining the prediction values for each model unit in each group. The results of the four models are transferred to a geographic information system. Landslide vulnerability is classified into five categories using Jenks natural breaks [71]: very low, low, moderate, high, and very high. The four models are tested using the ROC curve and the AUROC curve.

2.6.1. RF

Random forest (RF) is a well-known homogeneous bagging-based ensemble model developed by Breiman in 2001 [72]. It consists of multiple decision trees, making it an ensemble learning technique that aggregates the outputs of these trees to produce a classification [72,73,74]. The mechanism of the RF model can be outlined as follows:
I.
Using the bootstrap approach, it creates numerous decision trees to randomly choose fresh sample sets from the initial training dataset with substitution.
II.
At each resampling, a set of features is randomly selected, and the decision trees are built based on this subset of features.
III.
The generated trees are combined into an RF, which is then used to categorize new data.
The RF method exhibits robustness against missing, unbalanced, and multicollinear data and is capable of handling high-dimensional data [75,76]. One of its primary advantages is its resistance to overfitting, even when a large number of random forest trees are grown. Additionally, there is no need to rescale, transform, or modify the data when applying the RF algorithm. In this research, the landslide susceptibility model was developed using the “randomForest” package in R 4.0.2 software [16]. After numerous attempts, the number of trees (ntree) was set to 500, and the mtry parameter was set to 6. To minimize the fluctuation of the model findings and to limit overfitting, RF was conducted using a 10-fold cross-validation technique. The hyperparameters used in the RF model are listed in Table 3.

2.6.2. XGBoost

The XGBoost supervised classification model is built on the gradient tree boosting algorithm [77,78], which is a powerful machine learning model designed by Chen and Guestrin in 2016 [79]. This model creates consecutive decision trees using the estimated residuals or errors from the preceding tree rather than integrating separate trees. This approach allows the algorithm to focus on samples with higher uncertainty, improving its performance. XGBoost offers several advantages, including scalability for various use cases with low computing resource essentials, fast processing speed, efficient handling of sparse data, and smooth integration [80]. The algorithm utilizes a loss function with an additional regularization term to smooth the final learned weights and prevent overfitting [79]. It also employs first- and second-order gradient statistics to optimize the loss function [81]. While XGBoost shares some parameters with other tree-based models, it involves additional hyperparameters to control the overfitting concern, enhance precision, and mitigate forecasting variance [82]. This study develops the landslide susceptibility model using the “XGBoost” package in R 4.0.2 software, which provides powerful capabilities for classification tasks [83]. In this research, three general parameters were chosen for us to alter in the XGBoost algorithm for LSM application: nrounds (the maximum number of boosting repetitions), subsample (the subsample ratio of the training instance), and colsample_bytree (the subsample ratio of columns while constructing each tree). The hyperparameters used in XGBoost model are listed in Table 4. The key points and usability of machine learning models are shown in Table 5.

2.7. Deep Learning Models

2.7.1. CNN-2D

As a supervised DL approach, the CNN excels in achieving high predictive performance in fields such as image and speech recognition. It accomplishes this by hierarchically composing simple local features into complex models. A typical CNN comprises one or more convolutional layers, fully associated layers, and max pooling layers, enabling it to classify and extract features from high-dimensional data [84]. In the context of landslide susceptibility mapping (LSM), the landslide occurrence potential in each grid cell is influenced by multiple factors. Each grid cell possesses a unique set of characteristic values that illustrate the likelihood of a landslide event. We must initialize the process to apply the CNN for LSM by transforming the 1D input grid cell containing various characteristic attributes into a 2D matrix.
We compared the number of landslide-impacting variables with the number of characteristic values for every variable in this study. The largest of these two integers was then chosen to define the size of the related 2D grid. For example, if the research region has 9 lithological classes and 15 landslide influencing factors, we create 9 × 9 matrices for each grid cell. In the matrix, each column vector represents an attribute value, and the element at the corresponding position is assigned the value of 1. In contrast, other elements in the vector are assigned the value of 0. Some of the predictive parameters utilized in the present research for CNN models are provided in Figure 7 and Table 6.

2.7.2. RNN

An RNN (recurrent neural network) has the ability to capture dynamic information in sequential data by creating connections between hidden layer nodes at different time steps. Unlike other neural networks, RNNs can effectively leverage sequential data. In traditional neural networks, all inputs are treated as self-reliant entities. However, in the RNN technique, each unit is linked to other units in the hidden layer at various time intervals, allowing data to be propagated from one layer to the next in the network [85]. This characteristic of RNNs is achieved through the concept of “loop feedback,” where information is shared throughout the RNN. A simple RNN is typically executed using Jordan or Elman network architectures. At time step t, let x t , y t , and h t represent the input vector, the output vector, and the hidden state vector, respectively. By utilizing these elements, we can acquire:
h t = σ W h x t + U h h t 1 + b h
y t = σ W y h t + b y
where σ (⋅) is the training sample sequence’s loss function, U and W are variable matrices, and b is the appropriate bias vector.
RNN is particularly adept at manipulating sequential inputs through its recurrent hidden states. Hence, the accurate visualization of data is crucial in realizing the forecasting capability of RNNs. This portion presents the data visualization method for landslide susceptibility mapping using RNNs, as illustrated in Figure 8. The parameters used in the RNN model are listed in Table 7.
First, each LCF is treated as a single-band image, and all variable categories are assembled. Subsequently, these variable categories are then ordered in a decreasing sequence of relevance. Hence, each pixel can be transformed into a sequential sample based on its importance level. This approach ensures that the most significant variables are fed to the RNN framework first, while the less crucial variables are sent to the model last. Because of the recurrent nature of the RNN, essential data contributing to landslide occurrences are reserved and passed to the next hidden state. This retention of important information is advantageous for the final LSM. By organizing the data representation in this manner, we can effectively leverage the sequential processing capabilities of the RNN to improve the precision of the LSM. The key points and usability of DL models are in Table 8.

2.8. InSAR

The InSAR technique has proven to be highly valuable for the early identification of landslides due to its weather independence, wide monitoring coverage, and high accuracy. Among the various InSAR techniques, the small baseline subset (SBAS) is particularly useful for identifying slow-moving deformations with millimeter-level precision by utilizing a stack of SAR interferograms [86]. Additionally, the PS-InSAR and SBAS-InSAR techniques are utilized to evaluate the deformation in susceptible regions as generated by the models. This research collects and evaluates imagery from the Sentinel-1A IW sensor with a temporal resolution of 12 days.

2.8.1. SBAS-InSAR Processing

The SBAS-InSAR technology is used in this part to verify the LSM along the KKH. The basic data analysis chart, shown in Figure 3, incorporates the preprocessing of data, interferometric creation, phase unwrapping, refinement and re-flattening assessment, and displacement estimations.
The computation of time and spatial baselines between all Sentinel-1 picture pairs is part of data preparation. Following clipping and registration, the DEM data are utilized to finish image authorization, and the proportional conjunction that meets a particular threshold is chosen to generate a differential SAR interferogram set [87]. This investigation used a 30 m resolution SRTM-DEM to construct interferograms. The super primary image utilized comes from the images acquired on 15 July 2022, and 720 interferometric image pairings were created.
The key phase of SBAS-InSAR manipulation is an inversion, and the displacement computation is highly dependent on the investigation of inversion findings. The displacement rate and residual topography are estimated in the first inversion, and the input interferogram is optimized in the second unwrapping [88]. The second inversion expands upon the first by employing low-pass and high-pass filtering to calculate and eliminate the atmospheric phase, allowing for more accurate final displacement estimates and, ultimately, geocoding to determine the displacement rate dispersion in the research region. To avoid the effects of unwrapping inaccuracies, the line-of-sight displacement velocity was computed using a coherence threshold of 0.3 for SBAS-InSAR analysis [89].

2.8.2. PS-InSAR Processing

The PS-InSAR process analyzes the uniformity of the phase and amplitude using multitemporal SAR images wrapped around the same area, which determines the pixels that are not as caused by spatiotemporal decorrelation and then defines specific displacement information on each component of the phase, which must be conjunctly assessed and modeled to remove discrepancies [90,91].
The data preparation analysis steps (Figure 3) for importing SLC data with accurate paths comprise the following:
Acquiring imagery: Images with the same rotations are obtained, and both slave and master images are selected. The master images of the research area are obtained first, followed by the selection of slave images that overlap the same region.
Coregistration and examination: A specific area of interest is coregistered and evaluated. Various measures such as atmospheric phase screen (APS), track errors, and other factors are corrected and measured.
Phase constancy assessment: The phase constancy of the acquired data is evaluated. The pixels are projected to exhibit similar amplitudes and reduced phase distributions for these acquisitions. Absolute amplitude levels are not a significant concern in terms of manipulation disturbances.
Amplitude stability index (ASI): The ASI is used to choose persistent scatterers (PS) in the SARPROZ software (2023) procedure. PS points with ASI values greater than 0.7 are selected. This constraint parameter ensures that only a limited number of PS points are considered, which is necessary for accurate atmospheric phase screen computation.
Reference network and linear model: A reference network is built by linking PS points using Delaunay triangulation. The extracted linear model is removed, including residual height and linear deformation velocities. The APS is analyzed using an inverse network from the phase residual, and a single point of reference is defined to estimate the object’s velocity.
Multi-image sparse point (MISP) processing: Second-order PS points are chosen using the criteria of ASI > 0.6 in this step. Thicker PS points are obtained at this phase. To eliminate APS, identical parameters and reference points used for APS estimates are applied.
Geocoding and visualization: Google Earth is used to geocode and map the PS points. The landslide susceptibility map only includes PS points with a coherence of 0.60, indicating their reliability [92].

3. Results

3.1. Multicollinearity Analysis

This research assumed that there are no significant linear associations among the landslide causative variables that can negatively impact the susceptibility models. A multicollinearity analysis is conducted on the 15 landslide conditioning variables, and the outcomes are presented in Table 9. The rainfall variable has the highest VIF score of 4.892, while the lowest VIF score of 1.017 is observed for Aspect. The TOL values ranged from 0.204 to 0.982.

3.2. Landslide Susceptibility Mapping

The landslide susceptibility maps are generated using deep learning and machine learning models: CNN 2D, RNN, XGBoost, and RF (Figure 9). The experiment showed that the CNN 2D model had the best performance among the four models. The LSMs were classified into five categories using the Jenks natural break [71] technique in ArcGIS.
The precision of the landslide susceptibility maps was assessed using the confusion matrix proposed by [32]. Table 10 shows the performance of the CNN 2D, RNN, XGBoost, and RF models during the training stage. The outcomes revealed that the CNN 2D model has a high precision rate of 0.836 in the study region. Further validation was performed using the ROC (receiver operating characteristic) method [93], which plots “sensitivity” vs. “specificity” for different cut-off estimates. However, to understand the model’s performance, the ROC’s AUC was used [94]. The AUC results for the CNN 2D, RNN, XGBoost, and RF models are 82.56%, 79.43%, 76.04%, and 75.37%, respectively (Figure 10).

3.3. Landslide Mapping Based on Deformation Velocity

A thorough landslide inventory was organized for the identification and analysis of landslides along KKH. This inventory was built by combining data from several sources, including displacement values acquired from both descending and ascending data gathered from PS-InSAR and SBAS-InSAR observation.
In this study, the InSAR analysis successfully detected the majority of previously mapped landslides. Moreover, based on the PS and SBAS data, several new landslides with significant deformation velocity were identified, along with the boundaries, which were calibrated through fieldwork. The PS and SBAS techniques were applied to derive displacement rates along the time series in a one-dimensional LOS direction using a set of highly coherent interferograms with small spatial and temporal baselines [95,96].

3.3.1. PS-InSAR Results

Surface deformation on the Earth’s surface was calculated using a temporal coherence threshold of 0.7 for PS-InSAR analysis. A total of 324,747 PS/DS target points were obtained, representing the LOS deformation values ranging from −92.37 to 72.28 mm/year. These values were converted into Vslope using the transformation formula (5), resulting in a total of 212,373 points. The maximum slope displacement velocity was determined, with an appropriate threshold set between 0 and −20 mm/year (Figure 11). It was ascertained that barren terrain has a higher concentration of PS points than forested regions.
V s l o p e = V L O S c o s
where VLOS is displacement and Ø is the incident angle.
As a result, a total of 15 potential landslides were identified and detected based on the PS-InSAR-identified displacement velocity. Among the 15 landslides detected using PS-InSAR, 10 were exclusively identified using ascending Sentinel-1 datasets, while 5 were specifically detected using descending Sentinel-1 datasets (Figure 12). This observation demonstrates that the combination of descending and ascending datasets can overcome the constraints of acquiring data from a single scanning posture, improving the landslide detection process.
In the identification and detection of landslides, the PS-InSAR identifies displacement data and characteristics from images and field photographs, which are found valuable when used in integration. Most of these landslides are concealed by PS-InSAR-detected coherent targets (Figure 13). However, some landslides may not strictly meet the criteria of higher deformation velocity to be identified as active landslides, as they might experience lower rates of deformation due to steeper slopes or being in an actual inactive state.

3.3.2. SBAS-InSAR Results

In the SBAS-InSAR processing, the LOS displacement velocity (VLos) was determined using a coherent threshold of 0.3. The slope orientation velocity (Vslope) was then derived from the satellite LOS data, showing only unidirectional displacement. Since landslides and Earth’s surface deformations mostly happen over steep land, Vslope is an essential constituent used to forecast landslide evolution. The SBAS-InSAR results indicate that the displacement velocity along the LOS ranged from −81.89 to 75.40 mm/year (as depicted in Figure 14).
The preliminary delineation of landslide boundaries was conducted by combining the displacement velocity along the slope from both descending and ascending datasets with visual analysis of optical RS images and field assessments. This involves referring to areas with relatively high deformation velocity, topographic characteristics obtained from the digital elevation model (DEM), and features observed in the optical images.
As a result of the SBAS-InSAR analysis, a total of 9 potential landslides were detected and identified. Additionally, 547 landslides were represented through a combination of information from the literature [12,60,66] and field observations (Figure 15). Among the 9 SBAS-InSAR-identified landslides, the ascending Sentinel-1 dataset identified 6, and 3 were specifically identified using the descending Sentinel-1 dataset.
Figure 15 provides comparative examples of identified landslides, most of which are wrapped by SBAS-InSAR-identified coherent targets. The analysis of the landslides revealed that approximately 90% of them are associated with Quaternary deposits, the Hunza plutonic unit, southern Karakorum Metamorphic Complex, Permian massive limestone, and Chilas Complex formations. Their delineation was achieved through field investigations, analysis of optical RS images, and references to the existing literature (Figure 16).
The majority of KKH’s landscape is barren, with nearly 90 percent of the landslides occurring in non-vegetated zones. While SBAS-InSAR and PS-InSAR methods may have limitations in vegetation-covered areas, they still apply to more than 60% of the land devoid of vegetation. This suggests that vegetation plays a critical role in controlling slope stability in the region, consistent with previous research findings [60,97,98].

4. Discussion

The current study utilized RS techniques, such as optical RS and InSAR, for risk assessment and landslide mapping along the KKH [63,99,100]. This study took benefit of the multi-azimuth interpretation provided by the descending and ascending Sentinel-1 dataset, allowing for more extensive monitoring of surface displacement. The PS-InSAR and SBAS-InSAR techniques effectively captured regions with high deformation rates in most areas along the KKH. Additionally, the comprehensive landslide inventory presented in this study includes the latest landslides, ensuring the database is up to date and its valuable information. By processing Sentinel-1 data from June 2021 to June 2023, utilizing the InSAR technique, 24 new prospective landslides were identified, and some existing landslides were redefined. This updated landslide inventory was then utilized to create a landslide susceptibility model, which investigated the link between landslide occurrences and the causal variables. By combining the findings from PS-InSAR, SBAS-InSAR, and field investigations, the inventory was updated with landslides that have the potential for future failure and pose risks for the region, contributing to improved landslide susceptibility mapping.
The selected landslide influencing factors were used to construct CNN 2D and RNN architectures for comparison with the XGBoost and RF methods. The LSMs were validated and compared based on the AUROC curve and accuracy. CNN is known for its ability to efficiently obtain spatial data using weight sharing and local connections, making it a promising method for landslide modeling [101]. Earlier research has shown that combining CNNs with additional statistical approaches can produce better accuracy in landslide susceptibility modeling than using CNNs alone [102]. According to [103], CNN models are an improved tool for landslide modeling due to their substantial outcomes and higher accuracy rate in spatial landslide forecasting. The outcomes also reveal that both DL and traditional ML algorithms give excellent precision in a variety of sectors, such as landslide assessment and earth science studies throughout the world [104], which is in line with our findings, which showed that the ROC for the four models varies from 82.56 to 75.37%.
In the subsequent experiments, the proposed CNN and RNN models demonstrated enhanced predictive capability compared to the popular XGBoost and RF classifiers. Specifically, CNN-2D attained the highest AUC value of 0.825 on the validation set, indicating its effectiveness in improving prediction performance and its potential as a potential approach for future research. Various statistical and machine learning approaches have been compared and applied for landslide spatial forecast in areas, including AHP and Scoops 3D [105], frequency ratio (FR) and weight of evidence [106], the weighted overlay technique and AHP [9], random forest [63], support vector classification (SVC) [107], and XGBoost [100], but DL techniques, such as CNNs, provide powerful improvements by automatically exploring representations from raw data, making them valuable in various fields, including landslide susceptibility assessments. The experimental findings highlighted that CNN-2D outperformed the traditional DL approach of RNNs and the classical XGBoost and RF ML techniques. Furthermore, the suggested data representation techniques offer an innovative approach to handling raw landslide data. By exploiting the power of DL methods and combining them with other approaches, there is enormous potential to advance landslide susceptibility analysis in the future. The proposed 2D CNN structure includes convolutional max pooling layers and a dropout layer. Overfitting is a common issue when utilizing a 2D CNN in LSM. To address overfitting, each convolution layer is subsequently followed by a dropout layer, which temporarily discards NN units during the training process of the CNN based on a certain probability. This helps to improve classification accuracies and enhance the model’s generalization capability.
Different LCFs influence landslide triggers and relate to each other, making the selection of appropriate variables crucial for building an accurate landslide susceptibility model. The aim is to construct models with reduced noise and greater forecast ability. Before analyzing landslide susceptibility, it is essential to evaluate the forecasting potential of each contributing factor. To attain this, efforts are made to select the most relevant and impactful factors. Multicollinearity analysis is employed to evaluate correlations between the LCFs. In this study, 15 landslide conditioning variables were chosen as independent factors for evaluating landslide susceptibility, and the results are presented in Table 3. The variance inflation factor (VIF) was used to test the multicollinearity between these factors. Among the selected factors, rainfall had the highest VIF score of 4.892, while aspect had the lowest VIF score of 1.017. The tolerance (TOL) values ranged from 0.204 to 0.982. The outcomes revealed that there is no significant multicollinearity among the chosen variables, allowing all variables to be integrated into the models. It is worth noting that landslides can still occur in areas with significant vegetation due to rainfall and other external forces.
Despite the beneficial effect of the selected factors in evaluating landslide susceptibility, the current research could have been more effective if certain factors were adhered to. One major factor is data availability. This research relied on limited data diversity with a focus on historical landslide data, which limited the comprehensive analysis [60]. Another factor is that the input data resolution remains unpredictable during the data preparation phase, which has been a prevalent issue in past investigations [108,109]. Terrain condition factors were derived from a 12.5 m resolution DEM, while variables related to geological conditions were based on a 1:500,000 scale geological map. All factor layers were resampled at a 12.5 m resolution in ArcGIS 10.8 software to ensure data availability and computational convenience. The analysis of model performance in this study indicates that resampling processing was feasible. Secondly, because of restricted data availability, we examined several types of landslides with varying triggering conditions throughout a given time. While some investigators have previously explored this approach, a separate investigation of distinct types of landslides is more in line with the practical and current state factors [110,111].

5. Conclusions

The PS-InSAR and SBAS-InSAR techniques and multi-track ascending and descending Sentinel-1 SAR datasets were used to measure surface displacement velocity along the KKH. An updated and comprehensive landslide inventory was created by combining field surveys, image analysis, and a literature evaluation, identifying 571 landslides, including 24 newly detected active landslides and 547 landslides from previous records. To predict landslide susceptibility along KKH, two well-known deep learning (CNN-2D and RNN) and machine learning (XGBoost and RF) algorithms were utilized and compared. The CNN-2D algorithm demonstrated superior performance with an AUC of 82.56, outperforming RNN, XGBoost, and RF in terms of AUC, ROC, predictive power, and accuracy. The landslide susceptibility maps generated by these models can serve as valuable tools for decision-makers, land use planners, and various non-governmental and governmental organizations involved in resource and disaster management, infrastructure development, and human activity in the study area. In the future, studies can explore improved deep learning and machine learning architectures for landslide susceptibility mapping to improve accuracy and predictive capabilities utilizing this research as a baseline.
The current study is limited because of the absence of geotechnical and geophysical data. It is suggested that the developed dataset be used in future studies to improve algorithm prediction potency, create more precise LSM, and discover correlations between landslide incidence and these new geo-environmental variables. It is also advised to combine DL algorithms with metaheuristic techniques to optimize model parameters and boost algorithm prediction capabilities.

Author Contributions

Conceptualization, M.A.H.; Methodology, M.A.H.; Software, M.A.H.; Validation, Y.Z. (Yulong Zhou); Formal analysis, Y.Z. (Yulong Zhou); Investigation, Y.Z. (Ying Zheng); Resources, Y.Z. (Ying Zheng); Writing—original draft, M.A.H.; Writing—review & editing, H.D.; Visualization, H.D.; Supervision, Z.C.; Project administration, Z.C.; Funding acquisition, Z.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by National Natural Science Foundation of China, grant number No. 41871305; National key R & D program of China, grant number No.2017YFC0602204; Fundamental Research Funds for the Central Universities, China University of Geosciences (Wuhan), grant number No. CUGQY1945; Opening Fund of Key Laboratory of Geological Survey and Evaluation of Ministry of Education; and Fundamental Research Funds for the Central Universities, grant number No. GLAB2019ZR02.

Data Availability Statement

The data presented in the study are available upon request from the first and corresponding authors. The data are not publicly available due to the thesis that is being prepared using these data.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Zhou, X.; Wen, H.; Zhang, Y.; Xu, J.; Zhang, W. Landslide susceptibility mapping using hybrid random forest with GeoDetector and RFE for factor optimization. Geosci. Front. 2021, 12, 101211. [Google Scholar] [CrossRef]
  2. Ngo, P.T.T.; Panahi, M.; Khosravi, K.; Ghorbanzadeh, O.; Kariminejad, N.; Cerda, A.; Lee, S. Evaluation of deep learning algorithms for national scale landslide susceptibility mapping of Iran. Geosci. Front. 2021, 12, 505–519. [Google Scholar]
  3. Chen, N.; Tian, S.; Wang, F.; Shi, P.; Liu, L.; Xiao, M.; Liu, E.; Tang, W.; Rahman, M.; Somos-Valenzuela, M. Multi-wing butterfly effects on catastrophic rockslides. Geosci. Front. 2023, 14, 101627. [Google Scholar] [CrossRef]
  4. Berrocal, J.; Espinosa, A.; Galdos, J. Seismological and geological aspects of the Mantaro landslide in Peru. Nature 1978, 275, 533–536. [Google Scholar] [CrossRef]
  5. Wilde, M.; Günther, A.; Reichenbach, P.; Malet, J.-P.; Hervás, J. Pan-European landslide susceptibility mapping: ELSUS Version 2. J. Maps 2018, 14, 97–104. [Google Scholar] [CrossRef]
  6. Lima, P.; Steger, S.; Glade, T.; Tilch, N.; Schwarz, L.; Kociu, A. Landslide susceptibility mapping at national scale: A first attempt for Austria. In Proceedings of the Advancing Culture of Living with Landslides: Volume 2 Advances in Landslide Science; Springer: Ljubljana, Slovenia, 2017; pp. 943–951. [Google Scholar]
  7. Ghorbanzadeh, O.; Blaschke, T.; Gholamnia, K.; Meena, S.R.; Tiede, D.; Aryal, J. Evaluation of different machine learning methods and deep-learning convolutional neural networks for landslide detection. Remote Sens. 2019, 11, 196. [Google Scholar] [CrossRef]
  8. Hussain, M.A.; Chen, Z.; Zheng, Y.; Shoaib, M.; Shah, S.U.; Ali, N.; Afzal, Z. Landslide susceptibility mapping using machine learning algorithm validated by persistent scatterer In-SAR technique. Sensors 2022, 22, 3119. [Google Scholar] [CrossRef]
  9. Ali, S.; Biermanns, P.; Haider, R.; Reicherter, K. Landslide susceptibility mapping by using a geographic information system (GIS) along the China–Pakistan Economic Corridor (Karakoram Highway), Pakistan. Nat. Hazards Earth Syst. Sci. 2019, 19, 999–1022. [Google Scholar] [CrossRef]
  10. Sato, H.P.; Hasegawa, H.; Fujiwara, S.; Tobita, M.; Koarai, M.; Une, H.; Iwahashi, J. Interpretation of landslide distribution triggered by the 2005 Northern Pakistan earthquake using SPOT 5 imagery. Landslides 2007, 4, 113–122. [Google Scholar] [CrossRef]
  11. Hewitt, K. Catastrophic landslides and their effects on the Upper Indus streams, Karakoram Himalaya, northern Pakistan. Geomorphology 1998, 26, 47–80. [Google Scholar] [CrossRef]
  12. Abbas, H.; Hussain, D.; Khan, G.; ul Hassan, S.N.; Kulsoom, I.; Hussain, S. Landslide inventory and landslide susceptibility mapping for china pakistan economic corridor (CPEC)’s main route (Karakorum Highway). J. Appl. Emerg. Sci. 2021, 11, 18–30. [Google Scholar]
  13. Ghorbanzadeh, O.; Blaschke, T.; Aryal, J.; Gholaminia, K. A new GIS-based technique using an adaptive neuro-fuzzy inference system for land subsidence susceptibility mapping. J. Spat. Sci. 2020, 65, 401–418. [Google Scholar] [CrossRef]
  14. Sestras, P.; Bilașco, Ș.; Roșca, S.; Naș, S.; Bondrea, M.; Gâlgău, R.; Vereș, I.; Sălăgean, T.; Spalević, V.; Cîmpeanu, S. Landslides susceptibility assessment based on GIS statistical bivariate analysis in the hills surrounding a metropolitan area. Sustainability 2019, 11, 1362. [Google Scholar] [CrossRef]
  15. Ding, Q.; Chen, W.; Hong, H. Application of frequency ratio, weights of evidence and evidential belief function models in landslide susceptibility mapping. Geocarto Int. 2017, 32, 619–639. [Google Scholar] [CrossRef]
  16. Youssef, A.M.; Pourghasemi, H.R. Landslide susceptibility mapping using machine learning algorithms and comparison of their performance at Abha Basin, Asir Region, Saudi Arabia. Geosci. Front. 2021, 12, 639–655. [Google Scholar] [CrossRef]
  17. Constantin, M.; Bednarik, M.; Jurchescu, M.C.; Vlaicu, M. Landslide susceptibility assessment using the bivariate statistical analysis and the index of entropy in the Sibiciu Basin (Romania). Environ. Earth Sci. 2011, 63, 397–406. [Google Scholar] [CrossRef]
  18. Chen, W.; Panahi, M.; Tsangaratos, P.; Shahabi, H.; Ilia, I.; Panahi, S.; Li, S.; Jaafari, A.; Ahmad, B.B. Applying population-based evolutionary algorithms and a neuro-fuzzy system for modeling landslide susceptibility. Catena 2019, 172, 212–231. [Google Scholar] [CrossRef]
  19. Ali, S.A.; Parvin, F.; Vojteková, J.; Costache, R.; Linh, N.T.T.; Pham, Q.B.; Vojtek, M.; Gigović, L.; Ahmad, A.; Ghorbani, M.A. GIS-based landslide susceptibility modeling: A comparison between fuzzy multi-criteria and machine learning algorithms. Geosci. Front. 2021, 12, 857–876. [Google Scholar] [CrossRef]
  20. Jena, R.; Pradhan, B.; Beydoun, G.; Sofyan, H.; Affan, M. Integrated model for earthquake risk assessment using neural network and analytic hierarchy process: Aceh province, Indonesia. Geosci. Front. 2020, 11, 613–634. [Google Scholar] [CrossRef]
  21. Bui, D.T.; Pradhan, B.; Lofman, O.; Revhaug, I.; Dick, O.B. Spatial prediction of landslide hazards in Hoa Binh province (Vietnam): A comparative assessment of the efficacy of evidential belief functions and fuzzy logic models. Catena 2012, 96, 28–40. [Google Scholar]
  22. Balogun, A.-L.; Rezaie, F.; Pham, Q.B.; Gigović, L.; Drobnjak, S.; Aina, Y.A.; Panahi, M.; Yekeen, S.T.; Lee, S. Spatial prediction of landslide susceptibility in western Serbia using hybrid support vector regression (SVR) with GWO, BAT and COA algorithms. Geosci. Front. 2021, 12, 101104. [Google Scholar] [CrossRef]
  23. Achour, Y.; Pourghasemi, H.R. How do machine learning techniques help in increasing accuracy of landslide susceptibility maps? Geosci. Front. 2020, 11, 871–883. [Google Scholar] [CrossRef]
  24. Wang, H.; Zhang, L.; Yin, K.; Luo, H.; Li, J. Landslide identification using machine learning. Geosci. Front. 2021, 12, 351–364. [Google Scholar] [CrossRef]
  25. Tien Bui, D.; Tuan, T.A.; Klempe, H.; Pradhan, B.; Revhaug, I. Spatial prediction models for shallow landslide hazards: A comparative assessment of the efficacy of support vector machines, artificial neural networks, kernel logistic regression, and logistic model tree. Landslides 2016, 13, 361–378. [Google Scholar] [CrossRef]
  26. Pradhan, B.; Lee, S. Regional landslide susceptibility analysis using back-propagation neural network model at Cameron Highland, Malaysia. Landslides 2010, 7, 13–30. [Google Scholar] [CrossRef]
  27. Elkadiri, R.; Sultan, M.; Youssef, A.M.; Elbayoumi, T.; Chase, R.; Bulkhi, A.B.; Al-Katheeri, M.M. A remote sensing-based approach for debris-flow susceptibility assessment using artificial neural networks and logistic regression modeling. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2014, 7, 4818–4835. [Google Scholar] [CrossRef]
  28. Gorsevski, P.V.; Brown, M.K.; Panter, K.; Onasch, C.M.; Simic, A.; Snyder, J. Landslide detection and susceptibility mapping using LiDAR and an artificial neural network approach: A case study in the Cuyahoga Valley National Park, Ohio. Landslides 2016, 13, 467–484. [Google Scholar] [CrossRef]
  29. Arabameri, A.; Chen, W.; Loche, M.; Zhao, X.; Li, Y.; Lombardo, L.; Cerda, A.; Pradhan, B.; Bui, D.T. Comparison of machine learning models for gully erosion susceptibility mapping. Geosci. Front. 2020, 11, 1609–1620. [Google Scholar] [CrossRef]
  30. Sabokbar, H.F.; Roodposhti, M.S.; Tazik, E. Landslide susceptibility mapping using geographically-weighted principal component analysis. Geomorphology 2014, 226, 15–24. [Google Scholar] [CrossRef]
  31. Xiong, Y.; Zhou, Y.; Wang, F.; Wang, S.; Wang, J.; Ji, J.; Wang, Z. Landslide susceptibility mapping using ant colony optimization strategy and deep belief network in Jiuzhaigou Region. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2021, 14, 11042–11057. [Google Scholar] [CrossRef]
  32. Kavzoglu, T.; Sahin, E.K.; Colkesen, I. Landslide susceptibility mapping using GIS-based multi-criteria decision analysis, support vector machines, and logistic regression. Landslides 2014, 11, 425–439. [Google Scholar] [CrossRef]
  33. Youssef, K.; Shao, K.; Moon, S.; Bouchard, L.-S. Landslide susceptibility modeling by interpretable neural network. Commun. Earth Environ. 2023, 4, 162. [Google Scholar] [CrossRef]
  34. Tsangaratos, P.; Ilia, I. Comparison of a logistic regression and Naïve Bayes classifier in landslide susceptibility assessments: The influence of models complexity and training dataset size. Catena 2016, 145, 164–179. [Google Scholar] [CrossRef]
  35. Khosravi, K.; Sartaj, M.; Tsai, F.T.-C.; Singh, V.P.; Kazakis, N.; Melesse, A.M.; Prakash, I.; Bui, D.T.; Pham, B.T. A comparison study of DRASTIC methods with various objective methods for groundwater vulnerability assessment. Sci. Total Environ. 2018, 642, 1032–1049. [Google Scholar] [CrossRef]
  36. Hussain, M.A.; Chen, Z.; Wang, R.; Shoaib, M. PS-InSAR-based validated landslide susceptibility mapping along Karakorum Highway, Pakistan. Remote Sens. 2021, 13, 4129. [Google Scholar] [CrossRef]
  37. Wang, Y.; Hong, H.; Chen, W.; Li, S.; Panahi, M.; Khosravi, K.; Shirzadi, A.; Shahabi, H.; Panahi, S.; Costache, R. Flood susceptibility mapping in Dingnan County (China) using adaptive neuro-fuzzy inference system with biogeography based optimization and imperialistic competitive algorithm. J. Environ. Manag. 2019, 247, 712–729. [Google Scholar] [CrossRef]
  38. Chen, W.; Hong, H.; Panahi, M.; Shahabi, H.; Wang, Y.; Shirzadi, A.; Pirasteh, S.; Alesheikh, A.A.; Khosravi, K.; Panahi, S. Spatial prediction of landslide susceptibility using gis-based data mining techniques of anfis with whale optimization algorithm (woa) and grey wolf optimizer (gwo). Appl. Sci. 2019, 9, 3755. [Google Scholar] [CrossRef]
  39. Wang, Y.; Fang, Z.; Wang, M.; Peng, L.; Hong, H. Comparative study of landslide susceptibility mapping with different recurrent neural networks. Comput. Geosci. 2020, 138, 104445. [Google Scholar] [CrossRef]
  40. Zaremba, W.; Sutskever, I.; Vinyals, O. Recurrent neural network regularization. arXiv 2014, arXiv:1409.2329. [Google Scholar]
  41. Krizhevsky, A.; Sutskever, I.; Hinton, G.E. Imagenet classification with deep convolutional neural networks. Adv. Neural Inf. Process. Syst. 2012, 25, 1097–1105. [Google Scholar] [CrossRef]
  42. Du, Y.; Wang, W.; Wang, L. Hierarchical recurrent neural network for skeleton based action recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA, 7–12 June 2015; pp. 1110–1118. [Google Scholar]
  43. Wang, Y.; Fang, Z.; Hong, H.; Peng, L. Flood susceptibility mapping using convolutional neural network frameworks. J. Hydrol. 2020, 582, 124482. [Google Scholar] [CrossRef]
  44. Bui, D.T.; Hoang, N.-D.; Martínez-Álvarez, F.; Ngo, P.-T.T.; Hoa, P.V.; Pham, T.D.; Samui, P.; Costache, R. A novel deep learning neural network approach for predicting flash flood susceptibility: A case study at a high frequency tropical storm area. Sci. Total Environ. 2020, 701, 134413. [Google Scholar]
  45. Pradhan, B.; Lee, S.; Dikshit, A.; Kim, H. Spatial flood susceptibility mapping using an explainable artificial intelligence (XAI) model. Geosci. Front. 2023, 14, 101625. [Google Scholar] [CrossRef]
  46. Shi, X.; Liao, M.; Li, M.; Zhang, L.; Cunningham, C. Wide-area landslide deformation mapping with multi-path ALOS PALSAR data stacks: A case study of three gorges area, China. Remote Sens. 2016, 8, 136. [Google Scholar] [CrossRef]
  47. Hussain, M.A.; Chen, Z.; Shoaib, M.; Shah, S.U.; Khan, J.; Ying, Z. Sentinel-1A for monitoring land subsidence of coastal city of Pakistan using Persistent Scatterers In-SAR technique. Sci. Rep. 2022, 12, 5294. [Google Scholar] [CrossRef]
  48. Scaioni, M.; Longoni, L.; Melillo, V.; Papini, M. Remote sensing for landslide investigations: An overview of recent achievements and perspectives. Remote Sens. 2014, 6, 9600–9652. [Google Scholar] [CrossRef]
  49. Schlögel, R.; Doubre, C.; Malet, J.-P.; Masson, F. Landslide deformation monitoring with ALOS/PALSAR imagery: A D-InSAR geomorphological interpretation method. Geomorphology 2015, 231, 314–330. [Google Scholar] [CrossRef]
  50. Dai, C.; Li, W.; Wang, D.; Lu, H.; Xu, Q.; Jian, J. Active landslide detection based on Sentinel-1 data and InSAR technology in Zhouqu county, Gansu province, Northwest China. J. Earth Sci. 2021, 32, 1092–1103. [Google Scholar] [CrossRef]
  51. Ferretti, A.; Fumagalli, A.; Novali, F.; Prati, C.; Rocca, F.; Rucci, A. A new algorithm for processing interferometric data-stacks: SqueeSAR. IEEE Trans. Geosci. Remote Sens. 2011, 49, 3460–3470. [Google Scholar] [CrossRef]
  52. Ali, M.; Shahzad, M.I.; Nazeer, M.; Kazmi, J.H. Estimation of surface deformation due to Pasni earthquake using SAR interferometry. Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. 2018, 42, 23–29. [Google Scholar] [CrossRef]
  53. Mora, O.; Mallorqui, J.J.; Broquetas, A. Linear and nonlinear terrain deformation maps from a reduced set of interferometric SAR images. IEEE Trans. Geosci. Remote Sens. 2003, 41, 2243–2253. [Google Scholar] [CrossRef]
  54. Strozzi, T.; Wegmuller, U.; Keusen, H.R.; Graf, K.; Wiesmann, A. Analysis of the terrain displacement along a funicular by SAR interferometry. IEEE Geosci. Remote Sens. Lett. 2006, 3, 15–18. [Google Scholar] [CrossRef]
  55. Jiaxuan, H.; Mowen, X.; Atkinson, P. Dynamic susceptibility mapping of slow-moving landslides using PSInSAR. Int. J. Remote Sens. 2020, 41, 7509–7529. [Google Scholar] [CrossRef]
  56. Ciampalini, A.; Raspini, F.; Lagomarsino, D.; Catani, F.; Casagli, N. Landslide susceptibility map refinement using PSInSAR data. Remote Sens. Environ. 2016, 184, 302–315. [Google Scholar] [CrossRef]
  57. Lu, P.; Stumpf, A.; Kerle, N.; Casagli, N. Object-oriented change detection for landslide rapid mapping. IEEE Geosci. Remote Sens. Lett. 2011, 8, 701–705. [Google Scholar] [CrossRef]
  58. Righini, G.; Pancioli, V.; Casagli, N. Updating landslide inventory maps using Persistent Scatterer Interferometry (PSI). Int. J. Remote Sens. 2012, 33, 2068–2096. [Google Scholar] [CrossRef]
  59. Searle, M.; Khan, M.A.; Fraser, J.; Gough, S.; Jan, M.Q. The tectonic evolution of the Kohistan-Karakoram collision belt along the Karakoram Highway transect, north Pakistan. Tectonics 1999, 18, 929–949. [Google Scholar] [CrossRef]
  60. Su, X.-J.; Zhang, Y.; Meng, X.-M.; Yue, D.-X.; Ma, J.-H.; Guo, F.-Y.; Zhou, Z.-Q.; Rehman, M.U.; Khalid, Z.; Chen, G. Landslide mapping and analysis along the China-Pakistan Karakoram Highway based on SBAS-InSAR detection in 2017. J. Mt. Sci. 2021, 18, 2540–2564. [Google Scholar] [CrossRef]
  61. Guzzetti, F.; Carrara, A.; Cardinali, M.; Reichenbach, P. Landslide hazard evaluation: A review of current techniques and their application in a multi-scale study, Central Italy. Geomorphology 1999, 31, 181–216. [Google Scholar] [CrossRef]
  62. Galli, M.; Ardizzone, F.; Cardinali, M.; Guzzetti, F.; Reichenbach, P. Comparing landslide inventory maps. Geomorphology 2008, 94, 268–289. [Google Scholar] [CrossRef]
  63. Hussain, M.A.; Chen, Z.; Kalsoom, I.; Asghar, A.; Shoaib, M. Landslide susceptibility mapping using machine learning algorithm: A case study along Karakoram Highway (KKH), Pakistan. J. Indian Soc. Remote Sens. 2022, 50, 849–866. [Google Scholar] [CrossRef]
  64. Zeng, T.; Wu, L.; Peduto, D.; Glade, T.; Hayakawa, Y.S.; Yin, K. Ensemble learning framework for landslide susceptibility mapping: Different basic classifier and ensemble strategy. Geosci. Front. 2023, 101645. [Google Scholar] [CrossRef]
  65. Zhao, F.; Meng, X.; Zhang, Y.; Chen, G.; Su, X.; Yue, D. Landslide susceptibility mapping of karakorum highway combined with the application of SBAS-InSAR technology. Sensors 2019, 19, 2685. [Google Scholar] [CrossRef] [PubMed]
  66. Hussain, M.A.; Chen, Z.; Wang, R.; Shah, S.U.; Shoaib, M.; Ali, N.; Xu, D.; Ma, C. Landslide susceptibility mapping using machine learning algorithm. Civ. Eng. J 2022, 8, 209–224. [Google Scholar] [CrossRef]
  67. Pradhan, B.; Seeni, M.I.; Nampak, H. Integration of LiDAR and QuickBird data for automatic landslide detection using object-based analysis and random forests. Laser Scanning Appl. Landslide Assess. 2017, 69–81. [Google Scholar] [CrossRef]
  68. Pourghasemi, H.R.; Gokceoglu, C. Spatial Modeling in GIS and R for Earth and Environmental Sciences; Elsevier: Amsterdam, The Netherlands, 2019. [Google Scholar]
  69. Hong, H.; Liu, J.; Zhu, A.-X. Modeling landslide susceptibility using LogitBoost alternating decision trees and forest by penalizing attributes with the bagging ensemble. Sci. Total Environ. 2020, 718, 137231. [Google Scholar] [CrossRef] [PubMed]
  70. Arabameri, A.; Saha, S.; Mukherjee, K.; Blaschke, T.; Chen, W.; Ngo, P.T.T.; Band, S.S. Modeling spatial flood using novel ensemble artificial intelligence approaches in northern Iran. Remote Sens. 2020, 12, 3423. [Google Scholar] [CrossRef]
  71. Jenks, G.F.; Caspall, F.C. Error on choroplethic maps: Definition, measurement, reduction. Ann. Assoc. Am. Geogr. 1971, 61, 217–244. [Google Scholar] [CrossRef]
  72. Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
  73. Micheletti, N.; Foresti, L.; Robert, S.; Leuenberger, M.; Pedrazzini, A.; Jaboyedoff, M.; Kanevski, M. Machine learning feature selection methods for landslide susceptibility mapping. Math. Geosci. 2014, 46, 33–57. [Google Scholar] [CrossRef]
  74. Calle, M.L.; Urrea, V. Stability of Random Forest importance measures. Brief. Bioinform. 2011, 12, 86–89. [Google Scholar] [CrossRef] [PubMed]
  75. Park, S.; Kim, J. Landslide susceptibility mapping based on random forest and boosted regression tree models, and a comparison of their performance. Appl. Sci. 2019, 9, 942. [Google Scholar] [CrossRef]
  76. Joshi, A.; Pradhan, B.; Chakraborty, S.; Behera, M.D. Winter wheat yield prediction in the conterminous United States using solar-induced chlorophyll fluorescence data and XGBoost and random forest algorithm. Ecol. Inform. 2023, 77, 102194. [Google Scholar] [CrossRef]
  77. Bentéjac, C.; Csörgő, A.; Martínez-Muñoz, G. A comparative analysis of gradient boosting algorithms. Artif. Intell. Rev. 2021, 54, 1937–1967. [Google Scholar] [CrossRef]
  78. Ma, B.; Meng, F.; Yan, G.; Yan, H.; Chai, B.; Song, F. Diagnostic classification of cancers using extreme gradient boosting algorithm and multi-omics data. Comput. Biol. Med. 2020, 121, 103761. [Google Scholar] [CrossRef]
  79. Chen, T.; Guestrin, C. Xgboost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; pp. 785–794. [Google Scholar]
  80. LeDell, E.; Poirier, S. H2o automl: Scalable automatic machine learning. In Proceedings of the AutoML Workshop at ICML, Vienna, Austria, 17 July 2020. [Google Scholar]
  81. Zhang, W.; Wu, C.; Zhong, H.; Li, Y.; Wang, L. Prediction of undrained shear strength using extreme gradient boosting and random forest based on Bayesian optimization. Geosci. Front. 2021, 12, 469–477. [Google Scholar] [CrossRef]
  82. Merghadi, A.; Yunus, A.P.; Dou, J.; Whiteley, J.; ThaiPham, B.; Bui, D.T.; Avtar, R.; Abderrahmane, B. Machine learning methods for landslide susceptibility studies: A comparative overview of algorithm performance. Earth-Sci. Rev. 2020, 207, 103225. [Google Scholar] [CrossRef]
  83. Akinci, H.; Zeybek, M.; Dogan, S. Evaluation of landslide susceptibility of Şavşat District of Artvin Province (Turkey) using machine learning techniques. In Landslides; IntechOpen: London, UK, 2021. [Google Scholar]
  84. Shin, H.-C.; Roth, H.R.; Gao, M.; Lu, L.; Xu, Z.; Nogues, I.; Yao, J.; Mollura, D.; Summers, R.M. Deep convolutional neural networks for computer-aided detection: CNN architectures, dataset characteristics and transfer learning. IEEE Trans. Med. Imaging 2016, 35, 1285–1298. [Google Scholar] [CrossRef]
  85. Xu, S.; Niu, R. Displacement prediction of Baijiabao landslide based on empirical mode decomposition and long short-term memory neural network in Three Gorges area, China. Comput. Geosci. 2018, 111, 87–96. [Google Scholar] [CrossRef]
  86. Hu, B.; Wang, H.-S.; Sun, Y.-L.; Hou, J.-G.; Liang, J. Long-term land subsidence monitoring of Beijing (China) using the small baseline subset (SBAS) technique. Remote Sens. 2014, 6, 3648–3661. [Google Scholar] [CrossRef]
  87. Oliver-Cabrera, T.; Jones, C.E.; Yunjun, Z.; Simard, M. InSAR phase unwrapping error correction for rapid repeat measurements of water level change in wetlands. IEEE Trans. Geosci. Remote Sens. 2021, 60, 1–15. [Google Scholar] [CrossRef]
  88. Xia, Z.; Motagh, M.; Li, T.; Roessner, S. The June 2020 Aniangzhai landslide in Sichuan Province, Southwest China: Slope instability analysis from radar and optical satellite remote sensing data. Landslides 2022, 19, 313–329. [Google Scholar] [CrossRef]
  89. Sataer, G.; Sultan, M.; Emil, M.K.; Yellich, J.A.; Palaseanu-Lovejoy, M.; Becker, R.; Gebremichael, E.; Abdelmohsen, K. Remote sensing application for landslide detection, monitoring along Eastern Lake Michigan (Miami Park, MI). Remote Sens. 2022, 14, 3474. [Google Scholar] [CrossRef]
  90. Zhou, C.; Cao, Y.; Yin, K.; Wang, Y.; Shi, X.; Catani, F.; Ahmed, B. Landslide characterization applying sentinel-1 images and InSAR technique: The muyubao landslide in the three Gorges Reservoir Area, China. Remote Sens. 2020, 12, 3385. [Google Scholar] [CrossRef]
  91. Hussain, M.A.; Chen, Z.; Zheng, Y.; Shoaib, M.; Ma, J.; Ahmad, I.; Asghar, A.; Khan, J. PS-InSAR Based Monitoring of Land Subsidence by Groundwater Extraction for Lahore Metropolitan City, Pakistan. Remote Sens. 2022, 14, 3950. [Google Scholar] [CrossRef]
  92. Khan, J.; Ren, X.; Hussain, M.A.; Jan, M.Q. Monitoring land subsidence using PS-InSAR technique in Rawalpindi and islamabad, Pakistan. Remote Sens. 2022, 14, 3722. [Google Scholar] [CrossRef]
  93. Bui, D.T.; Ngo, P.-T.T.; Pham, T.D.; Jaafari, A.; Minh, N.Q.; Hoa, P.V.; Samui, P. A novel hybrid approach based on a swarm intelligence optimized extreme learning machine for flash flood susceptibility mapping. Catena 2019, 179, 184–196. [Google Scholar] [CrossRef]
  94. Song, Y.; Niu, R.; Xu, S.; Ye, R.; Peng, L.; Guo, T.; Li, S.; Chen, T. Landslide susceptibility mapping based on weighted gradient boosting decision tree in Wanzhou section of the Three Gorges Reservoir Area (China). ISPRS Int. J. Geo-Inf. 2018, 8, 4. [Google Scholar] [CrossRef]
  95. Berardino, P.; Fornaro, G.; Lanari, R.; Sansosti, E. A new algorithm for surface deformation monitoring based on small baseline differential SAR interferograms. IEEE Trans. Geosci. Remote Sens. 2002, 40, 2375–2383. [Google Scholar] [CrossRef]
  96. Lanari, R.; Mora, O.; Manunta, M.; Mallorquí, J.J.; Berardino, P.; Sansosti, E. A small-baseline approach for investigating deformations on full-resolution differential SAR interferograms. IEEE Trans. Geosci. Remote Sens. 2004, 42, 1377–1386. [Google Scholar] [CrossRef]
  97. Bacha, A.S.; Shafique, M.; van der Werff, H. Landslide inventory and susceptibility modelling using geospatial tools, in Hunza-Nagar valley, northern Pakistan. J. Mt. Sci. 2018, 15, 1354–1370. [Google Scholar] [CrossRef]
  98. Rehman, M.U.; Zhang, Y.; Meng, X.; Su, X.; Catani, F.; Rehman, G.; Yue, D.; Khalid, Z.; Ahmad, S.; Ahmad, I. Analysis of landslide movements using interferometric synthetic aperture radar: A case study in Hunza-Nagar Valley, Pakistan. Remote Sens. 2020, 12, 2054. [Google Scholar] [CrossRef]
  99. Hussain, S.; Pan, B.; Afzal, Z.; Ali, M.; Zhang, X.; Shi, X. Landslide detection and inventory updating using the time-series InSAR approach along the Karakoram Highway, Northern Pakistan. Sci. Rep. 2023, 13, 1–19. [Google Scholar] [CrossRef] [PubMed]
  100. Kulsoom, I.; Hua, W.; Hussain, S.; Chen, Q.; Khan, G.; Shihao, D. SBAS-InSAR based validated landslide susceptibility mapping along the Karakoram Highway: A case study of Gilgit-Baltistan, Pakistan. Sci. Rep. 2023, 13, 3344. [Google Scholar] [CrossRef]
  101. Nhu, V.-H.; Shirzadi, A.; Shahabi, H.; Chen, W.; Clague, J.J.; Geertsema, M.; Jaafari, A.; Avand, M.; Miraki, S.; Talebpour Asl, D. Shallow landslide susceptibility mapping by random forest base classifier and its ensembles in a semi-arid region of Iran. Forests 2020, 11, 421. [Google Scholar] [CrossRef]
  102. Fang, Z.; Wang, Y.; Peng, L.; Hong, H. Integration of convolutional neural network and conventional machine learning classifiers for landslide susceptibility mapping. Comput. Geosci. 2020, 139, 104470. [Google Scholar] [CrossRef]
  103. Ding, A.; Zhang, Q.; Zhou, X.; Dai, B. Automatic recognition of landslide based on CNN and texture change detection. In Proceedings of the 2016 31st Youth Academic Annual Conference of Chinese Association of Automation (YAC), Wuhan, China, 11–13 November 2016; pp. 444–448. [Google Scholar]
  104. Karantanellis, E.; Marinos, V.; Vassilakis, E.; Hölbling, D. Evaluation of machine learning algorithms for object-based mapping of landslide zones using UAV data. Geosciences 2021, 11, 305. [Google Scholar] [CrossRef]
  105. Rashid, B.; Iqbal, J.; Su, L.-J. Landslide susceptibility analysis of Karakoram highway using analytical hierarchy process and scoops 3D. J. Mt. Sci. 2020, 17, 1596–1612. [Google Scholar] [CrossRef]
  106. Hussain, M.L.; Shafique, M.; Bacha, A.S.; Chen, X.-Q.; Chen, H.-Y. Landslide inventory and susceptibility assessment using multiple statistical approaches along the Karakoram highway, northern Pakistan. J. Mt. Sci. 2021, 18, 583–598. [Google Scholar] [CrossRef]
  107. Qing, F.; Zhao, Y.; Meng, X.; Su, X.; Qi, T.; Yue, D. Application of machine learning to debris flow susceptibility mapping along the China–Pakistan Karakoram Highway. Remote Sens. 2020, 12, 2933. [Google Scholar] [CrossRef]
  108. Yi, Y.; Zhang, Z.; Zhang, W.; Jia, H.; Zhang, J. Landslide susceptibility mapping using multiscale sampling strategy and convolutional neural network: A case study in Jiuzhaigou region. Catena 2020, 195, 104851. [Google Scholar] [CrossRef]
  109. Sun, D.; Wen, H.; Wang, D.; Xu, J. A random forest model of landslide susceptibility mapping based on hyperparameter optimization using Bayes algorithm. Geomorphology 2020, 362, 107201. [Google Scholar] [CrossRef]
  110. Bui, D.T.; Tsangaratos, P.; Nguyen, V.-T.; Van Liem, N.; Trinh, P.T. Comparing the prediction performance of a Deep Learning Neural Network model with conventional machine learning models in landslide susceptibility assessment. Catena 2020, 188, 104426. [Google Scholar] [CrossRef]
  111. Peethambaran, B.; Anbalagan, R.; Kanungo, D.; Goswami, A.; Shihabudheen, K. A comparative evaluation of supervised machine learning algorithms for township level landslide susceptibility zonation in parts of Indian Himalayas. Catena 2020, 195, 104751. [Google Scholar] [CrossRef]
Figure 1. Location of the study area. (a) Pakistan, (b) Study area in red outline.
Figure 1. Location of the study area. (a) Pakistan, (b) Study area in red outline.
Remotesensing 15 04703 g001
Figure 2. Regional geological map of the study region, which depicts the fault lines (MKT and MMT) and Geological units in the research area, where CC is Chilas Complex (Mafic and Ultra-mafic rocks), Gm is Gilgit complex metasedimentary rocks, Cv is Chalt group, Pm stands for Permian massive Limestone, HPU is a Hunza plutonic unit, Ka stands for Komila amphibolite Complex, Sg is Sumayar Leucogranite, KB is Kohistan Batholiths, SSm is Shyok Suture Melange, Q is Quaternary deposits, SKm stands for southern Karakoram complex, Pz is Palaeozoic Metasedimentary rocks, Sv is Kohistan Arc sequence, Tr is Triassic massive dolomite and limestone, and Y stands for Yasin group.
Figure 2. Regional geological map of the study region, which depicts the fault lines (MKT and MMT) and Geological units in the research area, where CC is Chilas Complex (Mafic and Ultra-mafic rocks), Gm is Gilgit complex metasedimentary rocks, Cv is Chalt group, Pm stands for Permian massive Limestone, HPU is a Hunza plutonic unit, Ka stands for Komila amphibolite Complex, Sg is Sumayar Leucogranite, KB is Kohistan Batholiths, SSm is Shyok Suture Melange, Q is Quaternary deposits, SKm stands for southern Karakoram complex, Pz is Palaeozoic Metasedimentary rocks, Sv is Kohistan Arc sequence, Tr is Triassic massive dolomite and limestone, and Y stands for Yasin group.
Remotesensing 15 04703 g002
Figure 3. Technical route applied in the research.
Figure 3. Technical route applied in the research.
Remotesensing 15 04703 g003
Figure 4. Landslide inventory categorization derived from movement along KKH (black line) with various colors.
Figure 4. Landslide inventory categorization derived from movement along KKH (black line) with various colors.
Remotesensing 15 04703 g004
Figure 5. LCFs used in the study area.
Figure 5. LCFs used in the study area.
Remotesensing 15 04703 g005
Figure 6. LCFs used in the study area.
Figure 6. LCFs used in the study area.
Remotesensing 15 04703 g006
Figure 7. The structure of CNN-2D is illustrated in a schematic figure.
Figure 7. The structure of CNN-2D is illustrated in a schematic figure.
Remotesensing 15 04703 g007
Figure 8. Data representation for RNN.
Figure 8. Data representation for RNN.
Remotesensing 15 04703 g008
Figure 9. LSM using CNN 2D, RNN, XGBoost, and RF models.
Figure 9. LSM using CNN 2D, RNN, XGBoost, and RF models.
Remotesensing 15 04703 g009
Figure 10. ROC plots of CNN 2D, RNN, XGBoost, and RF models.
Figure 10. ROC plots of CNN 2D, RNN, XGBoost, and RF models.
Remotesensing 15 04703 g010
Figure 11. Displacement velocity along landslides and slope measured by using PS-InSAR applying the ascending and descending orbit data.
Figure 11. Displacement velocity along landslides and slope measured by using PS-InSAR applying the ascending and descending orbit data.
Remotesensing 15 04703 g011
Figure 12. Landslide distribution identified through multi-track Sentinel-1 datasets based on PS-InSAR. LA and LD: the landslide identified from ascending and descending Sentinel-1 datasets.
Figure 12. Landslide distribution identified through multi-track Sentinel-1 datasets based on PS-InSAR. LA and LD: the landslide identified from ascending and descending Sentinel-1 datasets.
Remotesensing 15 04703 g012
Figure 13. PS-InSAR displacement observation and optical image assessment of identified and interpreted landslides.
Figure 13. PS-InSAR displacement observation and optical image assessment of identified and interpreted landslides.
Remotesensing 15 04703 g013
Figure 14. Displacement velocity along landslides and slope measured by using SBAS-InSAR applying the ascending and descending orbit data.
Figure 14. Displacement velocity along landslides and slope measured by using SBAS-InSAR applying the ascending and descending orbit data.
Remotesensing 15 04703 g014
Figure 15. Landslide distribution identified through multi-track Sentinel-1 datasets based on SBAS-InSAR. LA and LD: the landslide detected by using ascending and descending Sentinel-1 dataset.
Figure 15. Landslide distribution identified through multi-track Sentinel-1 datasets based on SBAS-InSAR. LA and LD: the landslide detected by using ascending and descending Sentinel-1 dataset.
Remotesensing 15 04703 g015
Figure 16. SBAS-InSAR displacement observation and the optical image assessment of identified and interpreted landslides.
Figure 16. SBAS-InSAR displacement observation and the optical image assessment of identified and interpreted landslides.
Remotesensing 15 04703 g016
Table 1. Datasets used in PS-InSAR and SBAS-InSAR analysis.
Table 1. Datasets used in PS-InSAR and SBAS-InSAR analysis.
Data InformationAscendingDescending
Product typeSentinel 1 SLC
Acquisition modeIW
PolarizationVV
Wavelength (m)0.056
No of images6360
Time periodJune 2021–June 2023
Frame114473
Track 100107
Coverage (km2)250
Incident angleHorizontal (~45°) to vertical (~23°)
Azimuth resolution and range (m)5 × 20
Table 2. List of landslide causative variables used in the research.
Table 2. List of landslide causative variables used in the research.
S.NOVariablesSourcesDescription/Extraction
1Aspect, Elevation, Slope, Curvature, Plan Curvature, Profile Curvature, TWI, Distance to River, Roughness, Digital Elevation Model ALOS-PALSAR-DEM
(https://search.asf.alaska.edu, accessed on 10 February 2023)
2Lithology, Distance to Fault, Distance to roadGeological MapGeological Survey of Pakistan
3LandcoverSentinel-2 imagesLand use/Landcover
(https://earthexplorer.usgs.gov/, accessed on 10 February 2023)
4SoilSoil mapFood and Agricultural Organization (FAO) website
(http://www.fao.org/soils-portal/data-hub/soil-maps-and-databases/en/, accessed on 10 March 2023)
5PrecipitationGIOVANNI(https://giovanni.gsfc.nasa.gov/giovanni/, accessed on 15 March 2023)
Table 3. List of parameters used in random forest model.
Table 3. List of parameters used in random forest model.
ParametersValues
Node size14
mtry06
ntree500
Table 4. List of parameters used in extreme gradient boosting model.
Table 4. List of parameters used in extreme gradient boosting model.
ParametersValues
nround210
subsample1
colsample_bytree0.75
max_depth6
gamma0.01
eta0.05
Table 5. The key points of RF and XGBoost models.
Table 5. The key points of RF and XGBoost models.
RFXGBoost
Bagging ensemble methodBoosting ensemble method
Bagging-based algorithm where only a subset of features are selected at random to build a forest or collection of decision trees Gradient boosting employs gradient descent algorithm to minimize errors in sequential models
Reduce risk overfittingRegularization for avoiding overfitting
Maintain precision when a large proportion of data is missing.Efficient handling of missing data
Time-consuming process Less time-consuming process
Table 6. List of parameters used in CNN-2D model.
Table 6. List of parameters used in CNN-2D model.
ParametersValues
Batch size8
Epoch250
Dropout0.5
Learn rate0.002
Activation functionReLU
OptimizerAdam
Table 7. List of parameters used in RNN model.
Table 7. List of parameters used in RNN model.
ParametersValues
Batch size64
Epoch50
Dropout0
Learn rate0.001
OptimizerAdam
Table 8. The key points of CNN-2D and RNN models.
Table 8. The key points of CNN-2D and RNN models.
CNN 2DRNN
BasicsMost popular type of neural networksMost advanced and complex neural network
Structural layoutStructure is based on multiple layers of nodes including one or two conventional layersInformation flows in different direction, which gives it its self-learning feature and memory
Spatial recognition YesNo
Recurrent connection NoYes
DrawbackLarge training data required Slow and complex training and gradient concern
Table 9. Multicollinearity assessment of the LCFs.
Table 9. Multicollinearity assessment of the LCFs.
S.No.VariablesCollinearity Statistics
TOLVIF
1Aspect0.9821.017
2Landcover0.8261.209
3Rainfall0.2044.892
4Geology0.5491.821
5TWI0.6551.524
6Soil0.2843.509
7Slope0.5311.881
8Distance to Fault0.9791.021
9Distance to Road0.7831.275
10Distance to River0.6411.432
11Roughness0.6371.542
12Elevation0.3752.662
13Profile Curvature0.8001.249
14Plan Curvature0.9221.084
15Curvature0.7791.282
Table 10. Confusion matrix of CNN 2D, RNN, XGBoost, and RF models.
Table 10. Confusion matrix of CNN 2D, RNN, XGBoost, and RF models.
ModelsObservationPredictedPrecision
NoYes
CNN 2DNo465383.61
Yes127872
RNNNo384983.24
Yes135876
Extreme Gradient BoostingNo415583.01
Yes132870
Random ForestNo365882.24
Yes137876
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Hussain, M.A.; Chen, Z.; Zheng, Y.; Zhou, Y.; Daud, H. Deep Learning and Machine Learning Models for Landslide Susceptibility Mapping with Remote Sensing Data. Remote Sens. 2023, 15, 4703. https://doi.org/10.3390/rs15194703

AMA Style

Hussain MA, Chen Z, Zheng Y, Zhou Y, Daud H. Deep Learning and Machine Learning Models for Landslide Susceptibility Mapping with Remote Sensing Data. Remote Sensing. 2023; 15(19):4703. https://doi.org/10.3390/rs15194703

Chicago/Turabian Style

Hussain, Muhammad Afaq, Zhanlong Chen, Ying Zheng, Yulong Zhou, and Hamza Daud. 2023. "Deep Learning and Machine Learning Models for Landslide Susceptibility Mapping with Remote Sensing Data" Remote Sensing 15, no. 19: 4703. https://doi.org/10.3390/rs15194703

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop