Advancing Landslide Susceptibility Mapping in the Medea Region Using a Hybrid Metaheuristic ANFIS Approach

Debiche, Fatiha; Benbouras, Mohammed Amin; Petrisor, Alexandru-Ionut; Baba Ali, Lyes Mohamed; Leghouchi, Abdelghani

doi:10.3390/land13060889

Open AccessArticle

Advancing Landslide Susceptibility Mapping in the Medea Region Using a Hybrid Metaheuristic ANFIS Approach

by

Fatiha Debiche

¹,

Mohammed Amin Benbouras

¹

,

Alexandru-Ionut Petrisor

^2,3,4,5,*

,

Lyes Mohamed Baba Ali

⁶ and

Abdelghani Leghouchi

⁷

¹

Structure and Materials Department, University of Science and Technology Houari Boumediene, Algiers 16024, Algeria

²

Doctoral School of Urban Planning, Ion Mincu University of Architecture and Urbanism, 10014 Bucharest, Romania

³

Department of Architecture, Faculty of Architecture and Urban Planning, Technical University of Moldova, 2004 Chisinau, Moldova

⁴

National Institute for Research and Development in Constructions, Urbanism and Sustainable Spatial Development URBAN-INCERC, 21652 Bucharest, Romania

⁵

National Institute for Research and Development in Tourism, 50741 Bucharest, Romania

⁶

Faculty of Earth Sciences, Geography and Territorial Planning, University of Science and Technology Houari Boumediene, Algiers 16024, Algeria

⁷

Civil Engineering Department, Mohammed Seddik Benyahia University, Jijel 18000, Algeria

^*

Author to whom correspondence should be addressed.

Land 2024, 13(6), 889; https://doi.org/10.3390/land13060889

Submission received: 12 April 2024 / Revised: 16 June 2024 / Accepted: 19 June 2024 / Published: 19 June 2024

(This article belongs to the Special Issue Remote Sensing Application in Landslide Detection and Assessment)

Download

Browse Figures

Review Reports Versions Notes

Abstract

Landslides pose significant risks to human lives and infrastructure. The Medea region in Algeria is particularly susceptible to these destructive events, which result in substantial economic losses. Despite this vulnerability, a comprehensive landslide map for this region is lacking. This study aims to develop a novel hybrid metaheuristic model for the spatial prediction of landslide susceptibility in Medea, combining the Adaptive Neuro-Fuzzy Inference System (ANFIS) with four novel optimization algorithms (Genetic Algorithm—GA, Particle Swarm Optimization—PSO, Harris Hawks Optimization—HHO, and Salp Swarm Algorithm—SSA). The modeling phase was initiated by using a database comprising 160 landslide occurrences derived from Google Earth imagery; field surveys; and eight conditioning factors (lithology, slope, elevation, distance to stream, land cover, precipitation, slope aspect, and distance to road). Afterward, the Gamma Test (GT) method was used to optimize the selection of input variables. Subsequently, the optimal inputs were modeled using hybrid metaheuristic ANFIS techniques and their performance evaluated using four relevant statistical indicators. The comparative assessment demonstrated the superior predictive capabilities of the ANFIS-HHO model compared to the other models. These results facilitated the creation of an accurate susceptibility map, aiding land use managers and decision-makers in effectively mitigating landslide hazards in the study region and other similar ones across the world.

Keywords:

Adaptive Neuro-Fuzzy Inference System; hybrid metaheuristic optimization algorithms; landslide susceptibility; geographical information system; K cross-validation approach

1. Introduction

Landslides are a multifarious phenomenon, representing a substantial global hazard and endangering human lives, infrastructure, and transportation networks [1,2]. Annually, slope failures incur substantial economic losses, both directly and indirectly, potentially accounting for losses of millions of dollars [3,4]. Over the past two decades, researchers have increasingly recognized a worldwide landslide susceptibility map as a valuable tool for identifying areas vulnerable to landslide risk in urban or rural settings [3]. Such maps are particularly essential in the Medea region, where the specific geological, climatic, and physiographic conditions make it prone to landslide occurrences. Moreover, human activities indirectly contribute to landslide risks due to significant population growth, prompting the government to expand the road network and urbanize the region, thereby placing more infrastructures in landslide-prone areas. The construction of such an infrastructure typically involves excavation and tunneling activities, disrupting the natural equilibrium of soils and potentially inducing slope instability [1,4]. The landslide that occurred in January 2014 within the Jebel El Ouahch Tunnel in Constantine stands as a prominent example of such cases, directly linked to tunneling activities [1]. As a consequence, the tunnel was closed, leading to a modification in the alignment of the A1 highway corridor [1]. Hence, a landslide susceptibility map is a crucial tool in landslide hazard management, providing valuable information for local governments in order to develop master plans and derive solutions aimed at mitigating the catastrophic consequences of landslides. Such maps facilitate the development of appropriate planning and decision-making tools to address landslide risks effectively [3,5,6].

Since the 17th century, numerous approaches have been used to develop methodologies for predicting landslide susceptibility and producing corresponding maps [7,8]. Such maps serve as valuable tools for identifying regions vulnerable to landslide hazards. In this context, a variety of approaches have been employed for landslide susceptibility mapping. These approaches can be categorized into three main types: qualitative, quantitative, and semi-quantitative methods [9,10]. Qualitative methods are characterized as straightforward approaches that primarily rely on direct field measurements and the expertise and experience of experts. On the other hand, quantitative methods are regarded as rigorous and objective approaches that use statistical and mathematical techniques to analyze data [10]. Qualitative and quantitative methods are used to mitigate the subjectivity inherent to landslide susceptibility assessments by integrating geotechnical and statistical models. Furthermore, new hybrid methods have been introduced in the literature. These methods originated from the aforementioned approaches by merging qualitative and quantitative methods to assess the importance of the input parameters in generating landslide hazard maps. These hybrid methods are commonly called semi-quantitative methods [11]. The ease of use and effectiveness of the three aforementioned methods have rendered them popular and valuable, owing to their straightforward representation of the dependent variable (i.e., landslide susceptibility) and independent variables (i.e., its drivers) [12,13]. To the best of the authors’ knowledge, landslide inventories and heuristic methods stand out as the most frequently employed qualitative approaches [14]. The primary drawback of qualitative methods lies in their subjectivity, stemming from the experiential ranking of landslide predisposing factors based on the expertise of individuals [14].

Similarly, quantitative methods can be categorized into several subclasses, including statistical, deterministic, and machine learning approaches [10]. Deterministic methods, also called geotechnical methods, have seen extensive applications in the literature. These approaches rely on geotechnical parameters determined on-site, coupled with the engineering principles of slope instability, typically expressed through a safety factor. However, deterministic methods tend to overlook climatic and anthropogenic factors [14]. The primary limitation of deterministic methods is their reliance on comprehensive geotechnical and hydrological data, which can be challenging to gather for large areas [14]. Moreover, these methods are generally applicable only to mapping small areas [15]. Typically, statistical methods aim to predict the relationship between historical landslides and their drivers through bivariate or multivariate techniques [10]. Logistic regression, weight of evidence, and analytical hierarchy process methods are among the statistical approaches commonly employed for modeling landslide susceptibility. However, a key criticism of statistical methods is their requirement for the drivers to follow a normal distribution, which may not always be true under real world conditions. Additionally, these methods inherently assume linearity [16] and rely on simplified assumptions, such as linear behavior or production heuristics, which can limit the effectiveness of statistical methods in modeling complex nonlinear phenomena [17,18].

To address these limitations, machine learning models have been proposed for landslide susceptibility mapping. These models leverage sophisticated algorithms to model the intricate nonlinear relationships by analyzing the conditioning factors of both landslide and non-landslide locations [10]. Machine learning is an algorithmic approach that iteratively learns from the available data to uncover underlying relationships or hidden patterns, thereby constructing accurate analytical models [10,19,20,21]. So far, numerous researchers have employed machine learning methods for modeling landslide susceptibility, with the most commonly utilized approaches in the literature including Artificial Neural Networks [14,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38], Support Vector Machines [22,26,27,30,39,40,41,42,43,44,45,46], Decision Trees [23,39,41,42,43,47,48,49,50,51,52], Random Forest [30,40,51,52,53,54,55,56,57,58], Adaptive Neuro-Fuzzy Inference System [41,59,60,61,62,63,64,65], and Deep Neural Network [66].

However, certain limitations have been mentioned in the literature with respect to traditional machine learning models. The process of selecting the optimal model can be time-consuming, and these models are prone to issues such as overfitting and underfitting. Moreover, the convergence of algorithms during the training phase relies heavily on the initial complex values, leading to slow training speeds [16]. Recent studies have primarily focused on enhancing traditional machine learning methods through the use of hybrid metaheuristic algorithms [67,68]. These approaches serve as a prevalent solution to address the challenges encountered by machine learning algorithms. The primary advantage of integrating metaheuristic algorithms with machine learning methods is the enhancement of convergence during the learning phase toward the optimal solution. However, despite this benefit, hybrid machine learning methods have received relatively little attention in the context of landslide susceptibility mapping.

Building upon the background information provided, the current research aims to develop a novel advanced hybrid machine learning model designed to assess landslide susceptibility effectively. This model is subsequently integrated into a geographical information system (GIS), resulting in the generation of an accurate map highlighting landslide-prone areas. The proposed map serves as a valuable tool for decision-makers and land use managers in the Medea area, which is known to be highly vulnerable to landslides, helping them to mitigate landslide hazards, and has the potential to be used in other regions of the world. The study significantly contributes to introducing novel hybrid metaheuristic machine learning methods, combining Genetic Algorithm, Particle Swarm Optimization, Harris Hawks Optimization, and Salp Swarm Algorithm with ANFIS, to enhance landslide susceptibility mapping accuracy. Furthermore, an advanced approach named K cross-validation has been employed to ensure that the models are better generalized and less prone to overfitting and underfitting. Additionally, the research fills in a significant gap in the literature by focusing on the understudied region of Medea and providing valuable insights into landslide susceptibility in this vulnerable area.

2. Materials and Methods

2.1. Case Study

The study area is located in the central portion of the Tellian Atlas and characterized by high altitude and rugged terrain enclosing some fairly fertile plains that gradually fade into the borders of the high steppe plains, forming a series of rolling hills. Such a strategic position has made Medea a major transit zone and a link between the Tell and the Sahara, on one hand, and between the Eastern and Western High Plateaus, on the other. The province of Medea is located 88 km west of the capital, Algiers (see Figure 1). It covers an area of 8775.65 km² and is bounded by longitudes of 2°07′54″ and 3°42′13″ east and latitudes of 35°25′28″ and 36°30′02″ north. The total population of the province is estimated at 861,204 inhabitants (2011), with a density of 99 inhabitants per km². The province of Medea, consisting of 64 municipalities, has important thermal springs and tourist sites. This settlement has a total agricultural area of around 773,541 hectares and adequately watered terrain. Additionally, there is pastoral activity practiced over an area of pasture of more than 200,000 hectares located in the southern zone of the province. Medea has a semi-continental Mediterranean climate: cold and humid in winter, temperate in spring, and hot and dry in summer.

From a geological standpoint, previous studies have demonstrated that the region of Medea exhibits heterogeneous characteristics, which is the reason for selecting it as the study area. Towards the northern part of the Wilaya (region), sedimentary terrains from the Senonian, Cenomanian, and Albian periods prevail. This area is notably rugged, featuring mountains cloaked in forests, where Chiffa shales and cargneule gypsums are commonly found. Rich copper deposits are also present in this region, contributing to its raw materials, particularly evident in the Mouzaia Mines (currently known as Tamezguida) located in the northwest. Additionally, a scree of sandstone formations of Upper Miocene can be found in Djebel Nador. Moving towards the southwestern part of the province, marls from the Cretaceous period and eruptive rocks like basaltic tuffs dominate the landscape. Recent alluvial deposits are observed along the wadis, while gravelly alluvium characterizes the plateaus of Ben Slimane. In the southern region of the province, gypsum clays, red clays, and sandstones are prevalent. Published studies have identified lithostratigraphic formations, including post-nappe Neogene deposits, as well as Quaternary terrains that unconformably overlay the Albian and Cenomanian formations.

Past landslide locations provide valuable insights into the spatial patterns of landslide occurrences, aiding in landslide susceptibility zoning. They offer crucial information about past landslide behaviors and their relationship to causative factors. Therefore, creating a comprehensive landslide inventory is a critical step in a landslide susceptibility assessment. Many researchers generate landslide inventories using high-resolution remote sensing data or aerial photograph interpretation [69,70,71]. Given the frequent activation of landslides during the rainy season in this area, Google Earth imagery was used to capture these rainfall-triggered landslides [69,70,71].

In this study, the landslide inventory map was prepared digitizing Google Earth imagery, with the locations subsequently field-verified. The data encompass events that occurred between 2009 and 2023, resulting in a total of 160 delineated landslides converted into raster format, And 80% of these landslides were assigned to the training dataset, while the remaining 20% were to the validation one (see Figure 2). Fieldwork and analysis revealed that most landslides are concentrated in the northern part of the Medea Wilaya. This can be explained by two primary factors. First, torrential rainfall, particularly during tropical storms, acts as a key trigger in the north. Second, the region’s northern terrain features dense slopes, further contributing to landslide susceptibility. These slopes become susceptible to activation due to soil saturation, which reduces the geo-mechanical parameters of soil.

2.2. Overview of the Methodology

To present the most appropriate model for assessing landslide susceptibility and developing the target map, the methodology followed these steps:

Detection of Historical Landslide Locations and Identification of Conditioning Factors: Using Google Earth images and multiple field surveys, 160 landslide sites were identified in the study area. Additionally, eight drivers were selected based on the literature, expert knowledge, and characteristics of the area.
Construction of Database: A database was constructed containing logical scales of landslide conditioning factors and target values.
Optimal Input Variable Selection: The Gamma Test technique (GT) was employed to analyze the database and select the optimal input variables.
Modeling with Hybrid Machine Learning Methods: The selected optimal inputs were modeled using novel hybrid machine learning methods: ANFIS-GA, ANFIS-PSO, ANFIS-HHO, and ANFIS-SSA. Landslide locations were divided into 128 landslide training sites (80%) and 32 landslide validation sites (20%).
Integrating metaheuristic algorithms with ANFIS: Metaheuristic algorithms (GA, PSO, HHO, and SSA) were used in conjunction with ANFIS to enhance the accuracy and reliability of landslide susceptibility assessments. This hybrid approach enabled us to effectively capture the complex relationships between inputs and target (landslide susceptibility), resulting in a more precise and robust susceptibility map of the study area.
Selection of the Most Appropriate Model: Various statistical performance indicators such as sensitivity, specificity, precision, accuracy, and the Pearson correlation coefficient (R) were used to determine the most suitable model for predicting landslide susceptibility among the proposed models.
Evaluation of Predictive Capability: The predictive capability of the optimal model was assessed using K-fold cross-validation, with K = 5 to address underfitting and overfitting issues. This approach helps ensure that the models are better generalized and less prone to overfitting and underfitting.
Integration into ArcGIS and Target Map Preparation: The most appropriate model was integrated into ArcGIS, and the optimal input maps were divided into 30 × 30 m cells using the fishnet tool, which is available in ArcToolbox. The ANFIS parameters of the best model were applied to analyze the optimal input layers and provide landslide susceptibility classes for each cell, resulting in the preparation of the landslide susceptibility map for the study area.

The methodology used to identify the most suitable model and create the target map is thoroughly explained in Figure 3.

2.3. Thematic Map Layers

Initially, Google Earth images were used to identify potential landslide areas. Subsequently, a series of field trips were conducted to validate the presence of landslides and assess their characteristics, including size, shape, and movement types; conduct site diagnostics; and ascertain the activity status (active, dormant, etc.) of failed slopes. A total of 160 training sites were identified, digitized, and rasterized in a GIS with a grid size of 30 × 30 m. This grid size was determined by the dimensions of the digital elevation model (DEM) obtained from the Shuttle Radar Topography Mission (SRTM) database, which was used to generate the slope, aspect, and elevation maps. The SRTM 30 digital elevation model is widely used for various geospatial analyses due to its effectiveness and significance, as well-documented in the literature [72]. A hydrographic network map was created on the DEM using the hydrology toolbox in ArcGIS, and this map was used to create the distance to rivers using the “near” tool available in ArcToolbox. Furthermore, the land cover map was created from the Sentinel-2 satellite imagery, 2022, with a 10 × 10 m grid, established by the Environmental Systems Research Institute, which has been classified into six classes.

The lithology reflects the physical and mechanical properties of geological formations covering the surface. The lithological map was constructed based on the geological map service of Algeria at a scale of 1:50,000. The precipitation map is classified based on data from the National Agency for Hydraulic Resources of Algeria at a scale of 1:500,000. Finally, the road map was extracted from the open-source Google Maps Road data available on Google Earth, and this map was used to create the distances to roads using the “near” tool available in ArcToolbox. Elevation, slope, and aspect had the same resolution as the DEM file, whereas lithology, land cover, and precipitation maps were converted into raster format files with the same resolution (30 × 30 m). Similarly, the distance to streams and distance to roads were converted into raster format files with a 30 m × 30 m resolution.

It is important to note that machine learning modeling can yield the optimal results when a comprehensive set of input variables is considered. However, it is crucial to include only those inputs that have a significant influence on estimating the target value [15]. For this study, the selected effective drivers fall into three main classes: geological, geomorphic, and vegetation factors, which serve as inputs in the modeling process. Eight triggering landslide parameters were chosen based on a review of the landslide literature, expert knowledge, and the specific characteristics of the study area [1,8,14,15]. These parameters include lithology, elevation, slope, land cover, distance to stream, precipitation, slope aspect, and distance to road. Subsequently, the Gamma Test method was employed for the optimal input selection (further explained in Section 2.6). Maps depicting landslide causative factors in the study area are presented in Figure 4, Figure 5, Figure 6, Figure 7, Figure 8, Figure 9, Figure 10 and Figure 11.

Lithology (Figure 4) refers to the diverse geological formations present in the study area, characterized by their heterogeneity from a geotechnical perspective. It is recognized as one of the primary landslide-triggering drivers. Landslide susceptibility is significantly influenced by lithology, which directly controls the geo-mechanical properties of soil and rock. For example, formations with weaker mechanical properties, such as clay, marl, or highly weathered rocks, are more prone to failure, leading to a higher risk of landslides. Conversely, soils with high shear strength parameters like granite or basalt exhibit a greater resistance to slope failure [1,14,15].
Precipitation (Figure 5) plays a critical role in landslide susceptibility by saturating slopes, increasing the pore water pressure, inducing hydraulic erosion, triggering preexisting instability, and interacting with other drivers [73].
Elevation (Figure 6) is also considered an important driver of landslide occurrence. Often, elevation is indirectly linked to other drivers such as slope, erosion, precipitation, soil thickness, and land use. Additionally, it helps to classify the local terrain morphology by identifying areas of high and low elevation [1,14,15].
Slope (Figure 7) is recognized as the most important driver of landslides [74]. At the local scale, it affects the pore pressure levels. On a larger scale, it regulates the regional hydraulic continuity and is considered the primary driver for GIS-based mapping. The slope angle typically correlates with slope movement and relates closely to the driving forces; higher slope angles indicate greater susceptibility to landslides [1,14,15].
Land cover (Figure 8) is another influential driver of landslide susceptibility, in which vegetation plays a significant role in reducing soil erosion. Varying vegetation in a region is a key factor that notably influences slope stability. Vegetation contributes to enhancing the mechanical soil parameters through root strengthening, which can decrease the occurrence of landslides [1,14,15].
Distance to stream (Figure 9) refers to the proximity of a site to a stream and is also a crucial driver of slope stability. Streams can adversely affect site stability by eroding the slope or saturating the lower part of the slope, leading to an increase in the water levels and a decrease in the soil mechanical parameters [1,14,15].
Distance to the road (Figure 10) has been identified as a significant driver of landslide occurrence by previous studies. Road and highway construction, particularly in mountain areas, where they are built alongside slopes, can disrupt the natural equilibrium of soils, potentially leading to slope instabilities. Conversely, a greater distance from roads reduces the load on both the topography and the toe of the slope, thereby decreasing the likelihood of landslides. Consequently, the road network can be considered a key factor in landslide occurrence [1,14,15].
Slope aspect (Figure 11) is considered by several researchers as an indispensable driver of landslide susceptibility modeling, because it is related to the primary rainfall direction, wind influence, and exposure of slopes to sunlight [1,14,15]. The aspect data for the study area are summarized in Table 1, showing the surface area and percentage for each directional category. The data indicate that south-facing slopes (S) dominate the study area, accounting for 19.19% of the total surface area, followed by north-facing (N) and east-facing (E) slopes, representing 15.93% and 12.16% of the area, respectively. Northwest-facing (NW) slopes have the least coverage (8.74%).

2.4. ANFIS

ANFIS adopts a hybrid approach by combining an adaptive Artificial Neural Network (ANN) with a Fuzzy Inference System (FIS). Introduced by Jang in 1993 [75], ANFIS has consistently demonstrated its performance in addressing nonlinear problems across various studies. Through the training of input–output data, the ANFIS mechanism identifies the optimal parameters for membership functions (MFs). The ANFIS architecture, illustrated in Figure 12, consists of five layers. In the initial layer, known as fuzzification, the inputs in each node (j) undergo transformation into fuzzy membership functions through an activation function μ. This function can take various forms, such as triangular, trapezoidal, sigmoidal, Gaussian, etc., as described by Equations (1) and (2).

Q_{j}^{1} = μ Aj (x) for j = 1, 2

(1)

Q_{j}^{1} = μ Bj - 2 (y) for j = 3, 4

(2)

where x and y are the inputs,

Q_{j}^{1}

is the membership function, and Aj and Bj are the membership values of μ. In the current study, the Gaussian membership function presented in Equation (3) is used:

μ (x) = e x p [- (\frac{(x - c_{j})^{2}}{2 σ_{j}^{2}})]

(3)

The parameters Cj and σj, which are the mean and standard deviation of the Gaussian curve, respectively, serve as the premise parameters for the membership functions. In layer 2, the weights (wk) for these membership functions are computed using Equation (4), which determines the firing strength of the rules.

Q_{k}^{2} = w k = μ A j (x) \times μ B j (y) f o r k = 1, \dots, 4 a n d j = 1, 2

(4)

Layer 3 normalizes the firing strengths using Equation (5).

Q_{j}^{3} = \bar{w} j = \frac{w_{j}}{\sum_{k = 1}^{4} w_{k}} f o r j = 1, \dots, 4

(5)

In layer 4, known as the defuzzification layer, the output for each node j is evaluated using Equation (6), and the consequent parameters (p, q, and r) of the firing strengths f are calculated.

Q_{j}^{4} = \bar{w} j f j = \bar{w} j (p j x + q j y + r j) f o r j = 1, \dots, 4

(6)

The final layer calculates the overall output in a single node by summing up all the incoming signals, as described by Equation (7).

Q_{j}^{5} = \sum \bar{w} j f j = \frac{\sum_{j = 1}^{4} w_{j} f_{j}}{\sum_{j = 1}^{4} w_{j}}

(7)

2.5. Metaheuristic Algorithms

2.5.1. Genetic Algorithms (GAs)

Devised by John Holland in 1970 [76], Genetic Algorithms are inspired by natural selection and genetic processes. Population renewal depends on the success of the fittest individuals within the species. Initially populated with encoded points, GAs employ three operators (crossover, mutation for space exploration with potential solutions, and selection) to guide the population toward the optimal solution to a problem. The decision to terminate the Genetic Algorithm’s execution depends on various criteria tailored to specific problem types. Commonly employed criteria include the convergence of adaptation mean within a population, available time for a single GA execution, and maximum number of generations or evaluation functions in GA testing. The random generation of a population chain may explore a limited solution domain, increasing the likelihood of identifying the optimal solution while risking the loss of proximity to the optimum during the search. Consequently, GA execution must be repeated multiple times with different sets of random starting points to enhance the probability of detecting the optimal solution.

2.5.2. Particle Swarm Optimization (PSO)

Introduced by Kennedy and Eberhart in 1995 [77], the Particle Swarm Optimization (PSO) algorithm is inspired by the social behavior and dynamic interactions observed in nature, such as those among insects, birds, and fish. It integrates individual experiences with social interactions, aiming to obtain an optimal solution. This is achieved by considering the best position encountered by a particle along with that of its neighbors. A predefined fitness function assesses the performance of each particle, quantifying its effectiveness in addressing the optimization problem.

The updated equations for Particle Swarm Optimization (PSO) are as follows:

v_{i}^{t + 1} = w . v_{i}^{t} + c_{1} r_{1} . (p b e s t - X_{i}^{t}) - c_{2} r_{2} . (g_{b e s t} - X_{i}^{t})

(8)

X_{i}^{t + 1} = X_{i}^{t} + V_{i}^{t + 1}

(9)

where t is the current iteration, I the ith solution,

x_{i}^{t}

is the position of ith solution in the tth iteration, c₁ and c₂ are the acceleration coefficients, r₁ and r₂ are random values,

p b e s t

is the best solution that the ith particle has obtained so far, g_best is the best position of the total swarm, and

V_{i}^{t + 1}

is the velocity of ith solution in the tth + 1 iteration.

2.5.3. Harris Hawks Optimization (HHO)

Crafted by Heidari et al. [78], Harris Hawks Optimization (HHO) stands as an innovative algorithm rooted in swarm intelligence. It has been extensively utilized in recent years for tackling intricate nonlinear optimization problems; the HHO mechanism emulates the hunting actions of hawks seeking their prey. The HHO process unfolds in two primary phases: exploration and exploitation.

In the exploration phase, a set of the initial population of Harris Hawks {X1, X2,…, Xn} is created randomly to track and detect the prey (rabbit) within the feasible space. According to Equation (10), both hawks and prey have the same chance q.

x (t + 1) = \{\begin{matrix} x_{r a n d} (t) - r_{1} {[x}_{r a n d} (t) - {2 r}_{2} x (t)] i f q \geq 0.5 \\ x_{r a b b i t} (t) - x_{a} (t) - r_{3} (U B - L B) i f q < 0.5 \end{matrix}

(10)

where X(t) and X(t + 1) are the positions of the Harris Hawk at iterations t and t + 1, respectively. X_rand is a randomly selected Harris Hawk among all available individuals; Xrabbit(t) is the position of the prey q; r1, r2, r3, and r4 are random values varying between 0 and 1; and LB and UB are the lower and upper bounds, respectively. Equation (11) gives the mean position Xm(t) of the total number N of Harris Hawks.

x_{a} (t) = \frac{1}{N} \sum_{1}^{N} x_{i} (t)

(11)

During the hunting process, the escaping energy E of the prey decreases, and the transition from the exploration to the exploitation phase can be expressed by Equation (12).

E = 2 E₀ (1 − t/T)

(12)

where E₀ is the initial energy of the prey, ranging between −1 and 1, and T is the maximum iteration.

Exploitation is the second phase, which aims to refine the locally found solution. The prey, once attacked, tries to escape. Depending on the Hawks’ chasing behavior, four scenarios can occur: Soft Besiege, Hard Besiege, Soft Besiege with Progressive Rapid Dive, and Hard Besiege with Progressive Rapid Dive.

2.5.4. Salp Swarm Algorithm (SSA)

The Salp Swarm Algorithm (SSA), introduced by Mirjalili in 2017 [79], mimics the behavior of salps during navigation and food source exploration in seas and oceans. In the SSA, an initial population is randomly generated and split into two groups: leaders and followers. Leader salps, positioned at the top of the salp chain, direct the swarm towards the designated food location (the target), while the other salps follow the leaders during the aggregation phase.

The position of the leading salp, denoted as

x_{j}^{1}

, is updated according to Equation (13).

x_{j}^{1} = \{\begin{matrix} f_{j} + c_{1} ({(u b}_{j} - {{l b}_{j}) c}_{2} + {l b}_{j}) C_{3} \geq 0 \\ {f_{j} - c_{1} ({(u b}_{j} - {{l b}_{j}) c}_{2} + {l b}_{j}) C_{3} < 0}_{} \end{matrix}

(13)

F_j represents the best food solution in the jth dimension, ub_j and lb_j represent the upper and lower bounds in the jth dimension, respectively, c₂ and c₃ are random numbers ranged between 0 and 1, and c₁ is a parameter that varies with the following expression (Equation (14):

c_{1} = 2 e^{- (\frac{4 t}{T})^{2}}

(14)

where t and T represent the current iteration and the maximum number of iterations, respectively. Finally, the position of the followers

x_{j}^{i}

is updated according to Equation (15).

x_{j}^{i} = \frac{1}{2} (x_{j}^{i} + x_{j}^{i - 1})

(15)

where i ≥ 2 and

x_{j}^{i}

represent the position of the ith follower at the jth dimension.

2.6. Gamma Test

The Gamma Test (GT) is an iterative mathematical method for assessing the variance of noise, or the Mean Squared Error (MSE), without overfitting the model. It is particularly useful for evaluating the nonlinear correlation between two random variables, the dependent variable X and the independent variable Y. The GT method was initially utilized by Koncar [80] and Stefánsson et al. [81], and since then, numerous studies have been conducted to further develop it [82]. Here, only key information about the GT is provided; for a deeper understanding of the method, it is recommended to read the cited references.

By adhering to the essential conditions outlined in previous studies, the bias of regression between γ(k) and δ(k) can be computed, where 1 ≤ k ≤ p, which is a crucial parameter for assessing the variance of noise, denoted by Γ. γ(k) and δ(k) are computed as follows:

δ (k) = \frac{1}{M} \sum_{i = 1}^{M} {|x_{N [i, k]} - x_{i}|}^{2}

(16)

γ (k) = \frac{1}{2 M} \sum_{i = 1}^{M} {|y_{N [i, k]} - y_{i}|}^{2}

(17)

where x_N_[i,k] is the kth nearest neighbor of xi, y_N_[i,k] the corresponding target, and p the number of input variables. To compute Γ, a least squares fit line must be determined for the p points (δ(k),γ(k)). Then, the bias of the regression line will be simply valued, which characterizes the Gamma statistics parameter, Γ. Previous research has shown that Γ provides advantageous results for choosing the optimal input parameters. A small value of Γ indicates that the input set yields a better fit. Furthermore, another very important parameter to evaluate the estimation of the input set is V_ratio, computed as follows:

V_{r a t i o} = \frac{Γ}{σ^{2} (y)}

(18)

where

σ^{2} (y)

is the output variance. In summary, in the present study, the optimal input combination is identified based on the lowest values of Γ and V_ratio.

2.7. Statistical Indicators

The estimation accuracy of the suggested models was evaluated using various statistical indicators and graphical approaches. The statistical indicators employed in this study include sensitivity, specificity, precision, accuracy, and Pearson correlation coefficient (R), listed in Table 2. The optimal model is determined based on the highest values of these statistical indicators, where

TP: True Positive, for correctly predicted event values.
FP: False Positive, for incorrectly predicted event values.
TN: True Negative, for correctly predicted no-event values.
FN: False Negative, for incorrectly predicted no-event values.

3. Results

3.1. Database Compilation

Compiling the database is a critical step in assessing the relationship between landslide occurrence and its causal factors. Typically, landslide occurrences tend to be more frequent under certain conditions, as observed in previous incidents. Therefore, identifying the distribution of past landslides is crucial for a comprehensive study. In this research, Google Earth images were used to locate landslide sites using the Historical Imagery tool, which enables the visualization of changes over time in the study area map. This approach is valuable for monitoring suspicious sites over time and identifying potential landslide areas. Following this, field surveys were conducted to validate the potentiality; assess the sizes and shapes of landslides; determine the movement types; conduct site diagnoses; and characterize the activity level (active, dormant, etc.) of failed slopes.

To assess the performance of hybrid machine learning methods, various thematic layers representing several factors influencing landslides were prepared. The selection of drivers depended on data availability, information gathered from field surveys, and specific characteristics of the study area. These factors were digitized, organized, and rasterized using Geographic Information System (GIS) software, specifically ArcGIS 10.8, to integrate them into the proposed model and generate the final map.

The adopted input factors are lithology, elevation, slope, land cover, distance to stream, precipitation, slope aspect, and distance to road. Initially, the lithology layer (X1) was created using data from a 1:50,000-scale geologic map. This layer was then categorized into four distinct groups according to their landslide susceptibility, as illustrated in Table 3. First, elevation, slope, and slope aspect maps were derived from the Shuttle Radar Topography Mission (SRTM) database. Second, the elevation values (X2) ranged from 265 to 1800 m. X2 was classified into six categories as follows: (1) 265–400 m, (2) 400–600 m, (3) 600–800 m, (4) 800–1000 m, (5) 1000–1200 m, and (6) >1200 m. Third, the slope layer (X3) was created using the “slope function” in ArcGIS 10 and categorized into four sets as follows: (1) 0–5°, (2) 5–12.5°, (3) 12.5–25°, and (4) 25–90°. Fourth, the land cover layer (X4) was categorized into five classes: (1) residential, (2) uncultivated, (3) cultivated, (4) grassland, and (5) forest. Precipitation (X7) was classed into five classes as follows: (1) 100–200 mm, (2) 200–300 mm, (3) 300–400 mm, (4) 400–500 mm, and (5) 600–800 mm. Finally, distance to stream (X5) and distance to road (X6) were classified into six sets as follows: (1) 0–25 m, (2) 25–50 m, (3) 50–100 m, (4) 100–200 m, (5) 200–300 m, and (6) >300 m. These classifications are based on the analysis of the susceptibility of each factor class to landslide occurrences, drawing on findings from previous landslide studies.

3.2. Correlation between Inputs and Target

The statistical relationship between landslide susceptibility and input parameters was examined using SPSS software version 20.0. Table 4 displays the correlation matrix, providing a descriptive overview of the data distribution, the Pearson correlation coefficient (R) and its significance regarding landslide susceptibility, and other inputs. The results revealed significance levels below 0.05 for X1, X2, X3, X7, and X8, indicating statistical significance in these correlations. According to Smith’s classification (1986) [83], landslide susceptibility exhibits a consistent correlation with the input parameters, except for X4, X5, and X6, which demonstrate poor correlation. This suggests a complex nonlinear relationship that necessitates advanced machine learning techniques for accurate modeling.

3.3. Optimal Input Selection Using the Gamma Test (GT)

In this section, the impact of each input on landslide susceptibility was evaluated by constructing eight different combinations of the input factors (X1, X2, X3, X4, X5, X6, X7, and X8), as outlined in Table 5. The first combination includes all eight parameters (referred to as the initial set). Similarly, the second combination consists of seven input factors (All-X1), excluding the lithology parameter; the seventh combination includes all inputs except precipitation (X7) and so forth for the remaining combinations, as detailed in Table 5. The results of the GT analysis reveal that factors X1, X3, X5, X7, and X8 have a significant influence on the output. These five input factors were selected based on the highest value of the gamma statistic (Γ) and V_ratio. The findings clearly indicate that the combination of lithology, slope, distance to stream, precipitation, and slope aspect exhibited the lowest values of gamma statistics. Consequently, this set was identified as the optimal combination of input variables for modeling landslide susceptibility, as determined by the Gamma Test method.

3.4. Landslide Susceptibility Classification through the Hybrid Metaheuristic Method

To determine the optimal machine learning model, the study employed a two-step approach: first, selecting influential input parameters based on the literature recommendations, and second, identifying the best machine learning methods. Initially, eight factors were chosen, and the Gamma Test method was then applied to identify the optimal inputs. Subsequently, five statistical measures were used to assess and compare the performances of various models during both the training and validation phases. The results, including sensitivity, specificity, precision, accuracy, and Pearson correlation coefficient (R), are presented in Table 6.

The dataset was split into two parts: 80% for training and 20% for validation, comprising 128 samples for training and 32 for validation. As shown in Table 6, the landslide susceptibility modeled using various hybrid metaheuristic methods exhibited the following ranges of performance metrics during the training phase: sensitivity (95.83–100%), specificity (94.94–98.73%), precision (92.16–98%), accuracy (96.09–99.22%), and Pearson correlation coefficient (R) (92–99.21%). Similarly, during the validation phase, the ranges were sensitivity (85.71–100%), specificity (95.45–100%), precision (88.89–100%), accuracy (93.75–100%), and Pearson correlation coefficient (R) (87.26–99.97%). The results clearly demonstrate that the ANFIS-HHO model, trained with the optimal inputs identified by the GT method and using a combination of the HHO algorithm, produced the most accurate predictions. This model exhibited high sensitivity (100%/100%), specificity (98.734%/100%), precision (98%/100%), accuracy (99.22%/100%), and Pearson correlation coefficient (R) (99.21%/99.97%) during both the training/validation phases. Additionally, the ANFIS-SSA model performed satisfactorily and was ranked as the second-best model. On the other hand, the ANFIS-GA model yielded the weakest results in predicting landslide susceptibility. In terms of the performance hierarchy of the hybrid metaheuristic machine learning models during the training and validation phases, the order is ANFIS-HHO, ANFIS-SSA, ANFIS-PSO, and ANFIS-GA.

3.5. Evaluating the Best-Fitted Model Using the K-Fold Cross-Validation Approach

The evaluation of the predictive capability of the optimal ANFIS-HHO model involved the effective use of a five-fold cross-validation approach. Notably, previous studies focusing on predicting landslide susceptibility often assessed their models based on a single split, which limited the verification of their models’ ability to address overfitting and underfitting issues. Figure 13 illustrates the performance measures of the optimal ANFIS-HHO model using five-fold cross-validation with validation data for each split.

The results clearly demonstrate the efficacy of the ANFIS-HHO model, with correlation coefficients ranging between 0.972 and 0.994 for validation data across the five splits. This substantiates the predictive capability of the optimal ANFIS-HHO model to not only learn from existing data but also generalize well to novel validation data, effectively overcoming the overfitting and underfitting challenges.

3.6. Landslide Susceptibility Mapping

The landslide modeling process was conducted using four hybrid metaheuristic machine learning methods based on the training dataset. The performance of each model was assessed using five statistical indicators, revealing ANFIS-HHO as the most suitable model. To generate landslide susceptibility maps, susceptibility indices were computed for all pixels in the study area using a 30 × 30 m grid size. The Fishnet Tool in ArcToolbox facilitated this step. Subsequently, the ANFIS-HHO model was integrated into ArcGIS software to classify the susceptibility indices of each pixel based on the optimal input layers.

Figure 14 illustrates the resulting landslide susceptibility maps, showing three susceptibility classes: low, moderate, and high. The distribution of susceptibility classes reveals that 48.39% of the study area exhibits low susceptibility to landslides, while 22.31% and 29.29% have moderate and high susceptibilities, respectively.

The map illustrates an increasing susceptibility from plateau surfaces to streams, primarily influenced by slope angle. Plateau surfaces demonstrate low susceptibility due to factors such as lithological characteristics (e.g., hard rock), distance from streams, low precipitation, and gentle slopes. Moderate susceptibility is observed in ravines and convex slope ruptures delineating plateaus, indicative of lithological alteration zones (passage from a hard layer to soft layer). Conversely, high susceptibility zones are characterized by soft soil lithology, sparse vegetation cover, steep slopes, high precipitation, and proximity to streams.

3.7. Comparison between Our Model and the Models Proposed by the Literature

To assess the effectiveness of the proposed ANFIS-HHO model, a comparative study was conducted involving several empirical models from the literature predicting landslide susceptibility, as outlined in Table 7. The comparison was based on classification accuracy, sensitivity, and specificity, crucial indicators for evaluating prediction accuracy, where values close to 100 represent the best model. The results of the comparative study revealed that our proposed ANFIS-HHO model outperformed the others, demonstrating the highest classification accuracy, sensitivity, and specificity with values of 99.21, 100, and 98.734, respectively.

Following our model, the random forest model proposed by Dou et al. [52] ranked second, providing acceptable accuracy. The performance hierarchy of the machine learning models in our study was as follows: Dou et al. [52], Benbouras [8], Kavzoglu et al. [39], Tien Bui et al. [26], Aghdam et al. [59], Dao et al. [52], and Yeon et al. [50]. The effectiveness of our suggested model is attributed to the metaheuristic hybrid machine learning method, which automates the training process and achieves a better performance and optimum results in a short period of time.

4. Discussion

4.1. Significance of the Results

In our current research, we aimed to significantly contribute to the landslide research community by enhancing the performance of landslide susceptibility models. To the authors’ knowledge, the quality of these models heavily relies on the chosen method, and the current study focuses on exploring the effectiveness of novel hybrid metaheuristic machine learning methods. Furthermore, despite Medea Wilaya being highly vulnerable to landslides, the existing literature lacks a comprehensive landslide map. To address these gaps, the efficacy of four meta-heuristic algorithms combined with the Adaptive Neuro-Fuzzy Inference System (ANFIS) method was examined for a landslide susceptibility assessment. These algorithms include Genetic Algorithm (ANFIS-GA), Particle Swarm Optimization (ANFIS-PSO), Harris Hawks Optimization (ANFIS-HHO), and Salp Swarm Algorithm (ANFIS-SSA). It is worth noting that the use of hybrid metaheuristic machine learning methods in a landslide assessment is relatively rare, representing a premiere for the study area.

Our findings highlight that the ANFIS-HHO model emerged as the most suitable model, exhibiting higher values of sensitivity (100/100), specificity (98.734/100), precision (98/100), accuracy (99.22/100), and Pearson correlation coefficient (R) (99.21/99.97) during both the training and validation phases compared to the other models. Furthermore, we evaluated the newly developed model using the K-fold cross-validation method, demonstrating its ability to generate new data without overfitting or underfitting and its superior precision compared to the other proposed empirical models in the literature.

Our results hold significant importance for landslide research and hazard assessment. By demonstrating the effectiveness of hybrid metaheuristic machine learning methods in improving landslide susceptibility models, we provide valuable insights for researchers and practitioners. The identification of the ANFIS-HHO model as the most suitable for landslide prediction underscores the potential of these advanced techniques in addressing complex geological phenomena. The model is based on an optimal hybrid metaheuristic ANFIS-HHO model assessed by the K-fold cross-validation approach. This approach ensures that our model’s performance is evaluated using different subsets of the dataset, minimizing the risk of overfitting and enhancing its generalization. This combination shows a rigorous optimization for accurate predictions, effectively overcoming the overfitting and underfitting challenges. It includes essential input parameters for landslide susceptibility, such as lithology, slope, elevation, distance to stream, land cover, precipitation, slope aspect, and distance to road. Moreover, the integration of our improved ANFIS-HHO model into GIS software facilitates the generation of accurate landslide susceptibility maps, enabling decision-makers and land use managers to implement effective risk management strategies.

4.2. Inner Validation of the Results

We believe that our proposed methodology, which combines hybrid machine learning techniques with GIS tools, offers a straightforward approach that can be replicated in other regions facing similar challenges. Historically, by the beginning of the 20th century, microzoning maps and machine learning methods gained widespread usage in northern countries, aiding decision-makers and land use managers in various applications [84]. Today, the imperative to leverage these tools for developing up-to-date microzoning maps extends to several countries in the south, reflecting their significance in addressing contemporary challenges [84]. In this context, landslide susceptibility maps hold considerable importance [85], serving as critical resources for informed decision-making and risk management.

4.3. External Validation of the Results

The results of the current study demonstrate a significant enhancement in the performance of the landslide model through the utilization of hybrid machine learning methods. Compared to traditional methods, the metaheuristic hybrid HHO-ANFIS method yielded highly significant results. Furthermore, ANFIS-HHO outperformed the proposed models in the literature, showing a 9.22% improvement over the ANFIS and DNN proposed by Aghdam et al. [59] and Dao et al. [52], respectively, 4.81% improvement over SVM proposed by Kavzoglu et al. [39], 4.14% improvement over PSOGSA-ANN proposed by Benbouras [8], and 3.63% improvement over Random Forest proposed by Dou et al. [52]. These findings are consistent with expectations, as hybrid machine learning techniques, whether employed for prediction or classification tasks, can mitigate bias and variance while averting issues such as overfitting and underfitting, thus enhancing the predictive capability of traditional methods.

4.4. Importance of the Results

The significance of our results lies in their potential to inform landslide risk management strategies and land use planning efforts in landslide-prone regions. By accurately assessing the landslide susceptibility and generating detailed susceptibility maps, our methodology empowers decision-makers to implement proactive measures to mitigate the impact of landslides on human lives and infrastructures. Furthermore, our study highlights the efficacy of hybrid machine learning techniques in enhancing traditional landslide modeling approaches, paving the way for future research and practical applications in similar contexts worldwide.

4.5. Study Limitations and Future Directions

The main advantage of hybrid metaheuristic machine learning methods, in contrast to traditional approaches, lies in their ability to automate training processes and achieve superior performance and optimal results within shorter timeframes. Moreover, these methods possess the capability to amalgamate various algorithms, harnessing the strengths of each to create highly adaptable methodologies compared to conventional machine learning techniques. However, it is crucial to acknowledge several limitations associated with hybrid metaheuristic machine learning methods. The most important one is the relatively small sample size used in this study, which may impact the precision of the landslide susceptibility map. This limitation could hinder the model’s capacity to generalize novel conditions or scenarios not accounted for during the training phase. Additionally, researchers often rely on extensive and diverse datasets compiled from various sources to bolster learning outcomes. Thus, future studies could benefit from incorporating data gathered across multiple countries to enrich the learning process and enhance the model performance. Furthermore, the implementation of proposed models may pose other challenges in future research endeavors. Results are typically presented in complex matrices computed using transfer functions, which may prove cumbersome to utilize in subsequent cases, particularly considering the necessity to integrate the model with external programs such as ArcGIS, as demonstrated in this study.

5. Conclusions

The conclusions drawn from this study highlight significant contributions aimed at exploring the efficacy of new unused advanced hybrid machine learning methods in generating a reliable model for effectively assessing landslide susceptibility in Medea Wilaya, which is known as a highly vulnerable area to landslides. To achieve this objective, historical landslide locations were identified using Google Earth images, and multiple field surveys were performed. The Gamma Test method was then employed for optimal input selection, revealing that lithology, slope, distance to stream, precipitation, and slope aspect constitute the optimal input set. Following this, four meta-heuristic algorithms, namely ANFIS-GA, ANFIS-PSO, ANFIS-HHO, and ANFIS-SSA, were combined with the ANFIS method and applied to model the selected optimal input set. The accuracy of the proposed models was assessed using five statistical indicators. Based on that, the comparative assessments highlighted the superior accuracy of the ANFIS-HHO model, exhibiting the best performance in terms of sensitivity (100/100), specificity (98.734/100), precision (98/100), accuracy (99.22/100), and Pearson correlation coefficient (R) (99.21/99.97) during both the training and validation phases compared to other models. Additionally, the predictive capability of the ANFIS-HHO model was evaluated using a five-fold cross-validation with K = 5, which yielded consistently high correlation coefficients (0.972 to 0.994) across validation data splits, indicating the absence of overfitting or underfitting issues. Our proposed model is based on an optimal hybrid metaheuristic ANFIS-HHO model assessed by the K-fold cross-validation approach and indicates rigorous optimization for accurate predictions, effectively overcoming the overfitting and underfitting challenges.

Comparative analyses with the proposed models in the literature confirmed the significant improvement of our proposed ANFIS-HHO model. Finally, the proposed model was integrated into GIS software to produce an accurate map depicting landslide-prone areas. This map can serve as a valuable tool for decision-makers and land use managers in mitigating landslide hazards in the Medea region.

Theoretically, our study contributes to expanding our knowledge by providing an in-depth insight into the application of advanced machine learning techniques in landslide susceptibility assessments. By investigating the efficacy of hybrid metaheuristic algorithms combined with the Adaptive Neuro-Fuzzy Inference System (ANFIS) method, we contribute to the body of literature on landslide modeling methodologies.

Methodologically, our study presents a novel approach to landslide susceptibility modeling by integrating multiple advanced techniques and methodologies. The utilization of hybrid metaheuristic algorithms represents an innovative methodological advancement, offering a robust framework for improving the predictive accuracy and reliability. Moreover, our research demonstrates the feasibility of integrating machine learning models with Geographic Information Systems (GIS) software for practical applications in hazard mapping and risk management. This methodological innovation has implications beyond landslide research and can be adapted to other geospatial modeling applications, contributing to interdisciplinary approaches in geological–geotechnical risks.

Finally, the current research has the potential to inform decision-making processes and improve disaster preparedness efforts in landslide-prone regions. Moreover, the methodologies developed in this study can serve as a foundation for future research endeavors in the broader field of natural hazard assessment and mitigation.

Author Contributions

Conceptualization, F.D., M.A.B. and A.-I.P.; methodology, F.D., M.A.B., A.-I.P., L.M.B.A. and A.L.; software, M.A.B. and A.L.; validation, F.D. and A.-I.P.; formal analysis M.A.B. and L.M.B.A.; investigation, F.D., M.A.B., A.-I.P., L.M.B.A. and A.L.; data curation, M.A.B.; writing—original draft preparation, F.D., M.A.B., and A.-I.P.; writing—review and editing, F.D., M.A.B., A.-I.P., L.M.B.A. and A.L.; visualization L.M.B.A. and A.L.; supervision, F.D. and, A.-I.P.; project administration, F.D. and A.-I.P.; funding acquisition, F.D., M.A.B., A.-I.P., L.M.B.A. and A.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Data are available by request.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Achour, Y.; Pourghasemi, H.R. How Do Machine Learning Techniques Help in Increasing Accuracy of Landslide Susceptibility Maps? Geosci. Front. 2020, 11, 871–883. [Google Scholar] [CrossRef]
Cemiloglu, A.; Zhu, L.; Mohammednour, A.B.; Azarafza, M.; Nanehkaran, Y.A. Landslide Susceptibility Assessment for Maragheh County, Iran, Using the Logistic Regression Algorithm. Land 2023, 12, 1397. [Google Scholar] [CrossRef]
Nourani, V.; Pradhan, B.; Ghaffari, H.; Sharifi, S.S. Landslide Susceptibility Mapping at Zonouz Plain, Iran Using Genetic Programming and Comparison with Frequency Ratio, Logistic Regression, and Artificial Neural Network Models. Nat. Hazards 2014, 71, 523–547. [Google Scholar] [CrossRef]
Nhu, V.-H.; Mohammadi, A.; Shahabi, H.; Ahmad, B.B.; Al-Ansari, N.; Shirzadi, A.; Clague, J.J.; Jaafari, A.; Chen, W.; Nguyen, H. Landslide Susceptibility Mapping Using Machine Learning Algorithms and Remote Sensing Data in a Tropical Environment. Int. J. Environ. Res. Public Health 2020, 17, 4933. [Google Scholar] [CrossRef]
Debiche, F.; Kettab, R.M.; Benbouras, M.A.; Benbellil, B.; Djerbal, L.; Petrisor, A.-I. Use of GIS systems to analyze soil compressibility, swelling and bearing capacity under superficial foundations in Algiers region, Algeria. Urbanism. Arhitectura. Constr. 2018, 9, 357–370. [Google Scholar]
Kadavi, P.R.; Lee, C.-W.; Lee, S. Application of Ensemble-Based Machine Learning Models to Landslide Susceptibility Mapping. Remote Sens. 2018, 10, 1252. [Google Scholar] [CrossRef]
Pardeshi, S.D.; Autade, S.E.; Pardeshi, S.S. Landslide Hazard Assessment: Recent Trends and Techniques. SpringerPlus 2013, 2, 523. [Google Scholar] [CrossRef]
Benbouras, M.A. Hybrid Meta-Heuristic Machine Learning Methods Applied to Landslide Susceptibility Mapping in the Sahel-Algiers. Int. J. Sediment Res. 2022, 37, 601–618. [Google Scholar] [CrossRef]
Brenning, A. Spatial Prediction Models for Landslide Hazards: Review, Comparison and Evaluation. Nat. Hazards Earth Syst. Sci. 2005, 5, 853–862. [Google Scholar] [CrossRef]
Kavzoglu, T.; Colkesen, I.; Sahin, E.K. Machine Learning Techniques in Landslide Susceptibility Mapping: A Survey and a Case Study. In Landslides: Theory, Practice and Modelling; Advances in Natural and Technological Hazards Research; Pradhan, S.P., Vishal, V., Singh, T.N., Eds.; Springer International Publishing: Cham, Switzerland, 2019; pp. 283–301. ISBN 978-3-319-77377-3. [Google Scholar]
Hadmoko, D.S.; Lavigne, F.; Samodra, G. Application of a Semiquantitative and GIS-Based Statistical Model to Landslide Susceptibility Zonation in Kayangan Catchment, Java, Indonesia. Nat. Hazards 2017, 87, 437–468. [Google Scholar] [CrossRef]
Bai, S.B.; Wang, J.; Thiebes, B.; Cheng, C.; Chang, Z.Y. Susceptibility Assessments of the Wenchuan Earthquake-Triggered Landslides in Longnan Using Logistic Regression. Environ. Earth Sci. 2014, 71, 731–743. [Google Scholar] [CrossRef]
Chen, C.-W.; Chen, H.; Wei, L.-W.; Lin, G.-W.; Iida, T.; Yamada, R. Evaluating the Susceptibility of Landslide Landforms in Japan Using Slope Stability Analysis: A Case Study of the 2016 Kumamoto Earthquake. Landslides 2017, 14, 1793–1801. [Google Scholar] [CrossRef]
Caniani, D.; Pascale, S.; Sdao, F.; Sole, A. Neural Networks and Landslide Susceptibility: A Case Study of the Urban Area of Potenza. Nat. Hazards 2008, 45, 55–72. [Google Scholar] [CrossRef]
Ayalew, L.; Yamagishi, H. The Application of GIS-Based Logistic Regression for Landslide Susceptibility Mapping in the Kakuda-Yahiko Mountains, Central Japan. Geomorphology 2005, 65, 15–31. [Google Scholar] [CrossRef]
Huang, F.; Yin, K.; Huang, J.; Gui, L.; Wang, P. Landslide Susceptibility Mapping Based on Self-Organizing-Map Network and Extreme Learning Machine. Eng. Geol. 2017, 223, 11–22. [Google Scholar] [CrossRef]
Benbouras, M.A.; Kettab, R.M.; Zedira, H.; Petrisor, A.-I.; Debiche, F. Dry Density in Relation to Other Geotechnical Proprieties of Algiers Clay. Rev. Şcolii Dr. Urban. 2017, 2, 5–14. [Google Scholar]
Benbouras, M.A.; Sadoudi, L.; Leghouchi, A. Prediction of the Resilient Modulus of Subgrade Soil Using Machine-Learning Techniques. Urbanism. Arhitectura. Constr. 2025, 16, 1–14. [Google Scholar]
Amin, B. Predicting Shear Stress Parameters in Consolidated Drained Conditions Using Artificial Intelligence Methods. Basic Appl. Sci.-Sci. J. King Faisal Univ. 2021, 22, 1–7. [Google Scholar] [CrossRef]
Bioud, N.E.-I.; Laid, I.O.; Benbouras, M.A. Estimating the Fundamental Period of Infilled RC Frame Structures via Deep Learning. Urbanism. Arhitectura. Constr. 2023, 14, 59–80. [Google Scholar]
Alioua, S.; Arab, A.; Benbouras, M.A.; Leghouchi, A. Modeling Static Liquefaction Susceptibility of Saturated Clayey Sand Using Advanced Machine-Learning Techniques. Transp. Infrastruct. Geotech. 2024, 12, 1–30. [Google Scholar] [CrossRef]
Yilmaz, I. Comparison of Landslide Susceptibility Mapping Methodologies for Koyulhisar, Turkey: Conditional Probability, Logistic Regression, Artificial Neural Networks, and Support Vector Machine. Environ. Earth Sci. 2010, 61, 821–836. [Google Scholar] [CrossRef]
Wang, L.-J.; Guo, M.; Sawada, K.; Lin, J.; Zhang, J. A Comparative Study of Landslide Susceptibility Maps Using Logistic Regression, Frequency Ratio, Decision Tree, Weights of Evidence and Artificial Neural Network. Geosci. J. 2016, 20, 117–136. [Google Scholar] [CrossRef]
Yilmaz, I. Landslide Susceptibility Mapping Using Frequency Ratio, Logistic Regression, Artificial Neural Networks and Their Comparison: A Case Study from Kat Landslides (Tokat—Turkey). Comput. Geosci. 2009, 35, 1125–1138. [Google Scholar] [CrossRef]
Lee, S.; Ryu, J.-H.; Lee, M.-J.; Won, J.-S. Use of an Artificial Neural Network for Analysis of the Susceptibility to Landslides at Boun, Korea. Environ. Geol. 2003, 44, 820–833. [Google Scholar] [CrossRef]
Tien Bui, D.; Tuan, T.A.; Klempe, H.; Pradhan, B.; Revhaug, I. Spatial Prediction Models for Shallow Landslide Hazards: A Comparative Assessment of the Efficacy of Support Vector Machines, Artificial Neural Networks, Kernel Logistic Regression, and Logistic Model Tree. Landslides 2016, 13, 361–378. [Google Scholar] [CrossRef]
Xu, C.; Shen, L.; Wang, G. Soft Computing in Assessment of Earthquake-Triggered Landslide Susceptibility. Environ. Earth Sci. 2016, 75, 767. [Google Scholar] [CrossRef]
Gómez, H.; Kavzoglu, T. Assessment of Shallow Landslide Susceptibility Using Artificial Neural Networks in Jabonosa River Basin, Venezuela. Eng. Geol. 2005, 78, 11–27. [Google Scholar] [CrossRef]
Alimohammadlou, Y.; Najafi, A.; Gokceoglu, C. Estimation of Rainfall-Induced Landslides Using ANN and Fuzzy Clustering Methods: A Case Study in Saeen Slope, Azerbaijan Province, Iran. CATENA 2014, 120, 149–162. [Google Scholar] [CrossRef]
Were, K.; Bui, D.T.; Dick, Ø.B.; Singh, B.R. A Comparative Assessment of Support Vector Regression, Artificial Neural Networks, and Random Forests for Predicting and Mapping Soil Organic Carbon Stocks across an Afromontane Landscape. Ecol. Indic. 2015, 52, 394–403. [Google Scholar] [CrossRef]
Lee, S.; Ryu, J.-H.; Won, J.-S.; Park, H.-J. Determination and Application of the Weights for Landslide Susceptibility Mapping Using an Artificial Neural Network. Eng. Geol. 2004, 71, 289–302. [Google Scholar] [CrossRef]
Melchiorre, C.; Matteucci, M.; Azzoni, A.; Zanchi, A. Artificial Neural Networks and Cluster Analysis in Landslide Susceptibility Zonation. Geomorphology 2008, 94, 379–400. [Google Scholar] [CrossRef]
Neaupane, K.M.; Achet, S.H. Use of Backpropagation Neural Network for Landslide Monitoring: A Case Study in the Higher Himalaya. Eng. Geol. 2004, 74, 213–226. [Google Scholar] [CrossRef]
Yilmaz, I. The Effect of the Sampling Strategies on the Landslide Susceptibility Mapping by Conditional Probability and Artificial Neural Networks. Environ. Earth Sci. 2010, 60, 505–519. [Google Scholar] [CrossRef]
Pradhan, B.; Lee, S. Delineation of Landslide Hazard Areas on Penang Island, Malaysia, by Using Frequency Ratio, Logistic Regression, and Artificial Neural Network Models. Environ. Earth Sci. 2010, 60, 1037–1054. [Google Scholar] [CrossRef]
Lee, S.; Evangelista, D.G. Earthquake-Induced Landslide-Susceptibility Mapping Using an Artificial Neural Network. Nat. Hazards Earth Syst. Sci. 2006, 6, 687–695. [Google Scholar] [CrossRef]
Pradhan, B.; Buchroithner, M.F. Comparison and Validation of Landslide Susceptibility Maps Using an Artificial Neural Network Model for Three Test Areas in Malaysia. Environ. Eng. Geosci. 2010, 16, 107–126. [Google Scholar] [CrossRef]
Zare, M.; Pourghasemi, H.R.; Vafakhah, M.; Pradhan, B. Landslide Susceptibility Mapping at Vaz Watershed (Iran) Using an Artificial Neural Network Model: A Comparison between Multilayer Perceptron (MLP) and Radial Basic Function (RBF) Algorithms. Arab. J. Geosci. 2013, 6, 2873–2888. [Google Scholar] [CrossRef]
Kavzoglu, T.; Kutlug Sahin, E.; Colkesen, I. An Assessment of Multivariate and Bivariate Approaches in Landslide Susceptibility Mapping: A Case Study of Duzkoy District. Nat. Hazards 2015, 76, 471–496. [Google Scholar] [CrossRef]
Goetz, J.N.; Brenning, A.; Petschko, H.; Leopold, P. Evaluating Machine Learning and Statistical Prediction Techniques for Landslide Susceptibility Modeling. Comput. Geosci. 2015, 81, 1–11. [Google Scholar] [CrossRef]
Pradhan, B. A Comparative Study on the Predictive Ability of the Decision Tree, Support Vector Machine and Neuro-Fuzzy Models in Landslide Susceptibility Mapping Using GIS. Comput. Geosci. 2013, 51, 350–365. [Google Scholar] [CrossRef]
Hong, H.; Pradhan, B.; Xu, C.; Tien Bui, D. Spatial Prediction of Landslide Hazard at the Yihuang Area (China) Using Two-Class Kernel Logistic Regression, Alternating Decision Tree and Support Vector Machines. CATENA 2015, 133, 266–281. [Google Scholar] [CrossRef]
Marjanović, M.; Kovačević, M.; Bajat, B.; Mihalić Arbanas, S.; Abolmasov, B. Landslide Assessment of the Strača Basin (Croatia) Using Machine Learning Algorithms. Acta Geotech. Slov. 2011, 8, 45–55. [Google Scholar]
Yao, X.; Tham, L.G.; Dai, F.C. Landslide Susceptibility Mapping Based on Support Vector Machine: A Case Study on Natural Slopes of Hong Kong, China. Geomorphology 2008, 101, 572–582. [Google Scholar] [CrossRef]
Hong, H.; Pradhan, B.; Jebur, M.N.; Bui, D.T.; Xu, C.; Akgun, A. Spatial Prediction of Landslide Hazard at the Luxi Area (China) Using Support Vector Machines. Environ. Earth Sci. 2015, 75, 40. [Google Scholar] [CrossRef]
Ballabio, C.; Sterlacchini, S. Support Vector Machines for Landslide Susceptibility Mapping: The Staffora River Basin Case Study, Italy. Math. Geosci. 2012, 44, 47–70. [Google Scholar] [CrossRef]
Felicísimo, Á.M.; Cuartero, A.; Remondo, J.; Quirós, E. Mapping Landslide Susceptibility with Logistic Regression, Multiple Adaptive Regression Splines, Classification and Regression Trees, and Maximum Entropy Methods: A Comparative Study. Landslides 2013, 10, 175–189. [Google Scholar] [CrossRef]
Tien Bui, D.; Ho, T.C.; Revhaug, I.; Pradhan, B.; Nguyen, D.B. Landslide Susceptibility Mapping Along the National Road 32 of Vietnam Using GIS-Based J48 Decision Tree Classifier and Its Ensembles. In Cartography from Pole to Pole: Selected Contributions to the XXVIth International Conference of the ICA, Dresden 2013; Lecture Notes in Geoinformation and Cartography; Buchroithner, M., Prechtel, N., Burghardt, D., Eds.; Springer: Berlin/Heidelberg, Germany, 2014; pp. 303–317. ISBN 978-3-642-32618-9. [Google Scholar]
Saito, H.; Nakayama, D.; Matsuyama, H. Comparison of Landslide Susceptibility Based on a Decision-Tree Model and Actual Landslide Occurrence: The Akaishi Mountains, Japan. Geomorphology 2009, 109, 108–121. [Google Scholar] [CrossRef]
Yeon, Y.-K.; Han, J.-G.; Ryu, K.H. Landslide Susceptibility Mapping in Injae, Korea, Using a Decision Tree. Eng. Geol. 2010, 116, 274–283. [Google Scholar] [CrossRef]
Zhang, K.; Wu, X.; Niu, R.; Yang, K.; Zhao, L. The Assessment of Landslide Susceptibility Mapping Using Random Forest and Decision Tree Methods in the Three Gorges Reservoir Area, China. Environ. Earth Sci. 2017, 76, 405. [Google Scholar] [CrossRef]
Dou, J.; Yunus, A.P.; Tien Bui, D.; Merghadi, A.; Sahana, M.; Zhu, Z.; Chen, C.-W.; Khosravi, K.; Yang, Y.; Pham, B.T. Assessment of Advanced Random Forest and Decision Tree Algorithms for Modeling Rainfall-Induced Landslide Susceptibility in the Izu-Oshima Volcanic Island, Japan. Sci. Total Environ. 2019, 662, 332–346. [Google Scholar] [CrossRef]
Trigila, A.; Iadanza, C.; Esposito, C.; Scarascia-Mugnozza, G. Comparison of Logistic Regression and Random Forests Techniques for Shallow Landslide Susceptibility Assessment in Giampilieri (NE Sicily, Italy). Geomorphology 2015, 249, 119–136. [Google Scholar] [CrossRef]
Rahmati, O.; Falah, F.; Naghibi, S.A.; Biggs, T.; Soltani, M.; Deo, R.C.; Cerdà, A.; Mohammadi, F.; Tien Bui, D. Land Subsidence Modelling Using Tree-Based Machine Learning Algorithms. Sci. Total Environ. 2019, 672, 239–252. [Google Scholar] [CrossRef]
Catani, F.; Lagomarsino, D.; Segoni, S.; Tofani, V. Landslide Susceptibility Estimation by Random Forests Technique: Sensitivity and Scaling Issues. Nat. Hazards Earth Syst. Sci. 2013, 13, 2815–2831. [Google Scholar] [CrossRef]
Ließ, M.; Glaser, B.; Huwe, B. Functional Soil-Landscape Modelling to Estimate Slope Stability in a Steep Andean Mountain Forest Region. Geomorphology 2011, 132, 287–299. [Google Scholar] [CrossRef]
Stumpf, A.; Kerle, N. Object-Oriented Mapping of Landslides Using Random Forests. Remote Sens. Environ. 2011, 115, 2564–2577. [Google Scholar] [CrossRef]
Chen, W.; Xie, X.; Wang, J.; Pradhan, B.; Hong, H.; Bui, D.T.; Duan, Z.; Ma, J. A Comparative Study of Logistic Model Tree, Random Forest, and Classification and Regression Tree Models for Spatial Prediction of Landslide Susceptibility. CATENA 2017, 151, 147–160. [Google Scholar] [CrossRef]
Aghdam, I.N.; Varzandeh, M.H.M.; Pradhan, B. Landslide Susceptibility Mapping Using an Ensemble Statistical Index (Wi) and Adaptive Neuro-Fuzzy Inference System (ANFIS) Model at Alborz Mountains (Iran). Environ. Earth Sci. 2016, 75, 553. [Google Scholar] [CrossRef]
Pradhan, B.; Sezer, E.A.; Gokceoglu, C.; Buchroithner, M.F. Landslide Susceptibility Mapping by Neuro-Fuzzy Approach in a Landslide-Prone Area (Cameron Highlands, Malaysia). IEEE Trans. Geosci. Remote Sens. 2010, 48, 4164–4177. [Google Scholar] [CrossRef]
Choi, J.; Lee, Y.K.; Lee, M.J.; Kim, K.; Park, Y.; Kim, S.; Goo, S.; Cho, M.; Sim, J.; Won, J.S. Landslide Susceptibility Mapping by Using an Adaptive Neuro-Fuzzy Inference System (ANFIS). In Proceedings of the 2011 IEEE International Geoscience and Remote Sensing Symposium, Vancouver, BC, Canada, 24–29 July 2011; pp. 1989–1992. [Google Scholar]
Jaafari, A.; Rezaeian, J.; Omrani, M.S.O. Spatial Prediction of Slope Failures in Support of Forestry Operations Safety. Croat. J. For. Eng. J. Theory Appl. For. Eng. 2017, 38, 107–118. [Google Scholar]
Vahidnia, M.H.; Alesheikh, A.A.; Alimohammadi, A.; Hosseinali, F. A GIS-Based Neuro-Fuzzy Procedure for Integrating Knowledge and Data in Landslide Susceptibility Mapping. Comput. Geosci. 2010, 36, 1101–1114. [Google Scholar] [CrossRef]
Sezer, E.A.; Pradhan, B.; Gokceoglu, C. Manifestation of an Adaptive Neuro-Fuzzy Model on Landslide Susceptibility Mapping: Klang Valley, Malaysia. Expert. Syst. Appl. 2011, 38, 8208–8219. [Google Scholar] [CrossRef]
Tien Bui, D.; Pradhan, B.; Lofman, O.; Revhaug, I.; Dick, O.B. Landslide Susceptibility Mapping at Hoa Binh Province (Vietnam) Using an Adaptive Neuro-Fuzzy Inference System and GIS. Comput. Geosci. 2012, 45, 199–211. [Google Scholar] [CrossRef]
Dao, D.V.; Jaafari, A.; Bayat, M.; Mafi-Gholami, D.; Qi, C.; Moayedi, H.; Phong, T.V.; Ly, H.-B.; Le, T.-T.; Trinh, P.T.; et al. A Spatially Explicit Deep Learning Neural Network Model for the Prediction of Landslide Susceptibility. CATENA 2020, 188, 104451. [Google Scholar] [CrossRef]
Gupta, M.; Prakash, S.; Ghani, S. Enhancing Predictive Accuracy: A Comprehensive Study of Optimized Machine Learning Models for Ultimate Load-Carrying Capacity Prediction in SCFST Columns. Asian J. Civ. Eng. 2024, 25, 3081–3098. [Google Scholar] [CrossRef]
Moayedi, H.; Canatalay, P.J.; Ahmadi Dehrashid, A.; Cifci, M.A.; Salari, M.; Le, B.N. Multilayer Perceptron and Their Comparison with Two Nature-Inspired Hybrid Techniques of Biogeography-Based Optimization (BBO) and Backtracking Search Algorithm (BSA) for Assessment of Landslide Susceptibility. Land 2023, 12, 242. [Google Scholar] [CrossRef]
Bui, D.T.; Tsangaratos, P.; Nguyen, V.-T.; Liem, N.V.; Trinh, P.T. Comparing the Prediction Performance of a Deep Learning Neural Network Model with Conventional Machine Learning Models in Landslide Susceptibility Assessment. CATENA 2020, 188, 104426. [Google Scholar] [CrossRef]
Thai Pham, B.; Shirzadi, A.; Shahabi, H.; Omidvar, E.; Singh, S.K.; Sahana, M.; Talebpour Asl, D.; Bin Ahmad, B.; Kim Quoc, N.; Lee, S. Landslide Susceptibility Assessment by Novel Hybrid Machine Learning Algorithms. Sustainability 2019, 11, 4386. [Google Scholar] [CrossRef]
Marjanović, M.; Kovačević, M.; Bajat, B.; Voženílek, V. Landslide Susceptibility Assessment Using SVM Machine Learning Algorithm. Eng. Geol. 2011, 123, 225–234. [Google Scholar] [CrossRef]
Koukouvelas, I.K.; Zygouri, V.; Nikolakopoulos, K.; Verroios, S. Treatise on the Tectonic Geomorphology of Active Faults: The Significance of Using a Universal Digital Elevation Model. J. Struct. Geol. 2018, 116, 241–252. [Google Scholar] [CrossRef]
Smith, H.G.; Neverman, A.J.; Betts, H.; Spiekermann, R. The Influence of Spatial Patterns in Rainfall on Shallow Landslides. Geomorphology 2023, 437, 108795. [Google Scholar] [CrossRef]
Kavzoglu, T.; Kutlug Sahin, E.; Colkesen, I. Selecting Optimal Conditioning Factors in Shallow Translational Landslide Susceptibility Mapping Using Genetic Algorithm. Eng. Geol. 2015, 192, 101–112. [Google Scholar] [CrossRef]
Jang, J.-S.R. ANFIS: Adaptive-Network-Based Fuzzy Inference System. IEEE Trans. Syst. Man Cybern. 1993, 23, 665–685. [Google Scholar] [CrossRef]
Mohan, S.; Vijayalakshmi, D.P. Genetic algorithm applications in water resources. ISH J. Hydraul. Eng. 2009, 15, 97–128. [Google Scholar] [CrossRef]
Kennedy, J.; Eberhart, R. Particle Swarm Optimization. In Proceedings of the ICNN’95—International Conference on Neural Networks, Perth, WA, Australia, 27 November–1 December 1995; Volume 4, pp. 1942–1948. [Google Scholar]
Heidari, A.A.; Mirjalili, S.; Faris, H.; Aljarah, I.; Mafarja, M.; Chen, H. Harris Hawks Optimization: Algorithm and Applications. Future Gener. Comput. Syst. 2019, 97, 849–872. [Google Scholar] [CrossRef]
Mirjalili, S.; Gandomi, A.H.; Mirjalili, S.Z.; Saremi, S.; Faris, H.; Mirjalili, S.M. Salp Swarm Algorithm: A Bio-Inspired Optimizer for Engineering Design Problems. Adv. Eng. Softw. 2017, 114, 163–191. [Google Scholar] [CrossRef]
Koncar, N. Optimisation Methodologies for Direct Inverse Neurocontrol. Ph.D. Thesis, University of London, London, UK, 1997. [Google Scholar]
Stefánsson, A.; Končar, N.; Jones, A.J. A Note on the Gamma Test. Neural Comput. Appl. 1997, 5, 131–133. [Google Scholar] [CrossRef]
Amin Benbouras, M.; Petrisor, A.-I. Prediction of Swelling Index Using Advanced Machine Learning Techniques for Cohesive Soils. Appl. Sci. 2021, 11, 536. [Google Scholar] [CrossRef]
Smith, O.S. Covariance between Line per Se and Testcross Performance. Crop Sci. 1986, 26, 540–543. [Google Scholar] [CrossRef]
Semahi, S.; Benbouras, M.A.; Mahar, W.A.; Zemmouri, N.; Attia, S. Development of Spatial Distribution Maps for Energy Demand and Thermal Comfort Estimation in Algeria. Sustainability 2020, 12, 6066. [Google Scholar] [CrossRef]
Benbouras, M.A.; Kettab, R.M.; Debiche, F.; Lagaguine, M.; Mechaala, A.; Bourezak, C.; Petrişor, A.-I. Use of Geotechnical and Geographical Information Systems to Analyze Seismic Risk in Algiers Area. Rev. Şcolii Dr. Urban. 2018, 3, 11–24. [Google Scholar]

Figure 1. Geographical location of the study area.

Figure 2. The landslide inventory map of the study area.

Figure 3. Flowchart of the key steps of the research methodology for mapping areas susceptible to landslide.

Figure 4. Lithological map of the study area (source: geological map service of Algeria at the scale of 1:50,000).

Figure 5. Precipitation map of the study area (source: National Agency for Hydraulic Resources of Algeria at the scale of 1:500,000).

Figure 6. Elevation map of the study area (source: DEM generated from the SRTM database).

Figure 7. Slope map of the study area (source: DEM generated from the SRTM database).

Figure 8. Land cover map of the study area (source: Sentinel-2 satellite imagery).

Figure 9. Hydrographic network map (source: DEM generated from the SRTM database).

Figure 10. Roads network map of the study area (source: Google Maps Road available on Google Earth).

Figure 11. Slope aspect map of the study area (DEM generated from the SRTM database).

Figure 12. Architecture of the ANFIS model.

Figure 13. Performance measures of the ANFIS-HHO model using the K-fold cross-validation with K = 5.

Figure 14. Landslide susceptibility map for the study area based on the ANFIS-HHO model.

Table 1. Slope aspect distribution.

Aspect	Surface Area (m²)	Percentage (%)
E	1067.24	12.16
SE	969.43	11.05
S	1683.89	19.19
SW	954.53	10.88
W	1051.25	11.98
NW	766.81	8.74
N	1397.69	15.93
NE	884.82	10.08

Table 2. Statistical indicators of the model fit quality used in the current study.

Statistical Indicators	Equations
Sensitivity %	$S e n s i t i v i t y % = \frac{T P}{T P + F N} * 100$	(19)
Specificity %	$S p e c i f i c i t y % = \frac{T N}{T N + F P} * 100$	(20)
Precision %	$P r e c i s i o n % = \frac{T P}{T P + F P} * 100$	(21)
Accuracy %	$A c c u r a c y % = \frac{T N + T P}{T N + T P + F P + F N} * 100$	(22)
Pearson Correlation Coefficient (R)	$R = \frac{\sum_{i = 1}^{N} ((Y_{t a r, i} - \bar{Y_{t a r}}) (Y_{o u t, i} - \bar{Y_{o u t}}))}{\sqrt{\sum_{i = 1}^{N} {((Y_{t a r, i} - \bar{Y_{t a r}})}^{2} {(Y_{o u t, i} - \bar{Y_{o u t}})}^{2})}} (- 1 < R < 1)$	(23)

Table 3. Parameters used and their subdivision into classes.

Parameter	Subdivision	Classes
Lithology (X1)	Weakly landslide-prone formations	1
	Moderately landslide-prone formations	2
	Landslide-prone formations	3
	Highly landslide-prone formations	4
Elevation (X2)	265–400 m	1
	400–600 m	2
	600–800 m	3
	800–1000 m	4
	1000–1200	5
	<1200 m	6
Slope (X3)	0–5°	1
	5–12.5°	2
	12.5–25°	3
	25–90°	4
Land cover (X4)	Residential	1
	Uncultivated	2
	Cultivated	3
	Grassland	4
	Forrest	5
Distance to stream (X5)	0–25 m	1
	25–50 m	2
	50–100 m	3
	100–200 m	4
	200–300 m	5
	>300 m	6
Distance to road (X6)	0–25 m	1
	25–50 m	2
	50–100 m	3
	100–200 m	4
	200–300 m	5
	>300 m	6
Precipitation (X7)	100–200 mm	1
	200–300 mm	2
	300–400 mm	3
	400–500 mm	4
	600–800 mm	5
Slope aspect (X8)	East	1
	Southeast	2
	South	3
	Southwest	4
	West	5
	Northwest	6
	North	7
	Northeast	8

Table 4. Matrix of the correlations between the meteorological parameters (**: correlation is significant at the 0.01 level; *: correlation is significant at the 0.05 level).

		X1	X2	X3	X4	X5	X6	X7	X8	Y
X1	Correlation Coefficient	1.000	−0.316 **	0.324 **	−0.022	−0.061	0.074	−0.471 **	0.211 **	−0.560 **
X1	Significance. (2-tailed)	.	0.000	0.000	0.781	0.447	0.354	0.000	0.007	0.000
X2	Correlation Coefficient	−0.316**	1.000	−0.081	0.190 *	0.204 **	−0.164 *	0.217 **	−0.024	0.279 **
X2	Significance. (2-tailed)	0.000	.	0.307	0.016	0.010	0.038	0.006	0.763	0.000
X3	Correlation Coefficient	0.324 **	−0.081	1.000	0.060	0.117	−0.012	−0.488 **	0.299 **	−0.590 **
X3	Significance. (2-tailed)	0.000	0.307	.	0.454	0.139	0.881	0.000	0.000	0.000
X4	Correlation Coefficient	−0.022	0.190 *	0.060	1.000	−0.031	0.072	−0.092	0.125	−0.027
X4	Significance. (2-tailed)	0.781	0.016	0.454	.	0.698	0.363	0.245	0.115	0.732
X5	Correlation Coefficient	−0.061	0.204 **	0.117	−0.031	1.000	−0.144	−0.050	−0.048	−0.042
X5	Significance. (2-tailed)	0.447	0.010	0.139	0.698	.	0.069	0.534	0.544	0.597
X6	Correlation Coefficient	0.074	−0.164 *	−0.012	0.072	−0.144	1.000	0.014	−0.021	−0.046
X6	Significance. (2-tailed)	0.354	0.038	0.881	0.363	0.069	.	0.856	0.796	0.567
X7	Correlation Coefficient	−0.471 **	0.217 **	−0.488 **	−0.092	−0.050	0.014	1.000	−0.356 **	0.671 **
X7	Significance. (2-tailed)	0.000	0.006	0.000	0.245	0.534	0.856	.	0.000	0.000
X8	Correlation Coefficient	0.211 **	−0.024	0.299 **	0.125	−0.048	−0.021	−0.356 **	1.000	−0.379 **
X8	Significance. (2-tailed)	0.007	0.763	0.000	0.115	0.544	0.796	0.000	.	0.000
Y	Correlation Coefficient	−0.560 **	0.279 **	−0.590 **	−0.027	−0.042	−0.046	0.671 **	−0.379 **	1.000
Y	Significance. (2-tailed)	0.000	0.000	0.000	0.732	0.597	0.567	0.000	0.000	.

Table 5. Optimal input variable nomination using GT.

Input Parameters	Gamma Test Statistics
Input Parameters	Γ	V_ratio
All	0.1097	0.4681
All-X1	0.1438	0.6135
All-X2	0.1040	0.4436
All-X3	0.1212	0.5173
All-X4	0.0990	0.4225
All-X5	0.114	0.487
All-X6	0.1093	0.4663
All-X7	0.1167	0.4980
All-X8	0.1630	0.6954

Table 6. Performances of the proposed models throughout the training and validation phases.

	Sensitivity	Specificity	Precision	Accuracy	R
Training
ANFIS-GA	100	94.937	92.16	96.83	92
ANFIS-PSO	95.83	96.25	93.88	96.09	92.99
ANFIS-HHO	100	98.734	98	99.22	99.21
ANFIS-SSA	100	97.56	95.83	98.44	97.14
Validation
ANFIS-GA	100	95.833	88.89	96.875	92.31
ANFIS-PSO	100	95.45	90.909	96.875	93.6
ANFIS-HHO	100	100	100	100	99.97
ANFIS-SSA	85.71	100	100	93.75	87.26

Table 7. A comparison between our HHO-ANFIS model and some of the proposed empirical models in the literature.

Authors	Database	Best Methods	Classification Accuracy	Sensitivity	Specificity
Tien Bui et al. [26]	72	Artificial Neural Networks	90.2	80.25	94.55
Kavzoglu et al. [39]	39	Support Vector Machines	94.434	-	-
Yeon et al. [50]	600	Decision Trees	89.26	-	-
Dou et al. [52]	44	Random Forest	94.6	96.1	93.0
Aghdam et al. [59]	1292	ANFIS	90	-	-
Dao et al. [66]	217	Deep Neural Netwerk	90.1	-	-
Benbouras [8]	78	PSOGSA-ANN	95.1	95.1	97.5
Our study	160	ANFIS-HHO	99.21	100	98.734

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Debiche, F.; Benbouras, M.A.; Petrisor, A.-I.; Baba Ali, L.M.; Leghouchi, A. Advancing Landslide Susceptibility Mapping in the Medea Region Using a Hybrid Metaheuristic ANFIS Approach. Land 2024, 13, 889. https://doi.org/10.3390/land13060889

AMA Style

Debiche F, Benbouras MA, Petrisor A-I, Baba Ali LM, Leghouchi A. Advancing Landslide Susceptibility Mapping in the Medea Region Using a Hybrid Metaheuristic ANFIS Approach. Land. 2024; 13(6):889. https://doi.org/10.3390/land13060889

Chicago/Turabian Style

Debiche, Fatiha, Mohammed Amin Benbouras, Alexandru-Ionut Petrisor, Lyes Mohamed Baba Ali, and Abdelghani Leghouchi. 2024. "Advancing Landslide Susceptibility Mapping in the Medea Region Using a Hybrid Metaheuristic ANFIS Approach" Land 13, no. 6: 889. https://doi.org/10.3390/land13060889

APA Style

Debiche, F., Benbouras, M. A., Petrisor, A.-I., Baba Ali, L. M., & Leghouchi, A. (2024). Advancing Landslide Susceptibility Mapping in the Medea Region Using a Hybrid Metaheuristic ANFIS Approach. Land, 13(6), 889. https://doi.org/10.3390/land13060889

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Advancing Landslide Susceptibility Mapping in the Medea Region Using a Hybrid Metaheuristic ANFIS Approach

Abstract

1. Introduction

2. Materials and Methods

2.1. Case Study

2.2. Overview of the Methodology

2.3. Thematic Map Layers

2.4. ANFIS

2.5. Metaheuristic Algorithms

2.5.1. Genetic Algorithms (GAs)

2.5.2. Particle Swarm Optimization (PSO)

2.5.3. Harris Hawks Optimization (HHO)

2.5.4. Salp Swarm Algorithm (SSA)

2.6. Gamma Test

2.7. Statistical Indicators

3. Results

3.1. Database Compilation

3.2. Correlation between Inputs and Target

3.3. Optimal Input Selection Using the Gamma Test (GT)

3.4. Landslide Susceptibility Classification through the Hybrid Metaheuristic Method

3.5. Evaluating the Best-Fitted Model Using the K-Fold Cross-Validation Approach

3.6. Landslide Susceptibility Mapping

3.7. Comparison between Our Model and the Models Proposed by the Literature

4. Discussion

4.1. Significance of the Results

4.2. Inner Validation of the Results

4.3. External Validation of the Results

4.4. Importance of the Results

4.5. Study Limitations and Future Directions

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI