Long Short-Term Memory Based Subsurface Drainage Control for Rainfall-Induced Landslide Prevention

Biniyaz, Aynaz; Azmoon, Behnam; Sun, Ye; Liu, Zhen

doi:10.3390/geosciences12020064

Open AccessArticle

Long Short-Term Memory Based Subsurface Drainage Control for Rainfall-Induced Landslide Prevention

¹

Department of Civil, Environmental and Geospatial Engineering, Michigan Technological University, Houghton, MI 49931, USA

²

Department of Mechanical and Aerospace Engineering, University of Virginia, Charlottesville, VA 22904, USA

^*

Author to whom correspondence should be addressed.

Geosciences 2022, 12(2), 64; https://doi.org/10.3390/geosciences12020064

Submission received: 23 November 2021 / Revised: 18 January 2022 / Accepted: 26 January 2022 / Published: 30 January 2022

(This article belongs to the Collection New Advances in Geotechnical Engineering)

Download

Browse Figures

Versions Notes

Abstract

:

Subsurface drainage has been widely accepted to mitigate the hazard of landslides in areas prone to flooding. Specifically, the use of drainage wells with pumping systems has been recognized as an effective short-term solution to lower the groundwater table. However, this method has not been well considered for long-term purposes due to potentially high labor costs. This study aims to investigate the idea of an autonomous pumping system for subsurface drainage by leveraging conventional geotechnical engineering solutions and a deep learning technique—Long-Short Term Memory (LSTM)—to establish a geotechnical cyber-physical system for rainfall-induced landslide prevention. For this purpose, a typical soil slope equipped with three pumps was considered in a computer simulation. Forty-eight cases of rainfall events with a wide range of varieties in duration, total rainfall depths, and different rainfall patterns were generated. For each rainfall event, transient seepage analysis was performed using newly proposed Python code to obtain the corresponding pump’s flow rate data. A policy of water pumping for maintaining groundwater at a desired level was assigned to the pumps to generate the data. The LSTM takes rainfall event data as the input and predicts the required pump’s flow rate. The results from the trained model were validated using evaluation metrics of root mean square error (RMSE), mean absolute error (MAE), and R². The R²-scores of 0.958, 0.962, and 0.954 for the predicted flow rates of the three pumps exhibited high accuracy of the predictions using the trained LSTM model. This study is intended to make a pioneering step toward reaching an autonomous pumping system and lowering the operational costs in controlling geosystems.

Keywords:

landslide prevention; disaster resilience; geotechnical cyber-physical systems; long short-term memory; deep learning; transient flow

1. Introduction

Natural disasters cause severe human and economic losses all over the world [1]. Landslides account for approximately 5% of natural disasters and result in economic losses of $1.6 to $3.2 billion in the United States and approximately 25–50 deaths annually [2]. Researchers have conducted landslide susceptibility assessments in which landslide susceptibility is defined as the probability of landslide occurrence in an area considering various geoenvironmental conditions, and rainfall has been identified as one of the main factors influencing landslide occurrence [3,4,5,6,7]. The 2010 Maierato landslide [8], the 2014 Oso landslide [9], the 2018 Mengdong landslides [10], and annually occurring landslides in Rize, Turkey [11] are examples of rainfall-induced landslides. The frequency, duration, and intensity of rainfall events have been affected by climate change in recent decades, leading to an increase in landslides [12,13,14,15]. Intense and prolonged precipitation can cause a rise in the groundwater table, which decreases the stability of the slope. Additionally, the rain infiltrating the slope increases the degree of saturation and the unit weight of soil. The resultant increase in soil saturation also causes a decrease in the matric suction and shear strength of the unsaturated soil [16,17,18,19], which can adversely impact the safety of geosystems such as slopes. The magnitude and rate of the impact on the soil properties and the safety factor of a slope depend on the intensity and the duration of the rain [20,21]. Slope failures induced by rainfall events and the above mechanisms can cause massive economic losses and fatalities [2]; for example, [22] reported that monsoon flooding and landslides in the 2019 summer impacted more than 7 million people in Nepal, India, and Bangladesh.

To mitigate landslide hazards in areas prone to flooding, drainage has been widely used as a stabilization method [23,24,25,26]. Lowering the groundwater table using subsurface drainage systems increases slope stability. The lowered water table reduces the driving forces, which increases the factor of safety (FS). Most drainage systems are designed to use the gravity flow to reduce costs. In some cases, a pumping system might be necessary to remove water from deep drainage wells due to an inadequate gravity force [27].

Deep wells equipped with pumps and well-point systems are two pumping-based techniques that can lower the groundwater level in shallow and deep excavations, respectively [28]. For example, deep wells equipped with pumps were utilized as a primary solution to stabilize a slope near Genoa, Italy, in 1987 [29,30]. Such systems are also applicable in mining projects to reduce the risk of liquefaction of mine tailings due to earthquakes [31]. The operation of a well-point system requires a specialist operator to tune the flow valves to prevent the water level from falling below the top of the screen, as any air entering the system might cause a pump malfunction. Regular checks may also be required in pumping-based techniques to control seepage in sites subject to flooding [28,32]. The high operational cost of such systems is the primary factor preventing pumping systems from developing into long-term solutions for dewatering [28].

To overcome such problems, the goal of this study is to investigate the idea of an autonomous pumping system for subsurface drainage by leveraging conventional geotechnical engineering solutions and deep learning techniques to establish a geotechnical cyber-physical system (CPS) for rainfall-induced landslide prevention. To the best of our knowledge, this study is the first to explore the application of deep learning techniques in practical geotechnical CPSs. Specifically, we propose to use the Long Short-Term Memory (LSTM) recurrent network to learn the control policy of water pumping from the observed data during rainfall events so the trained LSTM can be used to run pumps for desired control outcomes with the predicted pump’s flow rate. A virtual environment was created to generate the data for training and testing of the proposed LSTM model, which has been validated with commercial software. Accordingly, Python code was developed to implement a transient seepage model for a typical soil slope equipped with three pumps and subjected to rainfall events. The proposed numerical simulation framework was integrated with deep learning, which cannot be achieved with commercially available software for transient seepage analysis. The LSTM model was then trained to take the rainfall data as input and predict the required pump’s flow rate data.

The proposed method can identify unnoticed knowledge in observed data that can enable autonomous water pumping in geosystems toward data-driven resilience. The proposed system can significantly reduce operational costs and improve safety by replacing human-controlled and managed geosystems. The main scientific contributions can be summarized as follows:

Geotechnical CPS: This study is the first to leverage LSTMs for subsurface drainage control for landslide prevention as a concept. The established geotechnical CPS in this study can be used for establishing autonomous pumping systems to increase safety and to reduce the operational costs of geosystems.
Data-driven disaster resilience: The proposed method can learn from data and experiences by integrating deep learning techniques into geotechnical engineering solutions, which is among the first to control an infrastructure system involving complex physics in geomaterials and to provide unique interventions to hazard prevention toward data-driven disaster resilience.
Physics-based model for data generation: We establish a new physics-based model for the transient seepage analysis of geosystems considering precipitation and pumping, which is needed to stimulate the behavior of such CPSs for improved drainage and landslide prevention. The developed model is written in Python as a free and open-source framework. This new model is then employed to generate data for deep learning (i.e., training and testing) and utilized as the independent testing environment to validate predictions for controlling groundwater. Integration with other deep learning algorithms is an advantage of the proposed model compared with the commercially available software for transient seepage analysis. Additionally, the governing equation and the auxiliary equations applied in this study can be easily modified to allow for more complex seepage analysis.

The rest of the paper is organized as follows: Section 2 discusses the data generation approaches; Section 3 introduces the background of LSTM, then explains the proposed model, required data preprocessing, and evaluation criteria; Section 4 presents the results and discussions; and, finally, Section 5 concludes the study.

2. Data Acquisition

To generate the data for training and testing of the proposed LSTM model, a transient seepage model was developed to simulate a lab-scale geosystem equipped with three pumps and subjected to rainfall events. In our previous study, we established a similar physical model that seamlessly coupled seepage and slope stability analysis to better understand the influence of water level changes on the safety status of slopes [33]. The computational framework of the physical model was cross-validated by commercial software in our previous study [33].

The goal of this study is to explore integrating the physical geosystems and the man-made interventions with a deep learning algorithm (i.e., LSTM) to learn the policy of water pumping from historical data and to establish an autonomous water pumping system for long-term use. Landslides are often the results of extreme weather events, as explained in Section 1. The data directly influencing slope stability for landslide prevention include the following four categories: (1) the geomaterial characteristics of the slope, mainly the associated unsaturated and saturated soil parameters; (2) the geometry of the slope and the pumps; (3) rainfall data and detailed information on pumps including each pump’s flow rate policy, location, and initial water table; and (4) time variables. Thus, we first present the data generation in Section 2.

2.1. Generation of Rainfall Data

The design of rainfall events requires the determination of three basic parameters: (1) duration, (2) depth of precipitation, and (3) rain intensity. According to German guidelines, DVWK (Deutschen Verbandes für Wasserwirtschaft und Kulturbau (DVWK)), there are four possible intensity distribution patterns for rainfall events, as shown in Figure 1 [34]. In the first typical rainfall pattern, Figure 1a, the rainfall intensity is constant over time. Figure 1b shows the rainfall distribution with a descending pattern. In this type of distribution, maximum rainfall intensity occurs at the onset of precipitation. In the third type of distribution shown in Figure 1c, the maximum intensity occurs in the middle of the rainfall event duration, similar to a normal distribution. In the last type in Figure 1d, the rainfall intensity increases over time and reaches the maximum intensity at the end of the precipitation.

Forty-eight sets of rainfall data with a wide range of varieties in duration, total precipitation depth, and patterns were generated. Three values, (i.e., 10 min, 15 min, and 20 min, were selected for the rainfall duration. Four values were considered for the total rainfall depth: 10 mm, 15 mm, 20 mm, and 25 mm. Generated rainfall events typically have one of the four typical patterns described in Figure 1a–d, which will be referred to as types “a”, “b”, “c”, and “d”, respectively. It is noted that the maximum rainfall intensity in all cases was less than the saturated hydraulic conductivity of the chosen soil to prevent runoff, allowing all precipitation to infiltrate the soil [35].

2.2. Pump Flow Rate Data Acquired from Transient Seepage Analysis

2.2.1. Governing Equation for the Transient Seepage Model

A transient seepage analysis was performed to obtain the corresponding pump’s flow rate for each rainfall event. The governing equation for a transient saturated-unsaturated seepage model was obtained by modifying the Richards Equation [36],

S \frac{\partial (h + z)}{\partial t} = K \times \nabla (\nabla (h + z)) + q_{H},

(1)

where

h

is the pressure head with the unit of meter,

z

is the elevation head with the unit of meter,

q_{H}

is the sink/source term representing the applied flux at the boundaries with the unit of meter per second, and

S

and

K

are defined based on the saturation degree as follows,

{\begin{cases} Saturated Flow \to K = K_{s}, S = S_{s} \\ Unsaturated Flow \to K = K_{s} K_{r}, S = S_{c} \end{cases},

(2)

where, for the saturated flow,

S_{s}

is the specific storage of saturated flow with the unit of 1/m, and

K_{s}

is the saturated hydraulic conductivity with the unit of meter per second; for the unsaturated flow,

S_{c}

is the specific moisture content with the unit of 1/m and

K_{s} K_{r}

is the unsaturated hydraulic conductivity, in which

K_{r}

is the relative hydraulic conductivity. Relative hydraulic conductivity defines the way in which the hydraulic conductivity changes with the degree of effective saturation (

S_{e}

). The following equation proposed by van Genuchten [37] was adopted for formulating

K_{r}

:

K_{r} = S_{e}^{b} {(1 - {(1 - S_{e}^{1 / a})}^{a})}^{2},

(3)

S_{e} = {[1 + {(\frac{ψ}{P_{0}})}^{\frac{1}{1 - a}}]}^{- a},

(4)

where

a

,

b

, and

P_{0}

(with the unit of Pascal) are fitting parameters derived from the Soil-Water Characteristic Curve (SWCC),

S_{e}

is the effective saturation degree, and

ψ

(with the unit of Pascal) is the matric suction [17]. The matric suction is calculated using

ψ = γ_{w} h

, where

γ_{w}

is the unit weight of water.

The specific moisture content for unsaturated flow (

S_{c}

) defines the rate of change in the water content per unit change of the negative water head [36].

S_{c}

is the derivative of volumetric water content

θ

with respect to

h

:

S_{c} = | \frac{\partial θ}{\partial h} | = n \frac{\partial S_{e}}{\partial h} = \frac{n a}{h (a - 1)} {(1 + {(\frac{ψ}{P_{0}})}^{\frac{1}{1 - a}})}^{- a - 1} {(\frac{ψ}{P_{0}})}^{\frac{1}{1 - a}},

(5)

where

n

is the soil porosity.

Specific storage of the saturated flow

S_{s}

is the volume of water released from a unit volume of aquifer per unit decline in the hydraulic head.

S_{s}

is a function of the compressibility values of soil and water and the soil porosity [38]:

S_{s} = \frac{1}{V_{t}} \frac{d V_{w}}{d h} = ρ_{w} g (C_{s} + n C_{w}),

(6)

where

ρ_{w}

is the density of water with the SI unit of

kg / m^{3}

,

g

is the gravitational acceleration,

C_{s}

is the soil compressibility with the SI unit of

{ms}^{2} / kg

, and

C_{w}

is the water compressibility with the SI unit of

{ms}^{2} / kg

.

2.2.2. Geometry and Boundary Conditions

As shown in Figure 2, a typical slope profile consisting of three pumps, was used in the analyses. The slope geometry, boundary conditions, and location of pumps were determined from the lab-dimensional model. It was assumed that there was no input flux other than rainfall. Pumps were modeled as a sinkhole. The diameter of the sinkhole for all of the three pumps was 2 inches. The initial groundwater table is set to the level of 6 inches.

The no-flux boundary condition was applied to the left and right sides of the slope above the groundwater table (i.e., DF and GA) and along the bottom of the slope (i.e., FG). This type of boundary condition was formulated using a Neumann boundary condition as Equation (7),

Neumann BC : - \nabla (h + z) \cdot \vec{n} = 0 on Γ_{(DF, FG, GA)} for t > 0,

(7)

The slope’s surface boundaries (i.e., AB, BC, and CD) were set to the influx boundary conditions to simulate rainfall infiltration at the slope’s surface. Water ponding was not considered for the slope’s surface since the rain intensities (

I_{r}

) were less than the soil infiltration capacity,

Neumann BC : - \nabla (h + z) \cdot \vec{n} = I_{r} on Γ_{(AB, BC, CD)} for t > 0,

(8)

The outflux boundary condition was adopted for the sinkhole boundaries of the pumps. The definition of flux is the discharge per unit area per unit time (i.e., m/s). Based on this definition, the Neumann boundary condition for the pumps was formulated as Equation (9):

Neumann BC : - \nabla (h + z) \cdot \vec{n} = α \frac{Q_{p}}{2 π r} on Γ_{(Pump 1, Pump 2, Pump 3)} for t > 0,

(9)

where

Q_{p}

is the full capacity of the pumps, which was considered to be 1 gpm (gallon per minute),

r

is the radius of the sinkhole for the pumps, and

α

is a value between 0 and 1 that was defined based on a simple policy for each pump to keep the water level at the target level. The target level in this model is the initial water level (i.e., 6″). The policies for pumps 1, 2, and 3 were formulated according to the range of water head at points P₁, P₂, and P₃ (see Figure 2), which are presented in Equations (10)–(12), respectively:

Pump 1 : \begin{array}{l} If h (P_{1}) = 0 m \to α = 0 \\ If 0 < h (P_{1}) < 0.4 m \to α = h (P_{1}) \times 2.5, \\ If h (P_{1}) \geq 0.4 m \to α = 1 \end{array}

(10)

Pump 2 : \begin{array}{l} If h (P_{2}) = 0 m \to α = 0 \\ If 0 < h (P_{2}) < 0.1 m \to α = h (P_{2}) \times 10, \\ If h (P_{2}) \geq 0.1 m \to α = 1 \end{array}

(11)

Pump 3 : \begin{array}{l} If h (P_{3}) = 0 m \to α = 0 \\ If 0 < h (P_{3}) < 0.15 m \to α = h (P_{3}) \times 6.67, \\ If h (P_{3}) \geq 0.15 m \to α = 1 \end{array}

(12)

where

h (P_{1})

,

h (P_{1})

, and

h (P_{3})

are the water head values at points P₁, P₂, and P₃, respectively. The purpose of this study is to prove that a control policy such as that used by an experienced engineer to control the drainage of a geosystem can be learned. If successful, we can utilize existing control data or engineers’ experiences to generate an autonomous subsurface drainage control system. To support proof of concept, the above policy and a clear mathematical formulation were adopted.

2.2.3. Soil Properties

Soil properties for the transient saturated-unsaturated flow model are presented in Table 1. Figure 3 represents the SWCC and the relationship between the relative hydraulic conductivity and the effective saturation for the chosen sandy soil. It is noted that the values for the specific storage (

S_{s}

) for saturated flow were assumed based on the ranges provided by [38] for sandy soils. To focus on the proof of the concept in this study, sandy soil was selected in this study, considering the relatively short seepage analysis time and low computational cost. Proving this concept with coarse-grained soils can also confirm the applicability of this method to other soils as long as the governing physics principles behind the transient seepage analysis do not change significantly.

2.2.4. Numerical Implementation

The seepage analysis was implemented using the DOLFIN package, a Python interface of FEniCS. FEniCS is a finite element analysis platform to solve nonlinear partial differential equations (PDEs). The procedure for solving the governing equation for the transient seepage analysis was summarized in Figure 4. The required input parameters to solve the PDE included unsaturated soil characteristics (

a, b, P_{0}, n

); saturated specific storage (

S_{s}

); saturated hydraulic conductivity (

K_{s}

); slope geometry; the geometry of pumps; the location of points P₁, P₂, and P₃, boundary conditions including the rain intensity and the pumps’ flow rate policy, the initial groundwater table, and time variables (total time

T = 120 \min

, and time interval

Δ t = 1 \min

). A total time of 120 min was considered in the seepage analysis to ensure that the groundwater reaches its initial condition at the end of pumping. A mesh of 3-node Lagrangian elements was generated in the computational domain of the slope (

Ω

). Boundary conditions were applied to the subdomain as described in Section 2.2.2. The initial boundary condition was specified as the solution to the PDE at

t = 0

. Auxiliary equations for the unsaturated hydraulic conductivity and effective saturation were defined. Next, the PDE for the governing equation was reformulated as a finite element variational problem. For each time step, the boundary conditions for the slope’s surface and the pumps were updated based on the rain intensity and water head of points P₁, P₂, and P₃ at the time

t_{i}

. Solving the PDE gives the total water head distribution at the time

t_{i}

_, which was utilized to update the pumps’ flow rate at the time

t_{i + 1}

. The output of the seepage analysis is the flow rate of pumps during and after the rainfall event.

Figure 5 shows the output of the seepage analysis for pumps’ flow rate in a typical rainfall event. The seepage analysis was performed for all generated rainfall events (48 sets). Since the flow rates of all the three pumps reached zero at the end of each set, the datasets from all 48 sets were combined to produce a time series of the pumps’ flow rate and rainfall. The time series of rainfall shown in Figure 6 was used as the input to the LSTM model, and the corresponding flow rates for pumps 1, 2, and 3 shown in Figure 7 were defined as the output.

3. Methodology

In the past decades, machine learning techniques have been applied to autonomous systems because of their ability to discover knowledge from observed data [39]. These techniques have been widely used because they do not require predefined input/output relationships [40]. The LSTM is a type of recurrent neural network (RNN) that has a memory cell to store information and dependencies long term [41]. The LSTM has shown promising results in time-series prediction tasks such as traffic flow prediction [42], travel time prediction [43], predicting water table depth in an agricultural area [44], flood forecasting [45], rainfall prediction [46], prediction of water levels in sewer systems [47], steam flow forecasting of small rivers [48], and prediction of displacement in multifactor-induced landslides [49]. Besides, the LSTM performs better when capturing various data patterns than the traditional neural network models [50,51]. Thus, the deep learning model for the proposed autonomous water pumping system was established with the LSTM.

3.1. Background: Long Short-Term Memory

The LSTM was initially proposed by [41]. The LSTM is a special kind of RNN that transfers historical information and long-term dependencies through time sequences [52]. Figure 8 shows the general structure of an RNN, which includes three layers: the input, hidden, and output layers. The output of each time step depends on the current time step’s input and the output from the previous time step. This dependency is transferred by the hidden layer units connected through time. In an RNN unit, the recurrent hidden state at time step

t

(i.e.,

h_{t}

) is updated using Equation (13).

h_{t} = \tanh (U \cdot X_{t} + W \cdot h_{t - 1} + b_{h}),

(13)

where

X_{t}

is the input at time-step

t

,

U

is the weight matrix between the input and the hidden layer,

W

is the weight matrix between hidden units,

h_{t - 1}

is the hidden state in the previous time-step,

b_{h}

is the bias vector of the hidden layer, and

\tanh

is a hyperbolic tangent function that transforms the input to a value between −1 and 1. The output of the time step

t

(i.e.,

y_{t}

) is obtained from Equation (14).

y_{t} = V \cdot h_{t} + b_{y},

(14)

where

V

is the weight matrix between the hidden layer and the output layer and

b_{y}

is the bias vector of the output layer. Although the RNN is a rigorous tool in time series prediction, it could not preserve the dependencies for a long time [52,53].

The LSTM has a memory cell that helps maintain long-term dependencies, as illustrated in Figure 9. Three gates control this memory cell: input gate, forget gate, and output gate [43,54]. The state of the three gates is updated in each time step using the following equations Equations (15)–(17).

The updated state of the input gate at the time

t

:

I_{t} = σ (U_{X}^{I} \cdot X_{t} + W_{h}^{I} \cdot h_{t - 1} + b_{I}),

(15)

The updated state of the forget gate at the time

t

:

F_{t} = σ (U_{X}^{F} \cdot X_{t} + W_{h}^{F} \cdot h_{t - 1} + b_{F}),

(16)

The updated state of the output gate at the time

t

:

O_{t} = σ (U_{X}^{O} \cdot X_{t} + W_{h}^{O} \cdot h_{t - 1} + b_{O}),

(17)

where

U_{X}^{I}

,

U_{X}^{F}

, and

U_{X}^{O}

are the weight matrices of the input, forget, and output gates for the input (

X_{t}

), respectively;

W_{h}^{I}

,

W_{h}^{F}

, and

W_{h}^{O}

are the weight matrices of the three gates for

h_{t - 1}

in which

h_{t - 1}

is the hidden state in the previous time-step,

t - 1

;

b_{I}

,

b_{F}

, and

b_{O}

are the bias vectors of the three gates;

σ

is the sigmoid activation function, which transforms the input to a value between 0 and 1 [43].

The memory cell is updated each time step by the forget gate and the input gate, enabling the cell to store information for a long time without a gradient vanishing problem. The output of the memory cell

C_{t}

is calculated using Equation (18). The forget gate is responsible for forgetting irrelevant information from the past [52,54].

C_{t} = (I_{t} \otimes {\tilde{C}}_{t}) + (F_{t} \otimes C_{t - 1}),

(18)

where

C_{t - 1}

is the previous state of the memory cell and

{\tilde{C}}_{t}

is the candidate input for the memory cell at a time

t

:

{\tilde{C}}_{t} = \tanh (U_{X}^{C} \cdot X_{t} + W_{h}^{C} \cdot h_{t - 1} + b_{C}),

(19)

where

U_{X}^{C}

and

W_{h}^{C}

are the weight matrices of the memory cell for

X_{t}

and

h_{t - 1}

, respectively, and

b_{C}

is the bias vector of the memory cell [43].

The hidden layer’s state at time t (

h_{t}

) is calculated with Equation (20):

h_{t} = O_{t} \otimes \tanh (C_{t}) .

(20)

The output of the LSTM unit is obtained with Equation (21):

y_{t} = V \cdot h_{t} + b_{y},

(21)

where

V

is the weight matrix between the hidden layer and the output layer and

b_{y}

is the bias vector of the output layer [43].

3.2. Proposed LSTM Model

In the practical water pumping system for rainfall-induced landslide prevention, the flow rate of pumps is the operating parameter that workers can manually control. Thus, this study aims to predict the corresponding flow rate of pumps for the given rainfall data to realize the autonomous water pumping system and provide timely interventions. The pumps’ flow rates depend on the information given for the rainfall intensity information provided. The LSTM network was constructed using the Keras deep learning library (Version 2.2.4) installed with the Tensorflow backend. It is a common practice in the literature to divide the total data into the training and testing sets, with a ratio of 70–80 percent for training and 20–30 percent for testing [55,56,57]. The data for 36 sets of rainfall events (36 × 121 = 4356 rows), including different patterns for the rainfalls and the corresponding data for the pumps’ flow rates (75% of the dataset), were used to train the proposed LSTM model. Then, the model’s performance on the remaining 25% of the dataset, including 12 sets of rainfall events (12 × 121 = 1452 rows) with a new pattern and the corresponding data for the pumps’ flow rates, was validated. Since this problem is a regression task, the mean absolute error (MAE) was used as a loss function and is calculated using Equation (22):

MAE = \frac{1}{n} \sum_{i = 1}^{n} | y_{i} - {\hat{y}}_{i} |,

(22)

where

n

is the total number of data,

y_{i}

is the actual value, and

{\hat{y}}_{i}

is the predicted value for the pump’s flow rate.

3.3. Data Preprocessing

3.3.1. Scaling

Datasets including variables in different ranges and units might harm the learning ability of the model [44]. Scaling the dataset can improve the model’s performance in the training process [58]. The Min-Max normalization method was applied to each feature separately, which scaled the data values linearly between [0, 1] as follows:

X_{n o r m o l i z e d} = \frac{x - x_{\min}}{x_{\max} - x_{\min}},

(23)

where

x_{\min}

and

x_{\max}

are the minimum and maximum values of features on their current scale [59].

3.3.2. Transform the Dataset into a Supervised Learning Problem

This step is required to use the LSTM algorithm. The purpose is to train the LSTM model to learn the dependencies in the dataset and map the output to the given input. To frame the dataset as a supervised learning problem, the input and the output of the model were defined for each time step. In this study, the input is the rainfall intensity in the next 120 min (

I_{r_{t}}, I_{r_{t + 1}}, \dots, I_{r_{t + 120}}

) and the expected output is the flow rates of pump 1 (i.e.,

q_{1_{t}}, q_{1_{t + 1}}, \dots, q_{1_{t + 120}}

), pump 2 (i.e.,

q_{2_{t}}, q_{2_{t + 1}}, \dots, q_{2_{t + 120}}

), and pump 3 (i.e.,

q_{3_{t}}, q_{3_{t + 1}}, \dots, q_{3_{t + 120}}

) in the next 120 min.

3.4. Model Evaluation Metrics

The predictions were evaluated using three performance metrics, the root mean square error (RMSE), the mean absolute error (MAE), and the coefficient of determination (R²). The RMSE and MAE are useful evaluation metrics for regression problems. The RMSE (Equation (24)) and MAE (Equation (22)) take a value between

[0, + \infty]

and have the same unit as the variable of interest. Values of RMSE and MAE close to 0 indicate a model’s good performance in prediction [60]. The R²-score, also called goodness of fit, measures how well a model can predict the desired variable and fit the observed data [61]. The R² (Equation (25)) gives a unitless score between 0 and 1 in which the score of 1 represents the optimal forecast model [62].

RMSE = {[\frac{1}{n} \sum_{i = 1}^{n} {(y_{i} - {\hat{y}}_{i})}^{2}]}^{\frac{1}{2}},

(24)

R^{2} = 1 - \frac{\sum_{i = 1}^{n} {(y_{i} - {\hat{y}}_{i})}^{2}}{\sum_{i = 1}^{n} {(y_{i} - \bar{y})}^{2}},

(25)

where

n

is the total number of data and

\bar{y}

is the mean value of the observed data.

4. Results and Discussions

In this section, the results generated in the search for the optimal LSTM architecture are presented. Then, the predictions obtained from the optimal architecture for the testing set are validated. Finally, the influence of rainfall patterns, which could affect the generalization of the method, is discussed.

4.1. Search for Optimal LSTM Architecture and Hyperparameters

The optimal LSTM architecture was achieved by optimizing the LSTM hyperparameters. LSTM hyperparameters include the numbers of LSTM layers, units in each LSTM layer, fully connected layers (or dense layers), units in each dense layer, and epochs, as well as the batch size and the optimization learning algorithm. The number of epochs defines the number of times the algorithm completes the learning process on the entire dataset (training and testing sets). The dataset is usually divided into a number of batches to complete one epoch. The batch size refers to the number of training samples used in a single batch. By optimizing the results of the evaluation metrics, the number of epochs and the batch size for all models were determined to be 300 and 50, respectively. It is noted that each LSTM model was trained using an adaptive optimization learning algorithm, (i.e., Adam), with a learning rate of 0.001.

Hyperparameters were chosen from a set of selected candidate values, as shown in Table 2. In the experimented models, the number of LSTM layers was set to be 1, 2, or 3 layers, and the number of units for an LSTM layer were selected from values of 100, 50, 30, and 20. The models have one or two dense layers. In the sequence of dense layers, the number of units for the last dense layer represents the number of outputs, or predictions. In the case of two dense layers, 500 was chosen as the number of units was chosen to be 500 and 363 was selected for the first and second dense layers, respectively. In the case of one dense layer, the number of units was set the same as the output dimension of 363. The output of this model includes 120 min (the next 121time steps) of flow rates for three pumps (3 × 121 = 363 data points).

Models with the hyperparameters mentioned above were evaluated using performance metrics of RMSE and R² as presented in Table 3. The optimal hyperparameters for the proposed model were achieved by evaluating the RMSE and R² scores. It can be seen from Table 3 that increasing the LSTM layers to two or three did not improve the RMSE or R² scores. Thus, the number of LSTM layers was optimized to one. For the unit size of the LSTM layer, it was found that a unit number smaller than 50 affected the learning ability of the model. For 20 and 30 units, the recorded R² scores are 0.949 and 0.932, respectively, which are smaller than R² = 0.962 for 50 units. A higher number of units for the LSTM layer (i.e., 100) did not improve the model’s accuracy. Based on the evaluation results, 50 was determined the optimal value for the number of units of the LSTM layer. With an R² score of 0.962, the model with only one dense layer showed better performance than two dense layers which had an R² score of 0.952.

A summary of the optimized hyperparameters is reported in Table 4. The optimal architecture comprises one LSTM layer with 50 units and a fully connected layer (or dense layer) at the top of the LSTM layer. The number of units for the dense layer is 363, the same as the output size. The epoch and the batch size selected for this model were 300 and 50, respectively. The proposed LSTM model has the best performance in the evaluation metrics: RMSE = 0.015 GPM and R² = 0.962.

4.2. Training and Testing with the Optimal LSTM Architecture

The model was trained with the optimal architecture introduced in Table 4, and the trained model was then validated on the testing set. Figure 10 shows the learning curves of the training and the testing sets for the optimal LSTM architecture. The MAE score, which was used as a loss function for both training and testing sets, decreases to the point of stability as the number of epochs increases. Additionally, there is a small gap between the training and testing loss curves. Meeting these two criteria indicates a good fit for the model.

The predictions of flow rates for pumps for the testing sets with the trained model were validated using the results of seepage analysis. Figure 11a–c compares the predicted flow rates from the proposed LSTM model with the values from the seepage analysis for pumps 1, 2, and 3, respectively. The input for both the LSTM model and the seepage analysis is the rainfall data, and the output is the pump’s flow rate. In the seepage analysis, the pump flow rate was obtained based on the predefined pumping policy for each pump, while the proposed LSTM model could learn the hidden pumping policies for each pump within the training dataset. Figure 11a–c supports the idea that the trained model can predict the required pumps’ flow rate for the testing set to keep the water table at the target level in response to the rainfall events. The proposed LSTM model can closely map the input data (i.e., set of rainfall data) to the expected output data (i.e., corresponding pumps’ flow rate) without requiring conducting a seepage analysis with prescribed pumping policies. An evaluation of the predictions, which is presented in Table 5, also confirmed this finding.

From the proposed model, predictions regarding the flow rate of pumps were evaluated using the performance metrics of R², RMSE, and MAE. Regardless of the output’s unit, values of R² close to 1 represent high prediction accuracy. The R² score of the flow rate predictions for pumps 1, 2, and 3 are 0.958, 0.962, and 0.954, respectively. The achieved high values of the R² score indicate how well the proposed LSTM model fits the observed data from the seepage analysis.

Unlike the R², RMSE and MAE carry the same unit as the predicted values for the pump flow rates in this study. The RMSE is the square root of the average squared differences between the predicted values from the proposed LSTM model and the actual values from the seepage analysis. The RMSE of the flow rate predictions for pumps 1, 2, and 3 are 0.007, 0.021, and 0.014, respectively. The RMSE values close to 0 represent a better prediction. Also, taking the range of a pump’s flow rate into consideration can help interpret the RMSE values. The range of observed flow rates of pumps 1, 2, and 3 from the seepage analysis in the testing set is 0–0.24 GPM, 0–0.65 GPM, and 0–0.37 GPM, respectively. Thus, the RMSE of the predicted flow rates for the three pumps is relatively small compared with the range of changes in the flow rates of the pumps.

The MAE is the average of differences between the predicted values from the proposed LSTM model and the actual values from the seepage analysis. The MAE of the flow rate predictions for pumps 1, 2, and 3 are 0.003, 0.009, and 0.006, respectively. Values of MAE are very small, which means the model can predict the pump’s flow rate with high accuracy. The evaluation of the predictions using various measurements verifies the promising performance of the proposed LSTM model in learning the pumping policy and predicting a pump’s flow rate.

4.3. Discussion 1: Application of the LSTM Model in Controlling the Groundwater

This subsection discusses different approaches for employing the proposed LSTM model in controlling groundwater level as an intervention method for the geotechnical CPSs studied in this paper. For this purpose, the predicted flow rates from the LSTM model were applied to the pumps in the slope to control the groundwater table. A seepage analysis was then conducted to obtain the groundwater levels at the three points of P₁, P₂, and P₃ during the rainfall events.

The pump flow rates for the 12 events in the testing set can be continuously forecasted using the LSTM model trained with 36 events. In the method discussed in previous sections, any error present in the initial prediction will be carried forward to subsequent predictions. While this method is flexible and easy to implement, the accuracy of predictions will be reduced in later predictions. To reduce accumulated errors, pump flow rates for each rainfall event can be predicted independently. In this method, a separate LSTM model with the optimal architecture, introduced in Section 4.1, was created to forecast the pump flow rate for each event. After forecasting the pump flow rate for one event, the actual data for that event was added to the training set to reduce the accumulated error for the flow rate predictions of the next rainfall event. By moving forward, the size of the training set increased, which improved the model’s performance on the next event’s predictions.

There are two options for applying the predicted pump flow rate in the seepage analysis to control the groundwater. One option is to reset the groundwater table to the initial condition at the beginning of each rainfall event. While this option can help evaluate the pump flow rate for each rainfall event separately, it cannot represent the real-world application. Alternatively, the groundwater level reset could be omitted during the 12 rainfall events to simulate reality.

Based on the above descriptions, three approaches were investigated for employing the proposed LSTM model by combining the available options for predicting the flow rates of pumps and applying them in the seepage analysis to control the groundwater level. In the first approach, pump flow rates for 12 rainfall events in the testing set were continuously predicted using the LSTM model trained with 36 events. Then, the groundwater table was reset to the initial condition at the beginning of each event to apply the pump flow rate in the seepage analysis. In the second approach, pump flow rates were predicted similar to the first approach. Then, the predicted flow rates were continuously implemented in the seepage analysis without resetting the groundwater condition at the beginning of each event. In the third approach, the training set for the LSTM model was recalibrated after forecasting the pump flow rate for each rainfall event to reduce the accumulated error in predictions. Then, the predicted values were continuously applied to the pumps during the rainfall events to control groundwater level.

Figure 12a–c displays the groundwater level from the three approaches at points of P₁, P₂, and P₃, respectively. To evaluate the application of the LSTM model, the groundwater levels from the three approaches were compared with the groundwater levels obtained with the prescribed pumping policies corresponding to the rainfall events at the points mentioned above.

A comparison of the groundwater levels using the prescribed pumping policies and approach 1 for employing the LSTM model shows that approach 1 has a discontinuity in control of the water table between the rainfall events. Despite positive results during the rainfall events at the three points of P₁, P₂, and P₃, this approach cannot represent real conditions since the groundwater level was reset to the initial condition at the beginning of each event. An advantage of this approach is that the predictions for each event can be independently evaluated from the other events.

Another comparison of the groundwater levels using the prescribed pumping policies and approach 2 for employing the LSTM model provides insight into the accumulated error in the predictions. While this approach is easy to implement, the accuracy of results will be reduced in time. To improve the accuracy of results, approach 3 was suggested for employing the LSTM model. The results for approach 3 demonstrate this approach can boost the performance of the LSTM model in controlling the groundwater level at each of the three points (P₁, P₂, and P₃).

To better evaluate the proposed three approaches for employing the LSTM model, the differences between the groundwater level from each approach and the prescribed pumping policies are shown in Figure 13. The groundwater level using the prescribed pumping policies performs as a benchmark for evaluation. Figure 13a–c displays the differences at points P₁, P₂, and P₃, respectively.

The maximum difference from the benchmark for approach 1 at points of P₁, P₂, and P₃ are 13.3%, 16.0%, and 17.0%, respectively. This approach shows a difference of less than 10% in the groundwater level for most of the rainfall events. The average difference of the results for approach 1 from the benchmark is approximately 4.6% at all of the three points.

Approach 2 shows greater differences compared with the other two approaches. The differences from the benchmark for approach 2 at points P₁, P₂, and P₃ increase up to 34.4%, 37.5%, and 40.7%, respectively. The average difference of the results for this approach from the benchmark is approximately 12.7% at all of the three points. The results for approach 3 demonstrate noticeably lower values of differences from the benchmark. The average difference of the results for this approach from the benchmark is approximately 3.5% at all of the three points.

The above discussions conclude the LSTM model has an indisputable impact on controlling the groundwater table in the slope. The evaluation of the groundwater level from different approaches of applying the predictions indicates that the model’s performance may decrease during long-sequential rainfall events. Thus, it is worthwhile to evaluate the predicted pump flow rates in controlling the groundwater table, in addition to the model evaluation metrics such as R², RMSE, and MAE.

4.4. Discussion 2: Influence of Rainfall Patterns

This subsection is intended to evaluate the generalizability of the proposed LSTM model. A generalized model means the model is capable of performing well on an unseen dataset. Since a small set of data with limited typical rainfall patterns was used in this study to train the LSTM model, it is beneficial to prove the model can be applied to real-world rainfall events with more complex patterns. Thus, the influence of the rainfall patterns on the accuracy of the model predictions was investigated. For this purpose, two rainfall datasets with different numbers of patterns were considered, as shown in Figure 14a,b. Both datasets include 24 rainfall events with the same variation in rainfall duration and depth. However, dataset 2 comprises more rainfall patterns types than dataset 1. The rainfall events in dataset 1 consist of only two types of patterns (i.e., types “a” and “d”), while the rainfall events in dataset 2 consist of four types of patterns (i.e., types “a”, “b”, “c”, and “d”). Figure 15a,b show the pumps’ flow rates for pumps 1, 2, and 3 corresponding to the rainfall data generated in Figure 14a,b, respectively.

An LSTM model was trained with the first 12 rainfall events and the corresponding pump flow rates for both datasets. Then, the model was validated with the next 12 rainfall events and their corresponding pump flow rates. The rainfall events in the testing set have the same type of pattern (“d”) in datasets 1 and 2.

Table 6 presents the evaluation results of the model trained with datasets 1 and 2. The evaluation metrics of RMSE and MAE for dataset 2 are lower than the values for dataset 1 in all three pumps. The R² score can better demonstrate the difference between the accuracy of the pump flow rate predictions for dataset 1 and dataset 2. The R² values for pumps 1, 2, and 3 in dataset 1 were 0.891, 0.922, and 0.910, respectively. By increasing the number of rainfall patterns for training the LSTM model from 1 in dataset 1 to 3 in dataset 2, the R² values for pumps 1, 2, and 3 were improved to 0.913, 0.935, and 0.925.

Additionally, comparing the comparison of the evaluation metrics for dataset 2 in Table 6 with the corresponding values in Table 5, it was found that the model trained with 36 rainfall events including the same types of patterns as dataset 2 has a better performance in predicting the pump’s flow rate. This means that the model trained with a dataset consisting of a larger number of rainfall events and pattern types improves the model’s performance in the prediction of the pump flow rates for a given new type of pattern.

4.5. Discussion 3: Limitation and Applicability of the Proposed Model

This subsection discusses the main limitations and application of the proposed LSTM model. They can be summarized as follows.

Field conditions including more complex rainfall patterns and the pondering effect of precipitations were excluded from the numerical simulation for the data generation. Therefore, future studies can include actual measured rainfall data to improve the performance of the proposed LSTM model in capturing more complicated patterns.
Ideal pump behaviors were assumed in the numerical simulation of the model, though the pump performance may vary depending on field conditions. Further experimental studies using the lab-scale geosystem described in Section 2.2.2 can help understand such limitations.
This study employs a numerical simulation of a lab-scale geosystem for generating flow rate data to focus on the proof of the concept. With the same concept and framework, more complicated cases, such as field measurements from real-world slopes, can also be employed to train LSTM, as long as the core physics underlying the transient seepage analysis remains the same. Additional calibration and data processing may be required before applying the proposed model to data from field measurements.

5. Conclusions

This study is the first to investigate the ability of the LSTM deep learning algorithm to learn water pumping policy and the dependencies between the rain intensity and corresponding pump flow rates to establish a geotechnical CPS for rainfall-induced landslide prevention. The data for rainfall intensity was generated by a combination of different durations, depths, and patterns. The corresponding pump flow rate data was obtained by conducting the transient seepage analysis to satisfy desired drainage policies during rainfall events. The proposed LSTM model successfully predicted the multi-step ahead of the pump flow rate based on the rainfall intensity data given to the model as input. The major findings of this study are as follows:

Evaluation metrics of RMSE, MAE, and R² showed a promising performance of the proposed LSTM model in predicting pump flow rates and learning the prescribed pumping policies. The R² values of 0.958, 0.962, and 0.954 for the pump flow rate predictions indicated high accuracy of the results.
An assessment of the groundwater table after applying the pump flow rates demonstrated the model’s performance could drop during long-sequential rainfall events. To avoid accumulated error in long-sequential rainfall events, it is suggested to use a separate LSTM model to predict the corresponding pump’s flow rate for each rainfall event in the testing set.
An evaluation of the influence of the rainfall patterns demonstrated that the performance of the proposed LSTM model can be improved by adding more and new rainfall patterns.
As long as the nature (i.e., physics) underlying the transient seepage analysis is the same, the proposed method can be applied to real measurements of pump flow rates. It is noted that this may require additional calibration and data processing for the raw data.
This study also presents a numerical framework written in Python to perform transient seepage analysis for a geosystem equipped with three pumps and subjected to rainfall events. Two main advantages of this framework are its integration with deep learning algorithms and its flexibility for considering more realistic field conditions.

Future studies can include more complex field conditions and rainfall patterns for generating training data, which can help to improve the performance of the proposed model.

Author Contributions

Conceptualization and methodology, Z.L. and A.B.; software and validation, A.B. and B.A.; writing—original draft preparation, A.B.; writing—review and editing, Y.S. and Z.L.; supervision and project administration, Z.L. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Science Foundation Grant No. 1742656 from the Geotechnical Engineering and Materials Program (now part of CMMI ECI). This work also used the Extreme Science and Engineering Discovery Environment (XSEDE), which is supported by the National Science Foundation grant number ACI-1548562.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data will be available upon request.

Conflicts of Interest

The authors declare no conflict of interest.

References

Salvati, P.; Petrucci, O.; Rossi, M.; Bianchi, C.; Pasqua, A.A.; Guzzetti, F. Gender, age and circumstances analysis of flood and landslide fatalities in Italy. Sci. Total Environ. 2018, 610, 867–879. [Google Scholar] [CrossRef] [PubMed]
Schuster, R.L.; Highland, L. Socioeconomic and Environmental Impacts of Landslides in the Western Hemisphere; United States Geological Survey: Reston, VA, USA, 2001.
Azarafza, M.; Ghazifard, A.; Akgün, H.; Asghari-Kaljahi, E. Landslide susceptibility assessment of South Pars Special Zone, southwest Iran. Environ. Earth Sci. 2018, 77, 1–29. [Google Scholar] [CrossRef]
Ahmad, H.; Ningsheng, C.; Rahman, M.; Islam, M.M.; Pourghasemi, H.R.; Hussain, S.F.; Habumugisha, J.M.; Liu, E.; Zheng, H.; Ni, H. Geohazards Susceptibility Assessment along the Upper Indus Basin Using Four Machine Learning and Statistical Models. ISPRS Int. J. Geo-Inf. 2021, 10, 315. [Google Scholar] [CrossRef]
Chen, W.; Chen, X.; Peng, J.; Panahi, M.; Lee, S. Landslide susceptibility modeling based on ANFIS with teaching-learning-based optimization and Satin bowerbird optimizer. Geosci. Front. 2021, 12, 93–107. [Google Scholar] [CrossRef]
Nanehkaran, Y.A.; Mao, Y.; Azarafza, M.; Kockar, M.K.; Zhu, H.-H. Fuzzy-based multiple decision method for landslide susceptibility and hazard assessment: A case study of Tabriz, Iran. Geomech. Eng. 2021, 24, 407–418. [Google Scholar]
Guzzetti, F.; Reichenbach, P.; Ardizzone, F.; Cardinali, M.; Galli, M. Estimating the quality of landslide susceptibility models. Geomorphology 2006, 81, 166–184. [Google Scholar] [CrossRef]
Conte, E.; Pugliese, L.; Troncone, A. Post-failure analysis of the Maierato landslide using the material point method. Eng. Geol. 2020, 277, 105788. [Google Scholar] [CrossRef]
Kargar, P.; Osouli, A.; Stark, T.D. 3D analysis of 2014 Oso landslide. Eng. Geol. 2021, 287, 106100. [Google Scholar] [CrossRef]
Yang, H.; Yang, T.; Zhang, S.; Zhao, F.; Hu, K.; Jiang, Y. Rainfall-induced landslides and debris flows in Mengdong Town, Yunnan Province, China. Landslides 2020, 17, 931–941. [Google Scholar] [CrossRef]
Uyeturk, C.E.; Huvaj, N.; Bayraktaroglu, H.; Huseyinpasaoglu, M. Geotechnical characteristics of residual soils in rainfall-triggered landslides in Rize, Turkey. Eng. Geol. 2020, 264, 105318. [Google Scholar] [CrossRef]
Kirschbaum, D.; Kapnick, S.; Stanley, T.; Pascale, S. Changes in extreme precipitation and landslides over High Mountain Asia. Geophys. Res. Lett. 2020, 47, e2019GL085347. [Google Scholar] [CrossRef]
Jakob, M.; Lambert, S. Climate change effects on landslides along the southwest coast of British Columbia. Geomorphology 2009, 107, 275–284. [Google Scholar] [CrossRef]
Kristo, C.; Rahardjo, H.; Satyanaga, A. Effect of variations in rainfall intensity on slope stability in Singapore. Int. Soil Water Conserv. Res. 2017, 5, 258–264. [Google Scholar] [CrossRef]
Pham, B.T.; Jaafari, A.; Nguyen-Thoi, T.; Van Phong, T.; Nguyen, H.D.; Satyam, N.; Masroor, M.; Rehman, S.; Sajjad, H.; Sahana, M. Ensemble machine learning models based on Reduced Error Pruning Tree for prediction of rainfall-induced landslides. Int. J. Digit. Earth 2021, 14, 575–596. [Google Scholar] [CrossRef]
Sun, D.-M.; Zang, Y.-G.; Semprich, S. Effects of airflow induced by rainfall infiltration on unsaturated soil slope stability. Transp. Porous Media 2015, 107, 821–841. [Google Scholar] [CrossRef]
Cho, S.E. Stability analysis of unsaturated soil slopes considering water-air flow caused by rainfall infiltration. Eng. Geol. 2016, 211, 184–197. [Google Scholar] [CrossRef]
Alsubal, S.; Sapari, N.; Harahap, S. The Rise of groundwater due to rainfall and the control of landslide by zero-energy groundwater withdrawal system. Int. J. Eng. Technol. 2018, 7, 921–926. [Google Scholar] [CrossRef] [Green Version]
Su, Z.; Wang, G.; Wang, Y.; Luo, X.; Zhang, H. Numerical simulation of dynamic catastrophe of slope instability in three Gorges reservoir area based on FEM and SPH method. Nat. Hazards 2021, 1–16. [Google Scholar] [CrossRef]
Ng, C.W.W.; Shi, Q. A numerical investigation of the stability of unsaturated soil slopes subjected to transient seepage. Comput. Geotech. 1998, 22, 1–28. [Google Scholar] [CrossRef]
Wang, J.-g.; Liang, B. Affection of rainfall factor to seepage and stability of loess slope. J. Water Resour. Water Eng. 2010, 21, 42–45. [Google Scholar]
Merzdorf, J. Climate Change Could Trigger More Landslides in High Mountain Asia; NASA’s Goddard Space Flight Center: Greenbelt, MD, USA, 2020.
Nicholson, P.G. Soil Improvement and Ground Modification Methods; Butterworth-Heinemann: Oxford, UK, 2014. [Google Scholar]
Turner, A.K.; Schuster, R.L. Landslides: Investigation and Mitigation; Special Report 247; Transportation Research Board national academy Press: Washington, DC, USA, 1996. [Google Scholar]
Dai, F.; Lee, C.; Ngai, Y.Y. Landslide risk assessment and management: An overview. Eng. Geol. 2002, 64, 65–87. [Google Scholar] [CrossRef]
Urciuoli, G.; Pirone, M. Subsurface drainage for slope stabilization. In Landslide Science and Practice; Springer: Berlin/Heidelberg, Germany, 2013; pp. 577–585. [Google Scholar]
Holtz, R.D.; Schuster, R.L. Landslides: Investigation and Mitigation. Transp. Res. Board Spec. Rep. 1996, 247, 439–473. [Google Scholar]
Cashman, P.M.; Preene, M. Groundwater Lowering in Construction: A Practical Guide; CRC Press: Boca Raton, FL, USA, 2001. [Google Scholar]
Olcese, A.; Vescovo, C.; Boni, S.; Giusti, G. Stabilisation of a landslide with submerged motor-driven pumps. In Proceedings of Slope stability engineering developments and applications. In Proceedings of the International Conference on Slope Stability, Isle of Wight, UK, 15–18 April 1991; pp. 321–326. [Google Scholar]
Forrester, K. Subsurface Drainage for Slope Stabilization; ASCE Press: Reston, VA, USA, 2001. [Google Scholar]
Mitchell, R.J.; Madsen, J.D.; Crawford, T.W. Hydraulic stabilization of earth structures. Can. Geotech. J. 1984, 21, 116–124. [Google Scholar] [CrossRef]
Woodward, J. An Introduction to Geotechnical Processes; CRC Press: Boca Raton, FL, USA, 2005. [Google Scholar]
Biniyaz, A.; Azmoon, B.; Liu, Z. Coupled transient saturated–unsaturated seepage and limit equilibrium analysis for slopes: Influence of rapid water level changes. Acta Geotech. 2021, 1–18. [Google Scholar] [CrossRef]
Wartalska, K.; Kaźmierczak, B.; Nowakowska, M.; Kotowski, A. Analysis of hyetographs for drainage system modeling. Water 2020, 12, 149. [Google Scholar] [CrossRef] [Green Version]
Chen, Y.-L.; Liu, G.-Y.; Li, N.; Du, X.; Wang, S.-R.; Azzam, R. Stability evaluation of slope subjected to seismic effect combined with consequent rainfall. Eng. Geol. 2020, 266, 105461. [Google Scholar] [CrossRef]
Liu, Z.L. Multiphysics in Porous Materials. In Multiphysics in Porous Materials; Springer: Berlin/Heidelberg, Germany, 2018; pp. 29–34. [Google Scholar]
van Genuchten, M.T. A closed-form equation for predicting the hydraulic conductivity of unsaturated soils 1. Soil Sci. Soc. Am. J. 1980, 44, 892–898. [Google Scholar] [CrossRef] [Green Version]
Sethi, R.; Di Molfetta, A. Groundwater Engineering: A Technical Approach to Hydrogeology, Contaminant Transport and Groundwater Remediation; Springer International Publishing: Cham, Switzerland, 2019. [Google Scholar]
Xu, X.; He, H.; Zhao, D.; Sun, S.; Busoniu, L.; Yang, S.X. Machine Learning with Applications to Autonomous Systems; Hindawi: London, UK, 2015. [Google Scholar]
Wei, Z.-L.; Lü, Q.; Sun, H.-y.; Shang, Y.-Q. Estimating the rainfall threshold of a deep-seated landslide by integrating models for predicting the groundwater level and stability analysis of the slope. Eng. Geol. 2019, 253, 14–26. [Google Scholar] [CrossRef]
Hochreiter, S.; Schmidhuber, J. Long short-term memory. Neural Comput. 1997, 9, 1735–1780. [Google Scholar] [CrossRef]
Tian, Y.; Zhang, K.; Li, J.; Lin, X.; Yang, B. LSTM-based traffic flow prediction with missing data. Neurocomputing 2018, 318, 297–305. [Google Scholar] [CrossRef]
Duan, Y.; Lv, Y.; Wang, F.-Y. Travel time prediction with LSTM neural network. In Proceedings of the 2016 IEEE 19th International Conference on Intelligent Transportation Systems (ITSC), Rio de Janeiro, Brazil, 1–4 November 2016; pp. 1053–1058. [Google Scholar]
Zhang, J.; Zhu, Y.; Zhang, X.; Ye, M.; Yang, J. Developing a Long Short-Term Memory (LSTM) based model for predicting water table depth in agricultural areas. J. Hydrol. 2018, 561, 918–929. [Google Scholar] [CrossRef]
Le, X.-H.; Ho, H.V.; Lee, G.; Jung, S. Application of long short-term memory (LSTM) neural network for flood forecasting. Water 2019, 11, 1387. [Google Scholar] [CrossRef] [Green Version]
Kaneko, R.; Nakayoshi, M.; Onomura, S. Rainfall Prediction by a Recurrent Neural Network Algorithm LSTM Learning Surface Observation Data. AGU Fall Meet. Abstr. 2019, 2019, GC43D-1354. [Google Scholar]
Zhang, D.; Holland, E.S.; Lindholm, G.; Ratnaweera, H. Enhancing operation of a sewage pumping station for inter catchment wastewater transfer by using deep learning and hydraulic model. arXiv 2018, arXiv:1811.06367. [Google Scholar]
Hu, Y.; Yan, L.; Hang, T.; Feng, J. Stream-Flow Forecasting of Small Rivers Based on LSTM. arXiv 2020, arXiv:2001.05681. [Google Scholar]
Xie, P.; Zhou, A.; Chai, B. The application of long short-term memory (LSTM) method on displacement prediction of multifactor-induced landslides. IEEE Access 2019, 7, 54305–54311. [Google Scholar] [CrossRef]
Yunpeng, L.; Di, H.; Junpeng, B.; Yong, Q. Multi-step ahead time series forecasting for different data patterns based on LSTM recurrent neural network. In Proceedings of the 2017 14th Web Information Systems and Applications Conference (WISA), Liuzhou, China, 11–12 November 2017; pp. 305–310. [Google Scholar]
Crivellari, A.; Beinat, E. LSTM-based deep learning model for predicting individual mobility traces of short-term foreign tourists. Sustainability 2020, 12, 349. [Google Scholar] [CrossRef] [Green Version]
Chung, J.; Gulcehre, C.; Cho, K.; Bengio, Y. Empirical evaluation of gated recurrent neural networks on sequence modeling. arXiv 2014, arXiv:1412.3555. [Google Scholar]
LeCun, Y.; Bengio, Y.; Hinton, G. Deep learning. Nature 2015, 521, 436–444. [Google Scholar] [CrossRef]
Ismail, A.A.; Wood, T.; Bravo, H.C. Improving Long-Horizon Forecasts with Expectation-Biased LSTM Networks. arXiv 2018, arXiv:1804.06776. [Google Scholar]
Nguyen, Q.H.; Ly, H.-B.; Ho, L.S.; Al-Ansari, N.; Le, H.V.; Tran, V.Q.; Prakash, I.; Pham, B.T. Influence of data splitting on performance of machine learning models in prediction of shear strength of soil. Math. Probl. Eng. 2021, 2021. [Google Scholar] [CrossRef]
Bui, D.T.; Pradhan, B.; Lofman, O.; Revhaug, I.; Dick, O.B. Landslide susceptibility mapping at Hoa Binh province (Vietnam) using an adaptive neuro-fuzzy inference system and GIS. Comput. Geosci. 2012, 45, 199–211. [Google Scholar]
Vasu, N.N.; Lee, S.-R. A hybrid feature selection algorithm integrating an extreme learning machine for landslide susceptibility modeling of Mt. Woomyeon, South Korea. Geomorphology 2016, 263, 50–70. [Google Scholar] [CrossRef]
Jin, J.; Li, M.; Jin, L. Data normalization to accelerate training for linear neural net to predict tropical cyclone tracks. Math. Probl. Eng. 2015, 2015. [Google Scholar] [CrossRef] [Green Version]
Patro, S.; Sahu, K.K. Normalization: A preprocessing stage. arXiv 2015, arXiv:1503.06462. [Google Scholar] [CrossRef]
Chai, T.; Draxler, R.R. Root mean square error (RMSE) or mean absolute error (MAE)?—Arguments against avoiding RMSE in the literature. Geosci. Model Dev. 2014, 7, 1247–1250. [Google Scholar] [CrossRef] [Green Version]
Dong, M.; Wu, H.; Hu, H.; Azzam, R.; Zhang, L.; Zheng, Z.; Gong, X. Deformation Prediction of Unstable Slopes Based on Real-Time Monitoring and DeepAR Model. Sensors 2021, 21, 14. [Google Scholar] [CrossRef]
Zhang, D. A coefficient of determination for generalized linear models. Am. Stat. 2017, 71, 310–316. [Google Scholar] [CrossRef]

Figure 1. Typical rainfall intensity distributions with (a) constant intensity over time, (b) maximum intensity at the beginning of precipitation, (c) maximum intensity in the middle of precipitation, and (d) maximum intensity at the end of precipitation.

Figure 2. Cross-section of soil slope equipped with pumps subjected to rainfall (Units: Inch).

Figure 3. SWCC and relative hydraulic conductivity

K_{r}

, of the soil.

Figure 3. SWCC and relative hydraulic conductivity

K_{r}

, of the soil.

Figure 4. Flowchart of the written python code using DOLFIN package to solve the governing equation for transient seepage analysis.

Figure 5. Pump flow rates as the output of the seepage analysis in a typical generated rainfall event (duration = 15 min, rainfall depth = 15 mm, and pattern “d”).

Figure 6. Generated time series of rainfall.

Figure 7. Time series of flow rates for pumps 1, 2, and 3 corresponding to the generated rainfall data.

Figure 8. The general structure of the recurrent neural network (RNN).

Figure 9. The structure of the LSTM unit.

Figure 10. Loss of the training and testing sets for the optimal LSTM architecture.

Figure 11. Comparison of pumps’ flow rate from the seepage analysis and predicted values using the LSTM model for (a) pump 1, (b) pump 2, and (c) pump 3.

Figure 12. Comparison of the groundwater level using the three approaches of applying the proposed LSTM model and the predefined pumping policies at (a) point P₁, (b) point P₂, and (c) point P₃.

Figure 13. Differences of groundwater levels using the three approaches of employing the LSTM model from the groundwater level using the prescribed pumping policies (benchmark) at (a) point P₁, (b) point P₂, and (c) point P₃.

Figure 14. Two rainfall datasets with the different number of patterns (a) dataset 1 consists of rainfall events with pattern types of “a” and “d” and (b) dataset 2 consists of rainfall events with pattern types of “a”, “b”, “c”, and “d”.

Figure 15. Time series of flow rate for pumps 1, 2, and 3 corresponding to the generated rainfall data in (a) dataset 1 shown in Figure 14a and (b) dataset 2 shown in Figure 14b.

Table 1. Soil properties for the transient saturated-unsaturated flow model.

Model Input	Definition	Sand
$K_{s}$	Saturated hydraulic conductivity (m/s)	$3 \times 10^{- 4}$
$S_{s}$	Saturated specific storage (1/m)	$1 \times 10^{- 4}$
$a$	Empirical parameter	0.6
$b$	Empirical parameter	0.5
$P_{0}$	Empirical parameter (Pa)	1200
$n$	Porosity	0.32

Table 2. Selected candidate values for hyperparameters of the LSTM model.

Hyperparameter	Candidate Values
Number of LSTM layers	1, 2, 3
LSTM unit sizes	100, 50, 30, 20
Number of dense layers	1, 2
Dense unit sizes	500, 363

Table 3. Experimented LSTM models with selected hyperparameters.

Number of LSTM Layers	Number of Units for LSTM Layers	Number of Fully Connected Layers (Dense)	Number of Units for Dense Layers	Batch Size	Epochs	Optimization Algorithm, Learning Rate	RMSE (GPM)	R²
1	100	1	363	50	300	Adam, 0.001	0.017	0.954
1	50	1	363	50	300	Adam, 0.001	0.015	0.962
1	30	1	363	50	300	Adam, 0.001	0.018	0.946
1	20	1	363	50	300	Adam, 0.001	0.018	0.945
1	50	2	500, 363	50	300	Adam, 0.001	0.017	0.952
2	50, 50	1	363	50	300	Adam, 0.001	0.018	0.950
2	50, 50	2	500, 363	50	300	Adam, 0.001	0.018	0.946
3	50, 50, 50	1	363	50	300	Adam, 0.001	0.019	0.943
3	50, 50, 50	2	500, 363	50	300	Adam, 0.001	0.019	0.943

Table 4. Proposed architecture for the optimal LSTM model.

Model Hyperparameters	Optimal Value
Number of the LSTM layers	1
LSTM unit size	50
Number of the dense layers	1
Dense unit size	363
Epochs	300
Batch size	50
Optimization algorithm, learning rate	Adam, 0.001

Table 5. Performance metrics of the proposed model on the testing set.

Predictions	RMSE (GPM)	MAE (GPM)	R²
Pump 1	0.007	0.003	0.958
Pump 2	0.021	0.009	0.962
Pump 3	0.014	0.006	0.954

Table 6. Evaluation of the model trained with dataset 1 and dataset 2.

Predictions	RMSE (GPM)		MAE (GPM)		R²
	Dataset 1	Dataset 2	Dataset 1	Dataset 2	Dataset 1	Dataset 2
Pump 1	0.013	0.012	0.005	0.005	0.891	0.913
Pump 2	0.033	0.030	0.014	0.012	0.922	0.935
Pump 3	0.020	0.018	0.008	0.007	0.910	0.925

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Biniyaz, A.; Azmoon, B.; Sun, Y.; Liu, Z. Long Short-Term Memory Based Subsurface Drainage Control for Rainfall-Induced Landslide Prevention. Geosciences 2022, 12, 64. https://doi.org/10.3390/geosciences12020064

AMA Style

Biniyaz A, Azmoon B, Sun Y, Liu Z. Long Short-Term Memory Based Subsurface Drainage Control for Rainfall-Induced Landslide Prevention. Geosciences. 2022; 12(2):64. https://doi.org/10.3390/geosciences12020064

Chicago/Turabian Style

Biniyaz, Aynaz, Behnam Azmoon, Ye Sun, and Zhen Liu. 2022. "Long Short-Term Memory Based Subsurface Drainage Control for Rainfall-Induced Landslide Prevention" Geosciences 12, no. 2: 64. https://doi.org/10.3390/geosciences12020064

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Long Short-Term Memory Based Subsurface Drainage Control for Rainfall-Induced Landslide Prevention

Abstract

1. Introduction

2. Data Acquisition

2.1. Generation of Rainfall Data

2.2. Pump Flow Rate Data Acquired from Transient Seepage Analysis

2.2.1. Governing Equation for the Transient Seepage Model

2.2.2. Geometry and Boundary Conditions

2.2.3. Soil Properties

2.2.4. Numerical Implementation

3. Methodology

3.1. Background: Long Short-Term Memory

3.2. Proposed LSTM Model

3.3. Data Preprocessing

3.3.1. Scaling

3.3.2. Transform the Dataset into a Supervised Learning Problem

3.4. Model Evaluation Metrics

4. Results and Discussions

4.1. Search for Optimal LSTM Architecture and Hyperparameters

4.2. Training and Testing with the Optimal LSTM Architecture

4.3. Discussion 1: Application of the LSTM Model in Controlling the Groundwater

4.4. Discussion 2: Influence of Rainfall Patterns

4.5. Discussion 3: Limitation and Applicability of the Proposed Model

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI