Prediction of Dam Deformation Using SSA-LSTM Model Based on Empirical Mode Decomposition Method and Wavelet Threshold Noise Reduction

Zhang, Caiyi; Fu, Shuyan; Ou, Bin; Liu, Zhenyu; Hu, Mengfan

doi:10.3390/w14213380

Open AccessArticle

Prediction of Dam Deformation Using SSA-LSTM Model Based on Empirical Mode Decomposition Method and Wavelet Threshold Noise Reduction

by

Caiyi Zhang

,

Shuyan Fu

,

Bin Ou

^*,

Zhenyu Liu

and

Mengfan Hu

College of Water Conservancy, Yunnan Agricultural University, Kunming 650201, China

^*

Author to whom correspondence should be addressed.

Water 2022, 14(21), 3380; https://doi.org/10.3390/w14213380

Submission received: 30 September 2022 / Revised: 19 October 2022 / Accepted: 21 October 2022 / Published: 25 October 2022

(This article belongs to the Special Issue Safety Monitoring and Management of Reservoir and Dams)

Download

Browse Figures

Versions Notes

Abstract

:

The deformation monitoring information of concrete dams contains some high-frequency components, and the high-frequency components are strongly nonlinear, which reduces the accuracy of dam deformation prediction. In order to solve such problems, this paper proposes a concrete dam deformation monitoring model based on empirical mode decomposition (EMD) combined with wavelet threshold noise reduction and sparrow search algorithm (SSA) optimization of long short-term memory network (LSTM). The model uses EMD combined with wavelet threshold to decompose and denoise the measured deformation data. On this basis, the LSTM model based on SSA optimization is used to mine the nonlinear function relationship between the reconstructed monitoring data and various influencing factors. The engineering example is analyzed and compared with the prediction results of LSTM model and PSO-SVM model. The results show that the mean absolute error (MAE) and root mean square error (RMSE) of the model are 0.05345 and 0.06358, with the complex correlation coefficient R² of 0.9533 being closer to 1 and a better fit than the other two models. This can effectively mine the relationship in the measured deformation data, and reduce the influence of high-frequency components on the dam prediction accuracy.

Keywords:

concrete dams; prediction model; empirical modal decomposition method; wavelet threshold; sparrow search algorithm; long short-term memory

1. Introduction

China is a large country of water conservancy construction, with the number and scale of existing dams among the world’s leading. To ensure the safe operation of dams is of great significance to maintain the safety of life and property of the public and regional stability, and a dam failure would cause huge losses. For example, the ‘9–8’ dam collapse in Xianfen County, Shanxi in 2008 caused 277 deaths, 4 missing persons, 33 injuries, and direct economic losses of RMB 96.19 million, while in 2018, the Nakuru County dam in Kenya collapsed, killing 48 people. Deformation is an important monitoring quantity that reflects the comprehensive safety state of a dam. The construction of a high-precision deformation monitoring model can represent the evolution of the structural properties of a dam, quantitatively interpret the role of the main influencing factors and predict the operation of the dam, and evaluate the dam properties accordingly [1,2,3].

The dam deformation monitoring model can be divided into statistical model, deterministic model, and hybrid models depending on the modeling approach [4,5]. As statistical models are easy to implement, they are widely used, with the development of computer technology such as gray theory [5,6], neural network model [7,8,9], support vector machine model [10,11,12], random forest theory [13,14], and many other methods which are widely used in dam deformation monitoring model [15,16], which greatly improves the calculation speed and prediction accuracy of the monitoring model. However, these models have certain shortcomings. The random forest model has uncertainty in the empirical selection of parameters and runs relatively slowly, which can lead to situations such as poor classification in small data processing [17], while neural network models are prone to overfitting and local optima when running. SVM models are suitable for problems such as small samples and nonlinearities, but the prediction performance of SVM models is strongly influenced by the selection of kernel parameters [18]. In order to make up for the shortcomings of these models, relevant personnel have proposed deep learning methods (such as CNN [19], RNN [20], DBN [21]) and applied them to dam deformation monitoring. By studying the intrinsic laws and representation levels of monitoring data, the model can improve the prediction accuracy of complex nonlinear problems to a certain extent. Among them, the recurrent neural network is one of the more commonly used deep learning methods. The recurrent neural network (RNN) is designed based on the recursive nature of sequence data and is a feedback type of neural network. However, due to the disappearance of the gradient of RNN, the time series cannot exist for too long, so it can only have short-term memory and cannot support long-term memory, so the relevant personnel proposed the long short-term memory network (LSTM) [22,23,24].

Long short-term memory networks (LSTMs) are derivatives that solve the problem of RNN gradient decay. The LSTM network combines short-term and long-term memory with gating, which solves the gradient decay problem to a certain extent. Ou Bin et al. [25] proposed a concrete dam deformation prediction model based on LSTM and verified that the model has good prediction accuracy and iteration rate in practical engineering. Wang et al. [26] proposed a prediction model combining ALO-LSTM and characteristic attention mechanism based on the existing earth dam seepage pressure prediction model, and quantitatively analyzed the degree of influence of each influence factor in the seepage pressure effect quantity. Dasan Yang et al. [27] proposed an attention mechanism-based LSTM concrete dam deformation prediction method with Adma optimization algorithm to improve the learning accuracy and speed of LSTM, and verified the feasibility of the model in practical engineering. Liu et al. [28] proposed a coupled long-term displacement prediction model for arch dams based on long and short memory networks. Principal component analysis (PCA) and moving average (MA) methods were used to reduce the dimensionality of the input variables, which are combined with LSTM to realize two coupled prediction models, LSTM-PCA and LSTM-MA, respectively. Affected by the measurement accuracy of the instrument and the inherent noise associated with the information acquisition module of the monitoring system, it is inevitable that there are certain random errors in the monitoring data that cannot be explained by environmental factors. In order to maximally eliminate the interference of unfavorable factors such as inherent noise and random errors, wavelet analysis [29,30], singular value decomposition [31], variational mode decomposition (VMD) [32,33], empirical mode decomposition (EMD) [34,35], etc., have been applied to the noise reduction of dam deformation monitoring data.

In view of the adverse effect of noise components in the measured deformation data on the modeling accuracy, this paper proposes a signal denoising method that combines empirical mode decomposition (EMD) and wavelet threshold method to decompose and reconstruct the data to make the unstable dam. The monitoring data are stabilized to reduce the influence of noise in the measured deformation. In addition, in order to avoid local optima when the model algorithm performs deformation prediction, the Sparrow Search Algorithm (SSA) is used to optimize long- and short-term memory networks (LSTM), perform parameter optimization, and a concrete dam deformation prediction model based on the Sparrow Search Algorithm which optimized LSTM is constructed. When applied to a certain engineering practice, the prediction accuracy and calculation speed of the model were analyzed and compared.

2. Selection of Statistical Models for Dam Deformation Prediction

Dam deformation is the displacement vector sum of the plastic and elastic deformation of the concrete dam and bedrock under load. The displacement vector generated in the dam body can be decomposed into water pressure component

δ_{H}

, temperature component

δ_{T}

, and time-dependent component

δ_{θ}

δ = δ_{H} + δ_{T} + δ_{θ}

(1)

The factor selection of the water pressure component

δ_{H}

, for the reason that the load of water pressure

p_{c}

is nonlinear changes. However, the

p_{c}

is with a curvilinear relationship of

H

, from which it can be deduced that there is a relationship between

δ_{H}

and

H

, which is linear with

H, H^{2}, H^{3}, H^{4}

. Moreover, the

δ_{2 H}

and

δ_{2 H}

have the same relationship:

δ_{H} = \sum_{i = 1}^{4} a_{1 i} H^{i}

(2)

where

a_{i}

is structure coefficient;

H

is water depth value before the dam.

According to analysis of the dam deformation monitoring data, temperature is one of the main factors affecting the arch dam. After years of normal operation of a concrete dam, the hydration heat of the poured concrete has been completely dispersed, and the temperature inside the dam has reached a quasi-stable temperature field. At this time, the dam temperature is only influenced by the boundary temperature. Assume that water temperature and air temperature are harmonic motion; deformation and concrete temperature are linear relationship. Therefore, multi-period harmonics are chosen as the factor:

δ_{T} (t) = \sum_{i = 1}^{m_{3}} (b_{1 i} \sin \frac{2 π i t}{365} + b_{2 i} \cos \frac{2 π i t}{365})

(3)

where

m_{3}

is 1 or 2,

i

is the annual cycle,

t

is the cumulative monitoring days.

Time-dependent component

δ_{θ}

, which in general variation law of mathematical expression is a functional relationship when arch dam is of normal operation. For concrete arch dams, it can be considered that the main factor of influencing the time-displacement is the viscous flow of the dam concrete and the dam base rock. The linear combination of

θ

and

\ln θ

can better describe the time-displacement caused by the rheological properties of the arch dam materials:

δ_{θ} = c_{1} θ + c_{2} \ln θ

(4)

In summary, the time-varying forecast model for arch dam deformation can be expressed as:

\begin{array}{l} δ = δ_{H} (t) + δ_{T} (t) + δ_{θ} (t) \\ = \sum_{i = 1}^{4} a_{1 i} H^{i} + \sum_{i = 1}^{m_{3}} (b_{1 i} \sin \frac{2 π i t}{365} + b_{2 i} \cos \frac{2 π i t}{365}) + c_{1} θ + c_{2} \ln θ \end{array}

(5)

3. Wavelet Threshold-EMD-Based Data Noise Reduction Method

3.1. Empirical Mode Decomposition

Empirical Modal Decomposition Method is a noise reduction method for nonlinear and non-stationary data [36]. This method does not require manual setting of parameters. The original data can be quickly decomposed into high-to-low modal components, namely, Intrinsic Mode Functions (IMF) and Residual (Res) [37]. The specific decomposition and reconstruction steps are as follows:

For a given data sequence

x (t)

, select the extreme value points in the data, and use the cubic spline difference function to form the upper and lower envelopes for the maximum point and the minimum point. Calculate the mean value of the upper and lower envelopes

m_{1} (t)

, subtract

m_{1} (t)

from

x (t)

, and get a new time series denoted as

h_{1} (t)

:

h_{1} (t) = x (t) - m_{1} (t)

(6)

However, it is difficult to solve the upper and lower package routes in reality, so it is necessary to use the spline difference function for fitting, and new extreme points will be generated during the fitting process. So it needs to go through the screening cycle until a certain stopping criterion is reached to the end. That is, take

h_{1} (t)

as a new

x (t)

and loop the above operation k times to get

h_{1 k} (t)

which is:

h_{1 k} (t) = h_{1 (k - 1)} (t) - m_{1} (t)

(7)

where

h_{1 (k - 1)} (t)

is the screening result of

k - 1

times; at this time

h_{1 k}

is the first IMF component, denoted as

c_{1} (t)

:

c_{1} (t) = h_{1 k} (t)

(8)

Subtract

c_{1} (t)

from given data

x (t)

to get residual

r_{1} (t)

:

r_{1} (t) = x (t) - c_{1} (t)

(9)

Continue to repeat the above steps until the residual is less than the preset error, or the residual is monotonic. at this point, the EMD decomposition ends which is:

r_{1} (t) = x (t) - c_{1} (t)

(10)

In order to make the frequency and amplitude of the components obtained from the decomposition have a certain practical significance, it can faithfully reflect the volatility characteristics of the original sequence. In the process of decomposition screening, the standard deviation sum of adjacent screening results is 0.3. That is, the

S D

equation is:

S D = \sum_{k = 1}^{T} \frac{{| h_{1 (k - 1)} (t) - h_{1 k} (t) |}^{t}}{h_{1 (k - 1)}^{2} (t)}

(11)

3.2. Wavelet Threshold Noise Reduction

The basic principle of wavelet threshold denoising is that after the signal is transformed by a wavelet, the corresponding wavelet coefficients will be generated; select the appropriate threshold value, keep the wavelet coefficients larger than the threshold value, and remove the wavelet coefficients smaller than the threshold value [38]. The basic steps are as follows: (1) Select a wavelet with the number of layers N for decomposition. (2) Thresholding the decomposition coefficients of each layer. (3) Wavelet reconstruction according to the wavelet coefficients after deactivation. The wavelet threshold affects the selection of key parameters for noise reduction. Selecting appropriate parameters will achieve a good noise reduction effect. The main influencing factors are as follows:

Decomposition layer: When wavelet decomposition is performed on the original data, the higher the number of decomposition layers, the better the noise reduction effect, but the signal is more likely to be distorted. The best result is achieved when the number of layers is chosen to be 3.

Basic wavelet function: For real signals, the basic wavelet selection usually considers factors such as support length, symmetry, trailing moments, smoothness, and similarity. For one-dimensional signals such as audio signals, dB wavelets, and symbol wavelets are usually selected.

Threshold: Threshold selection is more important for wavelet threshold noise reduction. Common threshold selection methods include unbiased risk estimation threshold, minimax threshold, fixed threshold, and heuristic threshold. This paper uses a fixed threshold:

λ = δ \sqrt{2 \log N}

(12)

At present, the commonly used threshold functions mainly include hard threshold function and soft threshold function, and the two threshold functions have their own advantages and disadvantages. Although the hard threshold function can well preserve the edge characteristics of the signal, it will cause a certain degree of distortion in the process of signal processing. The soft threshold function to remove noise after the signal is much smoother. Therefore, the soft threshold function is used in this paper.

Soft threshold function:

{\hat{w}}_{t h r} = {\begin{matrix} \begin{matrix} [sgn (w)] (| w | - t h r) & | w | \geq t h r \end{matrix} \\ \begin{matrix} 0 & | w | < t h r \end{matrix} \end{matrix}

(13)

where

w_{t h r}

is the wavelet coefficient after wavelet transformation;

λ

is the threshold value;

{\hat{w}}_{t h r}

c is the wavelet coefficient denoised by the threshold value.

Considering that the EMD method and the wavelet threshold method have their own advantages and disadvantages in data preprocessing, and the two methods can complement each other, for the high-frequency components decomposed by EMD, the wavelet threshold can be used for noise reduction, which effectively reduces the distortion of the signal. Therefore, a noise reduction method based on EMD combined with wavelet threshold is constructed based on the above principles.

4. The Prediction Model Based on Optimized LSTM Model with Sparrow Search Algorithm

4.1. Sparrow Search Algorithm

Jiankai xue [39] proposed a new intelligent optimization algorithm which is the Sparrow Search Algorithm (SSA) in 2020. This algorithm is mainly inspired by the foraging behavior of sparrows, and has the characteristics of strong merit-seeking ability and fast convergence. The Sparrow Search Algorithm avoids the algorithm falling into local optimum by constructing the corresponding fitness function, which makes the identity of individual sparrows, and the position change dynamically. In the simulation experiment, we need to use virtual sparrows for food hunting, and the population

X

consisting of n sparrows can be expressed in the following form:

X = [\begin{matrix} x_{1, 1} & x_{1, 2} & \dots & x_{1, d} \\ x_{2, 1} & x_{2, 2} & \dots & x_{2, d} \\ ⋮ & ⋮ & ⋮ & ⋮ \\ x_{n, 1} & x_{n, 2} & \dots & x_{n, d} \end{matrix}]

(14)

where

d

is the dimension of the sparrow population, n is the number of sparrows, x is the sparrow individual.

The fitness values of all sparrows can be expressed in the following form:

F_{X} = [\begin{matrix} \begin{matrix} f ([\begin{matrix} x_{1, 1} & x_{1, 2} & \dots & x_{1, d} \end{matrix}]) \\ f ([\begin{matrix} x_{2, 1} & x_{2, 2} & \dots & x_{2, d} \end{matrix}]) \end{matrix} \\ ⋮ \\ ⋮ \\ f ([\begin{matrix} x_{n, 1} & x_{1 n 2} & \dots & x_{n, d} \end{matrix}]) \end{matrix}]

(15)

where

F_{X}

is the adaptability matrix, and

f

is the adaptability value. Sparrows with better adaptability values (discoverers) will prefer to guide foraging direction and range when foraging in groups, due to the greater foraging range of discoverers.

During each iteration of the Sparrow Search Algorithm, the location update of the discoverer is described as follows:

x_{i, j}^{t + 1} = {\begin{matrix} x_{i, j}^{t} \exp (\frac{- i}{α i t e r_{\max}}), R_{2} < S T \\ x_{i, j}^{t} + Q L, R_{2} \geq S T \end{matrix}

(16)

where

i t e r

is the current iteration factor;

i t e r_{\max}

is the constant with the highest number of iterations;

x_{i, j}^{t}

x_{i, j}^{t}

is the value of the j dimension of the i sparrow at the t iteration,

j = 1, 2, 3, \dots, d

;

α \in (0, 1]

;

R_{2} = [0, 1]

is the alarm value;

S T \in [0.5, 1]

is the safety threshold value; Q is the random numbers that obey the normal distribution, and L is a 1 × d order matrix.

When

R_{2} < S T

, and there is no danger around at this point, the discoverer can continue to search for food.

When

R_{2} \geq S T

, and there is danger around at this time, the discoverer needs to take cover to ensure safety quickly.

However, for sparrow populations, once the discoverer searches for a quality food source, the joiner will recognize it and fly near it to grab food during foraging. At the same time, some joiners will always watch the discoverer and are ready to fight for food. Thus, the location update plan result from the joiner is as follows:

x_{i, j}^{t + 1} = {\begin{matrix} Q \exp (\frac{X_{w o r s t} - X_{i, j}^{t}}{i^{2}}), i > \frac{n}{2} \\ x_{p}^{t + 1} + | x_{i, j}^{t} - x_{p}^{t + 1} | A^{+} L, other \end{matrix}

(17)

where

x_{p}

is the producer optimal location;

X_{w o r s t}

is the worst position; A is the 1 × d order matrix, and any element is randomly assigned a value of 1 or −1. When

i > \frac{n}{2}

, it indicates that the i joiner failed to grab the food and needs to forage again for it.

In conclusion, assuming 10–20% of sparrows are aware of the danger, the initial position random generation rule is:

x_{i, j}^{t + 1} = {\begin{matrix} x_{b e s t}^{t} + λ | x_{i, j}^{t} - x_{b e s t}^{t} |, f_{i} > f_{g} \\ x_{i, j}^{t} + k \frac{| x_{i, j}^{t} - x_{w o r s t}^{t} |}{f_{i} - f_{w} + ε}, f_{i} = f_{g} \end{matrix}

(18)

where

λ

is the step control function, obeying a normal distribution with mean 0 and variance 1;

f_{i}

is the current sparrow adaptation values;

f_{g}

is the global optimal adaptation value;

x_{b e s t}

is the global best position;

k

is the direction of movement;

ε

is the minimum constant. When

f_{i} > f_{g}

, sparrows are more dangerous at the fringes of the population. When

f_{i} = f_{g}

, aware of the danger, the sparrow moves closer to other sparrows for safety in order to avoid it.

4.2. LSTM Neural Networks

LSTM is a special form of the traditional recurrent neural network (RNN), which was proposed by Sepp Hochreiter in 1977 for the gradient vanishing problem of RNN. Compared with RNN, which has only one state h in the hidden layer, LSTM adds a unit state c to the original structure of RNN. In order to make long-term preservation of short-term input, LSTM proposes to store and update the cell state in the form of “gate”. The concepts of input gate, forget gate, and output gate are proposed, and the long-term retention of information is finally realized through the control of the three gates as shown in Figure 1 and Figure 2.

The forget gate controls the proportion of the unit state from the previous moment saved to the current moment. The formula sigmoid can be obtained by the activation function:

f_{t} = δ (W_{f} \times (h_{t - 1}, x_{t}) + b_{f})

(19)

where

W_{f}

is the weight matrix of the forgotten gate;

b_{f}

is the offset items;

δ

is the sigmoid stimulus function;

x_{i}

is the current input, and

h_{t - 1}

is the hidden output in the previous moment.

The input gate controls the proportion of the network state saved to the cell state at the current moment. The following formula is:

i_{t} = δ (W_{i} \times (h_{t - 1}, x_{t}) + b_{i})

(20)

\tilde{C_{t}} = \tanh (W_{c} \times (h_{t - 1}, x_{t}) + b_{c})

(21)

where

W_{i}

is the weight matrix of the sigmoid layer;

b_{i}

is the bias term for the input gate sigmoid layer;

W_{c}

is the weight matrix of the input gate tanh layer, and

b_{c}

is the bias term for output gate tanh layer.

The expression after cell state update is:

C_{t} = f_{t} C_{t - 1} + i_{t} a_{t}

(22)

The output gate can extract valid information from the current cell state to be used in a new hidden layer. The mathematical expression of the output gate is:

o_{t} = σ (W_{o} \times (h_{t - 1}, x_{t}) + b_{o})

(23)

where

W_{o}

is the weight matrix of the output gate, and

b_{o}

is the bias term for the output gate.

The final output of LSTM is:

h_{t} = o_{t} \tanh C_{t}

(24)

5. Construction of the Model

Affected by the measurement accuracy of the instrument and the inherent noise associated with the information acquisition module of the monitoring system, it is inevitable that there are certain random errors in the monitoring data that cannot be explained by environmental factors, which will lead to prediction accuracy when building a dam deformation monitoring model. Therefore, it is necessary to perform prediction preprocessing on the data. In this paper, the combined noise reduction approach based on EMD combined with wavelet threshold is used to decompose and reconstruct the original data, and combined with the Sparrow Search Algorithm (SSA) to optimize the LSTM model, a concrete dam deformation monitoring model based on empirical mode decomposition (EMD) combined with wavelet threshold noise reduction, and combined with the Sparrow Search Algorithm (SSA) to optimize the long short-term memory network (LSTM) was constructed. The specific process is shown in Figure 3 below:

Step 1: The monitored raw data are decomposed by EMD, and the IMF components obtained from the decomposition are distributed from high to low. Perform wavelet threshold noise reduction on high-frequency IMF components, reconstruct the high-frequency IMF components after noise reduction and low-frequency IMF components, and obtain the data after noise reduction;
Step 2: Initialize and normalize the denoised data. Determine the following parameters: length of LSTM time window, number of hidden layer cells, sparrow population size and the number of iterations. Subsequently, initial safety threshold and sparrow position;
Step 3: Use the predicted value of the LSTM algorithm and the root mean square of the sample data to determine the fitness value of each sparrow;
Step 4: Update the sparrow position, get a new fitness value, and search for the optimal position of the population and the global optimal value;
Step 5: Perform iterations, determine whether the maximum number of iterations is reached, and obtain the optimal individual solution. Stop the iteration if the maximum value is reached and determine the optimal parameters of the LSTM. If not, repeat the loop step;
Step 6: Substitute the obtained LSTM parameters into the training grid to make predictions.

6. Case Study

6.1. Factsheet

A hydropower station is located in southwestern Yunnan Province, China. The dam is a concrete double-bend arch dam with a crest elevation of 1245 m. The maximum dam height is 292 m, the dam crest length is 992.74 m, the arch crown beam top width is 13 m, and the arch crown beam bottom width is 69.49 m. Considering the low frequency of manual observation, measurement points were selected from the automatic measurement points for comparative analysis. Among them, C4-A22-PL-05 measurement points are highly reliable, with fewer missing or jumping data and fewer instrument failures, which can provide longer and more accurate deformation monitoring data. A point is selected at the dam body as an observation point, and the upstream and downstream water levels and temperatures corresponding to this measurement point are shown in Figure 4, Figure 5 and Figure 6 below.

6.2. Data Noise Reduction Based on EMD Combined with Wavelet Threshold

Since the wavelet threshold noise reduction in the data processing, although the results are smoother, but in the extreme points will still be more blurred, and the traditional EMD method will remove the high-frequency components, the use of low-frequency components for reconstruction in a certain degree will make the signal distortion. Therefore, this paper proposes a noise reduction method based on EMD combined with wavelet threshold. First, the original monitoring data of the dam is decomposed by EMD, the high-frequency IMF components obtained from the decomposition are denoised by the wavelet threshold, and the low-frequency components obtained by the EMD decomposition are reconstructed. This method effectively avoids the signal distortion caused to some extent by the traditional EMD method. In this paper, different methods are used to denoise the data. The analysis results and the IMF components are shown in Figure 7 below:

It can be seen from Figure 7 and Figure 8 that after the dam deformation signal is processed based on EMD combined with the wavelet threshold noise reduction method, it is decomposed into five components, which are arranged in order from top to bottom by frequency. Using EMD combined with wavelet threshold noise reduction, the burrs near the extreme points are significantly reduced, the data at the peaks and valleys are smoother, and the amplitude of the signals at the peaks and valleys is well preserved. Compared with the wavelet threshold noise reduction effect, the denoising effect is very ideal, and can well maintain the basic shape of the original data of the dam deformation.

6.3. Model Analysis

There are many factors affecting the deformation of arch dams, such as the time factor, the water pressure factor, and the temperature, therefore, the choice of model parameters is more important. For the model input 10 vectors, where the timing factor is a combination of linear

θ

and

\ln θ

, taken

(θ - θ_{0}), (\ln θ - \ln θ_{0})

. The temperature factor is taken

\sin \frac{2 π t}{365} - \sin \frac{2 π t_{0}}{365}

,

\cos \frac{2 π t}{365} - \cos \frac{2 π t_{0}}{365}

,

\sin \frac{4 π t}{365} - \sin \frac{4 π t_{0}}{365}

,

\cos \frac{4 π t}{365} - \cos \frac{4 π t_{0}}{365}

. The dam is a concrete hyperbolic arch dam, therefore, the water pressure factor is taken as

(H - H_{0})

,

{(H - H_{0})}^{2}

,

{(H - H_{0})}^{3}

,

{(H - H_{0})}^{4}

. After selecting the impact factor and the effect dose, in order to eliminate the influence of the dimension and the magnitude difference between the impact factor as too large, the impact factor and the effect dose are normalized. The nodes of the input layer and output layer of the LSTM model in this paper are 1 and 9, respectively, the number of units in the two LSTM hidden layers are 100 and 50, respectively, and the maximum number of iterations is set to 50, respectively. Select the training set and test set, use the training set data to make the model fully learn the deformation law of the dam, and use the trained prediction model to predict the deformation of the prediction set data.

In this paper, the monitoring data of measurement point C4-A22-PL-05 from 2 February 2014 to 16 June 2015 were selected for parameter searching optimization of SSA-LSTM. The SSA algorithm optimizes the parameters of the LSTM network, which are the number of hidden neurons and the learning rate, respectively, and takes the root mean square error of the monitored real data and the predicted data as the fitness function. At the same time, the sparrow population was set to 10, the number of iterations was 50, the search dimension was 4, the number of neurons m was set to (1, 100), and the learning rate was (0.0001, 0.01). After SSA algorithm optimization, the number of hidden neurons and the learning rate of the two layers were (69, 25, 0.00745). The dam deformation data from the C4-A22-PL-05 measuring point from 17 June 2015 to 13 April 2016 are used as the training set, and the dam monitoring data from 13 April 2016 to November 2016 are used as the test set to carry out testing. In order to verify the validity of the model, different models LSTM, and PSO-SVM are used to construct the corresponding prediction models. The prediction results of the three models are shown in Table 1. The adaptation curve is shown in Figure 9. The prediction curves and model residuals are shown in Figure 10 and Figure 11 below.

The correlation results in Table 1 show that the complex correlation coefficient R²: SSA-LSTM model has a value of 0.9533, LSTM model has a value of 0.9036, and PSO-SVM model has a value of 0.885 at this measurement point. It can be seen that for the selected measurement point C4-A22-PL-05, SSA-LSTM > LSTM > PSO-SVM, where the SSA-LSTM model had the highest actual fit.

Root mean square error (RMSE): the value of SSA-LSTM model at this measuring point is 0.06358, the value of LSTM model is 0.0913, and the value of PSO-SVM model is 0.1293. It can be seen that for the selected C4-A22-PL-05 measuring point, SSA-LSTM < LSTM < PSO-SVM, and the root mean square error of SSA-LSTM model is the smallest. Among them, the SSA-LSTM model is reduced by 2.7% compared with the LSTM model and 6.5% compared with the PSO-SVM model.

Mean absolute error (MAE): the value of SSA-LSTM model at this measuring point is 0.05345, the value of LSTM model is 0.07611, and the value of PSO-SVM model is 0.09564. It can be seen that for the selected C4-A22-PL-05 measuring point, SSA-LSTM < LSTM < PSO-SVM, and the average absolute error of SSA-LSTM model is the smallest. Among them, the SSA-LSTM model is reduced by 1.9% compared with the LSTM model and 4.2% compared with the PSO-SVM model.

As can be seen from Figure 10, the prediction model constructed by the three algorithms of SSA-LSTM, LSTM, and PSO-SVM is generally consistent with the actual displacement change process. In comparison, the prediction results of SSA-LSTM algorithm are closer to the measured deformation than those of LSTM and PSO-SVM prediction models. It can be seen from Figure 11 that the residual of the SSA-LSYM model has no obvious change rule and its range of variation is significantly smaller. Other models are increasing with time residuals. It can be seen from Figure 10 and Figure 11 that the SSA-LSTM model fits better than the other two models, and its residual variation range fluctuates less, indicating that the model can more accurately represent the complex nonlinear function relationship between the impact factor and the dam deformation.

7. Conclusions

This paper proposes a noise reduction method based on EMD combined with wavelet threshold, using the EDM method to decompose the original monitoring data of the dam, and applying wavelet threshold noise reduction to the decomposed high- frequency IMF components. The high-frequency IMF components after noise reduction are obtained, and the low-frequency IMF components obtained by decomposition are combined for reconstruction. A prediction model is constructed from the denoised data, which improves the prediction accuracy of the SAA-LSTM model.
This paper uses the Sparrow Search Algorithm to optimize the long short-term memory (LSTM), and uses the good stability, convergence speed, scalability, and robustness of the Sparrow Search Algorithm to perform grid training and parameter optimization of the LSTM. The global optimal location and fitness values are updated, and the optimized LSTM model is optimized in terms of the number of hidden layer nodes and learning rate using grid search to effectively mine the complex functional relationship between the dam deformation and its influence factors.
The two deformation prediction models of LSVM and PSO-SVM are compared by using the example verification analysis. Compared with the other three models, the multiple correlation coefficient R² of the SSA-LSTM model is 0.9533, which is closer to 1 and has better fitting accuracy. The mean absolute error and root mean square error are 0.05345 and 0.06358, which are smaller than the other two models. It can be seen that the prediction accuracy and convergence speed of the SSA-LSTM model have been significantly improved, which provides a new method for high-precision prediction of dam deformation and is more suitable for practical engineering.

Author Contributions

Conceptualization, C.Z. and B.O.; validation, S.F., Z.L. and M.H.; writing—original draft preparation, C.Z.; methodology, C.Z.; funding, B.O. and S.F. All authors have read and agreed to the published version of the manuscript.

Funding

This work is supported by the National Natural Science Foundation of China (No. 52069029).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Original data are available upon reasonable request.

Conflicts of Interest

The authors declare no conflict of interest.

References

Wei, W.; Lou, S.Y.; Xu, F. A combined displacement prediction model for concrete arch dams based on monitoring time-series decomposition and reconfiguration. Eng. Sci. Technol. 2022, 54, 1–13. [Google Scholar] [CrossRef]
Li, Y.; Bao, T.; Shu, X. A Hybrid Model Integrating Principal Component Analysis, Fuzzy C-Means, and Gaussian Process Regression for Dam Deformation Prediction. Arab. J. Sci. Eng. 2020, 46, 4293–4306. [Google Scholar] [CrossRef]
Kang, F.; Liu, J.; Li, J. Concrete dam deformation prediction model for health monitoring based on extreme learning machine. Struct. Control. Health Monit. 2017, 24, e1997. [Google Scholar] [CrossRef]
Fan, Z.; Cui, W.; Chen, M. IPSO-RVM based dam safety warning model. J. Chang. Acad. Sci. 2016, 33, 48–51. [Google Scholar] [CrossRef]
Huang, W.; Fan, Z. Application of Gray Self-Memory Model in Prediction of Ice Flood Water Level of the Yellow River. Yellow River 2013, 35, 3–5. [Google Scholar]
Yang, S.; Chen, T. Research on electrical main wiring fault monitoring model for water conservancy projects based on grey system theory. Hydropower Technol. 2020, 51, 88–95. [Google Scholar] [CrossRef]
Lin, Z. Research on Deformation Prediction Model of Earth-rock Dam Based on BP Neural Network. People’s Pearl River 2020, 41, 74–78. [Google Scholar]
Chen, Z.; Xiong, X.; You, Y. Variational modal decomposition and long-short time neural networks for dam deformation prediction. Mapp. Sci. 2021, 46, 34–42. [Google Scholar] [CrossRef]
Shao, N.; Yu, Z.W. Application of wavelet neural networks in dam deformation forecasting. Urban Surv. 2018, 4, 156–157. [Google Scholar]
Wei, B.; Liu, B.; Xu, F. Hybrid model for multi-measurement point deformation monitoring of concrete arch dams incorporating PSO-SVM. J. Wuhan Univ. Inf. Sci. Ed. 2021, 1–14. [Google Scholar]
Huai, Z.; Zhe, X.; Zhi, P. Performance improvement method of support vector machine-based model monitoring dam safety. Struct. Control. Health Monit. 2016, 23, 252–266. [Google Scholar] [CrossRef]
Li, M.; Pan, J.; Lui, Y. A Deformation Prediction Model of High Arch Dams in the Initial Operation Period Based on PSR-SVM-IGWO. Math. Probl. Eng. 2021, 2021, 8487997. [Google Scholar] [CrossRef]
Luo, H.; Guo, S.; Bao, W. Random Forest Model and Application for Arch Dam Deformation Monitoring and Prediction. South-to-North Water Divers. Water Conserv. Technol. 2016, 14, 116–121. [Google Scholar] [CrossRef]
Li, X.; Wen, Z.; Su, H. An approach using random forest intelligent algorithm to construct a monitoring model for dam safety. Eng. Comput. 2019, 37, 39–56. [Google Scholar] [CrossRef]
Vadiati, M.; Rajabi Yami, Z.; Eskandari, E.; Nakhaei, M.; Kisi, O. Application of artificial intelligence models for prediction of groundwater level fluctuations: Case study (Tehran-Karaj alluvial aquifer). Environ. Monit. Assess. 2022, 194, 619. [Google Scholar] [CrossRef] [PubMed]
Samani, S.; Vadiati, M.; Azizi, F.; Zamani, E.; Kisi, O. Groundwater Level Simulation Using Soft Computing Methods with Emphasis on Major Meteorological Components. Water Resour. Manag. 2022, 36, 3627–3647. [Google Scholar] [CrossRef]
Yi, Z.; Su, H.; Yang, L. Combined modeling method of random forest and swordfish optimization for deformation monitoring model of concrete dam. Hydropower Energy Sci. 2021, 39, 106–109. [Google Scholar]
Wei, B.; Yuan, D.; Bin, X. Deformation prediction model of concrete dam based on optimal correlation vector machine based on chicken swarm algorithm. Water Conserv. Hydropower Technol. 2020, 51, 98–105. [Google Scholar] [CrossRef]
Zhi, P.; Zen, G.; Hai, D. Self CNN-based time series stream forecasting. Electron. Lett. 2016, 52, 1857–1858. [Google Scholar] [CrossRef]
Min, K.; Ki, M.; Pa, R.; Huh, K. RNN-Based Path Prediction of Obstacle Vehicles with Deep Ensemble. IEEE Trans. Veh. Technol. 2019, 68, 10252–10256. [Google Scholar] [CrossRef]
Hu, C.; Pei, H.; Si, X. A Prognostic Model Based on DBN and Diffusion Process for Degrading Bearing. IEEE Trans. Ind. Electron. 2019, 67, 8767–8777. [Google Scholar] [CrossRef]
Li, Y.; Bao, T.; Gong, J. The prediction of dam displacement time series using STL, extra-trees, and stacked LSTM neural network. IEEE Access 2020, 8, 94440–94452. [Google Scholar] [CrossRef]
Zhang, J.; Cao, X.; Xie, J. An Improved Long Short-Term Memory Model for Dam Displacement Prediction. Math. Probl. Eng. 2019, 2019, 6792189. [Google Scholar] [CrossRef] [Green Version]
Wang, S.; Yang, B.; Chen, H.; Fang, W.; Yu, T. LSTM-Based Deformation Prediction Model of the Embankment Dam of the Danjiangkou Hydropower Station. Water 2022, 14, 2464. [Google Scholar] [CrossRef]
Ou, B.; Wu, B.; Yuan, J. Deformation prediction model of concrete dam based on LSTM. Prog. Water Conserv. Hydropower Sci. Technol. 2022, 42, 21–26. [Google Scholar] [CrossRef]
Wang, X.; Li, K.; Zhang, Z. Seepage pressure prediction model of earth-rock dam coupled with ALO-LSTM and feature attention mechanism. J. Hydraul. Eng. 2022, 53, 403–412. [Google Scholar] [CrossRef]
Yang, D.; Gu, C.; Zhu, Y. A Concrete Dam Deformation Prediction Method Based on LSTM With Attention Mechanism. IEEE Access 2020, 8, 185177–185186. [Google Scholar] [CrossRef]
Liu, W.; Pan, J.; Ren, Y.; Wu, Z.; Wang, J. Coupling prediction model for long-term displacements of arch dams based on long short-term memory network. Struct. Control Health Monit. 2020, 27, e2548. [Google Scholar] [CrossRef]
Luo, D.; Zheng, D. Wavelet analysis and ARMA prediction model for dam deformation. J. Water Resour. Water Transp. Eng. 2016, 3, 70–75. [Google Scholar] [CrossRef]
Su, H.; Li, X.; Yang, B.; Wen, Z. Wavelet support vector machine-based prediction model of dam deformation. Mech. Syst. Signal Process. 2018, 110, 412–427. [Google Scholar] [CrossRef]
Yang, G.; Fan, Z.; Fu, C. Research on outlier identification techniques for dam safety monitoring data based on singular spectrum analysis. Hydroelectricity 2021, 47, 125–129. [Google Scholar] [CrossRef]
Zhang, J.; Heng, Y. Concrete dam deformation prediction model based on VMD-PE-CNN. Hydropower Technol. 2022, 1–11. (In Chinese) [Google Scholar]
Hu, H.; Zhang, J.; Li, T. A Novel Hybrid Decompose-Ensemble Strategy with a VMD-BPNN Approach for Daily Streamflow Estimating. Water Resour. Manag. 2021, 35, 5119–5138. [Google Scholar] [CrossRef]
Liu, S.; Xu, J.; Ju, B. Dam deformation prediction based on EMD and RBF neural networks. Mapp. Bull. 2019, 8, 88–91. [Google Scholar] [CrossRef]
Chen, Y. Back analysis of permeability coefficient of earth-rock dam based on EMD-RVM. IOP Conf. Ser. Earth Environ. Sci. 2020, 560, 12095. [Google Scholar] [CrossRef]
Sheng, J.; Teng, F.; Chen, D.; Qian, Q. Dam deformation prediction model and application based on EMD decomposition method. Water Conserv. Hydropower Technol. 2017, 48, 41–44. [Google Scholar] [CrossRef]
Xu, X.; Zhang, P.; Jian, J. Research on dam deformation prediction based on EMD-PSO-ELM algorithm. Softw. Guide 2020, 19, 1–5. [Google Scholar]
Xin, D.; Huan, Y.; Cai, Q. Ultra-short-term wind power prediction based on wavelet threshold noise reduction and BP neural network. World Sci. Technol. Res. Dev. 2011, 33, 1006–1010. [Google Scholar] [CrossRef]
Xue, J. Research and Application of a Novel Swarm Intelligence Optimization Technique; Donghua University: Shanghai, China, 2020. [Google Scholar]

Figure 1. RNN cell structure.

Figure 2. LSTM cell structure.

Figure 3. Concrete arch dam prediction process based on EMD combined with wavelet threshold noise reduction coupled with SSA-LSTM.

Figure 4. C4-A22-PL-05 measurement point downstream water level.

Figure 5. C4–A22–PL–05 measurement point upstream water level.

Figure 6. C4–A22–PL–05 measuring point air temperature.

Figure 7. C4–A22–PL–05 noise reduction comparison of measurement point.

Figure 8. IMF components after noise reduction based on EMD combined with wavelet thresholds. (a) IMF1 components after noise reduction based on EMD combined with wavelet thresholds; (b) IMF2 components after noise reduction based on EMD combined with wavelet thresholds; (c) IMF3 components after noise reduction based on EMD combined with wavelet thresholds; (d) IMF4 components after noise reduction based on EMD combined with wavelet thresholds; (e) IMF5 components after noise reduction based on EMD combined with wavelet thresholds.

Figure 9. Adaptability curves.

Figure 10. C4–A22–PL–05 measurement point prediction curve.

Figure 11. C4–A22–PL–05 measurement point prediction residuals.

Table 1. Prediction results of each prediction model.

Measuring Points	Predictive Models	RMSE/mm	MAE/mm²	MSE/mm	R²
C4-A22-PL-05	SSA-LSTM	0.06358	0.05345	0.00404	0.9533
	LSTM	0.0913	0.07611	0.00835	0.9036
	PSO-SVM	0.1293	0.09564	0.01096	0.8852

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhang, C.; Fu, S.; Ou, B.; Liu, Z.; Hu, M. Prediction of Dam Deformation Using SSA-LSTM Model Based on Empirical Mode Decomposition Method and Wavelet Threshold Noise Reduction. Water 2022, 14, 3380. https://doi.org/10.3390/w14213380

AMA Style

Zhang C, Fu S, Ou B, Liu Z, Hu M. Prediction of Dam Deformation Using SSA-LSTM Model Based on Empirical Mode Decomposition Method and Wavelet Threshold Noise Reduction. Water. 2022; 14(21):3380. https://doi.org/10.3390/w14213380

Chicago/Turabian Style

Zhang, Caiyi, Shuyan Fu, Bin Ou, Zhenyu Liu, and Mengfan Hu. 2022. "Prediction of Dam Deformation Using SSA-LSTM Model Based on Empirical Mode Decomposition Method and Wavelet Threshold Noise Reduction" Water 14, no. 21: 3380. https://doi.org/10.3390/w14213380

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Prediction of Dam Deformation Using SSA-LSTM Model Based on Empirical Mode Decomposition Method and Wavelet Threshold Noise Reduction

Abstract

1. Introduction

2. Selection of Statistical Models for Dam Deformation Prediction

3. Wavelet Threshold-EMD-Based Data Noise Reduction Method

3.1. Empirical Mode Decomposition

3.2. Wavelet Threshold Noise Reduction

4. The Prediction Model Based on Optimized LSTM Model with Sparrow Search Algorithm

4.1. Sparrow Search Algorithm

4.2. LSTM Neural Networks

5. Construction of the Model

6. Case Study

6.1. Factsheet

6.2. Data Noise Reduction Based on EMD Combined with Wavelet Threshold

6.3. Model Analysis

7. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI