Study on the Optimization and Stability of Machine Learning Runoff Prediction Models in the Karst Area

Mo, Chongxun; Liu, Guangming; Lei, Xingbi; Zhang, Mingshan; Ruan, Yuli; Lai, Shufeng; Xing, Zhenxiang

doi:10.3390/app12104979

Open AccessArticle

Study on the Optimization and Stability of Machine Learning Runoff Prediction Models in the Karst Area

by

Chongxun Mo

^1,2,3,

Guangming Liu

^1,2,3,

Xingbi Lei

^1,2,3,*

,

Mingshan Zhang

⁴,

Yuli Ruan

^1,2,3,

Shufeng Lai

^1,2,3 and

Zhenxiang Xing

^1,2,3,5

¹

College of Architecture and Civil Engineering, Guangxi University, Nanning 530004, China

²

Guangxi Provincial Engineering Research Center of Water Security and Intelligent Control for Karst Region, Guangxi University, Nanning 530004, China

³

Key Laboratory of Disaster Prevention and Structural Safety of the Ministry of Education, College of Civil Engineering and Architecture, Guangxi University, Nanning 530004, China

⁴

Shandong Hydrological and Water Resources Bureau of the Yellow River Conservancy Commission, Dongying 257000, China

⁵

School of Water Conservancy and Civil Engineering, Northeast Agricultural University, Harbin 150030, China

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2022, 12(10), 4979; https://doi.org/10.3390/app12104979

Submission received: 7 April 2022 / Revised: 6 May 2022 / Accepted: 10 May 2022 / Published: 14 May 2022

Download

Browse Figures

Versions Notes

Abstract

:

Runoff prediction plays an extremely important role in flood prevention, mitigation, and the efficient use of water resources. Machine learning runoff prediction models have become popular due to their high computational efficiency. To select a model with a better runoff simulation and to validate the stability of the model, the following studies were done. Firstly, the support vector machine Model (SVM), the Elman Neural Network Model (ENN), and the multi-model mean model (MMM) were used for the runoff prediction, with the monthly runoff data from 1963–2007 recorded by the Pingtang hydrological station in the Chengbi River Karst Basin, China. Secondly, the comprehensive rating index method was applied to select the best model. Thirdly, the indicators of the hydrologic alteration–range of variability approach (IHA-RVA) was introduced to measure the model stability with different data structure inputs. According to the comprehensive rating index method, the SVM model outperformed the other models and was the best runoff prediction model with a score of 0.53. The overall change of the optimal model was 10.52%, which was in high stability.

Keywords:

support vector machine; Elman neural network; multi-model mean model; model stability; runoff prediction; Chengbi River Karst Basin

1. Introduction

Accurate runoff prediction plays a critical role in the efficient and optimal allocation of water resources. Effective runoff prediction can be used for flood control, drought resistance, power generation, and irrigation [1,2,3,4]. In recent years, runoff prediction has received much attention. These studies focus on the exploration and application of neural network models, and the development and performance improvement of physical models. Gauche et al. [5] proposed two multi-timescale Long Short-Term Memory architectures that jointly predict multiple timescales within one mode, which can process different input variables at different timescales. An Adaptive Neural Fuzzy Inference System (ANFIS) model was developed to determine the discharge coefficient of the labyrinth spillway and thus predict runoff [6]. A Bayesian geostatistical method was used to interpolate hydrological data with different spatial support, and the model was evaluated by predicting the annual runoff in the watershed around Vos, Norway [7]. It was found that combining the point and surface data produced a higher prediction accuracy compared to using only one data type. In addition, to describe the nonhomogeneous characteristics of runoff series, a novel prediction model was established based on a nonhomogeneous Markov chain (NHMC-RPM) [8]. Furthermore, Excel solver was a promising way to reduce the problems of the parameter estimation of the nonlinear Muskingum routing models [9]. A reverse flood routing approach was developed based on the Muskingum model, and results indicated that the proposed methodology could substantially (up to almost 82%) improve the comparison with the observed inflows [10]. It has been shown that the Bayesian network-cluster model provides a suitable approach for predicting pollutant transport in natural rivers [11].

With the continuous development of machine learning, machine learning methods have been widely used in runoff predictions because of their excellent performance and relatively high applicability. For instance, the Artificial Neural network model and Long Short-Term Memory network model were applied to simulate the rainfall-runoff process according to flood events from 1971 to 2013 in the Fen River basin, monitored by 14 rainfall stations and one hydrologic station [12]. The results show that the two networks are both suitable for rainfall-runoff models and are better than conceptual and physical-based models. Liang et al. [13] proposed a support vector regression hydrologic uncertainty processor (SVR-HUP) model to predict the monthly, flood season, and annual runoff volumes. It was demonstrated that an SVR-HUP can predict the long-term runoff accurately and quantify the prediction uncertainty reasonably. A study of the daily runoff simulation was carried out based on the SVM-Copula model [14], and the results showed that the model can maintain the statistical characteristics of the original daily runoff series well, which means it could be a new way of random simulation in the case of insufficient data. In 2020, a Support Vector Machine–Artificial Flora (SVM-AF) hybrid model was used for the prediction of the daily streamflow [15]. The results showed that the proposed hybrid SVM-AF model outperformed the wavelet support vector machine model and the Bayesian support vector machine model in effectively predicting the flow and daily discharge. In the arid watershed, Samantaray and Sahoo [16] applied the Back Propagation Neural Network (BPNN), the Feed Forward Back Propagation Neural Network (FFBPNN), and the Cascade Forward Back Propagation Neural Network CFBPNN) algorithms for the prediction of the runoff. The results showed that the BPNN model is more suitable for predicting the connection between rainfall and runoff.

As different evaluation criteria were chosen, it is evident that the best model identified by each criterion is different. Due to the model-dependence, a scientific method is a key issue in runoff prediction. The SWAT, SWAT-ANN, and SWAT-MLP were applied to the runoff prediction. Analysis in Lv et al. [17] showed that the Soil and Water Assessment Tool–Whale Optimization Algorithm model was the best, as it was evaluated with four indicators (R2, RMSE, NSE, RE). According to Sibtain et al. [18], five statistical indexes were employed, which contain RMSE, MAE, MAPE, and MSE and R2 to evaluate the performance of eight runoff prediction models, and the results showed that complete ensemble empirical decomposition with the additive noise-variation mode decomposition–support vector machine (CEEMDAN-VMD-SVM) was the best model. Additionally, multiple linear regression (MLR), the back propagation neural network (BPNN), the Elman neural network (ENN), the particle swarm optimization–support vector machine for regression (PSO-SVR), and a coupling model composed of BPNN, ENN, and PSO-SVR were constructed to predict the annual runoff. The coupling model was considered to have the best predictive performance after the evaluation of the four metrics [19].

Although the previous study on runoff prediction has achieved certain results, there are still several problems, as follows. At present, there is no unified standard and method for the prediction model selection, so a more scientific and reasonable evaluation method for model selection is very necessary. Moreover, the stability of the model prediction performance has not been considered in previous studies, and how to test the stability of the model is still a major challenge for current research. Generally, most of the previous studies focused on runoff in non-karst areas, but the prediction models based on artificial intelligence and whether they can be applicable for runoff prediction analysis in karst areas needs to be further explored. Therefore, the objective of this study is to construct a suitable monthly runoff prediction model, establish a scientific evaluation system for selecting the model, and test the stability of the optimal model. For these purposes, the following studies will be conducted: firstly, the support vector machine model (SVM), the Elman Neural Network Model (ENN), and multi-model mean model (MMM) will be developed for the runoff prediction. Then, a scoring evaluation system with five indicators will be established to select the best model. Finally, the stability of the models will be tested by the IHA-RVA method with different data structure inputs.

2. Data and Methods

In order to better reflect the research idea of this paper, the specific flow chart is as follows in Figure 1.

2.1. Study Area and Data

The Chengbi River Karst Basin, located in Baise City, northwest of Guangxi, which covers an area of 2087 km² and contains 1121 km² of Karst area [20], was selected for this study. The basin has a subtropical monsoon climate, with a mild climate and abundant precipitation, which accounts for about 87% of the total precipitation during the flood season. The upper part of the basin is a typical karst peak forest area, with many karst features such as volute rivers, water caves, and skylights; the lower part is hilly, with high forest cover and dense vegetation [21]. There is a Chengbi River Reservoir in the basin, which is located 7 km upstream from the outlet of the basin. As the water source of drinking water of Baise city, Chengbi River Reservoir also has many roles such as power generation, flood control, and irrigation of farmland. Therefore, developing a suitable runoff prediction model with high accuracy is essential and significant for water resource allocation and sustainable economic growth in the Chengbi River Karst Basin area. There are 12 telemetric rainfall stations, 2 hydrological stations, and 1 meteorological station in the Chengbi River Karst Basin. Due to historical reasons, only the Pingtang and Bashou stations have collected representative long-duration data. The Pingtang station was selected as a representative station, and the monthly runoff data of the station for a total of 55 years from 1963 to 2017 were used for the study and analysis, which was provided by the water resources management department. Moreover, the runoff depth was obtained by calculating the total runoff volume of the specified section and dividing it by the rainfall catchment area above the section. The approximate location of the Chengbi River Karst Basin is shown in Figure 2.

2.2. Methods

2.2.1. Model for Prediction

Support Vector Machine

Suppose the training set of the SVM is

{X_{i}, Y_{i}}_{i}^{N}

, where

X_{i}

is the input variable,

Y_{i}

is the expected value, and

N

is the number of training sets. The SVM regression function is determined according to statistical learning theory [22] as follows.

f (x) = w_{i} ϕ_{i} (x) + b

(1)

where

ϕ_{i}

is the nonlinear transfer function that enables the mapping projection of the input variables from the low-dimensional space to the high-dimensional feature space,

w_{i}

is the representative weight vector, and

b

is the bias.

r (C) = C \frac{1}{N} \sum_{i = 1}^{N} L_{ε} (d_{i}, y_{i}) + \frac{1}{2} ‖ w ‖^{2}

(2)

L_{ε} (d, y) = {\begin{array}{r} | d - y | - ε, | d - y | \geq ε \\ 0, | d - y | < ε \end{array}

(3)

In Equation (2), the first part of the right-hand side of the equation is the empirical risk, which can be found by Equation (4) below. The loss value of the insensitive loss function is 0 when within the allowed range [23]. The regularization constant

C

can determine the criticality of the empirical risk in the optimization of real-world problems. The larger the value of

C

, the higher the corresponding percentage of empirical risk. The error tolerance

ε

denotes the approximate accuracy during training,

ξ

and

ξ^{*}

are positive relaxation variables, thus Equation (4) can be transferred to a constrained form as follows.

M i n i m i z e : \frac{1}{2} ‖ w ‖^{2} + C (\sum_{i}^{N} (ξ_{i} + ξ_{i}^{*}))

(4)

S u b j e c t t o {\begin{array}{l} \sum_{i = 1}^{N} (α_{i} - α_{i}^{*}) = 0 \\ α_{i}, α_{i}^{*} \in [0, C], i = 1, 2, 3, \dots, N \end{array}

(5)

To solve the optimization problem of Equation (4), the Lagrangian function is introduced as:

\begin{array}{l} L = \frac{1}{2} ‖ w ‖^{2} + C (\sum_{i = 1}^{N} (ξ_{i} + ξ_{i}^{*})) - \sum_{i = 1}^{N} α_{i} (w_{i} ϕ (x_{i}) + b - d_{i} + ε + ξ_{i}) \\ - \sum_{i = 1}^{N} α_{i}^{*} (d_{i} + w_{i} ϕ (x) - b + ε_{i}^{*}) - \sum_{i = 1}^{N} (β_{i} ξ_{i} + β_{i}^{*} ξ_{i}^{*}) \end{array}

(6)

In Equation (6), the minimum values of

a

,

b

, and

c

need to be calculated, while the maximum values of

d

and

f

need to be calculated. By adding the Karush–Kuhn–Tucker condition to the regression algorithm the equation is transformed into a double Lagrangian form,

\begin{array}{l} v (α_{i}, α_{i}^{*}) = \sum_{i = 1}^{N} d_{i} (α_{i} - α_{i}^{*}) - ε \sum_{i = 1}^{N} (α_{i} + α_{i}^{*}) \\ - \frac{1}{2} \sum_{i = 1, j = 1}^{N} (α_{i} - α_{i}^{*}) (α_{j} - α_{j}^{*}) K (x_{i}, x_{j}) \end{array}

(7)

Subject to {\begin{array}{l} \sum_{i = 1}^{N} (α_{i} - α_{i}^{*}) = 0 \\ α_{i}, α_{i}^{*} \in [0, C], i = 1, 2, 3, \dots, N \end{array}

(8)

where:

α_{i}

and

α_{i}^{*}

satisfy the equation. The calculation method of the optimal expected weight vector of the regression hyperplane is:

w^{*} = \sum_{i = 1}^{N} (α_{i} - α_{i}^{*}) K (x, x_{i})

(9)

f (x, α, α^{*}) = \sum_{i = 1}^{N} (α_{i} - α_{i}^{*}) K (x, x_{i}) + b

(10)

The kernel function in the above equation is expressed as

K (x_{i}, x_{j}) = φ (x_{i}) * φ (x_{j})

(11)

The kernel function is the inner product of the high-dimensional feature space, and the Mercer condition is considered when choosing the kernel function, i.e., choosing a semi-positive definite function. The kernel functions have more types, and the four main types of kernel functions commonly used in SVM are as follows [24].

(1): Linear kernel function:

K (x_{i}, x_{j}) = x_{i}^{T} x_{j}

(12)

(2): Sigmoid kernel function:

K (x_{i}, x_{j}) = \tanh (v x_{i}^{T} x_{j} + c)

(13)

(3): Gaussian kernel function:

K (x_{i}, x_{j}) = \exp (- \frac{{| x_{i} - x_{j} |}^{2}}{2 σ^{2}})

(14)

(4): Polynomial kernel functions:

K (x_{i}, x_{j}) = {(x_{i}^{T} x_{j} + c)}^{d}

(15)

The Gaussian kernel function performs best when compared to other kernels in terms of prediction error rate and rejection rate (i.e., correct prediction rate). Therefore, we chose the Gaussian kernel function as the kernel function of the SVM model in this study. The structure of the SVM model is schematically shown in Figure 3.

According to Izmailov et al. [25], when Gaussian kernel function is chosen as the kernel function, the parameter

g = 1 / 2 σ^{2}

that comes with the function has an impact on the speed of training and prediction. Specifically, the smaller the value of

g

, the more the number of support vectors. The franker the training period, the lower the training accuracy, and the worse the model prediction; the larger the value of

g

, the fewer the number of support vectors, the easier the model is to over-train. The larger the value of

C

, the lower the tolerance to error and the easier it is to overfit, while the smaller the value of

C

, the higher the tolerance to error and the easier it is to underfit. Therefore, it is important to determine the appropriate penalty parameter

C

and kernel function

g

. In this paper, the penalty factor and kernel function parameters in the support vector machine are optimally selected using a genetic algorithm, due to the advantages of simplicity, access, global parallelism, and good resistance to interference.

Elman Neural Network

The Elman neural network was established by Jeffrey and has four layers of network structure, that is the input layer, hidden layer, context layer, and output layer [26]. The function of the input layer is for passing the signal, the output layer is used for linear weighting, and the hidden layer is for passing the signal. In addition, Elman neural networks have nodes connected to the hidden layer and the context layer (or association layer, context layer, state layer) that receive feedback signals and can be considered as a one-step delay operator with a short-term memory function [27]. This property makes it sensitive to historical states. The network’s own ability to handle dynamic information is improved to accomplish time-varying adaptation, enhance global stability, and enable dynamic modeling. The hidden layer transfer function generally uses the nonlinear function, Sigmoid function, while the output layer and the correlation layer are linear functions. Therefore, compared with the forward neural network, Elman neural network is more suitable for predicting time series values. The structure of the Elman neural network is shown in Figure 4.

The calculation process of the model can be expressed as follows:

y_{q}^{(t)} = f (\sum_{j = 1}^{L} x_{j}^{(t)} ω_{q, j})

(16)

x_{j}^{(t)} = f (\sum_{i = 1}^{L} u_{i}^{(t)} w_{j, i} + \sum_{k = 1}^{L} c_{k}^{(t)} w_{j, k})

(17)

c_{k}^{(t)} = x_{j}^{(t - 1)}

(18)

where

y_{q}^{(t)}

is the output at time

t

of the output layer,

x_{j}^{(t)}

is the output at time

t

of the hidden layer,

c_{k}^{(t)}

is the output at time t of the context layer,

w_{q, j}

is the connection weight function of the output layer,

w_{j, i}

is the connection weight function of the hidden layer,

w_{j, k}

is the connection weight function of the context layer.

Multi-Model Mean Model

The principle of MMM is that the results are obtained by averaging the results of multiple model predictions. The MMM models are widely used because of the ease of operation and the ability to balance data differences among models. The MMM model is a simple way to reduce biases in individual model outputs [28]. MMM models have a wide range of applications in climate modeling. In Sandor‘s study, Nash-Sutcliffe efficiency coefficients indicated that MMM outperformed individual models in 92.3% of cases [29].

2.2.2. Model Selection Method

Firstly, five metrics, Nash efficiency coefficient (NSE), mean absolute percentage error (MAPE), root mean squared error (RMSE), mean absolute error (MAE), and qualification rate (QR), were chosen to compare the performance of the models. The focus of each indicator is different, so the first five indicators can be used for the initial screening model. The NSE is capable of evaluating the predictive power of a model concerning measuring the degree to which the simulated process value is close to the observed value. The closer the value is to 1, the better the predictive power of the model. MAPE can evaluate the analysis for the accuracy of predicting the smooth component in the runoff series, ideally the value of which is 1. RMSE is applied to measure the degree of variability of the data, and the smaller the value, the better the accuracy of the model [30]. The range of MAE is [0, +∞), and zero indicates that the predicted value exactly matches the true value. QR is the ratio of the number of qualified predicted flow values to the total number of flow values. The formulas for calculating the above-mentioned indexes are as follows.

N S E = 1 - \frac{\sum_{i = 1}^{N} {(y_{i} - y_{i}^{*})}^{2}}{\sum_{i = 1}^{N} {(y_{i} - \bar{y})}^{2}}

(19)

M A P E = \frac{1}{n} \sum_{i = 1}^{N} | \frac{y_{i} - y_{i}^{*}}{y_{i}} | \times 100 %

(20)

R M S E = \sqrt{\frac{1}{n} \sum_{i = 1}^{N} {(y_{i} - y_{i}^{*})}^{2}}

(21)

M A E = \frac{1}{n} \sum_{i = 1}^{N} | y_{i} - y^{*} |

(22)

Q R = \frac{k}{l} \times 100 %

(23)

where

y_{i}

is the measured value,

y_{i}^{*}

is the simulated value,

\bar{y}

is the average of the observed value, and

n

is the total number of time series.

Considering that individual models may exhibit better performance in one aspect, but cannot win across the board, a more integrated approach is needed. The composite rating index method (M_R) is based on the ranking of each rating index, which is a method to determine their consistency and obtain a composite ranking [31].

M_{R} = 1 - \frac{1}{n m} \sum_{i = 1}^{n} r_{i}

(24)

where

m

is the number of models,

n

is the number of indicators to be evaluated, and

r_{i}

is the ranking of the simulation capability of each model in each assessment metric (i = 1, 2, 3). The value

r_{i} = 1

indicates a model with the strongest simulation capability. It can be seen that 0 ≤ M_R < 1. The value

M_{R}

is closer to 1, indicating the stronger comprehensive simulation capability of the model.

2.2.3. Stability Test Method

Hydrological series can be divided into two types according to their organization: the sequential structure and the monthly structure. Many articles have investigated the prediction of time-series data with the traditional sequential series. The runoff series can be reconstructed as twelve series of the same month, from January to December, and this new data structure, called monthly series [32], is relatively rare in research. In this paper, the monthly series is utilized to test the stability of the model. The two data structure forms are shown in Figure 5.

Next, IHA-RVA was introduced to measure the stability of the model to analyze the influence of different data structures on the stability of the model. Since there are 33 indicators in the original formula and only five here, including NSE, MAPE, RMSE, MAE, and QR, a slight change was made to the formula in this study. The expression of IHA-RVA is given by the following equation [33].

D_{i} = | \frac{N m - N_{ε}}{N_{ε}} | \times 100 %

(25)

D_{0} = (\frac{1}{n} {\sum_{i = 1}^{5} D_{i}}^{2})^{0.5}

(26)

where

D_{i}

denotes the degree of change of each indicator,

D_{0}

denotes the overall degree of change,

N_{m}

denotes the value before the change of its indicator, and

N_{ε}

denotes the value of its indicator after the change. When

D_{i}

and

D_{0}

are between about 0% and 33%, it is the low degree of change, which also means that the model is highly stable. When

D_{i}

and

D_{0}

are between 33% and 67%, it is a moderate change, which means that the model is moderately stable. When

D_{i}

and

D_{0}

are between 67% and 100%, it is a high change, which means that the model is stable at a low level.

3. Results

3.1. Prediction Results

The continuous monthly runoff data were input into the model to gain the runoff prediction results from different models, as shown in Figure 6. To better analyze the eigenvalues of each model, the relevant data were calculated and plotted in Table 1.

For the average value, the observed monthly average runoff depth is 67.25 mm. The monthly average runoff depth simulated by the MMM model is 65.75 mm, which is the closest to the observed value, proving that the MMM model shows a promising prediction in this regard. The monthly average runoff depth predicted by the SVM model and ENN model is 61.36 mm and 70.15 mm, respectively. It can be seen that the predicted values of the SVM model and ENN model are different from the actual observed values, and the prediction results are not as outstanding as the MMM model.

The minimum value of the measured runoff was 1.51 mm, while the minimum values of the runoff predicted by the model are negative. In terms of minimum prediction, the SVM model, ENN model, and MMM model need to be improved to ensure that the data are all positive.

To further analyze the maximum monthly runoff, the runoff data graphs were extracted and the details are shown in Figure 6. For the maximum monthly runoff, the observed maximum monthly runoff is 440.57 mm. As shown in Figure 7, the maximum runoff predicted by the SVM, MMM, and ENN models were 253.28 mm, 264.18 mm, and 234.41 mm, respectively, with absolute errors of 187.29 mm, 176.39 mm, and 206.16 mm, obtained at sequence numbers of 128, 103, and 127. The predicted value of the SVM model is closest to the observed value, indicating a better prediction. In terms of time, the maximum monthly runoff predicted by the SVM model and the observed maximum monthly runoff is the same month, again indicating that the SVM model predicts better. The maximum monthly runoff predicted by the ENN model and the maximum observed monthly runoff occurred 25 months apart. In this respect, ENN had the worst prediction.

Meanwhile, the maximum monthly runoff predicted by the MMM model differs from the observed maximum monthly runoff by only one month on the time scale.

To further analyze the prediction performance, the absolute error (AE) was calculated, and the results are shown in Figure 8, Figure 9 and Figure 10. A positive AE value means that the predicted value is higher than the observed value, which means that the predicted runoff value is larger than the actual one. A negative AE value means that the predicted value is lower than the observed value, which means that the predicted amount of water is smaller than the actual amount. For the SVM model, 64 of the AE values were positive, accounting for 48.48% of the total data, while the ENN model and the MMM model had more than half of the positive AEs, reaching 59.85% and 56.06%, respectively. A positive AE value indicates that the predicted runoff depth is on the high side. The larger the predicted flow depth is, the safer it is for engineering designs (such as reservoir projects and bridge projects). From the above engineering safety point of view, the ENN model embodies a good performance.

A box plot of absolute errors was drawn in order to further reveal comparisons among the SVM model, ENN model, and MMM model in Figure 11. Thus, a positive absolute error denotes an overestimation of an observed value. Correspondingly, a negative absolute error indicates an overestimation of an observed value. A box plot is also used to show the dispersion of a set of data [34]. It is mainly used to reflect the characteristics of the original data distribution, but also to compare the characteristics of the distribution of multiple groups of data. In terms of the median, the SVM model has the best prediction with a median absolute error of 0 mm, the ENN model has a median absolute error of 5 mm, and the MMM model has a median absolute error of 4 mm. In terms of the fluctuation range, the normal range of absolute error for the SVM model is −72 mm to 65 mm, the normal range of absolute error for the ENN model is −72 mm to 104 mm, and the normal range of absolute error for the MMM model is −74 mm to 83 mm. The SVM model has the least fluctuation in absolute error.

The prediction accuracy of the SVM model, ENN model, and MMM model was further evaluated using the Taylor Diagram in Figure 12. The Taylor diagram was plotted based on the geometric relationship among the correlation coefficient (equal to the root of R²), standard deviation, and RMSE [34]. The Taylor diagram accurately and efficiently reflects the performance of the prediction model, with a closer proximity to observations indicating better predictions [35]. All three metrics indicate that the SVM model predictions were close to the observed values. Overall, the SVM model was closest to the observed values in comparison with the ENN model and MMM model.

Through the above analysis of the prediction performance of each model, it is concluded that each model has its own advantages and disadvantages, and it is impossible to directly determine which model is better, so further research is needed to make a better choice.

3.2. Model Selection Results

Firstly, the model evaluation is performed using different metrics, as shown in Table 2. For NSE, the MMM model outperforms the other two models, with an extremely narrow advantage over the SVM, with a difference of only 0.01. For MAPE, the SVM model outperforms the other two models, with a MAPE of 102.18%, which is significantly better than those of the ENN model (160%) and the MMM model (120%). For RMSE, the MMM model has the best prediction effect, which is 2.87 mm lower than ENN’s RMSE and slightly superior to SVM’s RMSE by 0.55 mm. Regarding the MAE values, the SVM model has the best prediction ability, and the overall error for all data is significantly smaller than the prediction errors of ENN and MMM. In terms of QR, the predictive power of SVM is much higher than ENN and MMM, being 0.12 and 0.06 higher than both, respectively. In conclusion, the simulation effect of ENN is the worst by all aspects of evaluation, while further calculations and discussion on the simulation effect of SVM and MMM are needed to find out which is better.

Secondly, the comprehensive rating index method was applied to rank the evaluation indexes of different models, and the calculation results of

r_{i}

are shown in Table 3. Based on the above scoring rules, the final scoring results of the three models were calculated and shown in Table 4. The results show that the SVM model is the best runoff simulation model with a score of 0.53, the MMM model is the next best with a score of 0.47, and the ENN model has the worst simulation with a score of only 0. Therefore, combining these results, the SVM model has the highest evaluation score among the three models and is the model with the best simulation effect.

3.3. Optimal Model Stability

First, we changed the data structure to build a new model. The results of the model preference in the previous step show that SVM is the best model among the three runoff simulation models. By changing the data structure, the new SVM model was employed with twelve bunches of streamflow data after naming it SVM (M). The new model prediction results are shown in Figure 13.

The stability of the model was measured by measuring the degree of variation in five indicators. The degrees of model variability derived by the IHA-RVA method are shown in Table 5. From the table, we can find that five indicators have changed simultaneously. Among them, two indicators show that the model constructed with the new data structure has a better prediction, namely MAPE and QR, where MAPE is reduced by 11.52% from the original 102.18% and QR is changed from 0.54 to 0.56. The other three indicators show that the original data structure can perform a better runoff prediction, and the initial SVM model has a better NSE value than the new one. The NSE value of the initial SVM model is 0.08 higher than that of the new data, while both RMSE and MAE are lower than that of the original model. Overall, the original model structure facilitates a better runoff prediction. In addition, the variability D_i of all five indicators is less than 33%, and the overall variability D₀ of the model is only 10.52%, indicating that the model is of low variability (a highly stable state).

4. Discussion

4.1. Similarities and Differences

The SVM model has a better performance than the ENN model, which is consistent with the results of the previous study. According to the model evaluation results, the performance of the SVM simulation is higher than that of Elman. In a study for the time series prediction, it appeared that prediction results were equally well for the SVM model, whereas the Elman neural network could not be used to predict these series as well [35]. In a study in other areas, such as cyber security posture, the SVM model also showed more advanced predictions than the ENN model. In terms of error, the RMSE and MAE values of SVM were lower than ENN [36]. In these studies mentioned above, SVM showed better predictions than ENN, whether it was a time series prediction or a network safety period prediction.

The runoff results predicted by the MMM model did not show an optimal prediction, which is inconsistent with the findings of most scholars, who generally agree that the MMM model has a significant advantage in balancing the internal variability of the data. As stated by Mishra et al. [37], MMM is a useful way of extracting first-order climate change projection information from the CMIP5 model. However, according to the scoring results of this study, the MMM model outperforms the ENN model in all metrics, while the overall scoring results are inferior to the SVM model, indicating that the prediction results of the MMM are not always better than those of a single model. Analysis of the results shows that the data of the MMM model are derived from both the SVM model and the MMM model, and although the MMM model can balance the data differences between the models, it does not completely reduce the runoff errors. As shown in Figure 6, the prediction error patterns of the SVM model and the ENN model are similar in the same period (both are overestimated or underestimated), thus the performance of the MMM model is in between the two models. In this study, it is reasonable for the prediction results of the MMM model to be between the best SVM model and the worst ENN model.

The monthly data structure-based model did not outperform the sequential data structure-based model, which is different from other scholars’ studies. According to Zhao et al. [32], the monthly predictive structure can effectively extract the instinctive hydrological information that is more easily learned by the predictive model than the traditional sequential predictive structure. The analysis may be due to differences in model types leading to different results. In Zhao’s study, the IGWO-GRU model was applied to the runoff prediction, while the SVM model is applied in this paper. In addition, the runoff data in this study are based on karst basins, and the subsurface geological conditions are extremely different from those in non-karst areas. Karst has a continuous influence on runoff. Therefore, under the sequential structure, this may make the SVM model more effective in capturing the influence pattern of karst. Based on these two points, it is also reasonable that the sequential data structure-based model is superior to the monthly data structure-based model in runoff prediction.

4.2. Policy Recommendations

The above studies show that the data structure can have a degree of impact on model performance. In this study, the traditional sequential structure model is superior to the monthly structure model. This result also broadens the ideas for runoff prediction. Water resource managers can consider choosing different data structures for runoff prediction and select the best model to manage water resources more scientifically and effectively.

4.3. Innovation, Limitations, and Further Research

A method for accurately quantifying and qualitatively measuring the stability of the model was developed. The stability of the model was measured for the first time by changing the data structure and introducing the quantitative calculations of IHA-RVA, which has also been used in previous studies, but has never been quantified for the stability of runoff prediction models and has only been applied to measure the degree of variability in hydrological conditions [33]. In this study, the overall variability of the model indicators is 10.52%, which is a low variability, meaning that the model is highly stable.

All three models showed negative values when predicting low values, and further model improvements are needed in this area to ensure that all data are positive.

Due to the data, the results and conclusions of this study cannot be directly transferred to other watersheds, and more watersheds are needed to obtain more generalized results. Furthermore, in future studies, replication and validation in more watersheds could be attempted.

The stability analysis of the model is the key to the preferred model. However, the factors affecting the stability of the model are complex and diverse. This study attempts to analyze the stability of the model only from the perspective of data structure, and the stability of the model performance needs to be carried out in depth. Due to the influence of climate change and human activities, the change of runoff becomes more complicated. However, the prediction of runoff by machine learning models cannot fully consider the influence of climate change and human activities. Therefore, a runoff prediction model that can fully combine the factors of climate and human activities needs to be further proposed.

5. Conclusions

In this study, the Chengbi River Karst Basin in Guangxi was used as the research object. The main findings of this study are as follows: The three models for runoff prediction in the karst areas are applicable, and there is room for improvement in prediction accuracy. Among the three models (SVM model, ENN model, and MMM model), the SVM model had the best runoff prediction, the MMM had the second-best runoff prediction, and the ENN model has the worst prediction. It is worth noting that although the data structure has an effect on the model, the degree of influence is limited, indicating that the SVM model has good reliability. By changing the data structure, it was found that the overall metrics of the SVM model changed by 10.52% and were highly stable. At the same time, different data structures can be considered for runoff prediction and simulation, to achieve better prediction results.

Based on these findings, further research direction is needed into, for example, exploring more machine learning models for runoff prediction in karst areas. Furthermore, in addition to the multi-model mean model, the performance of other coupled models can be further studied, and the relationship between the data structure and model performance is worth exploring in depth.

Author Contributions

Conceptualization, C.M., G.L., X.L. and Y.R.; data curation, G.L.; funding acquisition, C.M., Y.R. and Z.X.; methodology, G.L., X.L. and M.Z.; project administration, C.M. and S.L.; resources, C.M.; software, M.Z.; writing—original draft, G.L. and M.Z.; writing—review and editing, X.L. and Y.R. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China (51969004, 51979038), the Guangxi Natural Science Foundation of China (2017GXNSFAA198361), and the Innovation Project of Guangxi Graduate Education (YCBZ2019022).

Institutional Review Board Statement

Not applicable. This study did not involve humans or animals.

Informed Consent Statement

Not applicable. The study did not involve humans.

Data Availability Statement

Some data that support the findings of this study are available from the corresponding author upon reasonable request.

Conflicts of Interest

The authors declare no conflict of interest.

References

Sun, X.; Li, Z.; Tian, Q. Assessment of hydrological drought based on nonstationary runoff data. Hydrol. Res. 2020, 51, 894–910. [Google Scholar] [CrossRef]
Ma, R.-y.; Du, Y.; Li, K. Study on flood control risk of flood control engineering system based on the clustering of measured data. Clust. Comput.—J. Netw. Softw. Tools Appl. 2019, 22, S6541–S6549. [Google Scholar] [CrossRef]
Jiang, Z.; Li, R.; Li, A.; Ji, C. Runoff forecast uncertainty considered load adjustment model of cascade hydropower stations and its application. Energy 2018, 158, 693–708. [Google Scholar] [CrossRef]
Bai, Y.; Fernald, A.; Tidwell, V.; Gunda, T. Reduced and Earlier Snowmelt Runoff Impacts Traditional Irrigation Systems. J. Contemp. Water Res. Educ. 2019, 168, 10–28. [Google Scholar] [CrossRef]
Gauch, M.; Kratzert, F.; Klotz, D.; Nearing, G.; Lin, J.; Hochreiter, S. Rainfall-runoff prediction at multiple timescales with a single Long Short-Term Memory network. Hydrol. Earth Syst. Sci. 2021, 25, 2045–2062. [Google Scholar] [CrossRef]
Hosseini, K.; Nodoushan, E.J.; Barati, R.; Shahheydari, H. Optimal design of labyrinth spillways using meta-heuristic algorithms. Ksce J. Civ. Eng. 2016, 20, 468–477. [Google Scholar] [CrossRef]
Roksvag, T.; Steinsland, I.; Engeland, K. A two-field geostatistical model combining point and areal observations-A case study of annual runoff predictions in the Voss area. J. R. Stat. Soc. Ser. C—Appl. Stat. 2021, 70, 934–960. [Google Scholar] [CrossRef]
Li, W.; Wang, X.; Pang, S.; Guo, H. A Runoff Prediction Model Based on Nonhomogeneous Markov Chain. Water Resour. Manag. 2022, 36, 1431–1442. [Google Scholar] [CrossRef]
Barati, R. Application of excel solver for parameter estimation of the nonlinear Muskingum models. Ksce J. Civ. Eng. 2013, 17, 1139–1148. [Google Scholar] [CrossRef]
Badfar, M.; Barati, R.; Dogan, E.; Tayfur, G. Reverse Flood Routing in Rivers Using Linear and Nonlinear Muskingum Models. J. Hydrol. Eng. 2021, 26, 04021018. [Google Scholar] [CrossRef]
Alizadeh, M.J.; Shahheydari, H.; Kavianpour, M.R.; Shamloo, H.; Barati, R. Prediction of longitudinal dispersion coefficient in natural rivers using a cluster-based Bayesian network. Environ. Earth Sci. 2017, 76, 86. [Google Scholar] [CrossRef]
Hu, C.; Wu, Q.; Li, H.; Jian, S.; Li, N.; Lou, Z. Deep Learning with a Long Short-Term Memory Networks Approach for Rainfall-Runoff Simulation. Water 2018, 10, 1543. [Google Scholar] [CrossRef] [Green Version]
Liang, Z.M.; Li, Y.J.; Hu, Y.M.; Li, B.Q.; Wang, J. A data-driven SVR model for long-term runoff prediction and uncertainty analysis based on the Bayesian framework. Theor. Appl. Climatol. 2018, 133, 137–149. [Google Scholar] [CrossRef]
Wang, P.; Zhang, J.; Wang, M.; Liang, Y.; Li, J. Stochastic simulation of daily runoff in the middle reaches of the Yangtze river based on SVM-Copula model. Syst. Sci. Control. Eng. 2019, 7, 452–459. [Google Scholar] [CrossRef]
Dehghani, R.; Poudeh, H.T.; Younesi, H.; Shahinejad, B. Daily streamflow prediction using support vector machine-artificial flora (SVM-AF) hybrid model. Acta Geophys. 2020, 68, 1763–1778. [Google Scholar] [CrossRef]
Samantaray, S.; Sahoo, A. Prediction of runoff using BPNN, FFBPNN, CFBPNN algorithm in arid watershed: A case study. Int. J. Knowl.—Based Intell. Eng. Syst. 2020, 24, 243–251. [Google Scholar] [CrossRef]
Lv, Z.; Zuo, J.; Rodriguez, D. Predicting of Runoff Using an Optimized SWAT-ANN: A Case Study. J. Hydrol.—Reg. Stud. 2020, 29, 100688. [Google Scholar] [CrossRef]
Sibtain, M.; Li, X.; Nabi, G.; Azam, M.I.; Bashir, H. Development of a Three-Stage Hybrid Model by Utilizing a Two-Stage Signal Decomposition Methodology and Machine Learning Approach to Predict Monthly Runoff at Swat River Basin, Pakistan. Discret. Dyn. Nat. Soc. 2020, 2020, 7345676. [Google Scholar] [CrossRef]
Song, P.; Liu, W.; Sun, J.; Wang, C.; Kong, L.; Nong, Z.; Lei, X.; Wang, H. Annual Runoff Forecasting Based on Multi-Model Information Fusion and Residual Error Correction in the Ganjiang River Basin. Water 2020, 12, 2086. [Google Scholar] [CrossRef]
Mo, C.; Ruan, Y.; He, J.; Jin, J.; Liu, P.; Sun, G. Frequency analysis of precipitation extremes under climate change. Int. J. Climatol. 2019, 39, 1373–1387. [Google Scholar] [CrossRef]
Mo, C.; Ruan, Y.; Xiao, X.; Lan, H.; Jin, J. Impact of climate change and human activities on the baseflow in a typical karst basin, Southwest China. Ecol. Indic. 2021, 126, 107628. [Google Scholar] [CrossRef]
Cortes, C.; Vapnik, V. Support-Vector Networks. Mach. Learn. 1995, 20, 273–297. [Google Scholar] [CrossRef]
Meng, E. Streamflow Forecast Research of a Modified Hybrid Model M-EMDSVM. Master’s Thesis, Xi’an University of Technology, Xi’an, China, 2018. [Google Scholar]
Lei, C. The Runoff Forecasting of Heihe River Base on SARIMA and SVR Hybrid Model. Master’s Thesis, Lanzhou University, Lanzhou, China, 2018. [Google Scholar]
Izmailov, R.; Vapnik, V.; Vashist, A.; IEEE. Multidimensional Splines with Infinite Number of Knots as SVM Kernels. In Proceedings of the International Joint Conference on Neural Networks (IJCNN), Dallas, TX, USA, 4–9 August 2013. [Google Scholar]
Wang, M.; Wu, J.; Guo, J.; Su, L.; An, B. Multi-Point Prediction of Aircraft Motion Trajectory Based on GA-Elman-Regularization Neural Network. Integr. Ferroelectr. 2020, 210, 115–126. [Google Scholar] [CrossRef]
Shao, Y.; Lin, B.; Liu, Y. Rainfall-runoff Simulation Based on Runoff Classification Using Dynamic Artificial Neural Networks. Sci. Geogr. Sin. 2012, 32, 74–80. [Google Scholar]
Harrison, M.S.J.; Palmer, M.; Richardson, T.N.; Buizza, D.S.; Petroliagis, R. Joint ensembles from the UKMO and ECMWF models. Ecmwf Webgroup 1995, 2, 61–120. [Google Scholar]
Sandor, R.; Ehrhardt, F.; Grace, P.; Recous, S.; Smith, P.; Snow, V.; Soussana, J.-F.; Basso, B.; Bhatia, A.; Brilli, L.; et al. Ensemble modelling of carbon fluxes in grasslands and croplands. Field Crops Res. 2020, 252, 35–52. [Google Scholar] [CrossRef]
Yue, Z.; Ai, P.; Xiong, C.; Song, Y.; Hong, M.; Yu, J. Mid- and long-term runoff forecasting based on improved deep belief networks model. J. Hydroelectr. Eng. 2020, 39, 33–46. [Google Scholar]
Jiang, S.; Jiang, Z.; Li, W.; Shen, Y. Evaluation of the Extreme Temperature and Its Trend in China Simulated by CMIP5 Models. Clim. Change Res. 2017, 13, 11–24. [Google Scholar] [CrossRef]
Zhao, X.; Lv, H.; Wei, Y.; Lv, S.; Zhu, X. Streamflow Forecasting via Two Types of Predictive Structure-Based Gated Recurrent Unit Models. Water 2021, 13, 91. [Google Scholar] [CrossRef]
Richter, B.D.; Baumgartner, J.V.; Powell, J.; Braun, D.P. A method for assessing hydrologic alteration within ecosystems. Conserv. Biol. 1996, 10, 1163–1174. [Google Scholar] [CrossRef] [Green Version]
Fabio, D.N.; Abba, S.I.; Pham, B.Q.; Towfiqul Islam, A.R.M.; Talukdar, S.; Francesco, G. Groundwater level forecasting in Northern Bangladesh using nonlinear autoregressive exogenous (NARX) and extreme learning machine (ELM) neural networks. Arab. J. Geosci. 2022, 15, 647. [Google Scholar] [CrossRef]
Thiessen, U.; van Brakel, R.; de Weijer, A.P.; Melssen, W.J.; Buydens, L.M.C. Using support vector machines for time series prediction. Chemom. Intell. Lab. Syst. 2003, 69, 35–49. [Google Scholar] [CrossRef]
Ke, G. Combination Model of Network Security Situation Prediction Based on Cooperative Games. J. Syst. Simul. 2017, 29, 1153–1159. [Google Scholar]
Mishra, S.K.; Sahany, S.; Salunke, P.; Kang, I.-S.; Jain, S. Fidelity of CMIP5 multi-model mean in assessing Indian monsoon simulations. NPJ Clim. Atmos. Sci. 2018, 1, 39. [Google Scholar] [CrossRef]

Figure 1. Flowchart of the research.

Figure 2. Chengbi River karst basin.

Figure 3. Structure of the support vector machine (SVM).

Figure 4. Structure of the Elman neural network (ENN).

Figure 5. Data structure.

Figure 6. Prediction results of SVM model, ENN model, and MMM model.

Figure 7. The monthly runoff maximum.

Figure 8. AE of SVM model.

Figure 9. AE of ENN model.

Figure 10. AE of MMM model.

Figure 11. Box plot of Absolute Errors.

Figure 12. Taylor Diagram.

Figure 13. Prediction results of SVM (M) models.

Table 1. Monthly runoff characteristic values.

Model	Average Value (mm)	Minimum Value (mm)	Maximum Value (mm)
Observed model	67.25	1.51	440.57
SVM model	61.36	−4.18	253.28
ENN model	70.15	−12.69	264.18
MMM model	65.75	−2.23	234.41

Table 2. Evaluation of model performance results through five metrics.

Model	NSE (mm)	MAPE	RMSE	MAE (mm)	QR
SVM model	0.607	102.18%	52.89	34.31	0.54
Elman model	0.572	160.07%	55.21	38.54	0.42
MMM model	0.615	123.95%	52.34	35.34	0.48

Table 3. The ranking of the three models under five indicators (

r_{i}

).

Table 3. The ranking of the three models under five indicators (

r_{i}

).

Model	NSE	MAPE	RMSE	MAE	QR
SVM model	2	1	2	1	1
Elman model	3	3	3	3	3
MMM model	1	2	1	2	2

Table 4. Results of the combined assessment of the three models.

Method	SVM Model	ENN Model	MMM Model
M_R	0.53	0.00	0.47

Table 5. Model stability analysis results.

Indicator	N_m	N_ε	D_i	D₀
NSE	0.61	0.53	15.11%	10.52%
MAPE	102.18%	90.66%	12.71%
RMSE	52.89	58.00	8.81%
MAE	34.31	37.44	8.36%
QR	0.54	0.56	4.05%

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Mo, C.; Liu, G.; Lei, X.; Zhang, M.; Ruan, Y.; Lai, S.; Xing, Z. Study on the Optimization and Stability of Machine Learning Runoff Prediction Models in the Karst Area. Appl. Sci. 2022, 12, 4979. https://doi.org/10.3390/app12104979

AMA Style

Mo C, Liu G, Lei X, Zhang M, Ruan Y, Lai S, Xing Z. Study on the Optimization and Stability of Machine Learning Runoff Prediction Models in the Karst Area. Applied Sciences. 2022; 12(10):4979. https://doi.org/10.3390/app12104979

Chicago/Turabian Style

Mo, Chongxun, Guangming Liu, Xingbi Lei, Mingshan Zhang, Yuli Ruan, Shufeng Lai, and Zhenxiang Xing. 2022. "Study on the Optimization and Stability of Machine Learning Runoff Prediction Models in the Karst Area" Applied Sciences 12, no. 10: 4979. https://doi.org/10.3390/app12104979

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Study on the Optimization and Stability of Machine Learning Runoff Prediction Models in the Karst Area

Abstract

1. Introduction

2. Data and Methods

2.1. Study Area and Data

2.2. Methods

2.2.1. Model for Prediction

Support Vector Machine

Elman Neural Network

Multi-Model Mean Model

2.2.2. Model Selection Method

2.2.3. Stability Test Method

3. Results

3.1. Prediction Results

3.2. Model Selection Results

3.3. Optimal Model Stability

4. Discussion

4.1. Similarities and Differences

4.2. Policy Recommendations

4.3. Innovation, Limitations, and Further Research

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI