1. Introduction
Flow controller and flow regulator structures play important roles in flow distribution in irrigation systems and the success rate of irrigation systems depends on the performance of these structures [
1]. The maximum discharge condition is the main criteria for designing irrigation systems, while in practice, in the majority of cases, the maximum discharge is not observed. So, in such systems, flow controller and flow regulator structures are used for providing gravity irrigation conditions and for setting the water level at required water levels [
2]. Gates are the most common types of structures that are commonly utilized in irrigation networks [
3]. In addition, flow discharge measurement is another important application of gates in these systems, and estimation of flow discharge under gates can be classified as one of the most important issues in hydraulic engineering [
3]. In other words, the precision of flow discharge information has a direct influence on operational management and water-saving policies [
4]. According to gate applications all over the world, the hydraulic performance of gates has been studied by many researchers. Gibson [
5] and Henry [
6] carried out the earliest studies on gates. Henry [
6] presented the slice gate discharge coefficient (
Cd) in the form of a graph considering a dimensionless form of effective parameters in both submerged and free-flow conditions. The study by Henry [
6] is the basis of some other studies and some other
Cd equations. After that, Rajaratnam and Subramanya [
7], Rajaratnam and Subramanya [
8] and Swamee [
9] proposed new relationships for determining
Cd in both submerged and free-flow conditions.
Radial gates alongside slice gates are the most common types of gates. The required force for the opening of gates is large, so hydraulic engineers use radial gates instead of other types of gates [
10]. Because of the special shape of radial gates and existing cylindrical shells, entering water pressure passes through the axis of the gate, and pressure forces do not create a torque around it [
10]. Bijankhan et al. [
11] used dimensional analysis to analyze both submerged and free-flow conditions in a radial gate. In their study, the incomplete self-similarity concept and Buckingham theorem were used to propose a new dimensional equation for
Cd. Salmasi, Nouri, and Abraham [
10] used different types of sills under a radial gate and studied the influence of sills on the radial gate
Cd. Circular, semicircular, rectangular, triangular, and trapezoidal sills were used in the study. Two different locations of sills were studied and outcomes revealed that sills could have both negative and positive effects on
Cd. Bijankhan et al. [
12] presented an analytical equation for distinguishing free-flow and submerged flow conditions and used the experimental data of Buyalski [
13] for verification of the presented equation. Clemmens et al. [
14] provided a new method to calibrate both the free-flow and submerged flow of radial gates. The provided method utilizes the momentum equation and energy equation on the downstream side and upstream side of the structure, respectively. Zheng et al. [
15] established a new model using the least square method to calibrate the discharge in radial gates. In order to perform this, four discharge methods obtained from the energy equation and dimensionless analysis were used. Outcomes revealed that by using the presented parameter identification model, the accuracy of four methods increased and the value of maximum mean relative error decreased from 34.26% to 3.54%.
Recently, soft computing techniques and/or artificial intelligence (AI)-based models have been used for the estimation
Cd of hydraulic structures [
16,
17,
18]. Due to some negative issues in common methods, including a large quantity of influencing parameters and their interactions, a large number of assumptions, complexity of the solutions, high uncertainty, etc., these models can be utilized as a direct solution to the problem [
19,
20]. Salmasi and Abraham [
21] used genetic programming for identifying the discharge coefficient for inclined slide gates. They conducted a series of laboratory experiments and used experimental results to produce new equations using genetic programming. Aydin and Kayisli [
22] developed a neuro-fuzzy (ANFIS) model to predict the
Cd of labyrinth side weirs. They used 285 experimental data to test the developed model and the outcomes of the research revealed that the ANFIS could be effectively used in practice. Piano Key weirs’
Cd was investigated by Majedi-Asl et al. [
23]. For this purpose, SVM (support vector machines) and the GEP (gene expression programming) models were used. They reported that the GEP model results are in a good agreement with the experimental data [
23]. In another study, the discharge coefficient of oblique sluice gates was assessed using some different techniques and the ANN (artificial neural network) approach was introduced as the most accurate model [
24]. Roushangar et al. [
25] used the hybrid Grey Wolf Optimization-based Kernel-depended Extreme Learning Machine (GWO-KELM) approach for the prediction of the
Cd of submerged radial gates. The results of their studies showed that the GWO-KNL approach has a good ability to estimate the
Cd of the radial gate under different submergence ratios. Salazar et al. [
26] used the ANN and Finite Element methods (FEM) for estimation of the
Cd of radial-gated spillway of the Oliana dam in Spain. The results of their investigations showed that the FEM method can be more useful in analyzing radial-gated spillway. Rady [
27] applied the ANNs modeling method to investigate the discharge coefficient of the vertical and inclined sluice gates under free and submerged flow conditions. The results of his study indicated that the ANNs are powerful tools for modeling flow rates below both types of sluice gates within an accuracy of ±5%. Al-Talib and Kattab [
28] studied the
Cd using SPSS and ANN and compared them. Their results gave a good agreement compared with experimental data and they presented an equation to calculate the
Cd. Sauida [
29] simulated the relative energy loss downstream of the multi-sluice gate using ANN under submerged flow conditions and developed an empirical prediction equation using statistical Multiple Linear Regression (MLR). The results of his study showed that the ANN is more accurate than the MLR and can be utilized to determine the optimal multi-gate operation scenario for multi-vent regulators.
For the determination of radial gates’ Cd, simultaneously solving some nonlinear equations and some complex graphs is generally needed. Furthermore, a large number of equations for gates Cd are based on some assumptions that sometimes lead to outcomes which do not match reality. Hence, introducing alternative methods for predicting Cd of radial gates with good accuracy is essential. It is evident from the existing literature that the AI-based models are easy to apply and they produce accurate predictions. As alternatives to methods which solve the governing equations depending on boundary conditions, AI-based approaches can efficiently predict the Cd. Although the AI-based techniques have been widely used to estimate Cd of other hydraulic structures such as weirs; however, very few studies used these techniques in prediction of radial gates’ Cd. Here three AI-based models, namely, the Group Method of Data Handling (GMDH), Multivariate Adaptive Regression Splines (MARS) and Gaussian Process (GP) are employed to predict Cd in radial gates in both submerged and free discharge conditions. The existing experimental data in the literature are used as model inputs. The outcomes of the AI-based models are also compared with those of the linear and nonlinear regression-based models.
2. Materials and Methods
Figure 1 shows a schema of the radial gate in both free (
Figure 1a) and submerged flow conditions (
Figure 1b). As seen from the figure,
Y0 = upstream water depth,
W = gate opening height,
h = trunnion-pin height,
R = gate radius,
Y1 = flow depth at vena contracta under free-flow conditions, and
Yt = downstream water depth in submerged flow condition. As discussed above, the prediction of radial gates’
Cd is the main aim of the presented work. In order to perform this, the reported experimental results by Buyalski [
13] were utilized for the models’ training and testing.
The discharge coefficient of the radial gate (
Cd) in both free-flow and submerged flow conditions is a function of flow properties and gate geometry. Upstream water depth and tailwater depth belong to flow properties and gate radius, trunnion–pin height, gate opening height, and gate width (
B) belong to gate geometry. Therefore, for free-flow and submerged flow conditions, the dependency of radial gate
Cd on the effective parameters is as in the Equations (1) and (2):
Considering sections 0 (upstream of the radial gate) and 1 (immediately downstream of the radial gate, see the location of
Y1 in
Figure 1a) and assuming rectangular channel, the following equations can be written using energy equilibrium as:
where
Y1 and
V1 are flow depth and flow velocity in
Section 1,
Y0 and
V0 are flow depth and flow velocity in section 0, g is the gravity acceleration and
E1 and
E0 are the specific energy in two mentioned sections. By combination of continuity equation and Equation (4), it can be written as:
where
Q is discharge flowing under the radial gate. As it can be seen, Equation (5) is not a direct result of Equation (4) and the continuity equation. In Equation (5),
V0 is ignored by considering an assumption as the reservoir of the upstream side of the gate is large, it is presented for measuring the discharge of a complete orifice. This equation was firstly used by Henry [
6] for the gates. The
Cd is used to correct these assumptions. Henry [
6] presented the sluice gate
Cd in the form of a graph by considering the dimensionless form of effective parameters in both submerged and free-flow conditions. In the graph, discharge coefficient has been plotted against
Y0/
W for free-flow condition and for submerged flow condition, discharge coefficient has been plotted against
Y0/
W and
YT/
W. In the present study, Equation (5) was applied for the calculation of the discharge coefficient using the experimental data of Buyalski [
13]. 2536
Cd in different conditions were obtained. These data were utilized for training and testing of Multiple Linear Regression (MLR), Nonlinear Regression (NLR), Multivariate Adaptive Regression Splines (MARS), GMDH, and Gaussian Process (GP) regression models.
2.1. Experimental Data by Buyalski [13]
Buyalski [
13] employed an extensive experimental study for the determination of the
Cd of radial gates in both submerged and free-flow conditions. He experimentally modeled a real canal ant its radial gates (the Tehama–Colusa canal) with a scale of 1:6. Experiments were carried out in a wide rectangular flume 3.05 m. In the study by Buyalski [
13], gate-opening height (
W), upstream water depth (
Y0), tailwater depth in submerged flow condition (
YT), trunnion–pin height (
h), and types of gate sill were considered as study parameters. The gate width (
B) and the gate radius (
R) were constant and their corresponding values were 0.711 m and 0.702 m, respectively.
Cd algorithms were developed by Buyalski [
13] from a single-gate hydraulic laboratory model and they are applicable when the number of gates are varied from 1 to 5. The results of this experimental study are the basis of the majority of radial gates’
Cd equations. In order to determine
Cd t of radial gates, 2536 experimental data were used.
Table 1 shows the studied parameters by Buyalski [
13] and their range.
2.2. Methods
2.2.1. Multiple Linear Regression (MLR)
MLR is a process based on least square technique used for developing the linear relationship between independent (input) and dependent (output) variables. The commonly used equation for MLR is (Equation (6)):
where
x1,
x2, …,
xn are independent variables and
c0,
c1 …,
cn are regression coefficients.
2.2.2. Nonlinear Regression (NLR)
In comparison to MLR, that is always used for simple issues, NLR is chosen for complex issues. In the present study, for the creation of the NLR, XLSTA software was used. In the common NLR equation (Equation (7)), The output variable is
Cd and the input variables are:
x1,
x2, …,
xn.
2.2.3. Multivariate Adaptive Regression Splines (MARS)
The MARS method uses a non–parametric approach that does not include any assumption about the independent and dependent data set of the relationship. The MARS system uses sub domains for input space and for a specified sub domain, a linear regression equation is considered. A boundary value between knots is called a sub domain. The fitted linear regression is defined as the base functions (BFs). The following forms are given by the BFs (Equation (8)), where
x is a separate parameter, and
k is a boundary value. The last mathematical expression of the used MARS technique for the desired phenomenon is (Equation (9)):
where
y is a predicted output parameter by the function f(
x). S
0 is the value of a constant and
n is the quantity of BFs. The coefficient multiplied in BFs is S
n. β
n indicates the BFs. Mathematical modeling is sufficient to complete two phases using the MARS process. The BFs that were established in the previous phase and have no major impact on increasing the accuracy of the model are pruned on the basis of a criterion called Generalized Cross-Validation (GCV). In other words, the structure of the derived MARS model was adopted by GCV. The GCV is defined by Equation (10), where
N is the data quantity, and
C(
H) is the penalty for complexity that is increased by the quantity of BFs. In Equation (10), for each BF, d indicates a penalty number and
H is the quantity of BFs acquired by the MARS process [
30,
31].
2.2.4. Group Method of Data Handling (GMDH)
GMDH is a self–adapting approach that gradually maps any complicated system including a dynamic relationship between inputs and output. Ivakhnenko [
32] first suggested this approach. Each pair of inputs is added to a neuron in the GMDH technique. As seen below, the governing equation is a quadratic polynomial (Equation (11)). Where
and
(
i = 1 … 5) are coefficients and
xi and
xj are input pairs. From the Volterra sequence, the definition of GMDH provided stated that all the system can be calculated utilizing an infinite polynomial by Equation (12). The discrete form of Volterra series is presented as Kolmogorov–Gabor polynomial [
33,
34].
2.2.5. Gaussian Process Regression (GP)
According to Neal [
35], Gaussian Processes (GP) may be termed as a natural generalization of the Gaussian distribution and here, vector is the mean and matrix serves as the covariance [
36]. GP regression is an expedient approach of nonparametric regression owing to its theoretical simplicity as well as its worthy generalization capability and it provides an output which is probabilistic [
37]. A major assumption in GP is y which can be identified by
y ~ f(x) + ξ, where
ξ ~
N(0,σ
2). Here, the symbol ~ shows sampling. In regression of GP, for each input
x there is an associated random variable f(
x) and
x is the value of the stochastic f function at that location. In this study, the assumption is that the measurement error
ξ is normal independent and it is identically distributed, with a zero mean value (μ(x) = 0), a σ
2 of variance and f(
x) provided by the GP on
χ identified by
k. That is,
where
, and
I = identity matrix. For a provided test data
vector, the predictive distribution of the output
is Gaussian, where
Assume that the training data are
n and test data are
, then
matrix of covariances evaluated by the all pairs of training and test data sets are represented by
, and similarly, this holds true for the
,
and
; where
X and
Y show the vector of the training data and corresponding labels
. To produce a semi–definite covariance matrix
K which is positive, where
, a quantified covariance function is needed. The term kernel function which is utilized in SVM as well as the covariance function that has been utilized in regression of GP are equivalent. If we know the noise degree
, and the kernel, it would be ample having Equations (14) and (15) for the inference sake. It is a pre-requisite for the user throughout the training procedure of GP regression, to pick out an opposite covariance function, the parameters of it as well as the degree of noise. In the GP regression’s case with Gaussian noise involving a fixed value, a GP model can be accomplished by the employment of Bayesian inference, i.e., by maximizing the marginal likelihood. This provides the minimization of the negative log–posterior:
To discover the hyperparameters, the partial derivative of Equation (16) is obtained with respect to
and k, and it can be minimized by gradient descent. Kuss [
37] has reported a detailed account of the GP regression as well as different covariance functions.
2.2.6. Details of Kernel Function
The design of SVM and GP-based regression methods involves the kernel function (KF) idea. Different types of kernels have been discussed in the related literature [
30,
38,
39]. Four most frequently used KFs: a polynomial kernel function (
), radial basis kernel (
) and the Pearson VII kernel function
, where
,
, and
are specific kernel parameters.
2.3. Parameters for Performance Appraisal
For accuracy assessment of the implemented models, six different types of statistical parameters were considered. The Correlation Coefficient (CC), Root Mean Square Error (RMSE), Willmott’s Index (WI), Legates and McCabe’s Index (LMI), Mean Absolute Error (MAE), efficiency of the Nash Sutcliffe model (NS), Normalized Root Mean Square Error (NRMSE), and Root Mean Square Relative Error (RMSRE) were used for assessing the training and testing phases of prediction models [
40,
41,
42,
43,
44,
45,
46,
47,
48,
49]. It is possible to quantify the six performance assessment parameters used in this research using Equations (17)–(23).
Correlation Coefficient (CC): The correlation coefficient is a statistical measure of the degree of the association between two variables or groups. The range of CC is between –1.0 and 1.0. The equation of the CC is listed as follows [
20]:
Root Mean Square Error (RMSE): RMSE is a standard method of calculating a model’s error in predicting quantitative data. The range of the RMSE is 0 to ∞. Its equation is as follows [
20]:
Willmott’s Index (WI): Willmott [
46] suggested a standardized measure of model prediction error called the index of agreement (WI), which ranges from 0 to 1. The ratio of the mean square error to the potential error is represented by the index of agreement. A value of 1 indicates a perfect match, whereas a value of 0 shows no agreement at all. The index of agreement may identify additive and proportional differences between observed and predicted means and variances; however, due to squared differences, WI is extremely sensitive to extreme values.
Legates and McCabe’s Index (LMI): Legates and McCabe Jr [
40] proposed a new goodness-of-fit measure to evaluate the performance of developed model. The range of the LMI is from 0 to 1. LMI is very sensitive to outlier values.
Mean Absolute Error (MAE): MAE of prediction is defined as the absolute difference between the predicted and experimental (observed) values of relative coefficient of discharge, per residue [
20]. Where summation is carried out for all residues and
N is the total number of observations.
Nash–Sutcliffe efficiency index (NS): It is a widely used and potentially reliable statistic for assessing the goodness of fit of regression and soft computing models [
38]. The range of the NS is—∞ to 1.
Normalized Root Mean Square Error (NRMSE): It relates the RMSE to the observed range of the variable. Thus, the NRMSE can be interpreted as a fraction of the overall range that is typically resolved by the model [
20].
where
and
are, respectively, the observed and the predicted values,
is the average of observed values and
N is the quantity of observations
4. Conclusions
Radial gates are widely used for agricultural water management, flood controlling, etc. The development of usable models and equations for the prediction of their discharge coefficient is essential for these structures. In the current study, results of comprehensive research by Buyalski [
13] on the radial gates discharge coefficient were used to train and test classical regression and soft computing-based approaches. The whole effective parameters on the radial gates discharge coefficient such as gate opening height (
W), upstream water depth (
Y0), tailwater depth (
Y0), and trunnion–pin height (
h) have been studied by Buyalski [
13]. The non-dimension form of the effective parameters was used as inputs to the prediction models.
In the present study, the GP-, MARS-, GMDH-, NLR-, and LR-based models were used in estimating the discharge coefficient of radial gates for both free-flow and submerged flow conditions. In total, 400 and 2136 observations were used for model development and testing for the free-flow submerged flow conditions, respectively. The outcomes suggest that: 1. In the free-flow condition, the GP_RBF based model with the CC value of 0.9413, RMSE value of 0.0190, WI value of 0.9707, NS value of 0.8806, LMI value of 0.6658, NRMSE value of 0.0305, MAE value of 0.0153, and RMSRE value of 0.0291 is the most precise model among the considered models in the test phase, 2. In the submerged flow condition, the GP_PUK model outperforms the other applied models in the testing stage with the CC value of 0.9961, RMSE value of 0.0132, WI value of 0.9981, NS value of 0.9922, LMI value of 0.9215, NRMSE value of 0.0428, MAE value of 0.0097, and RMSRE value of 0.0920. For both free and submerged flow conditions, GP–PUK and GP–RBF perform superior to the other applied models in discharge coefficient prediction. MARS follows the GP models accuracy and it outperforms the GMDH models in both training and testing phases. Comparison of LR and NLR reveals that the NLR works better than the LR in the prediction of the discharge coefficient in free-flow condition whereas in submerged condition, the LR model is superior to the NLR-based model. Overall, the GP based models are performing superior to the other applied models in discharge coefficient prediction for free-flow and submerged conditions. The GP provides a universal and practical approach in learning using kernel machines. This method has advantages to other empirical methods due to its solid statistical learning foundation with Gaussian Processes. Thus, interpretability of model predictions and flexibility in model selection and model setting are possible in this method.
The existence methods reported by previous studies for the prediction of discharge coefficient in radial gates are based on some assumptions. Additionally, some of previous studies have developed complex algorithms and complex graphs to extract Cd for radial gates. The current study presented new simple and accurate models that can be used in radial gates design. To strengthen this study, more experimental data could be used in testing the GP method. In future studies, the GP models could be compared with other soft computing–based regression methods such as random forest, model tree, and genetic programming.
The presented study indicated the usefulness of the GP regression method for predicting the discharge coefficient which is essential in water resource management (WRM) such as flood controlling. Recently, use of MODIS data in WRM issues has been common and satellite data have enough quality with the advanced technology. For future work, MODIS data could be merged with the developed soft computing–based models in order to improve their accuracy in predicting the discharge coefficient which is essential for controlling water by providing beneficial information to decision makers and managers.