Next Article in Journal
Comments on Uncertainty in Groundwater Governance in the Volcanic Canary Islands, Spain
Previous Article in Journal
Welfare Effects of Water Variability in Agriculture. Insights from a Multimarket Model
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Sensitivity and Interaction Analysis Based on Sobol’ Method and Its Application in a Distributed Flood Forecasting Model

1
State Key Laboratory of Water Resources and Hydropower Engineering Science, Wuhan University, No.8 Donghu South Road, Wuhan 430072, China
2
Hubei Collaborative Innovation Center for Water Resources Security, Wuhan University, Wuhan 430072, China
*
Authors to whom correspondence should be addressed.
Water 2015, 7(6), 2924-2951; https://doi.org/10.3390/w7062924
Submission received: 28 February 2015 / Revised: 8 June 2015 / Accepted: 8 June 2015 / Published: 17 June 2015

Abstract

:
Sensitivity analysis is a fundamental approach to identify the most significant and sensitive parameters, helping us to understand complex hydrological models, particularly for time-consuming distributed flood forecasting models based on complicated theory with numerous parameters. Based on Sobol’ method, this study compared the sensitivity and interactions of distributed flood forecasting model parameters with and without accounting for correlation. Four objective functions: (1) Nash–Sutcliffe efficiency (ENS); (2) water balance coefficient (WB); (3) peak discharge efficiency (EP); and (4) time to peak efficiency (ETP) were implemented to the Liuxihe model with hourly rainfall-runoff data collected in the Nanhua Creek catchment, Pearl River, China. Contrastive results for the sensitivity and interaction analysis were also illustrated among small, medium, and large flood magnitudes. Results demonstrated that the choice of objective functions had no effect on the sensitivity classification, while it had great influence on the sensitivity ranking for both uncorrelated and correlated cases. The Liuxihe model behaved and responded uniquely to various flood conditions. The results also indicated that the pairwise parameters interactions revealed a non-ignorable contribution to the model output variance. Parameters with high first or total order sensitivity indices presented a corresponding high second order sensitivity indices and correlation coefficients with other parameters. Without considering parameter correlations, the variance contributions of highly sensitive parameters might be underestimated and those of normally sensitive parameters might be overestimated. This research laid a basic foundation to improve the understanding of complex model behavior.

1. Introduction

A distributed hydrological model is an essential tool and has been widely used in hydrology and water resources management due to its good ability in simulating temporal and spatial variation of hydrological variables [1,2]. In practice, the performance of a hydrological model is highly correlated to the setting of model parameters. Because of the existing phenomenon called “equifinality” [3,4], i.e., different parameter sets resulting in the same model simulations, there is large uncertainty in the application of hydrological models. Various selection of parameters will induce a large variety of simulation results; on the other hand, considering that the distributed hydrological models always contain complex structures with large number of parameters, the optimization choice of parameters is considered to be a difficult and time-consuming task.
However, the sensitivity analysis (SA), which involves testing the relative impact of model parameters on the runoff prediction, has been proved to be an efficient way to identify the parameters in hydrological models and has been applied in many studies [5,6]. Some studies have stated that SA can improve parameter calibration efficiency, avoid over-parameterization [7], reduce model uncertainty [8,9,10], aid in understanding model structure [11,12], and generate more accurate model outputs [6,13].
In general, SA methods can be broadly grouped into two types: local methods and global methods [6,7,14,15]. The local methods only make one parameter change at one time, while the other parameters are kept as constant [7,16]. The advantage of these methods is that only a small computation is needed during their application, while the drawback is that they cannot determine the interaction between parameters, thus may result in a large bias of SA. On the contrary, the global methods overcome the drawbacks of local methods by changing all of the model parameters at the same time [6,7,14,15]. This approach can consider the linear or nonlinear interaction between model parameters, and it is suggested to be more appropriate to complex and physically-based distributed hydrological models than local methods. Many global methods have been proposed and used, such as Regional Sensitivity Analysis (RSA) [17], Parameter Estimation Software (PEST) [18], Analysis of Variance (ANOVA) [14,19], Fourier amplitude sensitivity test (FAST) [15,20], Sobol’ method [6,21], etc. Among these methods, Sobol’ is proved to be one of the most robust and effective tools to describe individual and cooperative sensitivities [6,14,15,20,21,22,23], and it is widely applied not only in hydrological models [14,21,23,24,25], but also in environmental models [26], chemical models [27], biological models [28], ecological models [29], etc.
Tang et al. [14] and Gan et al. [15] compared Sobol’ method with other methods to analyze the sensitivities of the SAC-SMA model. Tang et al. [14] conducted a contradistinctive study between Sobol’ method and three other methods for two watersheds with two evaluation metrics including non-transformed root mean square error (RMSE) and transformed RMSE (TRMSE). Gan et al. [15] also compared Sobol’ method with eight other methods for first, second, and total order sensitivities based on the mean absolute error (MAE) of the simulated and observed flow measurement, in the South Branch Potomac River basin in the U.S. Reusser et al. [20] examined the dominant parameter and parameter interaction of TOPMODEL and a complex WaSiM-ETH model by the comparison of the Sobol’ method with FAST and extend FAST. Zhan et al. [6] proposed a highly efficient integrated sensitivity analysis tool on the basis of Sobol’ method for a DTVGM model in the Huaihe River Basin, China. The findings revealed the parameters’ sensitivities varied with regard to three different evaluation criteria, namely, water balance coefficient, Nash–Sutcliffe coefficient, and correlation coefficient. More recently, Zhang et al. [23] improved the understanding of the SWAT model through Sobol’ sensitivity analysis method at the Yichun River Basin in China, by exploring model performance based on four metrics—RMSE, Nash–Sutcliffe, runoff coefficient error, and slope of the flow duration curve error under dry, moderate, and wet climate conditions.
Previous studies have validated the robustness and ease of implementation of Sobol’ method to generate global sensitivity analysis (including first, second, and total order) in numerous hydrological models. Little attention has been paid to the independent assumption of parameters in Sobol’ method. Most researchers [6,14,15,23] did not consider the correlation underlying the parameters, even though great second order sensitivity indices were displayed. For this case, a generalized Sobol’ method [30] that considers correlation is more suitable for correlated parameters. Besides, as Wagener et al. [24] pointed out the significance of multiple objective functions during the model performance assessment, researchers realized the importance of reducing the uncertainty of sensitivity analysis by multi-metrics, various characterized catchments, and climate conditions. However, most distributed models were carried out with daily or monthly data [6,15,23], seldom with hourly data. In addition, even though multiple objective functions were strengthened, little attention was paid to the peak discharge timing and magnitude.
In this study, the Morris screening method and the extended [31] and generalized [30] Sobol’ method for uncorrelated and correlated parameters are implemented to the distributed flood forecasting model named the Liuxihe model by four objective indicators that can explicitly characterize the flood processes: (1) Nash–Sutcliffe efficiency (ENS); (2) water balance coefficient (WB); (3) peak discharge efficiency (EP); and (4) time to peak efficiency (ETP). Different types of flood magnitude (small, medium, and large) are considered to compare the sensitivity and interaction of parameters in the Liuxihe model. The Nanhua Creek catchment, which is located in the Pearl River basin, China, is selected as the study area, and the hourly rainfall-runoff data is used.

2. Methodology

The sensitivity and interaction analysis based on Sobol’ method can be carried out by the following steps: (1) screening ineffective parameters out by Morris method [5]; (2) identifying the parameters correlations through Generalized Likelihood Uncertainty Estimation (GLUE) [3,4]; (3) sampling parameter sets with and without considering the identified parameters correlations; (4) evaluation of the sensitivity indices of model correlated and uncorrelated parameters.

2.1. Morris Screening

Morris [5] proposed a “one-at-a-time” (OAT) design to screen the most influential parameters easily and effectively with a small amount of computation, which is now named the Morris screening method or Elementary Effect (EE) method. Although defined as a local SA method, the Morris screening method is broadly taken as a preliminary step to separate out the negligible parameters and decrease the huge computation demand for dozens of model parameters [6,32,33,34]. The detail of the Morris screening method is given below.
Let a model with n parameters Θ = ( θ 1 , θ 2 , , θ n ) varying across p selected levels in the n-dimensional hypercube space, and corresponding model output f ( Θ ) . For a specified parameter set Θ , the elementary effect of the ith parameter can be estimated by merely changing the ith parameter with a step Δ i while the rest of the parameters are kept unchanged. This process can be commonly defined as:
d i ( Θ ) = f ( θ 1 , , θ i 1 , θ i + Δ i , θ i + 1 , , θ n ) f ( θ 1 , , θ i 1 , θ i , θ i + 1 , , θ n ) Δ i
where Δ i is a value in the set { 1 / ( p 1 ) , 2 / ( p 1 ) , , 1 1 / ( p 1 ) }; p is the number of levels; and ( θ 1 , , θ i 1 , θ i , θ i + 1 , , θ n ) and ( θ 1 , , θ i 1 , θ i + Δ i , θ i + 1 , , θ n ) are a random and the designed sample set, respectively, both of them within the predefined n-dimensional hypercube space.
Two sensitivity measures are proposed to investigate the parameters [5]: μ and σ , which evaluate the overall effect of each parameter on the model output, and the parameters’ higher order effects caused by nonlinear or interactive relationship between parameters, respectively [35]. As suggested by Morris, R randomly sampled different trajectories are preferred to assess the elementary effects. Statistically speaking, μ and σ are the mean and the standard deviation of the R × (n + 1) elementary effects. When the model is non-monotonic, Campolongo et al. [35] improved the estimation of μ by μ * with the absolute value of d i ( Θ ) as:
μ i = 1 R k = 1 R d i ( Θ k )
μ i * = 1 R k = 1 R | d i ( Θ k ) |
σ i = 1 R 1 k = 1 R [ d i ( Θ k ) μ i ] 2
A large μ i * underlines the fact that the ith parameter is a relatively influential parameter and a large σ i underlines a strong nonlinear or interactive relationship between the ith parameter and the others [6].

2.2. Sobol’ Method for Uncorrelated Parameters

Sobol’ method is one of the widely used global sensitivity analysis methods based on variance decomposition [6,14,21,22]. Assume that the model parameters are independent or uncorrelated, the total variance of model output is composed of the variance from individual parameters, and the variances from cooperative parameters. The proportion of variance resulting from individual and cooperative parameters to the total variance is applied to estimate the first order and interactive sensitivity indices of parameters. The variance decomposition can be expressed as:
V ( y ) = i = 1 n V i + i j n n V i j + + i n n V 1 n
where V(y) denotes the total variance of model output; Vi denotes the first order contribution of the ith model parameter; Vij denotes the pairwise interactive effect of the ith and jth parameters; and n denotes the number of model parameters. Obviously, the first, second, and total order of sensitivity indices can be respectively expressed as:
Si = Vi/V(y)
Sij = Vij/V(y)
STi = 1 − V~i/V(y)
where V~i denotes the contribution of all except the ith parameter. The variances in Equation (5) can be estimated by approximate Monte Carlo numerical integrations [14]. The computation requirement of the original Sobol’ method to evaluate the first and total order sensitivity indices is m × (2n + 1), where m denotes size of sample sets, and the second order sensitivity indice cannot be calculated in such a case with tremendous model runs. Saltelli [31] proposed an extended method that generated more robust and satisfactory sensitivity outputs by double calculations of the first, second, and total order sensitivity indices with m × (2n + 2) model runs. The estimation for Vi, Vij, V~I, and V(y) is described as follows:
E ^ 1 2 ( y ) = 1 m r = 1 m [ f ( Θ r 1 ) f ( Θ r 2 )]
V ^ 1 ( y ) = 1 m r = 1 m [ f ( Θ r 1 ) ] 2 [ 1 m r = 1 m f ( Θ r 1 ) ] 2
V ^ i = 1 m r = 1 m [ f ( Θ r 1 ) f ( Θ ~ i 2 )] E ^ 1 2 ( y )    o r    1 m r = 1 m [ f ( Θ ~ i 1 ) f ( Θ r 2 )] E ^ 1 2 ( y )
E ^ 2 2 ( y ) = 1 m r = 1 m [ f ( Θ ~ i 1 ) f ( Θ ~ i 2 )]
V ^ 2 ( y ) = 1 m r = 1 m [ f ( Θ ~ i 1 ) ] 2 [ 1 m r = 1 m f ( Θ ~ i 1 ) ] 2
V ^ i j c = 1 m r = 1 m [ f ( Θ ~ i 1 ) f ( Θ ~ j 2 )]    E ^ 2 2 ( y )    o r    1 m r = 1 m [ f ( Θ ~ j 1 ) f ( Θ ~ i 2 )] E ^ 2 2 ( y )
V ^ i j = V ^ i j c V ^ i V ^ j
E ^ T 2 ( y ) = [ 1 m r = 1 m f ( Θ r 2 ) ] 2
V ^ T ( y ) = 1 m r = 1 m [ f ( Θ r 2 ) ] 2 E ^ T 2 ( y )
V ^ ~ i = 1 m r = 1 m [ f ( Θ r 1 ) f ( Θ ~ i 1 )] E ^ T 2 ( y )    o r    1 m r = 1 m [ f ( Θ ~ i 2 ) f ( Θ r 2 )] E ^ T 2 ( y )
where m denotes the sample size; E ^ 1 ( y ) , E ^ 2 ( y ) and E ^ T ( y ) are different estimated means of model outputs for first, second, and total order, respectively; V ^ 1 ( y ) , V ^ 2 ( y ) , and V ^ T ( y ) are estimated variances of model outputs for first, second, and total order, respectively; V ^ i , V ^ i j , and V ^ ~ i are estimated variances of model outputs results from the ith, ith and jth, and all except the ith parameters, respectively. The symbols Θ r 1 and Θ r 2 present sampled parameters selected from sample 1 and 2, respectively. The symbol Θ ~ i 1 presents all the sampled parameters selected from sample 1 except the ith one from sample 2. The combinations of sampled parameters input from sample 1 and sample 2 produce double times calculation of model outputs (see Equations (11), (14), and (18)). Equations (9)–(11) can be used to estimate the first order sensitivity indices according to Equation (6). Similarly, Equations (12)–(14) and Equations (16)–(18) can be used to estimate the second and total order sensitivity indices according to Equations (6) and (7), respectively.
Before the sensitivity analysis is implemented, the sampling scheme must be carefully selected since it has an important effect on the convergence efficiency of Monte Carlo approximation. As suggested by Zhan et al. [6], Tang et al. [14], and Saltelli [31], the Latin Hypercube sampling (LHS) [7,21,36,37], based on Monte Carlo, which generates an efficient estimation of model output through a stratified sampling method, is adopted in this study.

2.3. Sobol’ Method for Correlated Parameters

The original Sobol’ method can obtain good efficiency in the model of independent or uncorrelated parameters, which might be unsuitable for complex hydrological models with high nonlinearity and interaction. Kucherenko et al. [30] proposed a new method based on Sobol’ method to estimate the global sensitivity indices with dependent parameters, and made it appropriate for complex hydrological models.

2.3.1. GLUE Method for Correlation Identification

Before the implementation of the Sobol’ method for correlated parameters, correlation in parameters must be investigated in advance. In this paper, the Generalized Likelihood Uncertainty Estimation (GLUE) method [3,4] is employed to detect the correlated parameters [38]. The detailed procedures are listed as follows:
(1) randomly sample a huge number of parameter sets from the n-dimensional hypercube space, the same as Sobol’ method. Because no supposition about the a priori distribution of these parameters is essential, a uniform distribution is a common choice;
(2) calculate corresponding model outputs and likelihood measures with these sample sets;
(3) define a likelihood measure threshold to separate behavioral and non-behavioral parameters sets, termed as Θ b and Θ n b , respectively;
(4) estimate the mean μ ^ i , standard derivation σ ^ i , and pairwise correlation coefficient ρ i j of each parameter vector from Θ b . The pairwise correlation coefficients then constitute the correlation matrix M c .
The first three steps are abstracted from the GLUE method, which is applied to generate the behavioral parameters. The subsequent uncertainty analysis with these behavioral parameter sets is beyond the scope of this paper. Details about the GLUE method can be seen in Beven and Binley [3,4]. The fourth step shows the in-depth analysis of these behavioral parameters sets and relevant works referred to are Hornberger and Spear [17] and Ratto et al. [38].

2.3.2. Generation of Correlated Sample Sets

Based on the identified parameters correlations by the GLUE method, correlated parameter sets can be generated through a general procedure introduced by Kucherenko et al. [30]. The procedures are implemented as follows:
(1) randomly generate a parameters set from standard normal distribution, termed Φ by LHS. If the sample size is m, and parameters number is n, then the dimension of Φ is m   ×   n . Since Φ is produced aimlessly, the parameters vector in Φ is definitely independent and uncorrelated;
(2) decompose the correlation matrix M c by an appropriate decomposition method. Cholesky is preferred when the correlation matrix is positive definite, whereas this is not easy to promise. Therefore, orthogonal decomposition method, which is more universal, is employed:
M c = Q R
where Q is the orthogonal matrix and R is the upper triangular matrix. Both dimensions are n   ×   n ;
(3) transform the uncorrelated standard normal sample sets Φ to correlated standard normal sample sets Ψ :
Ψ = Φ R
where the dimension of Ψ is still m   ×   n ;
(4) derive the correlated parameters sets Θ c = ( θ c 1 , θ c 2 , , θ c i , , θ c n ) from correlated standard normal sample sets Ψ = ( ψ 1 , ψ 2 , , ψ i , , ψ n ) :
θ c i = ψ i σ ^ i + μ ^ i
where θ c i and ψ i are the ith parameter vector from Θ c and Ψ , respectively; and μ ^ i and σ ^ i are the mean and standard derivation of the ith parameter vector, respectively.

2.3.3. Generalization of Sobol’ Method

Concluded from Kucherenko’s method, the Sobol’ method to estimate the first and total order sensitivity indices for pairwise correlated parameters is slightly different from that for uncorrelated parameters, as Zhao et al. summarized in [39].
For uncorrelated parameters, the first order sensitivity indice can be obtained from Equations (6) and (9)–(11). Equation (11) estimates the variances of model outputs from the ith parameter with the explicit meaning that the ith parameter in sample 1 is replaced by the corresponding ith parameter in sample 2 or vice versa. Similarly, the first order sensitivity indice for correlated parameters can be derived from replacing several vectors which must contain the ith parameter in sample 1 from that in sample 2. If the pairwise correlation is considered, then the dimension of replaced vectors is equal to 2, with the following equation:
V ^ i = 1 m r = 1 m [ f ( Θ r 1 ) f ( Θ ~ i , k 2 )] E ^ 1 2 ( y )    o r    1 m r = 1 m [ f ( Θ ~ i , k 1 ) f ( Θ r 2 )] E ^ 1 2 ( y ) ,
where Θ ~ i , k 1 presents all the sampled parameters are selected from sample 1 except the ith and kth ones from sample 2. The others keep the same meaning as introduced above. Attention has to be paid to the fact that for correlated parameters, samples 1 and 2 are generated from procedures introduced in Section 2.3.2.
Analogously, Equation (18) for uncorrelated parameters expressed the connotation that the estimated variance from all except the ith parameter is the product of model output from sample 1 or 2 and the ith parameters in the samples replaced by those from sample 2 or 1. Therefore, the equation to estimate the variance for all except the ith parameter for correlated parameters can be presented as:
V ^ ~ i = 1 m r = 1 m [ f ( Θ r 1 ) f ( Θ ~ i , k 1 )] E ^ T 2 ( y )    o r    1 m r = 1 m [ f ( Θ ~ i , k 2 ) f ( Θ r 2 )] E ^ T 2 ( y ) .

2.4. Liuxihe Model

In this study, the Liuxihe model, which is a physically based distributed watershed model proposed by Chen et al. [40] that has been applied successfully to numerous catchments in China [41,42,43,44], is selected as the used distributed hydrological model. This model comprises five sub-models: basin digitization, evapotranspiration, runoff generation, runoff routing, and parameter derivation. In this model, the distributed model parameters are acquired directly from the DEM and channel cross-sectional properties from available free public data (also see the details in Chen et al. [40]).

2.4.1. Basin Digitization

The Liuxihe model divides the catchment horizontally into three topographic cell types, namely hillslope, channel, and reservoir. The catchment is also divided vertically into three layers, namely canopy, soil, and sub-surface. Runoff generation and flow routing is carried out independently for each grid cell.

2.4.2. Evapotranspiration

In the Liuxihe model, evapotranspiration is determined by the triplex pattern:
{ E s = v E ( θ > F C ) E s = ( 1 v ) E p θ W p F C W p ( W p < θ F C ) , E s = 0 ( θ W p )
where E is the actual evaporation (mm); v is the evaporation coefficient relevant to vegetation type; FC is the field water capacity (%); Wp is the soil water content at wilting point (%); θ is the temporary soil water content (%); and Ep is the potential evaporation. Cells that are entirely open water (reservoirs and channels) have an evapotranspiration coefficient of 1, while other cells have a value less than 1.

2.4.3. Runoff Generation

The model calculates runoff generation using the saturation-excess mechanism. Rainfall excess, or net rainfall, is derived from the difference between actual rainfall and evapotranspiration. Positive rainfall excess results in infiltration to the soil. Rainfall excess becomes surface runoff only after the soil becomes saturated.
Interflow in the soil layer is calculated according to water mass conservation:
{ Q l a t x + L h θ t = r Q p e r Q l a t = V l a t L z
where h is the soil thickness (mm); L is the cell width (m); Qlat is the interflow per unit width (m2/s); r is the rainfall excess vertically and adjacent interflow input per unit width (m3/s); vlat is the interflow velocity (m/s); and Qper is the percolation per unit width (m2/s).
According to Darcy’s law, the interflow velocity vlat is:
{ v l a t = K tan α = K S 0 θ > F C v l a t = 0 θ F C
where α is the hillslope cell slope in degree (°); S 0 is the hillslope cell slope in radians; K is hydraulic conductivity at water content (mm/h); and θ is derived according to Campbell [45]:
K K s = ( θ S C ) 2 b + 3
where SC is the saturation water content (%); Ks is the saturated hydraulic conductivity (mm/h); and b is the soil porosity characteristic coefficient.

2.4.4. Runoff Routing

The Liuxihe model assumes three types of flow routing according to the three water sources: surface flow, interflow, and groundwater. Also, there are three different flow routing methods corresponding to the three different grid cell types of hillslope, channel, and reservoir. Surface flow occurs in the hillslope cells; there are total five different routing categories.
For hillslope routing, the kinematical wave approximation (the 1-D Saint–Venant equations) is adopted with the assumption that the hydraulic cell slope denotes the hillslope:
Q x + L h t = q ,
where Q is the surface flow (m3/s); h is the rainfall excess (mm); and q is the lateral flow per unit width (m2/s). In addition, according to the Manning equation, Q can be derived by:
Q x + a b Q b 1 Q t q = 0 ,
where a = ( n L S 0 1 / 2 ) 3 / 5 and b = 3/5.
For river routing, the 1-D diffusive wave method is used. The model assumes a trapezoid cross-section shape of area A (m2), water depth h (m), river bottom width w (m), and side slope β. The continuity equation is:
{ A t + Q x = q S f = S 0 h x
According to the Manning equation and continuity equation, river routing can be calculated by:
Q x + c b Q b 1 Q t q = 0 ,
where c = ( 1 3600 n χ 2 / 3 S f 1 / 2 ) 3 / 5 and χ is the wetted perimeter of the cross-section (m).
For reservoir routing, inflow to the reservoir is calculated as the sum of all inflow cells to the reservoir based on Chen’s idea that discharge on such cells can be regarded as inflow without lapsing.
For interflow routing, the model determines the interflow by Equation (25) from adjacent soil cells since the runoff generation process already takes the interflow routing into consideration.
For groundwater routing, the linear reservoir method is calculated as:
Q g , t + Δ t = K g Q g , t + ( 1 K g ) Q p e r ,
where Qg,t and Qg,t + ∆t are groundwater flow at time t and t + ∆t, respectively (m3/s); and Kg is the groundwater recession coefficient.

2.4.5. Parameters Derivation

Based on the evapotranspiration, runoff generation, and routing of the Liuxihe model introduced above, there are 12 relevant parameters in this model (see Table 1). These 12 parameters can be categorized into five different types related to various aspects, including climate, land use, soil type, terrain, and others. A vital and challenging task before the sensitivity and correlation analysis is the determination of appropriate ranges and distribution for the parameters [46,47]. Tong and Graziani [48] and Zhan et al. [6] suggested that the ranges of parameters should be decided in advance considering their significant effect on the sensitivity results. Fortunately, the Liuxihe model, as one of the physically based distributed hydrological models, developed a reasonable method to derive parameters from terrain data. The parameters can be derived from the following two procedures, i.e., initial parameter determination and parameter adjustment. Soil related parameters (Table 1) were pre-fixed by Soil Water Characteristics Hydraulic Properties Calculator [49]. Other parameters were defined according to references. After the initial parameters values were pre-decided, the parameters were adjusted mildly in turn by multiplying a change factor f (f > 0). The best-performing parameters set was then selected judging from the model performance. It is noteworthy that the Liuxihe model is physically based, so the predetermined parameters can perform well to some degree. A slight adjustment is aimed at improving the performance. Details about Liuxihe model parameters derivation can be obtained from Chen et al. [40].
After the calibration through the above predesign and slight adjustment processes, the large sample sets for sensitivity analysis based on Sobol’ method are generated by multiplying a change factor matrix with the calibrated parameters. The change factor matrix is sampled from the normal distribution and ensures the corresponding sampled parameters are within each ranges listed in Table 1.
Table 1. Summary of the Liuxihe model parameters.
Table 1. Summary of the Liuxihe model parameters.
IDSymbolDescriptionClassificationProcessesRangesUnit
1KsSaturated hydraulic conductivitySoil type relatedRunoff generation1–100mm/h
2nhillslope roughnessLand use relatedRunoff routing0.1–0.6-
3MnChannel roughnessOther relatedRunoff routing0.02–0.1-
4hSoil thicknessSoil type relatedRunoff generation100–1000mm
5bSoil porosity characteristicSoil type relatedRunoff generation1–10-
6EpPotential evapotranspirationclimate relatedevapotranspiration0–1-
7KgGroundwater recession coefficientOther relatedRunoff routing0.8–0.99d−1
8SCSaturation moisture contentSoil type relatedRunoff generation/evapotranspiration30–80%
9FCField water capacitySoil type relatedRunoff generation/evapotranspiration10–50%
10vEvapotranspiration coefficientLand use relatedevapotranspiration0–1-
11WpWilting pointSoil type relatedRunoff generation/evapotranspiration0–30%
12SRiver slopeTerrain relatedRunoff routing0–0.2%

2.5. Objective Functions

The assessment of parameters sensitivity depended greatly on the choice of objective functions. In this study, four widely used metrics were used to test model performance: (1) Nash–Sutcliffe efficiency (ENS) [50]; (2) water balance coefficient (WB); (3) peak discharge efficiency (EP); and (4) time to peak efficiency (ETP). These four metrics were calculated according to the following equations:
E N S = 1 t = 1 n ( Q ( t ) Q ^ ( t )) 2 t = 1 n ( Q ( t ) Q ¯ ) 2
W B = t = 1 n Q ^ ( t ) t = 1 n Q ( t )
E P = Q ^ p Q p Q p
E T P = T ^ p T p ,
where Q(t) was observed or measured flow; Q ^ ( t ) was simulated or forecast flow; Q ¯ was the mean of observed flow; T ^ p and T p were the simulated and observed time to peak, respectively; and Q ^ p and Q p were simulated and observed peak discharge, respectively.
The ENS is widely used to assess overall hydrograph goodness-of-fit [6,7,23,24,51,52]. The WB shows the gap between the total simulated flood volume and total measured volume. This objective function is merely influenced by runoff generation process and is extensively applied to the water resources management [6,24,53]. The EP and ETP are the other two essential elements in flood forecasting, and these two elements are also crucial factors in flood prevention, which is adopted in the “Forecasting norm for hydrology intelligence (GB/T 22482-2008) of China”.

3. Study Area and Data

3.1. Study Area

The Nanhua Creek catchment (Figure 1) is a minor tributary of the upper Pearl River in China. It is located between 112°37′ E ~ 113°05′ E and 24°53′ N ~ 25°16′ N with a drainage area of 120 km2 and a length of 82 km. The Nanhua Creek catchment belongs to the monsoon climatic zone, with cold and dry winters and hot and humid summers. The long-term mean annual precipitation in the catchment is about 1300–1500 mm, the majority of which occurs in summer. The steep slope of the catchment, combined with concentrated rainstorms, makes this area prone to flash runoff response.
Figure 1. The location, elevation, river networks, and distribution of rain gauge and hydrologic stations of the Nanhua Creek catchment.
Figure 1. The location, elevation, river networks, and distribution of rain gauge and hydrologic stations of the Nanhua Creek catchment.
Water 07 02924 g001

3.2. Data

The Liuxihe model requires elevation (DEM), land use, and soil type data. In this study, DEM was obtained from NASA Shuttle Radar Topographic Mission (SRTM, http://srtm.csi.cgiar.org/) at a resolution of 90 m × 90 m. Land use data came from the USGS (http://landcover.usgs.gov/) and soil type data were from the FAO/UNESCO (http://www.isric.org/). Eighteen storm floods with hourly measured rainfall data of three rain gauge stations and discharge data of the catchment outlet hydrological station (Figure 1) were collected from 1980 to 1987 to calibrate the model parameters. Ten storm floods from 1990 to 1995 were applied to validate the model.
In order to investigate the response of Liuxihe model to various flood condition, three different types of floods with small, medium, and large magnitudes were selected from the available data set. These typical flood events with different magnitudes were selected with flood frequency of 75%, 50%, and 25%, respectively. The characteristics of these floods were listed in Table 2.
Table 2. Characteristics of the selected various flood conditions. W0 denoted the soil moisture content before the flood event began.
Table 2. Characteristics of the selected various flood conditions. W0 denoted the soil moisture content before the flood event began.
EventMagnitudePrecipitation (mm)Time Step (h)Duration (h)Peak Discharge (m3/s)W0 (mm)Flood Frequency (%)
198106Small80.511313844075%
198505Medium90.011405385050%
198005Large99.11809256025%

4. Results and Discussion

4.1. Results of Calibration and Validation

The average of the four objective functions of the 18 storms in the calibration period were 0.82, 1.08, 0.10, and 0.7 h respectively (see Table 3). According to the “Forecasting norm for hydrology intelligence (GB/T 22482-2008) of China”, these satisfactory results met the second grade of the norm, which requires 0.7 < ENS < 0.9, 0.8 < WB < 1.2 and |EP| < 0.2. The calibrated results demonstrated that the Liuxihe model was suitable for flood forecasting in the Nanhua Creek catchment and the parameters were reasonable. Similar results were also obtained from the model validation period. The average of the four objective functions of the 10 storms in the verification period were 0.74, 1.06, 0.12, and 1.7 h, respectively. Even though the performance of validation was slightly worse than the calibration period, the former still met the second grade of the norm, which strengthened the confidence of the Liuxihe model and calibrated parameters. A comparison of observed and calculated flow for the three representative storms is illustrated in Figure 2. It can be seen from these hydrographs that the Liuxihe model generated good results in the rising limbs, flood peaks, and recession limbs.
Table 3. Goodness of fit statistics between Liuxihe predicted and observed hydrographs in the Nanhua Creak catchment.
Table 3. Goodness of fit statistics between Liuxihe predicted and observed hydrographs in the Nanhua Creak catchment.
PeriodENSWBEPETP (h)
Calibration0.821.080.100.7
Validation0.741.060.121.7
Figure 2. Comparisons between the observed and the Liuxihe simulated hydrographs for the three representative storm floods. (a) 198106; (b) 198505; (c) 198005.
Figure 2. Comparisons between the observed and the Liuxihe simulated hydrographs for the three representative storm floods. (a) 198106; (b) 198505; (c) 198005.
Water 07 02924 g002

4.2. Results of Morris Screening

The Morris screening method, or Elementary Effect (EE) method, was employed to qualitatively identify the unimportant parameters. The identified relatively important parameters were then investigated for the following global sensitivity analysis. Two parameters should be predefined before the application of Morris screening, i.e., level number p and trajectories number R. As suggested by Campolongo et al. [6,35], p is preferentially even and R is in general between 10 and 50. In this paper, level number p was determined to be 10 and R was 40. Subsequently, 520 parameters sets were generated randomly by uniform distribution within the rational ranges.
The Morris screening results can be presented in the form of ranking table [33,34], Tornado diagrams [33], and scatter plots [6]. Scatter plots are confirmed to be the most efficient and clearest with modified means μ * as the x-axis and standard deviations σ as the y-axis, while the former two failed to illustrate the characteristics of standard deviations. The scatter plots of the 12 parameters in the Liuxihe model were compared in three different flood conditions as well as four different objective functions, as showed in Figure 3. On the basis of the fact that a larger μ * value reveals higher sensitivity, it is interesting to see that even under various conditions and objective functions of interest, the sensitive rankings could all be categorized into three rough classes: highly sensitive, sensitive, and insensitive; the classified parameters also gave consistent results. Among all the parameters, SC and FC were highly sensitive; b, Mn, h, n, and Ks were sensitive; and Ep, Kg, Wp, v, and S were insensitive. Explicit ranking among the 12 parameters differed within various situations. The top sensitive parameters were found among SC, FC and Mn from the most to the least frequent. The least sensitive parameters covered were S, v, and Wp, from the most to the least frequent. Our results were in accordance with those of Xu et al. [54], who stated that SC and FC were first-rate; h and b were second-rate; Wp, n, and Ks were third-rate; and S, Ep, Mn, Kg, and v were insensitive.
Figure 3. Scatter plots of the Morris screening results based on different objective functions measures under small, medium, and large flood magnitudes.
Figure 3. Scatter plots of the Morris screening results based on different objective functions measures under small, medium, and large flood magnitudes.
Water 07 02924 g003
Along with these sensitivity rankings reflected by μ * , the nonlinear and interactive effects between parameters reported by σ were also remarkable. All the subplots in Figure 3 indicated that SC and FC had a larger σ than the others, which inferred that the nonlinear relationship between parameter values and model outputs was more noticeable or that their interaction with other parameters was more evident. Similar results can also be obtained from b when taking WB as the objective function and Mn when ETP is the objective function of interest. It is plausible to suspect that parameters with higher modified mean values ( μ * ) had a higher possibility of presenting a correspondingly larger standard derivation value ( σ ). Zhan’s work [6] supported this suspicion with another distributed hydrological model (DTVGM). Therefore, attention must be paid to these high ranking parameters with large σ since the high nonlinearity and interaction with other parameters decreased the reliability of high sensitivity classification. This was also one of the dominating disadvantages of the Morris screening method: the nonlinearity and interaction between parameters cannot be differentiated. On the other hand, the insensitive parameters with lower modified mean value μ * were fortunately able to display a correspondingly lower standard derivation value σ (see Figure 3), which achieved the original goal of screening out insensitive or non-behavioral parameters with high reliability.
Therefore, the insensitive parameters of the Liuxihe model can doubtless be determined even under diverse flood conditions and objective functions. These were S, v, Wp, Ep, and Kg. These parameters were then kept constant in the subsequent GSA with negligible influences on model output. Parameters passing the validation screening, including SC, FC, Ks, b, h, n, and Mn, were chose for the further investigation of GSA based on Sobol’ method. The original 12 parameters in the Liuxihe model were reduced to seven for GSA, reducing the computation by almost half. That is the reason for the broad application of the Morris screening method as a first step to lower computation demand for GSA [6,32,33,34].

4.3. Results of Correlation Identification

Since the Sobol’ method is a variance-based decomposition method with the assumption that all parameters are independent, the validation of independence is definitely a prerequisite. However, this procedure was often ignored. In this paper, the GLUE method was employed to characterize the correlation relationship between the Liuxihe model parameters before the implementation of the Sobol’ global sensitivity analysis method. The GLUE method must define the likelihood measure and its threshold for dividing behavioral and non-behavioral. As Beven and Binley concluded [4], the Nash–Sutcliffe efficiency (ENS) (see Equation (33)) was the most extensively used likelihood measure. For the sake of simplicity without loss of generality, this paper also set ENS as the likelihood measure. In addition, according to the “Forecasting norm for hydrology intelligence (GB/T 22482-2008) of China”, the threshold was decided to be 0.7 in order to meet the requirement for the second grade. Based on the undemanding hypothesis about the prior distributions of the model parameters, a sample of 5000 sets was generated by the uniform distribution within the ranges of parameters.
The Pearson correlation analysis and significance level between the two parameters are listed in Table 4. The higher the correlation between the parameters, the darker gray in Table 4. The poorly correlated parameters are showed with no color. Highly correlating parameters were suggested to cut parameters’ hypercube space. Positive values designated that the relevant parameters had the same effect on model performance and worked as a quotient or difference, whereas negative values designated opposite effects from two relevant parameters on model performance and worked as a product or sum [38]. The same or opposite effects from parameters can be easily obtained from the principles of the model in Section 2.4. Interestingly, the SC and FC displayed the highest positive Pearson correlation, which was extremely close to 1 at the significance level of 0.01. Other parameters, like n and h, Mn and h, h and b, Mn and SC, and Ks and FC with Pearson correlation coefficients of 0.32, −0.34, 0.30, 0.26, and −0.28, respectively, were also relatively correlated at the significance level of 0.01. These highly or relatively correlated parameters were all classified as the highly or relatively sensitive parameters in the Morris screening design in Section 4.2. Individual parameters demonstrating a larger standard deviation in the Morris screening scatter plots in Figure 3 exhibited a higher correlation relationship with other parameters (e.g., SC and FC). These two methods strengthened the reality that the correlated parameters in the Liuxihe model were attentive and the straightforward application of the Sobol’ method was inappropriate. Different from the figurative illustration by the Morris screening method, the GLUE method highlighted the correlation analysis by directly tabling the correlation coefficients, which laid a fundamental basis for the subsequent sensitivity analysis accounting for correlated parameters.
Table 4. Correlation analysis between effective parameters. Dark grey shadow denoted highly correlated parameters. Light gray shadow denoted relatively correlated parameters. White cells denoted poorly correlated parameters. Values labeled “**” meant its significance level was 0.01. Values labeled “*” had a significance level of 0.05.
Table 4. Correlation analysis between effective parameters. Dark grey shadow denoted highly correlated parameters. Light gray shadow denoted relatively correlated parameters. White cells denoted poorly correlated parameters. Values labeled “**” meant its significance level was 0.01. Values labeled “*” had a significance level of 0.05.
nMnhbSCFC
Ks−0.180.17−0.22 *0.18−0.17−0.28 **
n −0.190.32 **0.160.24 *0.17
Mn −0.34 **0.160.26 **0.19
h 0.30 **−0.19−0.16
b 0.160.24 *
SC 0.96 **

4.4. Results of First and Total Order Sensitivity Analysis

In order to compare the difference of the sensitivity analysis with and without considering parameters’ correlation, the generalized and extended Sobol’ method [30,31] were implemented to estimate the first and total order sensitivity indices of the seven parameters (i.e., SC, FC, Ks, b, h, n, and Mn) in the Liuxihe model. In this study, as suggested by Gan et al. [15], 2000 parameter sets were generated from normal distribution by the Latin Hypercube Sampling (LHS) method [7,21,36,37] for the extended Sobol’ method, ignoring correlations. As for the generalized Sobol’ method accounting for parameter dependency, for the sake of consistency, 2000 parameter sets were generated from normal distribution by LHS as well. The difference with the latter was that the parameters were correlated with corresponding coefficients identified from Section 4.3 and were generated through the procedures in Section 2.3.2. Therefore, the comparison analysis required a total of 32,000 model runs, half for uncorrelated parameters and half for the correlated.
Independent and dependent first and total order sensitivity indices based on the four different objective functions under the three different flood magnitudes are shown comparatively in Figure 4. The red and blue bars presented the first order sensitivity indices for uncorrelated and correlated parameters, labeled with UCS1 and CS1, respectively. The green and yellow bars exhibited the total order sensitivity indices for uncorrelated and correlated parameters, labeled with UCST and CST, respectively. The difference between total order and first order indices can indicate the interactive indices between the parameters of interest and other parameters. The different height of the bars revealed the various levels of sensitivity. The higher the bar, the more sensitive the parameter.
Figure 4. Comparisons between the first and total order sensitivity indices of seven parameters in the Liuxihe model with and without accounting for correlations based on different objective functions measures under three different flood magnitudes (UCS1 and UCST designated the uncorrelated first and total order sensitivity indices; CS1 and CST designated the correlated first and total order sensitivity indices).
Figure 4. Comparisons between the first and total order sensitivity indices of seven parameters in the Liuxihe model with and without accounting for correlations based on different objective functions measures under three different flood magnitudes (UCS1 and UCST designated the uncorrelated first and total order sensitivity indices; CS1 and CST designated the correlated first and total order sensitivity indices).
Water 07 02924 g004
For the uncorrelated and correlated cases, the overall contribution of SC to the four objective functions under the three flood conditions varied from 0.23 to 0.5, which was the highest and indicative of the most sensitive. Then, the partial effect of FC was the second largest with the range of 0.23–0.48. The other parameters demonstrated sensitivity indices within the range of 0.01–0.1. As Tang et al. [14], Zhan et al. [6], and Zhang et al. [21] mentioned, parameters with sensitivity indices larger than 0.1 can be regarded as highly sensitive; sensitivity indices between 0.01 and 0.1 can be classified into sensitive. Those parameters with little contribution to model outputs’ variance (e.g., sensitivity indices less than 0.01) can be considered as insensitive. Therefore, both SC and FC were indicated to be highly sensitive with the largest first and total order indices for uncorrelated or correlated cases, whereas Ks, n, Mn, h, and b were regarded as sensitive parameters. The Sobol’ results did not classify any parameters into insensitive, which revealed that the parameters fixed as constant by the Morris screen in Section 4.2 were reasonable. The Sobol’ method strengthened the Morris screening results and quantified the sensitivity indices of each parameter with detailed values. In addition, our results were in accordance with Liao et al. [44], who investigated the Liuxihe model parameters based on the E-FAST algorithm using ENS as objective function by 1560 model runs.
Despite these similar outcomes obtained by uncorrelated and correlated parameters, it is useful to distinguish the differences between the two cases as well. For the uncorrelated case, the general order of average first order sensitivity indices of the seven parameters studied, from the highest to lowest, was 0.300, 0.292, 0.047, 0.039, 0.036, 0.035, and 0.029 for FC, SC, Mn, h, n, Ks, and b, respectively. For the correlated case, the order of first order sensitivity indices, from the highest to lowest, was 0.380, 0.377, 0.054, 0.045, 0.034, 0.032, 0.029 for FC, SC, Mn, h, n, b, and Ks, respectively. A large increase (larger than 0.08) of the sensitivity indice was observed from the uncorrelated case to the correlated, referred to FC and SC. A slight increment (about 0.006) was indicated from the uncorrelated to the correlated when it came to Mn and h. Finally, n, b, and Ks present a small decrease (about 0.003) for the correlated case. Similar results can also be obtained for the total sensitivity indices comparing the uncorrelated and correlated cases. These results were in agreement with the results given by Pan et al. [55], who revealed that the sensitivity of van Genuchten n to the percolation flux of interest improved when the van Genuchten n was correlated with van Genuchten α and the sensitivity indice of permeability declined when no interaction was considered. Furthermore, the improved sensitivity of van Genuchten n increased as the correlation coefficient went higher, which meant that parameters’ correlations played a significant effect on the sensitivity analysis and should not be overlooked. In this study, a nearly linear regression between SC and FC was exhibited in Section 4.3. Correspondingly, the variation of correlated analysis compared with the uncorrelated was very large (even larger than the sensitivity indices of the five sensitive parameters) in the case of highly correlated parameters. For the poorly correlated parameters (Mn and h), the increments were indistinct and when it came to non-significantly correlated parameters (n, b, and Ks), the sensitivity indices eventually decreased. This may be caused by the sample sets, considering the correlation, diminishes the parameters’ space [38]. In other words, the correlated samples are more concentrated. Therefore, the fractional contributions of correlated parameters to model output variance are highlighted compared with that of uncorrelated parameters. In addition, highly correlated parameters cast a more profound influence on the highlighting than the poorly correlated. From this point of view, as indicated by the results from Morris screening and the GLUE method, it is very likely that the highly sensitive parameters with large modified means reflected high correlation coefficients. Hence, the sensitivity analysis of these parameters may be underestimated if the interaction is not considered. On the contrary, the sensitivity analysis of sensitive parameters with non-significant correlation may be overestimated when other highly correlated parameters were not considered. Another interesting result should also be noted: the differences between the total and first order sensitivity indices, indicative of interaction effect, shrank compared with the uncorrelated case. Since only pairwise correlations were taken into account, the shrunken yet retained gaps between the total and first order sensitivity indices can be considered to be caused by the multivariable interactions and should be investigated in the future.
The sensitivity results also varied among different flood magnitudes. In both uncorrelated and correlated cases, the sensitivity indices decreased as the magnitudes increased. For example, the total order sensitivity indice of SC decreased from 0.468 for small floods to 0.353 for medium ones and 0.317 for large ones, based on the ENS measure for the uncorrelated. In addition, the gap between highly sensitive and sensitive parameters decreased as magnitudes increased. This can be explained by the fact that most small floods occurred after a long drought, during which the soil moisture was kept at a very low level [56] and a hydrograph was vulnerable to the disturbance and attenuation of the underlying surface. Therefore, a slight fluctuation of model parameters would play a noteworthy role in determining the hydrograph of a watershed outlet. In this way, the sensitivity indices and their variance for small floods were more significant. On the contrary, large floods mainly happened after long precipitation, during which the soil was moist enough and the disturbance of the underlying surface cast a less important effect on outlet hydrographs. Therefore, the sensitivity indices and their variation for large floods were less significant.
Additionally, attention should also be paid to the variations of sensitivity indices related to different objective functions. Taking ENS as the objective function, the most sensitive parameters, SC and FC, were significant and five sensitive parameters were at about the same level. While taking WB as the objective function, the indices of n and Mn declined compared with other sensitive parameters. However, this situation was reversed when taking EP and ETP as the objection functions. These differences came from the diverse emphasized hydrograph process, magnitude, and timing from the four objective functions. ENS focused on the variation between simulated hydrograph and the average measured one. WB concentrated only on the gap between the simulated and measured volume. Moreover, EP and ETP merely paid attention to the efficiency of the simulated peak discharge and the time to peak. Therefore, flow concentration parameters such as n and Mn that determine the flow velocity would have a more momentous impact on the peak discharge presented by EP and ETP, while little impact on the flood volume presented by WB. On the contrary, runoff generation parameters such as Ks, h, and b would play a more important role in influencing WB than EP and ETP compared with n and Mn. Both runoff generation and flow concentration parameters affect the shape of a hydrograph; their competing consequences result in the same sensitivity indices level for ENS. These results implied that the first and total order sensitivity indices of model parameters were greatly affected by the selection of objective functions as well as the choice of flood magnitudes. Previous studies [6,7,14,23,24,51,53] that took ENS or RMSE as the objective function would not classify the runoff generation and flow concentration parameters and obtain deep insight into the model behavior.

4.5. Results of Second Order and Interactive Sensitivity Analysis

In this study, further insight of the second order indices between the seven parameters without consideration of correlation relationship was also illustrated. Table 5 showed that the sum of the first order sensitivity indices of these seven parameters was less than 1. Zhan et al. [6] explained that the sum of first order indices less than 1 was caused by the interactive effects among parameters, which could not be considered in the single parameter sensitivity analysis. In addition, this outcome indicated that the interactive impacts among parameters cannot be neglected. This indication was reinforced by the gaps between the first and total order sensitivity indices (see Figure 4 and Table 5). On the other hand, the sum of total order sensitivity indices was slightly larger than 1. This phenomenon was explained by Tang et al. [14]: the truncation and Monte Carlo integration implemented in the Sobol’ method can result in slight numerical errors. However, Tang et al. [14] acknowledged that this unimportant error would not affect the sensitivity ranking of parameters. Baroni and Tarantola [57] also investigated the robustness of sensitivity indices under the change of sample size. The total order sensitivity indices proved to be more stable than the first order ones. Therefore, the total order sensitivity indices of the most and least sensitive parameters (i.e., FC and Ks) were iteratively explored by increasing the sample size by 200 at a time from 200 to 2000. Results in Figure 5 showed that both total order sensitivity indices of Ks (left) and FC (right) fluctuated within a wide range when the sample size was less than 1200. Furthermore, the variation of highly sensitive parameters was more significant than that of sensitive parameters. The total sensitivity indices of Ks with EP and ETP as objective functions were seen to converge at a very low computation cost. Even though the total sensitivity indices changed more significantly when taking ENS and WB as objective functions, the differences between the maximum and minimum were 0.006 and 0.015 for Ks, respectively. However, from the perspective of FC, only when taking EP as objective function did the total sensitivity indice vary slightly, and the differences between the maximum and minimum values were 0.047, 0.018, and 0.035 for ENS, WB, and ETP, respectively, which were much larger than those of Ks. These results were in consistent with the research of Baroni and Tarantola [57]. However, as long as the sample size increased to a large value, e.g., 1600, the total sensitivity analysis began to converge and generate steady results for both highly and generally sensitive parameters considering various objective functions. These results underlined that the sample size of 2000 set in the Sobol’ analysis in Section 4.4 was reasonable.
As a matter of fact, it is obvious that the sum of the first and second order sensitivity indices accounted for nearly all the total order (see Table 5), which indicated that the interactive effects among parameters can be mainly explained by pairwise interactions. In other words, the second order sensitivity indices of the Liuxihe Model can denote most parameter interactions. More precisely, as given in Table 5, the average second order sensitivity indices were 0.32, 0.25, 0.30, and 0.27, respectively, based on ENS, WB, EP, and ETP measures, with corresponding first order indices of 0.75, 0.77, 0.73, and 0.76, respectively.
Table 5. Summations of the first, second, and total order sensitivity indices based on four different objective functions.
Table 5. Summations of the first, second, and total order sensitivity indices based on four different objective functions.
Objective FunctionsFirst OrderSecond OrderTotal Order
SmallMediumLargeSmallMediumLargeSmallMediumLarge
ENS0.740.770.740.310.310.341.191.111.08
WB0.820.740.750.240.230.281.061.001.09
EP0.740.690.750.330.310.261.201.011.06
ETP0.750.840.70.340.240.221.041.141.05
Figure 5. Variations of the total order sensitivity indices along with increasing sample size. (a) For the least sensitive parameter Ks; (b) for the most sensitive parameter FC.
Figure 5. Variations of the total order sensitivity indices along with increasing sample size. (a) For the least sensitive parameter Ks; (b) for the most sensitive parameter FC.
Water 07 02924 g005
In order to better analyze the two-parameter interactions, the pairwise variation-induced second order sensitivity indices (see Figure 6) provided a more exhaustive understanding of the second order indices listed in Table 5. These figures displayed the lower triangle matrix of pairwise parameters interactions where dark gray shadow denoted higher interactions and light shadow denoted lower interactions. Remarkably, the pairwise interactions became more significant as the parameters grew more sensitive. For instance, the interactive sensitivity indice between SC and FC was as high as 0.09 based on the WB measure. Furthermore, the pairwise interactions between SC and the remaining sensitive parameters, and between FC and the remaining sensitive parameters, were also relatively higher for all objective functions under the three flood magnitudes. However, the two-way interactions between sensitive parameters were relatively lower (e.g., the second order sensitivity indice between Mn and n was less than 0.01 even though both parameters described the process for flow routing).
Our results were in good agreement with the results provided by Tang et al. [14], Zhan et al. [6], and Zhang et al. [21], even though the hydrological models were different. Tang et al. [14] pointed out that the lower zone storage parameter and percolation factor parameter in the model combined by SAC-SMA and SNOW-17 presented the highest number of interactions, while the two parameters both also showed the highest influence on model output in the first order sensitivity analysis. Zhan et al. [6] concluded that the second order indice between soil moisture parameter and thickness of upper soil layer in the model named DTVGM was 0.121 high for the objective function WB and their corresponding first order indices were 0.288 and 0.139, respectively, ranking inside the top three. Based on the Xinanjiang model, Zhang et al. [21] concluded that the pairwise interaction between areal mean free water storage capacity parameter (Sm) and recession constant for channel routing (Cs) had a profound effect on model performance, while Sm and Cs were the two most sensitive parameters in the total order sensitivity evaluation in the meantime. The application of Sobol’ method into various models’ sensitivity analysis indicated that the parameters that showed high sensitivity indices in the first and total order analysis were prone to presenting a correspondingly significant pairwise interaction with other parameters in the second order analysis. This was an expected result, as the changes in the sensitive parameter, coupled or not coupled with other parameters’ changes, would definitely contribute to the great variance of model outputs. Variation in the two highly sensitive parameters would have the most important effect on model performance.
Figure 6. Second order sensitivity indices between 12 parameters based on different objective functions measures under three different flood magnitudes. Dark grey shadow denoted high parameter interactions. Light gray shadow denoted low parameter interactions.
Figure 6. Second order sensitivity indices between 12 parameters based on different objective functions measures under three different flood magnitudes. Dark grey shadow denoted high parameter interactions. Light gray shadow denoted low parameter interactions.
Water 07 02924 g006
Additionally, pairwise interaction investigation among different flood magnitudes revealed that the second order parameter sensitivities were largely influenced by the selected flood magnitudes. Small floods presented the highest second order sensitivity while large floods presented the lowest sensitivity. The reason was similar to the differences caused in the first and total order sensitivity analysis. A dry and low soil moisture content condition before a small flood kept the watershed susceptible to the complex perturbations (e.g., changing pairwise parameters) of the underlying surface, while high soil moisture content before a large flood made the watershed more tolerable to the multipart disturbances. This result supported the above interaction analysis. Coupling parameters with higher sensitivity would generate higher interactive sensitivity indices (e.g., for small magnitude) than coupling lower sensitive parameters (e.g., for large magnitude).

5. Conclusions

In this study, the sensitivity and interaction of the distributed hydrological model parameters was investigated to help present and understand a model’s behavior. The Morris screening method was employed to detect the insensitive parameters and the GLUE method was applied to identify the parameters correlations. Subsequently, the extended and generalized Sobol’ methods were implemented to compare the sensitivity indices of the Liuxihe model parameters in the case of uncorrelated and correlated parameters based on four objective functions: (1) Nash–Sutcliffe efficiency (ENS); (2) water balance coefficient (WB); (3) peak discharge efficiency (EP); and (4) time to peak efficiency (ETP). Hourly rainfall-runoff data in the Nanhua Creek catchment, Pearl River, China, were used. Contrasting outcomes for the sensitivity and interaction analysis were illustrated with three different flood magnitudes, i.e., small, medium, and large. The main conclusions drawn are as follows:
  • The Morris screening method is helpful for computationally demanding distributed models with dozens of parameters. For the 12 parameters in the Liuxihe model, S, v, Wp, Ep, and Kg are detected as insensitive parameters no matter which objective functions or flood magnitudes are selected. The other seven parameters were retained for further global sensitivity analysis.
  • The GLUE method provides a convenient approach to identify the correlation relationship between parameters. SC and FC are observed to be nearly linear regressed, with a correlation coefficient of about 0.96 at the significance level of 0.01. Other parameters demonstrated a certain correlated relationship as well.
  • For both uncorrelated and correlated parameters, the classification of parameters as first or total order sensitivity indices was affected neither by the choice of objective functions nor by the selection of different flood magnitudes. FC and SC are highly sensitive parameters and the remaining five parameters are sensitive. Meanwhile, the ranking of parameters sensitivity indices was greatly determined by the various objective functions specified and particular flood magnitudes stated. Due to the distinctive antecedent watershed conditions inducing different tolerance to watershed disturbances, the sensitivity indices for small floods were relatively higher than those for large floods. With regard to specific objective functions, flow concentration parameters such as n and Mn had a greater effect on EP and ETP than WB, whereas runoff generation parameters such as Ks, b, and b display totally reversed consequences. Hydrological models behave and respond uniquely to various evaluation functions and application conditions.
  • Highly correlated parameters present relatively larger sensitivity indices in the correlated case compared with the uncorrelated, whereas poorly correlated parameters displayed inverse results. Considering that highly sensitive parameters are prone to correlate with each other and commonly sensitive parameters are inclined to be distinguished from other parameters, the sensitivity indices of highly sensitive parameters might be underestimated and that of sensitive parameters might be overestimated without accounting for correlations. Further in-depth second order sensitivity analysis indicated that pairwise interactions, which cannot be neglected, accounted for most interactions.
In order to understand how model parameters contribute to model uncertainty, the preliminary research on Sobol’ sensitivity analysis was carried out accordingly. Further investigation of the GLUE uncertainty analysis can be conducted instead of simple calibration/validation analysis since the GLUE method also provides an estimation of uncertainty in the performance of the hydrological models.

Acknowledgments

This study was supported by the National Grand Science and Technology Special Project of Water Pollution Control and Improvement (No. 2014ZX07204006) and the National Natural Science Foundation of China (51339004, 51279139, and 51279140). The authors would like to thank two reviewers for providing brilliant suggestions to improve this paper. Additional thanks should be given to Seiger Kucherenko and Shufang Song for providing the MATLAB code of the generalized Sobol’ method.

Author Contributions

All authors have contributed to the design and drafting of this manuscript. Hui Wan processed the data and carried out the statistical analysis of results; Jun Xia and Liping Zhang provided many significant suggestions on the methodology and structure of the manuscript; Dunxian She gave a lot of help to improve this manuscript. Yang Xiao and Lei Zou helped to prepare the data.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Beven, K.J. Rainfall-Runoff Modeling (The Primer), 2nd ed.; John Wiley & Sons, Ltd.: West Sussex, UK, 2012; pp. 1–3. [Google Scholar]
  2. Wang, G.S.; Xia, J.; Chen, J. Quantification of effects of climate variations and human activities on runoff by a monthly water balance model: A case study of the Chaobai River basin in northern China. Water Resour. Res. 2009, 45. [Google Scholar] [CrossRef] [Green Version]
  3. Beven, K.; Binley, A. The future of distributed models: Model calibration and uncertainty prediction. Hydrol. Process. 1992, 6, 279–298. [Google Scholar] [CrossRef]
  4. Beven, K.; Binley, A. GLUE: 20 years on. Hydrol. Process. 2014, 28, 5897–5918. [Google Scholar] [CrossRef]
  5. Morris, M.D. Factorial sampling plans for preliminary computational experiments. Technometrics 1991, 33, 161–174. [Google Scholar] [CrossRef]
  6. Zhan, C.; Song, X.; Xia, J.; Tong, C. An efficient approach for global sensitivity analysis of hydrological model parameters. Environ. Model. Softw. 2013, 41, 39–52. [Google Scholar] [CrossRef]
  7. Van Griensven, A.; Meixner, T.; Grunwald, S.; Bishop, T.; Diluzio, M.; Srinivasan, R.A. Global sensitivity analysis tool for the parameters of multi-variable catchment models. J. Hydrol. 2006, 324, 10–23. [Google Scholar] [CrossRef]
  8. Katz, R.W. Extreme value theory for precipitation: Sensitivity analysis for climate change. Adv. Water. Resour. 1999, 23, 133–139. [Google Scholar] [CrossRef]
  9. Plischke, E.; Borgonovo, E.; Smith, C.L. Global sensitivity measures from given data. Eur. J. Oper. Res. 2013, 226, 536–550. [Google Scholar] [CrossRef]
  10. Jung, Y.; Merwade, V.; Kim, S.; Kang, N.; Kim, Y.; Lee, K.; Kim, G.; Kim, H.S. Sensitivity of subjective decisions in the GLUE methodology for quantifying the uncertainty in the flood inundation map for Seymour reach in Indiana, USA. Water 2014, 6, 2014–2126. [Google Scholar] [CrossRef]
  11. Doherty, J.; Welter, D. A short exploration of structural noise. Water Resour. Res. 2010, 46. [Google Scholar] [CrossRef]
  12. Foglia, L.; Hill, M.C.; Mehl, S.W.; Burlando, P. Evaluating model structure adequacy: The case of the Maggia Valley groundwater system, Southern Switzerland. Water Resour. Res. 2013, 49. [Google Scholar] [CrossRef]
  13. Rakovec, O.; Hill, M.C.; Clark, M.P.; Weerts, A.H.; Teuling, A.J.; Uijlenhoet, R. Distributed evaluation of local sensitivity analysis (DELSA), with application to hydrologic models. Water Resour. Res. 2014, 50. [Google Scholar] [CrossRef]
  14. Tang, Y.; Reed, P.; Wagener, T.; van Werkhoven, K. Comparing sensitivity analysis methods to advance lumped watershed model identification and evaluation. Hydrol. Earth Syst. Sci. 2007, 11, 793–817. [Google Scholar] [CrossRef]
  15. Gan, Y.; Duan, Q.; Gong, W.; Tong, C.; Sun, Y.; Chu, W.; Ye, A.; Miao, C.; Di, Z. A comprehensive evaluation of various sensitivity analysis methods: A case study with a hydrological model. Environ. Model. Softw. 2014, 51, 269–285. [Google Scholar] [CrossRef]
  16. Cibin, R.; Sudheer, K.P.; Chaubey, I. Sensitivity and identifiability of stream flow generation parameters of the SWAT model. Hydrol. Process. 2010, 24, 1133–1148. [Google Scholar] [CrossRef]
  17. Hornberger, G.; Spear, R. An approach to the preliminary analysis of environmental systems. J. Environ. Manag. 1981, 12, 7–18. [Google Scholar]
  18. Doherty, J.; Johnston, V.K. Methodologies for calibration and predictive analysis of a watershed model. J. Am. Water. Resour. Assoc. 2003, 39, 251–265. [Google Scholar] [CrossRef]
  19. Mokhtari, A.; Frey, H.C. Sensitivity analysis of a two-dimensional probabilistic risk assessment model using analysis of variance. Risk Anal. 2005, 25, 1511–1529. [Google Scholar] [CrossRef] [PubMed]
  20. Reusser, D.E.; Buytaert, W.; Zehe, E. Temporal dynamics of model parameter sensitivity for computationally expensive models with the Fourier amplitude sensitivity test. Water Resour. Res. 2011, 47. [Google Scholar] [CrossRef]
  21. Zhang, X.L.; Peng, Y.; Xu, W.; Wang, B.D.; Wang, H.X. Sensitivity analysis of Xinanjiang model prameters using Sobol’ method. South-North Water Trans. Water Sci. Technol. 2014, 12, 27–32. (In Chinese) [Google Scholar]
  22. Sobol’, I.M. Sensitivity estimates for nonlinear mathematical models. Math. Model. Comput. Exp. 1993, 1, 407–417. [Google Scholar]
  23. Zhang, C.; Chu, J.; Fu, G. Sobol’s sensitivity analysis for a distributed hydrological model of Yichun River Basin, China. J. Hydrol. 2013, 480, 58–68. [Google Scholar] [CrossRef]
  24. Wagener, T.; van Werkhoven, K.; Reed, P.M.; Tang, Y. Multiobjective sensitivity analysis to understand the information content in streamflow observation for distributed watershed modeling. Water Resour. Res. 2009, 45. [Google Scholar] [CrossRef]
  25. Nossent, J.; Elsen, P.; Bauwens, W. Sobol’ sensitivity analysis of a complex environmental method. Environ. Model. Softw. 2011, 26, 1515–1525. [Google Scholar] [CrossRef]
  26. Zhan, Y.; Zhang, M. Application of a combined sensitivity analysis approach on a pesticide environmental risk indicator. Environ. Model. Softw. 2013, 49, 129–140. [Google Scholar] [CrossRef]
  27. Todri, E.; Amenaghawon, A.N.; del Val, I.J.; Leak, D.J.; Kontoravdi, C.; Sergei Kucherenko, S.; Nilay Shah, N. Global sensitivity analysis and meta-modeling of an ethanol production process. Chem. Eng. Sci. 2014, 114, 114–127. [Google Scholar] [CrossRef]
  28. Liu, K.; Zeng, X.; Qiao, L.; Li, X.; Yang, Y.; Dai, C.; Hou, A.; Xu, D. The sensitivity and significance analysis of parameters in the model of ph regulation on lactic acid production by Lactobacillus bulgaricus. BMC Bioinform. 2014, 15. [Google Scholar] [CrossRef] [PubMed]
  29. Lagerwalla, G.; Kiker, G.; Muñoz-Carpena, R.; Wang, N. Global uncertainty and sensitivity analysis of a spatially distributed ecological model. Ecol. Model. 2014, 275, 22–30. [Google Scholar] [CrossRef]
  30. Kucherenko, S.; Tarantola, S.; Annoni, P. Estimation of global sensitivity indices for models with dependent variables. Comput. Phys. Commun. 2012, 183, 937–946. [Google Scholar] [CrossRef]
  31. Saltelli, A. Making best use of model evaluations to compute sensitivity indices. Comput. Phys. Commun. 2002, 145, 280–297. [Google Scholar] [CrossRef]
  32. Campolongo, F.; Saltelli, A.; Cariboni, J. From screening to quantitative sensitivity analysis: A unified approach. Comput. Phys. Commun. 2011, 182, 978–988. [Google Scholar] [CrossRef]
  33. Sweetapple, C.; Fu, G.; Butler, D. Identifying key sources of uncertainty in the modelling of greenhouse gas emission from wastewater treatment. Water Res. 2013, 47, 4652–4665. [Google Scholar] [CrossRef] [PubMed]
  34. Sweetapple, C.; Fu, G.; Butler, D. Identifying sensitive sources and key operational parameters for the reduction of greenhouse gas emissions from wastewater treatment. Water Res. 2014, 62, 249–259. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  35. Campolongo, F.; Cariboni, J.; Saltelli, A. An effective screening design for sensitivity analysis of large models. Environ. Model. Softw. 2007, 22, 1509–1518. [Google Scholar] [CrossRef]
  36. McKay, M.D.; Beckman, R.J.; Conover, W.J. A comparison of three methods for selecting values of input variables in the analysis of output from a computer code. Technometrics 1979, 21, 239–245. [Google Scholar] [CrossRef]
  37. Andres, T.H. Sampling method and sensitivity analysis for large parameter sets. J. Stat. Comput. Simul. 1997, 57, 77–110. [Google Scholar] [CrossRef]
  38. Ratto, M.; Tarantola, S.; Saltelli, A. Sensitivity analysis in model calibration: GSA-GLUW approach. Comput. Phys. Commun. 2001, 136, 212–224. [Google Scholar] [CrossRef]
  39. Zhao, S.; Zhang, W.; Lin, X.; Li, J.; Song, D.; He, X. Effect of parameters correlation on uncertainty and sensitivity in dynamic thermal analysis of thermal protection blanket in service. Int. J. Therm. Sci. 2015, 87, 158–168. [Google Scholar] [CrossRef]
  40. Chen, Y.B.; Ren, Q.W.; Huang, F.H.; Xu, H.; Cluckie, L. Liuxihe model and its modeling to river basin flood. J. Hydrol. Eng. 2011, 16, 33–50. [Google Scholar] [CrossRef]
  41. Chen, Y.; Huang, F.; Xu, H. Research on flood forecasting of the Lianjiang River Basin based on physically based distributed hydrological model. In Proceedings of the Academic Annual Conference of Chinese Hydraulic Engineering Society, Haikou, China, 1–4 April 2008; pp. 955–960. (In Chinese)
  42. Huang, S.; Chen, Y.; Jiang, H. Flood forecasting of the Wu River Basin based on the Liuxihe model. In Proceedings of the 4th Youth Science and Technology Forum of the Annual Conference of Chinese Hydraulic Engineering Society, Beijing, China, 15 October 2008; pp. 255–260. (In Chinese)
  43. Fan, Z.; Hao, Z.; Chen, Y.; Wang, J.; Huang, F. The application and research of income flood simulation of the Baipengzhu Reservior with the Liuxihe Model. Acta Sci. Nat. Univ. SunYatSeni 2012, 51, 113–118. (In Chinese) [Google Scholar]
  44. Liao, H.; Chen, Y.; Xu, H.; He, J. Study of Liuxihe model for flood and rainfall forecast of Tiantoushui watershed. Yangtze River 2012, 43, 12–16. (In Chinese) [Google Scholar]
  45. Campbell, G.S. A simple method for determining unsaturated conductivity from moisture retention data. Soil Sci. 1974, 117, 311–314. [Google Scholar] [CrossRef]
  46. Freer, J.; Beven, K.J.; Ambroise, B. Bayesian estimation of uncertainty in runoff prediction and the value of data: An application of the GLUE approach. Water Resour. Res. 1996, 32, 2162–2173. [Google Scholar] [CrossRef]
  47. Manache, G.; Melching, C.S. Identification of reliable regression- and correlation-based sensitivity measures for importance ranking of water-quality model parameters. Environ. Model. Softw. 2008, 23, 549–562. [Google Scholar] [CrossRef]
  48. Tong, C.; Graziani, F. A practical global sensitivity analysis methodology for multi-physics applications. Lect. Notes Comput. Sci. Eng. 2008, 62, 277–299. [Google Scholar]
  49. Arya, L.M.; Paris, J.F. A physioempirical model to predict the soil moisture characteristic from particle-size distribution and bulk density data. Soil Sci. Soc. Am. J. 1981, 45, 1023–1030. [Google Scholar] [CrossRef]
  50. Nash, J.E.; Sutcliffe, J. River flow forecasting through conceptual models. Part 1. A discussion of principles. J. Hydrol. 1970, 10, 282–290. [Google Scholar] [CrossRef]
  51. Kannan, N.; White, S.M.; Worrall, F.; Whelan, M.J. Sensitivity analysis and identification of the best evapotranspiration and runoff options for hydrological modelling in SWAT-2000. J. Hydrol. 2007, 332, 456–466. [Google Scholar] [CrossRef]
  52. Van Werkhoven, K.; Wagener, T.; Reed, P.; Tang, Y. Sensitivity-guided reduction of parametric dimensionality for multi-objective calibration of watershed models. Adv. Water Resour. 2009, 32, 1154–1169. [Google Scholar] [CrossRef]
  53. Dobler, C.; Pappenberger, F. Global sensitivity analyses for a complex hydrological model applied in an Alpine watershed. Hydrol. Process. 2013, 27, 3922–3940. [Google Scholar] [CrossRef]
  54. Xu, H.; Chen, Y.; Li, Z.; He, J. Analysis on parameter sensitivity of distributed hydrological model based on LH-OAT Method. Yangtze River 2012, 43, 19–23. (In Chinese) [Google Scholar]
  55. Pan, F.; Zhu, J.; Ye, M.; Pachepsky, Y.A.; Wu, Y.S. Sensitivity analysis of unsaturated flow and contaminant transport with correlated parameters. J. Hydrol. 2011, 397, 238–249. [Google Scholar] [CrossRef]
  56. Massmann, C.; Holzmann, H. Analysis of the behavior of a rainfall-runoff model using three global sensitivity analysis methods evaluated at different temporal scales. J. Hydrol. 2012, 475, 97–110. [Google Scholar] [CrossRef]
  57. Baroni, G.; Tarantola, S. A General Probabilistic Framework for uncertainty and global sensitivity analysis of deterministic models: A hydrological case study. Environ. Model. Softw. 2014, 51, 26–34. [Google Scholar] [CrossRef]

Share and Cite

MDPI and ACS Style

Wan, H.; Xia, J.; Zhang, L.; She, D.; Xiao, Y.; Zou, L. Sensitivity and Interaction Analysis Based on Sobol’ Method and Its Application in a Distributed Flood Forecasting Model. Water 2015, 7, 2924-2951. https://doi.org/10.3390/w7062924

AMA Style

Wan H, Xia J, Zhang L, She D, Xiao Y, Zou L. Sensitivity and Interaction Analysis Based on Sobol’ Method and Its Application in a Distributed Flood Forecasting Model. Water. 2015; 7(6):2924-2951. https://doi.org/10.3390/w7062924

Chicago/Turabian Style

Wan, Hui, Jun Xia, Liping Zhang, Dunxian She, Yang Xiao, and Lei Zou. 2015. "Sensitivity and Interaction Analysis Based on Sobol’ Method and Its Application in a Distributed Flood Forecasting Model" Water 7, no. 6: 2924-2951. https://doi.org/10.3390/w7062924

Article Metrics

Back to TopTop