1. Introduction
The soil water retention (SWR) curve is the main function to evaluate and elucidate the soil’s capacity to retain water. It has an essential role in irrigation and drainage, soil conservation, and modeling water and contaminant transport in the vadose zone. The intricate nature of the soil media affects the SWR behavior so that the shape of the curve is influenced by soil physicochemical properties such as texture, structure, porosity, particle size distribution, permeability, and organic matter [
1,
2,
3].
SWR curves can be measured directly or indirectly (predictive models) [
4,
5]. Despite the relative advances in measurement methods, laboratory methods, besides the rigorous and technical expertise, are still time-consuming, tedious, labor-intensive, and costly [
6,
7]. In addition, they are often performed on small and disturbed soil samples or small-scale in situ pilots that are less generalizable to the farm scale [
8]. Therefore, to address the aforementioned limitations and disadvantages, alternative indirect methods have garnered significant interest. Many mathematical and conceptual models have been proposed in the literature to describe soil hydrological properties. These models can be classified into three main categories: physico–empirical models [
9,
10,
11,
12,
13,
14], pedotransfer functions (PTFs) [
15,
16,
17], and fractal methods [
18,
19,
20].
Physico–empirical models aim to integrate physical concepts with empirical assumptions. The physical concepts in use mostly encompass particle size distribution, pore size distribution, tortuosity, and pore connectivity [
21]. The reliance on semi-physical models on soil texture and the credibility of empirical assumptions has prompted numerous models suggested for SWR curve estimation. Naturally, the accuracy and performance of SWR models in predicting the relationship between soil water content and matric potential are closely tied to the adaptation of the model to the experimental data [
22].
PTF methods use the relationship established between expensive measured properties (SWR and hydraulic conductivity curves) and easily measured properties (texture, bulk density, and organic matter). This is achieved through either multivariate regression or artificial neural networks [
23,
24]. Nevertheless, the outcomes of the models are not applicable to all types of soil, and the PTFs are strongly influenced by the sample database. The inverse method (ICM) is a viable optimization technique for estimating unknown parameters by minimizing an objective function [
25]. Typically, the objective function is defined as the difference between predicted and costly measured parameters [
26]. Although defining adjustable variables improves the flexibility of the algorithm considered in the ICM to estimate the full range of the SWR curve, it also significantly increases non-uniqueness and the correlation between the estimated parameters [
25]. It is important to note that in the ICM, the estimated parameters should not be strongly correlated [
27]. On the other hand, selecting the appropriate initial guess for the parameters is crucial since inappropriate initial assumptions increase the possibility of divergence of the objective function [
28].
Applications of fractal geometry in soil science have shown that the complicated porous medium can be characterized by the fractal representation [
29]. Fractal models of soil hydraulic properties, such as the SWR, have been proposed [
19,
30,
31]. Based on the fractal scaling of soil structure, different SWRC models have been developed. Some are mass fractal-based models [
31]. Some are based on the fractal surface [
32]. There are also models based on the fractal scaling of PSD or the pore phase of soils [
30,
33]. To apply the models, one needs to determine the fractal dimensions. However, it is rather challenging to estimate the fractal dimensions because of the complexity of measurements of the pore size and pore volume distributions.
Among semi-physical models, closed-form expressions are widely used by modelers due to their approachability and practical application [
1]. Some of the proposed hydraulic functions have received more attention by researchers due to their performance and accuracy for estimating the SWR and hydraulic conductivity curves for the main twelve textural soil classes [
9,
10,
11,
12,
13,
34,
35,
36,
37,
38,
39,
40,
41,
42,
43,
44,
45,
46,
47,
48,
49,
50,
51,
52,
53,
54,
55]. In contrast, some other models had a lack of success due to failure in generalizing for different soil types [
56,
57,
58,
59,
60,
61,
62,
63,
64,
65,
66,
67,
68,
69,
70,
71,
72,
73,
74,
75].
Closed-form functions are principally high-order nonlinear models that integrate adjustable parameters with physical or mathematical meaning. They can explain the relationship between soil water content and matric suction, as well as soil water content and hydraulic conductivity, in the form of an analytical equation. The accuracy and performance of models are normally evaluated by comparing fitted values from models to experimental SWR measurements. Given the unique
S-shape of the SWR curve, many models in the form of sigmoid [
9,
10], multiple exponentials [
13,
46,
54], lognormal [
12], hyperbolic [
56,
62], and hybrid functions [
36,
53] have been suggested for SWR models.
This study assesses the performance and accuracy of 15 SWR models using a set of Iranian soils covering all textural classes. Furthermore, it evaluates the impact of soil properties such as texture, organic matter, and bulk density on SWR model performance.
2. SWR Models
Numerous SWR models have been proposed to describe the relationship between soil water content, θ [L3 L−3], and matric suction, h [L]. Typically, model performance is assessed using two θ ranges: one spanning from the saturated water content, θs [L3 L−3], to the residual water content, θr [L3 L−3], and the other extend across the entire range from the very dry part of the SWR curve. Estimating the SWR from the oven-dry region started in the 1990s; previously, SWR models were limited to the wet range due to limitations in technology and measuring equipment, especially at high suctions.
The Brooks and Corey model [
9] is one of the most conventional and widespread SWR models [
76]:
where
α [L
−1] is the inverse of the air-entry suction and
λ [-] > 0 is an index related to the pore size distribution (PSD) with a maximum value around 4. In fact,
λ is an index that evaluates the non-uniformity of the PSD, where soils with a more uniform PSD result in larger
λ.
Another practical and widespread SWR model was proposed by van Genuchten [
10]. The model is presented as follows:
where
m and
n are dimensionless shape parameters without physical meaning, and α is the inverse of the air-entry suction. It should be noted that considering
m = 1, this type of expression was previously used by other authors (e.g., [
77,
78,
79].
One essential criterion for a SWR model is the conformity of the model to the inflection point. Tani [
67] proposed an exponential model considering the matric suction at the inflection point,
hi [L], of the SWR curve. This model has fewer parameters, resulting in a low correlation between them.
Hutson and Cass [
36] developed their model based on Campbell (1974). The intrinsic flaw of the Campbell model was the inability of the model to estimate the SWR curve for suctions less than the air-entry value. For this reason, Hutson and Cass [
36] proposed a SWR curve as a piecewise-modified function that comprises exponential and parabolic functions as follows:
where
a [L] and
b [-] are empirical parameters, the latter related to the slope of the SWR curve. They used the parameter
a instead of the air entry suction, which is merely a correction factor.
Russo [
71] proposed an effective improvement for the Tani [
67] model, particularly at low suctions, and added another parameter as follows:
where
β refers to the empirical parameter attached to the Equation instead of the inflection point matric suction, and
k is a coefficient introduced by Mualem [
80].
Campbell and Shiozawa [
38] proposed a reformulated expression for the van Genuchten [
10] model incorporating water content adsorbed on soil surfaces at very low matric potentials as follows:
where the first term on the right-hand side notes to adsorptive water;
m [-],
A [-], and
θa [L
3 L
−3] are curve-fitting parameters, and
ho [L] (~10
5 m) is the matric suction at oven dryness.
Fredlund and Xing [
40] derived a five-parameter model to cover the entire range of the SWR curve:
where
a [-],
n [-], and
m [-] are empirical shape parameters, and
hr [L] corresponds to the matric suction associated with
θr.
Kosugi [
12,
42] proposed a lognormal SWR model considering the physical basis for the description of the soil pore size.
where erfc is the complementary error function, and
n [-],
[L] and
[L] are model parameters. In fact,
is the matric suction corresponding to the largest soil pore,
is the matric suction corresponding to the smallest soil pore, and
n is the standard deviation of the pore size distribution.
Fayer and Simmons [
43] adapted the van Genuchten and the Brook-Corey models from saturation to oven dryness using the approach of Campbell and Shiozawa [
38] as follows:
In the above two equations, instead of θr in the van Genuchten and Brooks–Corey models, the absorptive water content θa is used.
Assouline et al. [
44] postulated a conceptual SWR model based on the conversion from solid particle volume to pore volume.
where
θL [L
3 L
−3] is the water content whose hydraulic conductivity is a trifle and negligible, and
hL [L] is the matric suction corresponding to this very low water content.
M [-] and
n [-] are curve-fitting parameters influenced by particle shapes, packing configuration, and PSD.
Groenevelt and Grant [
46] presented a double exponential SWR model with high mathematical versatility.
k1, k0, and n are the curve-fitting parameters, while θa [L3 L−3] is the water content corresponding to a remarkably high matric suction ha corresponding to permanent wilting (15,000 cm H2O).
Khlosi et al. [
47] used the adsorptive approach from Campbell and Shiozawa [
38] to adapt the Kosugi [
81] model from saturation to oven dryness:
Θa and
ho are similar to those in the Fayer and Simmons [
43] model, and the other parameters are the same as in the Kosugi model [
12].
Dexter et al. [
13] proposed a double exponential model including five adjustable parameters with physical meaning. The parameters correspond to the soil matrix and the structural pore space, and the configuration of pores determines the shape of the SWR function.
where
w is the gravimetric soil water content [M·M
−1],
C points to the asymptote of the model when it approaches the residual water content,
A1 [M·M
−1] and
A2 [M·M
−1] are parameters related to the matrix pores and the structural pores, and
h1 [L] and
h2 [L] are the matric suction when water drains from the matrix (textural) and structural pores, respectively.
Similarly, Romano et al. [
51] divided the soil pore system into two: textural and structural. They used a two-term SWR based on the lognormal Kosugi model [
12] as follows:
where 0 ≤
w ≤ 1 notes the weighting factor that considers the share of the textural and structural pores in the soil and their influence on the SWR curve.
α1,
α2,
n1, and
n2 are shape parameters of the model for the textural and structural pore systems, respectively. This model was further improved by Pollacco et al. [
82] avoiding the use of the empirical weighting factor,
w.
The selected set of SWR models are summarized in
Table 1. It is worth noting that SWR models can be categorized as one-, two-, and three-segment models. Fragmenting the SWR curve using piecewise functions leads to an improved fit, but it also increases problems of continuity and smoothness [
41] with implications for the integrability and derivability at the junctions points [
83].
4. Results and Discussions
Figure 3 and
Figure 4 and
Table 3 and
Table 4 show results corresponding to
RMSE,
AIC, and
BIC of each SWR model using the 140 soils. According to
Table 3, except for the Tani and Russo models, the 10% percentile of the other 13 models is <0.01, while the 50% percentile (median) of the other 13 models is <0.015. Therefore, selected SWR models have good performance in fitting SWR data.
Although for the Tani and Russo models the number of parameters is less than for other models, the shape of suggested functions does not correspond to the behavior of the SWR curve.
Figure 5 clearly shows this demerit. Reducing the number of parameters in the model reduces the correlation between the parameters and increases the uniqueness of the parameters. The ideal SWR model will be able to account for the variations in the SWR data with the minimum number of parameters. Therefore, it is recommended to use models with fewer parameters in the fitting process as much as possible. The Tani and Russo models show the worst performance among the 15 utilized models; therefore, they are less recommended. According to
Table 4, among the remaining 13 models, the maximum RMSE varies between 0.03 and 0.23, indicating that there are significant differences in the accuracy and performance of the models.
Figure 3 shows that for the BC and VG models, despite the acceptable median and 90% percentile for both models, drastic and unusual changes are observed for the
RMSE (see
Table 4), ranging from 2.3 × 10
−5 to 0.175 and 9.5 × 10
−17 to 0.227, respectively, indicating that these models are significantly sensitive to soil type (or soil textures) compared to other models. Based on the information in
Table 3, the
AIC and
BIC analysis of the models also shows that the VG model has the lowest
AIC and
BIC among the selected models, so there are striking differences in the calculated
AIC and
BIC for the 10% percentile resulting from the minimum 10% percentile for the
RMSE. It seems that for some soils the VG model has an excellent fit compared to other models. The reason for this peculiarity is discussed in the manuscript. From Equation (17), by increasing the performance and accuracy of the desired model in fitting the experimental data (goodness-of-fit) and also the number of empirical SWR data (
N),
approaches zero (0
+ or zero limit). As a result,
takes more negative values. In
Table 3, models with more negative
AIC (in other words, lower values) have higher accuracy.
The
AIC and
BIC results for the 15 selected models are shown in
Figure 4. The results show that despite the relative proximity for the 50% percentile (median) in all models, the VG model results in wide alterations for
AIC and
BIC. It appears that the performance of the model is influenced by soil type and soil physical properties.
BC and VG reformulated and transformed Equations (19) and (20) to Equations (1) and (2), where the minimum and maximum soil water contents, θr and θs, are replaced by Amin and Amax, respectively. Although these models, like the traditional models, have acceptable performance for different soil types, they have some fundamental weaknesses and drawbacks, as discussed below.
- (1)
Given the nature and behavior of Equations (19) and (20), BC and VG models show excellent fit to the asymmetric retention data around the inflection point with
RMSE < 10
−10 but are inherently and significantly sensitive to pure errors and outliers in the dataset. Pure errors are errors caused by the presence of random variation in the data such that despite high R
2 values, the
RMSE shows abnormal values (see
Figure 6,
Figure 7 and
Figure 8).
The residual sum of squares can be decomposed into two components:
RSS = (sum of squares due to “pure error”) + (sum of squares due to “lack of fit”)
The sum of squares due to “pure error” is the sum of squares of the differences between each observed y-value (water content) and the average of all y-values (
) corresponding to the same x-value (matric suction). The sum of squares due to lack of fit is the weighted sum of squares of differences between each average of y-values (
) corresponding to the same x-value and the corresponding fitted y-value (
θEstimated), the weight in each case being simply the number of observed y-values for that x-value:
To minimize pure errors and obtain accurate empirical SWR for each soil sample, replication and duplication simultaneously are used during experiments so that no outliers were observed for almost 99% of the averaged SWR data. To avoid abnormal and illusive RMSEs, a few scattered outliers were ignored. In the presence of pure errors and the erratic scattering of data around the regression curve, suitable R
2 values are obtained (see
Figure 6,
Figure 7 and
Figure 8 as typical examples), which by itself is not an acceptable criterion for measuring the accuracy of the model (
Figure 5). The best criteria to assess models’ performance are
RSS,
RMSE, and
AIC;
- (2)
Comparison of Equations (1) and (19) and Equations (2) and (20) shows that for the BC and VG models, θr and θs are considered as the corresponding soil water content values when the logarithm of the matric suction approaches zero and infinity, respectively. Therefore, these parameters are fitting parameters without any specific physical meaning. From the data fitting, the range of variation for θr was higher compared to other models (0.00001 to 0.24 m3·m−3), indicating that these models, regardless of soil type and data range, only tried to fit the data with unphysical θr;
- (3)
For matric suction below air entry, the BC model treats the SWR curve as a vertical line (
Figure 6,
Figure 7 and
Figure 8). In other words, as depicted in
Figure 8 for suctions below air entry, decreasing suction does not increase soil water content and its value remains constant to saturated water content. For fine textured soils with large
ha, the inability of the model to simulate this part of the SWR becomes more pronounced, resulting in an increase in the
RMSE;
- (4)
The VG model does not consider the matric suction at the air entry suction. However, the strength of the model lies in having an inflection point which results in an exceptional fit to measured data, particularly at high water content;
- (5)
As previously discussed, the BC and VG models assume that soil suction approaches zero and infinity as the water content decreases to
θr and increases to
θs, respectively. This results in the constant maintenance of
θr with additional matric suction. While this may be valid only for the wet part of the SWR, however, a large number of observed SWR data indicates that for the dry part of the SWR (
θ <
θr), increasing matric suction results in a decrease in water content that follows a linear relationship on a semi-logarithmic scale [
38,
94]. Given the domain and range of 4- and 5PLCs, the major drawback of the BC and VG models is that they do not define the SWR beyond
θr;
- (6)
The correlation between parameters plays a significant role in the sensitivity of the SWR model to the experimental data. In the VG model, the relationship between the three parameters of
α,
n, and
m is such an extent that minor alteration in one parameter effectively changes the other two parameters. To better understand the sensitivity of the VG model, the correlation matrix between the VG model parameters (
θr,
θs,
α,
n, and
m) for the maximum and minimum
RMSE,
RSS, and
AIC (see
Figure 6, which shows the worst and the best fits corresponding to clay loam and sandy clay loam soils) are presented in
Table 5.
The binary correlation matrix clearly shows that, regardless of model performance and good-of-fitness, there is a high dependency between the three parameters (α, n, and m) of the VG model.
The impact of soil properties on the performance of the SWR models is evaluated for the 140 soils. The RMSE of each SWR model was correlated with the sand-silt-clay content, bulk density, and organic matter. Results are summarized in
Table 6.
In the current study, the null hypothesis (H0) is considered when there is no correlation between RMSE (model performance) and soil properties (sand-silt-clay content, bulk density, and organic matter) in the overall population, including 140 soil samples (). On the counterpoint, for the alternative hypothesis (H1), there is a correlation between RMSE and soil properties in the overall population (). To accept H1, calculated p-values must be less than 0.05 or 0.01 for 95% and 99% confidence intervals, respectively. To conclusively convict and reject H0, p-values must be less than 10−3.
Regarding the corresponding
p-values in
Table 6, it can be inferred that for the BC and VG models, there is a weak correlation between model performance (
RMSE) and soil properties (sand-silt-clay content, bulk density, and organic matter) in the overall population including 140 soil samples. As mentioned before, the VG model is a closed-form sigmoid expression including the adjustable parameters obtained from fitting. In the VG model,
m and
n are called shape (fitting) parameters and do not have a physical meaning.
From
Table 6, it can be concluded that for the BC and VG models, the model performance is strongly independent of soil texture and bulk density. On the other hand, almost in all SWR models, the model performance is impacted by organic matter, with 90% confidence. However, the calculated correlation between organic matter and
RMSE is not striking, and correlations are negligible.
Another important finding from
Table 6 is the Dexter (double exponential) model, where the model performance is noticeable under the influence of soil properties.
Evaluation of Equation (14) can shed more light on
Table 6 results. DEX developed model has five adjustable parameters with physical meaning. The parameters of this model correspond to the soil matrix and the structural pore space. In general, for this model, the pore configuration in the soil specifies the shape of the function. In Equation (14),
w is the gravimetric soil water content associated with the matric suction,
C points to the asymptote of the model when it approaches the residual soil water content,
A1 and
A2 are parameters of the model associated with the matrix and structural pores of the soil, respectively, and
h1 and
h2 are related to the matric suction when water drains from the textural (matrix) and the structural pore space, respectively.
It can be deduced that the Dexter model is bimodal. The first part, (A1 exp(h/h1)), represents the matrix pores, and the second part, (A2 exp(h/h2)), considers the effect of structural pores on the SWR curve.
Table 6 also shows that at the significance level of 0.05, the
PCC between
RMSE and soil texture shows higher values compared to
ρb and
OM. The findings of
Table 6 are consistent with Cornelis et al. [
95] and Du [
22]. When correlating the
RMSE of the SWR model with
OM and
ρb, Cornelis et al. [
95] found no significant results at the 0.05 level. They compared 48 soil samples from Belgium using unimodal analytical expressions, evaluating the accuracy, linearity,
AIC, and prediction potential (
PCC). Du [
22] studied the performance of closed-form expressions on 94 soil samples from saturation to oven-dry. Cornelis et al. [
95] observed that the VG model with
m as a free parameter showed the highest accuracy and the lowest
AIC. However, it had a low prediction potential, which means that the model is remarkably influenced by soil texture. Constraining
m to 1 − 1/
n did not resolve this issue. However, reducing the number of parameters from 5 to 4 mitigated the model’s dependence on soil texture. Both studies, Cornelis et al. [
95] and Du [
22] found a slight correlation between
RMSE and ρ
b, as well as
RMSE and
OM, for most SWR models, which is attributed to the specific mathematical form of these models. The parameters within the models are primarily defined through the integration of features such as PSD and capillarity laws.
The SWR models show weak dependence on
ρb and
OM due to the narrow range of variation in these parameters. However, when compared to previously published data [
38,
95,
96,
97] and UNSODA data [
98], some variations can be observed.
Evaluation of
PCC between
RMSE and the soil sand-silt-clay content reveals some interesting features. For all SWR models, the
RMSE values are negatively correlated with clay. On the other hand,
RMSE values are positively correlated with silt, indicating that by increasing clay content and decreasing silt content, the performance of the SWR model increases (this effect is weak). For sand,
RMSE and SWR models exhibit contrasting patterns. The performance of all SWR models in Cornelis et al. [
95] except one is also in agreement with these results.
5. Conclusions
In this study, the performance and predictive potential of 15 soil water retention (SWR) models have been assessed using 140 soils from Iran. The findings indicate important conclusions summarized below.
Based on the Akaike information criterion (AIC) and Bayesian information criterion (BIC), the van Genuchten model has the lowest values. There are significant differences in AIC and BIC for the 10% percentile, which are a result of the minimum RMSE at the 10% percentile. This can be attributed to the symmetric SWR data around the inflection point, where the minimum RMSE values for these soils were 10−17. The Brooks and Corey and van Genuchten models demonstrate inadequate predictive ability in estimating SWR curves. Results from the statistical test conducted at a significance level of 0.05 (95% confidence) demonstrate that soil texture has not notable impact on both models, as suggested by calculated p-values for PCC between RMSE and soil texture. Despite the relative proximity for the 50% percentile (median) in all models, the van Genuchten model results in wide alterations for AIC and BIC. It reveals that the performance of the model is influenced by soil type and soil physical properties.
The calculated Pearson correlation coefficient and associated p-values between RMSE and organic matter reveal that in almost all SWR models, the model performance is weakly influenced by organic matter, with an acceptable approximation (90% confidence). This independence is also evident for the bulk density, apart from in four models BC, VG, Tani, and Russo. Statistical investigation also reveals that the performance of the Dexter model is noticeably influenced by soil properties. The parameters of the Dexter model correspond to the soil matrix and the structural pore space, which is directly related to the shape of the SWR function.
The TA and RU models exhibit the poorest performance and accuracy compared to the studied SWR models. Among the SWR models, Dexter et al. double exponential, Groenevelt and Grant, and Khlosi et al. models show higher accuracy and performance in predicting the SWR curve in terms of RMSE, AIC, and BIC. The evaluation of prediction potential for SWR models reveals that the DEX, GG, Kosugi, Assouline, Tessier, and Beruand models show the least dependency on soil properties.
It should be noted that a small number of sandy soils were used in the data set, and therefore, the results may not be fully generalizable to sandy soils.