Next Article in Journal
Projection of Future Meteorological Droughts in Lake Urmia Basin, Iran
Previous Article in Journal
A Comparative Analysis of Multiple Machine Learning Methods for Flood Routing in the Yangtze River
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Evaluation of Various Generalized Pareto Probability Distributions for Flood Frequency Analysis

by
Cristian Gabriel Anghel
and
Cornel Ilinca
*
Faculty of Hydrotechnics, Technical University of Civil Engineering Bucharest, Lacul Tei, nr.122-124, 020396 Bucharest, Romania
*
Author to whom correspondence should be addressed.
Water 2023, 15(8), 1557; https://doi.org/10.3390/w15081557
Submission received: 28 March 2023 / Revised: 12 April 2023 / Accepted: 14 April 2023 / Published: 16 April 2023
(This article belongs to the Section Hydrology)

Abstract

:
This article analyzes six probability distributions from the Generalized Pareto family, with three, four and five parameters, with the main purpose of identifying other distributions from this family with applicability in flood frequency analysis compared to the distribution already used in the literature from this family such as Generalized Pareto Type II and Wakeby. This analysis is part of a larger and more complex research carried out in the Faculty of Hydrotechnics regarding the elaboration of a norm for flood frequency analysis using the linear moments method. In Romania, the standard method of parameter estimation is the method of ordinary moments, thus the transition from this method to the method of linear moments is desired. All the necessary elements for the distribution use are presented, such as the probability density functions, the complementary cumulative distribution functions, the quantile functions, and the exact and approximate relations for estimating parameters, for both methods of parameter estimation. All these elements are necessary for a proper transition between the two methods, especially since the use of the method of ordinary moments is done by choosing the skewness of the observed data depending on the origin of the maximum flows. A flood frequency analysis case study, using annual maximum and annual exceedance series, was carried out for the Prigor River to numerically present the analyzed distributions. The performance of this distribution is evaluated using a linear moments diagram.

1. Introduction

The frequency analysis of extreme events in hydrology is of particular importance tothe aim of determining the probability of occurrence of extreme events of a given magnitude.
Flood frequency analysis is important because it determines the maximum flow with certain exceeding probabilities; they have a defining role in the design of dams [1] and in water management [2], and can have significant impacts on human lives, infrastructure, and the environment.
Together with the distributions from the Gamma family and generalized extreme values, the Generalized Pareto Type 2 distribution (PGII) represents one of the most used distributions in flood frequency analysis [3,4,5], especially in the analysis of partial series using the Annual Exceedance Series (AES) or Peak Over Thresholds (POT).
Among the Generalized Pareto distributions analyzed in this article, the ones that received considerable attention in flood frequency analysis are the Generalized Pareto distribution Type II (PGII) using AES or POT [3,4,5], respectively, the Generalized Pareto distribution Type III (PGIII) and the Wakeby (WK5) distribution [4,6,7,8] in the analysis with the Annual Maximum Series (AMS). The PGIII is also known in the literature as the log-logistic distribution or generalized logistic distribution [4,6,7,9].
The PGII distribution has a broad use in the analysis of extreme events such as precipitation frequency analysis [10,11,12,13,14,15], low flow frequency analysis [16] and in flood frequency analysis [4,5,17,18,19,20].
Based on Rao and Hamed [4] and Hosking and Wallis [21], respectively, the Generalized Pareto distribution “is the logical choice for modeling flood magnitudes that exceed a fixed threshold when it is reasonable to assume that successive floods follow a Poisson process and have independent magnitudes”.
In the case of the Wakeby distribution, it is a quantile distribution (a distribution expressible in inverse form) that in certain situations [4,8,22] turns into the Generalized Pareto Type II distribution. Its application, using the close-form equations for the first four central moments, was realized for the first time by Anghel and Ilinca [22,23], both for the four and five parameters of Wakeby distributions.
Other distributions from the Generalized Pareto family have received little to no attention, such as the four parameters Generalized Pareto Type IV (PGIV4), the three parameters Generalized Pareto Type IV (PGIV3) and the Generalized Pareto Type I (PGI) also known as Pearson XI distribution.
Taking this into account, one of the objectives of the article consists of the analysis of the applicability of other distributions belonging to the same family of Generalized Pareto distribution in flood frequency analysis. For a comprehensive analysis, in this article, all the distributions belonging to this family are comparatively analyzed. Although some of these distributions have been used in the frequency analysis of extreme events in hydrology, this article brings new elements for these distributions that help to better understand and apply them in hydrology and beyond.
The research from this article is part of a larger and more complex research carried out in the Faculty of Hydrotechnics to identify the distributions from different families of distributions, which have applicability in frequency analysis of extreme values; partial results are presented in other materials [22,23,24].
In this article, the estimation methods of the parameters of these distributions are the method of ordinary moments (MOM) and the method of linear moments (L-moments); for some of them, it is necessary to solve nonlinear systems of equations, which leads to some difficulties in using these distributions. Thus, for the ease of applications of these distributions, parameter approximation relations are presented, using polynomial, exponential or rational functions.
Only these two methods of estimating parameters are analyzed in this article, because the MOM is the “parent” method in Romania, and the L-moments method is the method that is intended to be used in the new regulations regarding the analysis of extreme phenomena in hydrology, being much more stable and less biased to small lengths of data [3,4,8], as is the general case of hydrometry in Romania.
However, in recent years it has been demonstrated that this also requires certain corrections of both the statistical indicators and the parameters of the theoretical distributions obtained with this method [25,26].
All the mathematical elements necessary to use these distributions in the flood frequency analysis are presented.
New elements are presented, such as the first six raw and central moments for the PGII and PGI distributions; the relations for estimating the parameters with the L-moments for the PGIV4, PGIV3 and PGI distributions; new approximate parameter estimation relations for the PGII, PGI and PGIII distributions; the frequency factors for all analyzed distributions, both for the MOM and the L-moments and approximation relations of the frequency factors for the PGI, PGII and PGIII distributions.
Thus, all these novelty elements for these distributions presented in Table 1 will help hydrology researchers to better understand and easily apply these distributions.
The raw and central moments of the analyzed distributions were determined using the methodology presented in the Supplementary File, based on the probability density functions. It is for the first time that, for the Generalized Pareto distributions Type II and Type III, the raw and central moments up to order six are presented, important, along with the frequency factor, in establishing the confidence interval (for the MOM) using the Kite approximation [4]. In addition, for the first time, the frequency factor of these distributions is presented based on the L-moments method, an important aspect in determining the confidence interval using the Chow approximation [4], the latter was used in hydrology only based on the estimation of parameters with the MOM.
In order to verify the performances of the proposed distributions, a flood frequency analysis is carried out, using the Annual Maximum Series (AMS) and the Annual Exceedance Series (AES) for the Prigor River as a case study. All results are presented in comparison with the Pearson III distribution, which is the “parent” distribution in flood frequency analysis in Romania [22]. The purpose of this article was to identify other distributions from this family with applicability in flood frequency analysis compared to the distributions already used in the literature from these families such as the PGIII and PGII. This article does not exclude the applicability of other distributions from other families (Gamma, GEV, Beta) in flood frequency analysis, especially since these families were also analyzed within the research carried out in the Faculty of Hydrotechnics and are presented in other materials [22,23,24].
Comparing the results and choosing the best-fitted distribution is based on the L-skewness ( τ 3 ) and L-kurtosis ( τ 4 ) values and diagram. The values of the RME and RAE indicators are presented indicatively, knowing that they only properly evaluate the probability area of the observed values. Outside of this domain (left hand, upper part of the graph), they lose their relevance, because, in general, the data sets in Romania are not large enough (n > 80).
This article is organized as follows: The description of the statistical distributions by presenting the density function, the complementary cumulative function and the quantile function is in Section 2.1. The presentation of the relations for exact calculation and the approximate relations for determining the parameters of the distributions is in Section 2.2. Case studies by applying these distributions in flood frequency analysis for the Prigor River are in Section 3. Results, discussions and conclusions are in Section 3, Section 4 and Section 5.

2. Methods

The frequency analysis consists of determining the flows with certain exceedance probabilities using the AMS and the AES, respectively, with the Prigor River as a case study.
The series of maximum annual flows consists of choosing the maximum value corresponding to each year. In many cases, the lower maximum values of the annual data series do not always represent ”flood”. Thus, the use of frequency analysis using the AES is required, which allows secondary events, which exceed certain annual maximum values, which are to be considered as “flood”.
The AES was established by a descending sorting of all independent maximum values and choosing the first “n” values corresponding to the “n” number of years of analysis [14,15].
Using this criterion, it is important to verify that two or more maximum flow values do not come from the same flood. The independence of flows was verified and established based on the Cunnane criterion and the USWRC 1976 criterion, respectively [27].
The determination of the maximum flows was carried out in stages according to Figure 1. The verification of the character of outliers (Grubbs, Pilon, Quartile method), homogeneity and independence of flows was carried out in the data curation phase. No outliers were detected.
The estimation of the parameters of the analyzed distributions was done with MOM and L-moments. The MOM estimation has the disadvantage that for high-order moments and small data series, in many cases, it generates unrealistic values because the high-order moments require correction [4,5,28,29]. For skewness, the correction can be made using the Bobee relation [26] or, as is the practice in Romania [22,30,31,32,33], being established according to the origin of the maximum flows by multiplying the coefficient of variation ( C v ) by a coefficient reflecting this origin. In many cases, the choice is subjective and without rigor [22,34].
Considering that in many cases it is necessary to solve some systems of non-linear equations to estimate the parameters of the analyzed distributions, approximate relations for estimating the parameters were determined in the case of distributions where skewness and L-skewness depend on a single parameter. In addition, for a simplified and fast calculation that takes into account the fact that the inverse function can be expressed with the frequency factor for both MOM [4,23,34] and L-moments [23], the approximation relations of the factor of frequency (and the coefficients of these relations) are for the most frequent exceedance probabilities in the analysis of maximum flows. The estimation errors of both the parameters and the frequency factors are between 10−3 and 10−4.
The quantile results are compared with those of the Pearson III distribution, which is the “parent” distribution in Romania in the analysis of extreme events in hydrology, especially in the flood frequency analysis [22,30,31].

2.1. Probability Distributions

In Table 2 the probability density function, f ( x ) ; the complementary cumulative distribution function, F ( x ) , and the quantile function, x ( p ) , are presented for analyzed distributions [4,5,6,7,8,9,14,15,16,17]. All F ( x ) and x ( p ) of the analyzed distributions were determined using the methodology presented in the Supplementary file using only f ( x ) .

2.2. Parameter Estimation

The parameters estimation of the analyzed statistical distributions is presented for the MOM and the L-moments method, some of the most used methods in hydrology for parameter estimation [3,4,5,7,22].

2.2.1. Generalized Pareto Type IV (PGIV4)

The equations needed to estimate the parameters with MOM have the following expressions:
μ = γ + β α Γ ( α ) Γ ( λ α ) λ Γ ( λ )
σ 2 = β 2 λ Γ ( λ ) ( Γ ( 2 α + 1 ) Γ ( λ 2 α ) Γ ( α + 1 ) 2 Γ ( λ α ) 2 λ Γ ( λ ) )
Because they are too long, the relations for estimating the skewness and the kurtosis are presented in Appendix F.
The equations needed to estimate the parameters with L-moments have the following expressions:
L 1 = γ + β Γ ( λ α ) Γ ( 1 + α ) Γ ( λ )
L 2 = β ( Γ ( λ α ) Γ ( 1 + α ) Γ ( λ ) Γ ( 2 λ α ) Γ ( 1 + α ) Γ ( 2 λ ) )
L 3 = β ( Γ ( λ α ) Γ ( 1 + α ) Γ ( λ ) 3 Γ ( 2 λ α ) Γ ( 1 + α ) Γ ( 2 λ ) + 2 Γ ( 3 λ α ) Γ ( 1 + α ) Γ ( 3 λ ) )
L 4 = β ( Γ ( λ α ) Γ ( 1 + α ) Γ ( λ ) 6 Γ ( 2 λ α ) Γ ( 1 + α ) Γ ( 2 λ ) + 10 Γ ( 3 λ α ) Γ ( 1 + α ) Γ ( 3 λ ) 5 Γ ( 4 λ α ) Γ ( 1 + α ) Γ ( 3 λ ) )
where L 1 , L 2 , L 3 , L 4 represent the first four linear moments.

2.2.2. Generalized Pareto Type IV (PGIV3)

The distribution represents a particular case of the Pareto IV distribution when the position parameter γ = 0 . It is also known as the beta_p or Singh–Maddala distribution [9,35].
The equations needed to estimate the parameters with MOM have the following expressions:
μ = β Γ ( α + 1 ) Γ ( λ α ) Γ ( λ )
σ 2 = β 2 Γ ( λ ) ( Γ ( 2 α + 1 ) Γ ( λ 2 α ) Γ ( α + 1 ) 2 Γ ( λ α ) 2 Γ ( λ ) )
C s = Γ ( λ ) 2 Γ ( λ 3 α ) Γ ( 1 + 3 α ) + 2 Γ ( λ α ) 3 Γ ( 1 + α ) 3 3 Γ ( λ ) Γ ( λ 2 α ) Γ ( 1 + 2 α ) Γ ( λ α ) Γ ( 1 + α ) ( Γ ( λ ) Γ ( λ 2 α ) Γ ( 1 + 2 α ) Γ ( λ α ) 2 Γ ( 1 + α ) 2 ) 1.5
The equations needed to estimate the parameters with L-moments have the following expressions [35]:
L 1 = β Γ ( λ α ) Γ ( 1 + α ) Γ ( λ )
L 2 = β ( Γ ( λ α ) Γ ( 1 + α ) Γ ( λ ) Γ ( 2 λ α ) Γ ( 1 + α ) Γ ( 2 λ ) )
L 3 = β ( Γ ( λ α ) Γ ( 1 + α ) Γ ( λ ) 3 Γ ( 2 λ α ) Γ ( 1 + α ) Γ ( 2 λ ) + 2 Γ ( 3 λ α ) Γ ( 1 + α ) Γ ( 3 λ ) )

2.2.3. Generalized Pareto Type III (PGIII)

The equations needed to estimate the parameters with MOM have the following expressions [3,4]:
μ = γ + β Γ ( 1 + 1 α ) Γ ( 1 1 α )
σ 2 = β 2 ( Γ ( 1 + 2 α ) Γ ( 1 2 α ) Γ ( 1 + 1 α ) 2 Γ ( 1 1 α ) 2 )
C s = Γ ( 1 + 3 α ) Γ ( 1 3 α ) + 2 Γ ( 1 + 1 α ) 3 Γ ( 1 1 α ) 3 3 Γ ( 1 + 2 α ) Γ ( 1 2 α ) Γ ( 1 + 1 α ) Γ ( 1 1 α ) ( Γ ( 1 + 2 α ) Γ ( 1 2 α ) Γ ( 1 + 1 α ) 2 Γ ( 1 1 α ) 2 ) 1.5
The shape parameter can be obtained approximately depending on the skewness coefficient using the following function:
If 0.1 C s 7 :
α = exp ( 2 . 2463456 0 . 8505372 ln ( C s ) + 0 . 1230871 ln ( C s ) 2 + 0 . 0511417 ln ( C s ) 3 + 0 . 0024655 ln ( C s ) 4 0 . 006948 ln ( C s ) 5 0 . 0017015 ln ( C s ) 6 + 0 . 0004746 ln ( C s ) 7 + 0 . 0001676 ln ( C s ) 8 )
β = σ Γ ( 1 + 2 α ) Γ ( 1 2 α ) Γ ( 1 + 1 α ) 2 Γ ( 1 1 α ) 2
γ = μ β α Γ ( 1 α ) Γ ( 1 1 α )
The equations needed to estimate the parameters with L-moments have the following expressions:
α = 1 τ 3 = L 2 L 3
β = α 2 L 2 Γ ( 1 α ) Γ ( 1 1 α )
γ = L 1 β α Γ ( 1 α ) Γ ( 1 1 α )

2.2.4. Generalized Pareto Type II (PGII)

The equations needed to estimate the parameters with MOM have the following expressions [3,4,5,36]:
μ = γ + β α + 1
σ 2 = β 2 ( α + 1 ) 2 ( 2 α + 1 )
C s = α | α | 2 ( α 1 ) ( 2 α + 1 ) 0.5 3 α + 1
The shape parameter can be obtained approximately depending on the skewness coefficient using the following functions:
If C s < 2 :
α = 0 . 999946019 1 . 154837085 C s + 0 . 559584297 C s 2 0 . 152516332 C s 3 0 . 018234805 C s 4
If C s 2 :
α = 1 . 680766638 0 . 91819165 C s + 0 . 043608198 C s 2 0 . 002324901 C s 3 1 + 1 . 805341085 C s
β = σ ( α + 1 ) 2 α + 1
γ = μ β α + 1
The frequency factor (for MOM), presented in Appendix B, can be obtained approximately using a polynomial function of skewness and probability, whose coefficients are presented in Table A4 from Appendix D.
The equations needed to estimate the parameters with L-moments have the following expressions:
L 1 = γ + β α + 1
L 2 = β ( α + 1 ) ( α + 2 )
L 3 = β ( α 1 ) ( α + 1 ) ( α + 2 ) ( α + 3 )
Based on these equations, the parameters have the following expressions:
α = 1 3 τ 3 τ 3 + 1
β = L 2 ( α + 1 ) ( α + 2 )
γ = L 1 L 2 ( α + 2 )
The frequency factor (for L-moments) can be obtained approximately using a polynomial function depending on skewness and probability, whose coefficients are presented in Table A5 from Appendix D.

2.2.5. Generalized Pareto Type I (PGI)

The equations needed to estimate the parameters with MOM have the following expressions:
μ = γ + β α α 1
σ 2 = α β 2 ( α 1 ) 2 ( α 2 )
C s = α | α | 2 ( α + 1 ) ( 2 α 2 ) 0.5 ( α 3 ) α
The shape parameter can be obtained approximately depending on the skewness coefficient using the following functions:
If 0.1 C s 1.5 :
α = 1 . 00404609 2 . 127093299 C s 1 . 230376348 C s 2 0 . 100951553 C s 3 1 + 1 . 163661138 C s 0 . 532187406 C s 2
If 1.5 < C s < 2.8 :
α = 25953 . 828411268 70057 . 489315297 C s + 66202 . 507287974 C s 2 34489 . 86058943 C s 3 + 8475 . 284356191 C s 4 823 . 56972663 C s 5 1 + 2982 . 326081943 C s 1054 . 442301104 C s 2
β = σ ( α 1 ) α 2 α
γ = μ β α α 1
The equations needed to estimate the parameters with MOM have the following expressions:
L 1 = γ + β α α 1
L 2 = α β ( α 1 ) ( 2 α 1 )
L 3 = α β ( α + 1 ) ( α 1 ) ( 2 α 1 ) ( 3 α 1 )
Based on these equations, the parameters have the following expressions:
α = τ 3 + 1 3 τ 3 1
β = 2 L 2 3 4 L 2 ( 7 τ 3 5 ) 3 ( τ 3 + 1 ) ( 3 τ 3 1 )
γ = L 2 + 3 L 1 3 8 L 2 9 τ 3 3

2.2.6. The Five-Parameter Wakeby Distribution (WK5)

The equations needed to estimate the parameters with MOM have the following expressions:
μ = ξ + α β + 1 γ δ 1
σ 2 = α 2 ( β + 1 ) 2 ( 2 β + 1 ) 2 α γ ( β + 1 ) ( β + 1 δ ) ( δ 1 ) γ 2 ( δ 1 ) 2 ( 2 δ 1 )
The equations for skewness and kurtosis are presented in Appendix E.
The equations needed to estimate the parameters with L-moments have the following expressions [7,8]:
L 1 = ξ + α β + 1 + γ 1 δ
L 2 = α ( β + 1 ) ( β + 2 ) + γ ( 2 δ ) ( 1 δ )
L 3 = γ ( δ + 1 ) ( 1 δ ) ( 2 δ ) ( 3 δ ) + α ( 1 β ) ( β + 1 ) ( β + 2 ) ( β + 3 )
L 4 = γ ( δ + 1 ) ( δ + 2 ) ( 4 δ ) ( 3 δ ) ( 2 δ ) ( 1 δ ) + α ( 2 β ) ( 1 β ) ( β + 1 ) ( β + 2 ) ( β + 3 ) ( β + 4 )

3. Case Study

The presented case study consists in the determination of the maximum annual flows, on the Prigor River, Romania, using the proposed probability distributions.
The Prigor River is the left tributary of the Nera River, and it is located in the south-western part of Romania, as shown in Figure 2. The geographical coordinates of the location are 44°55′25.5″ N 22°07′21.7″ E.
The main morphometric characteristics of the river are presented in Table 3 [37].
In the section of the hydrometric station, the watershed area is 141 km2 and the average altitude is 729 m. The river has a length of 33 km, with an average slope of 22‰ and a sinuosity coefficient of 1.83.
There are 31 annual maximum flows; the values are presented in Table 4.
For the analysis with the AES, the maximum flows resulting from the selection are presented in Table 5.
The main statistical indicators of the data series are presented in Table 6.

4. Results

The proposed distributions from the Generalized Pareto family, were applied to perform a flood frequency analysis using the annual maximum series (AMS) and the annual exceedance series (AES) analysis, on the Prigor River.
The MOM and the L-moments method were used to estimate the parameters of the distributions. For the MOM, the skewness coefficient was chosen depending on the origin of the flows according to Romanian regulations [22,30] based on some multiplication coefficients for the coefficient of variation ( C v ). For the Prigor River, the multiplication coefficient 3, applied to the coefficient of variation, resulted in a skewness of 2.29 for AMS and 1.786 for AES. In Table 7 and Table 8 the resulted values of quantile distributions for some of the most common exceedance probabilities in flood frequency analysis are presented.
Figure 3 and Figure 4 show the fitting distributions for AMS and AES for the Prigor River. For plotting positions, the Alexeev formula was used [2].
In Figure S1, from the Supplementary Material, the confidence interval for each analyzed distribution is presented, both for the MOM and L-moments, using Chow’s relation [4,36] for a 95% confidence level.
Table 9 shows the values of the distributions parameters for the two methods of estimating and for both the AMS and AES.
Considering that it is desired to apply the distributions using the L-moments method, choosing the best distribution is based on the L-skewness ( τ 3 ) and L-kurtosis ( τ 4 ) values and diagram. The values of the RME and RAE indicators [38,39,40] are presented informatively, knowing that they are relevant only in the area of the observed data.
The distribution performance values are presented in Table 10 and Table 11. The values for the best-fitted model are highlighted in bold.

5. Discussions

In this article, the applicability of the distributions from the Generalized Pareto family in flood frequency analysis was analyzed using the Prigor River as a case study.
The analysis was performed using the AMS and AES. As can be seen both graphically (Figure 4) and tabularly (Table 7), the analysis with the AMS is more conservative than the analysis with the AES.
The main advantage of the AMS analysis is the ease of data selection, these being chosen as the maximum flow corresponding to each year. The disadvantage of the analysis is the use of maximum flows characteristic of each year, which do not always represent floods. These values located in the right-hand (high probabilities) lead to a steeper graph with higher values of quantiles in the field of low probabilities (left-hand).
The advantage of the analysis with AES is that the flows of the data series always represent floods. The disadvantage is the greater effort in data selection, through additional analyzes regarding data independence, respecting the condition that these maximum flows do not come from the same flood.
The estimation methods of the analyzed distribution parameters were performed for the MOM (standard method in Romania) and L-moments, two of the most used estimation methods in hydrology.
For the MOM analysis, the skewness was chosen depending on the origin of the flows, as is the hydrological practice in Romania. The use of multiplication coefficients for the calculation of the corrected skewness is an outdated method based on principles from the abrogated norms and a legacy from the USSR normative standards.
All analyzed distributions represent particular cases of the Generalized Pareto distribution, which are distributions of three and four parameters. The Wakeby distribution, which is a five-parameter distribution, was analyzed because it has as its particular case the PGII distribution, which is a quantile function whose structure is made up of two quantile functions of the PGII distribution.
All the results obtained in the case study are presented in comparison to the Pearson III distribution, which is considered the “parent” distribution in Romania, for the most used exceedance probabilities in hydrology.
Considering that it is desired to implement the L-moments method in Romania, according to Tabel 10 and Tabel 11, the best-fitted distributions are the PGIV4 and WK5 distributions, which best approximate the statistical indicators of the data set, τ 3 and τ 4 . The PGII and PGI distributions give satisfactory results because the natural values τ 3 and τ 4 of the distributions are close to those of the data sets.
For the PE3, PGIV3, PGIII, PGII and PGI, which are three parameter distributions, the resulting values are characterized by a high degree of uncertainty, especially in the area of small exceedance probabilities (left-hand), due to the fact that a proper calibration of the higher moments cannot be done. The confidence intervals for the analyzed distributions are presented in the Supplementary Material.
As observed in other materials [24], the apparent stability of the Pearson III distribution is due to the fact that the variation of the shape parameter for the two estimation methods does not differ much, except in the upper area of C s and τ 3 . The same cannot be said, for example, about the PGI, PGII and PGIII distributions, in which the variation is extremely large. Figure 5 shows the variation graph of the shape parameter for the PE3, PGIII, PGII and PGI distributions. As it could be observed in Section 2.1, both skewness and L-skewness depend only on the shape coefficient α .
The results of the quantiles obtained with the L-moments method, for the PGII and PGI distributions, both for the AES and AMS, presented in Table 4 and Table 5, are the same, the two distributions being mutual special cases. The WK5 and PGIV4 distributions best approximate, as expected, the values of the indicators obtained with L-moments, which are distributions of five and four parameters, respectively.
Figure 6 shows the τ 3 τ 4 variation diagram of the distributions as well as their relation to the values of the two indicators of the data sets.
Regarding the results obtained with MOM, for AES it can also be observed that the PGII and WK5 distributions have extremely close quantile values, which is due to the fact that for the same C s they correspond to the same value of C k , the WK5 distribution becoming the PGII distribution, as it was highlighted in other materials [4,8,9,22]. This is due to the choice of skewness based on the origin of the maximum flows. This is another disadvantage of using MOM in Romania.
Concerning the WK5 distribution, although it is a distribution that was introduced in flood frequency analysis to achieve the so-called “separation effect” described by Matalas [9,22], it can be seen that it is extremely sensitive depending on the analysis used (AMS or AES), and this is due to the particular cases in which it can take.
Figure 7 shows the skewness ( C s )-kurtozis ( C k ) variation diagram of the distributions.

6. Conclusions

The Generalized Pareto distribution represents a usual distribution used in the analysis of extreme events in hydrology. In flood frequency analysis this is especially applied using the partial series of maximum flows.
This article presents five distributions of three, four and five parameters that represent different forms of the Generalized Pareto distribution, some of them received limited attention in flood frequency analysis.
These distributions were analyzed (besides other families of distributions) in the research carried out in the Faculty of Hydrotechnics regarding the elaboration of a norm in Romania for the frequency analysis using the L-moments method.
The main purpose of the article was to identify other distributions from the same family that have applicability in flood frequency analysis using both AMS and AES.
For the transparency and ease of use of these distributions, all the necessary elements for their use are presented, such as the exact and approximate relations of the parameters’ estimation, and of the frequency factors, which eliminates the need for iterative numerical calculation, thus facilitating their applicability.
The performances of these distributions were verified in flood frequency analysis of the Prigor River, using the Annual Maximum Series and the Annual Exceedance Series.
The results were evaluated using the values of τ 3 and τ 4 , based on the τ 3 τ 4 diagram, compared to that of the data sets, which is the easiest and most accessible selection criterion.
Based on this study’s results, and also from the research carried out in the Faculty of Hydrotechnics for other sites, for flood frequency analysis and the L-moments estimation method, good candidates, from the Generalized Pareto family, are the PGIV4 and WK5 distributions, which are distributions of four and five parameters, which have the advantage that they can calibrate all linear moments.
Regarding the Wakeby distribution, this requires an additional analysis because in some cases it turns into the PGII distribution, which is a three-parameter distribution that does not achieve a satisfactory calibration of the linear moments.
In general, the three parameters distributions can be used in the analysis with L-moments, but the selection of their use must be made based on the τ 3 - τ 4 diagram so that the natural values τ 3 and τ 4 of the distribution to be very close to those of the observed data. Based on the work of Anghel si Ilinca [23], in Appendix A the τ 3 τ 4 diagram for a wide range of distributions used in hydrology is presented.
Mathematical support in statistical analysis is useful because the use of software (EasyFit, HEC-SSP, etc) without knowledge of mathematical foundations often leads to superficial analyzes with negative consequences. Another important aspect of the presentation of all the mathematical elements necessary for the application of these distributions is the fact that the software dedicated to statistical analysis is limited and does not offer the possibility of choosing the skewness coefficient depending on the origin of the maximum flows, as is the practice in Romania.
The research in this article is part of a more complex research carried out within the Faculty of Hydrotechnics, with the main aim of establishing the necessary guidelines for a robust, clear and concise norm regarding the determination of the maximum flow using the L-moment estimation method.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/w15081557/s1, Figure S1: The probability distributions curves with Confidence Intervals.

Author Contributions

Conceptualization, C.I. and C.G.A.; methodology, C.I. and C.G.A.; software, C.I. and C.G.A.; validation, C.I. and C.G.A.; formal analysis, C.I. and C.G.A.; investigation, C.I. and C.G.A.; resources, C.I. and C.G.A.; data curation, C.I. and C.G.A.; writing—original draft preparation, C.I. and C.G.A.; writing—review and editing, C.I. and C.G.A.; visualization, C.I. and C.G.A.; supervision, C.I. and C.G.A.; project administration, C.I. and C.G.A.; funding acquisition, C.I. and C.G.A. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

MOMthe method of ordinary moments
L-momentsthe method of linear moments
μ expected value; arithmetic mean
σ standard deviation
C v coefficient of variation
C s coefficient of skewness; skewness
C k coefficient of kurtosis; kurtosis
L 1 , L 2 , L 3 linear moments
τ 2 , L C v coefficient of variation based on the L-moments method
τ 3 , L C s coefficient of skewness based on the L-moments method
τ 4 , L C k coefficient of kurtosis based on the L-moments method
Distr.Distributions
RMErelative mean error
RAErelative absolute error
xiobserved values

Appendix A. The Variation of the L-Kurtosis—L-Skewness

In the next section the variation of the L-kurtosis depending on the positive L-skewness, obtained with the L-moments method, for certain theoretical distributions often used in hydrology is presented.
Figure A1. The variation diagram of L C s L C k .
Figure A1. The variation diagram of L C s L C k .
Water 15 01557 g0a1

Appendix B. The Frequency Factors for the Analyzed Distributions

Table A1 shows the expressions of the frequency factors for MOM and L-moments.
Table A1. Frequency factors.
Table A1. Frequency factors.
Distribution Frequency   Factor ,   K p ( p )
Quantile Function (Inverse Function)
Method of Ordinary Moments (MOM)L-Moments
x ( p ) = μ + σ K p ( p ) x ( p ) = L 1 + L 2 K p ( p )
PGIV4 Γ ( λ ) ( p 1 λ 1 ) Γ ( α + 1 ) Γ ( λ α ) Γ ( 2 α + 1 ) Γ ( λ 2 α ) Γ ( λ ) Γ ( α + 1 ) 2 Γ ( λ α ) 2 ( p 1 λ 1 ) α Γ ( α + 1 ) Γ ( λ α ) Γ ( λ ) Γ ( α + 1 ) ( Γ ( λ α ) Γ ( λ ) Γ ( 2 λ α ) Γ ( 2 λ ) )
PGIV3 Γ ( λ ) Γ ( p 1 λ 1 ) α Γ ( α + 1 ) Γ ( λ α ) Γ ( 2 α + 1 ) Γ ( λ 2 α ) Γ ( λ ) Γ ( α + 1 ) 2 Γ ( λ α ) 2 Γ ( p 1 λ 1 ) α Γ ( α + 1 ) Γ ( λ α ) Γ ( λ ) Γ ( α + 1 ) ( Γ ( λ α ) Γ ( λ ) Γ ( 2 λ α ) Γ ( 2 λ ) )
PGIII ( 1 p 1 ) 1 α π α sin ( π α ) 2 π α sin ( 2 π α ) π 2 α 2 sin ( π α ) 2 ( 1 p p ) 1 α α 2 sin ( π α ) π α
PGII 1 2 α α ( 1 ( 1 + α ) p α ) 1 + 2 α ( 1 p α ) ( α + 3 ) p α
PGI α | α | ( α 1 ) α 2 ( ( 1 p ) 1 α α α 1 ) α α | α | ( α 1 ) ( 2 α 1 ) ( ( 1 p ) 1 α α α 1 ) α
WK5 α β ( 1 p β ) γ δ ( 1 p δ ) α β + 1 + γ δ 1 α 2 ( β + 1 ) 2 ( 2 β + 1 ) 2 α γ ( β + 1 ) ( β + 1 δ ) ( δ 1 ) γ 2 ( δ 1 ) 2 ( 2 δ 1 ) α β ( 1 p β ) γ δ ( 1 p δ ) α β + 1 γ 1 δ α ( β + 1 ) ( β + 2 ) + γ ( 2 δ ) ( 1 δ )

Appendix C. Estimation of the Frequency Factor for the PGIII Distribution

The frequency factor, for MOM, can be estimated using the following polynomial function:
K p ( p ) = a + b C s + c C s 2 + d C s 3 + e C s 4 + f C s 5 + g C s 6 + h C s 7
Table A2. The frequency factor for estimation with MOM.
Table A2. The frequency factor for estimation with MOM.
P [%]abcdefgh
0.015.1117372.3134091.34999−0.7760280.182704−0.02320911.5563 × 10−3−4.330 × 10−5
0.13.8089411.377403345331−0.3187520.0903167−0.01308439.7470 × 10−4−2.960 × 10−5
0.52.9126200.8120830.0170251−0.1146350.0394326−6.29710 × 10−34.9970 × 10−4−1.590 × 10−5
12.5272590.600436−0.0535487−0.05656750.02330153.98010 × 10−33.2800 × 10−41.070 × 10−5
22.1400310.411098−0.0911407−0.01472560.0107835−2.10400 × 10−31.8480 × 10−4−6.200 × 10−6
31.9114470.311395−0.1003722.88960 × 10−35.08580 × 10−3−1.21600 × 10−31.1520 × 10−4−4.100 × 10−6
51.6193240.197967−0.1007970.0185338−4.64200 × 10−4−3.15600 × 10−44.3000 × 10−5−1.700 × 10−6
101.2089500.066749−0.08478510.0291844−5.24780 × 10−35.25900 × 10−4−2.7400 × 10−56.000 × 10−7
200.763538−0.0362722−0.05311110.0286703−6.99220 × 10−39.32800 × 10−4−6.5700 × 10−51.900 × 10−6
400.224364−0.103222−6.80020 × 10−30.0162353−5.34190 × 10−38.40400 × 10−4−6.63000 × 10−52.100 × 10−6
500.001255−0.1118380.01191168.67100 × 10−3−3.76100 × 10−36.53200 × 10−4−5.44000 × 10−51.8000 × 10−6
80−0.762791−0.05475790.0593627−0.02073343.89270 × 10−3−4.16000 × 10−42.37000 × 10−5−6.000 × 10−7
90−1.2106840.04154550.0668030−0.0360128.85020 × 10−3−1.19170 × 10−38.48000 × 10−5−2.500 × 10−6
The frequency factor, for L-moments, can be estimated using the following rational function:
K p ( p ) = a + b τ 3 1 + c τ 3 + d τ 3 2 + e τ 3 3 + f τ 3 4 + g τ 3 5 + h τ 3 6
Table A3. The frequency factor for estimation with L-moments.
Table A3. The frequency factor for estimation with L-moments.
P
[%]
abcdefgh
0.019.0865−9.0870−5.682414.136−19.53215.682−6.88721.2851
0.16.8300−6.8367−4.37098.5137−9.36926.1000−2.21780.35120
0.55.3161−5.3440−3.28174.4099−2.2009−1.07661.79280.61653
14.6040−4.6529−2.91683.6177−1.7699−0.698991.2546−0.43830
23.8961−3.9826−2.52182.8156−1.2339−0.623200.99972−0.35026
33.4784−3.5994−2.28402.4127−1.0762−0.399960.73125−0.26304
52.9457−3.1313−1.96551.9096−0.81060−0.299060.55319−0.20219
102.1976−2.5343−1.49751.3114−0.53308−0.154030.33379−0.12411
201.3864−2.0186−0.959520.81735−0.27803−0.0748780.18663−0.059390
400.40582−0.40582−0.258790.385340.16327−0.042751−0.0620780.088377
501.934 × 10−4−1.65110.0331440.373840.198570.011770−0.0310910.064863
80−1.3869−0.52845−0.0957051.2009−0.428380.0475470.40683−0.21635
90−2.19771.0582−0.122650.70118−0.41310−0.118620.17678−0.084297

Appendix D. Estimation of the Frequency Factor for the PGII Distribution

The frequency factor, for MOM, can be estimated using the following polynomial function:
K p ( p ) = a + b C s + c C s 2 + d C s 3 + e C s 4 + f C s 5 + g C s 6 + h C s 7
Table A4. The frequency factor for estimation with MOM.
Table A4. The frequency factor for estimation with MOM.
P [%]abcdefgh
0.011.9320140.0619042.87112−0.7955860.04954181.03747 × 10−2−1.7871 × 10−37.910 × 10−5
0.11.8096090.6704112.12699−1.157320.284537−3.77979 × 10−22.6299 × 10−3−7.530 × 10−5
0.51.7044171.2132200.762347−0.6519340.198072−3.05909 × 10−22.3988 × 10−3−7.580 × 10−5
11.6644191.3147800.178438−0.3555640.125632−2.09008 × 10−21.7176 × 10−3−5.610 × 10−5
21.6221831.265729−0.290832−0.07642580.0500117−9.92700 × 10−38.9160 × 10−4−3.080 × 10−5
31.5902861.151049−0.4793050.05834120.0103575−3.85500 × 10−34.1630 × 10−4−1.570 × 10−5
51.5308880.909001−0.5988490.179061−0.02885912.47620 × 10−3−9.6700 × 10−59.000 × 10−7
101.3775620.424384−0.5224940.228357−0.05363287.14020 × 10−3−5.0690 × 10−41.490 × 10−5
201.047703−0.1346753−0.1979160.136073−0.03936595.98720 × 10−3−4.6780 × 10−41.480 × 10−5
400.355375−0.46608630.182363−0.03470852.20900 × 10−32.60000 × 10−4−4.8700 × 10−52.100 × 10−6
500.0051002−0.4281290.242055−0.07585340.0142094−1.57970 × 10−39.61000 × 10−5−2.5000 × 10−6
80−1.0429460.10950220.0744175−0.05461870.0156946−2.36020 × 10−31.82600 × 10−4−5.700 × 10−6
90−1.3897580.3823067−0.0662687−9.67300 × 10−36.70430 × 10−3−1.27020 × 10−31.09700 × 10−4−3.700 × 10−6
The frequency factor, for L-moments, can be estimated using the following polynomial function:
K p ( p ) = a + b τ 3 + c τ 3 2 + d τ 3 3 + e τ 3 4 + f τ 3 5 + g τ 3 6 + h τ 3 7 + i τ 3 8 + j τ 3 9
Table A5. The frequency factor for estimation with L-moments.
Table A5. The frequency factor for estimation with L-moments.
P [%]abcdefghij
0.012.897616.049−166.332107.2−1171141260−843641.0671 × 105−6846514604
0.12.99217.912221.16046.340207.03−394.801467.1−2026.5667.72
0.52.97517.073321.3205.345297.924−99.393−95.564−59.326
12.93896.874114.22310.89533.199−116.2447.113
22.86676.55983.879028.740−66.61423.564
32.82145.33686.02521.9543−31.30514.174
52.70603.99514.1471−13.220−2.69564.0738
102.40172.0082−1.6983−9.92527.8868−1.6725
201.7989−0.48676−4.26161.00721.7490−0.80770
400.60025−2.4049−0.312672.3727−1.69660.44147
503.8891 × 10−4−2.33231.39050.48092−0.813900.27475
80−1.80030.525691.0120−1.37410.86642−0.22985
90−2.40012.1286−0.969240.232520.041754−0.033573

Appendix E. The Skewness and Kurtosis for the WK5 Distribution

C s = 6 α γ 2 ( δ 2 δ β + δ 1 ) ( β + 1 ) ( β + 1 2 δ ) ( β + 1 δ ) ( δ 1 ) 2 ( 2 δ 1 ) + 6 α 2 γ ( β 2 1 β ( 1 + δ ) ) ( β + 1 ) 2 ( 2 β + 1 ) ( β + 1 δ ) ( δ 1 ) ( 2 β + 1 δ ) 2 γ 3 ( 1 + δ ) ( δ 1 ) 3 ( 1 + δ ( 6 δ 5 ) ) 2 α 3 ( β + 1 ) ( β + 1 ) 3 ( 1 + β ( 5 + 6 β ) ) ( α 4 ( β + 1 ) 2 ( 2 β + 1 ) γ 2 ( δ 1 ) 2 ( 2 δ 1 ) + 2 α λ ( β + 1 ) ( δ 1 ) ( δ β 1 ) ) 1.5
C k = 3 ( α 4 ( 2 β 2 β + 3 ) ( β + 1 ) 4 ( 24 β 3 + 26 β 2 + 9 β + 1 ) γ 4 ( 3 + δ + 2 δ 2 ) ( δ 1 ) 4 ( 4 δ 1 ) ( 1 + δ ( 6 δ 5 ) ) + 2 α 2 γ 2 ( 9 β ( 30 + β ( 29 + 4 β ( 2 + β ) ) ) + β δ ( 71 + β ( 43 + 2 β ( 7 + 4 β ) ) ) δ 2 ( 29 + β ( 43 + 4 ( 6 + 7 β ) ) ) + 2 δ 3 ( 4 + 7 β ( 1 + 2 β ) ) 4 δ 4 ( 1 + 2 β ) + 30 δ ) ( β + 1 ) 2 ( 2 β + 1 ) ( 1 + β 2 δ ) ( 1 + 2 β 2 δ ) ( 1 + β δ ) ( 1 + 2 β δ ) ( δ 1 ) 2 ( 2 δ 1 ) + 4 α γ 4 ( 3 + 2 β + β 2 δ ( 12 + β ( 3 + β ) ) δ 2 ( 11 + β + 2 β 2 ) 2 δ 3 ( 1 + 3 β ) 4 δ 4 ) ( 1 + β ) ( 1 + β 3 δ ) ( 1 + β 2 δ ) ( 1 + β δ ) ( δ 1 ) 3 ( 1 + δ ( 6 δ 5 ) ) + 4 α 3 γ ( 3 + 4 β 4 + β 3 ( 2 6 δ ) + δ ( δ 2 ) + β ( 12 + δ ( δ 3 ) ) + β 2 ( 11 + δ ( 2 δ 1 ) ) ) ( 1 + β ) 3 ( 1 + β ( 6 β + 5 ) ) ( 1 + β δ ) ( 1 + 2 β δ ) ( 1 + 3 β δ ) ( δ 1 ) ) ( γ 2 ( δ 1 ) 2 ( 2 δ 1 ) α 4 ( β + 1 ) 2 ( 2 β + 1 ) + 2 α λ ( β + 1 ) ( δ 1 ) ( δ β 1 ) ) 2

Appendix F. The Skewness and Kurtosis for the PGIV Distribution

C s = Γ ( 1 + 3 α ) Γ ( λ 3 α ) λ Γ ( λ ) + 2 Γ ( 1 + α ) 3 Γ ( λ α ) 3 Γ ( λ + 1 ) 3 3 α Γ ( 1 + 2 α ) Γ ( λ 2 α ) Γ ( α ) Γ ( λ α ) Γ ( λ + 1 ) 2 ( Γ ( 1 + 2 α ) Γ ( λ 2 α ) λ Γ ( λ ) Γ ( 1 + α ) 2 Γ ( λ α ) 2 Γ ( λ + 1 ) 2 ) 3 / 2
C k = Γ ( 1 + 4 α ) Γ ( λ 4 α ) Γ ( λ + 1 ) 4 Γ ( 1 + 3 α ) Γ ( λ 3 α ) Γ ( 1 + α ) Γ ( λ α ) Γ ( λ + 1 ) 2 + 6 Γ ( 1 + 2 α ) Γ ( λ 2 α ) Γ ( 1 + α ) 2 Γ ( λ α ) 2 Γ ( λ + 1 ) 3 3 Γ ( 1 + α ) 4 Γ ( λ α ) 4 Γ ( λ + 1 ) 4 ( Γ ( 1 + 2 α ) Γ ( λ 2 α ) Γ ( λ + 1 ) Γ ( 1 + α ) 2 Γ ( λ α ) 2 Γ ( λ + 1 ) 2 ) 2

References

  1. Popovici, A. Dams for Water Accumulations; Technical Publishing House: Bucharest, Romania, 2002; Volume 2. [Google Scholar]
  2. Teodorescu, I.; Filotti, A.; Chiriac, V.; Ceausescu, V.; Florescu, A. Water Management; Ceres Publishing House: Bucharest, Romania, 1973. [Google Scholar]
  3. Hosking, J.R.M.; Wallis, J.R. Regional Frequency Analysis: An Approach Based on L-moments; Cambridge University Press: Cambridge, UK, 1997. [Google Scholar] [CrossRef]
  4. Rao, A.R.; Hamed, K.H. Flood Frequency Analysis; CRC Press LLC: Boca Raton, FL, USA, 2000. [Google Scholar]
  5. Singh, V.P. Entropy-Based Parameter Estimation in Hydrology; Springer Science + Business Media: Dordrecht, The Netherlands, 1998; ISBN 978-90-481-5089-2/978-94-017-1431-0. [Google Scholar] [CrossRef]
  6. Gubareva, T.S.; Gartsman, B.I. Estimating Distribution Parameters of Extreme Hydrometeorological Characteristics by L-Moment Method. Water Resour. 2010, 37, 437–445. [Google Scholar] [CrossRef]
  7. Greenwood, J.A.; Landwehr, J.M.; Matalas, N.C.; Wallis, J.R. Probability Weighted Moments: Definition and Relation to Parameters of Several Distributions Expressable in Inverse Form. Water Resour. Res. 1979, 15, 1049–1054. [Google Scholar] [CrossRef]
  8. Houghton, J.C. Birth of a parent: The Wakeby distribution for modeling flood flows. Water Resour. Res. 1978, 14, 1105–1109. [Google Scholar] [CrossRef]
  9. Crooks, G.E. Field Guide to Continuous Probability Distributions; Berkeley Institute for Theoretical Science: Berkeley, CA, USA, 2019. [Google Scholar]
  10. Yang, T.; Shao, Q.X.; Hao, Z.C.; Chen, X.; Zhang, Z.X.; Xu, C.Y.; Sun, L.M. Regional frequency analysis and spatio-temporal pattern characterization of rainfall extremes in the Pearl River Basin, China. J. Hydrol. 2010, 380, 386–405. [Google Scholar] [CrossRef]
  11. Zakaria, Z.A.; Shabri, A. Regional frequency analysis of extreme rainfalls using partial L-moments method. Theor. Appl. Climatol. 2013, 113, 83–94. [Google Scholar] [CrossRef]
  12. Zhou, C.R.; Chen, Y.F.; Huang, Q.; Gu, S.H. Higher moments method for generalized Pareto distribution in flood frequency analysis. IOP Conf. Ser. Earth Environ. Sci. 2017, 82, 012031. [Google Scholar] [CrossRef]
  13. Martins, A.L.A.; Liska, G.R.; Beijo, L.A.; Menezes, F.S.D.; Cirillo, M.Â. Generalized Pareto distribution applied to the analysis of maximum rainfall events in Uruguaiana, RS, Brazil. SN Appl. Sci. 2020, 2, 1479. [Google Scholar] [CrossRef]
  14. Ciupak, M.; Ozga-Zielinski, B.; Tokarczyk, T.; Adamowski, J. A Probabilistic Model for Maximum Rainfall Frequency Analysis. Water 2021, 13, 2688. [Google Scholar] [CrossRef]
  15. Shao, Y.; Zhao, J.; Xu, J.; Fu, A.; Wu, J. Revision of Frequency Estimates of Extreme Precipitation Based on the Annual Maximum Series in the Jiangsu Province in China. Water 2021, 13, 1832. [Google Scholar] [CrossRef]
  16. Ashkar, F.; Ouarda, T.B. On some methods of fitting the generalized Pareto distribution. J. Hydrol. 1996, 177, 117–141. [Google Scholar] [CrossRef]
  17. Mohsen, S.; Zulkifli, Y.; Fadhilah, Y. Comparison of Distribution Models for Peak flow, Flood Volume and Flood Duration. Res. J. Appl. Sci. Eng. Technol. 2013, 6, 733–738. [Google Scholar]
  18. Swetapadma, S.; Ojha, C.S.P. Technical Note: Flood frequency study using partial duration series coupled with entropy principle. Hydrol. Earth Syst. Sci. Discuss. 2021. preprint. [Google Scholar] [CrossRef]
  19. Rahman, A.S.; Rahman, A.; Zaman, M.A.; Haddad, K.; Ahsan, A.; Imteaz, M. A study on selection of probability distributions for at-site flood frequency analysis in Australia. Nat. Hazards 2013, 69, 1803–1813. [Google Scholar] [CrossRef]
  20. Drissia, T.K.; Jothiprakash, V.; Anitha, A.B. Flood Frequency Analysis Using L Moments: A Comparison between At-Site and Regional Approach. Water Resour. Manag. 2019, 33, 1013–1037. [Google Scholar] [CrossRef]
  21. Hosking, J.R.M.; Wallis, J.R. Parameter and Quantile Estimation for the Generalized Pareto Distribution. Technometrics 1987, 29, 339–349. [Google Scholar] [CrossRef]
  22. Ilinca, C.; Anghel, C.G. Flood-Frequency Analysis for Dams in Romania. Water 2022, 14, 2884. [Google Scholar] [CrossRef]
  23. Anghel, C.G.; Ilinca, C. Hydrological Drought Frequency Analysis in Water Management Using Univariate Distributions. Appl. Sci. 2023, 13, 3055. [Google Scholar] [CrossRef]
  24. Anghel, C.G.; Ilinca, C. Parameter Estimation for Some Probability Distributions Used in Hydrology. Appl. Sci. 2022, 12, 12588. [Google Scholar] [CrossRef]
  25. Viglione, A.; Merz, R.; Salinas, J.L.; Blöschl, G. Flood frequency hydrology: 3. A Bayesian analysis. Water Resour. Res. 2013, 49, 675–692. [Google Scholar] [CrossRef]
  26. Gaume, E. Flood frequency analysis: The Bayesian choice. WIREs Water. 2018, 5, e1290. [Google Scholar] [CrossRef]
  27. Lang, M.; Ouarda, T.B.; Bobée, B. Towards operational guidelines for over-threshold modeling. J. Hydrol. 1999, 225, 103–117. [Google Scholar] [CrossRef]
  28. Bulletin 17B Guidelines for Determining Flood Flow Frequency; U.S. Department of the Interior, U.S. Geological Survey: Reston, VA, USA, 1981.
  29. Bulletin 17C Guidelines for Determining Flood Flow Frequency; U.S. Department of the Interior, U.S. Geological Survey: Reston, VA, USA, 2017.
  30. STAS 4068/1-82; Maximum Water Discharges and Volumes, Determination of Maximum Water Discharges and Volumes of Watercourses. The Romanian Standardization Institute: Bucharest, Romania, 1982.
  31. Diacon, C.P. Serban Hydrological Syntheses and Regionalizations; Technical Publishing House: Bucharest, Romania, 1994. [Google Scholar]
  32. Mandru, R.; Ioanitoaia, H. Ameliorative Hydrology; Agro-Silvica Publishing House: Bucharest, Romania, 1962. [Google Scholar]
  33. Constantinescu, M.; Golstein, M.; Haram, V.; Solomon, S. Hydrology; Technical Publishing House: Bucharest, Romania, 1956. [Google Scholar]
  34. Ministry of Regional Development and Tourism. The Regulations Regarding the Establishment of Maximum Flows and Volumes for the Calculation of Hydrotechnical Retention Constructions; Indicative NP 129-2011; Ministry of Regional Development and Tourism: Bucharest, Romania, 2012.
  35. Murshed, M.S.; Park, B.J.; Jeong, B.Y.; Park, J.S. LH-Moments of Some Distributions Useful in Hydrology. In Communications for Statistical Applications and Methods; The Korean Statistical Society: Seoul, Republic of Korea, 2009. [Google Scholar] [CrossRef]
  36. Chow, V.T.; Maidment, D.R.; Mays, L.W. Applied Hydrology; McGraw-Hill, Inc.: New York, NY, USA, 1988; ISBN 007-010810-2. [Google Scholar]
  37. Ministry of the Environment. The Romanian Water Classification Atlas, Part I—Morpho-Hydrographic Data on the Surface Hydrographic Network; Ministry of the Environment: Bucharest, Romania, 1992.
  38. Singh, K.; Singh, V.P. Parameter Estimation for Log-Pearson Type III Distribution by Pome. J. Hydraul. Eng. 1988, 1, 112–122. [Google Scholar] [CrossRef]
  39. Shaikh, M.P.; Yadav, S.M.; Manekar, V.L. Assessment of the empirical methods for the development of the synthetic unit hydrograph: A case study of a semi-arid river basin. Water Pract. Technol. 2021, 17, 139–156. [Google Scholar] [CrossRef]
  40. Gu, J.; Liu, S.; Zhou, Z.; Chalov, S.R.; Zhuang, Q. A Stacking Ensemble Learning Model for Monthly Rainfall Prediction in the Taihu Basin, China. Water 2022, 14, 492. [Google Scholar] [CrossRef]
Figure 1. Methodological approach.
Figure 1. Methodological approach.
Water 15 01557 g001
Figure 2. The location of Prigor River and Prigor hydrometric station.
Figure 2. The location of Prigor River and Prigor hydrometric station.
Water 15 01557 g002
Figure 3. The fitting distributions for AMS.
Figure 3. The fitting distributions for AMS.
Water 15 01557 g003
Figure 4. The fitting distributions for AES.
Figure 4. The fitting distributions for AES.
Water 15 01557 g004
Figure 5. The variation of parameter α .
Figure 5. The variation of parameter α .
Water 15 01557 g005aWater 15 01557 g005b
Figure 6. The τ 3 τ 4 diagram for analyzed distributions.
Figure 6. The τ 3 τ 4 diagram for analyzed distributions.
Water 15 01557 g006
Figure 7. The C s C k diagram for the analyzed distributions.
Figure 7. The C s C k diagram for the analyzed distributions.
Water 15 01557 g007
Table 1. Novelty elements.
Table 1. Novelty elements.
New ElementsDistribution
Exact parameter estimationPGIV4, PGI
Approximate estimation of parametersPGIII, PGII, PGI
The frequency factor for MOMPGIV4, PGIV3, PGIII, PGII, PGI, WK5
The frequency factor for L-momentsPGIV4, PGIV3, PGIII, PGII, PGI, WK5
Approximate estimation of the frequency factorPGIII, PGII
Raw and central moments *PGIV, PGIII, PGII
* are presented in the Supplementary File.
Table 2. The analyzed probability distributions.
Table 2. The analyzed probability distributions.
Distr. f ( x ) F ( x ) x ( p )
PGIV4 λ ( x γ β ) 1 α 1 ( ( x γ β ) 1 α + 1 ) λ 1 α β ( ( x γ β ) 1 α + 1 ) λ γ + β ( 1 p 1 λ 1 ) α
PGIV3 λ ( x β ) 1 α 1 ( ( x β ) 1 α + 1 ) λ 1 α β 1 ( ( x β ) 1 α + 1 ) λ β ( 1 p 1 λ 1 ) α
PGIII α ( x γ β ) α 1 ( ( x γ β ) α + 1 ) 2 β ( 1 + ( x γ β ) α ) 1 γ + β ( 1 p 1 ) 1 α
PGII 1 β ( 1 α β ( x γ ) ) 1 α 1 ( 1 α β ( x γ ) ) 1 α γ + β α ( 1 p α )
PGI α β ( x γ β ) α 1 ( x γ β ) α γ + β p 1 α
WK5No closed formNo closed form ξ + α β ( 1 p β ) γ δ ( 1 p δ )
Table 3. The morphometric characteristics.
Table 3. The morphometric characteristics.
Length
[km]
Average
Stream Slope [‰]
Sinuosity
Coefficient [-]
Average
Altitude [m]
Watershed
Area [km2]
33221.83713153
Table 4. The AMS from the Prigor hydrometric station.
Table 4. The AMS from the Prigor hydrometric station.
AMS
19901991199219931994199519961997199819992000
Flow[m3/s]9.961510.114.87.3021.218.221.413.114.535
20012002200320042005200620072008200920102011
Flow[m3/s]19.922.111.880.38851.672.216.242.628.512.8
201220132014201520162017201820192020
Flow[m3/s]31.224.152.221.118.96.4024.915.136.6
Table 5. The AES from the Prigor hydrometric station.
Table 5. The AES from the Prigor hydrometric station.
AES
19951996199720002001200220042004200420052005
Flow[m3/s]21.218.221.43519.922.180.322.219.28838.9
20052005200620072007200720082009200920102012
Flow[m3/s]2417.551.672.233.815.916.242.623.128.531.2
201220122013201420152016201620182020
Flow[m3/s]27.318.724.152.221.118.916.824.936.6
Table 6. The statistical indicators of the data series.
Table 6. The statistical indicators of the data series.
Prigor RiverStatistical Indicators
μ σ C v C s C k L 1 L 2 L 3 L 4 τ 2 τ 3 τ 4
[m3/s][m3/s][-][-][-][m3/s][m3/s][m3/s][m3/s][-][-][-]
AMS27.621.10.7621.665.1627.610.74.262.430.3860.3990.228
AES31.718.90.5951.835.7731.79.304.222.140.2930.4540.230
μ , σ , C v , C s , C k , L 1 , L 2 , L 3 , L 4 , τ 2 , τ 3 , τ 4 represent the mean, the standard deviation, the coefficient of variation, the skewness, the kurtosis, the four L-moments, the L-coefficient of variation, the L-skewness and the L-kurtosis, respectively. For parameter estimation with the L-moments, the data series must be in ascending order for the calculation of natural estimators and L-moments, respectively.
Table 7. Quantile results of the analyzed distributions for AMS.
Table 7. Quantile results of the analyzed distributions for AMS.
DistributionAnnual Maximum Series (AMS)
Exceedance Probabilities [%]
MOML-Moments
0.010.10.51235800.010.10.5123580
PE321416012310790.781.569.512.023117213011395.485.372.711.4
PGIV426516611810084.776.266.111.336421714511996.984.970.911.7
PGIV326016611810185.576.966.611.381332316912896.882.166.512.6
PGIII27916911798.782.774.264.312.080032016812895.782.166.612.5
PGII22516212210689.680.469.111.832920714211896.885.171.411.7
PGI17113811210087.68070.210.532920714211886.885.171.411.7
WK522716312210689.480.369.0011.835821614512097.085.071.011.7
Table 8. Quantile results of the analyzed distributions for AES.
Table 8. Quantile results of the analyzed distributions for AES.
DistributionAnnual Exceedance Series (AES)
Exceedance Probabilities [%]
MOML-Moments
0.010.10.51235800.010.10.5123580
PE317813811097.785.478.269.116.623317213011395.385.372.918.0
PGIV423315010893.580.373.264.917.227719013711696.085.172.018.1
PGIV321914610894.381.674.666.116.683632717012896.481.766.419.2
PGIII23215010993.780.473.264.717.494033816812694.480.165.318.8
PGII17013611098.086.078.969.716.645224015012196.483.969.918.3
PGI13411710192.683.577.770.015.645224015012196.483.969.918.3
WK515012810898.187.380.571.417.020216413011496.786.673.718.2
Table 9. Parameter values of each distribution for MOM and L-moments.
Table 9. Parameter values of each distribution for MOM and L-moments.
Distri.AMSAES
MOML-MomentsMOML-Moments
α β γ λ δ ξ α β γ λ δ ξ α β γ λ δ ξ α β γ λ δ ξ
PE30.76624.19.2---0.69426.99.0---1.25416.910.6---0.52628.216.9---
PGIV40.50547.4−2.512.66--0.94080.07.475.19--0.22249.3−171.26--1.29164916.242.9--
PGIV30.5852.7-3.27--0.3720.3-0.92--0.43146.7-2.55--0.1519.6-0.36--
PGIII5.2352.9−28.5---2.5120.30.90---6.07557.1−28---2.214.211.2---
PGII−0.04219.37.5---−0.1417.17.78---0.03920.412.1---−0.2512.215.4---
PGI−15.4−367373---7.14122−114---−6.735−166176---4.02349.2−33.8---
WK53.3174.01419-0.0477.0572.822.0916-0.177.56−20.51.8227.5-−0.13314.7−5080.28515-−0.2516.33
Table 10. Distribution performance values for AMS.
Table 10. Distribution performance values for AMS.
Distr.Statistical Measures
Methods of Parameters EstimationAES Values
MOML-Moments
RMERAERMERAE τ 3 τ 4 τ 3 τ 4
PE30.02380.08790.02240.09020.3990.1920.3990.228
PGIV40.03520.15740.01840.07720.228
PGIV30.03000.13940.01830.07500.303
PGIII0.05330.20810.01810.07360.299
PGII0.02280.10470.01900.07870.221
PGI0.04700.23270.06460.26710.221
WK50.01840.08070.01850.07750.228
Table 11. Distribution performance values for AES.
Table 11. Distribution performance values for AES.
Distr.Statistical Measures
Methods of Parameters EstimationAES Values
MOML-Moments
RMERAERMERAE τ 3 τ 4 τ 3 τ 4
PE30.02190.10630.00930.04050.4540.2200.4540.230
PGIV40.03950.17360.00940.03760.230
PGIV30.03470.15560.01560.06800.348
PGIII0.03990.17470.01530.06560.338
PGII0.01950.09670.01080.03940.272
PGI0.02690.13130.01080.03940.272
WK50.01370.06890.00860.03590.230
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Anghel, C.G.; Ilinca, C. Evaluation of Various Generalized Pareto Probability Distributions for Flood Frequency Analysis. Water 2023, 15, 1557. https://doi.org/10.3390/w15081557

AMA Style

Anghel CG, Ilinca C. Evaluation of Various Generalized Pareto Probability Distributions for Flood Frequency Analysis. Water. 2023; 15(8):1557. https://doi.org/10.3390/w15081557

Chicago/Turabian Style

Anghel, Cristian Gabriel, and Cornel Ilinca. 2023. "Evaluation of Various Generalized Pareto Probability Distributions for Flood Frequency Analysis" Water 15, no. 8: 1557. https://doi.org/10.3390/w15081557

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop