Next Article in Journal
Nonlinear Complex Wave Excitations in (2+1)-Dimensional Klein–Gordon Equation Investigated by New Wave Transformation
Previous Article in Journal
A Domain Generation Diagnosis Framework for Unseen Conditions Based on Adaptive Feature Fusion and Augmentation
Previous Article in Special Issue
Extreme Treatment Effect: Extrapolating Dose-Response Function into Extreme Treatment Domain
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A New Class of Reduced-Bias Generalized Hill Estimators

by
Lígia Henriques-Rodrigues
1,*,†,
Frederico Caeiro
2,† and
M. Ivette Gomes
3,†
1
School of Science and Technology and Research Center in Mathematics and Applications (CIMA), University of Évora, 7000-671 Évora, Portugal
2
NOVA School of Science and Technology (NOVA FCT) and Center for Mathematics and Applications (CMA), NOVA University Lisbon, 2829-516 Caparica, Portugal
3
Department of Statistics and Operational Research (DEIO), Faculty of Sciences of the University of Lisbon (FCUL) and Centre of Statistics and its Applications (CEAUL), University of Lisbon (ULisboa), 1749-016 Lisbon, Portugal
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Mathematics 2024, 12(18), 2866; https://doi.org/10.3390/math12182866 (registering DOI)
Submission received: 31 July 2024 / Revised: 4 September 2024 / Accepted: 10 September 2024 / Published: 14 September 2024
(This article belongs to the Special Issue Computational Statistical Methods and Extreme Value Theory)

Abstract

:
The estimation of the extreme value index (EVI) is a crucial task in the field of statistics of extremes, as it provides valuable insights into the tail behavior of a distribution. For models with a Pareto-type tail, the Hill estimator is a popular choice. However, this estimator is susceptible to bias, which can lead to inaccurate estimations of the EVI, impacting the reliability of risk assessments and decision-making processes. This paper introduces a novel reduced-bias generalized Hill estimator, which aims to enhance the accuracy of EVI estimation by mitigating the bias.

1. Introduction

Statistics of extremes is a branch of statistics dealing with inference for extreme or rare events. Due to their unpredictability, such events are a matter of enormous concern since they can produce catastrophic consequences. The areas of application are very diversified. We mention the fields of environment (extreme pollution), hydrology (river flows), meteorology (heat waves, windstorms), geology (diamond sizes), insurance (large insurance claims), telecommunications (internet traffic), public health (rates of Pneumonia), and finance (stock market crash), among others.
Extreme value theory provides the theoretical framework for building the statistical model describing extreme events. The first fundamental limit results appeared in Fréchet [1], Fisher and Tippett [2], von Mises [3], and Gnedenko [4]. More specifically, let X 1 , X 2 denote a sequence of independent and identically distributed random variables with a common distribution function (d.f.) F ( x ) = P ( X x ) . Assume that there exist sequences of normalizing constants a n > 0 , and b n R , and a non-degenerate d.f. G such that, for all x,
lim n P max ( X 1 , X 2 , , X n ) b n a n x = G ( x ) .
Then, we say that the d.f. F belongs to the max-domain of attraction of the extreme value (EV) distribution G, and we write F D M ( G ) . The sequences of constants a n > 0 and b n R can be redefined in such a way that for some ξ R ,
G ( x ) G ξ ( x ) : = exp ( 1 + ξ x ) 1 / ξ , 1 + ξ x > 0 if ξ 0 exp ( exp ( x ) ) , x R if ξ = 0 .
The shape parameter ξ is the extreme value index (EVI), the most important parameter of extreme events, which is related to the heaviness of the right tail. It is also a cornerstone in the estimation of other parameters of extreme events (see [5,6,7,8,9,10], among others). For more details on the max domain of attraction, see Beirlant et al. [11], de Haan and Ferreira [12], and Dey and Yan [13], among others.
We consider in the present paper Pareto-type tails, i.e., ξ > 0 in the EV d.f. in (2). In mathematical terms, this is equivalent to assuming that the right tail or survival function 1 F satisfies
1 F ( x ) = x 1 / ξ L ( x ) , x ,
where L ( x ) is an unspecified slowly varying function at infinity, i.e., lim L ( t x ) / L ( t ) = 1 , as t , for all x > 0 (see references [12,14,15] for details on regular variation). This condition is usually called the first-order condition. If the slowly varying function L is constant, then F is the exact Pareto model. Equivalently, F is a model with a Pareto-type tail whenever the reciprocal tail quantile function U, defined by
U ( t ) = F ( 1 1 / t ) , t 1 , with F ( x ) = inf { y : F ( y ) x } ,
is of regular variation with index ξ (denoted as U R V ξ ), i.e., for every x > 0 ,
lim t U ( t x ) U ( t ) = x ξ .
For ξ > 0 , the most well-known EVI-estimator is the Hill estimator [16], which is the average of the k log-excesses,
ξ ^ H ( k ) : = 1 k i = 1 k V i k , 1 k < n ,
with
V i k : = ln X n i + 1 : n ln X n k : n ,
where X 1 : n X 2 : n X n : n denote the ascending-order statistics from a random sample of size n. The consistency of the Hill estimator is achieved if the threshold X n k : n is an intermediate order statistic [17], i.e., if
k = k ( n ) a n d k / n 0 , a s n .
The asymptotic normality of Hill’s estimator has been studied, under different assumptions, by several authors (see [18,19,20], among others). Typically, F is a Pareto-type model and the Hill estimator has a high variance for high thresholds X n k : n , i.e., for small values of k and a high bias for low thresholds, i.e., for large values of k. Consequently, the mean squared error (MSE) has a very peaked pattern, making it difficult to determine the optimal k, defined as the value k 0 where the MSE reaches its minimal value. For a detailed review on estimation procedures for the EVI, see [9,21]. To overcome this trade-off problem between bias and variance, several alternative estimators are proposed in the literature. Some of these alternative estimators are based on the α -moment of the log-excesses
M n ( α ) ( k ) : = 1 k i = 1 k V i k α .
We mention the class of kernel estimators introduced in Csörgo et al. [22], the moment estimator proposed by Dekkers et al. [23],
ξ ^ M ( k ) = M n ( 1 ) ( k ) + 1 1 2 1 ( M n ( 1 ) ( k ) ) 2 M n ( 2 ) ( k ) 1 ,
and the Jackknife estimators proposed in Gomes et al. [24]. Generalizations of the Hill estimator, dependent on an additional tuning parameter, can be found in Gomes and Martins [25], Groeneboom et al. [26], Beran et al. [27], Penalva et al. [28], Caeiro et al. [29], and Paulauskas and Vaičiulis [30], among others. For a comparison between several classic EVI-estimators, we refer to [31,32].
In this paper, we consider one of the aforementioned classes of generalized Hill (GH) estimators. More precisely, we consider the class of GH estimators in [25,33] based on the power mean (PM) of the normalized log-excesses,
ξ ^ PM ( α ) ( k ) = M n ( α ) ( k ) Γ ( α + 1 ) 1 / α , α > 0 , 1 k < n .
The Hill estimator is obtained by taking α = 1 in (10) ( ξ ^ H ( k ) = ξ ^ PM ( 1 ) ( k ) ). To reduce the bias of many classic EVI-estimators, a slightly more restrictive condition than (4) is needed. More specifically, we assume that the quantile function U belongs to a subset of Hall’s class [34,35],
U ( t ) = C t ξ 1 + ξ β t ρ / ρ + β t 2 ρ + o t 2 ρ ,
as t , with β , β 0 , and ρ < 0 . This subset of Hall’s class includes several well-known Pareto-type models, such as the Burr, Fréchet, Generalized Pareto, the Loglogistic, and the Student’s t. Here, we introduce a new class of reduced-bias (RB) EVI-estimators, still dependent on a tuning parameter,
ξ ^ RBPM ( α ) ( k ) = ξ ^ PM ( α ) ( k ) 1 1 ( 1 ρ ^ ) α α ρ ^ ( 1 ρ ^ ) α β ^ n k ρ ^ , α > 0 , 1 k < n ,
with ( β ^ , ρ ^ ) adequate estimators of the second-order parameters ( β , ρ ) defined in (11). This class of RB estimators generalizes the well-known minimum-variance reduced-bias (MVRB) corrected Hill (CH) estimators introduced in Caeiro et al. [36]. More precisely, if α = 1 in (12),
ξ ^ CH ( k ) ξ ^ RBPM ( 1 ) ( k ) : = ξ ^ H ( k ) 1 β ^ 1 ρ ^ n k ρ ^ , 1 k < n ,
with ξ ^ H ( k ) the Hill estimator in (5). Other types of RB RVI-estimators based on the Hill estimator can be found in [37,38], among others.
The remainder of this paper is organized as follows. Section 2 is devoted to the asymptotic behavior of the EVI-estimators under study. Under a third-order framework, the asymptotic properties of the PM EVI-estimator, in (10), are presented. Next, the asymptotic non-degenerate behavior of the RBPM EVI-estimator, in (12), is derived, assuming first that the second-order parameters ( β , ρ ) are known and then that they are adequately estimated in the high level k 1 of a larger order than k, the number of order statistics used in the EVI-estimation. In Section 3, an asymptotic comparison at optimal levels of the new reduced-bias estimators is performed. The finite sample properties of the new estimators are assessed through a Monte Carlo simulation study presented in Section 4. Section 5 is devoted to illustrating the behavior of the estimators under study to a hydrological data set, and finally, in Section 6, the main conclusions of this study are put forward.

2. Asymptotic Behavior of the Estimators

To study the asymptotic bias of the EVI-estimators under consideration, and due to the slowly varying parts of the CDF F, we need a second-order condition. We, thus, assume the existence of an ultimately positive regular varying function A with index ρ , i.e., | A ( t ) | R V ρ [39], such that
lim t ln U ( t x ) ln U ( t ) ξ ln x A ( t ) = x ρ 1 ρ ,
for every x > 0 . The second-order parameter ρ 0 rules the rate of convergence of U ( t x ) / U ( t ) to x ξ . The speed increases as | ρ | increases. Algorithms for an adequate estimation of the second-order parameters ( β , ρ ) can be found for instance in [40,41], among others. To obtain additional information about the bias of the estimators under study, we also impose the validity of the following third-order condition: there exists a B ( t ) function, which measures the rate of convergence in the second-order condition in (14), such that,
lim t ln U ( t x ) ln U ( t ) ξ ln x A ( t ) x ρ 1 ρ B ( t ) = x ρ + ρ 1 ρ + ρ ,
for all x > 0 , with ρ 0 , a third-order parameter, and | B ( t ) | R V ρ . This condition holds for several Pareto-type models.
A considerable number of models that fulfill Equation (15) belong the subset of Hall’s class in (11). For those models, we have
A ( t ) = ξ β t ρ , B ( t ) = β t ρ , with β , β 0 , δ = β / β and ρ = ρ < 0 .
Based on Lemma 2.1 of Gomes et al. [42], we begin by introducing some notation that underlies the distributional representation of M n ( α ) in Equation (8), given the validity of the third-order condition in (15), and assuming (16). With E i , i = 1 , 2 , , a sequence of independent and identically distributed standard exponential random variables, let us consider the notations:
μ α ( 1 ) : = E [ E i α ] = Γ ( α + 1 ) ,
μ α ( 2 ) ( ρ ) : = E E i α 1 e ρ E i 1 ρ = Γ ( α ) ρ 1 ( 1 ρ ) α 1 ,
μ α ( 3 ) ( ρ ) : = E E i α 2 e ρ E i 1 ρ 2 ,
= 1 ρ 2 ln ( 1 ρ ) 2 1 2 ρ , α = 1 Γ ( α ) ρ 2 ( α 1 ) 1 ( 1 2 ρ ) α 1 2 ( 1 ρ ) α 1 + 1 , α 1 ,
σ α : = V a r [ E i α ] = Γ ( 2 α + 1 ) Γ 2 ( α + 1 ) ,
and
μ ¯ α ( j ) ( ρ ) : = μ α ( j ) ( ρ ) μ α ( 1 ) , σ ¯ α : = σ α μ α ( 1 ) .
Theorem 1. 
Assuming the validity of the third-order condition, (15) and also of (16), and considering intermediate sequences k = k n satisfying (7), the following distributional representation holds:
M n ( α ) ( k ) = d ξ α μ α ( 1 ) 1 + σ ¯ α k Z k ( α ) + α ξ μ ¯ α ( 2 ) ( ρ ) A ( n / k ) + O p A ( n / k ) k + 1 2 ξ 2 2 α δ μ ¯ α ( 2 ) ( 2 ρ ) + α ( α 1 ) μ ¯ α ( 3 ) ( ρ ) A 2 ( n / k ) ( 1 + o p ( 1 ) ) ,
where Z k ( α ) are asymptotically standard normal random variables, defined by
Z k ( α ) : = P k ( α ) σ α = k σ α 1 k i = 1 k E i α Γ ( α + 1 ) .
The proof of this theorem follows the same structure as the proof of Proposition 3.2 in [42].
Remark 1. 
From the results presented in Gomes and Martins ([24]), the vector ( P k ( 1 ) , P k ( α ) ) , with P k ( α ) defined in (23), is asymptotically a bivariate random vector with zero mean value and covariance matrix,
Σ α = 1 α Γ ( α + 1 ) α Γ ( α + 1 ) Γ ( 2 α + 1 ) Γ 2 ( α + 1 ) ,
being the C o v P k ( α ) , P k ( β ) given by Γ ( α + β + 1 ) Γ ( α + 1 ) Γ ( β + 1 ) , for any pair ( α , β ) .
In the next Theorem, we state without proof the asymptotic distributional representation of the GH estimator, ξ ^ P M ( α ) , in (10), under a third-order framework, extending the result presented in Gomes and Martins ([24]).
Theorem 2. 
Assuming the validity of the third-order condition, (15) and also of (16), and considering intermediate sequences k = k n satisfying (7), the following distributional representation holds:
ξ ^ PM ( α ) ( k ) = d ξ + ξ σ ¯ α α k Z k ( α ) + μ ¯ α ( 2 ) ( ρ ) A ( n / k ) + O p A ( n / k ) α k + 1 2 ξ 2 δ μ ¯ α ( 2 ) ( 2 ρ ) + ( α 1 ) μ ¯ α ( 3 ) ( ρ ) ( μ ¯ α ( 2 ) ( ρ ) ) 2 A 2 ( n / k ) ( 1 + o p ( 1 ) ) ,
with
1 2 ξ 2 δ μ ¯ α ( 2 ) ( 2 ρ ) + ( α 1 ) μ ¯ α ( 3 ) ( ρ ) ( μ ¯ α ( 2 ) ( ρ ) ) 2 = 1 2 ξ α 2 ρ 2 α 1 ( 2 δ ) ρ ( 1 2 ρ ) α 2 ( 1 α ρ ) ( 1 ρ ) α α 1 ( 1 ρ ) 2 α δ α ρ + 1 .
If k A ( n / k ) λ finite and k A 2 ( n / k ) 0 , when k , then
k ( ξ ^ PM ( α ) ( k ) ξ ) d N λ b PM : = λ α 1 ( 1 ρ ) α ρ ( 1 ρ ) α , v α : = ξ 2 α 2 Γ ( 2 α + 1 ) Γ 2 ( α + 1 ) 1 .
Remark 2. 
The variance term, v α , reaches its minimum value ξ 2 when α is equal to 1. On the other hand, the term b P M b P M ( α ) , with ρ fixed, is positive and decreases as α increases (see [25], Figures 3 and 4).

Asymptotic Properties of the New Reduced Bias EVI-Estimator

The properties of the RB EVI-estimator, ξ ^ RBPM ( α ) , in (12), are presented in two stages. First, we consider that the second-order parameters ( β , ρ ) are known. Next, we assume that we estimate externally the vector of second-order parameters in an adequate level k 1 of order statistics, with k 1 = [ n 1 ϵ ] , ϵ 0 and where [ . ] denotes the integer part and n the size of the available sample.
Theorem 3. 
Assuming the validity of the third-order condition, (15) and also of (16), and considering intermediate sequences k = k n satisfying (7),
ξ ^ ( β , ρ ) RBPM ( α ) ( k ) = ξ ^ PM ( α ) ( k ) 1 1 ( 1 ρ ) α α ρ ( 1 ρ ) α β n k ρ , α > 0 , 1 k < n ,
has a distributional representation of the type
ξ ^ ( β , ρ ) RBPM ( α ) ( k ) = d ξ + ξ σ ¯ α α k Z k ( α ) + O p A ( n / k ) α k + b RBPM A 2 ( n / k ) ( 1 + o p ( 1 ) ) ,
with
b RBPM = 1 2 ξ 2 δ μ ¯ α ( 2 ) ( 2 ρ ) + ( α 1 ) μ ¯ α ( 3 ) ( ρ ) ( α + 1 ) ( μ ¯ α ( 2 ) ( ρ ) ) 2 .
Then, with v α defined in Theorem 2,
  • If k A ( n / k ) λ is finite or infinite and k A 2 ( n / k ) 0 , when k , then
    k ( ξ ^ ( β , ρ ) RBPM ( α ) ( k ) ξ ) d N 0 , v α .
  • If k A ( n / k ) and k A 2 ( n / k ) λ A , when k , then
    k ( ξ ^ ( β , ρ ) RBPM ( α ) ( k ) ξ ) d N λ A b RBMP , v α .
Proof. 
The result follows from the distributional representation of ξ ^ PM ( α ) presented in Theorem 2. Since β n k ρ = A ( n / k ) / ξ ,
ξ ^ ( β , ρ ) RBPM ( α ) ( k ) = ξ ^ PM ( α ) ( k ) 1 μ ¯ α ( 2 ) ( ρ ) ξ A ( n / k ) = d ξ + ξ σ ¯ α α k Z k ( α ) + μ ¯ α ( 2 ) ( ρ ) A ( n / k ) + O p A ( n / k ) α k + δ ξ μ ¯ α ( 2 ) ( 2 ρ ) + ( α 1 ) 2 ξ μ ¯ α ( 3 ) ( ρ ) ( μ ¯ α ( 2 ) ( ρ ) ) 2 A 2 ( n / k ) ( 1 + o p ( 1 ) ) × 1 μ ¯ α ( 2 ) ( ρ ) ξ A ( n / k ) = d ξ + ξ σ ¯ α α k Z k ( α ) + O p A ( n / k ) α k + 1 2 ξ 2 δ μ ¯ α ( 2 ) ( 2 ρ ) + ( α 1 ) μ ¯ α ( 3 ) ( ρ ) ( α + 1 ) ( μ ¯ α ( 2 ) ( ρ ) ) 2 A 2 ( n / k ) ( 1 + o p ( 1 ) ) .
Despite the great variety of classes of estimators for the second-order parameters ρ and β now available in the literature, we suggest the estimators introduced in Fraga Alves et al. [43] and Gomes and Martins [44]. Alternative estimators for the second-order parameters can be found in papers such as [45,46,47,48,49,50]. The class of estimators of ρ in [43] was initially parameterized with a tuning parameter τ 0 , but it can more generally be considered as a real number (Caeiro and Gomes [51]). It is defined as
ρ ^ ( k ; τ ) ρ ^ τ ( k ) : = 3 T n ( k ; τ ) 1 T n ( k ; τ ) 3 ,
where, with M n ( α ) ( k ) given in (8) and the notation a b τ = b ln a , whenever τ = 0 ,
T n ( k ; τ ) : = M n ( 1 ) ( k ) τ M n ( 2 ) ( k ) / 2 τ / 2 M n ( 2 ) ( k ) / 2 τ / 2 M n ( 3 ) ( k ) / 6 τ / 3 , τ R .
The theoretical and simulated results presented in Fraga Alves et al. [43] and Caeiro and Gomes [50], and their application in RB estimation, lead us to use τ = 0 when 1 ρ < 0 and τ = 1 when ρ < 1 in practical scenarios.
We consider the β -estimator first obtained in [44] and based on the scaled log-spacings
U i : = i ( ln X n i + 1 : n ln X n i : n ) , 1 k < n .
On the basis of any consistent estimator ρ ^ of the second-order parameter ρ , we consider the β -estimator, β ^ ( k ; ρ ^ ) , where, with ρ < 0 ,
β ^ ( k ; ρ ) : = k n ρ 1 k i = 1 k i k ρ 1 k i = 1 k U i 1 k i = 1 k i k ρ U i 1 k i = 1 k i k ρ 1 k i = 1 k i k ρ U i 1 k i = 1 k i k 2 ρ U i .
With ( ρ ^ , β ^ ) generally denoting the ( ρ , β ) -estimators in (28) and (29), computed in the high level k 1 , i.e., ( ρ ^ , β ^ ) ( ρ ^ ( k 1 ) , β ^ ( k 1 ) ) , we can now state the asymptotic behavior of ξ ^ RBPM ( α ) in (12).
Theorem 4. 
Assuming the validity of the third-order condition (15) and also (16), considering intermediate sequences k = k n satisfying (7), with ( ρ ^ , β ^ ) ( ρ ^ ( k 1 ) , β ^ ( k 1 ) ) , where k 1 is a high level such that k 1 A ( n / k 1 ) , k = o ( k 1 ) and with ( ρ ^ ρ ) ln n = o p ( 1 ) , then ξ ^ RBPM ( α ) is consistent for the estimation of ξ. If k A ( n / k ) λ is finite (possibly zero), then
k ξ ^ RBPM ( α ) ( k ) ξ d N 0 , v α ,
where v α is given in Theorem 2. This result still holds true for levels k such that k A ( n / k ) and k A 2 ( n / k ) λ A , if ( ρ ^ ρ ) ln n = o p ( 1 / k A ( n / k ) ) .
Proof. 
Defining
d α ( ρ , β ) : = μ ¯ α ( 2 ) ( ρ ) A ( n / k ) ξ = μ ¯ α ( 2 ) ( ρ ) β n k ρ ,
with μ ¯ α ( 2 ) ( ρ ) in (22), the delta method enables us to write
d α ( ρ ^ , β ^ ) = d α ( ρ , β ) + ( β ^ β ) d α ( ρ , β ) β ( 1 + o p ( 1 ) ) + ( ρ ^ ρ ) d α ( ρ , β ) ρ ( 1 + o p ( 1 ) ) ,
where
d α ( ρ , β ) β = μ ¯ α ( 2 ) ( ρ ) n k ρ
and
d α ( ρ , β ) ρ = μ ¯ α ( 2 ) ( ρ ) ρ β n k ρ + μ ¯ α ( 2 ) ( ρ ) β n k ρ ln n k = μ ¯ α ( 2 ) ( ρ ) β n k ρ c α ( ρ ) + ln n k
with
c α ( ρ ) = ρ μ ¯ α ( 2 ) ( ρ ) μ ¯ α ( 2 ) ( ρ ) = α ρ ( 1 ρ ) + 1 ρ α + 1 ρ 1 ρ 1 1 ρ α = 1 ρ α + 1 ( 1 ( α + 1 ) ρ ) ρ 1 ρ 1 1 ρ α .
Then, it follows that
ξ ^ RBPM ( α ) ( k ) = ξ ^ PM ( α ) ( k ) 1 1 ( 1 ρ ^ ) α α ρ ^ ( 1 ρ ^ ) α β ^ n k ρ ^ = ξ ^ PM ( α ) ( k ) 1 d α ( ρ ^ , β ^ ) = ξ ^ PM ( α ) ( k ) 1 d α ( ρ , β ) ( β ^ β ) d α ( ρ , β ) β ( 1 + o p ( 1 ) ) ( ρ ^ ρ ) d α ( ρ , β ) ρ ( 1 + o p ( 1 ) ) = d ξ ^ ( β , ρ ) RBPM ( α ) ( k ) μ ¯ α ( 2 ) ( ρ ) A ( n / k ) β ^ β β + ( ρ ^ ρ ) ( c α ( ρ ) + ln ( n / k ) ) .
Since ξ ^ ( β , ρ ) RBPM ( α ) is consistent for the estimation of ξ , β ^ and ρ ^ are consistent for the estimation of β and ρ , respectively, and ( ρ ^ ρ ) ln ( n / k ) = o p ( 1 ) ,
μ ¯ α ( 2 ) ( ρ ) A ( n / k ) β ^ β β + ( ρ ^ ρ ) ( c α ( ρ ) + ln ( n / k ) ) = o p ( 1 ) ,
and the consistency of ξ ^ RBPM ( α ) ( k ) follows.
Next, given that
k ξ ^ RBPM ( α ) ( k ) ξ = d k ξ ^ β , ρ RBPM ( α ) ( k ) ξ k A ( n / k ) μ ¯ α ( 2 ) ( ρ ) β ^ β β + ( ρ ^ ρ ) ( c α ( ρ ) + ln ( n / k ) ) ,
with β ^ β p β ( ρ ^ ρ ) ln ( n / k 1 ) ([52], Theorem 2.1); then,
k ξ ^ RBPM ( α ) ( k ) ξ = d k ξ ^ β , ρ RBPM ( α ) ( k ) ξ k A ( n / k ) μ ¯ α ( 2 ) ( ρ ) ( ρ ^ ρ ) ln ( k / k 1 ) .
Thus, ξ ^ RBPM ( α ) ( k ) is consistent for the estimation of ξ if ( ρ ^ ρ ) ln ( k / k 1 ) = o p ( 1 / A ( n / k ) ) and asymptotic normality holds for levels k such that k A ( n / k ) λ (finite) but also for levels k, where k A ( n / k ) and k A 2 ( n / k ) λ A , if ( ρ ^ ρ ) ln ( k / k 1 ) = o p 1 k A ( n / k ) .
Remark 3. 
Note that when α = 1 , we obtain for ξ ^ PM ( α ) ( k ) and ξ ^ RBPM ( α ) ( k ) , in (10) and (12), respectively, the same asymptotic variance as ξ ^ CH ( k ) in (13), which is ξ 2 . The ratio between the asymptotic standard deviations of both estimators,
r α : = v α ξ 2 = 1 α Γ ( 2 α + 1 ) Γ 2 ( α + 1 ) 1 ,
is always greater than or equal to 1 for α > 0 .
Let Bias ( ξ ^ RBPM ( α ) ( k ) ) represent the asymptotic bias and Var ( ξ ^ RBPM ( α ) ( k ) ) the asymptotic variance of ξ ^ RBPM ( α ) ( k ) , in (12). Then, based on the results presented in Theorem 3, we can derive the Asymptotic Mean Squared Error (AMSE):
AMSE ξ ^ RBPM ( α ) ( k ) = Var ( ξ ^ RBPM ( α ) ( k ) ) + Bias ( ξ ^ RBPM ( α ) ( k ) ) 2 =
= ξ 2 k α 2 Γ ( 2 α + 1 ) Γ 2 ( α + 1 ) 1 + b RBPM 2 A 4 ( n / k ) , if b RBPM 0 ,
with b RBPM given in (27). The AMSE of the CH estimator is obtained by taking α = 1 :
AMSE ξ ^ CH ( k ) = ξ 2 k + 1 ξ 2 δ 1 2 ρ 1 ( 1 ρ ) 2 2 A 4 ( n / k ) .
Proposition 1. 
If b RBPM 0 , the threshold that minimizes the AMSE, denoted as k 0 RBPM , can be expressed as follows:
k 0 RBPM = 1 α 2 Γ ( 2 α + 1 ) Γ 2 ( α + 1 ) 1 ( ρ ) β 4 2 δ μ ¯ α ( 2 ) ( 2 ρ ) + ( α 1 ) μ ¯ α ( 3 ) ( ρ ) ( α + 1 ) ( μ ¯ α ( 2 ) ( ρ ) ) 2 2 1 1 4 ρ n 4 ρ 1 4 ρ .
Remark 4. 
Note that
k 0 RBPM A 2 ( n / k 0 RBPM ) n v α 4 ρ | b RBPM | .
Moreover, by taking α = 1 in (33), the value of k that minimizes the AMSE of the CH estimator is given by
k 0 CH = 4 ρ δ 1 2 ρ 1 ( 1 ρ ) 2 2 β 4 1 1 4 ρ n 4 ρ 1 4 ρ .

3. Asymptotic Comparison at Optimal Levels

Let us consider ξ ^ ( k ) , a generic RB EVI-estimator with a distributional representation
ξ ^ ( k ) = d ξ + σ k Z k ( α ) + b A 2 ( n / k ) ( 1 + o p ( 1 ) ) .
Let ξ ^ 0 = ξ ^ n ( k 0 ) be the RB EVI-estimator computed at its optimal level and LMSE the limiting mean square error. Using regular variation theory ([14,15]), we can ensure that whenever b 0 , there exists a function φ ( n , ξ , ρ ) , which solely depends on the underlying model and not on the estimator ([31]), such that
lim n φ ( n , ξ , ρ ) AMSE ξ ^ 0 = σ 2 2 ρ 1 4 ρ b 2 1 1 4 ρ = : LMSE ξ ^ 0 .
By evaluating the ratio of LMSE for two different RB EVI-estimators, we derive a measure independent of the φ ( · ) function, thus leading us to consider the following efficiency measure:
AREFF ( 1 ) | ( 2 ) : = LMSE ( ξ ^ 0 ( 2 ) ) LMSE ( ξ ^ 0 ( 1 ) ) = σ ( 2 ) σ ( 1 ) 4 ρ 1 4 ρ b ( 2 ) b ( 1 ) 1 1 4 ρ .
If AREFF > 1 , then ξ ^ ( 2 ) ( k ) is more efficient than ξ ^ ( 1 ) ( k ) ; if AREFF = 1 , the two estimators are equally efficient; and if AREFF < 1 , then ξ ^ ( 1 ) ( k ) is more efficient than ξ ^ ( 2 ) ( k ) . With ξ ^ ( 1 ) ( k ) ξ ^ RBPM ( α ) ( k ) and ξ ^ ( 2 ) ( k ) ξ ^ CH ( k ) ( b CH : = 1 ξ δ 1 2 ρ 1 ( 1 ρ ) 2 , σ CH : = ξ ),
AREFF RBPM ( α ) | CH = 1 α Γ ( 2 α + 1 ) Γ 2 ( α + 1 ) 1 4 ρ 1 4 ρ × 2 ( α ρ ) 2 δ 1 2 ρ 1 1 ρ 2 α 1 ( 2 δ ) ρ ( 1 2 ρ ) α + 2 ( 1 + α ρ ) ( 1 ρ ) α α + 1 ( 1 ρ ) 2 α ( 1 + δ α ρ ) 1 1 4 ρ .
Note that (37) is derived from the asymptotic formulas for the dominant components of asymptotic bias and variance.
For simplicity, we conduct the comparison only for a few Pareto-type models. For the Burr or the Generalized Pareto (GP) distributions, δ = 1 and a contour plot of AREFF RBPM ( α ) | CH is presented in Figure 1. The blue lines represent the ( α , ρ ) -region where the two estimators are equally efficient and the grey-shaded region represents the values of the ( α , ρ ) -plane where ξ ^ RBPM ( α ) ( k ) is more efficient than ξ ^ CH ( k ) . It is apparent that the ( α , ρ ) -plane region where ξ ^ RBPM ( α ) ( k ) outperforms ξ ^ CH ( k ) is notably large and diverse.
By fixing the values of ( δ , ρ ) it is possible to find the value of α that maximizes AREFF RBPM ( α ) | CH , i.e., α max = arg max α AREFF RBPM ( α ) | CH . Considering the Pareto-type models described in Section 4 and the associated ( δ , ρ ) values we obtained for the Fréchet model, α max = 1.232 ; for the Burr model with ρ = 0.75 , α max = 0.464 , and α max = 0.073 , when ρ = 0.5 ; for the Half- t 1 model, α max = 0.468 ; for the Half- t 2 model, α max = 0.844 ; for the Half- t 3 model, α max = 1.148 , and for the Half- t 4 model, α max = 1.412 . Note that, as shown in Section 4, the use of this α max does not always lead to the best results because their finite sample behavior may differ.

4. Monte Carlo Simulation Study

In this section, we perform a Monte Carlo simulation study to assess the finite sample behavior of the new class of estimators ξ ^ RBPM ( α ) ( k ) , with selected values of α . For comparison purposes, the value α = 1 (CH≡RBPM(1)) and the Hill estimator in (5) were included in this study. The other α values used in the simulation study were chosen based on the findings from a preliminary simulation study. The results were obtained based on 2000 simulated samples of sizes n = 50 , 100 , 200 , 500 , 1000 , 2000 and 5000 from the following Pareto-type models:
  • The Fréchet model with a d.f. given by F ( x ) = exp ( x 1 / ξ ) , x > 0 , with ξ = 0.5 ( ρ = ρ = 1 , δ = 5 / 3 ).
  • The Burr model, with a d.f. F ( x ) = 1 ( 1 + x ρ / ξ ) 1 / ρ , x > 0 , ξ > 0 , ρ < 0 with ( ξ , ρ ) = ( 0.5 , 0.5 ) and ( 0.5 , 0.75 ) ( ρ = ρ = 1 , δ = 1 ).
  • The Half- t ν model, i.e., the absolute value of a Student’s- t ν distribution with ν = 2 , 3 , 4 , ( ξ , ρ ) = ( 1 / ν , 2 / ν ) ( ρ = ρ , δ = ν ( ν 2 + 4 ν + 2 ) / ( ( ν + 4 ) ( ν + 1 ) ) ).
  • The Log-gamma model with ξ = 0.5 ( ρ = 0 ). It is important to note that this model does not fulfill condition (11).

4.1. Results for All Values of k

For each model, the performance of the estimators is evaluated through the simulated mean value (E) and root mean squared error (RMSE) as a function of k. The results of the simulation study are provided in Figure 2, Figure 3, Figure 4 , Figure 5, Figure 6 , Figure 7 and Figure 8 for samples of size n = 500 . The dotted horizontal black line on the left plots represents the value of the EVI.
From these figures, we can conclude the following:
  • As expected, the finite sample behavior of the new class of RB EVI-estimators depends on the underlying model, but there is always a positive value of α that enables a clear reduction in both bias and RMSE;
  • For the Fréchet model, with ξ = 0.5 , the new RB EVI-estimators provide a stable trajectory around the true value of ξ for a wide range of k values;
  • For the Burr models under study, there is also a significant reduction in bias and RMSE;
  • The same holds for the Half-t models, except for the RMSE for the Half-t model with ν = 1 , where the Hill estimator presents the lowest RMSE;
  • Even for the Log-gamma model, it is possible to find values of α that stabilize the simulated mean sample paths, also reducing the RMSE.

4.2. Results at the Simulated Optimal k

The simulated mean values at optimal levels, aiming for minimal MSE, are summarized in Table 1 for the Pareto-type models described above. The corresponding simulated RMSE at optimal levels is presented in Table 2. The values with the smallest absolute bias and RMSE are highlighted in bold.
An analysis of the provided tables leads us to the following conclusions:
  • For the Fréchet model with ξ = 0.5 , Burr model with ( ξ , ρ ) = ( 0.5 , 0.75 ) , and Half-t model with ν = 2 , the best value of α depends on the sample size;
  • For the Burr model with ( ξ , ρ ) = ( 0.5 , 0.5 ) and for the Half-t models with ν = 3 , 4 , the value of α = 5 seems to provide an overall good performance;
  • For the Half-t model with ν = 1 , the best estimator in terms of minimum RMSE is the Hill estimator, but in terms of bias, the value of α = 5 , for large sample sizes, is the best choice;
  • For the Log-gamma model with ξ = 0.5 , the value of α = 0.5 provides the smallest absolute bias and the smallest RMSE for sample sizes n 500 . For small sample sizes, the smallest RMSEs are the ones associated with the CH estimator.
As stated at the end of Section 3, the values of α that maximize the AREFF, in (37), are based on a combination of the asymptotic bias and variance. Apart from the Fréchet model where the value α max = 1.2 provides good results in terms of simulated bias and RMSE, for the other models under study the findings of the simulations study differ from the theoretical ones. This difference may be attributed to the estimation of the second-order parameters ( β , ρ ) and to the fact that finite properties of the estimators may differ from the theoretical asymptotic ones.

5. Case Study: Application of New Estimators to Hydrological Data

In this section, we perform an illustration of the behavior of the new estimators in a real data set. Consider the daily flow (in m3/s) of the Whiteadder River in Hutton, UK, within 11 years, ranging from 1 January 1995 to 31 December 2005, with a total of n = 4018 observations. The data were collected from https://nrfa.ceh.ac.uk/data/station/meanflow/21022 (accessed on 7 December 2015). The time series plot and the boxplot of the Whiteadder River are presented in Figure 9. From the boxplot, it is clear that the data are positively skewed (skew = 8.244), also having an excess of kurtosis of 118.598. Based on these features, one can infer that our real data set has a Pareto-type tail and these EVI-estimators can be applied to infer the heaviness of the right tail. To perform an adequate choice of ξ , we search for stability regions around a value of ξ . The larger the stability region, the better. The use of RB EVI-estimators allows us to search for stability regions for large values of k. The sample paths of the estimators under study for the Whiteadder River flow are presented in Figure 10. The Hill estimates (black line) present some stability around the value of 0.5 for a small region of k values. The CH estimates (green line) also show a small stability region. The new class of RB EVI-estimators with an appropriate choice of the tuning parameter leads to more stable sample paths when compared to the classic RB EVI-estimator in (13) (CH≡RBPM(1)), enabling the use of stability Algorithms like the ones in [41] to obtain adaptive estimates of ξ .

6. Conclusions

In this work, we introduce a novel class of RB EVI-estimators developed using the power mean of the normalized log-excesses and that generalizes other classes of RB EVI-estimators in the literature. We derive the asymptotic non-degenerate behavior within a third-order framework and perform an asymptotic comparison at optimal levels. Large-scale simulation experiments reveal that the parameter α generally facilitates bias reduction and enhances stability in estimates, depending on the number k of order statistics considered. Moreover, with a suitably chosen control parameter, these estimators exhibit a relatively stable RMSE pattern with k. Our findings also show significant improvements over the CH estimator. A comparison with other classes of RB EVI-estimators is currently underway and falls outside the scope of this work.

Author Contributions

Conceptualization, L.H.-R. and F.C.; methodology, L.H.-R., F.C. and M.I.G.; validation, F.C.; investigation, L.H.-R., F.C. and M.I.G.; writing—original draft preparation, L.H.-R. and F.C.; writing—review and editing, L.H.-R., F.C. and M.I.G. All authors have read and agreed to the published version of the manuscript.

Funding

Research partially supported by National Funds through FCT—Fundação para a Ciência e a Tecnologia—projects UIDB/MAT/04674/2020 (CIMA) https://doi.org/10.54499/UIDB/04674/2020, UIDB/00297/2020 https://doi.org/10.54499/UIDB/00297/2020, UIDP/00297/2020 https://doi.org/10.54499/UIDP/00297/2020 (Centro de Matemática e AplicaçÕes), and UIDB/00006/2020 (CEAUL) https://doi.org/10.54499/UIDB/00006/2020. The first author is also supported by project 10.54499/CEECINST/00054/2018/CP1522/CT0003 https://doi.org/10.54499/CEECINST/00054/2018/CP1522/CT0003.

Data Availability Statement

The data used in this study are entirely based on simulated and open datasets. The simulation code and any relevant parameters used to generate these datasets are available from the corresponding author upon reasonable request.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Fréchet, M. Sur la loi de probabilité de l’écart maximum. Ann. Soc. Math. Polon. 1927, 6, 93–116. [Google Scholar]
  2. Fisher, R.A.; Tippett, L.H.C. Limiting forms of the frequency distribution of the largest or smallest member of a sample. Math. Proc. Camb. Philos. Soc. 1928, 24, 180–190. [Google Scholar] [CrossRef]
  3. Von Mises, R. La distribution de la plus grande de n valuers. Rev. Math. Union Interbalcanique 1936, 1, 141–160. [Google Scholar]
  4. Gnedenko, B. Sur La Distribution Limite Du Terme Maximum D’Une Série Aléatoire. Ann. Math. 1943, 44, 423. [Google Scholar] [CrossRef]
  5. Weissman, I. Estimation of parameters and large quantiles based on the k largest observations. J. Am. Stat. Assoc. 1978, 73, 812–815. [Google Scholar] [CrossRef]
  6. Matthys, G.; Beirlant, J. Estimating the extreme value index and high quantiles with exponential regression models. Stat. Sin. 2003, 13, 853–880. [Google Scholar]
  7. Jorion, P. Value at Risk: The New Benchmark for Managing Market Risk, 2nd ed.; McGraw-Hill: New York, NY, USA, 2000. [Google Scholar]
  8. Girard, S. A Hill Type Estimator of the Weibull Tail-Coefficient. Commun. Stat.—Theory Methods 2004, 33, 205–234. [Google Scholar] [CrossRef]
  9. Beirlant, J.; Herrmann, K.; Teugels, J. Estimation of the Extreme Value Index. In Extreme Events in Finance; John Wiley & Sons, Inc.: Hoboken, NJ, USA, 2016; pp. 97–115. [Google Scholar] [CrossRef]
  10. Allouche, M.; El Methni, J.; Girard, S. A refined Weissman estimator for extreme quantiles. Extremes 2023, 26, 545–572. [Google Scholar] [CrossRef]
  11. Beirlant, J.; Goegebeur, Y.; Segers, J.; Teugels, J.L. Statistics of Extremes: Theory and Applications; John Wiley & Sons: Hoboken, NJ, USA, 2004. [Google Scholar] [CrossRef]
  12. de Haan, L.; Ferreira, A. Extreme Value Theory; Springer: New York, NJ, USA, 2006. [Google Scholar] [CrossRef]
  13. Dey, D.; Yan, J. Extreme Value Modeling and Risk Analysis: Methods and Applications; Chapman and Hall/CRC: Boca Raton, FL, USA, 2016. [Google Scholar] [CrossRef]
  14. Bingham, N.H.; Goldie, C.M.; Teugels, J.L. Regular Variation; Encyclopedia of Mathematics and its Applications; Cambridge University Press: Cambridge, MA, USA, 1987; Volume 27. [Google Scholar] [CrossRef]
  15. Bingham, N. Regular variation and probability: The early years. J. Comput. Appl. Math. 2007, 200, 357–363. [Google Scholar] [CrossRef]
  16. Hill, B.M. A simple general approach to inference about the tail of a distribution. Ann. Stat. 1975, 3, 1163–1174. [Google Scholar] [CrossRef]
  17. Mason, D.M. Laws of large numbers for sums of extreme values. Ann. Probab. 1982, 10, 754–764. [Google Scholar] [CrossRef]
  18. Haeusler, E.; Teugels, J.L. On asymptotic normality of Hill’s estimator for the exponent of regular variation. Ann. Stat. 1985, 13, 743–756. [Google Scholar] [CrossRef]
  19. Hsing, T. On tail index estimation using dependent data. Ann. Stat. 1991, 19, 1547–1569. [Google Scholar] [CrossRef]
  20. de Haan, L.; Resnick, S. On asymptotic normality of the Hill estimator. Stoch. Models 1998, 14, 849–866. [Google Scholar] [CrossRef]
  21. Gomes, M.I.; Guillou, A. Extreme value theory and statistics of univariate extremes: A review. Int. Stat. Rev. 2015, 83, 263–292. [Google Scholar] [CrossRef]
  22. Csörgő, S.; Deheuvels, P.; Mason, D. Kernel estimates of the tail index of a distribution. Ann. Stat. 1985, 13, 1050–1077. [Google Scholar] [CrossRef]
  23. Dekkers, A.L.M.; Einmahl, J.H.J.; de Haan, L. A Moment Estimator for the Index of an Extreme-Value Distribution. Ann. Stat. 1989, 17, 1833–1855. [Google Scholar] [CrossRef]
  24. Gomes, M.I.; Martins, M.J.; Neves, M. Alternatives to a Semi-Parametric Estimator of Parameters of Rare Events–The Jackknife Methodology. Extremes 2000, 3, 207–229. [Google Scholar] [CrossRef]
  25. Gomes, M.I.; Martins, M.J. Generalizations of the Hill estimator—Asymptotic versus finite sample behaviour. J. Stat. Plan. Inference 2001, 93, 161–180. [Google Scholar] [CrossRef]
  26. Groeneboom, P.; Lopuhaä, H.; De Wolf, P. Kernel-type estimators for the extreme value index. Ann. Stat. 2003, 31, 1956–1995. [Google Scholar] [CrossRef]
  27. Beran, J.; Schell, D.; Stehlík, M. The harmonic moment tail index estimator: Asymptotic distribution and robustness. Ann. Inst. Stat. Math. 2014, 66, 193–220. [Google Scholar] [CrossRef]
  28. Penalva, H.; Gomes, M.I.; Caeiro, F.; Neves, M.M. A Couple of Non Reduced Bias Generalized Means In Extreme Value Theory: An Asymptotic Comparison. Rev.-Stat. J. 2020, 18, 281–298. [Google Scholar] [CrossRef]
  29. Caeiro, F.; Mateus, A.; Soltane, L. A class of weighted Hill estimators. Comput. Math. Methods 2021. [Google Scholar] [CrossRef]
  30. Paulauskas, V.; Vaičiulis, M. On an improvement of Hill and some other estimators. Lith. Math. J. 2013, 53, 336–355. [Google Scholar] [CrossRef]
  31. de Haan, L.; Peng, L. Comparison of tail index estimators. Stat. Neerl. 1998, 52, 60–70. [Google Scholar] [CrossRef]
  32. Cabral, I.; Caeiro, F.; Gomes, M.I. On the comparison of several classical estimators of the extreme value index. Commun. Stat.-Theory Methods 2022, 51, 179–196. [Google Scholar] [CrossRef]
  33. Segers, J. Residual estimators. J. Stat. Plan. Inference 2001, 98, 15–27. [Google Scholar] [CrossRef]
  34. Hall, P. On Some Simple Estimates of an Exponent of Regular Variation. J. R. Stat. Soc. Ser. B (Methodol.) 1982, 44, 37–42. [Google Scholar] [CrossRef]
  35. Hall, P.; Welsh, A. Adaptive estimates of parameters of regular variation. Ann. Stat. 1985, 13, 331–341. [Google Scholar] [CrossRef]
  36. Caeiro, F.; Gomes, M.I.; Pestana, D. Direct reduction of bias of the classical Hill estimator. Rev.-Stat. J. 2005, 3, 113–136. [Google Scholar] [CrossRef]
  37. Knight, K. A Simple Modification of the Hill Estimator with Applications to Robustness and Bias Reduction. Statistics Department, University of Toronto: Toronto, ON, Canada, 2012; preprint. [Google Scholar]
  38. Buitendag, S.; Beirlant, J.; de Wet, T. Ridge regression estimators for the extreme value index. Extremes 2019, 22, 271–292. [Google Scholar] [CrossRef]
  39. Geluk, J.; de Haan, L. Regular Variation, Extensions and Tauberian Theorems; CWI Tract 40, Centre for Mathematics and Computer Science: Amsterdam, The Netherlands, 1987. [Google Scholar]
  40. Gomes, M.I.; Pestana, D. A sturdy reduced-bias extreme quantile (VaR) estimator. J. Am. Stat. Assoc. 2007, 102, 280–292. [Google Scholar] [CrossRef]
  41. Caeiro, F.; Gomes, M.I. Threshold Selection in Extreme Value Analysis. In Extreme Value Modeling and Risk Analysis; Dey, D.K., Yan, J., Eds.; Chapman and Hall/CRC: Boca Raton, FL, USA, 2016; pp. 69–86. [Google Scholar] [CrossRef]
  42. Gomes, M.I.; de Haan, L.; Peng, L. Semi-parametric Estimation of the Second Order Parameter in Statistics of Extremes. Extremes 2002, 5, 387–414. [Google Scholar] [CrossRef]
  43. Fraga Alves, M.I.; Gomes, M.I.; de Haan, L. A new class of semi-parametric estimators of the second order parameter. Port. Math. 2003, 60, 193–213. [Google Scholar]
  44. Gomes, M.I.; Martins, M.J. Asymptotically unbiased estimators of the tail index based on external estimation of the second order parameter. Extremes 2002, 5, 5–31. [Google Scholar] [CrossRef]
  45. Ciuperca, G.; Mercadier, C. Semi-parametric estimation for heavy tailed distributions. Extremes 2010, 13, 55–87. [Google Scholar] [CrossRef]
  46. Gomes, M.I.; Henriques-Rodrigues, L.; Pereira, H.; Pestana, D. Tail index and second-order parameters’ semi-parametric estimation based on the log-excesses. J. Stat. Comput. Simul. 2010, 80, 653–666. [Google Scholar] [CrossRef]
  47. Worms, J.; Worms, R. Estimation of second order parameters using probability weighted moments. ESAIM Probab. Stat. 2012, 16, 97–113. [Google Scholar] [CrossRef]
  48. de Wet, T.; Goegebeur, Y.; Munch, M.R. Asymptotically unbiased estimation of a second order tail parameter. Stat. Probab. Lett. 2012, 82, 565–573. [Google Scholar] [CrossRef]
  49. Deme, E.H.; Gardes, L.; Girard, S. On the Estimation of the Second Order Parameter for Heavy-Tailed Distributions. Rev.-Stat. J. 2013, 11, 277–299. [Google Scholar] [CrossRef]
  50. Caeiro, F.; Gomes, M.I. Bias Reduction in the Estimation of a Shape Second-order Parameter of a Heavy Tail Model. J. Stat. Comput. Simul. 2015, 85, 3405–3419. [Google Scholar] [CrossRef]
  51. Caeiro, F.; Gomes, M.I. A new class of estimators of a “scale” second order parameter. Extremes 2006, 9, 193–211. [Google Scholar] [CrossRef]
  52. Caeiro, F.; Gomes, M.I.; Henriques-Rodrigues, L. Reduced-bias tail index estimators under a third order framework. Commun. Stat.-Theory Methods 2009, 38, 1019–1040. [Google Scholar] [CrossRef]
Figure 1. Contour plot of AREFF RBPM ( α ) | CH , with δ = 1 in the ( α , ρ ) -plane.
Figure 1. Contour plot of AREFF RBPM ( α ) | CH , with δ = 1 in the ( α , ρ ) -plane.
Mathematics 12 02866 g001
Figure 2. Simulated mean values (left) and RMSE (right) of the EVI-estimators under study for samples of size 1000 from the Fréchet model, with ξ = 0.5 .
Figure 2. Simulated mean values (left) and RMSE (right) of the EVI-estimators under study for samples of size 1000 from the Fréchet model, with ξ = 0.5 .
Mathematics 12 02866 g002
Figure 3. Simulated mean values (left) and RMSE (right) of the EVI-estimators under study for samples of size 1000 from the Burr model, with ξ = 0.5 and ρ = 0.75 .
Figure 3. Simulated mean values (left) and RMSE (right) of the EVI-estimators under study for samples of size 1000 from the Burr model, with ξ = 0.5 and ρ = 0.75 .
Mathematics 12 02866 g003
Figure 4. Simulated mean values (left) and RMSE (right) of the EVI-estimators under study for samples of size 1000 from the Burr model, with ξ = 0.5 and ρ = 0.5 .
Figure 4. Simulated mean values (left) and RMSE (right) of the EVI-estimators under study for samples of size 1000 from the Burr model, with ξ = 0.5 and ρ = 0.5 .
Mathematics 12 02866 g004
Figure 5. Simulated mean values (left) and RMSE (right) of the EVI-estimators under study for samples of size 1000 from the Half-t model, with v = 2 ( ξ = 0.5 ) .
Figure 5. Simulated mean values (left) and RMSE (right) of the EVI-estimators under study for samples of size 1000 from the Half-t model, with v = 2 ( ξ = 0.5 ) .
Mathematics 12 02866 g005
Figure 6. Simulated mean values (left) and RMSE (right) of the EVI-estimators under study for samples of size 1000 from the Half-t model, with v = 3 (ξ = 1/3).
Figure 6. Simulated mean values (left) and RMSE (right) of the EVI-estimators under study for samples of size 1000 from the Half-t model, with v = 3 (ξ = 1/3).
Mathematics 12 02866 g006
Figure 7. Simulated mean values (left) and RMSE (right) of the EVI-estimators under study for samples of size 1000 from the Half-t model, with v = 4 (ξ = 0.25).
Figure 7. Simulated mean values (left) and RMSE (right) of the EVI-estimators under study for samples of size 1000 from the Half-t model, with v = 4 (ξ = 0.25).
Mathematics 12 02866 g007
Figure 8. Simulated mean values (left) and RMSE (right) of the EVI-estimators under study for samples of size 1000 from the Log-gamma model, with ξ = 0.5.
Figure 8. Simulated mean values (left) and RMSE (right) of the EVI-estimators under study for samples of size 1000 from the Log-gamma model, with ξ = 0.5.
Mathematics 12 02866 g008
Figure 9. Daily time series (left) and boxplot (right) of the Whiteadder River flow between 1995 and 2005 (n = 4018).
Figure 9. Daily time series (left) and boxplot (right) of the Whiteadder River flow between 1995 and 2005 (n = 4018).
Mathematics 12 02866 g009
Figure 10. Sample paths of estimators under study for the Whiteadder River flow between 1995 and 2005 (n = 4018).
Figure 10. Sample paths of estimators under study for the Whiteadder River flow between 1995 and 2005 (n = 4018).
Mathematics 12 02866 g010
Table 1. Simulated mean value for the EVI-estimators under study, computed at the simulated optimal level.
Table 1. Simulated mean value for the EVI-estimators under study, computed at the simulated optimal level.
50100200500100020005000
Fréchet model with ξ = 0.5 ( ρ = 1 )
H0.57280.55630.54140.53580.52630.52130.5132
RBPM(0.5)0.47550.47450.48390.49590.49930.50100.5010
RBPM(1)0.48700.48930.48790.49590.50020.50190.5015
RBPM(1.2)0.48700.48930.48790.49590.50020.50190.5015
RBPM(2)0.49200.50380.50280.49980.50100.50280.5022
RBPM(3)0.49750.50400.50710.50820.50550.50470.5034
Burr model with ( ξ , ρ ) = ( 0.5 , 0.75 )
H0.61890.59870.57080.55260.54460.53640.5299
RBPM(0.5)0.56560.54630.53460.52960.51780.51710.5146
RBPM(1)0.55560.54330.53360.52320.51750.51710.5134
RBPM(2)0.54490.53830.52950.52200.51950.51610.5129
RBPM(3)0.54000.53530.53040.52470.52060.51540.5130
RBPM(5)0.52600.51800.52280.52120.51970.51780.5152
Burr model with ( ξ , ρ ) = ( 0.5 , 0.5 )
H0.68780.63490.60880.59280.57760.56330.5520
RBPM(0.25)0.69680.63190.61900.59240.58090.56510.5483
RBPM(1)0.63440.61000.58980.56920.56340.55460.5438
RBPM(2)0.60450.58610.57190.56140.55820.54660.5405
RBPM(3)0.56850.57130.55780.55690.55150.54350.5378
RBPM(5)0.53380.53920.53650.53800.53740.53430.5326
Half-t model with ν = 2 ( ξ = 0.5 )
H0.60010.56410.55680.54760.53620.52430.5232
RBPM(0.85)0.53390.52170.51640.50480.50340.50180.5007
RBPM(1)0.53190.52100.51640.50500.50370.50220.5011
RBPM(2)0.53000.52040.51480.50830.50430.50320.5011
RBPM(3)0.52420.52220.51770.51230.50720.50560.5031
RBPM(5)0.52230.51760.51540.51560.51160.50930.5065
Half-t model with ν = 3 ( ξ = 1 / 3 )
H0.43040.43250.39570.38740.37690.36370.3588
RBPM(0.25)0.43960.40850.39190.37440.36880.35670.3547
RBPM(1)0.39910.39490.37850.36810.36360.35360.3516
RBPM(2)0.38060.38110.37330.36300.35970.35440.3495
RBPM(3)0.37470.37130.36820.35990.35810.35400.3503
RBPM(5)0.35040.35280.35330.35520.35410.35250.3497
Half-t model with ν = 4 ( ξ = 0.25 )
H0.37320.34490.32660.30170.29580.29040.2800
RBPM(0.25)0.37790.34340.32660.30530.29770.28450.2828
RBPM(1)0.33450.32630.31090.29590.28990.27830.2754
RBPM(2)0.30630.30870.30030.28910.28130.27940.2730
RBPM(3)0.29750.29080.28990.28310.27960.27610.2723
RBPM(5)0.26150.27280.26880.27020.26840.26990.2678
Log-gamma with ξ = 0.5
H0.65690.63240.61720.59610.59630.58400.5759
RBPM(0.25)0.51100.53010.55610.57430.57800.58110.5818
RBPM(1)0.51890.53400.56130.58020.58620.58300.5732
RBPM(2)0.52600.54050.56630.58250.58030.57620.5707
RBPM(3)0.52810.54360.56590.57760.57870.57410.5702
RBPM(5)0.51100.52910.54900.55670.55780.55820.5582
Table 2. Simulated RMSE for the EVI-estimators under study, computed at the simulated optimal level.
Table 2. Simulated RMSE for the EVI-estimators under study, computed at the simulated optimal level.
50100200500100020005000
Fréchet model with ξ = 0.5 ( ρ = 1 )
H0.13830.10510.08180.05970.04570.03650.0262
RBPM(0.5)0.12050.09910.07820.05590.04180.03130.0206
RBPM(1)0.11370.09570.07450.05410.04060.03030.0200
RBPM(1.2)0.11240.09420.07390.05400.04060.03030.0200
RBPM(2)0.11250.09260.07320.05440.04210.03150.0206
RBPM(3)0.11630.09490.07580.05630.04460.03400.0224
Burr model with ( ξ , ρ ) = ( 0.5 , 0.75 )
H0.21700.16750.12910.09310.07390.06000.0453
RBPM(0.5)0.16930.11550.08370.05550.04040.03100.0234
RBPM(1)0.15470.10840.07910.05300.03880.03010.0227
RBPM(2)0.14250.10410.07680.05270.03950.03060.0229
RBPM(3)0.13590.10380.07810.05540.04240.03300.0247
RBPM(5)0.12190.10030.08060.06240.05080.04060.0317
Burr model with ( ξ , ρ ) = ( 0.5 , 0.5 )
H0.30390.23640.18330.13990.11480.09550.0760
RBPM(0.25)0.28910.20670.16150.12460.10280.08710.0693
RBPM(1)0.23370.17360.13730.10550.08790.07500.0601
RBPM(2)0.20200.15550.12380.09710.08140.06900.0563
RBPM(3)0.18130.14440.11680.09390.07970.06780.0556
RBPM(5)0.14930.12390.10410.08840.07780.06720.0574
Half-t model with ν = 2 ( ξ = 0.5 )
H0.20270.15110.11720.08190.06460.05170.0379
RBPM(0.85)0.14260.08990.06510.04040.02680.01960.0126
RBPM(1)0.13970.08850.06420.04000.02650.01950.0125
RBPM(2)0.13030.08570.06300.03990.02740.02040.0133
RBPM(3)0.12600.08740.06630.04350.03100.02350.0156
RBPM(5)0.11820.09060.07330.05290.04160.03340.0242
Half-t model with ν = 3 ( ξ = 1 / 3 )
H0.18790.14930.11570.08560.06790.05430.0406
RBPM(0.25)0.17320.12260.09150.06550.05240.04180.0315
RBPM(1)0.13780.10250.07720.05600.04480.03560.0273
RBPM(2)0.11970.09260.07150.05270.04220.03370.0260
RBPM(3)0.10840.08730.06970.05280.04310.03480.0271
RBPM(5)0.09210.07750.06650.05310.04640.03880.0310
Half-t model with ν = 4 ( ξ = 0.25 )
H0.18460.14680.11590.08540.06940.05600.0439
RBPM(0.25)0.18260.13570.10780.08040.06690.05430.0428
RBPM(1)0.14510.11160.08900.06740.05630.04600.0367
RBPM(2)0.12110.09710.07880.06090.05110.04260.0342
RBPM(3)0.10570.08730.07260.05710.04890.04150.0339
RBPM(5)0.08410.07200.06220.05080.04490.03980.0341
Log-gamma with ξ = 0.5
H0.21680.18200.15560.12940.11410.10350.0897
RBPM(0.25)0.18050.14170.11960.10590.09890.09350.0869
RBPM(1)0.17580.13730.11690.10590.10140.09760.0877
RBPM(2)0.18100.14090.12000.10750.10150.09400.0848
RBPM(3)0.18530.14520.12400.10840.10020.09280.0837
RBPM(5)0.17800.14440.12430.10590.09670.09030.0816
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Henriques-Rodrigues, L.; Caeiro, F.; Gomes, M.I. A New Class of Reduced-Bias Generalized Hill Estimators. Mathematics 2024, 12, 2866. https://doi.org/10.3390/math12182866

AMA Style

Henriques-Rodrigues L, Caeiro F, Gomes MI. A New Class of Reduced-Bias Generalized Hill Estimators. Mathematics. 2024; 12(18):2866. https://doi.org/10.3390/math12182866

Chicago/Turabian Style

Henriques-Rodrigues, Lígia, Frederico Caeiro, and M. Ivette Gomes. 2024. "A New Class of Reduced-Bias Generalized Hill Estimators" Mathematics 12, no. 18: 2866. https://doi.org/10.3390/math12182866

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop