Next Article in Journal
A Study of Generalized QL-Implications
Previous Article in Journal
A Computational Framework for Exploring SARS-CoV-2 Pharmacodynamic Dose and Timing Regimes
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

The Extended Half-Skew Normal Distribution

by
Karol I. Santoro
1,
Héctor J. Gómez
2,*,
Diego I. Gallardo
3,
Inmaculada Barranco-Chamorro
4,* and
Héctor W. Gómez
5
1
Department of Mathematics, Faculty of Sciences, North Catholic University, Antofagasta 1240000, Chile
2
Department of Physics and Mathematics Sciences, Faculty of Engineering, Catholic University of Temuco, Temuco 4780000, Chile
3
Department of Mathematics, Faculty of Engineering, University of Atacama, Copiapó 1530000, Chile
4
Department of Statistics and Operations Research, Faculty of Mathematics, University of Seville, 41012 Seville, Spain
5
Department of Mathematics, Faculty of Basic Sciences, University of Antofagasta, Antofagasta 1240000, Chile
*
Authors to whom correspondence should be addressed.
Mathematics 2022, 10(20), 3740; https://doi.org/10.3390/math10203740
Submission received: 1 September 2022 / Revised: 29 September 2022 / Accepted: 30 September 2022 / Published: 12 October 2022
(This article belongs to the Section Probability and Statistics)

Abstract

:
A new class of densities for modelling non-negative data, which is based on the skew-symmetric family of distributions proposed by Azzalini is introduced.We focus on the model generated by the skew-normal distribution, called Extended Half Skew-Normal distribution. Its relevant properties are studied. These are pdf, cdf, moments, mgf, and stochastic representation. The parameters are estimated by moment and maximum likelihood methods. A simulation study to assess the performance of the maximum likelihood estimators in finite samples was carried out. Two real applications are included, in which the EHSN provides a better fit than other proposals in the literature.

1. Introduction

In the last few years, a considerable amount of research activity has been carried out in the field of skew-symmetric distributions theory and applications. Skew-symmetric distributions are of interest since they can be applied to datasets with an asymmetric structure without the need of applying arbitrary transformations to original data in order to reach symmetry. Quite often these transformations cause a loss of interpretability with respect to the original data. Nowadays, the focus of a number of statisticians is the development of non-symmetric parametric extensions of symmetric distributions. That is, to consider distributions with support in R , to which parameters are incorporated to deal with skewness and kurtosis. In this sense, we can cite the pioneering paper by Azzalini [1] in which the skew-normal model was introduced. Other univariate parametric extensions of this model can be seen in the works by Azzalini [2], Azzalini and Capitano [3], Gupta et al. [4], Arellano-Valle et al. [5], DiCiccio and Monti [6], Gómez et al. [7], Adcock and Azzalini [8], and Gómez-Déniz et al. [9], among others. In all these papers, results and applications of great interest can be found.
The model studied in this paper is based on the univariate skew-normal distribution introduced by Azzalini [1]. There, a general method to build skew distributions on the basis of symmetric distributions was proposed. This key result is pointed out in next lemma, and will be one of the starting points to develop our proposal.
Lemma 1.
Let f be a symmetric at zero probability density function (pdf) and G an absolutely continuous cumulative distribution function (cdf), such that G ( x ) = G ( x ) . Then
h ( x ; λ ) = 2 f ( x ) G ( λ x ) , x R , λ R ,
is the pdf of a random variable (rv) X. In (1), λ is a shape parameter related to the skewness of the distribution.
In literature, it is said that h λ = h ( x ; λ ) introduced in (1) is the skew version of f with skewing function G.
Gómez et al. [7] showed that cases of great interest are obtained if the N ( 0 , 1 ) cdf, Φ , is considered as skewing function. Specifically they proposed the following family of skew densities
g ( x ; λ ) = 2 f 0 ( x ) Φ ( λ x ) , λ , x R ,
where f 0 is any symmetric around zero pdf and λ is a skewness parameter. Recall that, if f 0 = ϕ and G = Φ then g λ reduces to the pdf of the skew normal S N ( λ ) introduced by Azzalini [1]. As merit of (2), it can be cited that the models obtained are more flexible as for its skewness parameter than the distributions proposed by other authors, such as those proposed in Nadarajah and Kotz [10].
Flexible models can be of interest in real applications where is quite common to find non-negative data, which need for their modelling distributions with positive support. In this sense, we consider the model introduced in Elal-Olivero et al. [11], whose pdf is
f Y ( y ; α ) = 2 α + y 2 α + k f 0 y , y 0 , α 0 ,
where f 0 is the pdf of a symmetric around zero rv Y with E [ Y 2 ] = k < . If f 0 ( · ) = ϕ , the pdf of the N ( 0 , 1 ) distribution, then k = 1 and (3) is called the Extended Half-Normal (EHN) density, which was studied in [11]. Other choices of f 0 are possible, for instance, taken as f 0 ( · ) the Power Exponential pdf, [12], the Extended Half-Power Exponential model was recently proposed in [13].
The aim of this paper is to obtain a new model of distributions, based on (2), which will be flexible enough to modelling non-negative datasets.
The outline of this paper is the following one. In Section 2, the new general family of distributions is introduced along with basic properties. In Section 3, the particular case based on the skew-normal pdf is considered. This model is called the Extended Half Skew-Normal (EHSN) distribution. Its cdf, moments and stochastic representation are studied in detail. Section 4 is devoted to inference in the EHSN model. The method of moments and maximum likelihood (ML) are discussed. In Section 5, a simulation study is carried out to assess the consistence of ML estimators in the EHSN model. Finally, two real applications are given in Section 6, which illustrate the usefulness of our proposal.

2. A New General Family of Distributions

In this section, based on (1), a new class of distributions for modelling non-negative data is introduced. Some basic properties of this family are also given.
Lemma 2.
Let g λ = g ( x ; λ ) introduced in (2), and h a non-negative scalar function such that
0 h ( x ) g λ ( x ) d x < .
Then,
f ( x ) = c g λ ( x ) h ( x ) , x > 0 ,
is a pdf in R + with c 1 = 0 h ( x ) g λ ( x ) d x < .
Note that, varying f 0 or h in Lemma 2, a diversity of distributions with non-negative support can be obtained. In Proposition 1, we apply the idea given in (3) with k = 1 to the skew density g λ defined in (2).
Proposition 1.
Let g λ ( · ) defined in (2) and c α , λ 1 = 0 α + x 2 α + 1 g λ ( x ) d x . Provided that c α , λ 1 < , and by applying Lemma 2, the following family of densities is obtained
f X x ; α , λ = c α , λ α + x 2 α + 1 g λ x , x 0 ,
where α > 0 and λ R .
Proof. 
Straightforward by using Lemma 2.    □
Remark 1.
Particular cases of interest of (4) are:
P1. 
Taking limit when λ , then g λ tends to the half (or folded at zero) density of f 0 , 2 f 0 ( x ) with x 0 , and therefore f X ( · ) tends to the pdf introduced in (3).
P2. 
If α , then f X tends to the density g λ truncated to ( 0 , + ) , which can be denoted as f X ( x ; α , λ ) c g λ x , x 0 .
In a general setting, it is of interest to introduce a scale parameter δ > 0 . Then, the pdf of our family will be δ 1 f X ( x / δ ) , that is
f X ( x ; α , λ , δ ) = c α , λ δ α + ( x / δ ) 2 α + 1 g λ x δ ,
with c α , λ given in Proposition 1.
The notation X f X ( x ; α , λ , δ ) will be used to refer to the density defined in (5).

3. The Extended Half Skew-Normal

In this section, we focus on the particular case in which g λ ( · ) given in (2), is the pdf of the skew normal model, that is, f 0 = ϕ . The model, which results of applying (4), will be called the Extended Half Skew-Normal (EHSN). Results in this new model will be obtained by applying next lemmas, whose proofs can be seen in Nadarajah and Kotz [10] and Huang et al. [14], respectively.
Lemma 3
(Nadarajah and Kotz [10]). Let F t r be the cdf of a t-Student’s distribution with r > 0 degrees of freedom. Then, for every positive integer r, F t r is given by
F t r ( t ) = 1 2 + 1 π arctan t r + 1 2 π i = 1 ( r 1 ) / 2 Γ ( i ) r i 1 / 2 Γ ( i + 1 / 2 ) t ( r + t 2 ) i , if   r   is   odd 1 2 + 1 2 π i = 1 r / 2 Γ ( i 1 / 2 ) r i 1 Γ ( i ) t ( r + t 2 ) i 1 / 2 , if   r   is   even
where t R and i = 1 0 is defined as 0.
Lemma 4
(Huang et al. [14]). For s 1 and λ R , the following result holds
0 v s ϕ ( v ) Φ ( λ v ) d v = 2 s / 2 1 Γ ( ( s + 1 ) / 2 ) π F t s + 1 λ s + 1 .
Lemma 5.
Let X have pdf f X ( x ; α , λ , δ ) as introduced in (5). Then, the normalizing constant, c α , λ , in (5) is
c α , λ = 2 π π + 2 arctan ( λ ) + 2 λ ( 1 + λ 2 ) ( α + 1 ) .
Proof. 
Note that, making the change of variable v = x δ ,
c α , λ 1 = 0 2 δ 3 α δ 2 + x 2 α + 1 ϕ x δ Φ λ x δ d = 0 2 α + v 2 α + 1 ϕ v Φ λ v d v .
By applying the results given in Lemma 3 and Lemma 4 [10]
0 ϕ v Φ λ v d v = Γ ( 1 / 2 ) 2 π F t 1 ( λ ) = 1 4 + 1 2 π arctan ( λ ) ,
0 v 2 ϕ v Φ λ v d v = Γ ( 3 / 2 ) π F t 3 λ 3 = 1 4 + 1 2 π λ 1 + λ 2 + arctan ( λ ) .
From (8)–(10) is obtained.    □
Proposition 2.
Let X E H S N ( α , δ , λ ) . Then, the pdf of X is given by
f X ( x ; α , δ , λ ) = c α , λ δ 3 α δ 2 + x 2 α + 1 2 ϕ x δ Φ λ x δ , x 0 ,
with α > 0 shape parameter, δ > 0 scale parameter, λ 0 skewness parameter and c α , λ given in (8).
Proof. 
It follows from Lemma 5.    □
Remark 2.
In the model introduced in Proposition 2, we restrict the skewness parameter to λ 0 since negative values of λ skew the distribution to negative values of x, and the resulting pdf’s are not of interest for the purpose of modelling non-negative data.
Corollary 1.
The following models are particular cases of the EHSN distribution:
1. 
If λ 0 or λ + then EHSN ( α , δ , λ ) reduces to the Extended Half-Normal distribution, EHN ( α , δ ) , introduced in [11].
2. 
If λ = 0 , α = 0 , and δ = 1 , then EHSN ( α = 0 , δ = 1 , λ = 0 ) reduces to the Right Half Bimodal Normal model proposed in [15], RHBN ( 2 ) .
3. 
If λ = 0 and α then EHPE ( α , δ , λ = 0 ) reduces to the Half-Normal distribution, HN(δ).
Figure 1 summarizes the relationships among the EHSN and the particular cases previously cited.
In Figure 2, plots for the pdf of EHSN model are given. Without loss of generality, the scale parameter δ = 1 is taken. Four values of λ are fixed ( λ = 0.1 , 1 , 2 , 5 ), and several values of α are considered.
Next, it is proven that the cdf of the EHSN model can be expressed in terms of the cdf of a skew-normal, cdf and pdf of a Generalized Gamma, and pdf of a N ( 0 , 1 ) distribution. Details about the Generalized Gamma introduced by Stacy [16] are given in Appendix A.
Proposition 3.
Let X E H S N ( α , δ , λ ) , λ > 0 . Then, the cdf X, F X , can be obtained as:
F X ( x ) = c α , λ ( α + 1 ) α Φ x δ 2 O x δ , λ 1 2 + 1 π arctan ( λ ) + 1 2 F G G x δ + 0 x / δ f G G ( t ) 0 λ t ϕ ( u ) d u d t ,
where ϕ ( · ) and Φ ( · ) denote the pdf and cdf of a N ( 0 , 1 ) , F G G and f G G denotes the cdf and pdf of a Generalized Gamma distribution, G G ( 2 , 3 , 2 ) , and O ( · , λ ) is the Owen [17] function.
Proof. 
Given X E H S N ( α , δ , λ ) , let us consider Z = X δ E H S N ( α , 1 , λ ) . We have
F X ( x ) = F Z x δ .
Next, we are going to obtain the cdf of Z for z > 0
F Z z = 0 z f Z ( t ) d t = c α , λ α ( α + 1 ) 0 z 2 ϕ ( t ) Φ ( λ t ) d t + c α , λ 1 ( α + 1 ) 0 z 2 t 2 ϕ ( t ) Φ ( λ t ) d t .
Note that 2 ϕ ( t ) Φ ( λ t ) for t R is the pdf of a skew-normal distribution, S N ( λ ) , as it can be seen in [1]. Therefore, we can write
0 z 2 ϕ ( t ) Φ ( λ t ) d t = Φ x δ 2 O x δ , λ 1 2 + 1 π arctan ( λ ) ,
On the other hand, let us next consider
0 z 2 t 2 ϕ ( t ) Φ ( λ t ) d t .
Note that 2 t 2 ϕ ( t ) for t > 0 is the pdf of the Generalized Gamma distribution introduced by Stacy [16], G G ( a = 2 , d = 3 , p = 2 ) . By proceeding similarly to [18], we have that
0 z 2 t 2 ϕ ( t ) Φ ( λ t ) d t = 0 z 2 t 2 ϕ ( t ) λ t ϕ ( u ) d u d t ,
can be obtained in terms of the joint distribution of a random vector ( T , U ) with T G G ( 2 , 3 , 2 ) , f G G ( t ) = 2 t 2 ϕ ( t ) with t > 0 , and U N ( 0 , 1 ) independent.
Specifically, for λ > 0
0 z 2 t 2 ϕ ( t ) λ t ϕ ( u ) d u d t = F G G ( z ) Φ ( 0 ) + 0 z 2 t 2 ϕ ( t ) 0 λ t ϕ ( u ) d u d t .
Taking into account that Φ ( 0 ) = 1 / 2 , f G G ( t ) = 2 t 2 ϕ ( t ) with t > 0 , and (12) and (13) follows.    □
Since the EHSN distribution can be used to model lifetime data is of interest to study its survival and hazard rate function, see [19,20]. These functions are next obtained. Some plots are given in Figure 3.
Corollary 2.
Let X E H S N ( α , δ , λ ) with λ > 0 . Then
1. 
The survival function, S X , is S X ( t ) = 1 F X ( t ) , t > 0 , and F X was given in (12).
2. 
The hazard rate function, h X , is h X ( t ) = f X ( t ) S X ( t ) , t > 0 , and f X given in (11).
Plots of the survival and hazard rate function of X E H S N ( α , δ , λ ) are given in Figure 3 for fixed λ = 0.1 , δ = 1 , and α { 0.5 , 1 , 1.5 , 2 , 5 } .
Remark 3.
Similar plots were obtained for the values of parameters considered in Figure 2. The plot obtained in Figure 3 for the hazard rate function suggests that for λ and δ fixed, the E H S N model is stochastically ordered with respect to α.

3.1. Moments

From now on, the notation Y f Y ( y ) will be used to denote that Y has pdf f Y . In this section, first the moments of E H S N model are obtained. Second it is proven that the moment generating function (mgf) of E H S N model can be obtained from the mgf of Y c 1 ϕ ( y ) Φ ( λ y ) , y 0 .
Proposition 4.
Let X E H S N ( α , δ , λ ) . Then, the moment of order n can be obtained as
E [ X n ] = 2 c α , λ α + 1 α 2 n / 2 1 Γ ( ( n + 1 ) / 2 ) π F t n + 1 ( λ n + 1 ) + 2 n / 2 Γ ( ( n + 3 ) / 2 ) π F t n + 3 ( λ n + 3 ) ,
where F t r · was given in (6).
Proof. 
From (6) and (7), the proposed result follows.    □
Next, the moment generating function (mgf) is obtained.
Proposition 5.
Let X E H S N ( α , δ , λ ) . Then the mgf of X, M X , can be obtained as
M X ( t ) = M Z ( δ t ) w i t h Z E H S N ( α , 1 , λ )
and where the mgf of Z is given by
M Z ( t ) = c α , λ ( α + 1 ) ( 1 + α + t 2 ) M Y ( t ) + c α , λ ( α + 1 ) t 2 π + t 2 π λ 1 + λ 2 M H N ( t ) + λ π ( 1 + λ 2 ) M G G ( t ) ,
where M Y ( t ) is the mgf of Y c 1 2 ϕ ( y ) Φ ( λ y ) ,   y > 0 ; M H N ( t ) is the mgf of a half normal distribution, H N 1 1 + λ 2 ; and M G G is the mgf of a Generalized Gamma, G G 2 1 + λ 2 , 2 , 2 .
Proof. 
M Z ( t ) = E exp ( t Z ) = 0 e t z f z ( z ) d z ,
with f z the pdf of Z E H S N ( α , 1 , λ ) .
Note that
0 e t z 2 ϕ ( z ) Φ ( λ z ) d z = M Y ( t )
with Y c 1 2 ϕ ( y ) Φ ( λ y ) ,   y > 0 . That is, the mgf of a skew normal distribution, S N ( λ ) , truncated at zero.
Let us consider now
0 e t z 2 z 2 ϕ ( z ) Φ ( λ z ) d z = I .
This integral is solved by applying integration by parts twice, taking in both cases d v = z ϕ ( z ) , ( v = ϕ ( z ) ). First, we have
I = M y ( t ) + 2 t 0 e t z z ϕ ( z ) Φ ( λ z ) d z + 2 λ 0 e t z z ϕ ( z ) ϕ ( λ z ) d z .
Second, we obtain
2 t 0 e t z z ϕ ( z ) Φ ( λ z ) d z = t 2 π + t 2 M y ( t ) + λ t 0 e t z 2 ϕ ( z ) ϕ ( λ z ) d z = t 2 π + t 2 M y ( t ) + λ t 2 π 1 1 + λ 2 M H N ( t ) ,
where M H N ( t ) is the mgf of a half normal distribution, H N 1 1 + λ 2 .
Finally, we consider
2 λ 0 e t z z ϕ ( z ) ϕ ( λ z ) d z = λ π 0 e t z z e x p z 2 ( 1 + λ 2 ) 2 d z = λ π ( 1 + λ 2 ) M G G ( t )
where M G G is the mgf of a Generalized Gamma, G G a = 2 1 + λ 2 , d = 2 , p = 2 .    □
As a direct consequence of Proposition 4, the first four moments of E H S N ( α , δ , λ ) model are given, μ k = E [ X k ] , with k = 1 , , 4 .
Corollary 3.
Let X E H S N ( α , λ , δ ) . Then,
1. 
μ 1 = 2 c α , λ δ 2 π ( α + 1 ) α F t 2 λ 2 + 2 F t 4 λ 4 .
2. 
μ 2 = c α , λ δ 2 α + 1 α F t 3 λ 3 + 3 F t 5 λ 5 .
3. 
μ 3 = 2 2 c α , λ δ 3 π ( α + 1 ) α F t 4 λ 4 + 4 F t 6 λ 6 .
4. 
μ 4 = c α , λ δ 4 α + 1 3 α F t 5 λ 5 + 15 F t 7 λ 7 .
Expressions of variance, skewness and kurtosis can be obtained from Corollary 3.
Corollary 4.
Let X E H S N ( α , λ , δ ) . Then,
1. 
The variance of X, V a r [ X ] = E [ X 2 ] ( E [ X ] ) 2 , is
V a r [ X ] = c α , λ δ 2 α + 1 α F t 3 ( λ 3 ) + 3 F t 5 ( λ 5 ) 4 c α , λ 2 π ( α + 1 ) α F t 2 ( λ 2 + 2 F t 4 ( λ 4 ) ) 2 .
2. 
The skewness, β 1 , and kurtosis, β 2 , coefficients can be obtained by using
β 1 = μ 3 3 μ 1 μ 2 + 2 μ 1 3 ( μ 2 μ 1 2 ) 3 2 , a n d β 2 = μ 4 4 μ 1 μ 3 + 6 μ 2 μ 1 2 3 μ 1 4 ( μ 2 μ 1 2 ) 2 .
Without loss of generality, the scale parameter can be taken as one, δ = 1 . Plots for β 1 and β 2 , as functions of α and λ are given in Figure 4.

3.2. Stochastic Representation

In this section, the stochastic representation of the EHSN model is given. This result will be used to generate random values of this model.
Next propositions allow us to obtain the stochastic representation of the EHSN model.
Proposition 6.
Let Y and U be independent rv’s, with Y 2 χ 3 2 and P ( U = 1 ) = P ( U = 1 ) = 1 2 . Then, the rv W = U | Y | is distributed as W w 2 ϕ ( w ) with w R .
Proof. 
It can be seen in Elal-Olivero et al. [11].    □
Proposition 7.
Let Z and W be independent rv’s, with Z N ( 0 , 1 ) and W w 2 ϕ ( w ) , w R . Let us define H = α α + 1 Z + 1 α + 1 W . Then H α + h 2 α + 1 ϕ ( h ) , with h R and α > 0 .
Proof. 
Let us denote a = α α + 1 and b = 1 α + 1 . We have that the cdf of H, F H , is
F H ( h ) = P ( a Z + b W h ) = P Z h b W a = F Z h b w a f W ( w ) d w .
Taking derivative with respect to h, the pdf of H, f H , is obtained
f H ( h ) = 1 a f Z h b w a f w ( w ) d w = 1 a ϕ ( h ) w 2 ϕ w h b a d w = α + h 2 α + 1 ϕ ( h ) , h R .
   □
Proposition 8.
Let L = | H | , where H was defined in Proposition 7. Then, L 2 α + l 2 α + 1 ϕ l with l > 0 , that is, L E H N ( α ) .
Proof. 
For l > 0 , we have that
F L ( l ) = F | H | l = P | H | < l = P H < l P H < l = F H l F H l .
Taking derivative with respect to l, and sice f H is symmetrical about zero, we have that
f L ( l ) = f H l + f H l = 2 f H l = 2 α + l 2 α + 1 ϕ l , l > 0 .
   □
Proposition 9.
Let L and Z be independent rv’s, such that L 2 α + l 2 α + 1 ϕ l , with l > 0 and Z N ( 0 , 1 ) . Then
T L | Z < λ L E H S N ( α , λ ) , λ R .
Proof. 
The cdf of T, F T , is
F T ( t ) = P ( L < t | Z < λ L ) = P ( L t , Z < λ L ) P ( Z < λ L ) , t > 0 .
The numerator is
P ( L t , Z < λ L ) = 0 t P ( Z < λ L | L = l ) f L ( l ) d l = 2 0 t α + l 2 α + 1 ϕ l Φ ( λ l ) d l .
The denominator is
P ( Z < λ L ) = 0 P ( Z < λ L | L = l ) f L ( l ) d l = 2 0 α + l 2 α + 1 ϕ ( l ) Φ ( λ l ) d l .
From (8), P ( Z < λ L ) = c α , λ 1 , and therefore
F T ( t ) = 2 c α , λ 0 t α + l 2 α + 1 ϕ ( l ) Φ ( λ l ) d l .
Taking derivative with respect to t, the pdf of T is
f T ( t ) = 2 c α , λ α + t 2 α + 1 ϕ ( t ) Φ ( λ t ) , t > 0 .
   □
Proposition 10.
Let X = δ T with δ > 0 and T defined in Proposition 9. Then X E H S N ( α , δ , λ ) .
Proof. 
Immediate since the pdf of X is f X ( x ) = 1 δ f T x δ .    □

4. Inference

In this section, inference for the parameters in the E H S N distribution is carried out from a classical point of view. Let X E H S N ( α , δ , λ ) and consider independent copies of X, that is X 1 , X 2 , , X n a random sample from X. The method of moments and maximum likelihood estimators are next discussed.

4.1. Method of Moment Estimators

The moments estimators result from the solution of the equations E ( X j ) = X j ¯ , for j = 1 , 2 , 3 , where X j ¯ = n 1 i = 1 n x i j denotes the j-th sample moment. Solving E ( X ) = X ¯ , we have that
δ = X ¯ 2 π ( α + 1 ) 2 c α , λ α F t 2 λ 2 + 2 F t 4 λ 4 .
Taking (17), and by substituting it into E ( X j ) , j = 2 , 3 given in Corollary 1, we get
X 2 ¯ = X ¯ 2 π ( α + 1 ) α F t 3 λ 3 + 3 F t 5 λ 5 2 c α , λ α F t 2 λ 2 + 2 F t 4 λ 4 2 ,
X 3 ¯ = X ¯ 3 π ( α + 1 ) 2 α F t 4 λ 4 + 4 F t 6 λ 6 c α , λ 2 α F t 2 λ 2 + 2 F t 4 λ 4 3 .
These equations must be solved by using mathematical software, such as the function nleqslv available in R software [21], to obtain the moment estimators α ^ M M and λ ^ M M . Finally, δ ^ M M is obtained from (17).

4.2. Maximum Likelihood

Given X 1 , X 2 , , X n a random sample of size n from E H S N ( α , δ , λ ) , then from (11), the log-likelihood function is given by
( θ ) n log ( c α , λ ) 3 n log ( δ ) n log ( α + 1 ) + i = 1 n log ( α δ 2 + x i 2 ) 1 2 δ 2 i = 1 n x i 2 + i = 1 n log Φ λ x i δ ,
where ∝ means proportional to, and θ = ( α , δ , λ ) . Taking partial derivatives with respect to α , δ , and λ , the elements of the score vector are obtained, S ( θ ) = α , δ , λ , that is
α = n α + 1 + n λ c α , λ π ( α + 1 ) 2 ( 1 + λ 2 ) + i = 1 n δ 2 α δ 2 + x i 2 , δ = 3 n δ + 1 δ 3 i = 1 n x i 2 + i = 1 n 2 α δ α δ 2 + x i 2 λ i = 1 n x i δ 2 ξ λ x i δ , λ = c α , λ π ( 1 + λ 2 ) 1 + 1 λ 2 ( 1 + α ) ( 1 + λ 2 ) + i = 1 n x i δ ξ λ x i δ ,
where ξ ( · ) = ϕ ( · ) / Φ ( · ) . The MLEs of θ ^ are obtained as solution of S ( θ ) = 0 . Numerical methods must be used to solve this system. For instance, we use the Broyden–Fletcher–Goldfarb–Shanno (BFGS) algorithm to obtain the estimators of ( log   α , log   δ , λ ) and by invariance, we obtained the estimators of α and δ (See details in [22]).

4.3. Observed Fisher Information Matrix

The asymptotic variance of MLEs, θ ^ = ( α ^ , δ ^ , λ ^ ) , can be estimated from the Fisher information matrix, given by I ( θ ) = E 2 ( θ ) / θ θ with ( θ ) given in (20). Recall that, under regularity conditions,
I ( θ ) 1 / 2 θ ^ θ D N 3 ( 0 3 , I 3 ) , as   n + ,
where D stands convergence in distribution and N 3 ( 0 3 , I 3 ) denotes the standard trivariate normal distribution. Moreover, I ( θ ) can be obtained from the matrix 2 ( θ ) / θ θ , whose elements are given by I α α = 2 ( θ ) / α 2 , I α δ = 2 ( θ ) / α δ , and so on. Explicitly, we have
I α α = i = 1 n δ 4 ( α δ 2 + x i 2 ) 2 n ( α + 1 ) 2 1 + λ c α , λ π ( α + 1 ) ( 1 + λ 2 ) λ c α , λ π ( α + 1 ) ( 1 + λ 2 ) 2 , I α δ = i = 1 n 2 δ α δ 2 + x i 2 α δ 2 ( α δ 2 + x i 2 ) 1 , I α λ = n c α , λ π ( α + 1 ) 2 ( 1 + λ 2 ) λ c α , λ π ( 1 + λ 2 ) k α , λ + 2 λ 2 1 + λ 2 1 , I δ δ = 3 δ 2 i = 1 n x i 2 δ 2 n + i = 1 n 2 α α δ 2 + x i 2 2 α δ 2 α δ 2 + x i 2 1 + λ i = 1 n x i δ 3 ξ λ x i δ λ x i δ λ x i δ + ξ λ x i δ 2 , I δ λ = i = 1 n x i δ 2 ξ λ x i δ 1 λ x i δ λ x i δ + ξ λ x i δ , I λ λ = c α , λ π ( 1 + λ 2 ) 2 k α , λ c α , λ π k α , λ + 2 λ + 4 λ ( 1 + α ) ( 1 + λ 2 ) + i = 1 n x i 2 δ 2 ξ λ x i δ λ x i δ + ξ λ x i δ .
where k α , λ = 1 + 1 λ 2 ( 1 + α ) ( 1 + λ 2 ) .
In practice, it is not possible to obtain a closed form to the expected value of previous expressions. So, the covariance matrix of MLEs, I ( θ ) 1 , can be estimated by I ( θ ^ ) 1 , where I ( θ ^ ) denotes the observed information matrix, which is obtained by evaluating the previous derivatives at the MLE θ ^ , i.e.
I ( θ ^ ) = 2 ( θ ) / θ θ | θ = θ ^ .
The asymptotic variances of α ^ , δ ^ , and λ ^ are estimated by the diagonal elements of I ( θ ^ ) 1 , and their standard errors by the square root of asymptotic variances. Details about the theoretical results used in this subsection can be seen in [23].

5. Simulation Study

In this section, a simulation study is carried out to asses the performance of ML estimators. First an algorithm to generate samples from E H S N ( α , λ , δ ) is given. The simulation algorithm is based on the stochastic representation introduced in Section 3.2.
Algorithm
(i)
Simulate independently: R U ( 0 , 1 ) , Y χ 3 2 and Z , J N ( 0 , 1 ) .
(ii)
If R < 1 2 , then U = 1 . Otherwise U = 1 .
(iii)
Compute W = U Y .
(iv)
Compute L = α α + 1 Z + 1 α + 1 W .
(v)
Compute L = | H | .
(vi)
If J < λ L , then T = L . Otherwise, repeat steps (i) to (v) until you get a new random value of T.
(vii)
Take X = δ T .
As values of parameters in our simulation, we consider for α 0.25 , 0.5 , 1 ; λ 0.5 , 1 , 2 and δ 1 , 10 . As for the sample size, we consider n 100 , 300 , 500 , 1000 . For each sample size, and every combination of α , λ , δ , we carry out 1000 replicates and the corresponding ML estimates are computed.
Results are given in Table 1. As summaries we provide the estimated bias (bias), the mean of the estimated standard errors (SE), and the root of the estimated mean squared error (RMSE).
For the ML estimators of λ , α and δ , note that when the sample size increases then the bias, SE and RMSE decrease. Additionally, note that, when the sample size increases then, the SE and RMSE are closer, which suggests that the standard errors of the MLE estimators are well estimated. However, we highlight that such convergence is slower for the ML of λ , suggesting that a big sample size is necessary in order to guarantee good statistical properties of this estimator.

6. Applications

In this section, two real applications are given. The aim is to compare the EHSN model to other models of interest. Specifically, the EHSN model is compared to its precedent, the Extended Half-Normal distribution (EHN) proposed in [11] and the log-skew-normal (LSN) introduced in [24] with location parameter α , scale parameter δ , and shape parameter λ , which includes as particular case the traditional log-normal (LN) distribution for λ = 0 .

6.1. Application 1

We consider the dataset that corresponds to daily average wind speeds for 1961–1978 at 12 synoptic meteorological stations in the Republic of Ireland at the station number 7 (DUB) (see http://lib.stat.cmu.edu/datasets/wind.data, last accessed on 12 July 2022).
In Table 2, the descriptive summaries are provided: sample mean, sample variance, sample skewness ( b 1 ), and sample kurtosis coefficient ( b 2 ). We highlight that we obtained a low value for the sample kurtosis coefficient, b 2 = 4.0531 , which suggests that a distribution with flexible values for this coefficient, such as the EHSN can be used to model this dataset.
For this application, there is one observation as “zero”. The EHN and EHSN do not have problem to accommodate this observation. However, the LSN (and LN) distribution cannot accommodate it. For this reason, we only consider as competitors the EHN and EHSN in this problem. The moments estimators for the EHSN model in this dataset are α ^ M M = 1.093 , δ ^ M M = 5.059 and λ ^ M M = 2.521 , which were used to initialize the maximization procedure to obtain the corresponding MLE’s. In Table 3, EHN and EHSN models are compared. Akaike Information Criterion (AIC) [25] and Bayesian Information Criterion (BIC) [26] are used. Additionally, in Figure 5, the empirical cdf is plotted along with the cdf estimated for the EHSN model, whereas qqplots comparing both models are given in Figure 6. In Figure 7, the histogram and models fitted by maximum likelihood are given.
We highlight that following AIC and BIC criteria, the fitted EHSN model provides a better fit to this dataset. QQ-plots in Figure 6 and histogram with fitted densities in Figure 7 also support this statement.

6.2. Application 2

This dataset corresponds to heights (100 × feet) of 219 volcanoes studied in [27]. In Table 4, the descriptive summaries are provided. These are: the sample mean ( x ¯ ), the sample variance ( s 2 ), the sample skewness coefficient ( b 1 ), and the sample kurtosis coefficient ( b 2 ).
In this application, the EHSN model is compared to LSN and EHN. Moments estimates in the EHSN model are obtained by applying results in Section 4.1, these are α ^ M M = 1.579 , δ ^ M M = 59.668 , and λ ^ M M = 2.788 . These estimates are used as initial values to obtain MLEs by using numerical methods. ML estimates for the LSN, EHN, and EHSN models, along with their standard errors are provided in Table 5. For purposes of comparison, log-likelihood, AIC and BIC are also included in this table. These summaries support the fact that the EHSN model provides a better fit to this dataset. As plots, the histogram, along with the estimated pdfs, are given in Figure 8. The QQ-plots comparing EHN and EHSN models are given in Figure 9. In Figure 10, the empirical cdf is plotted along with the cdf estimated for the EHSN model. All these plots support our conclusions.

7. Final Discussion and Conclusions

This study presents a new model with positive support based on the skew-normal distribution, which has been called the extended half skew-normal distribution. This distribution is useful as a more general model compared to the EHN model proposed by Elal-Olivero [11], pursuing to increase kurtosis and improve the modeling of positive datasets with high kurtosis. Relevant properties of the model are given. Closed expressions are given for the pdf, cdf, mgf, and moments. Additionally, a stochastic representation is proposed, which will be the basis to generate random values in this model. Estimation of parameters is carried out via maximum likelihood methods by using numerical techniques. The simulation study illustrates the good performance of estimators. Two applications with real datasets were also carried out, verifying that the new model performs better than the competing models.
One of the referees points out that the pdf’s introduced throughout this paper, see, for instance, the general family defined in (4) or the pdf for the E H N introduced in (3), can be considered as weighted distributions, [28]. For the pdf introduced in (4), this would be to write
f X ( x ) = w ( x ; α ) E g λ I [ x 0 ] w ( X ; α ) g λ ( x ) I [ x 0 ] ,
with weight function w ( x ; α ) = α + x 2 . This idea may be studied in future works along with its implications.

Author Contributions

Conceptualization, K.I.S., H.J.G., D.I.G. and I.B.-C.; Formal analysis, I.B.-C. and D.I.G.; Investigation, K.I.S., H.J.G. and I.B.-C.; Software, D.I.G.; Supervision, H.W.G. All of the authors contributed significantly to this research article. All authors have read and agreed to the published version of the manuscript.

Funding

The research of I. Barranco-Chamorro was supported by IOAP of University of Seville, Spain. The research of H.W. Gómez was supported by Semillero UA-2022 project, Chile.

Data Availability Statement

Dataset studied in Section 6.1 can be found at http://lib.stat.cmu.edu/datasets/wind.data, (last accessed on 12 July 2022). The dataset in Section 6.2 has been taken from [27].

Acknowledgments

The authors want to express their gratitude to referees, whose comments helped to improve significantly this paper.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

In this Appendix details are given about the models under consideration in Section 3. These models are:
  • Skew-Normal, X S N ( λ ) , introduced in [1], whose pdf is
    f ( x ; λ ) = 2 ϕ ( x ) Φ ( λ x ) , x R , λ R .
    The cdf is
    F ( x ; λ ) = Φ ( x ) 2 O ( x , λ ) , x R , λ R ,
    where O ( · , λ ) denotes the Owen function [17].
  • Generalized Gamma, G G ( a , d , p ) , introduced in [16], whose pdf is
    f ( x ; a , d , p ) = 1 Γ d p p a d x d 1 e ( x / a ) p , x > 0 , a , d , p > 0 .
    The cdf is
    F ( x ; a , d , p ) = Γ z d p Γ d p
    where z = ( x / a ) p and Γ z d p = 0 z v ( d / p ) 1 e v d v , see [29].

References

  1. Azzalini, A. A class of distributions which includes the normal ones. Scand. J. Stat. 1985, 12, 161–178. [Google Scholar]
  2. Azzalini, A. The skew-normal distribution and related multivariate familie. Scand. J Stat. 2005, 32, 159–188. [Google Scholar] [CrossRef]
  3. Azzalini, A.; Capitanio, A. Distributions generated by perturbation of symmetry with emphasis on a multivariate skew t-distribution. J. R. Stat. Soc. Ser. Stat. Methodol. 2003, 65, 367–389. [Google Scholar] [CrossRef]
  4. Gupta, A.; Chang, F.; Huang, W. Some skew-symmetric models. Random Oper. Stoch. Equ. 2002, 10, 133–140. [Google Scholar] [CrossRef]
  5. Arellano-Valle, R.B.; Gómez, H.W.; Quintana, F.A. A New Class of Skew-Normal Distributions. Commun. Stat. Theory Methods 2004, 33, 1465–1480. [Google Scholar] [CrossRef]
  6. DiCiccio, T.J.; Monti, A.C. Inferential aspects of the skew exponential power distribution. J. Am. Stat. Assoc. 2004, 99, 439–450. [Google Scholar] [CrossRef]
  7. Gómez, H.W.; Venegas, O.; Bolfarine, H. Skew-symmetric distributions generated by the distribution function of the normal distribution. Environmetric 2007, 18, 395–407. [Google Scholar] [CrossRef] [Green Version]
  8. Adcock, C.; Azzalini, A. A selective overview of skew-elliptical and related distributions and of their applications. Symmetry 2020, 12, 118. [Google Scholar] [CrossRef] [Green Version]
  9. Gómez–Déniz, E.; Dávila-Cárdenes, N.; Boza-Chirino, J. Modelling expenditure in tourism using the log-skew normal distribution. Curr. Issues Tour. 2022, 25, 2357–2376. [Google Scholar] [CrossRef]
  10. Nadarajah, S.; Kotz, S. Skew ditribution generated by the normal kernel. Stat. Probab. Lett. 2003, 65, 269–277. [Google Scholar] [CrossRef]
  11. Elal-Olivero, D.; Olivares-Pacheco, J.F.; Gómez, H.W.; Bolfarine, H. A New Class of Non Negative Distributions Generated by Symmetric Distributions. Commun. Stat. Methods 2009, 38, 993–1008. [Google Scholar] [CrossRef]
  12. Subbotin, M. On the law of frecuency of errors. Math. Sb. Hall. 1923, 31, 296–301. [Google Scholar]
  13. Santoro, K.I.; Gómez, H.J.; Barranco-Chamorro, I.; Gómez, H.W. Extended Half-Power Exponential Distribution with Applications to COVID-19 Data. Mathematics 2022, 10, 942. [Google Scholar] [CrossRef]
  14. Huang, W.J.; Su, N.C.; Teng, H.Y. On some study of skew-t distribution. Commun. Stat. Theory Methods 2003, 48, 4712–4729. [Google Scholar] [CrossRef]
  15. Alavi, S.M.R. On a new bimodal normal family. J. Stat. Res. Iran 2011, 8, 163–175. [Google Scholar] [CrossRef] [Green Version]
  16. Stacy, E.W. A Generalization of the Gamma Distribution. Ann. Math. Stat. 1962, 33, 1187–1192. [Google Scholar] [CrossRef]
  17. Owen, D.B. Tables for computing bivariate normal probabilities. Ann. Math. Stat. 1956, 27, 1075–1090. [Google Scholar] [CrossRef]
  18. Martínez-Flórez, G.; Barranco-Chamorro, I.; Gómez, H.W. Flexible Log-Linear Birnbaum–Saunders Model. Mathematics 2021, 9, 1188. [Google Scholar] [CrossRef]
  19. Lai, C.D.; Xie, M. Stochastic Ageing and Dependence for Reliability; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2006. [Google Scholar]
  20. Marshall, A.W.; Olkin, I. Life Distributions; Springer: New York, NY, USA, 2007; Volume 13. [Google Scholar]
  21. R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2021; Available online: https://www.R-project.org/ (accessed on 31 August 2022).
  22. Fletcher, R. Practical Methods of Optimization, 2nd ed.; John Wiley & Sons: New York, NY, USA, 1987. [Google Scholar]
  23. Rohatgi, V.K.; Saleh, A.K.M.E. An Introduction to Probability Theory and Mathematical Statistics, 3rd ed.; John Wiley: New York, NY, USA, 2001. [Google Scholar]
  24. Azzalini, A.; Cappello, D.; Kotz, S. Log-skew-normal and log-skew-t distributions as model for family income data. J. Income Distrib. 2003, 11, 12–20. [Google Scholar] [CrossRef]
  25. Akaike, H. A new look at the statistical model identification. IEEE Trans. Autom. Control 1974, 19, 716–723. [Google Scholar] [CrossRef]
  26. Schwarz, G. Estimating the dimension of a model. Ann. Stat. 1978, 6, 461–464. [Google Scholar] [CrossRef]
  27. Tukey, J.W. Exploratory Data Analysis; Addison-Wesley: Boston, MA, USA, 1977. [Google Scholar]
  28. Patil, G.P.; Rao, C.R. Weighted distributions and size-biased sampling with applications to wildlife populations and human families. Biometrics 1978, 34, 179–189. [Google Scholar] [CrossRef]
  29. Abramowitz, M.; Stegun, I.A. Handbook of Mathematical Functions with Formulas, Graphs, and Mathematical Tables; Dover: New York, NY, USA, 1964. [Google Scholar]
Figure 1. Particular cases for the EHSN distribution.
Figure 1. Particular cases for the EHSN distribution.
Mathematics 10 03740 g001
Figure 2. (a) Pdf of EHSN ( α , δ = 1 , λ = 0.1 ) for different values of α . (b) Pdf of EHSN ( α , δ = 1 , λ = 1 ) for different values of α . (c) Pdf of EHSN ( α , δ = 1 , λ = 2 ) model for different values of α . (d) Pdf of EHSN ( α , δ = 1 , λ = 5 ) model for different values of α .
Figure 2. (a) Pdf of EHSN ( α , δ = 1 , λ = 0.1 ) for different values of α . (b) Pdf of EHSN ( α , δ = 1 , λ = 1 ) for different values of α . (c) Pdf of EHSN ( α , δ = 1 , λ = 2 ) model for different values of α . (d) Pdf of EHSN ( α , δ = 1 , λ = 5 ) model for different values of α .
Mathematics 10 03740 g002
Figure 3. (a) Plots of the survival function for the EHSN ( α , 1 , 0.1 ) . (b) Plots of the hazard rate function for the EHSN ( α , 1 , 0.1 ) .
Figure 3. (a) Plots of the survival function for the EHSN ( α , 1 , 0.1 ) . (b) Plots of the hazard rate function for the EHSN ( α , 1 , 0.1 ) .
Mathematics 10 03740 g003
Figure 4. (a) β 1 as function of λ   ( α = δ = 1 ) ; (b) β 2 as function of λ   ( α = δ = 1 ) ; (c) β 1 as function of α and λ   ( δ = 1 ) ; (d) β 2 as function of α and λ   ( δ = 1 ) .
Figure 4. (a) β 1 as function of λ   ( α = δ = 1 ) ; (b) β 2 as function of λ   ( α = δ = 1 ) ; (c) β 1 as function of α and λ   ( δ = 1 ) ; (d) β 2 as function of α and λ   ( δ = 1 ) .
Mathematics 10 03740 g004
Figure 5. Empirical and fitted EHSN cdf for Speed of Wind in dataset 1.
Figure 5. Empirical and fitted EHSN cdf for Speed of Wind in dataset 1.
Mathematics 10 03740 g005
Figure 6. qqplots for Speed of Wind dataset: (a) EHN model, (b) EHSN model.
Figure 6. qqplots for Speed of Wind dataset: (a) EHN model, (b) EHSN model.
Mathematics 10 03740 g006
Figure 7. Fitted EHSN and EHN model by maximum likelihood for speed of wind dataset.
Figure 7. Fitted EHSN and EHN model by maximum likelihood for speed of wind dataset.
Mathematics 10 03740 g007
Figure 8. Histogram for volcano heights dataset and estimated models. EHSN (solid line), EHN (dashed line), LSN (dotted line).
Figure 8. Histogram for volcano heights dataset and estimated models. EHSN (solid line), EHN (dashed line), LSN (dotted line).
Mathematics 10 03740 g008
Figure 9. qqplots for volcano heights dataset: (a) EHN model, (b) EHSN model.
Figure 9. qqplots for volcano heights dataset: (a) EHN model, (b) EHSN model.
Mathematics 10 03740 g009
Figure 10. Empirical and fitted EHSN cdf for volcano heights in dataset 2.
Figure 10. Empirical and fitted EHSN cdf for volcano heights in dataset 2.
Mathematics 10 03740 g010
Table 1. Estimated bias, SE and RMSE for ML estimators in finite samples from the EHSN model.
Table 1. Estimated bias, SE and RMSE for ML estimators in finite samples from the EHSN model.
True Value n = 100 n = 300 n = 500 n = 1000
λ α δ EstimatorBiasSERMSEBiasSERMSEBiasSERMSEBiasSERMSE
0.50.251 λ ^ 0.08874.10772.28760.07911.54351.58040.07630.86770.92950.06020.54910.6101
α ^ −0.08710.17660.1558−0.05800.11170.1078−0.05290.09060.0972−0.04290.07020.0788
δ ^ −0.00012.99921.6374−0.00610.60850.7113−0.00730.25880.4069−0.00910.05510.0886
10 λ ^ 0.09193.67742.03290.08811.39011.45810.08600.80250.83190.07530.52410.5291
α ^ −0.08750.17780.1570−0.05940.11550.1060−0.05240.09150.0965−0.04190.06880.0793
δ ^ −0.080124.337712.8305−0.06704.33035.0911−0.06601.88212.6233−0.05980.49060.6173
0.51 λ ^ 0.07743.94512.39540.06201.78961.94730.05571.22441.55750.05220.73750.9020
α ^ −0.16440.31920.2752−0.13190.20710.2080−0.11140.16890.1746−0.10980.14040.2068
δ ^ −0.02382.30731.2299−0.02300.69220.6892−0.02290.32360.4405−0.02360.20540.2725
10 λ ^ 0.08893.48272.26260.07031.65001.70880.06091.15961.44190.05760.69410.7828
α ^ −0.16570.31670.2721−0.13140.21040.2031−0.11220.17230.1786−0.10700.14500.2083
δ ^ −0.227718.03189.9169−0.21995.03754.5636−0.20282.90083.5120−0.19161.63281.9676
11 λ ^ 0.13574.16702.34740.10822.47561.92550.09421.34011.51830.05021.19931.2587
α ^ −0.38082.077510.3702−0.35140.43940.5198−0.26280.33210.3857−0.31560.26420.4956
δ ^ −0.05042.30900.9321−0.03621.32060.8033−0.04460.39320.4115−0.03290.58530.4356
10 λ ^ 0.14663.38982.14490.12802.31901.79190.11151.28321.41550.10891.12401.1980
α ^ −0.37882.09487.7521−0.35590.46380.5231−0.25580.33260.3834−0.30780.26470.4881
δ ^ −0.508915.61187.6009−0.377611.42826.3186−0.44773.51773.3111−0.32095.00683.5292
10.251 λ ^ 0.22212.82312.14650.16791.75091.97200.14901.35351.76980.11190.94271.4323
α ^ −0.07740.15940.1417−0.06880.09030.0983−0.06110.07180.0849−0.05480.05230.0696
δ ^ −0.01050.07580.1084−0.00890.03610.0324−0.00840.02880.0264−0.00630.02160.0211
10 λ ^ 0.20562.83242.08570.15731.68511.82360.11861.31091.69660.08830.87541.3112
α ^ −0.08100.16050.1445−0.06630.09110.0973−0.06020.07180.0829−0.05510.05270.0692
δ ^ −0.11470.98081.2949−0.09400.38460.5770−0.07000.28780.2646−0.07080.21770.2161
0.51 λ ^ 0.15482.50722.05120.13371.62161.81400.12891.30931.58640.11320.89361.2625
α ^ −0.17690.48311.9959−0.14820.16350.1823−0.13620.12940.1634−0.12070.09330.1414
δ ^ −0.02170.13880.1371−0.01580.04180.0393−0.01460.03250.0323−0.01230.02370.0255
10 λ ^ 0.15302.52032.04690.12231.61411.79410.11191.30241.59710.10510.89221.2264
α ^ −0.16730.41373.3884−0.14590.16370.1808−0.13640.12930.1631−0.12000.09320.1406
δ ^ −0.20531.26411.0703−0.16230.41370.3855−0.14840.32040.3188−0.11570.23410.2521
11 λ ^ 0.39952.87172.05610.34701.62431.75230.32511.30541.65520.27630.85421.1454
α ^ −0.40391.89001.7652−0.33820.33900.4111−0.30090.25790.3659−0.25630.17290.3070
δ ^ −0.03190.70010.4396−0.02220.14320.1427−0.02010.08060.1150−0.01690.03480.0432
10 λ ^ 0.39302.77742.10700.36251.64371.75450.26411.25701.50880.25250.85221.0931
α ^ −0.39561.53021.4103−0.33150.33470.4063−0.29390.25720.3579−0.25820.17580.3062
δ ^ −0.27635.61783.6497−0.21951.18981.2777−0.19720.75231.0015−0.16190.35450.4032
20.251 λ ^ 0.14123.84902.39800.12262.55332.19630.11122.16242.14800.09681.59111.8478
α ^ −0.06980.16490.1486−0.06390.09460.0984−0.06010.07470.0855−0.05460.05500.0724
δ ^ −0.02010.06600.0649−0.01480.03250.0349−0.01220.02570.0283−0.00850.01800.0206
10 λ ^ 0.20163.72582.33540.16032.60312.23580.13752.06712.04190.11811.53621.7291
α ^ −0.06940.294910.3606−0.06270.09410.0998−0.05690.07520.0852−0.05500.05530.0720
δ ^ −0.18570.56830.5554−0.14560.33000.3430−0.11450.25700.2810−0.09000.18040.2075
0.51 λ ^ 0.14083.37382.46860.12442.27362.08440.11691.77681.72680.10781.29281.3723
α ^ −0.14661.19165.8004−0.12480.17350.1786−0.11300.13350.1509−0.10440.09570.1301
δ ^ −0.02300.16890.1737−0.01670.04030.0480−0.01520.02950.0313−0.01190.02020.0237
10 λ ^ 0.15953.34672.27790.14722.25292.16340.13531.83221.85940.12221.31901.3916
α ^ −0.14560.65746.9030−0.11910.17030.1741−0.11500.13220.1544−0.10600.09670.1305
δ ^ −0.20851.22381.4289−0.16720.38700.3939−0.14900.30940.3834−0.11660.20010.2341
11 λ ^ 0.27823.53652.56550.27772.00521.94710.25531.62721.65020.24831.16611.1607
α ^ −0.31872.12199.7211−0.24700.35620.3716−0.21720.27360.3068−0.19870.19070.2553
δ ^ −0.02480.64830.4276−0.02090.09570.1128−0.01660.06340.1016−0.01470.02580.0292
10 λ ^ 0.25333.36562.57560.22241.95691.84860.20871.57681.58120.18441.14531.1161
α ^ −0.30611.95474.2552−0.24030.36230.3601−0.21670.27230.3012−0.20220.18970.2534
δ ^ −0.25744.20333.1281−0.19980.85850.9176−0.17040.41870.4742−0.14520.24870.2929
Table 2. Descriptive summaries for wind speeds dataset.
Table 2. Descriptive summaries for wind speeds dataset.
n x ¯ s 2 b 1 b 2
6574 6.3063 12.999 0.90314.0531
Table 3. Estimated parameters in the EHN and EHSN models.
Table 3. Estimated parameters in the EHN and EHSN models.
Estimates of ParametersEHNEHSN
α 0.413   ( 0.029 ) 0.754   ( 0.065 )
δ 4.674   ( 0.037 ) 4.851   ( 0.046 )
λ 3.058   ( 0.293 )
Log-likelihood−17,404.03−17,321.46
AIC 34,812.60 34,648.93
BIC 34,826.18 34,669.30
Table 4. Descriptive summaries for volcano heights (in 100 × feet).
Table 4. Descriptive summaries for volcano heights (in 100 × feet).
n x ¯ s 2 b 1 b 2
219 70.247 1850.548 0.83443.4439
Table 5. Estimates of parameters in EHN and EHSN models for volcanoes dataset.
Table 5. Estimates of parameters in EHN and EHSN models for volcanoes dataset.
Estimates of ParametersLSNEHNEHSN
α 4.015   ( 1.051 ) 0.783   ( 0.286 ) 1.579   ( 0.923 )
δ 0.770   ( 0.037 ) 56.514   ( 2.894 ) 59.668   ( 4.187 )
λ 0.007   ( 1.709 ) 2.788   ( 1.616 )
Log-likelihood−1133.636−1117.914−1115.100
AIC 2273.272 2241.828 2236.200
BIC 2283.440 2251.996 2246.367
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Santoro, K.I.; Gómez, H.J.; Gallardo, D.I.; Barranco-Chamorro, I.; Gómez, H.W. The Extended Half-Skew Normal Distribution. Mathematics 2022, 10, 3740. https://doi.org/10.3390/math10203740

AMA Style

Santoro KI, Gómez HJ, Gallardo DI, Barranco-Chamorro I, Gómez HW. The Extended Half-Skew Normal Distribution. Mathematics. 2022; 10(20):3740. https://doi.org/10.3390/math10203740

Chicago/Turabian Style

Santoro, Karol I., Héctor J. Gómez, Diego I. Gallardo, Inmaculada Barranco-Chamorro, and Héctor W. Gómez. 2022. "The Extended Half-Skew Normal Distribution" Mathematics 10, no. 20: 3740. https://doi.org/10.3390/math10203740

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop