Next Article in Journal
It’s All in the Embedding! Fake News Detection Using Document Embeddings
Previous Article in Journal
Three Mathematical Models for COVID-19 Prediction
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Bimodal Extension of the Epsilon-Skew-Normal Model

by
Juan Duarte
1,
Guillermo Martínez-Flórez
2,
Diego Ignacio Gallardo
3,
Osvaldo Venegas
4,* and
Héctor W. Gómez
1
1
Departamento de Matemáticas, Facultad de Ciencias Básicas, Universidad de Antofagasta, Antofagasta 1240000, Chile
2
Departamento de Matemática y Estadística, Facultad de Ciencias Básicas, Universidad de Córdoba, Montería 230002, Colombia
3
Departamento de Matemáticas, Facultad de Ingeniería, Universidad de Atacama, Copiapó 7820436, Chile
4
Departamento de Ciencias Matemáticas y Físicas, Facultad de Ingeniería, Universidad Católica de Temuco, Temuco 4780000, Chile
*
Author to whom correspondence should be addressed.
Mathematics 2023, 11(3), 507; https://doi.org/10.3390/math11030507
Submission received: 6 December 2022 / Revised: 28 December 2022 / Accepted: 3 January 2023 / Published: 18 January 2023
(This article belongs to the Section Probability and Statistics)

Abstract

:
This article introduces a bimodal model based on the epsilon-skew-normal distribution. This extension generates bimodal distributions similar to those produced by the mixture of normal distributions. We study the basic properties of this new family. We apply maximum likelihood estimators, calculate the information matrix and present a simulation study to assess parameter recovery. Finally, we illustrate the results to three real data sets, suggesting this new distribution as a plausible alternative for modelling bimodal data.

1. Introduction

In the last two decades of the twentieth century, inferential processes assumed the normality of the data. This assumption is not realistic in many cases, and the inferential processes were, therefore, inappropriate. In such situations, many authors decided to transform the variables in order to achieve symmetry or normality of the data, but these transformations led to unsatisfactory results because their interpretation became very complicated. Azzalini [1] introduced the skew-normal (SN) distribution, which allows asymmetric data to be modelled without the need for transformation. The probability density function (pdf) of the SN distribution is given by
ϕ S N ( z ; λ ) = 2 ϕ ( z ) Φ ( λ z ) , z R ,
where λ R , ϕ ( · ) and Φ ( · ) represent the pdf and cumulative distribution function (cdf) of the standard normal distribution, respectively. This is usually denoted by SN( λ ), where λ is a shape parameter. Many works have since been published on the SN distribution. To name a few, Azzalini [2], Henze [3], Chiogna [4], Pewsey [5], Arellano-Valle et al. [6], DiCiccio and Monti [7], Salinas et al. [8], Rosco et al. [9], Shafiei and Doostparast [10], Adcock and Azzalini [11], etc.
Mudholkar and Hutson [12] studied the epsilon-skew-normal (ESN) distribution with an asymmetry parameter ε , such that the standard normal distribution is recovered when ε = 0 . Specifically, X has an ESN distribution if its pdf can be written as
g ( x ; ε ) = ϕ x 1 sgn ( x ) ε = ϕ x 1 + ε , if x < 0 , ϕ x 1 ε , if x 0 ,
where sgn ( · ) is the sign function and 1 < ε < 1 . We denote this by X E S N ( ε ) . The properties of this distribution were studied extensively by Mudholkar and Hutson [12]. Arellano-Valle et al. [13] introduced a general family of epsilon-skew-symmetric (ESS) distributions, of which the ESN distribution is a particular case. Some works using distributions of this family are as follows: Hansen [14] applied the epsilon-skew-t distribution to economic data; Gómez et al. [15] applied it to mining data; recently, Celis et al. [16] introduced an epsilon-positive family of distributions based on the ESS family and applied it to data with or without censoring and Bevilacqua et al. [17] used the ESS family for modelling atypical data in special statistics.
The SN and ESN distributions are unimodal, meaning that they are not appropriate in fields such as economics, health, engineering and many others where the data are often bimodal. One of the classic distributions used for modelling bimodal data is the mixture of the normal (MN) distribution, but it was criticised for identifiability problems. See, for example, McLachlan and Peel [18] and Marin et al. [19]. Despite the many innovations introduced in this area, the problem still persists in many cases, which is a disadvantage when working with models of this type. Hence researchers continue to develop new symmetric and asymmetric bimodal distributions.
Bimodal distributions have been obtained from this skew-symmetric model. For example, Azzalini and Capitanio [20], Ma and Genton [21], Arellano-Valle et al. [13], Kim [22], Elal-Olivero et al. [23], Arnold [24], Gómez et al. [25], Hassan and El-Bassiouni [26], da Silva et al. [27], Cordeiro et al. [28], da Braga et al. [29], Altun et al. [30] and Alizadeh et al. [31], among others. For further information on the results of SN distribution and related families, see Azzalini’s book [32].
Models of this type were studied by Kim [22], who introduced a bimodal extension of the SN model, named “two-pieces skew-normal model (TN)”, denoted by T N ( λ ) , whose pdf can be written as
g ( z ; λ ) = c λ ϕ ( z ) Φ ( λ | z | ) , z R ,
where λ R and c λ = 2 π / ( π + 2 arctan ( λ ) ) . For λ > 0 , Kim [22] discusses that (2) defines a bimodal and symmetric around zero pdf.
An asymmetric extension of Kim’s model was presented by Arnold [24], who developed an asymmetric bimodal model named “the extended two-pieces skew-normal model (ETN)”, with pdf given by
h ( z ; λ , β ) = 2 c λ ϕ ( z ) Φ ( λ | z | ) Φ ( β z ) , z R ,
where λ , β R and c λ is a normalizing constant. The distribution is denoted by ETN( λ , β ).
Another model widely applied in these situations is the MN distribution, which is given by:
f ( x ; μ 1 , σ 1 , μ 2 , σ 2 , p ) = p ϕ ( x ; μ 1 , σ 1 ) + ( 1 p ) ϕ ( x ; μ 2 , σ 2 ) , x R ,
where ϕ ( · ; μ , σ ) denotes the pdf of the normal distribution with parameters mean μ R and standard deviation σ R + and 0 < p < 1 .
This article is organised as follows: in Section 2, we give the pdf of the new distribution, its basic properties and moments; in Section 3, we make an inference by the maximum likelihood (ML) method, we calculate the information matrix and we carry out a simulation study to assess the properties of the ML estimators in finite samples; in Section 4, we show three fits to real data sets and compare them with other distributions, and in Section 5, we discuss some conclusions.

2. The New Density

In this section, we introduce the bimodal ESN (BESN) distribution, an extension of the ESN distribution and study its basic properties.
Definition 1. 
A random variable X has a BESN distribution if its pdf is given by
h ( x ; ε , α ) = 1 + α x 2 1 + ( 1 + 3 ε 2 ) α ϕ x 1 s g n ( x ) ε , x R ,
where | ε | < 1 and α 0 . We denote this by X B E S N ( ε , α ) .
Remark 1. 
Note that the BESN model contains the standard normal model when ε = α = 0 , while for α = 0 , we obtain the ESN model. When ε = 0 , we obtain a symmetric bimodal model considered in Elal-Olivero et al. [23]. Thus this model is a new alternative for modelling both symmetric and asymmetric bimodal data.
Proposition 1. 
If X B E S N ( ε , α ) , | ε | < 1 and α 0 , then its cdf is given by:
H ( x ; ε , α ) = ( 1 + ε ) + α ( 1 + ε ) 3 1 + ( 1 + 3 ε 2 ) α Φ x 1 + ε α x ( 1 + ε ) 2 1 + ( 1 + 3 ε 2 ) α ϕ x 1 + ε , x < 0 , ( 1 + ε ) + α ( 1 + ε ) 3 2 1 + ( 1 + 3 ε 2 ) α + 1 ε + α ( 1 ε ) 3 1 + ( 1 + 3 ε 2 ) α Φ ( x 1 ε 1 2 ] α x ( 1 ε ) 2 1 + ( 1 + 3 ε 2 ) α ϕ x 1 ε , x 0 .
Remark 2. 
As expected, if X B E S N ( ε , α ) , | ε | < 1 , α 0 , then
lim x H ( x ; ε , α ) = ( 1 + ε ) + α ( 1 + ε ) 3 2 1 + ( 1 + 3 ε 2 ) α + ( 1 ε ) + α ( 1 ε ) 3 2 1 + ( 1 + 3 ε 2 ) α = 1 .
As can be seen from Figure 1, the shape of the pdf in relation to the unimodality and bimodality depends on parameters ε and α .

2.1. Properties

Proposition 2. 
Let X B E S N ( ε , α ) . Then
(i) 
geometrically, h ( x ; ε , α ) is a reflection of the ordinates related to the change in the sign of parameter ε. This is, h ( x ; ε , α ) = h ( x ; ε , α ) ;
(ii) 
if ε = 0 then h ( x ; α ) = 1 + α x 2 1 + α ϕ x , which is symmetric bimodal when α > 1 / 2 and symmetric unimodal if α 1 / 2 .
Proof. 
Using the density given in (4), the results are obtained immediately.    □
Proposition 3. 
Let X B E S N ( ε , α ) . Then, the pdf for X is
1. 
bimodal when α > 1 2 ( 1 | ε | ) 2 and its modes are located in
x 1 = 2 α ( 1 + ε ) 2 1 α a n d x 2 = 2 α ( 1 ε ) 2 1 α .
2. 
unimodal when ε ( 0 , 1 ) and
(a) 
1 2 ( 1 + ε ) 2 < α < 1 2 ( 1 ε ) 2 , and its mode is located in x = 2 α ( 1 + ε ) 2 1 α .
(b) 
0 α 1 2 ( 1 + ε ) 2 , and its mode is located in x = 0 .
3. 
unimodal when ε ( 1 , 0 ) and
(a) 
1 2 ( 1 ε ) 2 < α < 1 2 ( 1 + ε ) 2 , and its mode is located in x = 2 α ( 1 ε ) 2 1 α .
(b) 
0 α 1 2 ( 1 ε ) 2 , and its mode is located in x = 0 .
Proof. 
We calculate the first derivative of the pdf given in (4) and equal it to zero to find its critical values. That is,
h ( x ; ε , α ) = x 2 α 1 + α x 2 1 ( 1 sgn ( x ) ε ) 2 h ( x ; ε , α ) = 0 .
Consequently, x = 0 or 2 α 4 α ε sgn ( x ) + 2 α ε 2 1 α x 2 = 0 . Without loss of generality, let us assume in the latter equation that x > 0 , hence we get the equation
2 α ( 1 ε ) 2 1 α x 2 = 0 ,
then on solving it, we obtain that x 2 = 2 α ( 1 ε ) 2 1 α whenever α > 1 2 ( 1 ε ) 2 . Now, assuming that x < 0 , we obtain the equation
2 α ( 1 + ε ) 2 1 α x 2 = 0 ,
then, as before, it follows that when α > 1 2 ( 1 + ε ) 2 , then x 1 = 2 α ( 1 + ε ) 2 1 α , and so considering that 1 < ε < 1 the proof of part 1 is concluded.
Considering the range of values of ε and α in 2(a), we can deduce that Equation (5) has no solution and that Equation (6) has a solution equal to ( 2 α ( 1 + ε ) 2 1 ) / α . Then, using the criterion of the second derivative for h, we have
h 2 α ( 1 + ε ) 2 1 α = 2 α 1 ( 2 α ( 1 + ε ) 2 1 ) ( 1 + 2 α ( 1 + ε ) 2 1 ) 2 1 2 ( 1 + ε ) 2 h 2 α ( 1 + ε ) 2 1 α = 1 ( 1 + ε ) 2 1 α ( 1 + ε ) 2 2 h 2 α ( 1 + ε ) 2 1 α < 0 .
Therefore, at 2 α ( 1 + ε ) 2 1 α is attained a maximum of h. In consequence, it corresponds to their mode, and thus, proof 2(a) follows. On the other hand, in case 2(b), their hypotheses imply that Equations (5) and (6) have no solutions. Hence, the unique critical point of h is zero. Consequently, its mode is located at x = 0 .
Proceeding analogously to the proof of 2, the proof of part 3 follows, and thus the proof is completed.    □

2.2. Moments

This subsection is devoted to presenting the r-th moment of the B E S N ( ε , α ) model and its moment-generating function (mgf).
Proposition 4. 
If X B E S N ( ε , α ) . Then for r = 1 , 2 , , the r-th distributional moment is given by
E ( X r ) = 2 r / 2 Γ r + 1 2 π ( 1 + α + 3 ε 2 α ) C ε ( r + 1 ) + α ( r + 1 ) C ε ( r + 3 ) ,
where | ε | < 1 , α 0 and C ε ( k ) = ( 1 ε ) k + ( 1 ) k 1 ( 1 + ε ) k 2 .
Proof. 
Let Y E S N ( ε ) , then
μ r = E ( X r ) = E ( Y r ) + α E ( Y r + 2 ) 1 + α + 3 ε 2 α .
Hence, the result is obtained using the moments of the ESN model (see [12]).   □
Corollary 1. 
Let X B E S N ( ε , α ) . Then
E ( X ) = 2 π 2 ε ( 4 α ε 2 + 4 α + 1 ) 1 + ( 1 + 3 ε 2 ) α , E ( X 2 ) = 2 π ( 3.76 α + 3.76 ε 2 + 37.6 α ε 2 + 1.88 α ε 4 + 1.2533 ) 1 + ( 1 + 3 ε 2 ) α , E ( X 3 ) = 2 π 4 ( 3 α + ( 10 α + 2 ) ε 2 + 3 α ε 4 + 2 ) 1 + ( 1 + 3 ε 2 ) α , E ( X 4 ) = 3 ( 5 α + 5 ( 21 α + 2 ) ε 2 + 5 ( 35 α + 1 ) ε 4 + 35 α ε 6 + 1 ) 1 + ( 1 + 3 ε 2 ) α .
The skewness ( β 1 ) and kurtosis ( β 2 ) coefficients can be computed as
β 1 = μ 3 3 μ 1 μ 2 + 2 μ 1 3 μ 2 μ 1 2 3 / 2 a n d β 2 = μ 4 4 μ 1 μ 3 + 6 μ 1 2 μ 2 3 μ 1 4 μ 2 μ 1 2 2 ,
Figure 2 shows these coefficients for the BESN ( ε , α ) model in terms of ε and α .
Finally, in the following proposition, we present the mgf for the BESN distribution.
Proposition 5. 
The mgf for the X BESN ( ε , α ) distribution is given by
M X ( t ) = ( 1 + ε ) exp ( t 2 2 ( 1 + ε ) 2 ) 1 + ( 1 + 3 ε 2 ) α Φ ( t ( 1 + ε ) ) + ( 1 + ε ) 2 1 + t 2 ( 1 + ε ) 2 Φ ( t ( 1 + ε ) ) + t ( 1 + ε ) ϕ ( t ( 1 + ε ) ) + ( 1 ε ) exp ( t 2 2 ( 1 ε ) 2 ) 1 + ( 1 + 3 ε 2 ) α Φ ( t ( 1 ε ) ) + ( 1 ε ) 2 1 + t 2 ( 1 ε ) 2 Φ ( t ( 1 ε ) ) t ( 1 ε ) ϕ ( t ( 1 ε ) ) .
Proof. 
Using the definition M X ( t ) = E ( e t X ) and with a simple algebraic development, it is possible to obtain the result.    □

3. Inference

In this section, we apply the ML method to estimate the parameters. We calculate Fisher’s information matrix, and we carry out a simulation study.

3.1. ML Estimation

For a random sample x = ( x 1 , x 2 , , x n ) of size n from the BESN distribution, the log-likelihood function for θ = ( ε , α ) is given by
( θ ; x ) = i = 1 n log ( 1 + α x 2 ) i = 1 n log ( 1 + ( 1 + 3 ε 2 ) α ) 1 2 i = i n x i 2 ( 1 sgn ( x i ) ε ) 2 .
The score function is given by U ( θ ) = ( U ( ϵ ) , U ( α ) ) , where
U ( ε ) = 6 α ε n 1 + ( 1 + 3 ε 2 ) α i = 1 n x i 2 sgn ( x i ) ( 1 + sgn ( x i ) ε ) ( 1 sgn ( x i ) ε ) 3 , U ( α ) = i = 1 n x i 1 + α x i 2 n 1 + 3 ε 2 1 + ( 1 + 3 ε 2 ) α .
The ML estimators of α and ε are the solution of the system of equations U ( θ ) = 0 2 , where 0 p defines a vector of zeros with dimension p. The elements of the Hessian matrix, defined as the derivate of the score function with respect to each parameter, can be expressed as
U ε α = 6 n ε ( 1 + ( 1 + 3 ε 2 ) α ) 2 , U ε ε = 6 ε 2 ( 3 ε 2 1 ) 6 α ( 1 + ( 1 + 3 ε 2 ) α ) 2 i = 1 n 3 x i 2 ( 1 sgn ( x i ) ε ) 4 , U α α = i = 1 n x i 4 ( 1 + α x i 2 ) 2 n ( 1 + 3 ε 2 ) 2 ( 1 + ( 1 + 3 ε 2 ) α ) 2 .

3.2. Fisher’s Information Matrix

The elements of the Fisher’s information matrix are given by
I α α = 1 α 2 ( 1 + ( 1 + 3 ε 2 ) α ) 1 + 1.2533 α ( 3 + α 2 ) 2 π + I 1 + I 2 ( 1 + 3 ε 2 ) 2 [ ( 1 + ( 1 + 3 ε 2 ) α ) ] 2 I α ε = 6 ε ( 1 + ( 1 + 3 ε 2 ) α ) 2 , I ε ε = 6 α [ 1 α ( 3 ε 2 1 ) ] [ 1 + ( 1 + 3 ε 2 ) α ] 2 + 3 1 + ( 1 + 3 ε 2 ) α 3 α + 0.62666 2 π 2 1 ε 2 ,
where I 1 = 0 1 1 + α x 2 ϕ x 1 + ε d x = 1 2 π 2 α 1 Φ 1 ( 1 + ε ) 2 α exp 1 2 α ( 1 + ε ) 2 and I 2 = 0 1 1 + α x 2 ϕ x 1 ε d x = 1 2 2 α exp 1 2 α ( 1 ε ) 2 Γ 1 2 , 1 2 α ( 1 ε ) 2 . For more details of these integral formulas, we refer the reader to Gradshteyn and Ryzhik [33].

3.3. Location-Scale Case

For Z B E S N ( ε , α ) , the extension to the location-scale case of the model follows from the transformation X = μ + σ Z , where μ R and σ R + . The pdf for X is given by
f ( x ; μ , σ , ε , α ) = 1 + α z 2 σ ( 1 + ( 1 + 3 ε 2 ) α ) ϕ z 1 sgn ( x μ ) ε , x R ,
where z = x μ σ . We denote this by X B E S N ( μ , σ , ε , α ) . Therefore, for a random sample x 1 , x 2 , , x n of the random variable X B E S N ( μ , σ , ε , α ) , the log-likelihood function for θ = ( μ , σ , ε , α ) is given by
( θ ; x ) = i = 1 n log ( 1 + α z i 2 ) n log σ [ 1 + ( 1 + 3 ε 2 ) α ] 1 2 i = 1 n z i 2 ( 1 sgn ( x i μ ) ε ) 2 ,
where z i = x i μ σ and the score function can be expressed as
U ( μ ) = 2 α σ i = 1 n z i 1 + α z i 2 + 1 σ i = 1 n z i [ 1 sgn ( x i ) ε ] 2 , U ( σ ) = n σ 2 α σ i = 1 n z i 2 1 + α z i 2 + 1 σ i = 1 n z i 2 [ 1 sgn ( x i ) ε ] 2 , U ( ε ) = 6 n α ε 1 + ( 1 + 3 ε 2 ) α i = 1 n z i 2 sgn ( x i ) ( 1 sgn ( x i ) ε ) 3 , U ( α ) = i = 1 n z i 2 1 + α z i 2 n 1 + 3 ε 2 1 + ( 1 + 3 ε 2 ) α .
The maximum likelihood estimator for θ , say θ ^ , satisfies the asymptotic distribution
n ( θ ^ θ ) L N 4 ( 0 4 , I F ( θ ) 1 ) , a s   n + ,
where I F ( θ ) denotes the Fisher’s information matrix. See Appendix A for details about this matrix. In other words, θ ^ is asymptotically normally distributed and consistent.

3.4. Simulation Study

In this subsection, we present a simulation study in order to assess the performance of the ML estimator in finite samples. To do this, Table 1, Table 2 and Table 3 consider three values for ε : −0.90, −0.25 and 0.75; three values for α : 0.5, 1.5 and 3; two values for μ : −3 and 0; two values for σ : 1 and 5. Previous studies (not presented here) suggest that with smaller sample sizes (for example, n = 20 and n = 50 ), the estimators do not give good results in terms of bias; we, therefore, considered the following sample sizes: 100, 200 and 300. Values for the BESN distribution can be drawn using the Metropolis–Hastings algorithm. We use a burn-in period of 1000 and a thin of 20 in order to avoid a possible correlation between successive values. For each combination of ε , α , μ , σ and n (totaling 108 combinations), we draw 1000 samples of size n from the BESN ( ε , α , μ , σ ) model, and for each, we compute the corresponding ML estimators. We report the bias of the 1000 samples, the mean of the estimated standard errors (SE) and the root of the estimated mean squared error (RMSE). In general terms, we note that the bias and the RMSE of the estimators decrease to zero when the sample size is increased, suggesting that the estimators are consistent even in finite samples. In addition, the SE and RMSE terms are closer when the sample size is increased, suggesting that the variance of the estimators is well estimated. However, as mentioned above, we suggest a sample size of at least approximately 100 data to ensure good properties of the maximum likelihood estimators.

4. Applications

In this section, we fit the BESN distribution to three real data sets that are widely used in the literature, namely the roller, birthweight and nickel data sets. The first application is to a unimodal data set and is compared with the fit of the normal (N) distribution; the second application is to a symmetric bimodal data set and is compared with the fits of the N and TN distributions; the third application is to an asymmetric bimodal data set and is compared with the fits of the SN, ETN and MN distributions. To compare the models, we use the Akaike information criterion AIC (see Akaike [34]) and the Bayesian information criterion BIC (see Schwarz [35]). Traditionally the preferred model is the one with the smallest AIC and/or BIC.

4.1. First Application: Roller Data

In this first application, we use the data set related to 1150 heights measured at 1-micron intervals along the drum of a roller (i.e., parallel to the axis of the roller). This was part of an extensive study of the surface roughness of the roller. It is available for downloading at http://lib.stat.cmu.edu/jasadata/laslett (accessed on 5 November 2022). Summary statistics for the data set are presented in Table 4.
Given the values of sample asymmetry, b 1 , and sample kurtosis, b 2 , there is strong evidence that an asymmetric model could provide a better fit to the data under study. Therefore, the N and BESN distributions are fitted to the data set.
The ML estimates for each model (N and BEST) and standard errors (SE) in parentheses are: μ ^ = 3.535 ( 0.019 ) and σ ^ = 0.650 ( 0.014 ) with AIC = 2275.732 and BIC = 2285.827 for the N distribution and μ ^ = 4.944 ( 0.034 ) , σ ^ = 0.495 ( 0.011 ) ,   ε ^ = 0.833 ( 0.0348 ) and α ^ = 7.871 ( 2.258 ) with AIC = 2181.560 , and BIC = 2201.75 for the BESN distribution.
According to AIC and BIC, the BESN distribution provides a better fit for the roller data set than the N distribution. In other words, the BESN distribution achieves a satisfactory fit for skewness and kurtosis, which is not adequately fitted by the N distribution. Therefore, the BESN distribution presents the best fit for the roller data set. A qq-plot for the variable roller, using normal and BESN distributions, is shown in Figure 3a,b.
Figure 3c shows the empirical cdf for the variable roller (solid line), while the dotted line corresponds to the cdf for the BESN model. The results suggest a better fit for the BESN model.
In addition, we also compare the N and BESN distributions with a hypothesis test. Specifically, we propose the hypothesis
H 01 : ( ε , α ) = ( 0 , 0 ) versus H 11 : ( ε , α ) ( 0 , 0 ) ,
which can be tested using the statistic
Λ 1 = N ( μ ^ , σ ^ ) B E S N ( μ ^ , σ ^ , ε ^ , α ^ ) .
After numerical evaluations, we obtain 2 log ( Λ 1 ) = 96.172 , which is greater than the critical 5% chi-squared value with two degrees of freedom, namely χ 2.95 % 2 = 5.99 .

4.2. Second Application: Birthweight Data

In the second application, we study the fit of the BESN model to 500 units observed for the variable Z=b.weight, which is the ultrasound weight (birthweight in grams). These data are available as supplementary material. The summary statistics for the data set are presented in Table 5.
Given the symmetry of these data, we propose to fit a BESN model taking ε = 0 and then compare this with the fit of the N and TN models.
We used AIC and BIC to compare the fits of the N, TN and BESN models. According to these criteria (see Table 6 and Figure 4a), the BESN model is seen to present a better fit than the N and TN models.
Figure 4b,c show the qq-plot for the BESN model and the empirical cdf. The plots suggest a better fit for the BESN model than its competitors.

4.3. Third Application: Nickel Data

In this third application, we use a data set related to the logarithm of the nickel content in soil samples analysed at the Mines Department of Universidad de Atacama, Chile, which are available as supplementary material.
In this section, we compare the fits of the SN, BESN, ETN and MN models to the above data set.
The pdf for the MN model can be written as
f ( x ; ξ 1 , σ 1 , ξ 2 , σ 2 , p ) = p σ 1 ϕ ( x , ξ 1 , σ 1 ) + 1 p σ 2 ϕ ( x ; ξ 2 , σ 2 ) , x R ,
with parameters ( ξ j , σ j ) R × R + , j = 1 , 2 and 0 < p < 1 , we denote as MN ( ξ 1 , σ 1 , ξ 2 , σ 2 , p ) .
The comparisons are made using the AIC and BIC for the variable Y, the logarithm of the nickel concentration. In all cases, the parameters are estimated by ML using bbmle in the R software package [36]. The SE of the ML estimates is calculated using the observed information matrix corresponding to each model.
Table 7 gives the estimated parameters and AIC and BIC for the SN, ETN, BESN and MN models. The respective SE are in parentheses. The graph in Figure 5 shows that the BESN model presents quite a good fit.
In all cases, the models are augmented by the inclusion of location ( ξ ) and scale ( σ ) parameters. Since the models are not nested, the AIC and BIC have been used to compare the distributions. According to these criteria, the BESN model provides the best fit to the data of the example. Hence, the BESN model seems to be a useful alternative for modelling the data for the logarithm of the nickel concentration.
The conclusion of the study is that the BESN model appears to be more appropriate for the particular data sets analysed here. These points are illustrated in more detail in Figure 5, where the histograms and the fitted curves for the data sets are displayed.

5. Final Comments

In this paper, we propose the BESN distribution, which is shown to have flexible modes. We study its properties and implement ML estimation. Some other characteristics of the BESN distribution are:
  • The BESN distribution contains, as special cases, the N and ESN distributions.
  • The BESN model is very flexible in its modes, as we discuss in Figure 1 and Figure 2; it can be used to fit both unimodal data (application 1), and bimodal data (applications 2 and 3).
  • The BESN distribution has a closed expression for its cdf.
  • The moments of the BESN distribution have a closed expression.
  • The three applications show that the BESN distribution provides a better fit than the other models tested.
Given the promising results, further research for the model can be addressed by applying the model in other areas. For instance, given the simplicity of the mean and the cdf for the BESN distribution, a reparametrization of the model in terms of the mean or the quantile should be considered in order to propose a new mean or quantile regression model based on this distribution.

Author Contributions

Conceptualization, J.D. and H.W.G.; methodology, G.M.-F., D.I.G. and H.W.G.; software, D.I.G.; validation, D.I.G. and H.W.G.; formal analysis, G.M.-F., D.I.G., O.V. and H.W.G.; investigation, J.D. and O.V.; writing—original draft preparation, J.D., D.I.G. and H.W.G.; writing—review and editing, D.I.G., O.V. and H.W.G.; funding acquisition, O.V. and H.W.G. All authors have read and agreed to the published version of the manuscript.

Funding

This work was partially performed by Héctor W. Gómez during a visit to the Universidad Católica de Temuco, supported by MINEDUC-UA project, code ANT 1999.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data set of the first application is available at http://lib.stat.cmu.edu/jasadata/laslett (accessed on 5 November 2022), and those of the second and third applications are available as supplementary material.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

In this Appendix, we present the elements of the Hessian matrix and the Fisher’s Information matrix for the location-scale case of the BESN distribution.

Appendix A.1. Elements of the Hessian Matrix

U μ μ = 4 α σ 2 i = 1 n 1 [ 1 + α z i 2 ] 2 + 2 α σ 2 i = 1 n 1 1 + α z i 2 1 σ 2 i = 1 n 1 [ 1 sgn ( x i μ ) ε ] 2 , U μ σ = 4 α σ 2 i = 1 n z i [ 1 + α z i 2 ] 2 2 σ 2 i = 1 n z i [ 1 sgn ( x i μ ) ε ] 2 , U μ ε = 2 σ i = 1 n sgn ( x i μ ) z i [ 1 sgn ( x i μ ) ε ] 3 , U μ α = 2 σ i = 1 n z i [ 1 + α z i 2 ] 2 , U σ σ = n σ 2 + 2 α σ 2 i = 1 n 3 z i 2 + α z i 4 [ 1 + α z i 2 ] 2 3 σ 2 i = 1 n z i 2 [ 1 sgn ( x i μ ) ε ] 2 , U σ ε = 2 σ i = 1 n sgn ( x i μ ) z i 2 [ 1 sgn ( x i μ ) ε ] 3 , U σ α = 2 σ i = 1 n z i 2 [ 1 + α z i 2 ] 2 , U ε ε = 6 n α [ α ( 3 ε 2 1 ) ] [ 1 + ( 1 + 3 ε 2 ) α ] 2 3 i = 1 n z i 2 ( 1 sgn ( x i μ ) ε ) 4 , U ε α = 6 n ε [ 1 + ( 1 + 3 ε 2 ) α ] 2 , U α α = i = 1 n z i 4 [ 1 + α z i 2 ] 2 + n ( 1 + 3 ε 2 ) 2 [ 1 + ( 1 + 3 ε 2 ) α ] 2 .

Appendix A.2. Elements of the Fisher’s Information Matrix

After some intense calculation, we have that the elements of the Fisher’s information matrix, say E ( H θ ) , are given by
E ( U μ μ ) = 2 n α A σ 2 1 2 ( 1 + ε ) 1 Φ μ σ ( 1 + ε ) + 1 2 ( 1 ε ) Φ μ σ ( 1 ε ) n μ α A σ 3 ϕ μ σ ( 1 + ε ) ϕ μ σ ( 1 ε ) + 4 n α A σ 2 ( 1 + ε ) I 3 + ( 1 ε ) I 4 n A σ 2 1 1 + ε 1 Φ μ σ ( 1 + ε ) + 1 1 ε Φ μ σ ( 1 ε ) ,
E ( U μ σ ) = 2 n ( 1 + ε ) A σ 2 ϕ μ σ ( 1 + ε ) σ 2 + α μ 2 σ 2 ( 1 + ε ) + 2 α ( 1 + ε ) 2 n ( 1 ε ) A σ 2 ϕ μ σ ( 1 ε ) σ 2 + α μ 2 σ 2 ( 1 ε ) + 2 α ( 1 ε ) + 2 n A 2 π σ 2 e 1 2 α ( 1 + ε ) 2 E i σ 2 + α μ 2 2 α σ 2 ( 1 + ε ) + e 1 2 α ( 1 ε ) 2 E i σ 2 + α μ 2 2 α σ 2 ( 1 ε ) ,
E ( U μ ε ) = 2 n A σ 2 ϕ μ σ ( 1 + ε ) σ 2 + α μ 2 σ 2 ( 1 + ε ) + 2 α ( 1 + ε ) + 2 n A σ 2 ϕ μ σ ( 1 ε ) σ 2 + α μ 2 σ 2 ( 1 ε ) + 2 α ( 1 ε ) ,
E ( U μ α ) = n A 2 π α σ e 1 2 α ( 1 + ε ) 2 E i σ 2 + α μ 2 2 α σ 2 ( 1 + ε ) + e 1 2 α ( 1 ε ) 2 E i σ 2 + α μ 2 2 α σ 2 ( 1 ε ) ,
E ( U σ σ ) = 4 n A σ 2 ( 1 + ε ) 1 Φ μ σ ( 1 + ε ) + ( 1 ε ) Φ μ σ ( 1 ε ) 4 n A σ 2 ( 1 + ε ) I 3 + ( 1 ε ) I 4 n A σ 2 ( 1 + ε ) ( 7 α ( 1 + ε ) 2 + 3 ) 1 Φ μ σ ( 1 + ε ) + μ σ ( 1 + ε ) ϕ μ σ ( 1 + ε ) n A σ 2 ( 1 ε ) ( 7 α ( 1 ε ) 2 + 3 ) Φ μ σ ( 1 ε ) μ σ ( 1 ε ) ϕ μ σ ( 1 ε ) + n σ 2 3 n α μ 3 A σ 5 ϕ μ σ ( 1 + ε ) ϕ μ σ ( 1 ε ) ,
E ( U σ ε ) = 2 n μ A σ 2 ϕ μ σ ( 1 + ε ) σ 2 + α μ 2 σ 2 ( 1 + ε ) + 3 α ( 1 + ε ) 2 n μ A σ 2 ϕ μ σ ( 1 ε ) σ 2 + α μ 2 σ 2 ( 1 ε ) + 3 α ( 1 ε ) 2 n A σ 1 + 3 α ( 1 + ε ) 2 1 Φ μ σ ( 1 + ε ) + 2 n A σ 1 + 3 α ( 1 ε ) 2 Φ μ σ ( 1 ε ) ,
E ( U σ α ) = 2 n A α σ ( 1 + ε ) 1 Φ μ σ ( 1 + ε ) I 3 + ( 1 ε ) Φ μ σ ( 1 + ε ) I 4 ,
E ( U ε ε ) = 6 n α ( 1 + α ( 1 3 ε 2 ) ) [ 1 + ( 1 + 3 ε 2 ) α ] 2 3 n A 1 + ε μ σ ϕ μ σ ( 1 + ε ) σ 2 + α μ 2 σ 2 ( 1 + ε ) + 3 α ( 1 + ε ) + 1 + 3 α ( 1 + ε ) 2 Φ μ σ ( 1 + ε ) 3 n A 1 ε μ σ ϕ μ σ ( 1 ε ) σ 2 + α μ 2 σ 2 ( 1 ε ) + 3 α ( 1 ε ) + 1 + 3 α ( 1 ε ) 2 Φ μ σ ( 1 ε ) ,
E ( U ε α ) = 6 n ε [ 1 + ( 1 + 3 ε 2 ) α ] 2 , and
E ( U α α ) = n A α [ ( 1 + ε ) 3 [ 1 Φ μ σ ( 1 + ε ) + μ σ ( 1 + ε ) ϕ μ σ ( 1 + ε ) ] ( 1 ε ) 3 [ Φ μ σ ( 1 ε ) μ σ ( 1 ε ) ϕ μ σ ( 1 ε ) ] ] + n A α 2 ( 1 + ε ) 1 Φ μ σ ( 1 + ε ) I 3 + ( 1 ε ) Φ μ σ ( 1 + ε ) I 4 + n ( 1 + 3 ε 2 ) 2 [ 1 + ( 1 + 3 ε 2 ) α ] 2 ,
where A = 1 1 + α ( 1 + 3 ε 2 ) , I 3 = μ σ ( 1 + ε ) ϕ ( u ) 1 + α ( 1 + ε ) 2 u 2 d u , I 4 = μ σ ( 1 ε ) ϕ ( u ) 1 + α ( 1 ε ) 2 u 2 d u , and E i ( · ) is the exponential integral defined by E i ( x ) : = x e t t d t ; these integrals can be computed numerically by someone’s mathematical software.

References

  1. Azzalini, A. A class of distributions which includes the normal ones. Scand. J. Stat. 1985, 12, 171–178. [Google Scholar]
  2. Azzalini, A. Further results on a class of distributions which includes the normal ones. Statistica 1986, 46, 199–208. [Google Scholar]
  3. Henze, N. A probabilistic representation of the Skew-Normal distribution. Scand. J. Stat. 1986, 4, 271–275. [Google Scholar]
  4. Chiogna, M. Some results on the scalar skew-normal distribution. J. Ital. Statist. Soc. 1998, 1, 1–13. [Google Scholar] [CrossRef]
  5. Pewsey, A. Problems of inference for Azzalini’s skew-normal distribution. J. Appl. Stat. 2000, 27, 859–870. [Google Scholar] [CrossRef]
  6. Arellano-Valle, R.B.; Gómez, H.W.; Quintana, F.A. A New Class of Skew-Normal Distributions. Commun. Stat. Theory Methods 2004, 33, 1465–1480. [Google Scholar] [CrossRef]
  7. DiCiccio, T.J.; Monti, A.C. Inferential aspects of the skew exponential power distribution. J. Am. Stat. Assoc. 2004, 99, 439–450. [Google Scholar] [CrossRef]
  8. Salinas, H.S.; Arellano-Valle, B.R.; Gómez, H.W. The extended skew-exponential power distribution and its derivation. Commun. Stat. Theory Methods 2007, 36, 1673–1689. [Google Scholar] [CrossRef]
  9. Rosco, J.F.; Jones, M.C.; Pewsey, A. Skew t distributions via the sinh-arcsinh transformation. Test 2011, 20, 630–652. [Google Scholar] [CrossRef]
  10. Shafiei, S.; Doostparast, M. Balakrishnan skew-t distribution and associated statistical characteristics. Commun. Stat. Theory Methods 2014, 43, 4109–4122. [Google Scholar] [CrossRef]
  11. Adcock, C.; Azzalini, A. A selective overview of skew-elliptical and related distributions and of their applications. Symmetry 2020, 12, 118. [Google Scholar] [CrossRef] [Green Version]
  12. Mudholkar, G.S.; Hutson, A.D. The epsilon-skew-normal distribution for analyzing near-normal data. J. Statist. Plann. Inference 2000, 83, 291–309. [Google Scholar] [CrossRef]
  13. Arellano-Valle, R.B.; Gómez, H.W.; Quintana, F.A. Statistical inference for a general class of asymmetric distributions. J. Statist. Plann. Inference 2005, 128, 427–443. [Google Scholar] [CrossRef]
  14. Hansen, B.E. Autoregressive conditional density estimation. Int. Econ. Rev. 1994, 35, 705–730. [Google Scholar] [CrossRef]
  15. Gómez, H.W.; Torres, F.J.; Bolfarine, H. Large-Sample Inference for the Epsilon-Skew-t Distribution. Commun. Stat. Theory Methods 2007, 36, 73–81. [Google Scholar] [CrossRef]
  16. Celis, P.; de la Cruz, R.; Fuentes, C.; Gómez, H.W. Survival and Reliability Analysis with an Epsilon-Positive Family of Distributions with Applications. Symmetry 2021, 13, 908. [Google Scholar] [CrossRef]
  17. Bevilacqua, M.; Caamaño-Carrillo, C.; Arellano-Valle, R.B.; Gómez, C. A class of random fields with two-piece marginal distributions for modeling point-referenced data with spatial outliers. Test 2022, 31, 644–674. [Google Scholar] [CrossRef]
  18. McLachlan, G.; Peel, D. Mixture Models: Inference and Applications to Clustering; Marcel Dekker: New York, NY, USA, 2000. [Google Scholar]
  19. Marin, J.M.; Mengersen, K.; Robert, C. Bayesian modeling and inference on mixtures of distributions. Handb. Stat. 2005, 25, 459–503. [Google Scholar]
  20. Azzalini, A.; Capitanio, A. Distributions generate by perturbation of symmetry with emphasis on a multivariate skew-t distribution. J. R. Stat. Soc. Ser. B 2003, 65, 367–389. [Google Scholar] [CrossRef]
  21. Ma, Y.; Genton, M.G. Flexible class of skew-symmetric distributions. Scand. J Stat. 2004, 31, 459–468. [Google Scholar] [CrossRef] [Green Version]
  22. Kim, H.J. On a class of two-piece skew-normal distributions. Statistics 2005, 39, 537–553. [Google Scholar] [CrossRef]
  23. Elal-Olivero, D.; Gómez, H.W.; Quintana, F.A. Bayesian Modeling using a class of Bimodal skew-Elliptical distributions. J. Statist. Plan. Inference 2009, 139, 1484–1492. [Google Scholar] [CrossRef]
  24. Arnold, B.C.; Gómez, H.W.; Salinas, H.S. On multiple constraint skewed models. Statistics 2009, 43, 279–293. [Google Scholar] [CrossRef]
  25. Gómez, H.W.; Elal-Olivero, D.; Salinas, H.S.; Bolfarine, H. Bimodal extension based on the skew-normal distribution with application to pollen data. Environmetrics 2011, 22, 50–62. [Google Scholar] [CrossRef]
  26. Hassan, M.Y.; El-Bassiouni, M.Y. Bimodal Skew-Symmetric Normal Distribution. Commun. Stat. Theory Methods 2016, 45, 1527–1541. [Google Scholar] [CrossRef]
  27. Da Silva Braga, A.; Cordeiro, G.M.; Ortega, E.M.M. A new skew-bimodal distribution with applications. Commun. Stat. Theory Methods 2018, 47, 2950–2968. [Google Scholar] [CrossRef]
  28. Cordeiro, G.M.; Alizadeh, M.; Ozel, G.; Hosseini, B.; Ortega, E.M.M.; Altun, E. The generalized odd log-logistic family of distributions: Properties, regression models and applications. J. Stat. Comput. Simul. 2017, 87, 908–932. [Google Scholar] [CrossRef]
  29. Da Braga, A.S.; Cordeiro, G.M.; Ortega, E.M.; da Cruz, J.N. The odd log-logistic normal distribution: Theory and applications in analysis of experiments. J. Stat. Theory Pract. 2016, 10, 311–335. [Google Scholar] [CrossRef]
  30. Altun, E.; Alizadeh, M.; Gamze, O.Z.E.L.; Tatlidil, H.; Maksayi, N. Forecasting Value-At-Risk with Two-Step Method: Garch-Exponentiated Odd Log-Logistic Normal Model. Rom. J. Econ. Forecast. 2017, 20, 97–115. [Google Scholar]
  31. Alizadeh, M.; Afshari, M.; Hosseini, B.; Ramires, T.G. Extended exp-G family of distributions: Properties, applications and simulation. Commun. Stat. Simul. Comput. 2020, 49, 1730–1745. [Google Scholar] [CrossRef]
  32. Azzalini, A. The Skew Normal and Related Families; Cambridge University Press: Cambridge, UK, 2014. [Google Scholar]
  33. Gradshteyn, I.S.; Ryzhik, I.M. Tables of Integrals, Series, and Products, 7th ed.; Academic Press: San Diego, CA, USA, 2007. [Google Scholar]
  34. Akaike, H. A new look at the statistical model identification. IEEE Trans. Automat. Contr. 1974, 19, 716–723. [Google Scholar] [CrossRef]
  35. Schwarz, G. Estimating the dimension of a model. Ann. Statist. 1978, 6, 461–464. [Google Scholar] [CrossRef]
  36. R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2022; Available online: https://www.R-project.org/ (accessed on 25 December 2022).
Figure 1. Distributions (a) BESN(0.8,0) (solid line), BESN(0.2,0) (dashed line); (b) BESN(0,10) (solid line), BESN(0,2) (dashed line); (c) BESN(0.05,10) (solid line), BESN(0.05,2) (dashed line); and (d) BESN(0.2,4) (solid line), BESN(−0.2,4) (dashed line).
Figure 1. Distributions (a) BESN(0.8,0) (solid line), BESN(0.2,0) (dashed line); (b) BESN(0,10) (solid line), BESN(0,2) (dashed line); (c) BESN(0.05,10) (solid line), BESN(0.05,2) (dashed line); and (d) BESN(0.2,4) (solid line), BESN(−0.2,4) (dashed line).
Mathematics 11 00507 g001aMathematics 11 00507 g001b
Figure 2. Plots for (a) skewness and (b) kurtosis coefficients for the BESN ( ε , α ) model.
Figure 2. Plots for (a) skewness and (b) kurtosis coefficients for the BESN ( ε , α ) model.
Mathematics 11 00507 g002
Figure 3. QQ-plots fitted using (a) the N model, (b) the BESN model, (c) the cdf of the BESN model (dotted line) and empirical cdf (solid line).
Figure 3. QQ-plots fitted using (a) the N model, (b) the BESN model, (c) the cdf of the BESN model (dotted line) and empirical cdf (solid line).
Mathematics 11 00507 g003
Figure 4. (a) Histogram for the variable birthweight. Estimated pdf: N (dotted red line), TN (dashed blue line) and BESN (solid black line). (b) QQ-plot for the BESN distribution. (c) Empirical cdf for the variable birthweight (solid black line) and cdf of the BESN distribution (dotted red line).
Figure 4. (a) Histogram for the variable birthweight. Estimated pdf: N (dotted red line), TN (dashed blue line) and BESN (solid black line). (b) QQ-plot for the BESN distribution. (c) Empirical cdf for the variable birthweight (solid black line) and cdf of the BESN distribution (dotted red line).
Mathematics 11 00507 g004
Figure 5. Fitted densities: (a) SN (dotted red line), MN (dashed blue line) and BESN (solid black line), (b) Empirical cdf (solid black line) and BESN distribution (dashed red line).
Figure 5. Fitted densities: (a) SN (dotted red line), MN (dashed blue line) and BESN (solid black line), (b) Empirical cdf (solid black line) and BESN distribution (dashed red line).
Mathematics 11 00507 g005
Table 1. Recovery parameters for the ML estimators in the BESN model: case ε = 0.90 .
Table 1. Recovery parameters for the ML estimators in the BESN model: case ε = 0.90 .
Real Value n = 100 n = 200 n = 300
ε α μ σ EstimatorBiasSERMSEBiasSERMSEBiasSERMSE
−0.900.5−31 ε 0.05410.09050.10140.02160.06220.07120.01140.05130.0643
α −0.04640.21090.2328−0.02570.15790.1628−0.01590.13120.1345
μ 0.15900.15980.27700.07120.11320.17320.04690.09470.1545
σ 0.02110.07310.07820.00980.04790.04850.00610.03850.0407
5 ε 0.05470.09080.10100.02130.06190.07060.01020.05100.0625
α −0.04480.21600.2556−0.01610.16020.1610−0.01200.13180.1369
μ 0.70440.79751.26970.34870.55930.85950.22030.46360.7437
σ 0.11820.36880.39280.02940.23670.24860.01220.19250.1977
01 ε 0.05350.08980.10030.02650.06360.07350.01230.05210.0614
α −0.04820.21160.2398−0.01340.16170.1678−0.01040.13180.1324
μ 0.16660.15770.27140.08130.11570.18060.04520.09510.1424
σ 0.01510.07400.07940.00770.04790.05240.00470.03830.0418
5 ε 0.05750.09150.10550.02180.06210.07330.01500.05300.0632
α −0.02380.22930.3624−0.02010.15950.1784−0.00450.13440.1420
μ 0.71400.81851.27960.36230.56430.89620.24870.48730.7200
σ 0.09990.36510.38610.04590.24050.26390.00600.19080.1948
1.5−31 ε 0.15280.13150.21610.06730.09010.11820.04570.07210.0983
α 0.64211.80582.3750−0.02880.62970.9154−0.10460.46730.5272
μ 0.37000.24520.52090.18410.16380.30070.13750.13440.2548
σ −0.00590.06110.05860.00030.04190.0402−0.00090.03350.0333
5 ε 0.17240.13380.23030.07490.09520.13050.04570.07390.0968
α 0.89482.00172.48380.00030.66130.9471−0.06910.48880.5838
μ 1.79621.29632.63720.92160.88711.61840.64620.68711.2344
σ 0.01350.30730.29030.01470.21090.19580.00550.16890.1679
01 ε 0.15340.13310.21250.07020.09330.12870.04550.07420.0954
α 0.68721.72922.2382−0.06610.60440.8916−0.11200.46610.5036
μ 0.36690.24540.51000.20360.17050.34540.13430.13900.2413
σ −0.00360.06100.0583−0.00060.04210.0408−0.00070.03360.0324
5 ε 0.17380.13530.23140.07630.09150.13530.04290.07320.0957
α 0.76781.82452.2004−0.04270.59580.7176−0.05670.49960.6068
μ 1.81731.30492.66720.97350.85371.70090.61930.67831.2289
σ 0.01360.30720.27970.00360.20870.19700.00130.16850.1589
3.0−31 ε 0.19610.14350.23140.10530.10710.14570.06930.08940.1061
α 1.49645.33444.22440.39202.35912.88720.11611.63472.1301
μ 0.44630.27950.56290.25930.19950.36970.17860.16480.2741
σ −0.01070.06290.0594−0.00350.04410.0417−0.00130.03580.0337
5 ε 0.19770.14030.23330.10920.11100.14660.07190.08950.1137
α 1.49415.21894.01580.56922.60323.07610.12301.70022.2917
μ 2.10361.38692.74591.23541.03431.79330.89100.82501.4584
σ −0.01960.31700.2962−0.00740.22350.20150.00060.17950.1634
01 ε 0.18940.13840.22440.11050.10720.15190.07090.09030.1096
α 1.59075.58984.40330.33832.41523.14100.06621.63562.1862
μ 0.43100.27120.54370.27210.20130.38560.18000.16570.2833
σ −0.01000.06280.0604−0.00370.04420.0417−0.00150.03590.0331
5 ε 0.19830.14170.23400.10240.11140.14070.07400.08900.1143
α 1.69005.66184.32360.51782.44532.83070.08871.69082.4463
μ 2.09041.39932.74721.14721.02061.71840.91370.83261.4344
σ −0.01610.31760.27690.00740.22490.2038−0.00400.17880.1642
Table 2. Recovery parameters for the ML estimators in the BESN: case ε = 0.25 .
Table 2. Recovery parameters for the ML estimators in the BESN: case ε = 0.25 .
Real Value n = 100 n = 200 n = 300
ε α μ σ EstimatorBiasSERMSEBiasSERMSEBiasSERMSE
−0.250.5−31 ε −0.01570.15290.24790.00010.12970.20250.01000.11980.1708
α 0.15640.37440.51070.08320.24220.32040.05850.19480.2567
μ 0.00850.42440.72070.03520.38730.61770.04290.36600.5381
σ −0.01140.10680.1043−0.00170.07000.07830.00120.05720.0638
5 ε 0.00040.15410.24540.00650.13130.2090−0.00050.11870.1746
α 0.22170.40800.59950.10740.24940.34630.05140.18960.2447
μ 0.22162.15743.63850.28271.96463.20130.09411.79442.6873
σ −0.05620.53630.5480−0.02890.34700.3834−0.00700.27870.3263
01 ε −0.00740.15110.25020.00690.13170.20340.00400.11950.1751
α 0.18510.38250.56630.08630.24320.31620.04500.19060.2427
μ 0.02940.42300.72860.05340.39320.61360.03660.36190.5402
σ −0.01280.10100.1047−0.00420.06940.07270.00110.05850.0690
5 ε −0.00270.15220.2416−0.00310.13200.19540.00630.12130.1788
α 0.17540.38990.57290.07800.24570.33670.04750.19310.2444
μ 0.23652.13443.53990.10971.95403.00060.21161.84322.7743
σ −0.03990.55510.5357−0.00380.37680.4089−0.01100.28130.2984
1.5−31 ε −0.00290.09090.1336−0.00170.06860.0834−0.00440.05540.0602
α 0.58731.07651.40330.20780.55390.62390.14300.42780.4549
μ 0.00270.30000.4507−0.00150.23530.2782−0.00950.19170.2066
σ −0.01500.06270.0677−0.00660.04510.0476−0.00440.03710.0373
5 ε −0.00320.09590.1343−0.00110.06850.0786−0.00220.05670.0606
α 0.49221.00051.23410.22180.56000.65180.11110.41810.4243
μ 0.00011.59042.25160.02821.17771.3209−0.01370.98211.0770
σ −0.07140.32320.3412−0.02800.22620.2271−0.01900.18610.1954
01 ε −0.00360.09540.1166−0.00330.06740.08470.00250.05530.0580
α 0.43900.98871.20580.21650.55570.61480.13430.42570.4539
μ 0.00460.31840.3861−0.00440.23170.28690.01340.19160.2030
σ −0.01140.06370.0657−0.00570.04510.0449−0.00620.03710.0381
5 ε 0.00910.09200.1342−0.00160.06960.08930.00270.05490.0594
α 0.56151.08131.45520.18500.54940.61070.14160.42720.4530
μ 0.24881.51972.22350.01221.19191.51790.03890.94911.0313
σ −0.07830.32030.3451−0.01990.22750.2236−0.02360.18550.1873
3.0−31 ε 0.00030.05950.06380.00040.04130.0423−0.00060.03330.0342
α 1.31452.93773.43020.52151.36531.62280.33491.00201.1052
μ 0.01300.19360.21410.00260.13600.1350−0.00100.10980.1058
σ −0.01120.05600.0581−0.00470.03990.0390−0.00320.03250.0310
5 ε 0.00280.06030.0741−0.00160.04130.0437−0.00040.03350.0347
α 1.15842.89073.25380.55561.39651.79040.34351.01521.2337
μ 0.07740.98421.2218−0.01500.68050.72740.00500.55280.5680
σ −0.04520.28170.3062−0.01230.19960.1983−0.01030.16310.1658
01 ε −0.00150.06050.0639−0.00310.04190.04530.00030.03320.0354
α 1.19922.81893.17280.52111.37981.76970.36521.01941.1536
μ 0.00800.19810.2161−0.00400.13840.14000.00370.10930.1152
σ −0.00640.05630.0612−0.00420.04000.0411−0.00370.03250.0323
5 ε 0.00110.05980.0742−0.00140.04170.04340.00050.03330.0337
α 1.19852.82083.07170.56481.43671.97080.34381.00431.1392
μ 0.02910.97311.26110.03060.68800.69160.02240.54960.5347
σ −0.04570.28120.2978−0.02220.19970.2049−0.01300.16290.1613
Table 3. Recovery parameters for the ML estimators in the BESN: case ε = 0.75 .
Table 3. Recovery parameters for the ML estimators in the BESN: case ε = 0.75 .
Real Value n = 100 n = 200 n = 300
ε α μ σ EstimatorBiasSERMSEBiasSERMSEBiasSERMSE
0.750.5−31 ε −0.02610.11490.14280.00670.08750.10950.01410.07420.0999
α 0.01400.25160.30320.00600.17390.17940.00850.14120.1548
μ −0.12690.24050.3650−0.02730.18360.2554−0.00200.15640.2182
σ 0.00300.07440.0802−0.00580.04930.0511−0.00460.03990.0433
5 ε −0.02450.11570.14240.00850.08630.11260.01450.07360.0970
α 0.01450.26030.39960.00040.17140.19900.00420.13950.1476
μ −0.61721.19961.7918−0.14040.90371.3038−0.01980.77271.0555
σ 0.02220.38430.4149−0.00700.25000.2842−0.02430.19880.2222
01 ε −0.02460.11310.14000.00170.08680.11090.00650.07440.1019
α 0.01970.26820.45940.00970.17500.20130.00850.14220.1598
μ −0.12570.23290.3677−0.04000.18280.2608−0.01720.15780.2368
σ 0.00550.07630.0823−0.00190.04930.0543−0.00220.04020.0460
5 ε −0.03100.12020.14250.00370.08580.11530.01390.07490.1003
α 0.01320.25430.35800.00660.17160.18070.00030.13930.1444
μ −0.65141.25861.8396−0.18000.89301.3635−0.02030.79251.1059
σ 0.02710.38420.4040−0.01790.24560.2661−0.01510.20040.2093
1.5−31 ε −0.06990.13010.1703−0.02570.10040.1269−0.00970.08610.1094
α 0.56751.71732.39930.22430.84001.34740.19540.66931.1860
μ −0.24560.28190.4944−0.10790.22750.3477−0.06250.19500.2956
σ −0.01800.06010.0645−0.00740.04220.0439−0.00810.03420.0373
5 ε −0.08000.13070.1688−0.03140.10410.1317−0.00180.08570.1077
α 0.48051.51972.12540.17010.77801.18730.19370.71201.5127
μ −1.20881.43752.2866−0.64291.18961.8823−0.25010.96251.3974
σ −0.06270.30140.3091−0.04140.21190.2179−0.04440.17090.1871
01 ε −0.08140.13330.1736−0.02770.10020.1272−0.01140.08690.1102
α 0.35361.21981.51170.15690.78811.30500.15280.60860.9161
μ −0.26580.29560.5000−0.12500.22640.3568−0.06560.19950.2901
σ −0.01830.06010.0648−0.00930.04220.0443−0.00880.03410.0363
5 ε −0.07950.13250.1740−0.03350.10290.1326−0.01270.08710.1119
α 0.44611.46632.03140.14600.76061.25830.21740.72421.4759
μ −1.26191.47682.4543−0.66181.17161.8498−0.32741.00961.4889
σ −0.06820.30140.3115−0.03580.21210.2302−0.02770.17250.1795
3−31 ε −0.06870.13020.1310−0.02620.09960.1019−0.00730.08260.0853
α 1.38214.90263.98070.87222.85433.31110.74482.22952.8217
μ −0.22080.27640.3953−0.10340.21950.3009−0.05270.18750.2324
σ −0.02170.06110.0673−0.01040.04430.0466−0.00930.03650.0403
5 ε −0.07630.13240.1448−0.02630.09780.1049−0.00610.08390.0895
α 1.11074.68724.02880.95113.06693.65940.70112.14872.7362
μ −1.14171.42132.1190−0.56471.07661.5742−0.25840.94801.2493
σ −0.06950.30750.3168−0.05870.22120.2465−0.04510.18250.1965
01 ε −0.06980.13250.1382−0.02590.09870.1049−0.00610.08330.0888
α 0.94974.37103.77660.73722.73923.26770.59771.98592.3943
μ −0.24050.28060.4226−0.11320.21800.3008−0.05560.18780.2497
σ −0.02260.06100.0686−0.01130.04420.0470−0.00950.03640.0406
5 ε −0.08480.13250.1587−0.02770.10120.1036−0.01070.08460.0891
α 0.89464.21923.65390.73492.70613.09200.68832.29723.1316
μ −1.34571.44612.4238−0.60041.12741.5485−0.32520.96381.2807
σ −0.09280.30710.3216−0.06700.22080.2401−0.04740.18270.1982
Table 4. Summary statistics for the variable roller.
Table 4. Summary statistics for the variable roller.
n Mean S b 1 b 2
11503.5350.650−0.9864.855
Table 5. Summary statistics for the variable birthweight.
Table 5. Summary statistics for the variable birthweight.
nMeanS b 1 b 2
5003210834.0920.0712.068
Table 6. Parameter estimates and SE for the N, TN and BESN models.
Table 6. Parameter estimates and SE for the N, TN and BESN models.
ParameterNTNBESN
μ 3210.356 (37.301)3207.422 (26.083)3220.873 (26.947)
σ 834.092 (26.415)772.688 (25.760)555.235 (16.408)
α 1.676 (0.368)
λ 1.771 (0.539)
AIC8148.2848109.6728093.565
BIC8156.7138122.3158106.208
Table 7. Estimated parameters and AIC and BIC for the SN, ETN, BESN and MN distributions. The respective SE are in parentheses.
Table 7. Estimated parameters and AIC and BIC for the SN, ETN, BESN and MN distributions. The respective SE are in parentheses.
ParametersSNETNBESNParametersMN
ξ 3.4851 (0.1545)2.1977( 0.0973)1.6740 (0.1323) ξ 1 1.1320 (0.1094)
σ 0.9797 (0.1281)0.9094 (0.0892)0.5954 (0.0364) σ 1 0.2280 (0.0771)
ε 4.0647(1.9118) ξ 2 2.9167(0.0640)
α −1.5950 (0.5872)2.3479 (1.4204)−0.4232 (0.0692) σ 2 0.5494 (0.0434)
β 1.1509 (0.2944) p0.0853 (0.0316)
AIC190.198190.768186.3271 188.751
BIC197.525200.538196.097 200.964
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Duarte, J.; Martínez-Flórez, G.; Gallardo, D.I.; Venegas, O.; Gómez, H.W. A Bimodal Extension of the Epsilon-Skew-Normal Model. Mathematics 2023, 11, 507. https://doi.org/10.3390/math11030507

AMA Style

Duarte J, Martínez-Flórez G, Gallardo DI, Venegas O, Gómez HW. A Bimodal Extension of the Epsilon-Skew-Normal Model. Mathematics. 2023; 11(3):507. https://doi.org/10.3390/math11030507

Chicago/Turabian Style

Duarte, Juan, Guillermo Martínez-Flórez, Diego Ignacio Gallardo, Osvaldo Venegas, and Héctor W. Gómez. 2023. "A Bimodal Extension of the Epsilon-Skew-Normal Model" Mathematics 11, no. 3: 507. https://doi.org/10.3390/math11030507

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop