1. Introduction
Nonnegative data sets are quite usual in different scientific fields. To carry out inference for these kinds of data, models of continuous distributions with a nonnegative support are required. In this sense, we can cite the following models: Half-Normal (HN), Right Half Bimodal Normal (RHBN) (Alavi [
1]), the Generalized Gamma (Meeker and Escobar [
2]), Truncated Positive Normal (Gómez et al. [
3]), Slash Truncated Positive Normal (Gómez et al. [
4] ), Truncated Skewed Bimodal Normal (Sharifipanah et al. [
5]), Extended generalized half-normal (Duarte Sánchez et al. [
6]), Extended generalized half-normal with progressive type-I interval censoring (Ahmadi and Yousefzadeh [
7]), Gamma-Exponential (Kudryavtsev and Shestakov [
8]). In a more general setting, families of distributions with nonnegative support can be found, in which the model is built on the basis of given probability density functions symmetric around zero,
. In this sense, we highlight the seminar paper by Elal-Olivero et al. [
9]. Specifically, let
Y be a continuous random variable (rv) with support in
and probability density function (pdf)
symmetric about zero, satisfying that
. Then, Elal-Olivero et al. [
9] proposed to obtain a family of nonnegative continuous distributions,
X, whose pdf is given by:
where
is a shape parameter, and
is a scale parameter.
If
, where
denotes the pdf of the
distribution, then the Extended Half-Normal (EHN) distribution, proposed in [
9], is obtained, whose pdf is:
The EHN model was studied in depth in Elal-Olivero et al. [
9]. The aim of this paper is to introduce a new model of distributions based on (
1) and taking as
, the pdf of the Power Exponential (PE) distribution,
, which was introduced in Subbotin [
10]. Recall that the pdf of a
distribution is:
where the normalising constant
is:
Moreover, for the
model,
is:
The
model has received a great deal of interest in papers dealing with robust inference, such as Box [
11] and Box et al. [
12]. Its statistical properties can be seen, for instance, in Nadarajah [
13]. Details about its use as a prior robust in Bayesian Statistics are given in Choy and Walker [
14]. We highlight that, in the
model, the kurtosis coefficient depends on
. The
model includes the normal and the Laplace distribution, along with other symmetric distributions around zero with lighter or heavier tails than the normal distribution. So, our proposal will be to obtain a new model, called the Extended Half-Power Exponential (EHPE), by using (
1) and (
3), which will be more flexible for its kurtosis than the EHN distribution, and moreover, it contains the EHN as a particular case. As for the outline of this paper, in
Section 2, the EHPE distribution is defined, and its properties are studied: pdf and cumulative distribution function (cdf), reliability and hazard functions, moments, and skewness and kurtosis coefficients. In
Section 3, the estimation of the parameters is discussed by using the maximum likelihood (ML) method. In
Section 4, a Montecarlo simulation study is carried out, which shows the good asymptotic behaviour of ML estimates. In
Section 5, two applications to COVID-19 data are given. It will be shown there that this model can be used to describe nonnegative asymmetric data with light or heavy tails.
Section 6 is devoted to the final conclusions about our study.
2. EHPE Distribution
In this section, the EHPE distribution is introduced. Its pdf and cumulative distribution function (cdf) are given, along with some properties of interest in reliability and survival analysis, such as the reliability and hazard function. The section is completed with the study of moments, which allow us to study the skewness and kurtosis.
2.1. Probability Density Function
Proposition 1. Let . Then, the pdf of X is given by:where , and are parameters of this model. On the other hand, c and k are features of the model, which were introduced in (4) and (5), respectively. Proof. By applying (
1) and taking into account the expression for the pdf of the Power Exponential distribution,
, given in (
3), the result proposed in (
6) is obtained. □
Remark 1 (Interpretation of parameters in (
6))
. is a shape parameter, is a scale parameter, and it will be seen in Corollary 2 that is a parameter mainly related to the skewness and kurtosis of EHPE model. By construction, the following models are particular cases for the EHPE distribution:
EHPE EHN ;
EHPE HN ;
EHPE RHBN (2).
Figure 1 summarizes the relationships among the EHPE and the particular cases previously cited.
In the next proposition, we highlight that (
6) can be expressed as a mixture of two half densities. This result is useful to interpret the parameters in
Figure 2.
Proposition 2. Let .
- 1.
The pdf of X can be written as a mixture of two half densities:where The pdf’s given in (8) and (9) are the half pdf’s built from and , respectively. - 2.
- 3.
Proof. The expression given in (
7) is immediate from (
6). Note that
given in (
9) is a half-density, since
is a pdf symmetrical to about zero.
Moreover, 2. and 3. are immediate from (
7).
□
Remark 2. In Figure 2, plots for the pdf in the EHPE model for different values of and are given (). It can be seen in [14] that for , the tails of the distribution are more platykurtic than the normal ones. To asses the effect of this fact on the EHPE model, the values and are considered in Figure 2, panels (a) and (b), respectively. As for α, its values vary from (black), (red), 2 (green), and ∞ (blue). In this way, (a) and (b) panels show the effect of considering , an increasing value of β, and for fixed β, the effect of increasing the value of α. Panel (c) is devoted to . Recall that, in this case, the distribution reduces to the and the to the [9]. On the other hand, for , the tails in distribution are more leptokurtic than the normal ones, see [14]. The positive values of β are considered in panels (d), (e), and (f). The effect of increasing the β value (and for a fixed β, to increase the value of α), can be appreciated there. It is also worth highlighting that the case corresponds to the EHPE distributions built from the Laplace or double exponential model, . In all panels, it can be appreciated, that increasing the value of α, a higher coordinate in the origin is obtained for the pdf.
Remark 3. (a) Note that the results given in Proposition 2, along with plots in Figure 2, show that by applying (1), it is possible to obtain a plethora of pdf’s whose shapes varying from to depending on the value of . (b) Also note that if , then reduces to .
2.2. Some Properties
Next, the cdf of is obtained. Recall that this function is defined as .
Proposition 3. Let . Then, the cdf of X is given by:where is the cdf of a nonnegative rv W with pdf , , and is the lower incomplete gamma function, . Proof. Making the change of variable,
, we have that:
Note that the first integral is the cdf of the nonnegative rv W with pdf , for . As for the second integral, changing the variable, , the lower incomplete gamma function is obtained, with . □
Proposition 4. Let . Then:
- (i)
The reliability function, , of X is given by: - (ii)
The hazard function, , of X is given by:
Remark 4. Plots for the reliability and hazard function are given in Figure 3 for several values of α and β, (). Although the hazard functions in Figure 3 are increasing functions of t, we point out that other shapes are also possible. For instance, for , or , is first decreasing and later increasing. In next proposition, we prove that the cdf in a EHPE model can be expressed as a mixture of two gamma cdf’s.
Proposition 5. Let . Then, the cdf of X can be written as:where is the cdf of a Gamma distribution, and is the cdf of a Gamma distribution. Proof. Recall (
10). Changing variable
in
, after some algebra, we have that:
It can be seen in Abramowitz and Stegun [
15] that the cdf of a Gamma distribution,
, denoted as
can be obtained as:
Therefore, by applying (
14), (
13) can be obtained as:
where
is the cdf of a Gamma
distribution. As for the second summand in (
10), taking into account that
and the expression of
, given in (
5), (
12) is obtained. □
Remark 5.
- 1.
The result given in Proposition 5 is similar to the one given in [9] for the EHN distribution. - 2.
The fact that the EHPE cdf can be expressed as a mixture of gamma cdf’s may motivate the use of this model as a competitor of Rayleigh type models such as those introduced in [16,17].
2.3. Moments
The moments of the EHPE distribution are given in the next proposition.
Proposition 6. Let . Then, the nth moment of X, where n is a positive integer, is given by:with . Proof. Let
W with
for
. Then, the
nth moment of
W is:
By noting that
and using (
16), (
15) is obtained. □
Corollary 1. If , then:
- 1.
- 2.
- 3.
- 4.
- 5.
The variance of X, , is:
Corollary 2. Let . Then, the skewness coefficient, , and the kurtosis coefficient, , are given by:where . Remark 6. The expressions for the skewness and kurtosis coefficients given in Corollary 2 are obtained by using: Remark 7. Proposition 6 shows that the moments of distribution basically depend on moments of the model. Plots for the skewness and kurtosis coefficients of distribution are given in Figure 4 for different values of α and β parameters. In this figure, the effect of the β parameter can be seen. A greater value of β produces a higher value of the skewness and kurtosis coefficients. This fact can also be appreciated in Table 1 and Table 2. 4. Simulation Study
In this section, a Monte Carlo simulation study is carried out to illustrate the behaviour of ML estimators. For the sample size n, the values , 200 and 500 are considered; without loss of generality ; for 0.1 and 0.2; and for −0.3, 0 and 0.1. For every combination of , and , 1000 samples of size n are generated from the EHPE model. To generate random numbers of an acceptance–rejection technique is proposed, which proceeds as follows:
- Step 1.
Simulate Y from pdf g;
- Step 2.
Simulate ;
- Step 3.
If define . Otherwise, repeat Step 1.
For
,
g is taken as the pdf of an
distribution introduced in [
9], which corresponds to the EHPE model with
. On the other hand, for
,
g is the pdf of an
model. In both settings,
c is taken as the maximum value of
. For every sample, the ML estimates are obtained by applying Newton–Raphson algorithm. As initial values to start the MLE recursion algorithms, the estimate of parameter
in the EHN distribution,
and
are taken. In
Table 3, the empirical bias, the mean of the standard errors (SEs) and the root of the empirical mean squared error (RMSE) are given for the estimators of the parameters. Note that, although the ML estimators are biased, the bias decreases when the sample size increases. The SEs and RMSEs also decrease with the sample size. These facts suggest that the ML estimators are consistent. Moreover, approximate confidence intervals to 95% level were obtained, by using the asymptotic distribution of ML estimators. Their empirical coverage probabilities (CP) have also been included in
Table 3. In this table, it can be appreciated that, in general, the performance is good.