Next Article in Journal
Kalman Filter for Linear Discrete-Time Rectangular Singular Systems Considering Causality
Previous Article in Journal
Adaptive Neural Consensus of Unknown Non-Linear Multi-Agent Systems with Communication Noises under Markov Switching Topologies
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Modified Unit-Half-Normal Distribution with Applications

1
Departamento de Estadística y Ciencias de Datos, Facultad de Ciencias Básicas, Universidad de Antofagasta, Antofagasta 1240000, Chile
2
Instituto de Ciências Matemáticas e de Computação, Universidade de São Paulo, São Carlos 13560-095, Brazil
3
Departamento de Ciencias Matemáticas y Físicas, Facultad de Ingeniería, Universidad Católica de Temuco, Temuco 4780000, Chile
*
Author to whom correspondence should be addressed.
Mathematics 2024, 12(1), 136; https://doi.org/10.3390/math12010136
Submission received: 4 November 2023 / Revised: 23 December 2023 / Accepted: 27 December 2023 / Published: 31 December 2023
(This article belongs to the Section Probability and Statistics)

Abstract

:
In this article, we introduce a new continuous distribution based on the unit interval. This distribution is generated from a transformation of a random variable with half-normal distribution. We study its basic properties, percentiles, moments and order statistics. Maximum likelihood estimation is applied, and we present a simulation study to observe the behavior of the maximum likelihood estimators. We examine two applications to real proportions datasets, where the new distribution is shown to provide a better fit than other distributions defined in the unit interval.

1. Introduction

In real life, it is quite common to find continuous data sets in the interval ( 0 , 1 ) . These data are the product of measurements that interpret different indices and rates. An example is insurance data, where a probability distribution can be used as a distortion function to define a premium principle (see Denuit et al. [1]). There are many studies involving measurements between ( 0 , 1 ) , see for example Cook et al. [2] and Gupta and Nadarajah [3], etc. Continuous distributions with support in ( 0 , 1 ) are fundamental for modeling these data; for example, the two-parameter Beta is the model most frequently used to model data of this kind due to its great flexibility (see Johnson et al. [4]). A random variable (r.v.) X is called a Beta distribution with parameters α and β if its probability density function (pdf) is given by
f Z ( z ; α , β ) = 1 B ( α , β ) z α 1 ( 1 z ) β 1 , 0 < z < 1 ,
where α > 0 , β > 0 and B( · , · ) is the Beta function. Another distribution with support in ( 0 , 1 ) is the Kumaraswamy distribution (see Kumaraswamy [5]). A r.v. Z has a Kumaraswamy (KM) distribution with parameters α and β if its pdf is given by
f Z ( z ; α , β ) = α β z α 1 ( 1 z α ) β 1 , 0 < z < 1 ,
where α > 0 and β > 0 .
In recent years, several distributions with positive support have been transformed into distributions with unit support, for example Grassia [6], based on the Gamma distribution; Jones [7], based on the Kumaraswamy distribution; Mazucheli et al. [8], based on the Birnbaum-Saunders distribution; Ghitany et al. [9], based on the inverse Gaussian distribution; Modi et al. [10], based on the Burr III distribution; Korkmaz and Chesneau [11], based on the Burr XII distribution; Haq et al. [12], based on the modified Burr-III distribution; Gómez-Déniz et al. [13], Mazucheli et al. [14], Mazucheli et al. [15], based on the Lindley distribution, and more recently Bakouch et al. [16] based on the half-normal (HN) distribution. For example a distribution with support in ( 0 , 1 ) and only one parameter is the unit-Lindley distribution (see Mazucheli et al. [14]). A r.v. Z is called a unit-Lindley (UL) distribution with parameter σ if its pdf is given by
f Z ( z ; σ ) = σ 2 ( 1 z ) 3 1 + σ exp σ z 1 z , 0 < z < 1 ,
where σ > 0 .
In this article, we introduce a new probability distribution with a restricted domain. Its distribution is derived by modifying the representation of the unit-half-normal (UHN) distribution introduced by Bakouch et al. [16]. One of the motivations of distribution theory is to provide new alternatives to known distributions in order to improve the statistical modeling of certain datasets. Our work is based on the HN distribution. Thus, we say that an r.v. X is called an HN distribution with scale parameter σ if its pdf is given by
f X ( x ; σ ) = 2 σ ϕ x σ , x > 0 ,
with σ > 0 , and ϕ ( · ) is the expression of the standard normal distribution. We denote this by X H N ( σ ) and some of its properties are:
  • The cumulative distribution function (cdf) of X is F X ( x ; σ ) = 2 Φ x σ 1 .
  • The r th moments are expressed by E ( X r ) = 2 r / 2 Γ r + 1 2 σ r π , r = 1 , 2 ,
where Φ ( · ) is the cdf of the standard normal distribution, and Γ ( x ) = 0 t x 1 e t d t is the gamma function. Hogg and Tanis [17] discuss some properties of the HN distribution.
Bakouch et al. [16] introduce the UHN distribution, which is the product of a transformation of the random variables X H N ( σ ) . Using the following transformation Y = X 1 + X they obtain the UHN distribution, the pdf of which is given by
f Y ( y ; σ ) = 2 σ ( 1 y ) 2 ϕ y σ ( 1 y ) , 0 < y < 1 ,
where σ > 0 and we denote it by Y U H N ( σ ) .
In Figure 1, we show the pdf of the UHN distribution for several values of σ . In Figure 2, we show a histogram of a proportions dataset. The shape that can be adopted by the UHN distribution close to zero does not represent this dataset; we therefore sought a different transformation with this characteristic. The main object of the present article is to study a new distribution that is a modification of the UHN distribution and offers an alternative to the UHN distribution for modeling proportion data with positive asymmetry, as shown by the data in Figure 2.
The rest of the paper is organized as follows. In Section 2, we give the representation of this distribution and generate the new density, its properties, moments and order statistics. In Section 3, we derive an inference by maximum likelihood (ML) and carry out a simulation study. Section 4 shows two applications to real datasets. In Section 5 we provide some final conclusions.

2. Density Function and Properties

In this section, we introduce the representation, density and properties of the new distribution.

2.1. Stochastic Representation

The representation of this new distribution is
Z = 1 1 + X ,
where X H N ( σ ) , σ > 0 , and we call the distribution of Z the modified unit-half-normal (MUHN). This is denoted by Z M U H N ( σ ) . Mazucheli et al. [15] use this representation in the Lindley distribution, obtaining a distribution called New Unit-Lindley (NUL). Applications of the NUL distribution are given in Ferreira and Mazucheli [18] and Alrumayh et al. [19], among others.

2.2. Density Function

The following result shows the pdf of the MUHN distribution, which is generated using the representation given in (2).
Proposition 1.
Let Z M U H N ( σ ) . Then, the pdf of Z is given by
f Z ( z ; σ ) = 2 σ z 2 ϕ 1 z σ z , 0 < z < 1 ,
where σ > 0 .
Proof. 
Let X H N ( σ ) , using the representation given in (2), and the random variables transformation method the result is obtained.    □
Proposition 2.
Let Z M U H N ( σ ) . Then, the MUHN distribution has unimodality at z 0 = 1 + 8 σ 2 1 4 σ 2 .
Proof. 
Differentiating the density given in (3) with respect to z set equal to zero gives the result.    □
In Figure 3, we show the pdf of the MUHN distribution for several values of σ .

2.3. Cumulative Distribution Function

The following proposition shows the cdf of the MUHN distribution.
Proposition 3.
Let Z M U H N ( σ ) . Then, the cdf of Z is given by
F Z ( z ; σ ) = 2 Φ z 1 σ z ,
where σ > 0 .
Proof. 
Calculating the cdf of Z directly, we have
F Z ( z ; σ ) = 0 z 2 σ t 2 ϕ 1 t σ t d t .
Making the following change of the variable u = 1 t σ t , the result is obtained.    □

2.4. Reliability Analysis

The reliability function r ( t ) and the hazard function h ( t ) of the MUHN distribution are given in the following corollary.
Corollary 1.
Let T M U H N ( σ ) . Then, the reliability and hazard of T is given by
r ( t ) = 1 2 Φ t 1 σ t , a n d h ( t ) = 2 ϕ 1 t σ t σ t 2 1 2 Φ t 1 σ t ,
where σ > 0 is shape parameter.
In Figure 4, we show the Hazard function of MUHN distribution for several values of  σ .
Proposition 4.
Let Z M U H N ( σ ) . Then, the quantile function (Q) of the MUHN distribution is given by
Q ( p ) = 1 σ Φ 1 p 2 1 , 0 < p < 1 ,
where Φ 1 ( · ) is the inverse cdf of a standard normal distribution.
Proof. 
Using the cdf given in (4), we have
p = F Z ( z ; σ ) = 2 Φ z 1 σ z .
Applying the inverse function of the cdf of a standard normal distribution and clearing for z, the result is obtained.    □

2.5. Order Statistics

Let Z 1 , . . . , Z n be a random sample of the r.v. Z M U H N ( σ ) . We denote by Z ( j ) the j t h order statistics, j { 1 , , n } .
Proposition 5.
The pdf of Z ( j ) is
f Z ( j ) ( z ; σ ) = 2 j n ! σ ( j 1 ) ! ( n j ) ! z 2 ϕ 1 z σ z Φ j 1 z 1 σ z 1 2 Φ z 1 σ z n j , 0 < z < 1 .
In particular, the pdf of the minimum, Z ( 1 ) , is
f Z ( 1 ) ( z ; σ ) = 2 n σ z 2 ϕ 1 z σ z 1 2 Φ z 1 σ z n 1 , 0 < z < 1
and the pdf of the maximum, Z ( n ) , is
f Z ( n ) ( z ; σ ) = n 2 n σ z 2 ϕ 1 z σ z Φ n 1 z 1 σ z , 0 < z < 1
Proof. 
Since the model is absolutely continuous, the pdf of the j t h order statistics is obtained by applying
f Z ( j ) ( z ; σ ) = n ! ( j 1 ) ! ( n j ) ! f Z ( z ; σ ) [ F Z ( z ; σ ) ] j 1 [ 1 F Z ( z ; σ ) ] n j , j { 1 , , n }
where F and f denote the cdf and pdf of the parent distribution, Z M U H N ( σ ) in this case.    □

2.6. Moments

An important numerical function for calculating the r-th moments of the random variable Z M U H N ( σ ) is defined as
a r ( σ ) = 0 exp ( u 2 / 2 ) ( 1 + σ u ) r d u , r = 1 , 2 , 3 , . . .
More details of this function can be found in Appendix A.
Proposition 6.
Let Z M U H N ( σ ) . Then, for  r = 1 , 2 , 3 , , the r-th moment of Z is given by
E ( Z r ) = 2 2 π a r ( σ ) .
Proof. 
Using the representation given in (2) and calculating the r-th moments directly, we have
E ( Z r ) = E 1 ( 1 + X ) r = 0 2 σ ( 1 + x ) r ϕ x σ d x .
Making the following change in the variable u = x σ , the result is obtained.    □
Corollary 2.
Let Z M U H N ( σ ) . Then, the mean and variance of the r.v. Z are given respectively by
E ( Z ) = 2 a 1 ( σ ) and V a r ( Z ) = 2 a 2 ( σ ) 2 a 1 2 ( σ ) ,
and the asymmetry and kurtosis coefficients are given respectively by
β 1 = a 3 ( σ ) 6 a 1 ( σ ) a 2 ( σ ) + 8 a 1 3 ( σ ) 2 a 2 ( σ ) 2 a 1 2 ( σ ) 3 / 2 ,
β 2 = a 4 ( σ ) 8 a 1 ( σ ) a 3 ( σ ) + 24 a 1 2 ( σ ) a 2 ( σ ) 24 a 1 4 ( σ ) 2 a 2 ( σ ) 2 a 1 2 ( σ ) 2 .
Figure 5 depicts plots for the asymmetry and kurtosis coefficients in the MUHN distribution.
Proposition 7.
Let Z M U H N ( σ ) . Then, the moment-generating function ( M Z ) of the r.v. Z is given by
M Z ( t ) = 2 j = 0 t j j ! a j ( σ ) ,
where a j are given in (7).
Proof. 
Using the representation given in (2), we have
M Z ( t ) = E exp ( t Z ) = E exp t 1 + X = 0 exp t 1 + x 2 σ ϕ x σ d x ,
making the change of the variable u = x σ , expanding the exp ( · ) function in series and using (7) the result is obtained.    □
The following proposition shows a closed expression for negative moments.
Proposition 8.
Let Z M U H N ( σ ) . Then, for  r = 1 , 2 , 3 , , the negative r-th moment of Z is given by
E ( Z r ) = σ k π k = 0 r r k 2 k 2 Γ k + 1 2 .
Proof. 
Calculating the negative moments directly using binomial theorem, the result is obtained.    □
From this we have that: E ( Z 1 ) = 1 + σ 2 π .

3. Inference

In this section, we estimate the parameter σ of the MUHN model using a modified moments (MM) method and the ML method, we present a simulation study, and we discuss the asymptotic estimation of the ML estimator.

3.1. Mm Estimation

For a random sample z 1 , , z n derived from the MUHN( σ ) distribution, Z 1 ¯ = 1 n i = 1 n 1 z i , then MM estimator of σ is:
σ ^ M M = π 2 Z 1 ¯ 1 .

3.2. Ml Estimation

For a random sample z 1 , , z n derived from the MUHN( σ ) distribution, the log-likelihood function can be written as
l ( σ ) = n log ( 2 ) n log ( 2 π ) n log ( σ ) i = 1 n log ( z i 2 ) 1 2 σ 2 i = 1 n 1 z i z i 2 .
The score equation is given by
d l ( σ ) d σ = n σ + 1 σ 3 i = 1 n 1 z i z i 2 = 0 ,
the ML estimator for σ ( σ ^ ) is obtained by resolving the following Equation (13) and its
σ ^ = 1 n i = 1 n 1 z i z i 2 1 / 2 ,
Hence, for large samples, the ML estimator, σ ^ , is asymptotically normal, that is,
n σ ^ σ L N ( 0 , I F 1 ( σ ) ) .
It results from this that the asymptotic variance of the ML estimator σ ^ is the inverse of Fisher’s information I F ( σ ) = 2 σ 2 , i.e.,  V a r ( σ ^ ) σ 2 2 n .
Proposition 9.
For a random sample z 1 , , z n derived from the MUHN(σ) distribution, we have that
n σ σ ^ χ ( n )
where χ ( n ) denotes the chi-distribution with n degrees of freedom.
Proof. 
As Z M U H N ( σ ) , then 1 σ 2 1 Z Z 2 χ ( 1 ) 2 , where χ ( 1 ) 2 chi-squared distribution with 1 degree of freedom. From the properties of the chi-squared distribution we have 1 σ 2 i = 1 n 1 z i z i 2 χ ( n ) 2 , luego n σ 1 n i = 1 n 1 z i z i 2 1 / 2 χ ( n ) .    □
Corollary 3.
Some direct consequences of the result given in (15) are
  • E ( σ ^ ) = σ 2 Γ n + 1 2 n Γ n 2
  • V a r ( σ ^ ) = σ 2 1 2 Γ 2 n + 1 2 n Γ 2 n 2
The σ ^ estimator is asymptotically unbiased for σ . With these results the bias and the mean squared error can be calculated.

3.3. Simulation Study

To examine the behavior of the ML estimation approach, we carried out a simulation study to assess the performance of the estimation, using the parameter σ of the MUHN distribution. Two algorithms, Algorithms 1 and 2, are proposed to generate random numbers from the MUHN distribution. The simulation analysis was carried out by generating 1000 samples of size n = 30, 38, 40, 50, 80 and 100 from the MUHN distribution. The objective of this simulation is to study the behavior of the ML of the parameter σ of the MUHN model.
Algorithm 1 to simulate values from the Z M U H N ( σ ) distribution.
1:
Generate Y U n i f o r m ( 0 , 1 ) .
2:
Compute Z = 1 σ Φ 1 Y 2 1 .
Algorithm 2 to simulate values from the Z M U H N ( σ ) distribution.
1:
Generate Y N ( 0 , 1 ) .
2:
Compute X = σ Y .
3:
Compute Z = 1 1 + X .
The code for both algorithms can be found in the following repository https://github.com/isaaccortes1989/MUHN-Codes (accessed on 2 November 2023). Since the results of the two algorithms are similar, we only present those of Algorithm 1. Table 1 displays the empirical bias (B), standard deviation (SD), mean of the standard errors (SEs), root of the empirical mean squared error (RMSE), and the coverage probability (CP). The CP terms converge reasonably to the nominal value used for their construction (95%), suggesting that the normality is reasonable for the asymptotic distribution of the ML estimators in the MUHN model. As shown in Table 1, the performance of the estimations improves as n increases.

4. Applications

This section shows two applications of the MUHN model, highlighting its superior performance compared to other models known in the statistical literature.

4.1. Application 1

In this first application, we fit the MUHN distribution and compare it with the uniparametric UL and UHN distributions and the two-parameter Beta and KM distributions defined in the Introduction. The dataset consists of 48 samples of rocks from an oil reservoir, as reported by Cordeiro and Brito [20]. We conducted an analysis of the shape perimeter using a squared variable (area). The data are in Table 2:
Table 3 displays basic descriptive statistics for the dataset. We employ the notation b 1 and b 2 to denote sample asymmetry and kurtosis coefficients, respectively.
Using Section 3.1, the MM estimator of σ is σ ^ M M = 5.258 .
Table 4 shows the parameters estimated by ML for the UL, UHN, MUHN, Beta and KM models. Standard errors of the ML estimates are calculated using Fisher’s information corresponding to each model. For each model, we report the values of the Akaike information criterion (AIC), introduced by Akaike [21], and the Bayesian information criterion (BIC), proposed by Schwarz [22]. It is observed that both AIC and BIC criteria indicate a better fit for the Beta model.
Figure 6 illustrates the ML fit of the five models with the probability histogram. Additionally, we calculate the quantile residuals (QRs). If the model is suitable for the data, the QRs should be a sample from the standard normal model (see Dunn and Smyth [23]). This assumption can be validated using traditional normality tests, such as the Anderson–Darling (AD), Cramér-von Mises (CVM) and Shapiro–Wilkes (SW) tests.
In Figure 7, the QRs for the fitted models and the p-values for the AD, CVM and SW normality tests are provided to assess whether the QRs follow the standard normal distribution. It is observed that the QRs follow the standard normal distribution only for the MUHN model; in other word, all three test show that the data did not come from the UL, UHN, Beta and KM distributions. Figure 7 suggest that the MUHN model gives a better fit for this dataset.
The codes for this application are available on the following website: https://github.com/isaaccortes1989/MUHN-Codes/tree/main/First%20Application (accessed on 2 November 2023).

4.2. Application 2

In this second application, we fit the MUHN distribution and compare it with the two-parameter Beta and KM distributions that are defined in the Introduction. The data set consists of a sample of 38 proportions formed by COVID information taken from the Chilean database, from 4 March to 10 April 2020. These data were formed using new cases (NC), daily accumulated cases (AC) and daily cumulative deaths (CD). We analyze the proportion of daily NC to the accumulated number of survivors with the equation:
z i = N C i A C i C D i 1 .
The data are in Table 5.
Table 6 shows the descriptive summary of the data, highlighting their positive asymmetry. Using Section 3.1, the MM estimator of σ is σ ^ M M = 7.656 .
The estimates, SE, AIC, and BIC values for the UL, UHN, KM, Beta and MUHN models are displayed in Table 7. From the table, it is evident that the MUHN model demonstrates a better fit, as it exhibits the smallest criteria values with only one parameter. Furthermore, the fit of the five models with the histogram of the data can be observed in Figure 8, confirming the better fit of the MUHN model. Finally, as shown in Figure 9, all QRs indicate that a standard normal distribution is followed only for the MUHN model.
The codes for this application are available on the following website: https://github.com/isaaccortes1989/MUHN-Codes/tree/main/Second%20Application (accessed on 2 November 2023).

5. Discussion

This paper presents a study of the MUHN distribution. We show some properties and compare them with the UHN distribution in a fit using ML estimation. The MUHN distribution appears to be a viable alternative for fitting data between zero and one, and with positive asymmetry. Some other characteristics of the MUHN distribution are:
  • The representation of the MUHN distribution is simple.
  • The MUHN distribution has an explicit mode.
  • The cdf, hazard function and quantile function are explicit and represented by known functions.
  • The ML estimator shows very good behavior with small samples.
  • The applications show that the MUHN distribution is a very good alternative when the data present positive asymmetry; this was confirmed by both the AIC and BIC model selection criteria and by the Anderson–Darling, Cramér-von Mises and Shapiro–Wilkes statistical tests.

Author Contributions

Conceptualization, I.E.C. and H.W.G.; methodology, P.I.A. and H.V.; software, P.I.A. and I.E.C.; validation, I.E.C., O.V. and H.W.G.; formal analysis, P.I.A., I.E.C. and O.V.; investigation, H.V. and O.V.; writing—original draft preparation, P.I.A. and H.V.; writing—review and editing, I.E.C. and O.V.; funding acquisition, O.V. and H.W.G. All authors have read and agreed to the published version of the manuscript.

Funding

The research of P.I.A., H.V. and H.W.G. was supported by Semillero UA-2023.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data sets are available in the text.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

The function a r ( σ ) defined in (7) when r = 2 , 3 , is given by:
1 σ 2 π 1 2 r + 1 2 σ r σ Γ ( 1 r 2 ) · 1 F 1 ( r 2 , 1 2 ; 1 2 σ 2 ) + 2 Γ ( 1 r 2 ) · 1 F 1 ( r + 1 2 , 3 2 ; 1 2 σ 2 ) + 1 r 1 · p F q ( 1 2 , 1 ; 1 r 2 , 3 r 2 ; 1 2 σ 2 )
and when r = 1 is given by
1 σ 2 π exp ( 1 2 σ 2 ) π Erfi ( 1 2 σ ) ExpIntegralEi ( 1 2 σ 2 ) ,
where p F q ( ) is the generalized hypergeometric function (for more details see Abramowitz and Stegun [24]), and for a definition and properties of the integral functions Erfi ( · ) and ExpIntegralEi ( · ) we refer the reader to Weisstein [25,26].

References

  1. Denuit, M.; Dhaene, J.; Goovaerts, M.J.; Kaas, R. Actuarial Theory for Dependent Risks; John Wiley & Sons Ltd.: Chichester, UK, 2005. [Google Scholar]
  2. Cook, D.O.; Kieschnick, R.; McCullough, B. Regression analysis of proportions in finance with self selection. J. Empir. Financ. 2008, 15, 860–867. [Google Scholar] [CrossRef]
  3. Gupta, A.K.; Nadarajah, S. Handbook of Beta Distribution and Its Applications; CRC Press: New York, NY, USA, 2004. [Google Scholar]
  4. Johnson, N.L.; Kotz, S.; Balakrishnan, N. Continuous Univariate Distributions, 2nd ed.; John Wiley & Sons Inc.: New York, NY, USA, 1995; Volume 2. [Google Scholar]
  5. Kumaraswamy, P. A generalized probability density function for double-bounded random processes. J. Hydrol. 1980, 46, 79–88. [Google Scholar] [CrossRef]
  6. Grassia, A. On a Family of Distributions with Argument between 0 and 1 Obtained by Transformation of the Gamma Distribution and Derived Compound Distributions. Aust. J. Stat. 1977, 19, 108–114. [Google Scholar] [CrossRef]
  7. Jones, M.C. Kumaraswamy’s distribution: A beta-type distribution with some tractability advantages. Stat. Methodol. 2009, 6, 70–81. [Google Scholar] [CrossRef]
  8. Mazucheli, J.; Menezes, A.F.B.; Dey, S. The unit-Birnbaum-Saunders distribution with applications. Chil. J. Stat. 2018, 1, 47–57. [Google Scholar]
  9. Ghitany, M.; Mazucheli, J.; Menezes, A.; Alqallaf, F. The unit-inverse Gaussian distribution: A new alternative to two-parameter distributions on the unit interval. Commun. Stat. Theory Methods 2018, 48, 3423–3438. [Google Scholar] [CrossRef]
  10. Modi, K.; Gill, V. Unit Burr III distribution with application. J. Stat. Manag. Syst. 2019, 23, 579–592. [Google Scholar] [CrossRef]
  11. Korkmaz, M.; Chesneau, C. On the unit Burr-XII distribution with the quantile regression modeling and applications. Comput. Appl. Math. 2021, 40, 29. [Google Scholar] [CrossRef]
  12. Haq, M.; Hashmi, S.; Aidi, K.; Ramos, P.F.L. Unit Modified Burr-III Distribution: Estimation, Characterizations and Validation Test. Ann. Data Sci. 2023, 10, 415–449. [Google Scholar] [CrossRef]
  13. Gómez-Déniz, E.; Sordo, M.A.; Calderín-Ojeda, E. The Log-Lindley distribution as an alternative to the beta regression model with applications in insurance. Insur. Math. Econ. 2014, 54, 49–57. [Google Scholar] [CrossRef]
  14. Mazucheli, J.; Menezes, A.F.B.; Chakraborty, S. On the one parameter unit-Lindley distribution and its associated regression model for proportion data. J. Appl. Stat. 2020, 46, 700–714. [Google Scholar] [CrossRef]
  15. Mazucheli, J.; Bapat, S.R.; Menezes, A.F.B. A new one-parameter unit-Lindley distribution. Chil. J. Stat. 2020, 11, 53–67. [Google Scholar]
  16. Bakouch, H.S.; Nikb, A.S.; Asgharzadehb, A.; Salinas, H.S. A flexible probability model for proportion data: Unit-half-normal distribution. Commun. Stat. Case Stud. Data Anal. 2021, 7, 271–288. [Google Scholar] [CrossRef]
  17. Hogg, R.V.; Tanis, E.A. Probability and Statistical Inference, 4th ed.; MacMillan Publishing: New York, NY, USA, 1993. [Google Scholar]
  18. Ferreira, A.B.; Mazucheli, J. The zero, one and zero-and-one-inflated new unit-Lindley distributions. Braz. J. Biom. 2022, 40, 291–326. [Google Scholar] [CrossRef]
  19. Alrumayh, A.; Weera, W.; Khogeer, H.A.; Almetwally, E.M. Optimal analysis of adaptive type-II progressive censored for new unit-Lindley model. J. King Saud Univ. Sci. 2023, 35, 102462. [Google Scholar] [CrossRef]
  20. Cordeiro, G.M.; Brito, R.D.S. The beta power distribution. Braz. J. Probab. Stat. 2012, 26, 88–112. [Google Scholar]
  21. Akaike, H. A new look at the statistical model identification. IEEE Trans. Automat. Contr. 1974, 19, 716–723. [Google Scholar] [CrossRef]
  22. Schwarz, G. Estimating the dimension of a model. Ann. Stat. 1978, 6, 461–464. [Google Scholar] [CrossRef]
  23. Dunn, P.K.; Smyth, G.K. Randomized Quantile Residuals. J. Comput. Graph. Stat. 1996, 5, 236–244. [Google Scholar]
  24. Abramowitz, M.; Stegun, I.A. Handbook of Mathematical Functions with Formulas, Graphs, and Mathematical Tables, 9th ed.; National Bureau of Standards: Gaithersburg, MD, USA, 1968.
  25. Weisstein, E.W. “Erfi”. From MathWorld—A Wolfram Web Resource. Available online: https://mathworld.wolfram.com/Erfi.html (accessed on 2 January 2023).
  26. Weisstein, E.W. “Exponential Integral”. From MathWorld—A Wolfram Web Resource. Available online: https://mathworld.wolfram.com/ExponentialIntegral.html (accessed on 2 January 2023).
Figure 1. Densities UHN (0.5) (black), UHN (0.9) (red), UHN (1.5) (blue) and UHN (2) (green).
Figure 1. Densities UHN (0.5) (black), UHN (0.9) (red), UHN (1.5) (blue) and UHN (2) (green).
Mathematics 12 00136 g001
Figure 2. Histogram of a data set of proportions.
Figure 2. Histogram of a data set of proportions.
Mathematics 12 00136 g002
Figure 3. Densities MUHN (0.5) (black), MUHN (0.9) (red), MUHN (1.5) (blue) and MUHN (2) (green).
Figure 3. Densities MUHN (0.5) (black), MUHN (0.9) (red), MUHN (1.5) (blue) and MUHN (2) (green).
Mathematics 12 00136 g003
Figure 4. Hazard function of MUHN distribution for selected values of σ : σ = 0.4 (black), σ = 2 (red) and σ = 4 (blue).
Figure 4. Hazard function of MUHN distribution for selected values of σ : σ = 0.4 (black), σ = 2 (red) and σ = 4 (blue).
Mathematics 12 00136 g004
Figure 5. Plots of the asymmetry and kurtosis coefficients for the MUHN model.
Figure 5. Plots of the asymmetry and kurtosis coefficients for the MUHN model.
Mathematics 12 00136 g005
Figure 6. Histogram for rock samples from a petroleum reservoir; lines represent distributions fitted using ML estimates: UL ( σ ^ ) (red), UHN ( σ ^ ) (blue), MUHN ( σ ^ ) (green), Beta ( α ^ , β ^ ) (black) and KM ( α ^ , β ^ ) (brown).
Figure 6. Histogram for rock samples from a petroleum reservoir; lines represent distributions fitted using ML estimates: UL ( σ ^ ) (red), UHN ( σ ^ ) (blue), MUHN ( σ ^ ) (green), Beta ( α ^ , β ^ ) (black) and KM ( α ^ , β ^ ) (brown).
Mathematics 12 00136 g006
Figure 7. QQ-plots of the QRs: UL distribution (a); UHN distribution (b); MUHN distribution (c); Beta distribution (d) and KM distribution (e).
Figure 7. QQ-plots of the QRs: UL distribution (a); UHN distribution (b); MUHN distribution (c); Beta distribution (d) and KM distribution (e).
Mathematics 12 00136 g007
Figure 8. Histogram for COVID dataset; lines represent distributions fitted using ML estimates.
Figure 8. Histogram for COVID dataset; lines represent distributions fitted using ML estimates.
Mathematics 12 00136 g008
Figure 9. QQ-plots of the QRs: KM distribution (a); Beta distribution (b); MUHN distribution (c); UL distribution (d) and UHN distribution (e).
Figure 9. QQ-plots of the QRs: KM distribution (a); Beta distribution (b); MUHN distribution (c); UL distribution (d) and UHN distribution (e).
Mathematics 12 00136 g009
Table 1. ML estimates, B, SD, SE, RMSE, and CP for the MUHN model with sample size 30, 38, 40, 50, 80 and 100, respectively.
Table 1. ML estimates, B, SD, SE, RMSE, and CP for the MUHN model with sample size 30, 38, 40, 50, 80 and 100, respectively.
σ n σ ^ BSDSERMSECP
0.5300.496−0.0080.0650.0640.0650.936
380.498−0.0030.0580.0570.0580.941
400.499−0.0030.0560.0560.0560.954
500.499−0.0020.0500.0500.0500.937
800.5000.0000.0380.0400.0380.959
1000.5010.0010.0350.0350.0350.949
1300.987−0.0130.1270.1270.1280.926
380.989−0.0110.1110.1130.1120.934
400.991−0.0090.1080.1110.1080.944
500.995−0.0050.0970.0990.0970.937
800.996−0.0040.0790.0790.0790.938
1000.997−0.0030.0680.0700.0680.950
1.5301.487−0.0090.1890.1920.1900.932
381.489−0.0070.1690.1710.1690.943
401.489−0.0070.1670.1660.1680.936
501.492−0.0050.1490.1490.1490.943
801.495−0.0040.1140.1180.1140.950
1001.496−0.0030.1040.1060.1040.949
2301.967−0.0160.2500.2540.2520.927
381.974−0.0130.2240.2260.2250.938
401.973−0.0140.2210.2210.2220.928
501.981−0.0100.1950.1980.1960.928
801.990−0.0050.1580.1570.1580.944
1001.991−0.0050.1380.1410.1380.950
2.5302.495−0.0020.3270.3220.3270.934
382.489−0.0040.2920.2860.2920.933
402.489−0.0040.2830.2780.2830.941
502.494−0.0020.2530.2490.2530.937
802.498−0.0010.1910.1970.1910.949
1002.498−0.0010.1710.1770.1710.951
3302.972−0.0090.3760.3840.3770.943
382.975−0.0080.3380.3410.3390.934
402.977−0.0080.3300.3330.3310.943
502.988−0.0040.3000.2990.3000.940
802.988−0.0040.2310.2360.2310.944
1002.990−0.0030.2130.2110.2130.946
4303.960−0.0100.5290.5110.5300.918
383.963−0.0090.4590.4550.4600.933
403.969−0.0080.4480.4440.4490.936
503.980−0.0050.4030.3980.4030.937
803.989−0.0030.3200.3150.3200.940
1003.993−0.0020.2820.2820.2820.947
5304.949−0.0100.6470.6390.6480.927
384.958−0.0080.5800.5690.5810.929
404.968−0.0060.5600.5550.5600.926
504.973−0.0050.5090.4970.5100.938
804.987−0.0030.3980.3940.3980.938
1004.988−0.0020.3480.3530.3480.951
7.2307.149−0.0070.9300.9230.9310.939
387.164−0.0050.8360.8220.8360.939
407.166−0.0050.8130.8010.8130.939
507.171−0.0040.7430.7170.7430.922
807.175−0.0030.5840.5670.5850.94
1007.173−0.0040.5260.5070.5260.936
Table 2. The data of 48 samples of rocks from an oil reservoir.
Table 2. The data of 48 samples of rocks from an oil reservoir.
0.09032960.20365400.20431400.28088700.19765300.3286410
0.14862200.16239400.26272700.17945500.32663500.2300810
0.18331200.15094400.20007100.19180200.15419200.4641250
0.11706300.14814100.14481000.13308300.27601600.4204770
0.12241700.22859500.11385200.22521400.17696900.2007440
0.16704500.23162300.29102900.34127300.43871200.2626510
0.18965100.17256700.24007700.31164600.16358600.1824530
0.16412700.15348100.16186500.27601600.25383200.2004470
Table 3. Descriptive statistics for the first dataset.
Table 3. Descriptive statistics for the first dataset.
Data Setn Z ¯ S b 1 b 2
Rock samples from a petroleum reservoir48 0.218 0.083 1.169 4.110
Table 4. Parameter estimates with SEs (in parentheses), AIC and BIC values for UL, UHN, MUHN, Beta and KM models.
Table 4. Parameter estimates with SEs (in parentheses), AIC and BIC values for UL, UHN, MUHN, Beta and KM models.
Parameter EstimatesULUHNMUHNBetaKM
α ---5.942 (1.181)2.719 (0.294)
β ---21.206 (4.347)44.661 (17.576)
σ 4.049 ( 0.502 ) 0.337 ( 0.035 ) 4.565 ( 0.466 ) --
AIC 68.700 81.053 87.316 107.200 100.983
BIC 66.829 79.182 85.445 103.458 97.241
Table 5. The data of a sample of 38 proportions formed by COVID information.
Table 5. The data of a sample of 38 proportions formed by COVID information.
0.6666666670.250.20.2857142860.3
0.3333333330.1176470590.2608695650.3030303030.23255814
0.2950819670.1866666670.5192307690.2238805970.155462185
0.3040935670.2119815670.1918063310.1503164560.152815013
0.1908893710.1926444830.1255742730.1888198760.156626506
0.1075268820.1265822780.1055514970.0966677660.109576968
0.0891089110.1018985820.0693357190.0714434060.058835027
0.0775333570.0713328870.081372097
Table 6. Descriptive statistics for the second dataset.
Table 6. Descriptive statistics for the second dataset.
Data Setn Z ¯ S b 1 b 2
COVID380.1940.1251.8637.347
Table 7. Parameter estimates with their respective SE (in parentheses), AIC and BIC values for the indicated model.
Table 7. Parameter estimates with their respective SE (in parentheses), AIC and BIC values for the indicated model.
Parameter EstimatesULUHNKMBetaMUHN
α --1.610 (0.211)2.325 (0.501)-
β --10.512 (3.368)9.438 (2.205)-
σ 4.141 (0.578)0.443 (0.051)--7.231 (0.829)
AIC−51.442−39.916−58.371−61.191−67.334
BIC−49.805−38.278−55.096−57.916−65.696
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Alvarez, P.I.; Varela, H.; Cortés, I.E.; Venegas, O.; Gómez, H.W. Modified Unit-Half-Normal Distribution with Applications. Mathematics 2024, 12, 136. https://doi.org/10.3390/math12010136

AMA Style

Alvarez PI, Varela H, Cortés IE, Venegas O, Gómez HW. Modified Unit-Half-Normal Distribution with Applications. Mathematics. 2024; 12(1):136. https://doi.org/10.3390/math12010136

Chicago/Turabian Style

Alvarez, Paulina I., Héctor Varela, Isaac E. Cortés, Osvaldo Venegas, and Héctor W. Gómez. 2024. "Modified Unit-Half-Normal Distribution with Applications" Mathematics 12, no. 1: 136. https://doi.org/10.3390/math12010136

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop