Next Article in Journal
Unsteady Stagnation Point Flow of Hybrid Nanofluid Past a Convectively Heated Stretching/Shrinking Sheet with Velocity Slip
Previous Article in Journal
Certain Properties of Vague Graphs with a Novel Application
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A New Family of Discrete Distributions with Mathematical Properties, Characterizations, Bayesian and Non-Bayesian Estimation Methods

1
Department of Applied Statistics and Insurance, Faculty of Commerce, Damietta University, Damietta 34519, Egypt
2
Department of Statistics, Mathematics and Insurance, Benha University, Banha 13513, Egypt
3
Department of Mathematical and Statistical Sciences, Marquette University, Milwaukee, WI 53201-1881, USA
*
Author to whom correspondence should be addressed.
Mathematics 2020, 8(10), 1648; https://doi.org/10.3390/math8101648
Submission received: 21 August 2020 / Revised: 7 September 2020 / Accepted: 22 September 2020 / Published: 24 September 2020
(This article belongs to the Section Probability and Statistics)

Abstract

:
In this work, we propose and study a new family of discrete distributions. Many useful mathematical properties, such as ordinary moments, moment generating function, cumulant generating function, probability generating function, central moment, and dispersion index are derived. Some special discrete versions are presented. A certain special case is discussed graphically and numerically. The hazard rate function of the new class can be “decreasing”, “upside down”, “increasing”, and “decreasing-constant-increasing (U-shape)”. Some useful characterization results based on the conditional expectation of certain function of the random variable and in terms of the hazard function are derived and presented. Bayesian and non-Bayesian methods of estimation are considered. The Bayesian estimation procedure under the squared error loss function is discussed. Markov chain Monte Carlo simulation studies for comparing non-Bayesian and Bayesian estimations are performed using the Gibbs sampler and Metropolis–Hastings algorithm. Four applications to real data sets are employed for comparing the Bayesian and non-Bayesian methods. The importance and flexibility of the new discrete class is illustrated by means of four real data applications.

1. Introduction and Genesis

Discretization of existing continuous probability distributions have received noticeable attention in recent years. In this paper, we present and study a new discrete analogue based on the continuous Rayleigh distribution called the discrete Rayleigh G family of distributions. The relevant mathematical properties, such as moments, cumulant generating function, moment generating function, probability generating function, central moment, and dispersion index (DisIx) are derived and analyzed. A special version of the new family related to well-known Weibull model is discussed. Some classical (non-Bayesian) estimation methods such as Cramér–von Mises estimation (CVME), the ordinary least squared estimation (OLSE), the maximum likelihood estimation (MLE), and the weighted least squared estimation (WLSE) are described and considered. Since the conditional posteriors of the parameters cannot be obtained in any standard forms, using a hybrid Markov chain Monte Carlo for drawing sample from the joint posterior of the parameters is suggested. The Bayesian estimation procedure under the squared error loss function (SELF) is also presented. Markov chain Monte Carlo (MCMC) simulations for comparing non-Bayesian and Bayesian estimations are performed. Gibbs sampler and Metropolis–Hastings (M–H) algorithm are employed and applied. The flexibility and importance of the new family is illustrated by four real data sets. The new family provides a better fit than sixteen competitive distributions. Many special member distributions could be considered and studied in the future. As a future work, we may consider the bivariate and the multivariate extensions of this new family.
In probability theory, the Rayleigh distribution is a continuous probability model for nonnegative-valued random variables (NNRVs). A Rayleigh distribution can often be observed when the overall magnitude of a vector is related to its directional components. A RV X is said to have the continuous Rayleigh distribution if its cumulative distribution function (CDF) is given by F α ( x ) = 1 e x p [ ( α x ) 2 ] | ( x 0   and   α > 0 ) . Recently, [1] defined the continuous CDF of the Rayleigh G (RG) family as F α , Φ _ ( x ) = 1 e x p [ α Φ _ 2 ( x ) ] | ( x R   and   Φ _ ( ) = G Φ _ ( ) / [ 1 G Φ _ ( ) ] ) where G Φ _ ( x ) refers to the continuous CDF of any base line model. Let α = log ( π ) then, CDF of the discrete RG family (DRG) can be expressed as
F π , Φ _ ( y ) = 1 π Φ _ 2 ( y + 1 ) | ( π ( 0 , 1 )   and   y N = N { 0 } ) ,
The corresponding reliability function (RF) can be written as
S π , Φ _ ( y ) = π Φ _ 2 ( y + 1 ) | ( π ( 0 , 1 )   and   y N ) ,
The probability mass function (PMF) of the discrete analogue of DRG family corresponding to (2) can be expressed as π , Φ _ ( y ) = S π , Φ _ ( y ) S π , Φ _ ( y + 1 ) . Therefore, the PMF can be written as
π , Φ _ ( y ) = π Φ _ 2 ( y 1 ) π Φ _ 2 ( y ) | ( π ( 0 , 1 )   and   y N ) .
where Φ _ 2 ( y ) refers to the squared odd ratio function of the NNRV Y . The hazard rate function (HRF) of the DRG family can be written as
h F ( y ) = π , Φ _ ( y ) / S π , Φ _ ( y 1 ) .
In the statistical literature, many discrete versions of continuous distributions have been proposed and studied such as a generalization of the Poisson distribution ([2]), the discrete Weibull (DW) ([3]), the discrete Rayleigh (DR) ([4]), discrete half-normal distribution ([5]), discrete Pareto (DPa) ([6]), a new discrete geometric (DGc) ([7]), the discrete inverse Weibull (DIW) ([8]), the discrete Lindley (DLi) ([9]), the discrete generalized exponentiated type II (DGE-II) ([10]), the discrete inverse Rayleigh (DIR) ([11]), exponentiated discrete Weibull (EDW) ([12]), the discrete Lindley type II (DLi-II) ([13]), discrete Lomax (DLx) ([14]),the discrete log-logistic (DLL) ([15]), the discrete Burr type XII (DBXII) ([14]), and the exponentiated discrete Lindley (EDLi) ([16]), among others.
Table 1 presents some new sub-models based on the DRG family. In Section 3, however, a comprehensive graphical and numerical analysis is given for the discrete Rayleigh Weibull model. Many useful discretization processes can be applied to many existing continuous families (see [17,18,19,20,21,22,23,24]) and continuous probability distributions (see [25,26,27,28,29,30]).
This paper is organized as follows. Some mathematical properties of the DRG family are derived and analyzed in Section 2. Some useful characterization results are derived and presented in Section 3. Estimation and inference procedures are presented in Section 4. Markov chain Monte Carlo simulations for comparing Bayesian and non-Bayesian estimation methods are performed in Section 5. Four real data applications for comparing Bayesian and non-Bayesian estimation methods are presented in Section 6. Four applications for comparing the competitive discrete models are considered in Section 7. Finally, Section 8 offers some concluding remarks.

2. Properties

2.1. Ordinary Moments

Theorem 1. 
Let Y be an NNRV, where Y ∼DRG( π , Φ _ ) family, then the r t h moment of Y can be expressed as
μ r = E ( Y r ) = y = 1 [ y r ( y 1 ) r ] π Φ _ 2 ( y ) | ( y N , π ( 0 , 1 )   and   r = 1 , 2 , 3 , ) .
Proof. 
See Appendix A.1. □
Then, using Equation (5), the mean ( μ 1 ), and μ 2 can be respectively written as
μ 1 = μ 1 = E ( Y ) = y = 1 π Φ _ 2 ( y ) | ( y N , π ( 0 , 1 )   and   r = 1 ) ,
and
μ 2 = E ( Y 2 ) = y = 1 ( 2 y 1 ) π Φ _ 2 ( y ) | ( y N , π ( 0 , 1 )   and   r = 2 ) .

2.2. Central Moments

The r th central moment of Y , say μ r , is
μ r = E ( Y μ 1 ) r = w = 0 r ( μ 1 ) w ( r w ) μ r w | ( | ( y N , π ( 0 , 1 )   and   r = 1 , 2 , 3 , ) ) .
So, the variance ( V ( X ) ) from
E ( Y μ 1 ) 2 = μ w = 0 r ( μ 1 ) w ( r w ) μ 2 w | ( | ( y N , π ( 0 , 1 )   and   r = 2 ,
can be derived using the ordinary moments
μ 2 = V ( Y ) = μ 2 ( μ 1 ) 2 = y = 1 ( 2 y 1 ) π Φ _ 2 ( y ) ( μ 1 ) 2 | ( y N , π ( 0 , 1 )   and   r = 2 ) .
The DisIx or the variance to mean ratio (VMR) of the DRG family can be derived as
DisIx ( Y ) = y = 1 ( 2 y 1 ) π Φ _ 2 ( y ) y = 1 π Φ _ 2 ( y )   y = 1 π Φ _ 2 ( y ) | ( y N   and   π ( 0 , 1 ) ) .
The DisIx is used to quantify whether a specific set of observations are clustered” or “dispersed” compared to a specific statistical model, so, the DisIx is used to determine whether the observed real data set can be modeled via a Poisson process. For any real data set, when the DisIx is <1, the data set is said to be “under-dispersed” data. A numerical analysis with its related calculations for the DisIx(Y) is presented in Table 2.

2.3. The Moment and Cumulant Generating Function (MGF & CGF)

Theorem 2. 
Let Y be an NNRV, where Y ∼DRG( π , Φ _ ) family, then the MGF of Y can be obtained as
M Y ( t ) = 1 + y = 1 { e x p ( t Y ) e x p [ t ( Y 1 ) ] } π Φ _ 2 ( y ) | ( y N , π ( 0 , 1 )   and   r = 1 , 2 , 3 , ) .
Proof. 
See Appendix A.2. □
The first r derivatives of Equation (6), with respect to t | t = 0 , yield the first r raw moments, i.e.,
μ r = E ( Y r ) = d r d t r M Y ( t ) | ( t = 0   and   r = 1 , 2 , 3 , ) ,
where
μ = E ( Y ) = d d t M Y ( t ) | t = 0 = y = 1 π Φ _ 2 ( y ) | ( y N , π ( 0 , 1 )   and   r = 1 ) ,
μ 2 = E ( Y 2 ) = d 2 d t 2 M Y ( t ) | ( t = 0 ) = y = 1 ( 2 y 1 ) π Φ _ 2 ( y ) | ( y N , π ( 0 , 1 )   and   r = 2 ) ,
μ 3 = E ( Y 3 ) = d 3 d t 3 M Y ( t ) | ( t = 0 ) = y = 1 [ 3 y ( y 1 ) + 1 ] π Φ _ 2 ( y ) | ( y N , π ( 0 , 1 )   and   r = 3 ) ,
and
μ 4 = E ( Y 4 ) = d 4 d t 4 M Y ( t ) | ( t = 0 ) = y = 1 [ y 4 ( y 1 ) 4 ] π Φ _ 2 ( y ) | ( y N , π ( 0 , 1 )   and   r = 4 ) .
The CGF is the logarithm of the MGF. Thus, r t h cumulant, say κ r , can be obtained from κ r = d r d t r l o g [ M Y ( t ) ] | ( t = 0 ,   and   r = 1 , 2 , 3 , ) . The first cumulant is the mean, the second cumulant is the variance, and the third cumulant is the same as the third central moment κ 3 = μ 3 that being said κ 1 = μ 1 = E ( Y ) ,   κ 2 = μ 2 = μ 2 μ 1 2 and κ 3 = μ 3 = μ 3 3 μ 2 μ 1 + 2 μ 1 3 . However, the fourth and higher order cumulants are not equal to central moments. In some cases, theoretical treatments of problems in terms of cumulants are simpler than those using moments. In particular, when two or more RVs are statistically independent, the r th order cumulant of their sum is equal to the sum of their r th order cumulants. Moreover, the cumulants can be also obtained from κ r | r 1 = μ r w = 0 r 1 ( r 1 w 1 ) μ r w κ w . The skewness ( S ( Y ) ) and kurtosis measures ( K ( Y ) ) of Y follow from the well-known relationships. A numerical analysis for the S ( Y ) and K ( Y ) for a special member of the new family is presented in Table 2 with useful comments. The probability generating function can be written as P Y ( s ) = 1 + y = 1 ( 1 1 s ) s y π Φ _ 2 ( y ) | ( y N , π ( 0 , 1 )   and   r = 1 , 2 , 3 , ) . For the Weibull base line model, we have Φ _ ( y ) = e x p ( y θ ) 1 | y N , π ( 0 , 1 ) , θ > 0 . Then based on (3), the PMF of the DRW model (see Table 1) can be expressed as π , θ ( y ) = π [ e x p ( y θ ) 1 ] 2 π { e x p [ ( y + 1 ) θ ] 1 } 2   | ( y N , π ( 0 , 1 )   and   θ > 0 ) . Clearly, when θ = 1 , the DRW model reduced to the DRE model. Figure 1 gives some plots of the PMF of the DRW model for various values of the parameters. Figure 2 shows some plots of the HRF of the DRW model for various values of the parameters. Based on Figure 1, we conclude that the PMF of the DRW can be “heavy tail right skewed PMF” and “left skewed PMF”. Based on Figure 2, we see that the HRF of the DRW can be “decreasing” ( π = 0.5 , θ = 5 ), “upside down” ( π = 0.9 , θ = 0.75 ), “increasing” ( π = 0.992 , θ = 0.263 ) and “decreasing-constant-increasing” ( π = 0.99 , θ = 0.25 ). Based on Table 2, it is noted that E ( Y ) of the DRW model increases as π increases and decreases as θ increases. The S ( y ) of the DRW distribution can be positive and negative. The spread for K ( y ) is ranging from 1.9 to . The DisIx ( Y ) of the DRW distribution can range in the interval (0.1) or more than 1 or equal 1 (for π = 0.001 , θ = 5 ) like Poisson distribution. So, the new DRW distribution could be useful in modeling “under-dispersed” or “over-dispersed” count data.

3. Characterizations of DRG Distribution

The problem of characterizing a distribution is an important problem in applied sciences, where an investigator is vitally interested to know if their model follows the right distribution. To this end, the investigator relies on conditions under which their model would follow specifically the chosen distribution. In this section, we present certain characterizations of the DRG distribution. These characterizations are based on the conditional expectation of certain function of the random variable and in terms of the hazard function.
Proposition 1. 
Let Y : Ω N * be a random variable. The PMF of Y is Equation (3) if and only if
E { [ π Φ _ 2 ( Y ) + π Φ _ ( Y + 1 ) ] | X   > k } = π Φ _ 2 ( k + 1 ) | k N * .
Proof. 
If Y has PMF Equation (3), then the left-hand side of Equation (7), will be
[ 1 F π , Φ _ ( k ) ] 1 y = k + 1 { π 2 Φ _ 2 ( y ) π 2 Φ _ 2 ( y + 1 ) } = π Φ _ 2 ( k + 1 ) [ π 2 Φ _ 2 ( k + 1 ) ] = π Φ _ 2 ( k + 1 ) | k N * .
Conversely, if Equation (7) holds, then
y = k + 1 { [ π Φ _ 2 ( y ) + π Φ _ 2 ( y + 1 ) ]   f ( y ) } = [ 1 F ( k ) ] π Φ _ 2 ( k + 1 ) = { [ 1 F π , Φ _ ( k + 1 ) ] + π , Φ _ ( k + 1 ) } π Φ _ 2 ( k + 1 ) .
From Equation (8), we also have
y = k + 2 { [ π Φ _ 2 ( y ) + π Φ _ 2 ( y + 1 ) ] π , Φ _ ( y ) } = [ 1 F π , Φ _ ( k + 1 ) ] π Φ _ 2 ( k + 2 ) .
Now, subtracting Equation (9) from Equation (8), we arrive at
[ π Φ _ 2 ( k + 1 ) + π Φ _ 2 ( k + 2 ) ] π , Φ _ ( k + 1 ) = [ π Φ _ 2 ( k + 1 ) π Φ _ 2 ( k + 2 ) ] [ 1 F π , Φ _ ( k + 1 ) ] + π Φ _ 2 ( k + 1 ) π , Φ _ ( k + 1 ) .
or
π Φ _ 2 ( k + 2 )   π , Φ _ ( k + 1 ) = [ π Φ _ 2 ( k + 1 ) π Φ _ 2 ( k + 2 ) ] [ 1 F π , Φ _ ( k + 1 ) ] ,
or
π , Φ _ ( k + 1 ) 1 F π , Φ _ ( k + 1 ) = π Φ _ 2 ( k + 1 ) π Φ _ 2 ( k + 2 ) π Φ _ 2 ( k + 2 ) = π Φ _ 2 ( k + 1 ) π Φ _ 2 ( k + 2 ) 1 .
From the last equality, we have
h F ( k + 1 ) = π , Φ _ ( k + 1 ) 1 F π , Φ _ ( k + 1 ) = π Φ _ 2 ( k + 1 ) π Φ _ 2 ( k + 2 ) 1 ,
which implies that Y has PMF Equation (3). □
Proposition 2. 
Let Y : Ω N * be a random variable. The PMF of Y is Equation (3) if and only if its hazard function, h F , satisfies the following difference equation
h F ( k + 1 ) h F ( k ) = π Φ _ 2 ( k + 1 ) π Φ _ 2 ( k + 2 ) π Φ _ 2 ( k ) π Φ _ 2 ( k + 1 ) | k N * ,
with the initial condition h F ( 0 ) = 1 π Φ _ 2 ( 1 ) 1 .
Proof. 
Clearly, if Y has PMF Equation (3), then Equation ( 10 ) holds. Now, if Equation (10) holds, then
y = 0 k [ h F ( y + 1 ) h F ( y ) ] = y = 0 k [ π Φ _ 2 ( y + 1 ) π Φ _ 2 ( y + 2 ) π Φ _ 2 ( y ) π Φ _ 2 ( y + 1 ) ] ,
or
h F ( k + 1 ) h F ( 0 ) = 1 π Φ _ 2 ( 1 ) + π Φ _ 2 ( k + 1 ) π Φ _ 2 ( k + 2 ) ,
or, in view of the initial condition h F ( 0 ) = 1 π Φ _ 2 ( 1 ) 1 we have
h F ( k + 1 ) = π Φ _ 2 ( k + 1 ) π Φ _ 2 ( k + 2 ) 1 | k N * ,
which is the hazard function corresponding to the PMF Equation (3). □

4. Estimation and Inference

In this subsection, non-Bayesian and Bayesian estimation methods are considered. In the first subsection, we will consider the MLE method, CVME method, OLSE method, and WLSE method. In second subsection, the Bayesian estimation method under the SELF. All non-Bayesian estimation methods are discussed in the statistical literature with more details.

4.1. Non-Bayesian Estimation Methods

4.1.1. The MLE Method

Let Y 1 , Y 2 , , Y n be a random sample (RS) from the DRG distribution. The log-likelihood function for V _ where V _ = ( π , Φ _ j ) T | | ( j = 1 , 2 , , p ) , is given by
= ( V _ ) =   i = 0 n l n [ π Φ _ j 2 ( y i : n ) π Φ _ j 2 ( y i : n + 1 ) ] | ( π ( 0 , 1 )   and   y i : n N )
which can be maximized either using the statistical programs or by solving the nonlinear system obtained from ( V _ ) via differentiation. The score vector U ( V _ ) = ( ( V _ ) / π , ( V _ ) / Φ _ j ) T , are
( V _ ) / π =   i = 0 n Φ _ j 2 ( y i : n ) π [ Φ _ j 2 ( y i : n ) ] 1 Φ _ j 2 ( y i : n + 1 ) π [ Φ _ j 2 ( y i : n + 1 ) ] 1 π Φ _ j 2 ( y i : n ) π Φ _ j 2 ( y i : n + 1 ) ,
and
( V _ ) / Φ _ j =   i = 0 n Φ _ j 2 ( y i : n ) Φ _ j π Φ _ j 2 ( y i : n ) l n π Φ _ j 2 ( y i : n + 1 ) Φ _ j π Φ _ j 2 ( y i : n + 1 ) l n π π Φ _ j 2 ( y i : n ) π Φ _ j 2 ( y i : n + 1 ) | ( j = 1 , 2 , , p ) ,
where
Φ _ j 2 ( y i : n ) / Φ _ j = 2 Φ _ j ( y i : n ) Φ _ j Φ _ j ( y i : n ) ,
and
Φ _ j 2 ( y i : n + 1 ) / Φ _ j = 2 Φ _ j ( y i : n + 1 ) 2 1 Φ _ j Φ _ j ( y i : n + 1 ) .
Setting 0 = ( V _ ) / π = ( V _ ) / Φ _ j and solving them simultaneously yields the MLEs for the parameters of the DRG family. The Newton–Raphson algorithm is employed for obtaining the numerical solutions in such cases.

4.1.2. The CVME Method

The CVME of the parameter vector V _ = ( π , Φ _ j ) T are obtained via minimizing the following expression with respect to (WRT) to π and Φ _ j respectively, where
C V M ( V _ ) = 1 12 n 1 + i = 1 n [ F π , Φ _ j ( y i : n ) ξ ( i , n ) [ 1 ] ] 2 | ( π ( 0 , 1 )   and   y i : n N ) ,
where ξ ( i , n ) [ 1 ] = 2 i 1 2 n and
C V M ( V _ ) = i = 1 n [ 1 π Φ _ j 2 ( y i : n + 1 ) ξ ( i , n ) [ 1 ] ] 2 .
The, CVMEs are obtained by solving the following two non-linear equations
i = 1 n ( 1 π Φ _ j 2 ( y i : n + 1 ) ξ ( i , n ) [ 1 ] ) D ( π ) ( y i : n + 1 , V _ ) = 0 ,
and
i = 1 n ( 1 π Φ _ j 2 ( y i : n + 1 ) ξ ( i , n ) [ 1 ] ) D ( Φ _ j ) ( y i : n + 1 , V _ ) = 0 ,
where D ( π ) ( y i : n + 1 , V _ ) = F π , Φ _ j ( y i : n ) / π and D ( Φ _ j ) ( y i : n + 1 , V _ ) = F π , Φ _ j ( y i : n ) / Φ _ j are the first partial derivatives of the CDF of DRG distribution WRT to π and Φ _ j respectively.

4.1.3. OLSE Method

Let F π , Φ _ j ( y i : n ) denote the CDF of DRG model and let Y 1 < Y 2 < < Y n be the n ordered RS. The OLSEs are obtained upon minimizing
O L S E ( V _ ) = i = 1 n [ F π , Φ _ j ( y i : n ) ξ ( i , n ) [ 2 ] ] 2 ,
then, we have
O L S E ( V _ ) = i = 1 n [ 1 π Φ _ j 2 ( y i : n + 1 ) ξ ( i , n ) [ 2 ] ] 2 ,
where ξ ( i , n ) [ 2 ] = i n + 1 . The LSEs are obtained via solving the following non-linear equations
0 = i = 1 n [ 1 π Φ _ j 2 ( y i : n + 1 ) ξ ( i , n ) [ 2 ] ] D ( π ) ( y i : n + 1 , V _ ) ,
and
0 = i = 1 n [ 1 π Φ _ j 2 ( y i : n + 1 ) ξ ( i , n ) [ 2 ] ] D ( Φ _ j ) ( y i : n + 1 , V _ ) ,
where D ( π ) ( y i : n + 1 , V _ ) and D ( Φ _ j ) ( y i : n + 1 , V _ ) defined above.

4.1.4. WLSE Method

The WLSE are obtained by minimizing the function W L S E ( V _ ) WRT π and Φ _ j
W L S E ( V _ ) = i = 1 n ξ ( i , n ) [ 3 ] [ F π , Φ _ j ( y i : n ) ξ ( i , n ) [ 2 ] ] 2 ,
where
ξ ( i , n ) [ 3 ] = [ ( 1 + n ) 2 ( 2 + n ) ] / [ i ( 1 + n i ) ] .
The WLSEs are obtained by solving
0 = i = 1 n ξ ( i , n ) [ 3 ] [ 1 π Φ _ j 2 ( y i : n + 1 ) ξ ( i , n ) [ 2 ] ] D ( π ) ( y i : n + 1 , V _ ) ,
and
0 = i = 1 n ξ ( i , n ) [ 3 ] [ 1 π Φ _ j 2 ( y i : n + 1 ) ξ ( i , n ) [ 2 ] ] D ( Φ _ j ) ( y i : n + 1 , V _ ) ,
where D ( π ) ( y i : n + 1 , V _ ) and D ( Φ _ j ) ( y i : n + 1 , V _ ) are defined above.

4.2. Bayesian Estimation

Assume the beta and uniform priors for the parameters π and Φ _ j respectively. Then,
p 1 , ( ϕ 1 , ψ 1 ) ( π ) beta ( ϕ 1 , ψ 1 ) ,
and
p 2 , ( ϕ 2 , ψ 2 ) ( Φ _ j ) Uniform ( ϕ 2 , ψ 2 ) .
Assume that the parameters are independently distributed. The joint prior distribution p ( ϕ i , ψ i ) ( π , Φ _ j ) is given by
p ( ϕ i , ψ i ) ( π , Φ _ j ) = 1 ( ψ 2 ϕ 2 ) B ( ϕ 1 , ψ 1 )   π ϕ 1 ( 1 π ) ψ 1 ,
where B ( , ) is the beta function. The posterior distribution p ( π , Φ _ j | z _ ) of the parameters is defined as
p ( π , Φ _ j | z _ ) likelihood   function × p ( ϕ i , ψ i ) ( π , Φ _ j ) .
Under SELF, the Bayesian estimators of π and Φ _ j are the means of their marginal. It is not possible to obtain the Bayesian estimates through the above formulae. So, the numerical approximations are needed. We propose the use of MCMC techniques, namely the Gibbs sampler and M–H algorithm (see [31,32,33,34] for more details). Since the conditional posteriors of the parameters π and Φ _ j cannot be obtained in any standard forms, using a hybrid MCMC for drawing sample from the marginal posterior of the parameters is suggested. Then, the full conditional posteriors of π and Φ _ j can be easily derived. The simulation algorithm is given by:
  • Provide the initial values, say π and Φ _ j then at i th stage;
  • Using M–H algorithm, generate [ π ] ( i ) p 1 ( [ π ] ( i ) | [ π ] ( i 1 ) , [ Φ _ j ] ( i 1 ) , z _ ) ;
  • Using M–H algorithm, generate [ Φ _ j ] ( i ) p 2 ( [ Φ _ j ] ( i ) | [ π ] ( i ) , [ Φ _ j ] ( i 1 ) , z _ ) ;
  • Repeat steps 1 3 , M = 100,000 times to obtain the sample of size M from the corresponding posteriors of interest. Obtain the Bayesian estimates of π and Φ _ j using the following formulae π ^ Bayes = 1 M M 0 h = 1 + M 0 M π [ h ] and Φ _ j ^ Bayes = 1 M M 0 h = 1 + M 0 M Φ _ j [ h ] respectively, where M 0 ( 50,000 ) is the burn-in period of the generated MCMC.

5. Simulations for Comparing Non-Bayesian and Bayesian Estimation Methods

For the DRW case, MCMC simulation studies are performed for assessing and comparing the performance of non-Bayesian and Bayesian estimations. The numerical assessment is performed based on the mean squared errors (MSEs). First, we generated 1000 samples of the DRW distribution, where n = 50, 150, 300, 500. The MSEs are obtained and listed in Table 3, Table 4 and Table 5. Based on Table 3, Table 4 and Table 5 we note that all methods perform well. Though the Bayesian method is better in some cases. The performance of all estimation methods improves and tends to 0 when n .

6. Real Data Modeling for Comparing Bayesian and Non-Bayesian Estimation Methods

In this Section, four examples of real data sets are introduced for comparing the Bayesian and non-Bayesian estimation methods. We consider the Akaike information criterion (AIC) and Correct Akaike IC (CAIC) statistic for comparing the Bayesian and non-Bayesian estimation methods.

6.1. Failure Times Data of 50 Devices

This data represents the failure times of 50 devices (in weeks) put on a certain life test (see [35]). The data are: 0.1, 0.2, 1, 1, 1, 1, 1, 2, 3, 6, 7, 11, 12, 18, 18, 18, 18, 18, 21, 32, 36, 40, 45, 46, 47, 50, 55, 60, 63, 63, 67, 67, 67, 67, 72, 75, 79, 82, 82, 83, 84, 84, 84, 85, 85, 85, 85, 85, 86, 86. Table 6 gives the estimators under Bayesian and non-Bayesian estimation methods, AIC, and CAIC statistics for data set I. Figure 3 presents the frequency distribution using Bayesian estimation for data set I.
Based on Table 6, the MLE method is the best method with AIC = 478.9 and CAIC = 479.20. However, all other methods perform very well.

6.2. Failure Times of 15 Electronic Components

This lifetime data gives the failure times for 15 electronic components in an acceleration lifetime test (see [36]). The data are: 1.4, 5.1, 6.3, 10.8, 12.1, 18.5, 19.7, 22.2, 23, 30.6, 37.3, 46.3, 53.9, 59.8, 66.2. Table 7 gives the estimators under Bayesian and non-Bayesian estimation methods, AIC, and CAIC statistics for data set II. Figure 4 shows the frequency distribution using Bayesian estimation for data set II.
Based on Table 7, the MLE method is the best method with AIC = 132.1 and CAIC = 133.1. However, all other methods perform very well.

6.3. Counts of Cysts of Kidneys

This data represents the counts of cysts of corticosteroid-induced kidneys dysmorphogenetic which associated with deregulated expression of known cytogenic molecules as well as Indian hedgehog (see [37]). The data are: 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 3, 3, 3, 3, 3, 3, 4, 4, 4, 4, 5, 5, 6, 6, 7, 7, 8, 9, 10, 11, 11. Table 8 gives the estimators under Bayesian and non-Bayesian estimation methods, AIC, and CAIC statistics for data set III. Figure 5 gives the frequency distribution using Bayesian estimation for data set III.
Based on Table 7, the MLE method is the best method with AIC = 338.9 and CAIC = 338.9. However, all other methods perform very well.

6.4. Number of European Corn-Borer Larvae Parasites

This data represents the number of European corn-borer larvae parasites in the field ([38]). In their stochastic biological experiment, [38] randomly selected 8 hills in 15 replications and counted the number of borers per hill of corn. The data are: 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 4, 4, 4, 4, 4, 5, 5, 5, 5, 6, 7, 7, 8, 8. Table 9 gives the estimators under Bayesian and non-Bayesian estimation methods, AIC, and CAIC statistics for data set IV. In Figure 6, we sketched the frequency distribution using Bayesian estimation for data set IV.
Based on Table 7, the MLE method is the best method with AIC = 406.1 and CAIC = 406.2. However, all other methods perform very well.

7. Real Data Modeling for Comparing the Competitive Discrete Models Using the MLE Method

We illustrate the flexibility and the importance of the DRW distributions using four real data applications. The fitted distributions (see Table 10) are analyzed and compared using the log-likelihood function ( ), AIC, CAIC, Chi-square ( χ V 2 ) with degree of freedom (d.f), and its p-value ( P [ V ] ), Kolmogorov–Smirnov ( K S ) and its P [ V ] . Table 6 below gives the competitive models.

7.1. Failure Times Data of 50 Devices

We compare the fits of the DRW model with some competitive models, such as DW, EDW, DIW, EDLi, DPa, DLi-II, and DLL. The MLEs along with their corresponding standard errors (SEs), and the goodness of fit (GOF) test statistics are listed in Table 11 and Table 12, respectively. Statisticians have developed a remarkably powerful set of tools for analyzing normally distributed data. The most popular one is the “normal quantile-quantile (Q-Q) plot”. If the distribution of the data matched the normal distribution perfectly, all the quantile points would lie between the two blue lines. Figure 7 (left plot) gives the Q-Q plot for dataset I. Figure 7 (right) gives the box for the failure time data (data set I). In the applications, the shape of the HRF can help in selecting a particular model. For this aim, the total time on test (TTT) plot (see [39]) can be used. The TTT plot is obtained by plotting T ( r / n ) against r / n , where T ( r / n ) = [ i = 1 n y ( i ) + ( n r ) y ( r ) ] / i = 1 n y ( i ) , r = 1 , , n and the y ( i ) ’s are the order statistics of the sample. It is “convex shape” for the “monotonically decreasing HRF” and is “concave shape” for the “monotonically increasing HRF”. When solid line matches dashed line, the HRF of the data is “constant”. Figure 8 shows the TTT plot (left panel) and estimated HRF (EHRF) (right panel) for the DRW model for data set I. Based on Table 8, the DRW provides the best fits against all competitive models.

7.2. Failure Times of 15 Electronic Components

For this application, we compare the fits of the DRW model with some competitive models such as DGE-II, DLx, DEx, DIR, DR, DIW, DPa, and DBXII. The MLEs with their SEs, and the GOF statistics are reported in Table 13 and Table 14, respectively. Figure 9 gives the Q-Q plot and box for the failure times data (data set II). Figure 10 shows the total time test plots (TTT) plot and estimated HRF (EHRF) for the DRW model for data set II. Based on Table 14, the DRW provides the best fits against all competitive models.

7.3. Counts of Cysts of Kidneys

For this real data set, we compare the fits of the DRW distribution with DW, DR, DIW, DE, DLx, DLi-II, DLi, and Poisson. The MLEs with their SEs are listed in Table 15. Table 16 gives the GOF statistics. Figure 11 gives the TTT plot, Q-Q plot and Box plot versus the EHRFs for data set III. Figure 12 presents the fitted PMFs and EHRF for data set III. Based on Table 16, the DRW provides the best fits against all competitive models.

7.4. Number of European Corn-Borer Larvae Parasites

We shall compare the fits of the DRW distributions with DIW, DGIW, DIR, DPa, DR, DBXII, negative binomial (NB), and Poisson distributions. The MLEs with their corresponding SEs are listed in Table 17. Table 18 gives the GOF statistics. Figure 13 gives the TTT plot, Q-Q plot and box plot versus the EHRFs for data set IV. Figure 14 presents the fitted PMFs and EHRF for data set IV. Based on Table 18, the DRW provides the best fits against all competitive models

8. Concluding Remarks

In this work, we presented and studied a new discrete analogue based on the continuous Rayleigh distribution called the discrete Rayleigh G (DRG) family of distributions. Some of its statistical properties, such as moments, cumulant generating function, moment generating function, probability generating function, central moment, and DisIx are derived. A special discrete version of the DRG family related to a Weibull distribution is discussed. The HRF of the new family can be “decreasing”, “upside down”, “increasing” and “decreasing-constant-increasing (U-HRF)”. The PMF of the DRG can be “heavy tail right skewed PMF” and “left skewed PMF”. Some classical (non-Bayesian) estimation methods, such as Cramér–von Mises estimation, the maximum likelihood estimation, the weighted least squared estimation, and the ordinary least squared estimation are described and considered. Some useful characterization results based on the conditional expectation of certain function of the random variable and in terms of the hazard function are derived and presented.
The Bayesian procedure under the SELF is also presented. Since the conditional posteriors of the parameters cannot be obtained in any standard forms, using a hybrid MCMC for drawing sample from the joint posterior of the parameters is suggested. MCMC simulations for comparing non-Bayesian and Bayesian estimation are performed. Gibbs sampler and M–H algorithm are employed and applied. The Bayesian method is the best for all sample sizes with the minimum mean squared errors. The non-Bayesian estimation methods perform well but not better than the Bayesian method. The performance of all estimation methods improves when n .
Four applications to real data sets are employed for comparing the Bayesian and non-Bayesian methods. The importance and flexibility of the new discrete class is illustrated by means of four real data applications. Many special member distributions could be considered and studied in the future in separate papers. As a future work, we may consider the bivariate and the multivariate extensions of the DRG family. We hope that the DRG family will attract wider applications in reliability, engineering, and other areas of research.

Author Contributions

M.A. (writing—review and editing), H.M.Y. (writing the original draft preparation, validation, review and editing, and conceptualization), G.G.H. (characterization, writing—review and editing), M.I. (writing the original draft preparation, validation, review and editing, and conceptualization). All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Acknowledgments

The authors gratefully acknowledge, with thanks, the very thoughtful and constructive comments and suggestions of the four reviewers, which resulted in a much improved paper.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Appendix A.1

Proof of Theorem 1. 
Since
μ r = E ( Y r ) = y = 0 y r S π , Φ _ ( y ) .
Then,
μ r = E ( Y r ) = y = 0 y r [ π Φ _ 2 ( y ) π Φ _ 2 ( y + 1 ) ] = y = 1 [ y r ( y 1 ) r ] S π , Φ _ ( y 1 ) = y = 1 [ y r ( y 1 ) r ] π Φ _ 2 ( y ) | ( y N , π ( 0 , 1 )   and   r = 1 , 2 , 3 , ) .
The MGF of our NNRV Y can be derived from
M Y ( t ) = y = 0 e x p ( t Y ) ? π , Φ _ ( y ) .

Appendix A.2

Proof of Theorem 2. 
Using Equation (3) we have
M Y ( t ) = y = 0 e x p ( t Y ) [ π Φ _ 2 ( y ) π Φ _ 2 ( y + 1 ) ] ,
then,
M Y ( t ) = π Φ _ 2 ( 0 ) + [ π Φ _ 2 ( 1 ) π Φ _ 2 ( 1 ) ] + [ π Φ _ 2 ( 2 ) π Φ _ 2 ( 2 ) ] + [ π Φ _ 2 ( 3 ) π Φ _ 2 ( 3 ) ] + ,
finally
M Y ( t ) = 1 + y = 1 { e x p ( t Y ) e x p [ t ( Y 1 ) ] } π Φ _ 2 ( y ) | ( y N , π ( 0 , 1 )   and   r = 1 , 2 , 3 , ) .

References

  1. Bourguignon, M.; Silva, R.B.; Cordeiro, G.M. The Weibull-G family of probability distributions. J. Data Sci. 2014, 12, 53–68. [Google Scholar]
  2. Consul, P.C.; Jain, G.C. A generalization of the Poisson distribution. Technometrics 1973, 15, 791–799. [Google Scholar]
  3. Nakagawa, T.; Osaki, S. The discrete Weibull distribution. IEEE Trans. Reliab. 1975, 24, 300–301. [Google Scholar]
  4. Roy, D. Discrete Rayleigh distribution. IEEE Trans. Reliab. 2004, 53, 255–260. [Google Scholar]
  5. Kemp, A.W. The discrete half-normal distribution. In Advances in Mathematical and Statistical Modeling; Birkhauser: Basel, Switzerland, 2008; pp. 353–365. [Google Scholar]
  6. Krishna, H.; Pundir, P.S. Discrete Burr and discrete Pareto distributions. Stat. Methodol. 2009, 6, 177–188. [Google Scholar]
  7. Gomez-Déniz, E. Another generalization of the geometric distribution. Test 2010, 19, 399–415. [Google Scholar]
  8. Jazi, A.M.; Lai, D.C.; Alamatsaz, H.M. Inverse Weibull distribution and estimation of its parameters. Stat. Methodol. 2010, 7, 121–132. [Google Scholar]
  9. Gomez-Déniz, E.; Caldern-Ojeda, E. The discrete Lindley distribution: Properties and applications. J. Stat. Comput. Simul. 2011, 81, 1405–1416. [Google Scholar]
  10. Nekoukhou, V.; Alamatsaz, M.H.; Bidram, H. Discrete generalized exponential distribution of a second type. Statistics 2013, 47, 876–887. [Google Scholar]
  11. Hussain, T.; Ahmad, M. Discrete inverse Rayleigh distribution. Pak. J. Stat. 2014, 30, 203–222. [Google Scholar]
  12. Nekoukhou, V.; Bidram, H. The exponentiated discrete Weibull distribution. Stat. Oper. Res. Trans. 2015, 39, 127–146. [Google Scholar]
  13. Hussain, T.; Aslam, M.; Ahmad, M.A. Two parameter discrete Lindley distribution. Rev. Colomb. Estad. 2016, 39, 45–61. [Google Scholar]
  14. Para, B.A.; Jan, T.R. On discrete three-parameter Burr type XII and discrete Lomax distributions and their applications to model count data from medical science. Biom. Biostat. Int. J. 2016, 4, 1–15. [Google Scholar]
  15. Para, B.A.; Jan, T.R. Discrete version of log-logistic distribution and its applications in genetics. Int. J. Mod. Math. Sci. 2016, 14, 407–422. [Google Scholar]
  16. El-Morshedy, M.; Eliwa, M.S.; Nagy, H. A new two-parameter exponentiated discrete Lindley distribution: Properties, estimation and applications. J. Appl. Stat. 2020, 47, 354–375. [Google Scholar] [CrossRef]
  17. Aryal, G.R.; Yousof, H.M. The exponentiated generalized-G Poisson family of distributions. Econ. Qual. Control. 2017, 32, 1–17. [Google Scholar]
  18. Hamedani, G.G.; Yousof, H.M.; Rasekhi, M.; Alizadeh, M.; Najibi, S.M. Type I general exponential class of distributions. Pak. J. Stat. Oper. Res. 2017, XIV, 39–55. [Google Scholar]
  19. Hamedani, G.G.; Altun, E.; Korkmaz, M.C.; Yousof, H.M.; Butt, N.S. A new extended G family of continuous distributions with mathematical properties, characterizations and regression modeling. Pak. J. Stat. Oper. Res. 2018, 14, 737–758. [Google Scholar]
  20. Hamedani, G.G.; Rasekhi, M.; Najibi, S.M.; Yousof, H.M.; Alizadeh, M. Type II general exponential class of distributions. Pak. J. Stat. Oper. Res. 2019, XV, 503–523. [Google Scholar]
  21. Korkmaz, M.C.; Altun, E.; Yousof, H.M.; Hamedani, G.G. The odd power Lindley generator of probability distributions: Properties, characterizations and regression modeling. Int. J. Stat. Probab. 2019, 8, 70–89. [Google Scholar]
  22. Nascimento, A.D.C.; Silva, K.F.; Cordeiro, G.M.; Alizadeh, M.; Yousof, H.M. The odd Nadarajah-Haghighi family of distributions: Properties and applications. Stud. Sci. Math. Hung. 2019, 56, 1–26. [Google Scholar] [CrossRef]
  23. Cordeiro, G.M.; Yousof, H.M.; Ramires, T.G.; Ortega, E.M.M. The Burr XII system of densities: Properties, regression model and applications. J. Stat. Comput. Simul. 2018, 88, 432–456. [Google Scholar] [CrossRef]
  24. Korkmaz, M.C.; Yousof, H.M.; Hamedani, G.G.; Ali, M.M. The Marshall–Olkin generalized G Poisson family of distributions. Pak. J. Stat. 2018, 34, 251–267. [Google Scholar]
  25. Khalil, M.G.; Hamedani, G.G.; Yousof, H.M. The Burr X exponentiated Weibull model: Characterizations, mathematical properties and applications to failure and survival times data. Pak. J. Stat. Oper. Res. 2019, XV, 141–160. [Google Scholar] [CrossRef] [Green Version]
  26. Ibrahim, M.; Yousof, H.M. Transmuted Topp-Leone Weibull Lifetime Distribution: Statistical Properties and Different Method of Estimation. Pak. J. Stat. Oper. Res. 2020, 16, 501–515. [Google Scholar] [CrossRef]
  27. Ibrahim, M.; Mohammed, W.S.; Yousof, H.M. Bayesian and Classical Estimation for the One Parameter Double Lindley Model. Pak. J. Stat. Oper. Res. 2020, 16, 409–420. [Google Scholar]
  28. Korkmaz, M.Ç.; Yousof, H.M. The one-parameter odd Lindley exponential model: Mathematical properties and applications. Stoch. Qual. Control 2017, 32, 25–35. [Google Scholar] [CrossRef]
  29. Yousof, H.M.; Korkmaz, M.Ç.; Hamedani, G.G. The odd Lindley Nadarajah-Haghighi distribution. J. Math. Comput. Sci. 2017, 7, 864–882. [Google Scholar]
  30. Gad, A.M.; Hamedani, G.G.; Salehabadi, S.M.; Yousof, H.M. The Burr XII-Burr XII distribution: Mathematical properties and characterizations. Pak. J. Stat. 2019, 35, 229–248. [Google Scholar]
  31. Cai, L. Metropolis-Hastings Robbins-Monro algorithm for confirmatory item factor analysis. J. Educ. Behav. Stat. 2010, 35, 307–335. [Google Scholar] [CrossRef]
  32. Chib, S.; Greenberg, E. Understanding the metropolis-hastings algorithm. Am. Stat. 1995, 49, 327–335. [Google Scholar]
  33. Korkmaz, M.C.; Yousof, H.M.; Rasekhi, M.; Hamedani, G.G. The Odd Lindley Burr XII Model: Bayesian Analysis, Classical Inference and Characterizations. J. Data Sci. 2018, 16, 327–353. [Google Scholar]
  34. Rasekhi, M.; Saber, M.M.; Yousof, H.M. Bayesian and classical inference of reliability in multicomponent stress-strength under the generalized logistic model. Commun. Stat. Theory Methods 2020, 1–12. [Google Scholar] [CrossRef]
  35. Bodhisuwan, W.; Sangpoom, S. The discrete weighted Lindley distribution. In Proceedings of the International Conference on Mathematics, Statistics, and Their Applications, Banda Aceh, Indonesia, 4–6 October 2016. [Google Scholar]
  36. Lawless, J.F. Statistical Models and Methods for Lifetime Data; Wiley: New York, NY, USA, 2003. [Google Scholar]
  37. Chan, S.; Riley, P.R.; Price, K.L.; McElduff, F.; Winyard, P.J. Corticosteroid-induced kidney dysmorphogenesis is associated with deregulated expression of known cystogenic molecules, as well as Indian hedgehog. Am. J. Physiol. Ren. Physiol. 2009, 298, 346–356. [Google Scholar] [CrossRef] [Green Version]
  38. Bebbington, M.; Lai, C.D.; Wellington, M.; Zitikis, R. The discrete additive Weibull distribution: A bathtub-shaped hazard for discontinuous failure data. Reliab. Eng. Syst. Saf. 2012, 106, 37–44. [Google Scholar] [CrossRef]
  39. Aarset, M.V. How to identify a bathtub hazard rate. IEEE Trans. Reliab. 1987, 36, 106–108. [Google Scholar] [CrossRef]
Figure 1. The probability mass function (PMF) of the DRW for different values of the parameters.
Figure 1. The probability mass function (PMF) of the DRW for different values of the parameters.
Mathematics 08 01648 g001
Figure 2. The hazard rate function (HRF) of the DRW for different values of the parameters.
Figure 2. The hazard rate function (HRF) of the DRW for different values of the parameters.
Mathematics 08 01648 g002
Figure 3. Frequency distribution using Bayesian estimation for data set I.
Figure 3. Frequency distribution using Bayesian estimation for data set I.
Mathematics 08 01648 g003
Figure 4. Frequency distribution using Bayesian estimation for data set II.
Figure 4. Frequency distribution using Bayesian estimation for data set II.
Mathematics 08 01648 g004
Figure 5. Frequency distribution using Bayesian estimation for data set III.
Figure 5. Frequency distribution using Bayesian estimation for data set III.
Mathematics 08 01648 g005
Figure 6. Frequency distribution using Bayesian estimation for data set IV.
Figure 6. Frequency distribution using Bayesian estimation for data set IV.
Mathematics 08 01648 g006
Figure 7. The quantile-quantile (Q-Q) plot and box for the failure time data (data set I).
Figure 7. The quantile-quantile (Q-Q) plot and box for the failure time data (data set I).
Mathematics 08 01648 g007
Figure 8. The total time on test (TTT) plot and estimated hazard rate function (EHRF) for the DRW model for dataset I.
Figure 8. The total time on test (TTT) plot and estimated hazard rate function (EHRF) for the DRW model for dataset I.
Mathematics 08 01648 g008
Figure 9. Q-Q plot and box for the failure time data (dataset II).
Figure 9. Q-Q plot and box for the failure time data (dataset II).
Mathematics 08 01648 g009
Figure 10. TTT plot and EHRF for the DRW model for dataset II.
Figure 10. TTT plot and EHRF for the DRW model for dataset II.
Mathematics 08 01648 g010
Figure 11. TTT plot, Q-Q plot, and Box plot versus the EHRFs for data set III.
Figure 11. TTT plot, Q-Q plot, and Box plot versus the EHRFs for data set III.
Mathematics 08 01648 g011
Figure 12. The fitted PMFs and EHRF for data set III.
Figure 12. The fitted PMFs and EHRF for data set III.
Mathematics 08 01648 g012
Figure 13. TTT plot, Q-Q plot, and box plot versus the EHRFs for data set IV.
Figure 13. TTT plot, Q-Q plot, and box plot versus the EHRFs for data set IV.
Mathematics 08 01648 g013
Figure 14. The fitted PMFs and EHRF for data set IV.
Figure 14. The fitted PMFs and EHRF for data set IV.
Mathematics 08 01648 g014
Table 1. Some new sub-models based on the discrete Rayleigh G family (DRG) family.
Table 1. Some new sub-models based on the discrete Rayleigh G family (DRG) family.
No.Baseline Model Δ Φ ( y ) | y N 0 , π ( 0 , 1 ) Sub Model
1Log-logistic (LL) y θ | θ > 0 DRLL
2Log-logistic (LL) ( y / θ ) β | θ , β > 0 DRLL
3Weibull (W) e x p ( y θ ) 1 | θ > 0 DRW
4Exponential (E) e x p ( β y ) 1 | β > 0 DRE
5Lomax (Lx) ( 1 + y ) β 1 | β > 0 DRLx
6Rayleigh (R) e x p ( β y ) 2 1 | β > 0 DRR
7Weibull (W) e x p ( β y ) θ 1 | θ , β > 0 DRW
8Lomax (Lx) ( 1 + y / β ) θ 1 | θ , β > 0 DRLx
9Burr XII (BXII) ( 1 + y θ ) β 1 | θ , β > 0 DRBXII
10Fréchet (F) [ e x p ( y θ ) 1 ] 1 | θ > 0 DRF
11Inverse Rayleigh (IR) [ e x p ( β y 2 ) 1 ] 1 | β > 0 DRIR
12Inverse Exponential (IE) [ e x p ( β y 1 ) 1 ] 1 | β > 0 DRIE
13Inverse Lomax (ILx) [ ( 1 + β y 1 ) θ 1 ] 1 | θ , β > 0 DRILx
14Inverse Lomax (ILx) [ ( 1 + y 1 ) θ 1 ] 1 | θ > 0 DRILx
15Burr XII (BXII) [ ( 1 + ( y / λ ) θ ) β 1 ] 1 | θ , β , λ > 0 DRBXII
16Fréchet (F) [ e x p ( β θ y θ ) 1 ] 1 | β , θ > 0 DRF
17Half-logistic (HL) { [ 1 e x p ( y ) 1 + e x p ( y ) ] 1 1 } 1 DRHL
18Standard Gumbel (Gu) ( e x p { e x p [ ( y ) ] } 1 ) 1 DRSGu
19Lindley (Li) e x p ( β y ) [ ( 1 + β + β y ) / ( 1 + β ) ] 1 1 | β > 0 DRLi
20Nadarajah-Haghighi (NH) e x p [ ( 1 + β x ) θ 1 ] 1 | β , θ > 0 DRNH
21Half-logistic (HL) { [ 1 e x p ( β y ) 1 + e x p ( β y ) ] 1 1 } 1 | β > 0 DRHL
22Gompertz e x p { θ [ e x p ( β y ) 1 ] } 1 | β , θ > 0 DRGz
23Dagum (D) [ ( 1 + ( y / λ ) θ ) β 1 ] 1 | θ , β , λ > 0 DRD
24Inverse Flexible W (IFW) ( e x p { e x p [ β / y θ y ] } 1 ) 1 | θ , β > 0 DRIFW
25Gumbel (Gu) ( e x p { e x p [ ( y θ ) / β ] } 1 ) 1 | β > 0 , θ R DRGu
26Burr X (BX) ( { 1 e x p [ ( β y ) 2 ] } θ 1 ) 1 | β , θ > 0 DRBX
27Inverse Gompertz ( e x p { θ [ e x p ( β y ) 1 ] / β } 1 ) 1 | β , θ > 0 DRIGz
Table 2. E(Y), V(Y), S(Y), and K(Y) of the DRW distribution.
Table 2. E(Y), V(Y), S(Y), and K(Y) of the DRW distribution.
πθE(Y)V(Y)S(Y)K(Y)DisIx(Y)
0.00151.3884 × 10⁹1.3884 × 10⁻⁹26,837.29720,240,21
0.100 0.00111560.001114429.88949894.38170.9988844
0.250 0.01668870.01641027.54571257.937770.9833113
0.500 0.12918470.11249602.2111545.8892020.8708153
0.750 0.42768040.24476990.29235271.0854700.5723196
0.900 0.73265810.1958702−1.0513902.1054220.2673419
0.990 0.97076240.0283828−5.58861832.232650.0292376
0.999 0.99705040.0029409−18.33114337.03070.0029496
0.50.001804.573140100432033.207041242.543498406.30
0.10.5260527.04951814.38802426.516213.400800
0.50.13039370.11580982.2869666.5778180.8881549
50.12918470.1124962.2111545.8892020.8708153
Table 3. Mean squared errors (MSEs) for π = 0.607 and θ = 0.1.
Table 3. Mean squared errors (MSEs) for π = 0.607 and θ = 0.1.
n MLEOLSWLSBayesian
50π0.004490.002870.002890.00255
θ0.000730.002610.002350.00045
150π0.001760.000900.000920.00069
θ0.000170.000730.000460.00036
300π0.000730.000450.000470.00041
θ0.000060.000240.000120.00011
500π0.000430.000270.000280.00032
θ0.000040.000150.000070.00003
MLE: maximum likelihood estimation; OLS: ordinary least squared; WLS: weighted least squared.
Table 4. MSEs for π = 0.9 and θ = 0.8.
Table 4. MSEs for π = 0.9 and θ = 0.8.
n MLEOLSWLSBayesian
50π0.000390.000390.000540.00022
θ0.010860.009560.015660.00376
150π0.000170.000100.000200.00008
θ0.002990.001430.001430.00196
300π0.000100.000040.000110.00004
θ0.001810.000620.000620.00069
500π0.000070.000020.000060.00002
θ0.001320.000380.000390.00036
Table 5. MSEs for π = 0.5 and θ = 0.3.
Table 5. MSEs for π = 0.5 and θ = 0.3.
N MLEOLSWLSBayesian
50π0.004480.004350.004230.00359
θ0.339390.384390.380220.20457
150π0.001200.001290.001260.00116
θ0.069610.071940.066760.00282
300π0.000560.000670.000610.00068
θ0.007930.006790.006560.00168
500π0.000320.000380.000390.00057
θ0.001820.002180.001180.00096
Table 6. Estimators under Bayesian and non-Bayesian estimation methods, Akaike information criterion (AIC) and Correct Akaike IC (CAIC) statistics for data set I.
Table 6. Estimators under Bayesian and non-Bayesian estimation methods, Akaike information criterion (AIC) and Correct Akaike IC (CAIC) statistics for data set I.
Method π ^ θ ^ AICCAIC
MLE0.9920.234478.9479.20
OLS0.9870.206481.8482.02
WLS0.9930.251483.3483.53
Bayesian0.9910.224479.9480.11
Table 7. Estimators under Bayesian and non-Bayesian estimation methods, AIC and CAIC statistics for data set II.
Table 7. Estimators under Bayesian and non-Bayesian estimation methods, AIC and CAIC statistics for data set II.
Method π ^ θ ^ AICCAIC
MLE0.9920.264132.1133.1
OLS0.9880.246132.4133.4
WLS0.9890.243132.6133.6
Bayesian0.9910.251132.5133.5
Table 8. Estimators under Bayesian and non-Bayesian estimation methods, AIC and CAIC statistics for data set III.
Table 8. Estimators under Bayesian and non-Bayesian estimation methods, AIC and CAIC statistics for data set III.
Method π ^ θ ^ AICCAIC
MLE0.7450.183338.9338.9
OLS0.7400.171339.4339.5
WLS0.7390.188339.2339.3
Bayesian0.7370.183339.0339.2
Table 9. Estimators under Bayesian and non-Bayesian estimation methods, AIC and CAIC statistics for data set IV.
Table 9. Estimators under Bayesian and non-Bayesian estimation methods, AIC and CAIC statistics for data set IV.
Method π ^ θ ^ AICCAIC
MLE0.8490.298406.1406.2
OLS0.8550.330409.6409.7
WLS0.8600.320407.2407.3
Bayesian0.8410.292406.3406.4
Table 10. The competitive models.
Table 10. The competitive models.
ModelAbbreviation
1discrete ParetoDPa
2discrete LomaxDLx
3discrete LindleyDLi
4discrete WeibullDW
5discrete RayleighDR
6discrete log-logisticDLL
7discrete exponentialDE
8discrete Burr type XIIDBXII
9discrete Lindley type IIDLi-II
10discrete inverse RayleighDIR
11PoissonPoisson
12discrete inverse WeibullDIW
13Exponentiated discrete WeibullEDW
14Exponentiated discrete LindleyEDLi
15discrete generalized E type IIDGE-II
16Negative binomialNB
Table 11. MLEs (SEs) for data set I.
Table 11. MLEs (SEs) for data set I.
πθλ
ModelMLE (SE)MLE (SE)MLE (SE)
DRW0.9920.234
(0.004)(0.019)
EDW0.9891.1390.784
(0.164)(3.227)(3.053)
DW0.9811.023
(0.011)(0.131)
DIW0.0180.582
(0.013)(0.061)
DLi-II0.9690.058
(0.005)(0.027)
EDLi0.9720.480
(0.005)(0.087)
DLL1.0000.439
(0.321)(0.062)
DPa0.739
(0.032)
Table 12. The goodness of fit (GOF) statistics for data set I.
Table 12. The goodness of fit (GOF) statistics for data set I.
Statistic
Model↓−ℓAICCAICK-S P [ V ]
DRW237.5478.9479.20.1790.081
EDW240.2486.7487.20.1950.045
DW241.6487.2487.50.1870.061
DIW261.9527.8528.10.2580.003
DLi-II240.6485.2485.40.1860.064
EDLi240.3484.6484.80.1950.045
DLL294.9593.8594.00.535<0.001
DPa275.9553.7553.80.335<0.001
Table 13. MLEs (SEs) for dataset II.
Table 13. MLEs (SEs) for dataset II.
πθ
ModelMLE (SE)MLE (SE)
DRW0.9920.264
(0.006)(0.033)
DGE-II0.9561.491
(0.013)(0.535)
DIW2.2 × 10⁻⁴0.875
(7.8 × 10⁻⁴)(0.164)
DLx0.012104.506
(0.039)(84.409)
DBXII0.97513.367
(0.051)(27.785)
DR0.999
(2.58 × 10⁻⁴)
DIR1.8 × 10⁻⁷
(0.055)
DPa0.720
(0.061)
DE0.965
(0.009)
Table 14. The GOF statistics for dataset II.
Table 14. The GOF statistics for dataset II.
Statistic
Model−ℓAICCAICK-S P [ V ]
DRW64.0132.1133.10.1160.988
DE65.0134.0136.30.1770.673
DGE-II64.4134.8135.80.1290.937
DR66.4134.8136.10.2160.433
DIR89.1180.2180.50.698<0.001
DIW68.7141.4142.40.2090.482
DLx65.9135.7136.70.2050.491
DB-XII75.7155.5156.50.3880.015
DPa77.4156.8157.10.4050.009
Table 15. MLEs (SEs) for data set III.
Table 15. MLEs (SEs) for data set III.
Modelπθ
MLE (SE)MLE (SE)
DRW0.7450.183
(0.028)(0.019)
DW0.7500.431
(0.084)(0.340)
DIW0.5811.049
(0.048)(0.146)
DLi-II0.5810.001
(0.045)(0.058)
DLx0.1501.830
(0.098)(0.951)
DR0.901
(0.009)
DE0.581
(0.030)
DLi0.436
(0.026)
Poisson1.390
(0.112)
Table 16. The GOF statistics for dataset III.
Table 16. The GOF statistics for dataset III.
ZOFDRWDWDIWDRDExDLiDLi-IIDLxPoisson
06563.8359.0163.9111.0046.0940.2546.0361.8927.42
11416.5319.8420.7026.8326.7829.8326.7721.0138.08
2109.30610.788.0529.5515.5618.3615.579.6526.47
365.9006.264.2322.239.0410.359.055.2412.26
443.9704.192.6012.495.255.535.273.174.26
522.762.011.755.423.052.863.062.061.18
621.9671.991.261.851.771.441.781.420.27
721.4271.320.950.521.030.711.041.020.05
811.0490.990.740.110.600.350.600.760.01
910.7790.860.590.020.350.170.350.580.00
1010.5840.760.480.000.200.080.200.460.00
1120.4411.994.740.000.280.070.282.740.00
−ℓ 167.441170.14172.93277.78178.77189.1178.8170.48246.21
AIC 338.883344.28349.87557.56359.53380.2361.5344.96494.42
CAIC 338.995344.39349.98557.59359.57380.3361.6345.07494.46
χ 2 0.779513.1256.463321.0722.8843.4822.893.316294.10
d.f 333444334
P [ V ] 0.854360.3730.091<0.00010.0001<0.0001<0.00010.345<0.0001
Table 17. MLEs (SEs) for dataset IV.
Table 17. MLEs (SEs) for dataset IV.
πθλβ
ModelMLEMLEMLEMLE
DRW0.8490.298
(0.019)(0.022)
DGW0.04502.5392.1590.479
(0.429)(4.703)(2.698)(0.466)
DIW0.3451.541
(0.043)(0.156)
DBXII0.5192.358
(0.051)(0.366)
NB0.8709.956
(0.036)(0.096)
DIR0.319
(0.042)
DR0.867
(0.012)
DPa0.329
(0.034)
Poisson1.483
(0.025)
Table 18. The GOF statistics for dataset IV.
Table 18. The GOF statistics for dataset IV.
ZOFDRWDIWDBXIIDIRDRNBDPaPoisson
04346.10341.3743.8438.2815.9230.1264.4527.23
13528.00741.8539.6151.9036.1738.8720.1540.38
21718.63715.4215.6215.5134.5827.619.6929.95
31111.827.177.206.0421.0314.265.6514.81
457.1073.943.912.918.895.993.68
544.0482.422.371.612.702.172.581.63
612.1861.611.560.980.600.701.900.40
721.1191.131.090.640.090.211.460.09
820.5435.094.802.140.020.0610.440.02
−ℓ 201.058204.810204.293208.440235.23211.52220.63219.19
AIC 406.115413.621412.587418.881472.45427.05443.24440.38
CAIC 406.218413.723412.689418.915472.49427.14443.27440.41
χ 2 2.2235.5114.66414.27470.68820.36732.46238.478
d.f 33344344
P [ V ] 0.5270.1380.198<0.0001<0.00010.0001<0.0001<0.0001

Share and Cite

MDPI and ACS Style

Aboraya, M.; M. Yousof, H.; Hamedani, G.G.; Ibrahim, M. A New Family of Discrete Distributions with Mathematical Properties, Characterizations, Bayesian and Non-Bayesian Estimation Methods. Mathematics 2020, 8, 1648. https://doi.org/10.3390/math8101648

AMA Style

Aboraya M, M. Yousof H, Hamedani GG, Ibrahim M. A New Family of Discrete Distributions with Mathematical Properties, Characterizations, Bayesian and Non-Bayesian Estimation Methods. Mathematics. 2020; 8(10):1648. https://doi.org/10.3390/math8101648

Chicago/Turabian Style

Aboraya, Mohamed, Haitham M. Yousof, G.G. Hamedani, and Mohamed Ibrahim. 2020. "A New Family of Discrete Distributions with Mathematical Properties, Characterizations, Bayesian and Non-Bayesian Estimation Methods" Mathematics 8, no. 10: 1648. https://doi.org/10.3390/math8101648

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop