Next Article in Journal
Product Channeling in an O2O Supply Chain Management as Power Transmission in Electric Power Distribution Systems
Next Article in Special Issue
Optimal Repeated Measurements for Two Treatment Designs with Dependent Observations: The Case of Compound Symmetry
Previous Article in Journal
The Bounds of the Edge Number in Generalized Hypertrees
Previous Article in Special Issue
Computation of Probability Associated with Anderson–Darling Statistic
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

The Modified Beta Gompertz Distribution: Theory and Applications

1
Institute of Statistical Studies and Research (ISSR), Department of Mathematical Statistics, Cairo University, Giza 12613, Egypt
2
Department of Statistics, Govt. S.A Postgraduate College Dera Nawab Sahib, Bahawalpur, Punjab 63360, Pakistan
3
Department of Mathematics, LMNO, University of Caen, 14032 Caen, France
4
Department of Statistics, University of Jeddah, Jeddah 21589, Saudi Arabia
5
Department of Statistics, Faculty of Sciences, King Abdulaziz University, Jeddah 21589, Saudi Arabia
*
Author to whom correspondence should be addressed.
Mathematics 2019, 7(1), 3; https://doi.org/10.3390/math7010003
Submission received: 7 November 2018 / Revised: 11 December 2018 / Accepted: 17 December 2018 / Published: 20 December 2018
(This article belongs to the Special Issue Applied and Computational Statistics)

Abstract

:
In this paper, we introduce a new continuous probability distribution with five parameters called the modified beta Gompertz distribution. It is derived from the modified beta generator proposed by Nadarajah, Teimouri and Shih (2014) and the Gompertz distribution. By investigating its mathematical and practical aspects, we prove that it is quite flexible and can be used effectively in modeling a wide variety of real phenomena. Among others, we provide useful expansions of crucial functions, quantile function, moments, incomplete moments, moment generating function, entropies and order statistics. We explore the estimation of the model parameters by the obtained maximum likelihood method. We also present a simulation study testing the validity of maximum likelihood estimators. Finally, we illustrate the flexibility of the distribution by the consideration of two real datasets.

1. Introduction

The Gompertz distribution is a continuous probability distribution introduced by Gompertz [1]. The literature about the use of the Gompertz distribution in applied areas is enormous. A nice review can be found in [2], and the references therein. From a mathematical point of view, the cumulative probability density function (cdf) of the Gompertz distribution with parameters λ > 0 and α > 0 is given by
G ( x ) = 1 e λ α ( e α x 1 ) , x > 0 .
The related probability density function (pdf) is given by
g ( x ) = λ e α x e λ α ( e α x 1 ) , x > 0 .
It can be viewed as a generalization of the exponential distribution (obtained with α 0 ) and thus an alternative to the gamma or Weibull distribution. A feature of the Gompertz distribution is that g ( x ) is unimodal and has positive skewness, whereas the related hazard rate function (hrf) given by h ( x ) = g ( x ) / ( 1 G ( x ) ) is increasing. To increase the flexibility of the Gompertz distribution, further extensions have been proposed. A natural one is the generalized Gompertz distribution introduced by El-Gohary et al. [3]. By introducing an exponent parameter a > 0 , the related cdf is given by
F ( x ) = 1 e λ α ( e α x 1 ) a , x > 0 .
The related applications show that a plays an important role in term of model flexibility. This idea was then extended by Jafari et al. [4] who used the so-called beta generator introduced by Eugene et al. [5]. The related cdf is given by
F ( x ) = 1 B ( a , b ) 0 1 e λ α ( e α x 1 ) t a 1 ( 1 t ) b 1 d t = I 1 e λ α ( e α x 1 ) ( a , b ) , x > 0 .
where a , b > 0 , B ( a , b ) = 0 1 t a 1 ( 1 t ) b 1 d t and I x ( a , b ) = ( 1 / B ( a , b ) ) 0 x t a 1 ( 1 t ) b 1 d t , x [ 0 , 1 ] . This distribution has been recently extended by Benkhelifa [6] with a five-parameter distribution. It is based on the beta generator and the generalized Gompertz distribution.
Motivated by the emergence of complex data from many applied areas, other extended Gompertz distributions have been proposed in the literature. See for instance, El-Damcese et al. [7] who considered the Odd Generalized Exponential generator introduced by Tahir et al. [8]; Roozegar et al. [9] who used the McDonald generator introduced by Alexander et al. [10]; Refs. [11,12] who applied the transmuted generator introduced by Shaw and Buckley [13]; Chukwu and Ogunde [14] and Lima et al. [15] who used the Kumaraswamy generator; and Benkhelifa [16] and Yaghoobzadeh [17] who considered the Marshall–Olkin generator introduced by Marshall and Olkin [18]; and Shadrokh and Yaghoobzadeh [19] who considered the Beta-G and Geometric generators.
In this paper, we present and study a distribution with five parameters extending the Gompertz distribution. It is based on the modified beta generator developed by Nadarajah et al. [20] (which can also be viewed as a modification of the beta Marshall–Olkin generator developed by Alizadeh et al. [21]). The advantage of this generator is to nicely combine the advantages of the beta generator of Eugene et al. [5] and the Marshall–Olkin generator of Marshall and Olkin [18]. To the best of our knowledge, its application to the Gompertz distribution has never been considered before. We provide a comprehensive description of its general mathematical properties (expansions of the cdf and pdf, quantile function, various kinds of moments, moment generating function, entropies and order statistics). The estimation of the model parameters by maximum likelihood is then discussed. Finally, we explore applications to real datasets that illustrate the usefulness of the proposed model.
The structure of the paper is as follows. Section 2 describes the considered modified beta Gompertz distribution. Some mathematical properties are investigated in Section 3. Section 4 provides the necessary to the estimation of the unknown parameters with the maximum likelihood method. A simulation study is presented, which tests validity of the obtained maximum likelihood estimators. Applications to two real datasets are also given.

2. The Modified Beta Gompertz Distribution

Let c > 0 , G ( x ) be a cdf and g ( x ) be a related pdf. The modified beta generator introduced by Nadarajah et al. [20] is characterized by the cdf given by
F ( x ) = I c G ( x ) 1 ( 1 c ) G ( x ) ( a , b ) ,
By differentiation of F ( x ) , a pdf is given by
f ( x ) = c a g ( x ) G ( x ) a 1 1 G ( x ) b 1 B ( a , b ) 1 ( 1 c ) G ( x ) a + b , x R .
The hrf is given by
h ( x ) = c a g ( x ) G ( x ) a 1 1 G ( x ) b 1 B ( a , b ) 1 ( 1 c ) G ( x ) a + b 1 I c G ( x ) 1 ( 1 c ) G ( x ) ( a , b ) , x R .
Let us now present our main distribution of interest. Using the cdf G ( x ) of the Gompertz distribution with parameters λ > 0 and α > 0 as baseline, the cdf given by Equation (2) becomes
F ( x ) = I c 1 e λ α ( e α x 1 ) 1 ( 1 c ) 1 e λ α ( e α x 1 ) ( a , b ) , x > 0 .
The related distribution is called the modified beta Gompertz distribution (MBGz distribution), also denoted by MBGz( λ , α , a , b , c ). The related pdf in Equation (3) is given by
f ( x ) = c a λ e α x e λ b α ( e α x 1 ) 1 e λ α ( e α x 1 ) a 1 B ( a , b ) 1 ( 1 c ) 1 e λ α ( e α x 1 ) a + b , x > 0 .
The hrf is given by
h ( x ) = c a λ e α x e λ b α ( e α x 1 ) 1 e λ α ( e α x 1 ) a 1 B ( a , b ) 1 ( 1 c ) 1 e λ α ( e α x 1 ) a + b 1 I c 1 e λ α ( e α x 1 ) 1 ( 1 c ) 1 e λ α ( e α x 1 ) ( a , b ) , x > 0 .
Figure 1 shows the plots for f ( x ) and h ( x ) for selected parameter values λ , α , a , b , c . We observe that these functions can take various curvature forms depending on the parameter values, showing the increasing of the flexibility of the former Gompertz distribution.
A strong point of the MBGz distribution is to contain different useful distributions in the literature. The most popular of them are listed below.
  • When c = 1 / ( 1 θ ) with θ ( 0 , 1 ) ( θ is a proportion parameter), we obtain the beta Gompertz geometric distribution introduced by Shadrokh and Yaghoobzadeh [19], i.e., with cdf
    F ( x ) = I 1 e λ α ( e α x 1 ) 1 θ e λ α ( e α x 1 ) ( a , b ) , x > 0 .
    However, this distribution excludes the case c ( 0 , 1 ) , which is of importance since it contains well-known flexible distributions, as developed below. Moreover, the importance of small values for c can also be determinant in the applications (see Section 4).
  • When c = 1 , we get the beta Gompertz distribution with four parameters introduced by Jafari et al. [4], i.e., with cdf
    F ( x ) = I 1 e λ α ( e α x 1 ) ( a , b ) , x > 0 .
  • When c = b = 1 , we get the generalized Gompertz distribution studied by El-Gohary et al. [3], i.e., with cdf
    F ( x ) = 1 e λ α ( e α x 1 ) a , x > 0 .
  • When a = b = 1 and c = 1 θ with θ > 1 , we get the a particular case of the Marshall–Olkin extended generalized Gompertz distribution introduced by Benkhelifa [16], i.e., with cdf
    F ( x ) = 1 e λ α ( e α x 1 ) θ + ( 1 θ ) 1 e λ α ( e α x 1 ) , x > 0 .
  • When a = b = c = 1 , we get the Gompertz distribution introduced by Gompertz [1], i.e., with cdf
    F ( x ) = 1 e λ α ( e α x 1 ) , x > 0 .
  • When c = 1 and α 0 , we get beta exponential distribution studied by Nadarajah and Kotz [22], i.e., with cdf
    F ( x ) = I 1 e λ x ( a , b ) , x > 0 .
  • When b = c = 1 and α 0 , we get the generalized exponential distribution studied by Gupta and Kundu [23], i.e., with cdf
    F ( x ) = 1 e λ x a , x > 0 .
  • When a = b = c = 1 and α 0 we get the exponential distribution, i.e., with cdf
    F ( x ) = 1 e λ x , x > 0 .

3. Some Mathematical Properties

3.1. On the Shapes of the pdf

The shapes of f ( x ) given by Equation (5) can be described analytically. As usual, the critical points x of the pdf f ( x ) satisfies x ln ( f ( x ) ) = 0 , with
x ln ( f ( x ) ) = α b λ e α x + ( a 1 ) λ e α x e λ α ( e α x 1 ) 1 e λ α ( e α x 1 ) + ( a + b ) ( 1 c ) λ e α x e λ α ( e α x 1 ) c + ( 1 c ) e λ α ( e α x 1 ) .
The point x is a local maximum if 2 x 2 ln ( f ( x ) ) < 0 , a local minimum if 2 x 2 ln ( f ( x ) ) > 0 and a point of inflection if 2 x 2 ln ( f ( x ) ) = 0 .
Let us now study the asymptotic properties of f ( x ) . We have
f ( x ) c a B ( a , b ) λ a x a 1 , x 0 .
Thus, for a ( 0 , 1 ) , we have lim x 0 f ( x ) = + ; for a = 1 , we have lim x 0 f ( x ) = b c λ ; and, for a > 1 , we have lim x 0 f ( x ) = 0 . We have
f ( x ) 1 c b B ( a , b ) λ e α x e λ b α e λ b α e α x , x + .
Thus, lim x + f ( x ) = 0 in all cases. Figure 1a illustrates these points for selected parameters.

3.2. On the Shapes of the hrf

Similar to the pdf, the critical points x of the hrf h ( x ) given by Equation (6) satisfy x ln ( h ( x ) ) = 0 , with
x ln ( h ( x ) ) = α b λ e α x + ( a 1 ) λ e α x e λ α ( e α x 1 ) 1 e λ α ( e α x 1 ) + ( a + b ) ( 1 c ) λ e α x e λ α ( e α x 1 ) c + ( 1 c ) e λ α ( e α x 1 ) + c a λ e α x e λ b α ( e α x 1 ) 1 e λ α ( e α x 1 ) a 1 B ( a , b ) 1 ( 1 c ) 1 e λ α ( e α x 1 ) a + b 1 I c 1 e λ α ( e α x 1 ) 1 ( 1 c ) 1 e λ α ( e α x 1 ) ( a , b ) .
Again, the point x is a local maximum if 2 x 2 ln ( h ( x ) ) < 0 , a local minimum if 2 x 2 ln ( h ( x ) ) > 0 and a point of inflection if 2 x 2 ln ( h ( x ) ) = 0 .
We also have
h ( x ) c a B ( a , b ) λ a x a 1 , x 0 .
Thus, for a ( 0 , 1 ) , we have lim x 0 h ( x ) = + ; for a = 1 , we have lim x 0 h ( x ) = b c λ ; and, for a > 1 , we have lim x 0 h ( x ) = 0 . We have
h ( x ) b λ e α x , x + .
Thus, lim x + h ( x ) = + in all cases. Figure 1b illustrates these points for selected parameters.

3.3. Linear Representation

Let us determine useful linear representations for F ( x ) given by Equation (4) and f ( x ) given by Equation (5). First, let us suppose that c ( 0 , 1 ) . It follows from the generalized binomial formula, i.e., ( 1 + z ) γ = k = 0 + γ k z k for | z | < 1 and γ R , with γ k = γ ( γ 1 ) ( γ k + 1 ) k ! , that
F ( x ) = 1 B ( a , b ) 0 c G ( x ) 1 ( 1 c ) G ( x ) t a 1 ( 1 t ) b 1 d t = 1 B ( a , b ) k = 0 + b 1 k ( 1 ) k 0 c G ( x ) 1 ( 1 c ) G ( x ) t a + k 1 d t = 1 B ( a , b ) k = 0 + b 1 k ( 1 ) k a + k c G ( x ) 1 ( 1 c ) G ( x ) a + k .
On the other hand, using again the generalized binomial formula, we obtain
c G ( x ) 1 ( 1 c ) G ( x ) a + k = c a + k = 0 + ( a + k ) ( 1 ) ( 1 c ) [ G ( x ) ] + a + k .
In a similar manner, we have
[ G ( x ) ] + a + k = 1 e λ α ( e α x 1 ) + a + k = m = 0 + + a + k m ( 1 ) m ( 1 H m ( x ) ) ,
where H m ( x ) = 1 e m λ α ( e α x 1 ) is the cdf of a Gompertz distribution with parameters m λ and α . Combining these equalities, we obtain the following series expansion:
F ( x ) = m = 0 + v m ( 1 H m ( x ) ) ,
where
v m = ( 1 ) m B ( a , b ) k = 0 + = 0 + + a + k m ( a + k ) b 1 k c a + k ( 1 ) k + ( 1 c ) 1 a + k .
By derivation of F ( x ) , f ( x ) can be expressed as
f ( x ) = m = 0 + w m h m ( x ) ,
where w m = v m and h m ( x ) is the pdf of a Gompertz distribution with parameters m λ and α .
For the case c > 1 , we must do some transformation for Equation (7) to apply the generalized binomial formula. We can write
c G ( x ) 1 ( 1 c ) G ( x ) a + k = G ( x ) 1 ( 1 1 c ) ( 1 G ( x ) ) a + k = [ G ( x ) ] a + k = 0 + ( a + k ) ( 1 ) ( c 1 ) c [ 1 G ( x ) ] = = 0 + q = 0 ( a + k ) q ( 1 ) + q ( c 1 ) c [ G ( x ) ] q + a + k .
On the other hand, we have
[ G ( x ) ] q + a + k = m = 0 + q + a + k m ( 1 ) m ( 1 H m ( x ) ) .
Therefore, we can write F ( x ) as Equation (8) with
v m = ( 1 ) m B ( a , b ) k = 0 + = 0 + q = 0 ( a + k ) q q + a + k m b 1 k ( 1 ) + q + k ( c 1 ) c 1 a + k ,
and f ( x ) as Equation (9) with w m = v m (and still h m ( x ) is the pdf of a Gompertz distribution with parameters m λ and α ). For the sake of simplicity, we refer to the form in Equation (9) far all series representation of f ( x ) , whether c ( 0 , 1 ) or c > 1 .
Hereafter, we denote by X a random variable having the cdf F ( x ) given by Equation (4) (and the pdf f ( x ) given by Equation (5)) and by Y m a random variable following the Gompertz distribution with parameters m λ and α , i.e., having the cdf H m ( x ) (and the pdf h m ( x ) ).

3.4. Quantile Function

The quantile function of X is given by
Q ( u ) = 1 α ln 1 α λ ln 1 I u 1 ( a , b ) c + ( 1 c ) I u 1 ( a , b ) , u ( 0 , 1 ) ,
where I u 1 ( a , b ) denotes the inverse of I u ( a , b ) . It satisfies F ( Q ( u ) ) = Q ( F ( u ) ) = u . Using [20], one can show that
Q ( u ) 1 λ c a 1 a B ( a , b ) 1 a u 1 a , u 0 .
From Q ( u ) , we can simulate the MBGz distribution. Indeed, let U be a random variable following the uniform distribution over ( 0 , 1 ) . Then, the random variable X = Q ( U ) follows the MBGz distribution.
The median of X is given by M = Q ( 1 / 2 ) . We can also use Q ( u ) to define skewness measures. Let us just introduce the Bowley skewness based on quartiles and the Moors kurtosis respectively defined by
B = Q ( 3 / 4 ) + Q ( 1 / 4 ) 2 Q ( 1 / 2 ) Q ( 3 / 4 ) Q ( 1 / 4 ) , M o = Q ( 7 / 8 ) Q ( 5 / 8 ) + Q ( 3 / 8 ) Q ( 1 / 8 ) Q ( 6 / 8 ) Q ( 2 / 8 ) .
Contrary to γ 1 and γ 2 , these quantities have the advantage to be always defined. We refer to [24,25].

3.5. Moments

Let r be a positive integer. The rth ordinary moment of X is defined by μ r = E X r = + x r f ( x ) d x . Using the linear representation given by Equation (9), we can express μ r as
μ r = m = 0 + w m + x r h m ( x ) d x = m = 0 + w m E ( Y m r ) .
By doing the change of variables u = e α x , we obtain
E ( Y m r ) = m λ α r + 1 e m λ α 1 + ( ln u ) r e m λ α u d u .
This integral has connections with the so-called generalized integro-exponential function. Further developments can be found in [26,27]. Therefore, we have
μ r = m = 0 + w m m λ α r + 1 e m λ α 1 + ( ln u ) r e m λ α u d u .
Obviously, the mean of X is given by E ( X ) = μ 1 and the variance of X is given by V ( X ) = μ 2 ( μ 1 ) 2 .

3.6. Skewness

The rth central moment of X is given by μ r = E ( X μ 1 ) r . It follows from the binomial formula that
μ r = k = 0 r r k ( 1 ) k ( μ 1 ) k μ r k .
On the other side, the rth cumulants of X can be obtained via the equation:
κ r = μ r k = 1 r 1 r 1 k 1 κ k μ r k ,
with κ 1 = μ 1 . The skewness of X is given by γ 1 = κ 3 / κ 2 3 / 2 and the kurtosis of X is given by γ 2 = κ 4 / κ 2 2 . One can also introduce the MacGillivray skewness given by
ρ ( u ) = Q ( 1 u ) + Q ( u ) 2 Q ( 1 / 2 ) Q ( 1 u ) Q ( u ) , u ( 0 , 1 ) .
It illustrates the effects of the parameters a, b, α and λ on the skewness. Further details can be found in [28].

3.7. Moment Generating Function

The moment generating function of X is given by M X ( t ) = E e t X = + e t x f ( x ) d x . Using Equation (9), we have
M X ( t ) = m = 0 + w m + e t x h m ( x ) d x = m = 0 + w m M Y m ( t ) ,
where M Y m ( t ) = E ( e t Y m ) , the moment generating function of Y m . Doing successively the change of variables u = e α x and the change of variable v = m λ α u , we obtain
M Y m ( t ) = m λ α e m λ α 1 + u t α e m λ α u d u = e m λ α α m λ t α m λ α + v t α e v d v = e m λ α α m λ t α Γ t α + 1 , m λ α ,
where Γ ( d , x ) denotes the complementary incomplete gamma function defined by Γ ( d , x ) = x + t d 1 e t d t . Therefore, we can write
M X ( t ) = m = 0 + w m e m λ α α m λ t α Γ t α + 1 , m λ α .
Alternatively, using the moments of X, one can write
M X ( t ) = r = 0 + t r r ! μ r = r = 0 + m = 0 + t r r ! w m m λ α r + 1 e m λ α 1 + ( ln u ) r e m λ α u d u .

3.8. Incomplete Moments and Mean Deviations

The rth incomplete moment of X is defined by m r ( t ) = E X r 1 { X t } = t x r f ( x ) d x . Using Equation (9), we can express m r ( t ) as
m r ( t ) = m = 0 + w m t x r h m ( x ) d x .
Doing successively the change of variables u = e α x , we obtain
t x r h m ( x ) d x = m λ α r + 1 e m λ α 1 e α t ( ln u ) r e m λ α u d u .
The mean deviation of X about the mean is given by
δ 1 = E ( | X μ 1 | ) = 2 μ 1 F ( μ 1 ) 2 m 1 ( μ 1 ) ,
where m 1 ( t ) denote the first incomplete moment. The mean deviation of X about the median M = Q ( 1 / 2 ) is given by
δ 2 = E ( | X M | ) = μ 1 2 m 1 ( M ) .

3.9. Entropies

Let us now investigate different kinds of entropies. For the sake of simplicity in exposition, we suppose that c ( 0 , 1 ) (the case c > 1 can be considered in a similar way). The Rényi entropy of X is defined by
I γ ( X ) = 1 1 γ ln + f ( x ) γ d x ,
with γ > 0 and γ 1 . It follows from (3) that
f ( x ) γ = c a γ [ g ( x ) ] γ G ( x ) γ ( a 1 ) 1 G ( x ) γ ( b 1 ) B ( a , b ) γ 1 ( 1 c ) G ( x ) γ ( a + b ) .
The generalized binomial formula implies that
G ( x ) γ ( a 1 ) 1 ( 1 c ) G ( x ) γ ( a + b ) = k = 0 + γ ( a + b ) k ( 1 ) k ( 1 c ) k [ G ( x ) ] k + γ ( a 1 ) .
Similarly, we have
[ G ( x ) ] k + γ ( a 1 ) = = 0 + k + γ ( a 1 ) ( 1 ) [ 1 G ( x ) ] .
Therefore,
f ( x ) γ = c a γ B ( a , b ) γ k = 0 + = 0 + γ ( a + b ) k k + γ ( a 1 ) ( 1 ) k + ( 1 c ) k [ 1 G ( x ) ] + γ ( b 1 ) [ g ( x ) ] γ .
By doing the change of variable u = e α x and the change of variable v = ( + γ b ) λ α u , we get
+ [ 1 G ( x ) ] + γ ( b 1 ) [ g ( x ) ] γ d x = 0 + e ( + γ b ) λ α ( e α x 1 ) λ γ e α γ x d x = λ γ 1 α e ( + γ b ) λ α 1 + u γ 1 e ( + γ b ) λ α u d u = α γ 1 ( + γ b ) γ e ( + γ b ) λ α ( + γ b ) λ α + v γ 1 e v d v = α γ 1 ( + γ b ) γ e ( + γ b ) λ α Γ γ , ( + γ b ) λ α .
By putting the above equalities together, we have
I γ ( X ) = 1 1 γ [ α γ ln ( c ) γ ln ( B ( a , b ) ) + ( γ 1 ) ln ( α ) + γ b λ α + ln k = 0 + = 0 + γ ( a + b ) k k + γ ( a 1 ) ( 1 ) k + ( 1 c ) k e λ α ( + γ b ) γ Γ γ , ( + γ b ) λ α ] .
The Shannon entropy of X is defined by S ( X ) = E ( ln [ f ( X ) ] ) . It can be obtained by the formula S ( X ) = lim γ 1 + I γ ( X ) .
The γ -entropy is defined by
H γ ( X ) = 1 γ 1 ln 1 + f ( x ) γ d x .
Using the expansion above, we obtain
H γ ( X ) = 1 γ 1 ln [ 1 c a γ α γ 1 e γ b λ α B ( a , b ) γ × k = 0 + = 0 + γ ( a + b ) k k + γ ( a 1 ) ( 1 ) k + ( 1 c ) k e λ α ( + γ b ) γ Γ γ , ( + γ b ) λ α ] .

3.10. Order Statistics

Let X 1 , , X n be the random sample from X and X i : n be the ith order statistic. Then, the pdf of X i : n is given by
f i : n ( x ) = n ! ( i 1 ) ! ( n i ) ! f ( x ) [ F ( x ) ] i 1 1 F ( x ) n i = n ! ( i 1 ) ! ( n i ) ! j = 0 n i n i j ( 1 ) j f ( x ) [ F ( x ) ] j + i 1 .
It follows from Equations (8) and (9) that
f i : n ( x ) = n ! ( i 1 ) ! ( n i ) ! j = 0 n i n i j ( 1 ) j m = 0 + w m h m ( x ) k = 0 + v k ( 1 H k ( x ) ) j + i 1 .
Using a result from [29], power series raised to a positive power as follows
k = 0 + a k x k n = k = 0 + d n , k x k ,
where the coefficients ( d n , k ) k N are determined from the recurrence equation: d n , 0 = a 0 n and, for any m 1 , d n , m = ( 1 / ( m a 0 ) ) k = 1 m ( k ( n + 1 ) m ) a k d n , m k . Therefore, noticing that 1 H k ( x ) = e λ α ( e α x 1 ) k , we have
k = 0 + v k ( 1 H k ( x ) ) j + i 1 = k = 0 + d j + i 1 , k ( 1 H k ( x ) ) ,
where d j + i 1 , 0 = v 0 j + i 1 and, for any m 1 , d j + i 1 , m = 1 m v 0 k = 1 m ( k ( j + i ) m ) v k d j + i 1 , m k . By combining the equalities above, we obtain
f i : n ( x ) = n ! ( i 1 ) ! ( n i ) ! j = 0 n i n i j ( 1 ) j m = 0 + k = 0 + w m d j + i 1 , k h m ( x ) ( 1 H k ( x ) ) .
Finally, one can observe that h m ( x ) ( 1 H k ( x ) ) = m λ e α x e ( m + k ) λ α ( e α x 1 ) = m m + k u m + k ( x ) , where u m + k ( x ) denotes the pdf of the Gompertz distribution with parameters ( m + k ) λ and α . Thus, the pdf of ith order statistic of the MBGz distribution can be expressed as a linear combination of Gompertz pdfs, i.e.,
f i : n ( x ) = n ! ( i 1 ) ! ( n i ) ! j = 0 n i n i j ( 1 ) j m = 0 + k = 0 + w m d j + i 1 , k m m + k u m + k ( x ) .
Let r be a positive integer. Then, the rth ordinary moment of X i : n can be expressed as
E ( X i : n r ) = + x r f i : n ( x ) d x = n ! ( i 1 ) ! ( n i ) ! j = 0 n i n i j ( 1 ) j m = 0 + k = 0 + w m d j + i 1 , k m λ α r + 1 e ( m + k ) λ α 1 + ( ln u ) r e ( m + k ) λ α u d u .

4. Statistical Inference

4.1. Maximum Likelihood Estimation

We now investigate the estimation of the parameters of the MBGz distribution. Let x 1 , , x n be n observed values from the MBGz distribution and ξ = ( λ , α , a , b , c ) be the vector of unknown parameters. The log likelihood function is given by
( ξ ) = a n ln ( c ) + n ln ( λ ) + α i = 1 n x i λ b α i = 1 n ( e α x i 1 ) + ( a 1 ) i = 1 n ln 1 e λ α ( e α x i 1 ) n ln ( B ( a , b ) ) ( a + b ) i = 1 n ln 1 ( 1 c ) 1 e λ α ( e α x i 1 ) .
The maximum likelihood estimators of the parameters are obtained by maximizing the log likelihood function. They can be determined by solving the non-linear equations: λ ( ξ ) = 0 , α ( ξ ) = 0 , a ( ξ ) = 0 , b ( ξ ) = 0 , c ( ξ ) = 0 with
( ξ ) λ = n λ b α i = 1 n ( e α x i 1 ) + ( a 1 ) i = 1 n 1 α ( e α x i 1 ) e λ α ( e α x i 1 ) 1 e λ α ( e α x i 1 ) + ( a + b ) i = 1 n ( 1 c ) 1 α ( e α x i 1 ) e λ α ( e α x i 1 ) 1 ( 1 c ) 1 e λ α ( e α x i 1 ) ,
( ξ ) α = i = 1 n x i λ b α i = 1 n x i e α x i 1 α ( e α x i 1 ) + ( a 1 ) i = 1 n λ α e λ α ( e α x i 1 ) x i e α x i 1 α ( e α x i 1 ) 1 e λ α ( e α x i 1 ) + ( a + b ) i = 1 n ( 1 c ) e λ α ( e α x i 1 ) x i e α x i 1 α ( e α x i 1 ) 1 ( 1 c ) 1 e λ α ( e α x i 1 ) ,
by setting B ( 1 , 0 ) ( a , b ) = a B ( a , b ) and B ( 0 , 1 ) ( a , b ) = b B ( a , b ) (one can remark that B ( 1 , 0 ) ( a , b ) = ψ ( a ) ψ ( a + b ) and B ( 0 , 1 ) ( a , b ) = ψ ( b ) ψ ( a + b ) , where ψ ( x ) denotes the so called digamma function),
( ξ ) a = n ln c + i = 1 n ln 1 e λ α ( e α x i 1 ) n B ( 1 , 0 ) ( a , b ) B ( a , b ) i = 1 n ln 1 ( 1 c ) 1 e λ α ( e α x i 1 ) ,
( ξ ) b = λ α i = 1 n ( e α x i 1 ) n B ( 0 , 1 ) ( a , b ) B ( a , b ) i = 1 n ln 1 ( 1 c ) 1 e λ α ( e α x i 1 )
and
( ξ ) c = a n c ( a + b ) i = 1 n 1 e λ α ( e α x i 1 ) 1 + ( 1 c ) 1 e λ α ( e α x i 1 ) .
We can solve the above non-linear equations simultaneously. A mathematical package can be used to get the maximum likelihood estimators of the unknown parameters. In addition, all the second-order derivatives exist. As usual, the asymptotic normality of the maximum likelihood estimators can be used to construct informative objects (approximate confidence intervals, confidence regions, and testing hypotheses of λ , α , a , b , c , etc.).

4.2. Simulation

From a theoretical point of view, the performances of the different estimates (MLEs) for the MBGz distribution are difficult to compare. We therefore propose a simulation study that uses their mean square errors (MSEs) for different sample sizes as benchmarks. The software package Mathematica (version 9) was used. Different sample sizes were considered through the experiments at size n = 50 , 100 and 150. The experiment was repeated 3000 times. In each experiment, the estimates of the parameters were obtained by maximum likelihood methods of estimation. The means and MSEs for the different estimators can be found in Table 1. We observed that MSEs are decreasing with increasing n.

4.3. Applications

This section provides an application to show how the MBGz distribution can be applied in practice. We compared MBGz to Exponentaited Generalized Weibull–Gompertz distribution (EGWGz) by El-Bassiouny et al. [30] and other well known distributions in literature, Kumaraswamy–Gompertz (Kw-Gz), beta Gompertz (BGz) and Gompertz (Gz) models. The MLEs are computed using Quasi-Newton Code for Bound Constrained Optimization and the log-likelihood function evaluated. The goodness-of-fit measures, Anderson–Darling (A*), Cramer–von Mises (W*), Akaike Information Criterion (AIC), Bayesian Information Criterion (BIC), and log-likelihood ( ^ ) values are computed. As usual, the lower are the values of these criteria, the better is the fit. In addition, the value for the Kolmogorov–Smirnov (KS) statistic and its p-value are reported. The required computations are carried out in the R software (version 3).

4.3.1. Dataset 1

The first dataset was given in [31]. It represents the time to failure of turbocharger of a certain type of engine. The dataset is as follows: 0.0312, 0.314, 0.479, 0.552, 0.700, 0.803, 0.861, 0.865, 0.944, 0.958, 0.966, 0.977, 1.006, 1.021, 1.027, 1.055, 1.063, 1.098, 1.140, 1.179, 1.224, 1.240, 1.253, 1.270, 1.272, 1.274, 1.301, 1.301, 1.359, 1.382, 1.382, 1.426, 1.434, 1.435, 1.478, 1.490, 1.511, 1.514, 1.535, 1.554, 1.566, 1.570, 1.586, 1.629, 1.633, 1.642, 1.648, 1.684, 1.697, 1.726, 1.770, 1.773, 1.800, 1.809, 1.818, 1.821, 1.848, 1.880, 1.954, 2.012, 2.067, 2.084, 2.090, 2.096, 2.128, 2.233, 2.433, 2.585, 2.585.

4.3.2. Dataset 2

The second dataset was considered in [32]. It corresponds to a single fiber with 20 and 101 mm of gauge length, respectively. The dataset is as follows: 1.6, 2.0, 2.6, 3.0, 3.5, 3.9, 4.5, 4.6, 4.8, 5.0, 5.1, 5.3, 5.4, 5.6, 5.8, 6.0, 6.0, 6.1, 6.3, 6.5, 6.5, 6.7, 7.0, 7.1, 7.3, 7.3, 7.3, 7.7, 7.7, 7.8, 7.9, 8.0, 8.1, 8.3, 8.4, 8.4, 8.5, 8.7, 8.8, 9.0.
Table 2 and Table 3 present the maximum likelihood estimates, with the corresponding standard errors in parentheses, of the unknown parameters ( λ , α , a , b , c ) of the MBGz distribution for Datasets 1 and 2, respectively. Table 4 and Table 5 show the statistics AIC, BIC, W*, A*, KS, and p-Value values for all the considered models. We then see that the proposed MBGz model fits the considered data better than the other models. Thus, the proposed MBGz model provides an interesting alternative to other existing models for modeling positive real data. To complete this fact, PP, QQ, epdf and ecdf plots of the MBGz distribution are given in Figure 2 and Figure 3 for Datasets 1 and 2, respectively.

Author Contributions

I.E., F.J., C.C., M.E. and S.A. have contributed equally to this work.

Funding

This research received no external funding.

Acknowledgments

The authors would like to thank the referees for their valuable comments which helped to improve the manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Gompertz, B. On the nature of the function expressive of the law of human mortality and on the new mode of determining the value of life contingencies. Philos. Trans. R. Soc. A 1825, 115, 513–580. [Google Scholar] [CrossRef]
  2. Tjørve, K.M.C.; Tjørve, E. The use of Gompertz models in growth analyses, and new Gompertz-model approach: An addition to the Unified-Richards family. PLoS ONE 2017, 12. [Google Scholar] [CrossRef]
  3. El-Gohary, A.; Alshamrani, A.; Al-Otaibi, A. The generalized Gompertz distribution. Appl. Math. Model. 2013, 37, 13–24. [Google Scholar] [CrossRef]
  4. Jafari, A.A.; Tahmasebi, S.; Alizadeh, M. The beta-Gompertz distribution. Revista Colombiana de Estadistica 2014, 37, 141–158. [Google Scholar] [CrossRef]
  5. Eugene, N.; Lee, C.; Famoye, F. Beta-normal distribution and its applications. Comm. Statist. Theory Methods 2002, 31, 497–512. [Google Scholar] [CrossRef]
  6. Benkhelifa, L. The beta generalized Gompertz distribution. Appl. Math. Model. 2017, 52, 341–357. [Google Scholar] [CrossRef]
  7. El-Damcese, M.A.; Mustafa, A.; El-Desouky, B.S.; Mustafa, M.E. Generalized Exponential Gompertz Distribution. Appl. Math. 2015, 6, 2340–2353. [Google Scholar] [CrossRef]
  8. Tahir, M.H.; Cordeiro, G.M.; Alizadeh, M.; Mansoor, M.; Zubair, M.; Hamedani, G.G. The Odd Generalized Exponential Family of Distributions with Applications. J. Stat. Distrib. Appl. 2015, 2, 1–28. [Google Scholar] [CrossRef]
  9. Roozegar, R.; Tahmasebi, S.; Jafari, A.A. The McDonald Gompertz distribution: Properties and applications. Commun. Stat. Simul. Comput. 2017, 46, 3341–3355. [Google Scholar] [CrossRef]
  10. Alexander, C.; Cordeiro, G.M.; Ortega, E.M.M.; Sarabia, J.M. Generalized beta-generated distributions. Comput. Stat. Data Anal. 2012, 56, 1880–1897. [Google Scholar] [CrossRef]
  11. Khan, M.S.; King, R.; Hudson, I.L. Transmuted Generalized Gompertz distribution with application. J. Stat. Theory Appl. 2017, 16, 65–80. [Google Scholar] [CrossRef]
  12. Moniem, A.I.B.; Seham, M. Transmuted Gompertz Distribution. Comput. Appl. Math. 2015, 1, 88–96. [Google Scholar]
  13. Shaw, W.; Buckley, I. The Alchemy of Probability Distributions: Beyond Gram-Charlier Expansions and a Skewkurtotic-Normal Distribution from a Rank Transmutation Map; Research Report; King’s College: London, UK, 2007. [Google Scholar]
  14. Chukwu, A.U.; Ogunde, A.A. On kumaraswamy gompertz makeham distribution. Am. J. Math. Stat. 2016, 6, 122–127. [Google Scholar]
  15. Lima, F.P.; Sanchez, J.D.; da Silva, R.C.; Cordeiro, G.M. The Kumaraswamy Gompertz distribution. J. Data Sci. 2015, 13, 241–260. [Google Scholar]
  16. Benkhelifa, L. Marshall-Olkin extended generalized Gompertz distribution. J. Data Sci. 2017, 15, 239–266. [Google Scholar]
  17. Yaghoobzadeh, S. A new generalization of the Marshall-Olkin Gompertz distribution. Int. J. Syst. Assur. Eng. Manag. 2017, 8, 1580–1587. [Google Scholar] [CrossRef]
  18. Marshall, A.W.; Olkin, I. A new method for adding a parameter to a family of distributions with application to the exponential and Weibull families. Biometrika 1997, 84, 641–652. [Google Scholar] [CrossRef]
  19. Shadrokh, A.; Yaghoobzadeh, S.S. The Beta Gompertz Geometric Distribution: Mathematical Properties and Applications. Andishe_ye Amari 2018, 22, 81–91. [Google Scholar]
  20. Nadarajah, S.; Teimouri, M.; Shih, S.H. Modified Beta Distributions. Sankhya Ser. B 2014, 76, 19–48. [Google Scholar] [CrossRef]
  21. Alizadeh, M.; Cordeiro, G.M.; Brito, E. The beta Marshall-Olkin family of distributions. J. Stat. Distrib. Appl. 2015, 2, 4. [Google Scholar] [CrossRef]
  22. Nadarajah, S.; Kotz, S. The beta exponential distribution. Reliab. Eng. Syst. Saf. 2006, 91, 689–697. [Google Scholar] [CrossRef]
  23. Gupta, R.D.; Kundu, D. Generalized Exponential Distributions. Aust. N. Z. J. Stat. 1999, 41, 173–188. [Google Scholar] [CrossRef]
  24. Kenney, J.F.; Keeping, E.S. Mathematics of Statistics, 3rd ed.; Chapman and Hall Ltd.: Rahway, NJ, USA, 1962. [Google Scholar]
  25. Moors, J.J. A quantile alternative for kurtosis. J. R. Stat. Soc. Ser. D 1988, 37, 25–32. [Google Scholar] [CrossRef]
  26. Milgram, M. The generalized integro-exponential function. Math. Comput. 1985, 44, 443–458. [Google Scholar] [CrossRef]
  27. Lenart, A. The moments of the Gompertz distribution and maximum likelihood estimation of its parameters. Scand. Actuar. J. 2014, 3, 255–277. [Google Scholar] [CrossRef]
  28. MacGillivray, H.L. Skewness and Asymmetry: Measures and Orderings. Ann. Stat. 1986, 14, 994–1011. [Google Scholar] [CrossRef]
  29. Gradshteyn, I.S.; Ryzhik, I.M. Table of Integrals, Series and Products; Academic Press: New York, NY, USA, 2000. [Google Scholar]
  30. El-Bassiouny, H.; EL-Damcese, M.; Mustafa, A.; Eliwa, M.S. Exponentiated Generalized Weibull-Gompertz Distribution with Application in Survival Analysis. J. Stat. Appl. Probab. 2017, 6, 7–16. [Google Scholar] [CrossRef]
  31. Xu, K.; Xie, M.; Tang, L.C.; Ho, S.L. Application of Neural Networks in forecasting Engine Systems Reliability. Appl. Soft Comput. 2003, 2, 255–268. [Google Scholar] [CrossRef]
  32. Badar, M.G.; Priest, A.M. Statistical aspects of fiber and bundle strength in hybrid composites. In Progress in Science and Engineering Composites, Proceedings of the ICCM-IV, Tokyo, Japan, 25–28 October 1982; Hayashi, T., Kawata, K., Umekawa, S., Eds.; Japan Society for Composite Materials: Tokyo, Japan, 1982; pp. 1129–1136. [Google Scholar]
Figure 1. Some plots of the pdf f ( x ) (a); and some plots for the hrf h ( x ) (b).
Figure 1. Some plots of the pdf f ( x ) (a); and some plots for the hrf h ( x ) (b).
Mathematics 07 00003 g001
Figure 2. PP, QQ, epdf and ecdf plots of the MBGz distribution for Dataset 1.
Figure 2. PP, QQ, epdf and ecdf plots of the MBGz distribution for Dataset 1.
Mathematics 07 00003 g002
Figure 3. PP, QQ, epdf and ecdf plots of the MBGz distribution for Dataset 2.
Figure 3. PP, QQ, epdf and ecdf plots of the MBGz distribution for Dataset 2.
Mathematics 07 00003 g003
Table 1. The MLEs and MSEs of MBGz distribution.
Table 1. The MLEs and MSEs of MBGz distribution.
nParametersInitialMLEMSEInitialMLEMSE
50a3.03.00240.50572.52.64240.1737
b1.51.64090.14991.51.52190.0400
c0.50.49410.00080.50.50500.0004
α 0.50.54220.01980.50.52910.0116
λ 0.50.52410.03870.50.52350.0122
100a3.03.07780.24582.52.50600.0754
b1.51.60830.07791.51.53730.0291
c0.50.49860.00030.50.49910.0003
α 0.50.55720.01470.50.51230.0029
λ 0.50.49260.01260.50.50350.0070
150a3.02.90410.10152.52.51250.0284
b1.51.61590.04851.51.52320.0088
c0.50.49400.00020.50.50020.0001
α 0.50.54770.00940.50.51370.0015
λ 0.50.46940.00720.50.49680.0015
50a1.51.47060.03251.51.54350.0641
b1.81.77640.06391.81.78380.0955
c0.50.50540.00131.51.52850.0203
α 0.50.48330.00290.50.48950.0008
λ 0.50.54880.01600.50.53640.0118
100a1.51.51380.02011.51.51940.0224
b1.81.81770.03801.81.83090.0451
c0.50.50040.00071.51.50100.0047
α 0.50.50070.00230.50.50110.0005
λ 0.50.51060.00590.50.50360.0028
150a1.51.53130.01021.51.46900.0094
b1.81.81520.01941.81.83960.0258
c0.50.50550.00031.51.48640.0017
α 0.50.51730.00220.50.50070.0004
λ 0.50.50440.00340.50.49430.0009
Table 2. MLEs (standard errors in parentheses) for Dataset 1.
Table 2. MLEs (standard errors in parentheses) for Dataset 1.
Distribution Estimates
MBGz ( λ , α , a , b , c )0.00852.55371.07371.31535.0687
(0.0067)(0.5727)(0.3197)(0.8933)(3.3003)
EGWGz ( λ , a , b , c , β )3.20782.45980.02031.89740.5460
(1.2099)(0.6498)(0.0531)(1.8193)(0.2430)
KwGz ( a , b , c , d , θ )0.18611.49481.49090.9811
(0.3130)(0.5076)(0.4735)(2.4368)
BGz ( a , b , θ , v )0.31441.55911.47980.4966
(0.4283)(0.3658)(0.4543)(0.8692)
Gz ( λ , α )0.08411.8811
(0.0268)(0.2043)
Table 3. MLEs (standard errors in parentheses) for Dataset 2.
Table 3. MLEs (standard errors in parentheses) for Dataset 2.
Distribution Estimates
MBGz ( λ , α , a , b , c )0.00980.52700.87684.56350.1561
(0.0116)(0.1599)(0.3893)(0.8862)(0.2442)
EGWGz ( λ , a , b , c , β )0.01010.60770.10781.69290.6613
(0.0141)(0.1506)(0.3427)(1.2539)(0.3379)
KwGz ( a , b , θ , v )0.01330.29232.016413.7085
(0.0120)(0.1641)(0.7880)(7.0208)
BGz ( a , b , c , d , θ )0.01250.18563.76222.0116
(0.0100)(0.1601)(2.5635)(3.3802)
Gz ( λ , α )0.0074)(0.6243)
(0.0035)(0.0748)
Table 4. The AIC, BIC, W*, A*, KS, and p-Value values for Dataset 1.
Table 4. The AIC, BIC, W*, A*, KS, and p-Value values for Dataset 1.
Dist ^ AICBICW*A*KSp-Value
MBGz50.0387110.0776118.24810.03280.27450.05390.9889
EGWGz52.6888115.3776126.54820.07060.53410.07850.7885
KwGz51.2042110.4084119.34480.05290.41250.06400.9396
BGz51.1518110.3026119.23990.05180.40570.06270.9484
Gz53.9686111.9374122.40560.08190.59210.08100.7547
Table 5. The AIC, BIC, W*, A*, KS, and p-Value values for Dataset 2.
Table 5. The AIC, BIC, W*, A*, KS, and p-Value values for Dataset 2.
Dist ^ AICBICW*A*KSp-Value
MBGz78.2184168.1770176.02140.02220.18400.07070.9888
EGWGz79.3744168.5489178.59330.04790.29220.08210.9623
KwGz80.7197169.4395176.19500.04300.33260.09660.8489
BGz82.9924173.9849180.74040.09220.67360.10800.7389
Gz80.9566168.9234177.29110.03590.23350.09030.8299

Share and Cite

MDPI and ACS Style

Elbatal, I.; Jamal, F.; Chesneau, C.; Elgarhy, M.; Alrajhi, S. The Modified Beta Gompertz Distribution: Theory and Applications. Mathematics 2019, 7, 3. https://doi.org/10.3390/math7010003

AMA Style

Elbatal I, Jamal F, Chesneau C, Elgarhy M, Alrajhi S. The Modified Beta Gompertz Distribution: Theory and Applications. Mathematics. 2019; 7(1):3. https://doi.org/10.3390/math7010003

Chicago/Turabian Style

Elbatal, Ibrahim, Farrukh Jamal, Christophe Chesneau, Mohammed Elgarhy, and Sharifah Alrajhi. 2019. "The Modified Beta Gompertz Distribution: Theory and Applications" Mathematics 7, no. 1: 3. https://doi.org/10.3390/math7010003

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop