Next Article in Journal
FruiTemp: Design, Implementation and Analysis for an Open-Source Temperature Logger Applied to Fruit Fly Host Experimentation
Previous Article in Journal
Semiconductor Laser Linewidth Theory Revisited
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

The Combined-Unified Hybrid Censored Samples from Pareto Distribution: Estimation and Properties

Department of Statistics and Operations Research, College of Science, King Saud University, P.O. Box 2455, Riyadh 11451, Saudi Arabia
*
Author to whom correspondence should be addressed.
The authors contributed equally in this work.
Appl. Sci. 2021, 11(13), 6000; https://doi.org/10.3390/app11136000
Submission received: 30 April 2021 / Revised: 17 June 2021 / Accepted: 24 June 2021 / Published: 28 June 2021
(This article belongs to the Topic Industrial Engineering and Management)

Abstract

:
In this paper, we use the combined-unified hybrid censoring samples to obtain the maximum likelihood estimates of the unknown parameters, survival, and hazard functions of Pareto distribution. Next, we discuss some efficiency criteria of the maximum likelihood estimators, including; the unbiasedness, consistency, and sufficiency. Additionally, we use MCMC to obtain the Bayesian estimates of the unknown parameters. In addition, we calculate the intervals estimation of the unknown parameters. Finally, we analyze a set of real data in view of the theoretical findings of the paper.

1. Introduction

Pareto distribution was introduced by Pareto [1] for the distribution of income. The importance of Pareto distribution lies in its applications in economics and reliability studies. Arnold [2] has given a wide historical aacount of Pareto distribution and its applications. Estimation and characteristics of Pareto distribution were investigated by many authors, among researchers, see for examples, Malik [3], Arnold and Press [4], Tiwari, Yang and Zalkikar [5], Abdel-Ghaly, Attia and Aly [6], Hossain and Zimmer [7], and Soliman [8]. Saldaña-Zepeda et al. [9] have proposed a goodness-of-fit test for Pareto distribution when the observations are Type-II right censoring. Wu [10] has constructed an interval estimation for Pareto distribution using a doubly Type-II censored sample. Recently, Han [11] has investigated the expected Bayesian estimation and its expected mean square error of Pareto distribution parameter under different loss functions and Poudyal [12] has investigated the truncated, censored, and actuarial payment-type moments of the robust fitting of a single parameter Pareto distribution.
A random variable X follows Pareto distribution P ( k , α ) if its probability density function (pdf) is given by
f ( x ) = α k α x α 1 , α > 0 , k > 0 , x k ,
with the corresponding cumulative distribution function (cdf) is given by
F ( x ) = 1 k x α , x k .
The reliability function R ( t ) and hazard function H ( t ) are given, respectively, as
R ( t ) = k t α and H ( t ) = α t .
Quite a few techniques exist to estimate the shape parameter of Pareto distribution. Both of Type-I and Type-II censored are extensively used in practice. Type-I and Type-II censoring schemes can be merged to get the hybrid censoring scheme which was first introduced by Epstein [13]. The hybrid censoring scheme becomes quite important in the reliability and life testing problems, see Fairbanks et al. [14], Draper and Guttman [15], Chen and Bhattacharya [16], Jeong et al. [17], Childs et al. [18], and Gupta and Kundu [19]. Balakrishnan and Kundu [20] have discussed theinferences based on Type-I and Type-II hybrid censoring schemes. Next, they have discussed some details on the generalized hybrid censoring and unified hybrid censoring schemes. Additionally, they have shown the adaption of the hybrid censoring schemes in competing risks set-up and in step-stress modeling. Jeon and Kang [21] have discussed the parameter estimation from half-logistic distribution by using multiply Type-II hybrid censoring. Nassar and Dobbah [22] have investigated the reliability characteristics of bathtub-shaped distribution under adaptive Type-I progressive hybrid censoring. Algarni, Almarashi, and Abd-Elmougoud [23] have discussion the joint Type-I generalized hybrid censoring for estimating the two Weibull distributions. Mohie El-Din, et al. [24] have distressed the estimation and prediction for Pareto distribution under Type-II progressive hybrid censoring scheme, while Çetinkaya [25] has drawn inference based on Type-II hybrid censored data from a Pareto distribution. Huang and Yang [26] have suggested a combined hybrid censoring sampling scheme (CHCS) as follows: Assume n experimental units are placed under a certain experiment and let X m : n and X l : n denote the failure time of the mth and lth units, respectively, such that ( m , l )   1 , 2 , , n , ( t 1 , t 2 )   ( 0 , ) , m < l , t 1 < t 2 and let t denote the termination time of the experiment. If the mth failure occurs before time t 1 , the experiment terminates at min { X l : n , t 1 } , when the mth failure occurs in the interval ( t 1 , t 2 ) , then the experiment is stopped at X m : n and finally when the mth failure occurs after time t 2 , the experiment is stooped at t 2 . For our later convenience, we abbreviate this scheme as combined CHCS ( m , l ; t 1 , t 2 ) . The system as devolved by Huang and Yang [26] includes six different cases, such that each case the data are unobservable as explained below:
t = X m : n , 0 < t 1 < X m : n < ( t 2 < X l : n ) , X m : n , 0 < t 1 < X m : n < ( X l : n < t 2 ) , t 2 , 0 < t 1 < t 2 < ( X m : n < X l : n ) , X l : n , 0 < X m : n < X l : n < ( t 1 < t 2 ) , t 1 , 0 < X m : n < t 1 < ( X l : n < t 2 ) , t 1 , 0 < X m : n < t 1 < ( t 2 < X l : n ) ,
where the unobservable data are marked by the parentheses.
Balakrishnan et al. [27] have proposed the unified hybrid censoring scheme (UHCS) that is for a certain m, l   1 , 2 , , n , ( t 1 , t 2 )   ( 0 , ) , m < l , t 1 < t 2 and t denote the experiment termination time. when the mth failure occurs before t 1 , then the experiment is terminated at m i n { m a x { X l : n , t 1 } , t 2 } , when mth failure occurs in the interval ( t 1 , t 2 ), then the experiment is terminated at m i n { X l : n , t 2 } and when the mth failure occurs after time t 2 , then the experiment is terminated at X m : n . The symbol UHCS ( m , l ; t 1 , t 2 ) is used for such this scheme. Similarly, each type of these hybrid censored samples includes different six cases such that in each case some part of sample are unobservable as given below
t = t 2 , 0 < t 1 < X m : n < t 2 < ( X l : n ) , X l : n , 0 < t 1 < X m : n < X l : n < ( t 2 ) , X m : n , 0 < t 1 < t 2 , < X m : n < ( X l : n ) , t 1 , 0 < X m : n < X l : n < t 1 < ( t 2 ) , X l : n , 0 < X m : n < t 1 < X l : n < ( t 2 ) , t 2 , 0 < X m : n < t 1 < t 2 < ( X l : n ) ,
where unobservable data are marked in the parentheses.
Emam and Sultan [28] have suggested a unified approach from CHCS ( m , l ; t 1 , t 2 ) , and UHCS ( m , l ; t 1 , t 2 ) known as the combined-unified hybrid censored scheme (C-UHCS ( m , l ; t 1 , t 2 ) ). They have applied the proposed censoring sampling to derive the Bayesian and non-Bayesian estimates from Dagum distribution. We belief no attempt has been made for estimating of the parameters of the Pareto distribution by using CHCS ( m , l ; t 1 , t 2 ) or UHCS ( m , l ; t 1 , t 2 ) , so, we apply C-UHCS ( m , l ; t 1 , t 2 ) to Pareto distribution. In this paper, we apply the combined-unified hybrid censored scheme to derive the estimates from Pareto distribution. We consider the maximum likelihood estimator of the parameters of Pareto distribution based on three cases: (i) the location parameter k when the shape parameter α is known; (ii) the shape parameters α when the location parameter k is unknown; and (iii) when the location and shape parameters are unknown. In addition, we state and prove four theorems discuss the efficiency of these estimators based on unbiasedness, consistency, and sufficiency. The remainder of this paper is structured as follows: in Section 2, we present the likelihood function of C-UHCS, in Section 3, we derive the maximum likelihood estimates of the unknown parameters in three different three cases and use them to construct the asymptotic confidence intervals (CI) for both of k and α . Next, in Section 4, we obtain the Bayes estimates of k and α under the squared error loss function using MCMC. In Section 5, we analyze a real dataset in using the theoretical findings of the paper. Finally, in Section 6, we draw a brief conclusion.

2. Likelihood Function of C-UHCS

Consider X 1 : n , X 2 : n , X r : n are the lifetimes of units that placed on a life-test, and let cumulative distribution distribution (cdf) F ( x ) and probability density distribution (pdf) f ( x ) and assume that, for any case, the experiment is terminated at t that may refer to time t 1 , t 2 , observation x m : n or observation x l : n , and let r denote the maximum number of failures until t equal, respectively, D 1 , D 2 , m and l. Emam and Sultan [28] have proposed the likelihood function under the censoring samples C-UHCS ( m , l ; t 1 , t 2 ) under different choices of r, t and x r : n = ( x 1 : n , x 2 : n , . . . , x r : n ) as
L ( Ω | x r : n ) = n ! ( n r ) ! [ 1 F ( t ) ] n r i = 1 r f ( x i : n ) ,
where r and t can be chosen in the different cases of censoring as:
L ( C ) Ω | x L ( U ) Ω | x
Casesrtrt
1 : 0 < t 1 < X m : n < t 2 < X l : n m X m : n D 2 t 2
2 : 0 < t 1 < X m : n < X l : n < t 2 m X m : n l X l : n
3 : 0 < t 1 < t 2 < X m : n < X l : n D 2 t 2 m X m : n
4 : 0 < X m : n < X l : n < t 1 < t 2 l X l : n D 1 t 1
5 : 0 < X m : n < t 1 < X l : n < t 2 D 1 t 1 l X l : n
6 : 0 < X m : n < t 1 < t 2 < X l : n D 1 t 1 D 2 t 2
L ( C ) Ω | x is the likelihood corresponding to CHCS ( m , l ; t 1 , t 2 ) given by Huang and Yang [26] as
L ( C ) Ω | x = n ! ( n m ) ! 1 F ( x m : n ) n m i = 1 m f ( x i : n ) ; D 1 = 0 , , m 1 , D 2 = m , n ! ( n D 2 ) ! 1 F ( t 2 ) n D 2 i = 1 D 2 f ( x i : n ) ; D 1 , D 2 = 0 , , m 1 , n ! ( n l ) ! 1 F ( x l : n ) n l i = 1 l f ( x i : n ) ; D 1 = D 2 = l , n ! ( n D 1 ) ! 1 F ( t 1 ) n D 1 i = 1 D 1 f ( x i : n ) ; D 1 = D 2 = m , , l 1 ,
and L ( U ) Ω | x is the likelihood corresponding to UHCS ( m , l ; t 1 , t 2 ) given by Balakrishnan et al. [27] as
L ( U ) Ω | x = n ! ( n D ) ! 1 F ( t 1 ) n D i = 1 m f ( x i : n ) ; D 1 = D 2 = D = l , , n , n ! ( n l ) ! 1 F ( x l : n ) n r i = 1 l f ( x i : n ) ; D 1 = m , , l 1 , D 2 = l , n ! ( n D 2 ) ! 1 F ( t 2 ) n D 2 i = 1 D 2 f ( x i ) ; D 1 , D 2 = m , , l 1 , n ! ( n l ) ! 1 F ( x l : n ) n l i = 1 l f ( x i : n ) ; D 1 = 0 , , m 1 , D 2 = l , n ! ( n D 2 ) ! 1 F ( t 2 ) n D 2 i = 1 D 2 f ( x i : n ) ; D 1 = 0 . . m 1 , D 2 = m . . . l 1 , n ! ( n m ) ! 1 F ( x m : n ) n m i = 1 m f ( x i : n ) ; D 1 , D 2 = 0 , , m 1 .

3. The Maximum Likelihood Estimates

Let X 1 : n , X 2 : n , X r : n be the C-UHCS ( m , l ; t 1 , t 2 ) from the Pareto distribution given in (1). The likelihood function given in (8) in this case may written as
L = n ! ( n r ) ! k t α n r i = 1 r α x i : n k x i : n α , k x 1 : n x 2 : n t ,
and hence
log L = log n ! ( n r ) ! i = 1 r x i : n + r log α + n α log k α log t n r i = 1 r x i : n .
From (9), we consider the following cases:

3.1. Case 1: α Is Known

The maximum likelihood estimate (MLE) of the parameter k is given by
k ˜ = x 1 : n .
From (11), it is easy to show that the X 1 : n P ( k , n α ) , hence
E ( k ˜ ) = n α n α 1 k , V a r ( k ˜ ) = n α k 2 ( n α 2 ) ( n α 1 ) 2 , and M S E ( k ˜ ) = 2 k 2 ( n α 2 ) ( n α 1 ) ,
this shows that, the estimator in (11) is consistent sufficient estimator of k, while the estimator
k ^ = n α 1 n α x 1 : n ,
is consistent sufficient unbiased estimate of k and more efficient than k ˜ , since
V a r ( k ^ ) = k 2 n α ( n α 2 ) < V a r ( k ˜ ) , n α > 2 .
The corresponding MLEs of the reliability and hazard functions are given, respectively, by
R ^ ( t ) = n α 1 n α t x 1 : n α and H ^ ( t ) = α t .
In order to construct a confidence estimation for k in this case, we consider the pivotal quantity
W 1 = k ^ μ k ^ σ k ^ = n α ( n α 2 ) n α 1 n α k x 1 : n 1 , n α > 2 .
Again, it is easy to show that the distribution of W 1 given in (16) follows Pareto distribution of the second kind with shape parameter α * = n α , location parameter θ * = n α ( n α 2 ) + ( n α 1 ) n α 2 n α n α , and scale parameter k * = ( n α 1 ) n α 2 n α n α . Thus F w 1 ( w ) can be written as
F W 1 ( w ) = 1 ( n α 1 ) n α 2 n α w + n α ( n α 2 ) n α , w > n α ( n α 2 ) + ( n α 1 ) n α 2 n α n α ,
and then the ( 1 γ ) 100 % confidence interval for k is constructed by
( γ 2 ) 1 n α X 1 : n , ( 1 γ 2 ) 1 n α X 1 : n .
The mean length of the confidence interval given in (17) is n α k n α 1 ( 1 γ / 2 ) 1 n α ( γ / 2 ) 1 n α which approaches to zero as n .

3.2. Case 2: k Is Known

The MLE of α can be obtained from (10) as
α ˜ = r log y 1 ,
where y 1 = t k n r i = 1 r x i : n k . From (18), we can see that α ˜ is sufficient statistics of α . The mean and variance of the MLE of α and 1 α can be derived in Theorem 1 below.
Theorem 1.
If X 1 : n , X 2 : n , X r : n be the C-UHCS ( m , l ; t 1 , t 2 ) from Pareto distribution given in (1), then the consistent sufficient MVUE of 1 α is 1 α ˜ , where α ˜ is a biased consistent sufficient statistic of α and for r > 2 , we have
E ( α ˜ ) = r r 1 α , V a r ( α ˜ ) = r 2 α 2 ( r 2 ) ( r 1 ) 2 a n d   M S E ( α ˜ ) = ( r + 2 ) α 2 ( r 1 ) ( r 2 ) ,
and
E ( 1 α ˜ ) = 1 α , V a r ( 1 α ˜ ) = 1 r α 2 .
Proof. 
See Appendix A. □
The unbiased estimate of α is given by
α ^ = r 1 r α ˜ = r 1 log t k n r i = 1 r x i : n k ,
and hence
V a r ( α ^ ) = α 2 r 2 < V a r ( α ˜ ) , r > 2 ,
this shows that α ^ is an unbiased consistent sufficient estimate for α .

3.3. Case 3: k and α Are Unknown

The MLEs of k , α , R ( t ) and H ( t ) can be derived, respectively, by
k ˜ = x 1 : n , α ˜ = r log ( y 2 ) , R ˜ ( t ) = e x p r log x 1 : n t log y 2 and H ˜ ( t ) = r t log y 2 ,
where
y 2 = t x 1 : n n r i = 2 r x i : n .
From (21), we see that ( k ˜ , α ˜ ) are jointly sufficient statistics for ( k , α ) . The following theorem states the mean and variance of the MLE of 1 α ˜ in this case.
Theorem 2.
If X 1 : n , X 2 : n , X r : n be the C-UHCS ( m , l ; t 1 , t 2 ) from Pareto distribution given in (1). For the biased estimate of 1 α , we have
E 1 α ˜ = r 1 r α , V a r ( 1 α ˜ ) = r 1 r 2 α 2 , a n d   M S E ( 1 α ˜ ) = 1 r α 2 ,
while log [ y 2 ] / ( r 1 ) is an unbiased estimate of 1 / α with variance 1 / ( ( r 1 ) α 2 ) and relative efficiency 1 1 / r .
Proof. 
See Appendix B. □
Theorem 3.
If X 1 : n , X 2 : n , X r : n be the C-UHCS ( m , l ; t 1 , t 2 ) from Pareto distribution given in (1). For the biased estimate of α, we have
E ( α ˜ ) = r r 2 α , r > 2 ,
V a r ( α ˜ ) = r 2 α 2 ( r 3 ) ( r 2 ) 2 , r > 3 ,
M S E ( α ˜ ) = r 2 + 2 r 6 ( r 3 ) ( r 2 ) 2 α 2 , r > 3 , .
and for the unbiased estimate of α, we have
α ^ = r 2 r α ˜ = r 2 log y 2 , V a r ( α ^ ) = α 2 r 3 , r > 3   a n d   R E ( α ^ ) = r 3 r .
Proof. 
See Appendix C. □
Theorem 4.
If X 1 : n , X 2 : n , X r : n be the C-UHCS ( m , l ; t 1 , t 2 ) from Pareto distribution given in (1). Then the unbiased MLE of k of Pareto distribution is
1 log y 2 n ( r 1 ) X 1 : n .
Proof. 
See Appendix D. □
Remark 1.
The MLEs of Pareto parameters and their properties based on complete sample given in Baxter [29] can be easily derived from our results in the cases 1, 2, and 3 by setting r = n .
Now, we apply the normality appromimation of the MLEs to obtain the appromimate confidence intervals for k and α . The variance-covariance matrix of the parameters V ^ = [ σ i , j ] , i , j = 1 , 2 can be witten as
V ( k ^ , α ^ ) = 2 log [ L ] k 2 2 log [ L ] k α 2 log [ L ] α k 2 log [ L ] α 2 ( k = k ^ , α = α ^ ) 1 ,
where the elements of the observed variance-covariance matrix can be derived from (10) as
2 log [ L ] k 2 = n α k 2 , 2 log [ L ] α 2 = r α 2 , 2 log [ L ] α k = n k ,
and hence the minimum variance bound of the MLEs of α and 1 α are given, respectively, by α 2 r and 1 r α 2 . V ( k , α ) takes the form
V ( k , α ) = k 2 α n ( r n α ) r α 2 k n k n n α k 2 , k ^ = 1 log y 2 n ( r 1 ) x 1 : n and α ^ = r 2 log y 2 ,
then the 100 ( 1 τ ) % confidence intervals for the parameters k and α are given by
k ^ z τ / 2 V ( k ^ ) , k ^ + z τ / 2 V ( k ^ )   a n d   α ^ z τ / 2 V α ^ , α ^ + z τ / 2 V α ^ ,
where V ( k ^ ) and V ( α ^ ) are the estimated variances of k ^ and α ^ , which are given by the diagonal elements of V ( k ^ , α ^ ) , and z τ / 2 is the upper τ / 2 percentile of standard normal distribution, where τ / 2 = z τ / 2 1 2 π e z 2 / 2 d z . The delta method was used for derive approximate confidence intervals for R ( t ) and H ( t ) as
Ψ 1 = R ( t ) k R ( t ) α ( k = k ^ , α = α ^ ) , and Ψ 2 = H ( t ) k H ( t ) α ( k = k ^ , α = α ^ ) ,
where
R ( t ) k = k t α 1 α t , R ( t ) α = k t α log k t , H ( t ) k = 1 t , and H ( t ) α = 0 .
Then, the approximate estimates of V ( R ^ ( t ) ) and V ( H ^ ( t ) ) are given, respectively, by
V ( R ^ ( t ) ) Ψ 1 V k ^ , α ^ Ψ 1 t ,   a n d   V H ^ t Ψ 2 V k ^ , α ^ Ψ 2 t ,
where Ψ t is the transpose of Ψ and
R ^ ( t ) ± z τ / 2 V ( R ^ ( t ) )   a n d   H ^ ( t ) ± z τ / 2 V ( H ^ ( t ) .

4. Bayesian Estimation: MCMC Method

In the Bayesian approach, the risk functions are chosen depending on how one measures the distance between the estimate and the unknown parameter. To perform the Bayesian analysis, usually we use loss the squared error (SE) loss function as
L S E ( g ( φ ) , g ^ ( φ ) ) = ( g ( φ ) g ^ ( φ ) ) 2 ,
where g ^ ( φ ) is an estimate of g ( φ ) and the Bayes estimate of g ( φ ) using the SE loss function is given by
g ^ S E ( φ ) = E φ g ( φ ) | x r : n .
In this section, we use the Metropolis Hastings algorithm within Gibbs sampling approach for generating random samples from the conditional densities of the parameters and use them to get the Bayian estimates and interval (HPD credible intervals) estimates of the unknown parameters. The unknown parameters k and α are assigned independent gamma distributions. Then, the joint prior distribution for k and α , is given by
π ( k , α ) = b a Γ ( a ) k a 1 exp b k * d c Γ ( c ) α c 1 exp d α , a , b , c , d > 0 .
Then, the posterior distributrion of k and α , is given by
π * ( k , α ) = L π ( k , α ) α k L π ( k , α ) k α = α r k n α T n r i = 1 r x i : n α k a 1 exp b k α c 1 exp d α α k α r k n α T n r i = 1 r x i : n α k a 1 exp b k α c 1 exp d α k α .
In the following algorithm, we apply Metropolis Hastings (M-H) technique with normal proposal distribution for generainge samples from these distributions.
  • Start with initial values of the parameters ( k ( 0 ) , α ( 0 ) ). Then, set i = 1 ;
  • Generate k ( * ) , α ( * ) using the proposal distributions N ( k ( i 1 ) , V a r ( k ^ ) ) and N ( α ( i 1 ) , V a r ( α ^ ) ) , respectively, such that V a r ( k ^ ) and V a r ( α ^ ) are given in (3.21);
  • Compute the acceptance probability r = m i n 1 , π * ( k ( * ) , α ( * ) ) π * ( k ( i 1 ) , α ( i 1 ) ) ;
  • Generate U from uniform (0, 1);
  • Accept the proposal distribution and set k ( i ) , α ( i ) = k ( * ) , α ( * ) if U < r . Otherwise, reject the proposal distribution and set k ( i ) , α ( i ) = k ( i 1 ) , α ( i 1 ) ;
  • Set i = i + 1 ;
  • Repeat Steps 2–6, M times, and obtain k ( i ) and α ( i ) for i = 1 , . . . , M .
By using the generated random samples from the Gibbs sampling procedure with N unburn units, then Bayes estimate of the parameters using the squared error loss functions are
k ^ = 1 M N i = N + 1 M k ( i ) , α ^ = 1 M N i = N + 1 M α ( i ) ,
R ^ ( t ) = 1 M N i = N + 1 M k ( i ) t α ( i ) , and H ^ ( t ) = 1 M N i = N + 1 M α ( i ) t .
MCMC HPD credible interval Algorithm:
  • Arrange the values of k ( * ) , α ( * ) , R ( * ) and H ( * ) in increasing magnitude;
  • Find the positions of the lower bounds which is ( M N ) * q / 2 , where q is the significance, then determine the lower bounds of k , α , R and H;
  • Find the positions of the upper bounds which is ( M N ) * ( 1 q / 2 ) , then determine the upper bounds of k , α , R and H;
  • Repeat the above steps M times. Find the average value of the lower and upper bounds MCMC HPD credible interval of k , α , R and H.

5. Data Analysis

In this section, we apply the proposed MLEs and the Bayesian estimates to analyze a set of real data distributed as Pareto distribution given by Nigm and Hamdy [30] and Wong [31]. The data represents the first 10 observations of sample size n = 15 businesses as: 1.01, 1.05, 1.08, 1.14, 1.28, 1.30, 1.33, 1.43, 1.59, 1.62.
The calculations are carried out through the steps below:
  • Assume different censoring schemes from our model C-UHCS ( m , l ; t 1 , t 2 ) .
  • Calculate the MLEs estimations of k , α , R ( t )   a n d   H ( t ) at the termination time T.
  • Calculate the Bayesian estimations of k , α , R ( t )   a n d   H ( t ) at the termination time T by MCMC (with 100,000 repetitions and 20,000 burns).
  • For the Bayesian analysis, we select the values of the hyper-parameter a, b, c, and d as:
    (a)
    Calculate the MLEs of k and α using (3.23) when n = 15 , r = 10 and t = x 10 : 15 = 1.62 as k ^ = 0.983433 and α ^ = 3.80173 .
    (b)
    Assume k g a m m a ( a , b ) and α g a m m a ( c , d ) , hence solve the two equations (the mean and variance of gamma distribution) a b = k ^ and a b 2 = 0.01 to get a = 96 and b = 0.01
    (c)
    Repeat step (b) for c and d to get c = 1445 and d = 0.0026 .
  • The corresponding variances of the point estimates are calculated.
  • The 95% and 90% interval estimation using of the unknown parameters, as well as the reliability and hazard function, are calculated.
  • The numerical results are displayed in Table 1.
    From Table 1, we see that
    (i)
    The estimates of Type-I and Type-II censoring are very close as both of T and X r : n become very closed to each other;
    (ii)
    In the most cases, the standard deviation of the Bayesian estimate is smaller than the MLE;
    (iii)
    In the most cases, the interval width of the Bayesian estimate is shorter than the MLE at the some confidence level;
    (iv)
    In general the model C-UHCS enables us to have flexible way for selecting the censoring schemes.
The second rows represent the standard deviation of the point estimates and width of the interval estimates.

6. Conclusions

In this paper, MLEs and the Bayesian estimates of the unknown parameters from Pareto distribution under C-UHCS ( m , l ; t 1 , t 2 ) model are obtained. Some efficiency properties of the MLEs estimate are discussed, including, the unbiasedness, minimum variance unbiased estimate and sufficiency. MCMC technique is used to carry out the Bayesian estimates. The interval estimates are developed using the observed Fisher information matrix and MCMC. The theoretical findings of the paper are applied to analyze a real dataset under different choices of the censoring. The numerical results show the efficiency performance of the proposed model. Finally, the proposed censoring model has flexible futures to switch between Type-I and Type-II censoring according the experiments need.

Author Contributions

Conceptualization, K.S.S., W.E.; methodology, K.S.S., W.E.; software, W.E.; validation and formal analysis, K.S.S.; resources, K.S.S., W.E.; writing—original draft preparation, W.E.; writing—review and editing, K.S.S., W.E. Both authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Deanship of Scientific Research at King Saud University grant number (RG -1435-056).

Acknowledgments

The authors would like to thank the referees for their helpful comments which improved the presentation of the paper. The authors would also like to extend their sincere appreciation to the Deanship of Scientific Research at King Saud University for funding this Research group (RG-1435-056).

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Proof of Theorem 1

Proof. By using the well known relation
1 = k x 1 x 1 x 2 x r 1 L d x = A α r y 1 α d x ,
where
A = n ! ( n r ) ! 1 i = 1 r x i : n .
From (A1), we may write
A y 1 α d x = α r ,
differentiating both sides of (A2) with respect to α , we have
A y 1 α log y 1 d x = r α r 1 ,
A y 1 α log y 1 2 d x = r ( r + 1 ) α r 2 .
Integrating both sides of (A2) with respect to α , we have
A y 1 α 1 log y 1 d x = α r + 1 r 1 + c 1 , r > 1 ,
A y 1 α 1 ( log y 1 ) 2 d x = α r + 2 ( r 1 ) ( r 2 ) + c 1 α + c 2 , r > 2 .
It is easy to show form (A5) and (A6) that
lim α y 1 α = lim α α r + 1 = lim α α r + 2 = 0 , t h e n c 1 = c 2 = 0 .
Now, from (A3), we have
A α r y 1 α log y 1 d x = E log y 1 = r α .
Similarly, from (A4)–(A6), we have
E log y 1 2 = r ( r + 1 ) α 2 , E 1 log y 1 = α ( r 1 ) a n d E 1 log y 1 2 = α ( r 1 ) ( r 2 ) ,
and hence Theorem 1 is proved. □

Appendix B. Proof of Theorem 2

Proof. Let Y = log X 1 : n k n E x p o n e n t i a l ( 1 α ) , then
E ( Y m ) = m ! α m ,
E log ( y 2 ) = E [ log ( y 1 ) Y ] = r 1 α , E [ log ( y 2 ) ] 2 = E [ log ( y 1 ) ] 2 + E [ Y 2 ] 2 E [ Y log ( y 1 ) ]
= r ( r + 1 ) α 2 + 2 α 2 2 E [ log Y log y 1 ] ,
and let
E [ Y log y 1 ] = g ( α ) t h e n α r g ( α ) = A y 1 α y log y 1 d x ,
by integrating over α , we get
α r α r g ( α ) d α = E [ Y ] = 1 α ,
differentiating with respect to α , we have
g ( α ) = r + 1 α 2 ,
hence
E [ log ( y 2 ) ] 2 = r ( r 1 ) α 2 , a n d V a r ( log y 2 ) ) = r 1 α 2 .
Then
E 1 α ˜ = r 1 r α , V a r 1 α ˜ = r 1 r 2 α 2 a n d M S E 1 α ˜ = 1 r α 2 .
Hence Theorem 2 is proved. □

Appendix C. Proof of Theorem 3

Proof. Consider the relation
A α r y 1 α y j d x = E ( Y j ) = j ! α j ,
then
A y 1 α y j d x = j ! α ( r + j ) .
Integrating both sides of (A14) ( j + 1 ) times over α , we get
A y 1 α y j ( log y 1 ) j d x = j ! α ( r 1 ) ( r 1 ) r ( r + 1 ) ( r + j 1 ) ) ,
then
A α r y 1 α y j ( log y 1 ) j + 1 d x = E Y j log y 1 j + 1 = j ! ( r 2 ) ! α ( r + j 1 ) ! = α B ( j + 1 , r 1 ) ,
where B ( j + 1 , r 1 ) is the beta function defined as B ( j + 1 , r 1 ) = 0 1 t j ( 1 t ) r 2 d t .
Similarly, integrating both sides of (A14) ( j + 2 ) times over α , we have
E y j ( log y 1 ) j + 2 = B ( j + 2 , r 2 ) α 2 j + 1 .
Now to find the mean and variance of α ˜ , we get
E 1 log y 2 = E 1 log y 1 Y = E 1 log y 1 1 Y log y 1 1 = α i = 0 B ( j + 1 , r 1 ) ,
using the the following relation k = 0 B ( x , y + k ) = B ( x 1 , y ) , see Gradshtegn and Ryzhik [32], we have
E 1 log y 2 = α B ( r 1 , 1 ) = α r 2 ,
and
E 1 log y 2 2 = j = 0 ( j + 1 ) E Y j ( log y 1 ) j + 1 = α 2 j = 0 B ( j + 2 , r + 2 ) = α 2 ( r 2 ) ( r 3 ) .
Then
E ( α ˜ ) = E r log y 2 = r α r 2 , V a r ( α ˜ ) = r 2 α 2 ( r 3 ) ( r 2 ) 2 , M S E ( α ˜ ) = ( r 2 + 2 r 6 ) α 2 ( r 3 ) ( r 2 ) 2 .
For the unbiased estimator of α , we consider
α ^ = r 2 r α ˜ = r 2 log y 2 ,
then
V a r ( α ^ ) = α 2 r 3 < V ( α ˜ ) a n d R E ( α ˜ ) = r 3 r

Appendix D. Proof of Theorem 4

Proof. To find E ( X 1 : n log y 2 ) , we use the relation
A α r y 1 α x 1 : n d x 1 : n = E ( X 1 : n ) = n α n α 1 k .
Then
A y 1 α x 1 : n d x = n α α r n α 1 k .
By differentiating both sides of (A20) with respect to α , we have
A y 1 α ( log y 1 ) x 1 : n d x = d d α α r n α k n α 1 = n k α r ( n α r r + 1 ) ( n α 1 ) 2 ,
and
A α r y 1 α ( log y 2 + y ) x 1 : n d x = n k ( n α r r + 1 ) ( n α 1 ) 2 = E log y 2 x 1 : n + E ( Y x 1 : n ) = E ( log y 2 x 1 : n ) + n n α 2 ( n α 1 ) 2 .
Then
E ( log y 2 x 1 : n ) = n ( r 1 ) n α 1 k ,
hence
E x 1 : n 1 log y 2 n ( r 1 ) = k ,
his shows that x 1 : n 1 log y 2 n ( r 1 ) is the unbiased estimate of k. □

References

  1. Pareto, V. Cours d’ Economie Politique; Rougem: Paris, France, 1897. [Google Scholar]
  2. Arnold, B.C.; Press, S.J. Bayesian inference for Pareto distribution. J. Econ. 1983, 21, 287–306. [Google Scholar] [CrossRef]
  3. Malik, H.J. Estimation of parameter of Pareto distribution. Metrika 1970, 15, 126–132. [Google Scholar] [CrossRef]
  4. Arnold, B.C.; Press, S.J. Bayesian estimation and prediction for Pareto data. J. Am. Stat. Assoc. 1989, 84, 1079–1084. [Google Scholar] [CrossRef]
  5. Tiwari, R.C.; Yang, Y.; Zalkikar, J.N. Bayes estimation of the Pareto failure-model using Gibbs sampling. IEEE. Trans. Reliab. 1996, R-45, 471–476. [Google Scholar] [CrossRef]
  6. Abdel-Ghaly, A.A.; Attia, A.F.; Aly, H.M. Estimation of the parameters of Paretro distribution and reliability function using accelerated life testing with censoring. Commun. Statist. Theor. Meth. 1998, 27, 469–484. [Google Scholar]
  7. Hossain, A.M.; Zimmer, W.J. Comparisons of methods of estimation for Pareto distribution of the first kind. Commun. Statist. Theor. Meth. 2000, 29, 859–878. [Google Scholar] [CrossRef]
  8. Soliman, A. Bayes prediction in a Pareto lifetime model with random sample size. Statistician 2000, 49, 51–62. [Google Scholar] [CrossRef]
  9. Saldaña-Zepeda, D.P.; Vaquera-Huerta, H.; Arnold, B.C. A goodness of fit test for the Pareto distribution in the presence of Type-II censoring, based on the cumulative hazard function. Comput. Stat. Data Anal. 2010, 4, 833–842. [Google Scholar] [CrossRef]
  10. Wu, S. Interval estimation for a Pareto distribution based on a doubly Type-II censored sample. Comput. Stat. Data Anal. 2008, 52, 3779–3788. [Google Scholar] [CrossRef]
  11. Han, M. The E-Bayesian estimation and its E-MSE of Pareto distribution parameter under different loss functions. J. Stat. Comput. Simul. 2020, 90, 1834–1848. [Google Scholar] [CrossRef]
  12. Poudyal, C. Truncated, censored, and actuarial payment-type moments for robust fitting of a single-parameter Pareto distribution. J. Comput. Appl. Math. 2021, 388, 113310. [Google Scholar] [CrossRef]
  13. Epstein, B. Truncated life tests in the exponential case. Ann. Math. Stat. 1954, 25, 555–564. [Google Scholar] [CrossRef]
  14. Fairbanks, K.; Madson, R.; Dykstra, R. A confidence interval for an exponential parameter from a hybrid life test. J. Am. Stat. Assoc. 1982, 77, 137–140. [Google Scholar] [CrossRef]
  15. Draper, N.; Guttman, I. Bayesian analysis of hybrid life tests with exponential failure times. Ann. Inst. Stat. Math. 1987, 39, 219–225. [Google Scholar] [CrossRef]
  16. Chen, S.; Bhattacharya, G.K. Exact confidence bounds for an exponential parameter under hybrid censoring. Commun. Stat. Theory Methods 1988, 17, 1857–1870. [Google Scholar] [CrossRef]
  17. Jeong, H.S.; Park, J.I.; Yum, B.J. Development of (r,T) hybrid sampling plans for exponential lifetime distributions. J. Appl. Stat. 1996, 23, 601–607. [Google Scholar] [CrossRef]
  18. Childs, A.; Chandrasekhar, B.; Balakrishnan, N.; Kundu, D. Exact likelihood inference based on type-I and type-II hybrid censored samples from the exponential distribution. Ann. Inst. Stat. Math. 2003, 55, 319–330. [Google Scholar] [CrossRef]
  19. Gupta, R.D.; Kundu, D. On the comparison of Fisher information matrices of the Weibull and generalized exponential distributions. J. Stat. Plann. Infer. 2006, 136, 3130–3144. [Google Scholar] [CrossRef]
  20. Balakrishnan, N.; Kundu, D. Hybrid censoring: Models, inferential results and applications. In Computational Statistics and Data Analysis; Elsevier: Amsterdam, The Netherlands, 2013; Volume 57, pp. 166–209. [Google Scholar]
  21. Kang, S.B.; Park, Y.K. Estimation for the half-logistic distribution based on multiply Type-II hybrid censoring. Phys. Stat. Mech. Appl. 2020, 550, 124501. [Google Scholar]
  22. Nassar, M.; Dobbah, S.A. Analysis of reliability characteristics of bathtub-shaped distribution under adaptive Type-I progressive hybrid censoring. IEEE Access 2020, 8, 181796–181806. [Google Scholar] [CrossRef]
  23. Algarni, A.; Almarashi, A.; Abd-Elmougoud, G.A. Joint Type-I generalized hybrid censoring for estimation two Weibull distributions. J. Inf. Sci. Eng. 2020, 36, 1243–1260. [Google Scholar]
  24. Mohie El-Din, M.M.; Sadek, A.; Mohie El-Din, M.M.; Nagy, M. Estimation and prediction for Pareto distribution under Type-II progressive hybrid censoring scheme. Int. J. Comput. Appl. 2016, 155, 9–15. [Google Scholar]
  25. Çetinkaya, Ç. Inference based on Type-II hybrid censored data from a Pareto distribution. J. Reliab. Stat. Stud. 2020, 13, 253–264. [Google Scholar]
  26. Huang, W.T.; Yang, K.C. A new hybrid censoring scheme and some of its properties. Tamsui Oxf. J. Math. Sci. 2010, 26, 355–367. [Google Scholar]
  27. Balakrishnan, N.; Rasouli, A.; Farsipour, N.S. Exact likelihood inference based on an unified hybrid censored sample from the exponential distribution. J. Stat. Comput. Simul. 2008, 78, 475–488. [Google Scholar] [CrossRef]
  28. Emam, W.; Sultan, K.S. Bayesian and maximum likelihood estimations of the Dagum parameters under combined-unified hybrid censoring. Math. Biosci. Eng. 2021, 18, 2930–2951. [Google Scholar] [CrossRef]
  29. Baxter, M. Minimum variance unbiased estimation of the Pareto distribution. Metrika 1980, 27, 133–138. [Google Scholar] [CrossRef]
  30. Nigm, A.M.; Hamdy, H.I. Bayesian prediction bounds for the Pareto lifetime model. Commun. Stat. Theory Methods 1987, 16, 1761–1772. [Google Scholar] [CrossRef]
  31. Wong, A. Approximate studentization for Pareto distribution with application to censored data. Stat. Pap. 1998, 39, 189–201. [Google Scholar] [CrossRef]
  32. Gradshtegn, I.S.; Ryzhik, I.M. Tables of Integrals, Series and Products; Academic Press: London, UK, 1980. [Google Scholar]
Table 1. Estimation of the unknown parameters, reliability and hazard functions under different censoring choices.
Table 1. Estimation of the unknown parameters, reliability and hazard functions under different censoring choices.
T Point EstimationInterval Estimation
MLEBayesMLEBayes
95%90%95%90%
X m = 5 , 15 = 1.28 k0.9761.0420.818 1.1340.843 1.1090.963 1.0750.980 1.074
0.0810.0660.3170.2670.1120.094
α 2.9622.9631.792 4.1321.977 3.9472.867 3.0572.872 3.052
0.5970.8412.3401.9700.1900.180
R ( . ) 0.2230.2710.000 0.4850.002 0.4440.213 0.3030.225 0.301
0.1340.8410.4850.3370.0900.076
H ( . ) 1.8281.8290.845 2.8121.001 2.6561.770 1.8871.773 1.884
0.5020.8411.4441.2160.1170.111
X l = 7 , 15 = 1.33 k0.9831.0560.831 1.1350.855 1.1110.991 1.0821.007 1.082
0.0770.0770.3030.2550.0910.075
α 3.7303.7322.402 5.0582.612 4.8483.635 3.8253.640 3.820
0.6780.0912.6562.2360.1900.180
R ( . ) 0.1550.2030.025 0.2850.046 0.2650.159 0.2270.169 0.225
0.0660.0910.3090.2600.0680.056
H ( . ) 2.3022.3031.175 3.4301.353 3.2512.244 2.3612.247 2.358
0.5750.0911.6401.3800.1170.111
X l = 9 , 15 = 1.59 k0.9841.0590.813 1.1550.840 1.1280.998 1.0831.012 1.083
0.0870.0790.3410.2870.0850.071
α 3.8693.8722.477 5.2602.697 5.0403.774 3.9643.780 3.959
0.7100.0912.7832.3430.1900.180
R ( . ) 0.1450.1930.042 0.2490.058 0.2330.153 0.2150.161 0.213
0.0530.0910.2910.2450.0620.052
H ( . ) 2.3882.3901.175 3.6011.367 3.4102.330 2.4472.333 2.444
0.6190.0911.7181.4460.1170.111
t 1 = 1.3 , D = 6 k0.9801.0500.827 1.1340.851 1.1090.979 1.0790.996 1.078
0.0780.0720.3070.2590.1000.083
α 3.3793.3802.123 4.6352.321 4.4373.285 3.4743.290 3.469
0.6410.4262.5132.1150.1890.179
R ( . ) 0.1830.2320.007 0.3590.035 0.3310.181 0.2590.192 0.257
0.0890.4260.3490.2940.0780.064
H ( . ) 2.0862.0861.025 3.1461.193 2.9792.028 2.1442.031 2.141
0.5410.4261.5511.3060.1170.111
t 2 = 1.4 , D = 8 k0.9831.0580.822 1.1450.848 1.1190.995 1.0831.010 1.082
0.0820.0780.3230.2720.0870.072
α 3.8023.8052.442 5.1622.657 4.9473.707 3.8973.713 3.892
0.6940.0572.7202.2900.1900.180
R ( . ) 0.1500.1980.035 0.2650.053 0.2470.156 0.2210.165 0.219
0.0590.0570.2990.2520.0650.054
H ( . ) 2.3472.3491.177 3.5161.362 3.3312.288 2.4062.292 2.403
0.5970.0571.6791.4140.1170.111
t 2 = 1.7 , D = 10 k0.9861.0640.822 1.1510.848 1.1251.007 1.0861.022 1.085
0.0840.0830.3290.2770.0780.063
α 4.2464.2512.786 5.7073.017 5.4764.152 4.3424.158 4.337
0.7450.4522.9222.4590.1890.179
R ( . ) 0.1220.1680.041 0.2020.054 0.1890.132 0.1870.140 0.185
0.0410.4520.2540.2140.0540.044
H ( . ) 2.6212.6241.345 3.8971.547 3.6952.563 2.6802.567 2.677
0.6510.4521.8031.5180.1170.110
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Sultan, K.S.; Emam, W. The Combined-Unified Hybrid Censored Samples from Pareto Distribution: Estimation and Properties. Appl. Sci. 2021, 11, 6000. https://doi.org/10.3390/app11136000

AMA Style

Sultan KS, Emam W. The Combined-Unified Hybrid Censored Samples from Pareto Distribution: Estimation and Properties. Applied Sciences. 2021; 11(13):6000. https://doi.org/10.3390/app11136000

Chicago/Turabian Style

Sultan, Khalaf S., and Walid Emam. 2021. "The Combined-Unified Hybrid Censored Samples from Pareto Distribution: Estimation and Properties" Applied Sciences 11, no. 13: 6000. https://doi.org/10.3390/app11136000

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop