Next Article in Journal
3D-ShuffleViT: An Efficient Video Action Recognition Network with Deep Integration of Self-Attention and Convolution
Next Article in Special Issue
Ratio Test for Mean Changes in Time Series with Heavy-Tailed AR(p) Noise Based on Multiple Sampling Methods
Previous Article in Journal
Haircut Capital Allocation as the Solution of a Quadratic Optimisation Problem
Previous Article in Special Issue
Statistical Inference on the Entropy Measures of Gamma Distribution under Progressive Censoring: EM and MCMC Algorithms
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Discriminating among Generalized Exponential, Weighted Exponential and Weibull Distributions

1
School of Science, Xi’an University of Technology, Xi’an 710048, China
2
College of Big Data and Internet, Shenzhen Technology University, Shenzhen 518118, China
3
College of New Material and New Energy, Shenzhen Technology University, Shenzhen 518118, China
*
Author to whom correspondence should be addressed.
Mathematics 2023, 11(18), 3847; https://doi.org/10.3390/math11183847
Submission received: 13 August 2023 / Revised: 3 September 2023 / Accepted: 5 September 2023 / Published: 8 September 2023
(This article belongs to the Special Issue Current Developments in Theoretical and Applied Statistics)

Abstract

:
In this paper, we consider the problem of discriminating among three different positively skewed lifetime distributions, namely the generalized exponential distribution, the weighted exponential distribution, and the Weibull distribution. All of these distributions have been used quite effectively to analyze positively skewed lifetime data. We use the methods of the ratio of maximized likelihood, the minimum Kolmogorov distance, and the sequential probability ratio test to discriminate among these three distributions. The probability of correct selection is considered for each hypothesis based on several scenarios with Monte Carlo simulation. Real data applications are studied to illustrate the effectiveness of these proposed methods.

1. Introduction

The problem of testing whether some given data come from one of two or more distributions is quite common in the statistical literature, see Atkinson [1,2], Chen [3], Chambers and Cox [4], Cox [5,6], and Dyer [7]. Several methods have been used on the problem of discriminating among different distributions, such as the ratio of maximized likelihood (RML), the minimum Kolmogorov distance (MKD), and the sequential probability ratio test (SPRT). In this paper, we discuss the methods of discriminating among the Weibull (WB) distribution, the generalized exponential (GE) distribution, and the weighted exponential (WE) distribution.
The WB distribution was introduced by Weibull [8] to analyze the skewed lifetime data, and has been widely applied in reliability engineering, failure analysis, and radar systems to model the dispersion of the received signal’s level produced by some types of clutter. If x follows the WB distribution with the shape parameter β and scale parameter ξ , respectively, then it has the cumulative distribution function (cdf) and the probability density function (pdf) as follows
F W B ( x ; β , ξ ) = 1 e ( ξ x ) β , x > 0 , and β , ξ > 0 ,
and
f W B ( x ; β , ξ ) = β ξ β x β 1 e ( ξ x ) β , x > 0 , and β , ξ > 0 ,
respectively, and is denoted by x W B ( β , ξ ) .
The GE distribution was proposed and studied quite extensively by Gupta and Kundu [9], and has the cdf
F G E ( x ; γ , θ ) = ( 1 e θ x ) γ , x > 0 ,
and the pdf
f G E ( x ; γ , θ ) = γ θ ( 1 e θ x ) γ 1 e θ x , x > 0 ,
where γ > 0 , θ > 0 are shape and scale parameters, respectively, and it is denoted by x G E ( γ , θ ) . Moreover, the pdf of the distribution G E ( γ , θ ) strictly decreases when γ 1 , and has a unimodal shape for γ > 1 . It is clear that the pdf of the G E ( γ , θ ) distribution is always right-skewed and can be used quite effectively to analyze skewed data sets.
Recently, Gupta and Kundu [10] introduced a new class of the WE distribution as a generalization of the standard exponential distribution, which has the pdf
f W E ( x ; η , λ ) = η + 1 η λ e λ x ( 1 e η λ x ) , x > 0 ,
and the cdf
F W E ( x ; η , λ ) = 1 + 1 η e λ ( η + 1 ) x η + 1 η e λ x , x > 0 ,
where η > 0 , λ > 0 are the shape and scale parameters, respectively, and it is denoted as x W E ( η , λ ) . Furthermore, they showed that the WE distribution possesses some good properties and can be used as a good fit for survival time data compared to other popular distributions such as the gamma, the WB, or the GE distribution. For example, Makhdoom [11] investigated the statistical inference for reliability and stress strength for the WE distribution. Dey et al. [12] investigated various properties and methods of the estimation of the WE distribution. Tian and Yang [13] studied a change-point problem of the WE distribution based on the likelihood ratio test, modified information criterion, and Schwarz information criterion.
As we know, the WE distribution can be used as an alternative to the WB or GE distributions, and these distributions have several interesting properties. For certain values of shape and scale parameters, it is interesting to note that the shapes of these three distributions of cdfs and pdfs are very similar, but the hazard functions are completely different, which can be seen in Figure 1 and Figure 2. These suggest that even if the distributions are very close in a sense of a certain distributional characteristic, they may be quite different concerning other characteristics.
In fact, many scholars have studied the discrimination related to the WB distribution or the GE distribution. Gupta and Kundu [14] discussed the closeness of the gamma and the GE distributions using the RML and the MKD methods. Gupta and Kundu [15] considered the RML and the MKD methods to study the discriminating problem of the WB and the GE distributions. Pakyari [16] presented diagnostic tools based on the RML and the MKD methods to discriminate between the GE, the geometric extreme exponential, and the WB models. Raqab [17] considered the RML procedure to discriminate between the generalized Rayleigh and the WB distributions. Elsherpieny et al. [18] studied the RML procedure to discriminate between the gamma and the log-logistic distributions in the case of progressive type-II censoring. Ahmad et al. [19] used the MKD method to discriminate between the generalized Rayleigh and the WB distributions. Raqab et al. [20] studied the discriminating problem of the WB, the log-normal, and the log-logistic distributions based on the RML and the MKD methods. Vaidyanathan and Varghese [21] applied the RML and the MKD methods to discriminate between the exponential and the Lindley distributions based on a given random sample of observations, and this has been widely used in the fields of biology and engineering. Recently, Paul et al. [22] developed the SPRT methodology to discriminate between any two of the log-normal, the WB, and the log-logistic distributions as well as to discriminate among these distributions, which allowed the use of a less than average sample size without sacrificing the probability of correct selection (PCS).
As far as we know, statisticians have not yet studied the problem of discriminating among these distributions. Thus, it is significant to study this. The paper is organized as follows. Three different methods to discriminate among the GE, WE, and WB distributions are investigated in Section 2. Simulation studies are considered in Section 3. The real data set is analyzed in Section 4. Some concluding remarks are presented in Section 5.

2. Methodology

In this section, we study three different methods, namely the RML, the MKD, and the SPRT, to discriminate among the GE, the WE, and the WB distributions.
As a random sample, x 1 , x 2 , ⋯, x n is supposed to belong to one of the WE, GE, or WB distributions. Our interest lies in determining the distribution from which the random sample is observed. This problem can be formulated as the hypothesis-testing problem in the following
H 1 : x 1 , x 2 , W E ( η , λ ) vs . H 2 : x 1 , x 2 , G E ( γ , θ ) vs . H 3 : x 1 , x 2 , W B ( β , ξ ) .
Under H 1 , the log-likelihood function is given by
log L W E = n log ( η + 1 ) λ η λ X ¯ + i = 1 n log ( 1 e η λ x i ) ,
where X ¯ = 1 n i = 1 n x i .
The MLEs of η ^ and λ ^ are the solutions of the following equations after taking the first derivatives of (1) for η and λ ,
log L W E η = n η ( η + 1 ) i = 1 n λ x i e η λ x i 1 e η λ x i = 0 , log L W E λ = n λ n X ¯ + i = 1 n η x i e η λ x i 1 e η λ x i = 0 .
Under H 2 , the log-likelihood function is given by
log L G E = n [ log ( γ θ ) θ X ¯ ] + ( γ 1 ) i = 1 n log ( 1 e θ x i ) .
Similarly, the MLEs of γ ^ and θ ^ can be obtained by solving the following equations,
log L G E γ = n γ i = 1 n log ( 1 e θ x i ) = 0 , log L G E θ = n θ n X ¯ + ( γ 1 ) i = 1 n x i e θ x i 1 e θ x i = 0 .
Under H 3 , the log-likelihood function is given by
log L W B = n log β + n β log ξ + ( β 1 ) i = 1 n log x i i = 1 n ( ξ x i ) β .
Then, the MLEs of ξ ^ and β ^ can be obtained by solving the following equations,
log L W B ξ = n β ξ β ξ β 1 i = 1 n x i β = 0 , log L W B β = n β + n log ξ + i = 1 n log x i i = 1 n ( ξ x i ) β log ( ξ x i ) = 0 .

2.1. The Ratio of Maximized Likelihood Method

Cox [5,6] first proposed the RML test when discriminating between two separate models and derived the asymptotic distribution of the RML statistic. In the following, we introduce the procedure of the method of the RML.
The RMLs between two separate models can be defined as follows:
L 1 = L W E ( η ^ , λ ^ ) L G E ( γ ^ , θ ^ ) , L 2 = L W B ( β ^ , ξ ^ ) L W E ( η ^ , λ ^ ) , L 3 = L W B ( β ^ , ξ ^ ) L G E ( γ ^ , θ ^ ) ,
where η ^ ,   λ ^ , γ ^ ,   θ ^ , β ^ , and ξ ^ are calculated by (2), (4), and (6), respectively.
The logarithm of the above RMLs can be written as
T n 1 = log L 1 = log L W E ( η ^ , λ ^ ) log L G E ( γ ^ , θ ^ ) = n log ( η ^ + 1 ) λ ^ η ^ γ ^ θ ^ ( λ ^ θ ^ ) X ¯ + i = 1 n log 1 e η ^ λ ^ x i ( 1 e θ ^ x i ) γ ^ 1 , T n 2 = log L 2 = log L W B ( β ^ , ξ ^ ) log L W E ( η ^ , λ ^ ) = n log β ^ η ^ ξ ^ β ^ ( η ^ + 1 ) λ ^ + λ ^ X ¯ + i = 1 n log x i β ^ 1 1 e η ^ λ ^ x i i = 1 n ( ξ ^ x i ) β ^ , T n 3 = log L 3 = log L W B ( β ^ , ξ ^ ) log L G E ( γ ^ , θ ^ ) = n log β ^ ξ ^ β ^ γ ^ θ ^ + θ ^ X ¯ + i = 1 n log x i β ^ 1 ( 1 e θ ^ x i ) γ ^ 1 i = 1 n ( ξ ^ x i ) β ^ .
Now we choose the WE distribution if T n 1 > 0 , T n 2 < 0 , the GE distribution if T n 1 < 0 , T n 3 < 0 , and the WB distribution if T n 2 > 0 , T n 3 > 0 .

2.2. The Minimum Kolmogorov Distance Method

The MKD test was originally employed to test the hypothesis that a completely random sample has come from a fully specified continuous distribution. The procedure of the MKD method is given as follows. Denote F ^ W E ( η ^ , λ ^ ) , F ^ G E ( γ ^ , θ ^ ) , and F ^ W B ( β ^ , ξ ^ ) as the cdfs calculated at the MLEs of the parameters of the WE, GE, and WB distributions, respectively, and let F ˜ ( x ) be the empirical distribution function calculated from the data. The KDs associated with the three models are given as, respectively,
K D W E = s u p < x < + | F ^ W E ( x , η ^ , λ ^ ) F ˜ ( x ) | , K D G E = s u p < x < + | F ^ G E ( x , γ ^ , θ ^ ) F ˜ ( x ) | , K D W B = s u p < x < + | F ^ W B ( x , β ^ , ξ ^ ) F ˜ ( x ) | .
The model with the minimum distance is then chosen as the winning model.

2.3. The SPRT Method

The SPRT analysis, first introduced by Wald [23], was originally developed as a more effective method of quality control during the Second World War. In this section, an SPRT procedure for discriminating the WE, the GE, and the WB is described.
As the data are observed sequentially, the goal is to identify the correct model by testing multiple hypotheses. The proposed discriminating method is based on the logarithm of the RMLs defined as:
Λ n 1 = log L G E ( x i ; γ ^ n , θ ^ n ) L W E ( x i ; η ^ n , λ ^ n ) = log L G E ( x i ; γ ^ n , θ ^ n ) log L W E ( x i ; η ^ n , λ ^ n ) = n log η ^ n γ ^ n θ ^ n ( η ^ n + 1 ) λ ^ n ( θ ^ n λ ^ n ) X ¯ + i = 1 n log ( 1 e θ ^ n x i ) γ ^ n 1 1 e η ^ n λ ^ n x i , Λ n 2 = log L W E ( x i ; η ^ n , λ ^ n ) L W B ( x i ; β ^ n , ξ ^ n ) = log L W E ( x i ; η ^ n , λ ^ n ) log L W B ( x i ; β ^ n , ξ ^ n ) = n log ( η ^ n + 1 ) λ ^ n β ^ n η ^ n ξ ^ n β ^ n λ ^ n X ¯ + i = 1 n log 1 e η ^ n λ ^ n x i x i β ^ n 1 + i = 1 n ( ξ ^ n x i ) β ^ n , Λ n 3 = log L G E ( x i ; γ ^ n , θ ^ n ) L W B ( x i ; β ^ n , ξ ^ n ) = log L G E ( x i ; γ ^ n , θ ^ n ) log L W B ( x i ; β ^ n , ξ ^ n ) = n log γ ^ n θ ^ n β ^ n ξ ^ n β ^ n θ ^ n X ¯ + i = 1 n log ( 1 e θ ^ n x i ) γ ^ n 1 x i β ^ n 1 + i = 1 n ( ξ ^ n x i ) β ^ n ,
where η ^ n , λ ^ n , γ ^ n , θ ^ n , β ^ n , and ξ ^ n are MLEs of η , λ , γ , θ , β , and ξ , respectively.
A natural idea for multi-hypothesis testing is to select the hypothesis having the maximum likelihood. The SPRT procedure proposed here is based on Λ n 1 , Λ n 2 , and Λ n 3 as defined in (10), respectively. The appropriate selection of stopping boundaries in (10) is crucial for ensuring a high PCS and savings in the sample size. Here, we choose P C S 1 α , where α ( 0 , 1 ) is the fixed level, and the SPRT stopping boundaries are b j = ln ( α 2 ) = a j , for j = 1 , 2 , 3 . Let k ( 1 < k n ) be a prefixed pilot sample size. The purely sequential procedure is discussed as follows:
Stage 1: Draw samples x 1 , x 2 , ⋯, x k and compute Λ k 1 , Λ k 2 , and Λ k 3 . Stop sampling if { Λ k 1 a 1 and Λ k 3 a 3 } or { Λ k 1 b 1 and Λ k 2 a 2 } or { Λ k 2 b 2 and Λ k 3 b 3 } . Otherwise, proceed to the next stage.
Stage 2: Draw a new observation x k + 1 independent of x 1 , x 2 , ⋯, x k and compute Λ k + 1 1 , Λ k + 2 2 , and Λ k + 3 3 . Stop sampling if { Λ k + 1 1 a 1 and Λ k + 1 3 a 3 } or { Λ k + 1 1 b 1 and Λ k + 1 2 a 2 } or { Λ k + 1 2 b 2 and Λ k + 1 3 b 3 } . Otherwise, proceed to draw one new observation. We continue sampling one observation at a time until
t = i n f n k { Λ n 1 a 1 Λ n 3 a 3 } { Λ n 1 b 1 Λ n 2 a 2 } { Λ n 2 b 2 Λ n 3 b 3 } .
We select the GE distribution if { Λ t 1 a 1 a n d Λ t 3 a 3 } , the WE distribution if { Λ t 1 b 1 a n d Λ t 2 a 2 } , and the WB distribution if { Λ t 2 b 2 a n d Λ t 3 b 3 } . In other words, the stopping time T = m i n { τ 1 , τ 2 , τ 3 } where
τ 1 = i n f n k { Λ n 1 a 1 , Λ n 3 a 3 } , τ 2 = i n f n k { Λ n 1 b 1 , Λ n 2 a 2 } , τ 3 = i n f n k { Λ n 2 b 2 , Λ n 3 b 3 } .

2.4. Algorithm

According to the above procedure, the algorithm for each method can be summarized as follows.
(i)
Generating a sample with sample size n follows the null hypothesis.
(ii)
Using the sample to compute the corresponding MLE of the distribution follows H 1 , H 2 , H 3 .
(iii)
Compute the statistic of each method by using Equations (8), (9), and (10), respectively.
(iv)
Select the correct distribution according to the criteria from each procedure.

3. Numerical Results

In this section, we performed some simulation experiments to compare different methods to discriminate among these three distributions. We used the RML and MKD methods first, then considered the SPRT procedure. All the computations were performed using R version 4.1.1 program [24]. To compare the performances of the different methods for different sample sizes and for different parameter values, we generated samples from the three different distributions and computed the PCS for each method based on 1000 replications.
We used the Monte Carlo simulation to generate data sets from each of the null distributions. In this case, we considered different sample sizes, namely n = 20, 40, 60, 80, 100, 200, and 500. For each of these combinations of the sample size, the shape parameter, and the scale parameter, we estimated the parameters of the three models by the MLEs procedure as described in (2), (4) and (6). To compare the performances of the different methods for different sample sizes and for different parameter values, we generated samples from the three different distributions and computed the PCS for each method based on 1000 replications. The details are explained below.
In the RML procedure, the RML statistic was calculated and the winning model was determined by the decision rule in Section 2.1. We repeated this procedure 1000 times and calculated the proportion of times the null distribution was chosen as the corresponding PCS. Moreover, we chose the values of the shape parameters that were greater than 1, and which had unimodal shapes of pdfs and had the significance of discrimination. For simplicity, we selected the same values of the scale parameters. Meanwhile, the corresponding PCSs for each hypothesis were calculated based on the procedure in Section 2.1.
I:
The data come from the WE distribution. In this case, we set η = 5.0, 5.2, 5.4, 5.8 and λ = 0.5, 1.0, 1.5. We compute T n 1 and T n 2 as defined in (8) and the P C S W E R M L can be written as follows,
P C S W E R M L = P ( T n 1 > 0 , T n 2 < 0 | data from the WE distribution ) .
II:
The data come from the GE distribution. We set γ = 2.5, 2.7, 2.9, 3.3 and θ = 0.5, 1, 1.5, and the P C S G E R M L can be obtained as follows,
P C S G E R M L = P ( T n 1 < 0 , T n 3 < 0 | data from the GE distribution ) .
III:
The data come from the WB distribution. We set the parameters β = 2.5, 2.7, 2.9, 3.3 and ξ = 0.5, 1, 1.5, and the P C S W B R M L can be calculated as follows,
P C S W B R M L = P ( T n 2 > 0 , T n 3 > 0 | data from the WB distribution ) .
Similarly, as in Pakyari [16] and Raqab et al. [20], we found the MLEs of parameters for the GE, WE, and WB distributions. The empirical distribution function was also calculated for each of the generated data sets. The model with the minimum distance was then chosen as the winning model. This procedure was repeated 1000 times and the corresponding PCS based on the MKD procedure, as described in Section 2.2, can be written as follows:
P C S W E M K D = P ( K D W E < K D G E , K D W B > K D W E | data from the WE distribution ) , P C S G E M K D = P ( K D W E > K D G E , K D W B > K D G E | data from the GE distribution ) , P C S W B M K D = P ( K D W B < K D W E , K D W B < K D W E | data from the WB distribution ) .
Table 1, Table 2 and Table 3 give the PCS under the RML and the MKD procedures when the data come from different distributions. In Table 1, the P C S R M L is higher than P C S M K D , but only slightly higher in some cases, which shows that the difference in the PCS choice between the two methods is not very large. At the same time, when the sample size is bigger than 100, the PCS can only reach 70%, which means it may take a certain sample size to distinguish among the three distributions when the data are generated from the WE distribution. In Table 2, the values of PCS for these two methods are improved as the shape parameter and scale parameter increase. However, the value of P C S R M L is significantly higher than that of the P C S M K D . In Table 3, the value of PCS is close to 1 when the sample sizes exceed 200. In all three tables, we find that the PCS increases as the sample size and parameters improve, and the RML method is superior to the MKD procedure.
Next, in the SPRT procedure, we computed Λ t 1 , Λ t 2 , and Λ t 3 as defined in (10) and then found the best-fitting distribution based on the criterion given in Section 2.3. We considered the SPRT procedure for some prefixed values of α = 0.1, 0.2. Meanwhile, we set k = 10 fixed sample sizes here and repeated the purely sequential procedure 1000 times. Using the same procedures as Paul et al. [22], the P C S s are obtained in the following,
P C S W E S P R T = P ( Λ t 1 b 1 , Λ t 2 a 2 | data from the WE distribution ) , P C S G E S P R T = P ( Λ t 1 a 1 , Λ t 3 a 3 | data from the GE distribution ) , P C S W B S P R T = P ( Λ t 2 b 2 , Λ t 3 b 3 | data from the WB distribution ) .
Table 4, Table 5 and Table 6 compare the performances of the fixed-sample procedure; here we pick the RML procedure to discriminate among these three distributions with the SPRT procedure. E ( t ) and n R M L denote the estimated expected sample size for the SPRT procedure and the sample size of the RML procedure. n R M L is chosen such that P C S R M L is approximately equal to P C S S P R T without sacrificing the PCS. The last columns of these tables report the savings in the average (expected) sample sizes as n R M L E ( t ) n R M L × 100 % .
From Table 4, Table 5 and Table 6, it is clear that as α increases, the value of n R M L and E ( t ) decrease. With approximately equal PCSs, the SPRT methods require a much lower expected sample size than that of the corresponding RML procedure. Table 4, Table 5 and Table 6 illustrate that the SPRT procedure requires fewer observations to draw the same conclusion than the the fixed-sample procedure.

4. Data Analysis

In this section, we analyze a real data set and use the above methods to discriminate the WE, the GE and the WB distributions.

4.1. Malignant Melanoma Data

This data set is available in Andersen et al. [25] and R package “ t i m e r e g ” (see, Scheike and Zhang [26]), and contains the survival times (in years) of 205 patients after surgery for malignant melanoma (skin cancer) between the years 1962 and 1977 collected at the University Hospital of Odense, Denmark, by K.T. Drzewiecki. The data set can also be found in Appendix A.
We analyzed the data set based on the RML, the MKD, and the SPRT methods, and the results are summarized in Table 7. For the SPRT procedure, we set a j = 2.3026 , b j = 2.3026 , and j = 1, 2, 3. From the table, we can find that T n 2 = 23.4740 > 0 and T n 3 = 16.8645 > 0 , which means the WB distribution is selected. Similarly, the MKDs for these three distributions are 0.1700, 0.1813, and 0.1127, respectively. Therefore, the MKD criterion also suggests choosing the WB model for this data set. Next, Λ t 2 = 2.3319 < 2.3026 and Λ t 3 = 2.3102 < 2.3026 , which illustrates that the SPRT procedure also supports this fact in this case. However, it is remarkable that the average sample size E ( t ) is only 106, which is much less than n F = 205 , giving 48.29% savings in the average sample size.
Figure 3 depicts the histogram of the malignant melanoma data, and pdfs of the WE, the GE, and the WB fitted models. From the fitted density functions, it appears that the WB distribution provides a better fit than the WE and the GE distributions in this case. Andersen et al. [25] suggested using a WB model for this data set and Paul et al. [22] used SPRT criteria to choose the WB distribution, which all confirm the result we obtained above.

4.2. Daily Ozone Data

The second data set records the daily measurements of ozone concentration in the atmosphere of New York City from May to September 1973; for details, see Nadarajah [27]. The data are given in Appendix A.
In Table 8, we obtain the results by applying the RML, MKD, and SPRT methods. We first consider the RML criterion to analyze these real data. Since T n 1 = 0.4501 > 0 and T n 2 = 1.6644 < 0 , we can say that the WE model fits best. Next, the MKDs between the empirical distribution function and the fitted GE, WE, and WB distribution functions are 0.0846, 0.0564 and 0.0900, respectively. Therefore, the MKD criterion also suggest choosing the WE model for this data set. The RML and the MKD criteria all lead to the same conclusion. Finally, we use the SPRT procedure with boundaries a j = 2.3026 , b j = 2.3026 , and j = 1, 2, 3. Then, Λ t 1 = 2.3088 < 2.3026 and Λ t 2 = 2.3112 > 2.3026 , which mean that we still prefer the WE distribution rather than the GE and WB distributions. Moreover, the average sample size E ( t ) is about 93, which is less than n F = 116, giving 19.83% savings in the average sample size.
The histogram in Figure 4 indicates that a positively skewed distribution may fit the data well. We provide the histogram of data set 2 and the pdfs of the GE, WE, and WB fitted distributions in Figure 4. From Figure 4, it appears that the WE distribution provides a better fit than the GE and the WB distributions in this case. Finally, we also conduct the goodness of fit test on the daily ozone data set, and the results are shown in Table 8. It is found that the WE distribution fits best.

5. Conclusions

In this paper, we consider the problem of discriminating among the WE, the GE, and the WB distributions using three different methods. Simulation studies are conducted for the various shapes and scales of parameters, and the performance of the RML, MKD, and SPRT methods are analyzed and compared. We compute their corresponding values of PCS based on the Monte Carlo simulation for different sceneries. Through simulation studies, we find that the RML method is superior to the MKD method in distinguishing these three distributions under the fixed sample size. At the same time, without losing the values of PCS, we compare the RML method with the SPRT procedure and find that the SPRT method provides savings on the sample size. The results for the application show that the SPRT requires fewer observations to obtain the same PCS than the existing fitting test with the RML and the MKD.

Author Contributions

W.T.: conceptualization, methodology, validation, investigation, resources, supervision, project administration, visualization, and writing—review and editing; R.N.: software, formal analysis, data curation, writing—original draft preparation, and visualization; Y.Z.: software, writing—review and editing, and visualization. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by Natural Science Foundation of Top Talent of SZTU under grant GDRC202214.

Data Availability Statement

The data presented in this study are openly available in [25,26,27].

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Appendix A.1. Malignant Melanoma Data

0.0274, 0.0822, 0.0959, 0.2712, 0.5068, 0.5589, 0.5753, 0.6356, 0.6356, 0.7644, 0.8082, 0.9726, 1.0575, 1.1671, 1.2849, 1.3507, 1.4493, 1.7014, 1.7233, 1.8055, 1.8274, 1.9671, 2.0603, 2.1342, 2.1726, 2.2384, 2.2630, 2.2822, 2.3507, 2.3808, 2.3890, 2.6493, 2.6767, 2.6904, 2.8521, 2.8904, 2.9096, 2.9452, 3.1671, 3.3644, 3.4301, 3.4822, 3.5945, 3.9096, 3.9315, 4.1068, 4.1260, 4.1315, 4.1370, 4.1425, 4.1534, 4.1781, 4.2247, 4.2411, 4.2658, 4.2740, 4.2822, 4.3397, 4.3973, 4.4411, 4.4575, 4.4767, 4.4959, 4.4959, 4.5151, 4.5260, 4.5315, 4.5315, 4.5671, 4.5973, 4.6164, 4.6301, 4.6849, 4.6849, 4.7288, 4.7808, 4.8274, 4.8740, 4.8959, 4.8959, 4.9123, 4.9425, 4.9644, 5.0301, 5.0384, 5.0384, 5.0795, 5.0849, 5.0959, 5.1068, 5.2027, 5.2438, 5.2575, 5.2603, 5.2795, 5.2959, 5.3205, 5.3562, 5.3589, 5.3644, 5.3781, 5.3973, 5.4932, 5.4986, 5.5096, 5.5452, 5.5562, 5.5836, 5.6329, 5.6411, 5.6466, 5.6493, 5.6849, 5.7123, 5.7589, 5.7616, 5.7644, 5.7753, 5.7863, 5.8904, 5.9068, 5.9315, 6.0521, 6.1014, 6.1014, 6.1808, 6.2027, 6.4082, 6.4685, 6.5397, 6.5425, 6.5836, 6.6466, 6.6466, 6.6603, 6.7397, 6.7589, 6.8274, 6.8301, 6.9068, 6.9644, 7.0110, 7.0274, 7.0411, 7.2877, 7.3041, 7.3315, 7.5014, 7.6219, 7.6356, 8.1753, 8.3068, 8.3288, 8.3342, 8.4027, 8.4356, 8.4959, 8.6137, 8.6356, 8.6411, 8.7123, 8.7178, 8.7260, 8.7644, 8.8438, 8.8466, 8.9808, 9.0329, 9.1178, 9.1233, 9.1452, 9.2685, 9.2712, 9.2740, 9.2822, 9.3205, 9.4274, 9.4740, 9.4767, 9.4767, 9.5233, 9.6521, 10.0466, 10.1233, 10.1233, 10.3452, 10.3452, 10.4932, 10.5644, 10.6082, 10.7096, 10.8712, 10.9616, 11.2411, 11.2849, 11.2986, 11.5260, 11.8082, 12.0274, 12.2712, 12.3068, 12.7890, 12.7890, 13.4959, 15.2466.

Appendix A.2. Daily Ozone Data

41, 36, 12, 18, 28, 23, 19, 8, 7, 16, 11, 14, 18, 14, 34, 6, 30, 11, 1, 11, 4, 32, 23, 45, 115, 37, 29, 71, 39, 23, 21, 37, 20, 12, 13, 135, 49, 32, 64, 40, 77, 97, 97, 85, 10, 27, 7, 48, 35, 61, 79, 63, 16, 80, 108, 20, 52, 82, 50, 64, 59, 39, 9, 16, 78, 35, 66, 122, 89, 110, 44, 28, 65, 22, 59, 23, 31, 44, 21, 9, 45, 168, 73, 76, 118, 84, 85, 96, 78, 73, 91, 47, 32, 20, 23, 21, 24, 44, 21, 28, 9, 13, 46, 18, 13, 24, 16, 13, 23, 36, 7, 14, 30, 14, 18, 20.

Appendix B

# the RML procedure
for (i in 1:p)
{   logWE<-function(theta){
         beta=theta[1]
         lambda=theta[2]
   h=n*log(beta+1)-n*log(beta)+n*log(lambda)-lambda*sum(x)+sum(log(1-exp(-beta*lambda*x)))
   return(-h)   }
  result1=optim(par=c(1,1),fn=logWE,control=list(maxit=1000),hessian = FALSE)
  logGE <- function(theta){
        gamma=theta[1]
        alpha=theta[2]
  h=n*log(gamma)+n*log(alpha)+(gamma-1)*sum(log(1-exp(-alpha*x)))-alpha*sum(x)
  return(h)  }
  result2=maxLik(logLik=logGE, star=c(1,1),method=“BFGS”)
  logWB<-function(theta){
       eta=theta[1]
       delta=theta[2]
  h=n*log(eta)+n*eta*log(delta)+(eta-1)*sum(log(x))-sum((delta*x)$^$eta)
  return(h)  }
  result3=maxLik(logLik = logWB, start =c(1,1),method=“BFGS”)
  LWE=n*log(result1$par[1]+1)-n*log(result1$par[1])+n*log(result1$par[2])-result1$par[2]*sum(x)+
      sum(log(1-exp(-result1$par[1]*result1$par[2]*x)))
  LGE=n*log(result2$estimate[1])+n*log(result2$estimate[2])+(result2$estimate[1]-1)*sum(log(1-exp
      (-result2$estimate[2]*x)))-result2$estimate[2]*sum(x)
  LWB=n*log(result3$estimate[1])+n*result3$estimate[1]*log(result3$estimate[2])+(result3$estimate
     [1]-1)*sum(log(x))-sum((result3$estimate[2]*x)^(result3$estimate[1])
  if((LWE-LGE)>0 & (LWE-LWB)>0){
       li=1  }
  else{       li=0  }
  qff=rbind(qff,li)}
            # the MKD procedure
for(i in 1:p)
{  logWE<-function(theta){
        beta=theta[1]
        lambda=theta[2]
   h=n*log(beta+1)-n*log(beta)+n*log(lambda)-lambda*sum(x)+sum(log(1-exp(-beta*lambda*x)))
   return(h)   }
  result1=maxLik(logLik = logWE, start =c(1,1),method=“SANN”)
  x1=sort(x)
  FWE=1+(1/result1$estimate[1])*exp(-result1$estimate[2]*(result1$estimate[1]+1)*x1)
      -((result1$estimate[1]+1)/result1$estimate[1])*exp(-result1$estimate[2]*x1)
  fn=ecdf(x1)
  WEKD=max(abs(FWE-fn(x1)))
  logGE<-function(theta){
       gamma=theta[1]
       alpha=theta[2]
  h=n*log(gamma)+n*log(alpha)+(gamma-1)*sum(log(1-exp(-alpha*x)))-alpha*sum(x)
  return(h) }
  result2=maxLik(logLik = logGE, start =c(1,1),method=“BFGS”)
  FGE=(1-exp(-result2$estimate[2]*x1))^(result2$estimate[1])
  GEKD=max(abs(FGE-fn(x1)))
  logWB<-function(theta){
      eta=theta[1]
      delta=theta[2]
  h3=n*log(eta)+n*eta*log(delta)+(eta-1)*sum(log(x))-sum((delta*x)^eta)
  return(h3)  }
  result3=maxLik(logLik = logWB, start =c(1,1),method=“BFGS”)
  FWB=1-exp(-(result3$estimate[2]*x1)^(result3$estimate[1]))
  WBKD=max(abs(FWB-fn(x1)))
  if((GEKD-WEKD)>0 & (WBKD-WEKD)>0){  li=1}
  else{  li=0  }
  qff=rbind(qff,li)}
            # the SPRT procedure
for (i in 1:p)
{ while (b1<=S1 & S1<=a1 & b1<=S2 & S2<=a1 & b1<=S3 & S3<=a1 & k<=n){
      x <- x1[1:k]
      logWE<-function(theta){
           beta=theta[1]
           lambda=theta[2]
      h=k*log(beta+1)-k*log(beta)+k*log(lambda)-lambda*sum(x)+sum(log(1-exp(-beta*lambda*x)))
      return(-h)}
result1=optim(par=c(1,1),fn=logWE,control = list(maxit=1000),hessian = FALSE)
logGE <- function(theta){
            gamma=theta[1]
            alpha=theta[2]
      h=k*log(gamma)+k*log(alpha)+(gamma-1)*sum(log(1-exp(-alpha*x)))-alpha*sum(x)
      return(h)      }
      result2=maxLik(logLik = logGE, start =c(1,1),method=“SANN”)
      logWB<-function(theta){
           eta=theta[1]
         delta=theta[2]
      h=k*log(eta)+k*eta*log(delta)+(eta-1)*sum(log(x))-sum((delta*x)^eta)
      return(h)   }
      result3=maxLik(logLik = logWB, start =c(1,1),method=“SANN”)
         s1<-function(x)
      {  P<-log(((result1$par[1]+1)/result1$par[1])*result1$par[2]*exp(-result1$par[2]*x)*
           (1-exp(-result1$par[1]*result1$par[2]*x)))
        return(P) }
      s2<-function(x)
      {    K<-log(result2$estimate[1]*result2$estimate[2]*((1-exp(-result2$estimate[2]*x))^
           (result2$estimate[1]-1))*exp(-result2$estimate[2]*x))
       return(K)  }
     s3<-function(x)
      { Q<-log(result3$estimate[1]*(result3$estimate[2]^(result3$estimate[1]))*(x^
          (result3$estimate[1]-1))*exp(-(result3$estimate[2]*x)^result3$estimate[1]))
      return(Q) }
     P=s1(x)
     K=s2(x)
     Q=s3(x)
     S1=sum(K-P)
     S2=sum(P-Q)
     S3=sum(K-Q)
     k<-k+1 }
   if(S1>=a1 & S3>=a1){       li=1   }
   else{       li=0 }
   qff=rbind(qff,li)
   N=rbind(N,k)}

References

  1. Atkinson, A.C. A test for discriminating between models. Biometrika 1969, 56, 337–347. [Google Scholar] [CrossRef]
  2. Atkinson, A.C. A method for discriminating between models. J. R. Stat. Soc. Ser. B Methodol. 1970, 32, 323–345. [Google Scholar] [CrossRef]
  3. Chen, W. On the tests of separate families of hypotheses with small sample size. J. Stat. Comput. Simul. 1980, 11, 183–187. [Google Scholar] [CrossRef]
  4. Chambers, E.A.; Cox, D.R. Discrimination between alternative binary response models. Biometrika 1967, 54, 573–578. [Google Scholar] [CrossRef] [PubMed]
  5. Cox, D.R. Tests of separate families of hypotheses. In Proceedings of the Fourth Berkeley Symposium on Mathematical Statistics and Probability, Berkeley, CA, USA, 20 June–30 July 1960; Statistical Laboratory of the University of California: Berkeley, CA, USA, 1961; Volume 1, pp. 105–123. [Google Scholar]
  6. Cox, D.R. Further results on tests of separate families of hypotheses. J. R. Stat. Soc. Ser. B Methodol. 1962, 24, 406–424. [Google Scholar] [CrossRef]
  7. Dyer, A.R. Discrimination procedures for separate families of hypotheses. J. Am. Stat. Assoc. 1973, 68, 970–974. [Google Scholar] [CrossRef]
  8. Weibull, W. A statistical theory of the strength of materials. R. Swed. Inst. Eng. Res. 1939, 151, 7. [Google Scholar]
  9. Gupta, R.D.; Kundu, D. Theory & methods: Generalized exponential distributions. Aust. N. Z. J. Stat. 1999, 41, 173–188. [Google Scholar]
  10. Gupta, R.D.; Kundu, D. A new class of weighted exponential distributions. Statistics 2009, 43, 621–634. [Google Scholar] [CrossRef]
  11. Makhdoom, I. Estimation of r = p[y<x] for weighted exponential distribution. J. Appl. Sci. 2012, 12, 1384–1389. [Google Scholar]
  12. Dey, S.; Ali, S.; Park, C. Weighted exponential distribution: Properties and different methods of estimation. J. Stat. Comput. Simul. 2015, 85, 3641–3661. [Google Scholar] [CrossRef]
  13. Tian, W.; Yang, Y. Change point analysis for weighted exponential distribution. Commun. Stat.-Simul. Comput. 2021, 1–13. [Google Scholar] [CrossRef]
  14. Gupta, R.D.; Kundu, D. Closeness of gamma and generalized exponential distribution. Commun. Stat.-Theory Methods 2003, 32, 705–721. [Google Scholar] [CrossRef]
  15. Gupta, R.D.; Kundu, D. Discriminating between Weibull and generalized exponential distributions. Comput. Stat. Data Anal. 2003, 43, 179–196. [Google Scholar] [CrossRef]
  16. Pakyari, R. Discriminating between generalized exponential, geometric extreme exponential and Weibull distributions. J. Stat. Comput. Simul. 2010, 80, 1403–1412. [Google Scholar] [CrossRef]
  17. Raqab, M.Z. Discriminating between the generalized Rayleigh and Weibull distributions. J. Appl. Stat. 2013, 40, 1480–1493. [Google Scholar] [CrossRef]
  18. Elsherpieny, E.A.; Muhammed, H.Z.; Radwan, N.U.M.M. On discriminating between gamma and log-logistic distributions in case of progressive type II censoring. Pak. J. Stat. Oper. Res. 2017, 13, 157–183. [Google Scholar] [CrossRef]
  19. Ahmad, M.A.; Raqab, M.Z.; Kundu, D. Discriminating between the generalized Rayleigh and Weibull distributions: Some comparative studies. Commun. Stat.-Simul. Comput. 2017, 46, 4880–4895. [Google Scholar] [CrossRef]
  20. Raqab, M.Z.; Al-Awadhi, S.A.; Kundu, D. Discriminating among Weibull, log-normal, and log-logistic distributions. Commun. Stat.-Simul. Comput. 2018, 47, 1397–1419. [Google Scholar] [CrossRef]
  21. Vaidyanathan, V.S.; Varghese, A.S. Discriminating Between Exponential and Lindley Distributions. J. Stat. Theory Appl. 2019, 18, 295–302. [Google Scholar] [CrossRef]
  22. Paul, B.; De, S.K.; Kundu, D. A sequential sampling approach for discriminating log-normal, Weibull, and log-logistic distributions. Commun. Stat.-Simul. Comput. 2021, 1–23. [Google Scholar] [CrossRef]
  23. Wald, A. Sequential tests of statistical hypotheses. Annu. Math. Stat. 1945, 16, 117–186. [Google Scholar] [CrossRef]
  24. R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2019. [Google Scholar]
  25. Andersen, P.K.; Borgan, O.; Gill, R.D.; Keiding, N. Statistical Models Based on Counting Processes; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2012. [Google Scholar]
  26. Scheike, T.H.; Zhang, M.J. Analyzing competing risk data using the R timereg package. J. Stat. Softw. 2011, 38, i02. [Google Scholar] [CrossRef] [PubMed]
  27. Nadarajah, S. A truncated inverted beta distribution with application to air pollution data. Stoch. Environ. Res. Risk Assess. 2008, 22, 285–289. [Google Scholar] [CrossRef]
Figure 1. The pdfs, cdfs, and hazard functions of GE(3.92, 1.85), WE(0.58, 1.68), and WB(1.80, 0.96).
Figure 1. The pdfs, cdfs, and hazard functions of GE(3.92, 1.85), WE(0.58, 1.68), and WB(1.80, 0.96).
Mathematics 11 03847 g001
Figure 2. The pdfs, cdfs, and hazard functions of GE(2.02, 0.95), WE(1.63, 0.88), and WB(1.43, 0.62).
Figure 2. The pdfs, cdfs, and hazard functions of GE(2.02, 0.95), WE(1.63, 0.88), and WB(1.43, 0.62).
Mathematics 11 03847 g002
Figure 3. Histogram of the malignant melanoma data sets and the density functions of the fitted GE, WE, and WB models on the left, and estimated cdfs for the data set of the malignant melanoma on the right.
Figure 3. Histogram of the malignant melanoma data sets and the density functions of the fitted GE, WE, and WB models on the left, and estimated cdfs for the data set of the malignant melanoma on the right.
Mathematics 11 03847 g003
Figure 4. Histogram of the daily ozone data set and the density functions of the fitted GE, WE, and WB models on the left, and estimated cdfs for the data set of daily ozone on the right.
Figure 4. Histogram of the daily ozone data set and the density functions of the fitted GE, WE, and WB models on the left, and estimated cdfs for the data set of daily ozone on the right.
Mathematics 11 03847 g004
Table 1. The PCSs based on the Monte Carlo simulation when using data from the WE distribution.
Table 1. The PCSs based on the Monte Carlo simulation when using data from the WE distribution.
( η , λ )PCS20406080100200500
(5, 0.5)RML0.50720.59560.64460.67000.69760.75270.8570
MKD0.39920.45620.50120.55630.61120.70020.7882
(5.2, 0.5)RML0.52160.61020.65220.68620.69900.76420.8552
MKD0.40020.45770.50320.55750.61550.71560.7889
(5.4, 0.5)RML0.52880.61360.65440.68440.71140.76540.8628
MKD0.41320.45850.50450.55840.62250.72260.7995
(5.8, 0.5)RML0.53640.62640.66300.69500.71520.77700.8702
MKD0.42350.45980.51120.55980.63100.72500.8002
(5, 1)RML0.51520.60660.64860.67600.69660.75820.8588
MKD0.40050.45870.50220.55890.61360.71250.7902
(5.2, 1)RML0.52360.61920.65400.68720.69960.76660.8564
MKD0.40500.45920.50440.55910.61620.71630.7905
(5.4, 1)RML0.53400.62160.65940.68960.71320.76920.8666
MKD0.41380.45980.51990.55890.62550.73200.8002
(5.8, 1)RML0.53860.63480.67400.69720.71580.78020.8744
MKD0.42440.46080.52200.56230.63320.73650.8115
(5, 1.5)RML0.51760.60720.65840.68560.70300.76020.8596
MKD0.40100.45990.50560.56230.61880.72230.7956
(5.2, 1.5)RML0.53120.62240.65620.69180.70940.76900.8654
MKD0.41120.46020.50890.56330.62200.72280.7998
(5.4, 1.5)RML0.54220.62400.66220.69140.71640.77280.8710
MKD0.41890.46250.52320.56620.63050.73350.8156
(5.8, 1.5)RML0.54540.63580.66940.70060.71880.78300.8774
MKD0.42560.46680.52380.56710.63540.73990.8226
Table 2. The PCSs based on the Monte Carlo simulation when using data from the GE distribution.
Table 2. The PCSs based on the Monte Carlo simulation when using data from the GE distribution.
( γ , θ )PCS20406080100200500
(2.5, 0.5)RML0.46900.54600.61240.66120.70700.81680.9482
MKD0.37460.41560.46200.50420.52180.64100.8226
(2.7, 0.5)RML0.50560.61760.66980.72760.77060.88740.9808
MKD0.40980.47580.50400.55540.59340.73820.9170
(2.9, 0.5)RML0.55600.65780.72520.77740.80940.92000.9866
MKD0.41160.48890.50510.55610.59550.73920.9188
(3.3, 0.5)RML0.59300.71020.77280.81800.85120.93700.9932
MKD0.42250.49020.50660.55750.59650.73990.9192
(2.5, 1)RML0.48620.55960.61820.66480.70820.81920.9504
MKD0.37560.41740.46480.50520.52400.64680.8304
(2.7, 1)RML0.51380.61840.67060.73340.77880.88780.9812
MKD0.41050.47680.50630.55590.59960.73880.9220
(2.9, 1)RML0.55760.66200.72760.78120.81120.92120.9896
MKD0.42200.49920.50650.56020.59980.74010.9232
(3.3, 1)RML0.60060.71360.77660.81860.85320.93760.9936
MKD0.42360.49330.50740.56780.60020.74110.9235
(2.5, 1.5)RML0.48940.56040.62240.66960.71880.82540.9510
MKD0.38040.42380.46900.50700.52600.65140.8322
(2.7, 1.5)RML0.51540.62780.67900.73540.78300.88920.9842
MKD0.41880.47770.50770.56130.60050.74160.9235
(2.9, 1.5)RML0.55920.67160.73260.78480.81280.92380.9910
MKD0.42220.50090.51120.56220.60150.75540.9245
(3.3, 1.5)RML0.60980.72080.78200.82460.85360.93980.9954
MKD0.42880.49550.51420.56880.61160.76230.9255
Table 3. The PCSs based on the Monte Carlo simulation when using data from the WB distribution.
Table 3. The PCSs based on the Monte Carlo simulation when using data from the WB distribution.
( γ , θ )PCS20406080100200500
(2.5, 0.5)RML0.73560.82640.88940.92640.94580.98881
MKD0.43260.61580.72060.78820.83240.94180.9956
(2.7, 0.5)RML0.76600.86000.90580.93680.95320.99441
MKD0.46280.64060.75640.81060.85900.95760.9978
(2.9, 0.5)RML0.76980.86420.92140.94520.96820.99541
MKD0.48160.65660.76520.82500.86880.96300.9990
(3.3, 0.5)RML0.79460.88500.93480.96340.97820.99841
MKD0.52600.68800.80440.86040.90740.97820.9996
(2.5, 1)RML0.74060.82840.89680.92940.94800.99141
MKD0.44500.62140.72320.79120.84160.94420.9958
(2.7, 1)RML0.76280.86100.90840.93960.95880.99481
MKD0.46330.64230.76120.82260.85990.96620.9981
(2.9, 1)RML0.77460.86800.92280.94620.96960.99661
MKD0.48220.65890.76680.82880.86960.96760.9995
(3.3, 1)RML0.79960.88680.93500.96640.97840.99861
MKD0.53110.68850.81230.86980.91220.98810.9998
(2.5, 1.5)RML0.74680.84560.89900.93020.95220.99221
MKD0.44660.62340.72520.79300.84260.94640.9970
(2.7, 1.5)RML0.76720.86940.90880.94420.96040.99541
MKD0.46650.64660.76550.82390.86230.97020.9985
(2.9, 1.5)RML0.77780.87680.92660.95200.97140.99741
MKD0.48360.66120.76700.82950.87020.97020.9998
(3.3, 1.5)RML0.80020.89120.93840.96780.98040.99921
MKD0.53260.68920.81330.87060.92220.99051
Table 4. Comparing the method of RML discrimination and the SPRT procedure when using data from the WE distribution.
Table 4. Comparing the method of RML discrimination and the SPRT procedure when using data from the WE distribution.
True DistributionBoundaries | b j | = a j = | ln ( α 2 ) | PCS SPRT P C S R M L E ( t ) n RML Savings %
WE (5, 0.5)2.99570.96620.9605521.3380034.83
2.30260.90090.9025483.6670030.91
WE (5.2, 0.5)2.99570.94470.9436506.6675032.45
2.30260.91180.9156452.2266034.63
WE (5.4, 0.5)2.99570.95020.9500485.9969029.57
2.30260.91000.9164421.2061030.95
WE (5.8, 0.5)2.99570.94090.9438458.6664028.33
2.30260.90090.9066396.8856029.13
Table 5. Comparing the method of RML discrimination and the SPRT procedure when using data from the GE distribution.
Table 5. Comparing the method of RML discrimination and the SPRT procedure when using data from the GE distribution.
True DistributionBoundaries | b j | = a j = | ln ( α 2 ) | PCS SPRT PCS RML E ( t ) n RML Savings %
GE (2.5, 0.5)2.99570.94200.9446452.2278042.02
2.30260.90020.9056411.3266037.68
GE (2.7, 0.5)2.99570.95000.9588422.6071040.48
2.30260.91180.9156395.6662036.18
GE (2.9, 0.5)2.99570.94440.9494400.2265038.43
2.30260.91030.9111378.6058034.72
GE (3.3, 0.5)2.99570.95990.9556385.8960035.69
2.30260.91010.9119346.2255037.05
Table 6. Comparing the method of RML discrimination and the SPRT procedure when using data from the WB distribution.
Table 6. Comparing the method of RML discrimination and the SPRT procedure when using data from the WB distribution.
True DistributionBoundaries | b j | = a j = | ln ( α 2 ) | PCS SPRT PCS RML E ( t ) n RML Savings %
WB (2.5, 0.5)2.99570.94330.9450243.2250051.36
2.30260.90060.9063205.1238046.02
WB (2.7, 0.5)2.99570.93360.9380228.6245049.20
2.30260.91120.9156186.2334045.23
WB (2.9, 0.5)2.99570.95030.9551206.8841049.54
2.30260.90500.9066206.8832035.35
WB (3.3, 0.5)2.99570.93330.9368206.8838045.56
2.30260.90340.9067126.3329056.44
Table 7. Summary of the RML, the MKD, and the SPRT procedures and the goodness of fit test of the malignant melanoma data.
Table 7. Summary of the RML, the MKD, and the SPRT procedures and the goodness of fit test of the malignant melanoma data.
ModelGEWEWB
shape parameters’ estimation γ ^ = 2.3371 η ^ = 2.1321 β ^ = 1.8895
scale parameters’ estimation θ ^ = 0.2712 λ ^ = 0.2152 ξ ^ = 0.1521
T n 1 −6.6095
T n 2 23.4740
T n 3 16.8645
Λ t 1 2.1162
Λ t 2 −2.3319
Λ t 3 −2.3102
AIC1081.27001094.48701047.5400
BIC1087.91601101.13301054.1860
MKD0.17000.18130.1127
Table 8. Summary of the RML, the MKD, the SPRT procedures, and the goodness of fit test of the daily ozone data.
Table 8. Summary of the RML, the MKD, the SPRT procedures, and the goodness of fit test of the daily ozone data.
ModelGEWEWB
shape parameters’ estimation γ ^ = 1.7960 η ^ = 9.4201 β ^ = 1.3402
scale parameters’ estimation θ ^ = 0.0337 λ = 0.0205 ξ ^ = 0.0217
T n 1 0.4501
T n 2 −1.6644
T n 3 −1.2143
Λ t 1 −2.3088
Λ t 2 2.3112
Λ t 3 2.0036
AIC1086.79201085.89201089.2210
BIC1092.29901091.39901094.7280
MKD0.08460.05640.0900
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Niu, R.; Tian, W.; Zhang, Y. Discriminating among Generalized Exponential, Weighted Exponential and Weibull Distributions. Mathematics 2023, 11, 3847. https://doi.org/10.3390/math11183847

AMA Style

Niu R, Tian W, Zhang Y. Discriminating among Generalized Exponential, Weighted Exponential and Weibull Distributions. Mathematics. 2023; 11(18):3847. https://doi.org/10.3390/math11183847

Chicago/Turabian Style

Niu, Ruizheng, Weizhong Tian, and Yunchu Zhang. 2023. "Discriminating among Generalized Exponential, Weighted Exponential and Weibull Distributions" Mathematics 11, no. 18: 3847. https://doi.org/10.3390/math11183847

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop