1. Introduction
Burr devised a dynamic family of probability distributions based on the Pearson differential equations. The Burr XII (BXII) and Burr III (BIII) distributions are widely used models from the system of Burr distributions. On the contrary, according to [
1], the Burr X (BX) model has also gained much attention from applied statisticians along with the BXII and BIII models. The prime reason is that these densities exists in simpler forms and can yield a range of shapes to model a variety of scenarios in diverse scientific fields. The authors in [
2] are of the view that the most adaptable of these three is BIII, especially in environmental, reliability, and survival sciences. The BIII distribution is also called the Dagum distribution in studies of income, wage, and wealth distribution [
3]. In the actuarial literature, it is known as the inverse Burr distribution [
4] and the kappa distribution in the meteorological literature [
5]. As per [
4], it is a prime case of the four-parameter generalised Beta-II distribution. In order to follow the ambit regarding the scope of this provision, we now shift our attention to the BIII distribution. For a random variable
X defined on a positive real line, the cumulative distribution function (cdf) and probability density function (pdf) of two-parameter BIII distribution, respectively, are given below:
and
where
are the shape parameters.
The shape parameter plays a significant role in yielding the hazard rate of BIII distribution, which can be decreasing or unimodal. Thus, it cannot be used to model lifetime data with a bathtub-shaped hazard function, such as human mortality and deterioration modelling. For the last few decades, statisticians have been developing various extensions and modifications in Weibull distribution due to its simple functional form. The two-parameter flexible Weibull extension of [
6] has a hazard function that can be increasing, decreasing, or bathtub shaped. Zhang and Xie [
7] studied the characteristics and application of the truncated Weibull distribution, which has a bathtub-shaped hazard function. A three-parameter model, called exponentiated Weibull distribution, was introduced by [
8]. Another three-parameter model is referred to as the extended Weibull distribution by [
9]. Xie et al. [
10] proposed a three-parameter modified Weibull extension with a bathtub-shaped hazard function. A new modified Weibull distribution by the authors in [
11] has been presented with increasing and a bathtub-shaped hazard function.
Various extensions of BIII distribution have been studied in the literature. In reference [
12], the authors studied low-flow frequency analysis in hydrology with three-parameter-modified BIII distribution with supreme interest in the lower tail of a distribution. Çankaya et al. [
13] extended the BIII model by adding a skew parameter with an epsilon skew extension approach. Modi and Gill [
14] introduced the unit BIII model. Haq et al. [
15] introduced the unit-modified BIII model. Ali et al. [
16] re-parameterized BIII distribution and proposed the modified BIII (MBIII) distribution with the following cdf:
where
, and
are the shape parameters. The authors claimed that the newly structured model is a limiting case of generalized inverse Weibull, BIII, and log-logistic distribution. Still, the density of the improved model can only model positively skewed data, which greatly dented the proposition of the model in the first place. Other extensions are mostly based on the generalized families of distributions that sare complex in nature. Some of them are mentioned as: Beta Dagum by [
17], Modified BIII by [
18], Marshall Olkin BIII by [
19], Gamma BIII by [
20], and Gamma BIII by [
21]. However, we feel that a flexible model with computationally simpler functional forms is still presently needed. Motivated by a lack of availability of literature related to the modified BIII distribution, we present a much more flexible new modification of BIII distribution. The cdf of the new, modified BIII (NMBIII) distribution is defined as
where the
is the additional factor, with
as the rate parameter and
are power parameters of the baseline model.
It is worth mentioning that when we use the additional term to add flexibility in the model, we specifically refer to the ability of the proposed model to fit a diverse range of real life phenomena. Additionally, flexibility may also be associated with the instantaneous failure rate or hazard rate, and is more commonly known as risk function. By selecting precise values for the shape parameters, the hazard rate function of the NMBIII distribution can take on a variety of appealing shapes. Generally speaking, the classical models deal with normal extreme observations. A new modification of BIII distribution will also enable us to observe the tail behaviour of the distribution, which is skewed in nature. Further, the BIII distribution has a monotonic decreasing and unimodal hazard rate function, but due to its modification, NMBIII has monotonic, decreasing, increasing, unimodal, bathtub, and approximately constant hazard-rate shapes. Moreover, many standard distributions are nested models or limiting cases of the Burr system of distributions, which include the Weibull, exponential, logistic, generalised logistic, Gompertz, normal, extreme value, and uniform distributions. The NMBIII distribution outperforms most of these competitive existing models. When
NMBIII distribution reduces to BIII distribution. When
and
, then NMBIII distribution gives us log-logistic distribution. When
, then NMBIII distribution gives us modified log-logistic distribution (new). When
and
, the NMBIII distribution reduces to logistic distribution. When
, it reduces to modified skew logistic distribution (new). When
and
it reduces to generalized logistic distribution type I or Burr type II, or this type has also been called the “skew-logistic” distribution (see [
22]). In a nutshell, with the proposed NMBIII, we seek and hope to attract applied researchers from all scientific community to utilize it in the significant modelling of real-life scenarios.
The article is structured as follows: In
Section 2, we focus our attention on the idea behind the new modification. {In
Section 3, we acquaint the readers with some of the structural properties including the linear expansion, moments, mode, moment-generating functions, order statistics, and stochastic ordering of NMBIII distribution. In
Section 4, model parameters are estimated by maximum likelihood method, and the Fisher information matrix is derived.
Section 5 gives the simulation method based on complete and incomplete samples (middle censored). In
Section 6, three data sets on complete and middle-censored data sets have been employed to established the authenticity of the proposed model to the readers.
Section 7 consists of the concluding remarks and discussions.
2. The New Modified BIII Model
The modified Weibull (MW) distribution (see [
23] has the cumulative survival function that is the product of the Weibull cumulative hazard function
and
. Hence, the distribution function was found to be
which was later generalized to exponentiated form by [
24] using Lehmann alternative-I.
In the same vein, Equation (
4) has been modified. The pdf corresponding to (
4) is given as:
The corresponding survival and hazard functions of NMBIII are, respectively, given by:
and
If a new random variable
y is defined as
in Equation (
4), then we obtain the following model, referred to as modified Burr XII distribution, with cdf and pdf, respectively, as under
and
As far as we can tell, Equations (
4) and (
8) are first modifications of BIII distribution and BXII distributions, respectively. Thus, the proposed distribution in (
4) is more flexible and has tractable tail properties than its parent BIII distribution as well as MBIII distributions. The shapes of pdf and hrf are presented in
Figure 1 and
Figure 2, respectively.
Figure 1 represents the different shapes of the proposed model, i.e., bimodal, reversed-J, right skewed, approximate left-skewed, and symmetrical shapes for different parameter values.
Figure 2 reflects the different shapes of hazard function, which are increasing, decreasing, bathtub, upside-down bathtub, and nearly constant for different parameter values. The proposed distribution is more flexible and tractable than its parent BIII distribution, as well as MBIII distributions (see in
Table 1).
5. Middle-Censoring
The middle-censoring scheme is a non-parametric general censoring mechanism proposed by [
27], where other censoring schemes can be obtained as special cases of this middle-censoring scheme (see [
28]).
For n identical lifetimes with a random censoring interval at the item with some unknown bivariate distribution. Then, the exact value of is observable only if ; otherwise, the interval is observed.
Middle-censoring had previously been applied to exponential and Burr XII lifetime distributions (see [
28,
29]). Furthermore, it was extended to parametric models with covariates [
30], and its robustness was investigated by [
31].
In this section, we analyse the NMBIII lifetime data when they are middle-censored. Assume that are i.i.d. NMBIII random variable and let be another random variable that defines the length of the censoring interval with exponential distribution with mean , where the left-censoring point for each individual is assumed to also be an exponential random variable with mean . Moreover, the , and are all independent of each other and the observed data, and are given by
5.1. Estimation
For
n randomly selected units from the NMBIII
population, where
c,
,and
k are unknown, were tested under middle-censoring scheme. In this setting, there are
uncensored observations and
censored observations. Then, by re-ordering the observed data into the uncensored and censored observations, we therefore have the following data
where
The likelihood function of the observed data is given by:
where
is a normalizing constant depending on
and
, and the estimation of them is not of interest and this is left as a constant. The log-likelihood function is given by
The maximum-likelihood estimation (MLE) of
c,
,and
k, denoted by
, and
, can be derived by solving the following equations:
and
It is obvious that the MLE of c, ,and k cannot be solved explicitly. Therefore, the solutions can be obtained using Newton–Raphson method or numerically using the solve systems of nonlinear equations “nleqslv” package in R.
Since the MLE is asymptotically normal, the approximate confidence intervals for the parameters c, and k can be computed as follows: and , where are the variances of the respective parameters , and , and is the value of the standard normal curve and is the level of significance.
5.2. Simulation Results
We conducted Monte Carlo simulation studies to assess the finite sample behaviour of the MLEs of the parameters based on two settings; the first is the random variable generated from the NMBIII distribution, while the other considers the case where the NMBIII lifetime data were middle-censored.
The random samples for both settings were generated from distribution NMBIII based on accept-reject approach. Without loss of generality, random samples were used with five different sizes viz n = 10, 30, 50, 70, and 100 from NMBIII distribution with parameters .
The middle censoring settings considered three combinations of the censoring schemes , , and .
The results were obtained from 1000 Monte Carlo replications from simulations carried out using the software R, and the average estimates and the mean squared error (MSE) are obtained and reported in
Table 3.
Results in
Table 3 show that the ML estimates for both settings behave similarly. In general, there is a decreasing function between the sample size and the mean squared error, which verifies the consistency property of the derived estimators. The average estimates are insignificantly effected by the censoring status.
6. Applications
This section provides three applications for complete data sets to show how the NMBIII distribution can be applied in practice. We compare NMBIII distribution to MBIII, BIII, Weibull (W), Gamma (Ga), Lognormal (LN), Generalized Weibull (EW), and Generalised Extreme value type-II (GEV-II) distributions. In these applications, the model parameters are estimated by the method of maximum likelihood. The Akaike information criterion (AIC), Bayesian information criterion (BIC), A*(Anderson Darling), and W*(Cramer–von Mises) are computed to compare the fitted models. In general, the smaller the values of these statistics, the better the fit to the data. Additionally, the asymptotic variance-covariance matrices of the NMBIII parameters are also provided. The plots of the fitted PDFs, CDFs, Probability–Probabibility (PP), and Quantile–Quantile (QQ) of NMBIII are displayed for visual comparison. The required computations are carried out in the R software.
The first data set consists of 119 observations on fracture toughness of Alumina (Al
O
) (in the units of MPa m
. These data were studied by [
32]. The second data set refers to the material thickness of hole (12 mm) and sheet (3.15 mm), comprising 50 observations, as reported by authors in [
33]. The third data set was first analysed by [
34] and represents the survival times, in weeks, of 33 patients suffering from Acute Myelogenous Leukaemia.
The variance–covariance matrix of the MLEs of the NMBIII distribution for data set 1 is
The variance–covariance matrix of the MLEs of the NMBIII distribution for data set 2 is
The variance–covariance matrix of the MLEs of the NMBIII distribution for data set 3 is
7. Conclusions
A good theory should seek out the most concise explanation for the facts. With this in mind, a new modified form of BIII distribution has been introduced that can model well-specified forms of hazard rate shapes, including increasing, decreasing, bathtub, upside-down bathtub, and nearly constant. Some of its statistical properties, such as, rth moment, sth incomplete moment, moment generating function, skewness, kurtosis, mode, ith order statistics, and stochastic ordering have been derived. The maximum likelihood estimation is employed to estimate the model parameters. The usefulness of this model is demonstrated by applications on complete and censored samples. Simulation study is also performed. A future effort would include the contributions of new regression models, Bayesian parameter estimations, and research into diversified fields of data sets.