Next Article in Journal
Multiple Phase Stepping Generation in Alkali Metal Atoms: A Comparative Theoretical Study
Previous Article in Journal
Transpilers: A Systematic Mapping Review of Their Usage in Research and Industry
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A New Generalized Logarithmic–X Family of Distributions with Biomedical Data Analysis

1
Department of Statistics, Abdul Wali Khan University, Mardan 23200, Pakistan
2
Department of Analytics in the Digital Era, United Arab Emirates University, Al Ain 15551, United Arab Emirates
3
Department of Statistics, Shaheed Benazir Bhutto Women University, Peshawar 2500, Pakistan
4
Department of Technology, The University of Lahore, Lahore 54590, Pakistan
5
Innovation Education and Research Center for On-Device AI Software (Bk21), Department of Computer Science and Engineering, Chungnam National University, Daejeon 34134, Republic of Korea
6
Department of Computer Science and Engineering, Chungnam National University, Daejeon 34134, Republic of Korea
*
Author to whom correspondence should be addressed.
Appl. Sci. 2023, 13(6), 3668; https://doi.org/10.3390/app13063668
Submission received: 6 February 2023 / Revised: 2 March 2023 / Accepted: 9 March 2023 / Published: 13 March 2023

Abstract

:
In this article, an attempt is made to propose a novel method of lifetime distributions with maximum flexibility using a popular T–X approach together with an exponential distribution, which is known as the New Generalized Logarithmic-X Family (NGLog–X for short) of distributions. Additionally, the generalized form of the Weibull distribution was derived by using the NGLog–X family, known as the New Generalized Logarithmic Weibull (NGLog–Weib) distribution. For the proposed method, some statistical properties, including the moments, moment generating function (MGF), residual and reverse residual life, identifiability, order statistics, and quantile functions, were derived. The estimation of the model parameters was derived by using the well-known method of maximum likelihood estimation (MLE). A comprehensive Monte Carlo simulation study (MCSS) was carried out to evaluate the performance of these estimators by computing the biases and mean square errors. Finally, the NGLog–Weib distribution was implemented on four real biomedical datasets and compared with some other distributions, such as the Alpha Power Transformed Weibull distribution, Marshal Olkin Weibull distribution, New Exponent Power Weibull distribution, Flexible Reduced Logarithmic Weibull distribution, and Kumaraswamy Weibull distribution. The analysis results demonstrate that the new proposed model performs as a better fit than the other competitive distributions.

1. Introduction

To model and analyse lifetime data, probability distributions are essential tools, particularly in the area of biomedical science; see Liu et al. [1]. Various probability distributions have been suggested for modelling lifetime phenomena; for example, the Exponential (Expo), Rayleigh (Ray), and Weibull (Weib) distributions. However, the performance of these aforementioned classical distributions is affected when the phenomena are complex. For illustration, the commonly used Expo distribution performs well for modelling lifetime phenomena based on a constant hazard rate function (HRF), whereas the Ray distribution has the ability to model lifetime phenomena having only an increasing HRF. The Weib distribution is known for its performance, as it has the characteristics of both the Expo and Ray distributions. The Weib distribution can model the phenomena having a monotone (increasing, decreasing, and constant) HRF (see Almalki and Yuan [2], and Liao et al. [3]). However, when the phenomena follow non-monotonic (unimodal, modified unimodal, and bathtub-shaped) HRFs, the Weib distribution may not be a good option. In medical cases, e.g., breast cancer, neck cancer, and bladder cancer, the HRF of the phenomena is shown to have a uni-modal or modified uni-modal shape. The HRF for breast cancer, bladder cancer, and neck cancer has been identified as a uni-modal shape after surgical removal. The cancer recurrence HRF begins to increase in the early stage and gradually decreases after a definite period of time when removed by surgery (Demicheli et al. [4]). Another example of a uni-modal failure rate is infection with some new viruses, where the initial phase increases and then decreases; see, for example, (Liu et al. [1]).
In the recent past, researchers have contributed toward developing new families of continuous probability distributions that are flexible in nature. These flexible distributions are derived by incorporating some additional parameters into the baseline distributions. Here, we refer to some of the newly proposed families of lifetime distributions, including: the Marshall–Olkin Weibull–Generated (MO–Weib–G) family of probability distributions proposed by Klakattawi et al. [5], the Shifted Exponential–G (SExpo–G) family of distributions proposed by Eghwerido et al. [6], the Generalization of Gull Alpha Power (GGAP) family of probability distributions proposed by Kilai et al. [7], the New Flexible Logarithmic–X (NFLog–X) family of probability distributions proposed by Alkhairy et al. [8], the Arcsine–X family of probability distributions proposed by Tung et al. [9], the Truncated family of distributions proposed by Alzaatreh et al. [10], the Transmuted Alpha Power–G (TAP–G) family of Eghwerido et al. [11], the Fréchet Topp Leone–G (FTL–G) family of Reyad et al. [12], the New Extended-Family (NEx-F) of distributions proposed by Zichuan et al. [13], a New Lifetime Exponential–X (NLE–X) family proposed by Huo et al. [14], a New Modified Exponent Power Alpha (NMEPA) family of distributions proposed by Shah et al. [15], and a generalized Alpha Exponent Power (GAEP) Family of distributions proposed by Hussain et al. [16]. For more reading about probability distributions, see Chen et al. [17], Xu et al. [18], Zhang et al. [19], Luo et al. [20], and Zhuang et al. [21].
Zhao et al. [22] have introduced a new method of lifetime distributions by incorporating an additional parameter using the T–X family method (Alzaatreh et al. [23]). The CDF (cumulative distribution function) is given by
G ( x ; δ , Θ ) = 1 ( 1 F ( x ; Θ ) 1 ( 1 δ ) F ( x ; Θ ) ) δ x
where δ + is the shape parameter, and F ( x ; Θ ) is the CDF of any baseline distribution.
Wang et al. [24] have suggested a new logarithmic family of lifetime distributions, and its CDF is
G ( x ; δ , θ , Θ ) = log [ e δ ( 1 F ( x ; Θ ) θ ) ( e δ 1 ) ] δ x
where δ + and θ + are the additional parameters, and F ( x ; Θ ) is the CDF of any baseline or classical distribution.
Likewise, another approach has been recommended for the lifetime phenomena modelling named the Log–U (Logarithmic–U) of distributions by Zhao et al. [25]. The CDF of the Log–U family is
G ( x ; θ , α , Θ ) = 1 ( 1 α F ( x ; Θ ) α [ log F ( x ; Θ ) ] ) θ x
where α + and θ + are the additional parameters, and F ( x ; Θ ) is the CDF of any baseline or classical distribution.
In this paper, we carry this approach of distribution theory and propose a new family of probability distributions by implementing the T–X family approach (Alzaatreh et al. [23]). The proposed family of distributions gives an appropriate fit for biomedical datasets. Additionally, this study empirically shows that the new extension of the Weib distribution offers an efficient fit to the considered datasets in comparison with well-known distributions (see Section 6).
The work carried out in this paper is outlined and organized in the following sections: In Section 2, the new method of lifetime distributions is derived. In Section 3, a three-parameter special subcase of the proposed class named a New Generalized Logarithmic Weibull (NGLog–Weib) distribution is derived, and the shape of the CDF, PDF (probability density function), and HF (hazard function) are also discussed in the same section. In Section 4, some mathematical and statistical properties of the proposed approach are derived. In Section 5, the estimation methods for estimating the modal parameters are used, as well as in the same section, a brief MCSS is also conducted to assess the behaviour of these estimates. The implementations of the novel-introduced methods on four biomedical datasets are discussed in Section 6.

2. The NGLog–X Family

Let r ( t ) be the PDF of a RV (random variable), say T , where T [ π 1 , π 2 ] for π 1 < π 2 < , and let suppose that K [ F ( x ; Θ ) ] be a function of F ( x ; Θ ) CDF of a RV, say X , satisfying the conditions given bellow:
i.
K [ F ( x ; Θ ) ] ( π 1 , π 2 ) ;
ii.
K [ F ( x ; Θ ) ] is monotonically increasing and differentiable; and
iii.
K [ F ( x ; Θ ) ] π 1 as x and K [ F ( x ; Θ ) ] π 2 as x .
According to (Alzaatreh et al. [23]), the CDF G ( x ) of the T–X family is defined by
G ( x ) = π 1 K [ F ( x ; Θ ) ] r ( t ) d t , x .
The g ( x ) PDF corresponding to (4) is given by
g ( x ) = { d d x K [ F ( x ; Θ ) ] } r { K [ F ( x ; Θ ) ] } , x .
Using the T–X idea, several novel classes of distributions have been proposed in the literature. Table 1 lists different K [ F ( x ; Θ ) ] functions for special members of the T–X family.
Table 1. Different K [ F ( x ; Θ ) ] functions for some members of the T–X family.
Table 1. Different K [ F ( x ; Θ ) ] functions for some members of the T–X family.
K [ F ( x ; Θ ) ] Support of XMember of T–X Family
F ( x ; Θ ) [ 0 , 1 ] Beta–G (Eugen et al. [26])
log [ F ( x ; Θ ) ] ( 0 , ) Gamma–G type 1 (Zografos and Balakrishnan [27])
log [ 1 F ( x ; Θ ) ] ( 0 , ) Gamma–G type 2 (Ristic and Balakrishnan [28])
F ( x ; Θ ) 1 F ( x ; Θ ) ( 0 , ) Gamma–G type 3 (Torabi and Montazeri [29])
log [ log { 1 F ( x ; Θ ) } ] ( , ) Logistic–X family (Tahir et al. [30])
log ( 1 F ( x ; Θ ) e F ( x ; Θ ) ) ( 0 , ) Weighted T–X family (Ahmad et al. [31])
log [ e ( 1 K ( x , Θ ) ) 1 e ( 1 K ( x , Θ ) ) ] ( 0 , ) New Expo–X family (Shah et al. [32])
log ( e δ   F ( x ; Θ ) ( e log F ( x ; Θ ) ) δ ) ( 0 , ) NGLog–X family (proposed)
Now, taking inspiration from Equation (4), we propose the novel-introduced family of distributions. Let T ~ exp ( 1 ) , and then its CDF is given by
R ( t ) = 1 e t ,   t 0
The PDF corresponding to Equation (5) is
r ( t ) = e t ,   t > 0
Setting K [ F ( x ; Θ ) ] = log ( e δ   F ( x ; Θ ) ( e log F ( x ; Θ ) ) δ ) and r ( t ) as Equation (6) in Equation (4), we obtain the G ( x ; δ , Θ ) CDF of the New Generalized Logarithmic–X (NGLog–X) family of lifetime distributions, which is given by
G ( x ; δ , Θ ) = e δ   F ( x ; Θ ) ( e log F ( x ; Θ ) ) δ ,   δ + , x
where δ + is an extra shape parameter, x and F ( x ; Θ ) is the CDF of any baseline (or existing) probability distribution which may depend on a parameter vector Θ .
The PDF g ( x ; δ , Θ ) = d d x G ( x ; δ , Θ ) corresponding to Equation (7) is given by
g ( x ; δ , Θ ) = f ( x ; Θ ) e δ { 1 + δ ( e log F ( x ; Θ ) ) 1 } ( e log F ( x ; Θ ) ) δ , δ + , x
where, d d x F ( x ; Θ ) = f ( x ; Θ )
Furthermore, for δ + and x , the SF (survival function) S ( x ; δ , Θ ) = 1 G ( x ; δ , Θ ) , HF (hazard function) h ( x ; δ , Θ ) = g ( x ; δ , Θ ) 1 G ( x ; δ , Θ ) , RHF (reverse hazard function) τ ( x ; δ , Θ ) = g ( x ; δ , Θ ) G ( x ; δ , Θ ) , and CHF (cumulative hazard function) H ( x ; δ , Θ ) = log [ 1 G ( x ; δ , Θ ) ] of the NGLog–X family are shown in Equations (9)–(12):
S ( x ; δ , Θ ) = 1 e δ   F ( x ; Θ ) ( e log F ( x ; Θ ) ) δ ,
h ( x ; δ , Θ ) = f ( x ; Θ ) e δ { 1 + δ ( e log F ( x ; Θ ) ) 1 } ( e log F ( x ; Θ ) ) δ e δ   F ( x ; Θ ) ,
τ ( x ; δ , Θ ) = f ( x ; Θ ) { 1 + δ ( e log F ( x ; Θ ) ) 1 } F ( x ; Θ ) ,
and
H ( x ; δ , Θ ) = log ( 1 e δ   F ( x ; Θ ) ( e log F ( x ; Θ ) ) δ )

3. NGLog–Weib Distribution

For a three-parameter special case of the NGLog–X family, let us consider the CDF, F ( x ; Θ ) , and PDF, f ( x ; Θ ) , of the traditional Weib distribution are F ( x ; Θ ) = 1 e β x α and f ( x ; Θ ) = β α x α 1 e β x α , respectively ( β + , α + , and x + ), where Θ = ( β , α ) .
Using F ( x ; Θ ) = 1 e β x α in Equation (7), we define the CDF G ( x ; δ , Θ ) of the NGLog–Weib model. For x + , the scale parameter, β + , and shape parameters, δ + and α + , the G ( x ; δ , Θ ) CDF of the NGLog–Weib model is as follows,
G ( x ; δ , Θ ) = e δ   ( 1 e β x α ) ( e log ( 1 e β x α ) ) δ ,
S ( x ; δ , Θ ) = 1 e δ   ( 1 e β x α ) ( e log ( 1 e β x α ) ) δ , ( Survival   Function )
Plots of the CDF, i.e., G ( x ; δ , Θ ) and SF (survival function), i.e., S ( x ; δ , Θ ) of the NGLog–Wei probability model, are given in Figure 1 for (i) δ = 2.0, β = 6.0, α = 1.5 (in red), (ii) δ = 3.0, β = 5.0, α = 2.3 (in green), (iii) δ = 4.0, β = 3.0, α = 2.9 (in black), and (iv) α = 4.0, β = 1.5, θ = 3.5 (in blue).
Corresponding to G ( x ; δ , Θ ) in Equation (13), the PDF, g ( x ; δ , Θ ) , is
g ( x ; δ , Θ ) = α β x α 1 e β x α + δ { 1 + δ ( e log ( 1 e β x α ) ) 1 } ( e log ( 1 e β x α ) ) δ
For a graphical illustration, some possible behaviours of the PDF g ( x ; δ , Θ ) of the NGLog–Weib distribution at different parameter values are sketched in Figure 2. The plots are sketched for (i) δ = 3.0, β = 3.5, α = 0.3 (in red), (ii) δ = 3.0, β = 5.0, α = 1.9 (in green), (iii) δ = 4.5, β = 3.0, α = 2.9 (in black), and (iv) δ = 6.5, β = 1.5, α = 3.3 (in blue). Figure 2 shows four different shapes of the PDF g ( x ; δ , Θ ) , including (i) decreasing or a reverse-J pattern (in red), (ii) right-skewed (in green), (iii) symmetrical (in black), and (iv) left-skewed (in blue).
Furthermore, the hazard function h ( x ; δ , Θ ) , RHF (reverse hazard function) τ ( x ; δ , Θ ) , and CHF (cumulative hazard function) H ( x ; δ , Θ ) of the NGLog–Weib are given in Equations (16) to (18), respectively, by
h ( x ; δ , Θ ) = α β x α 1 e β x α + δ { 1 + δ ( e log ( 1 e β x α ) ) 1 } ( e log ( 1 e β x α ) ) δ e δ ( 1 e β x α ) ,
τ ( x ; δ , Θ ) = β α x α 1 e β x α { 1 + δ ( e log ( 1 e β x α ) ) 1 } ( 1 e β x α ) ,
H ( x ; δ , Θ ) = log ( 1 e δ   ( 1 e β x α ) ( e log ( 1 e β x α ) ) δ )
Furthermore, for the various values of the parameters, some possible behaviours of the HF h ( x ; δ , Θ ) of the NGLog–Weib model are shown in Figure 3 for (i) δ = 10.7, β = 2.6, α = 0.5 (in red), (ii) δ = 10.4, β = 3.5, α = 0.5 (in green), (iii) δ = 0.5, β = 0.2, α = 2.9 (in black), (iv) δ = 6.0, β = 2.4, α = 0.3 (in blue), and (iv) δ = 0.5, β = 0.8, α = 0.5 (in violet). From Figure 3, we can observe (or examine) that the NGLog–Wieb offers data modelling with increasing, decreasing, uni-model, and bathtub-shaped HFs, h ( x ; δ , Θ ) .

4. Statistical Properties

This section presents some mathematical and statistical properties of the NGLog–X family of probability distributions.

4.1. Moments

We derived a computational representation of the general order moments corresponding to the proposed NGLog–X of distributions. If x is an RV from the NGLog–X family, then about the origin, the r th moments are given by
E ( x r ) = x r g ( x ; δ , Θ ) d x
Putting the density of the proposed family in Equation (19), we obtain
μ = x r f ( x ; Θ ) e δ { 1 + δ ( e log F ( x ; Θ ) ) 1 } ( e log F ( x ; Θ ) ) δ d x
μ = x r f ( x ; Θ ) ( 1 log F ( x ; Θ ) e ) δ { 1 + δ e 1 ( 1 log F ( x ; Θ ) e ) 1 } d x
Using the expansion ( 1 x ) n = i = 0 ( i + n 1 n 1 )   x i , where x = log F ( x ; Θ ) e , n = δ , and n = 1 in Equation (20), respectively, then we obtain
μ = i = 0 ( i + δ 1 δ 1 ) e i Ψ i + δ i = 0 j = 0 ( i + δ 1 δ 1 ) e ( i + j + 1 ) Ψ i , j , 1
where
Ψ i = x r f ( x ; Θ ) ( log F ( x ; Θ ) ) i d x
and
Ψ i , j , 1 = x r f ( x ; Θ ) ( log F ( x ; Θ ) ) i + j d x
Similarly, using the PDF g ( x ; δ , Θ ) , a general explicit expression for the MGF (moments generating function) Μ X ( t ) of the NGLog–X is derived as
Μ X ( t ) = e t x g ( x ; δ , Θ ) d x = r = 0 t r r ! μ
By simplifying, we obtain the MGF
Μ X ( t ) = i , r = 0 t r e i r ! Ψ i + δ i , j , r = 0 t r e ( i + j + 1 ) r ! Ψ i , j , 1

4.2. Residual and Reverse Residual Life of NGLog–X

In life distributions, the residual lifetime RV is referred to as the “time since failure” and has wide applications in survival analysis. The notation used for the residual lifetime of the NGLog–X RV x is denoted by R ( t ) and is given as
R ( t ) ( x ) = S ( x + t ; δ , Θ ) S ( x ; δ , Θ ) ,
R ( t ) ( x ) = ( ( e log F ( x + t ; Θ ) ) δ e δ   F ( x + t ; Θ ) ) ( e log F ( x ; Θ ) ) δ ( ( e log F ( x ; Θ ) ) δ e δ   F ( x ; Θ ) ) ( e log F ( x + t ; Θ ) ) δ
For the NGLog–X RVs, the reverse residual life is R ¯ ( t ) :
R ¯ ( t ) ( x ) = S ( x t ; δ , Θ ) S ( x ; δ , Θ ) ,
R ( t ) ( x ) = ( ( e log F ( x t ; Θ ) ) δ e δ   F ( x t ; Θ ) ) ( e log F ( x ; Θ ) ) δ ( ( e log F ( x ; Θ ) ) δ e δ   F ( x ; Θ ) ) ( e log F ( x t ; Θ ) ) δ

4.3. The Identifiability Property

The IP is a very worthwhile statistical property that ensures precise inference. In this subsection, we calculated the IP of the NGLog–X family of the distributions. Let δ 1 and δ 2 be the two additional parameters having the CDFs G ( x ; δ 1 , Θ ) and G ( x ; δ 2 , Θ ) . The parameter δ is identifiable if δ 1 = δ 2 . Mathematically, the IP is derived as:
G ( x ; δ 1 , Θ ) = G ( x ; δ 2 , Θ )
Inserting Equation (7) in Equation (25), we obtain
e δ 1   F ( x ; Θ ) ( e log F ( x ; Θ ) ) δ 1 = e δ 2   F ( x ; Θ ) ( e log F ( x ; Θ ) ) δ 2
After simplifying Equation (26), we obtain
e δ 1   F ( x ; Θ ) ( e log F ( x ; Θ ) ) δ 2 = e δ 2   F ( x ; Θ ) ( e log F ( x ; Θ ) ) δ 1
e δ 1   ( e log F ( x ; Θ ) ) δ 2 = e δ 2   ( e log F ( x ; Θ ) ) δ 1
By taking logarithm of Equation (27), we obtain
δ 1 + δ 2 log ( e log F ( x ; Θ ) ) = δ 2 + δ 1 log ( e log F ( x ; Θ ) ) δ 1 ( 1 log ( e log F ( x ; Θ ) ) ) = δ 2 ( 1 log ( e log F ( x ; Θ ) ) ) δ 1 = δ 2
From Equation (28), we observed that δ 1 = δ 2 . Hence, the parameter, δ , is identifiable.

4.4. Order Statistics

Let x 1 , x 2 , , x n be the set of the i.i.d RVs of the ‘n’ size taken from the NGLog–X family of distributions with the parameters δ and Θ . Let suppose that the associated OS (order statistics) is x 1 : n , x 2 : n , , x n : n . Then, from David [33], the density function (DF) of x j : n , say q j : n ( x ) , where j = 1 , 2 , , n , is defined by
q j : n ( x ) = g ( x ; δ , Θ ) B ( j , n j + 1 ) i = 0 n j ( 1 ) i [ G ( x ; Θ , δ ) ] i + j 1
Using Equations (7) and (8) in Equation (29), we obtain the j t h order statistics for the proposed family of probability distributions.

4.5. Quantile Function

A QF (quartile function) is used to calculate any of the positional descriptive statistics of the distributions. The QF of the proposed NGLog–X method is the function Q ( u ; δ , Θ ) that satisfies the following nonlinear equations:
Q ( G ( u ; δ , Θ ) ) = u , u ( 0 , 1 )
By using Equation (7) in Equation (30), we obtain
Q ( G ( u ; δ , Θ ) ) = G 1 ( u ) = F 1 ( u )
Q ( u ) = δ + log F ( x ; Θ ) log u + δ log ( e log F ( x ; Θ ) ) ,
where u is the solution of δ + log F ( x ; Θ ) log u + δ log ( e log F ( x ; Θ ) ) .

5. Estimation and Simulations

To obtain the MLEs ( δ M L E , Θ M L E ) of the proposed method, we used the method of the maximum likelihood function. In addition, an MCSS was also conducted to assess the performance of the corresponding δ ^ M L E and Θ ^ M L E .

5.1. Estimation

Various methods of estimation have been developed for estimating the model parameters; however, a well-known method, MLE, is used commonly. Here, from a complete sample only, we define the MLE method for the estimation of the unknown parameters of the NGLog–X family distributions. Suppose that x 1 , x 2 , , x k are the observed values of the size “n” selected from the proposed NGLog–X family of distributions with the parameters ( δ M L E , Θ M L E ) randomly. Then, the expression of the LF (likelihood function) φ ( x ; δ , Θ ) is acquired as
φ ( x ; δ , Θ ) = i = 1 n ( f ( x ; Θ ) e δ { 1 + δ ( e log F ( x ; Θ ) ) 1 } ( e log F ( x ; Θ ) ) δ )
The corresponding log-LF, ( Φ ) , is given by
( Φ ) = n δ + 2 i = 1 n log f ( x i ; Θ ) + i = 1 n log ( 1 + δ ( e log F ( x i ; Θ ) ) 1 ) n δ i = 1 n log ( e log F ( x ; Θ ) )
Using ( Φ ) , the partial derivatives are the following equations:
d d δ ( Φ ) = n + i = 1 n ( e log F ( x i ; Θ ) ) 1 ( 1 + δ ( e log F ( x i ; Θ ) ) 1 ) n i = 1 n log ( e log F ( x ; Θ ) ) ,
d d Θ ( Φ ) = + 2 i = 1 n d / d Θ f ( x i ; Θ ) f ( x i ; Θ ) + n δ i = 1 n [ F ( x ; Θ ) ] 1 d / d Θ F ( x ; Θ ) ( e log F ( x ; Θ ) ) i = 1 n ( e log F ( x i ; Θ ) ) 2 [ F ( x i ; Θ ) ] 1 d / d Θ F ( x i ; Θ ) ( 1 + δ ( e log F ( x i ; Θ ) ) 1 ) ,
where Φ = ( δ , Θ ) .
Setting d d δ ( Φ ) = 0 and d d Θ ( Φ ) = 0 and solving these equations simultaneously yielded the MLEs of δ ^ M L E , Θ ^ M L E . Clearly, the resulting equations cannot be solved analytically. Therefore, statistical software can be applied to solve these equations numerically through iterative Newton–Raphson-type algorithms.

5.2. Simulation

Here, in this sub-section, a comprehensive MCSS is conducted to assess the performance of the estimators of the NGLog–Weib distribution by using the R package. The simulation results are calculated for six sets of parameter values: (i) Set I ( δ = 0.5 ,   β = 2.4 ,   α = 3.6 ) , (ii) Set II ( δ = 3.2 ,   β = 2.7 ,   α = 3.8 ) , (iii) Set III ( δ = 1.2 ,   β = 3.3 ,   α = 4.5 ) , (iv) Set IV ( δ = 1.5 ,   β = 3.9 ,   α = 4.6 ) , (v) Set V ( δ = 1.4 ,   β = 3.5 ,   α = 1.0 ) , and (vi) Set VI ( δ = 1.2 ,   β = 0.7 ,   α = 1.0 ) . For each set of the above parameters values, the MCSS replicates are made 1000 times under different sample sizes, n = 25 , 50 , , 1000 . We obtained the average MLEs, MSEs (mean square errors), and biases for each sample size. The numerical values of the biases and the MSEs of the model are calculated as:
1 n i = 1 n ( δ ^ i δ )
and
1 n i = 1 n ( δ ^ i δ ) 2
For the numerical values, the same process is repeated for Θ . The SRs (simulation results) corresponding to Set I and Set II are shown in Table 2. Similarly, the results of Set III and Set IV are provided in Table 3, beside the results of Set V and Set VI, which are provided in Table 4. It is observed from Table 2, Table 3 and Table 4 that the estimated values of ( δ ^ , Θ ^ ) M L E are stable, and the MSE of ( δ ^ , Θ ^ ) M L E decreases as the sample size increases. Similarly, the values of the biases, ( δ ^ , Θ ^ ) M L E , are also decay to zero as the sample size increases.

6. Real Data Applications

In this section, we show the practical performance of the suggested method by analysing four real datasets from the biomedical field. The first biomedical dataset is guinea pigs infected, consisting of 72 observations and presenting the survival time of the guinea pigs infected with different amounts of virulent tubercle bacilli. The second dataset is head and neck cancer, having 44 observations and presenting the length of time the head and neck cancer patients stay alive. The third dataset, COVID−19 patients, contains 36 observations and denotes the death rate of COVID−19 pandemic patients belong to Canada of 36 days, from 10 April to 15 May 2020, see the link [https://covid19.who.int/]. Similarly, the fourth dataset represents the lifetimes of 20 patients receiving analgesics. A detailed description of all the datasets is presented in Table 5.
Table 5. Medical datasets.
Table 5. Medical datasets.
No.ObservationsSources
Dataset 1.10, 33, 44, 56, 59, 72, 74, 77, 92, 93, 96, 100, 100, 102, 105, 107, 107, 108, 108, 108, 109, 112, 113, 115, 116, 120, 121, 122, 122, 124, 130, 134, 136, 139, 144, 146, 153, 159, 160, 163, 163, 168, 171, 172, 176, 183, 195, 196, 197, 202, 213, 215, 216, 222, 230, 231, 240, 245, 251, 253, 254, 255, 278, 293, 327, 342, 347, 361, 402, 432, 458, 555Bjerkedal [34]
Dataset 2.12.20, 23.56, 23.74, 25.87, 31.98, 37, 41.35, 47.38, 55.46, 58.36, 63.47, 68.46, 78.26, 74.47, 81.43, 84, 92, 94, 110, 112, 119, 127, 130, 133, 140, 146, 155, 159, 173, 179, 194, 195, 209, 249, 281, 319, 339, 432, 469, 519, 633, 725, 817, 1776Ceren et al. [35]
Dataset 3.3.1091, 3.3825, 3.1444, 3.2135, 2.4946, 3.5146, 4.9274, 3.3769, 6.8686, 3.0914, 4.9378, 3.1091, 3.2823, 3.8594, 4.0480, 4.1685, 3.6426, 3.2110, 2.8636, 3.2218, 2.9078, 3.6346, 2.7957, 4.2781, 4.2202, 1.5157, 2.6029, 3.3592, 2.8349, 3.1348, 2.5261, 1.5806, 2.7704, 2.1901, 2.4141, 1.9048Almetwally et al. [36]
Dataset 4.1.1, 1.4, 1.3, 1.7, 1.9, 1.8, 1.6, 2.2, 1.7, 2.7, 4.1, 1.8, 1.5, 1.2, 1.4, 3.0, 1.7, 2.3, 1.6, 2.0.Jamal et al. [37]
The NGLog–Weib distribution was applied to all four datasets. The goodness of the fit of the newly introduced method was compared with the APTra–Weib (Alpha Power Transformed Weibull) model (Dey et al. [38]), the NExpo–Weib (New Exponential Weibull) model (Shah et al. [32]), the MO–Weib (Marshall–Olkin Weibull) model (Marshall and Olkin [39]), the FRLog–Weib (Flexible Reduced Logarithmic Weibull) model (Liu et al. [1]), and the Kumar–Weib (Kumaraswamy Weibull) distribution (Cordeiro et al. [40]). The CDF of the competitive probability models (or distributions) are given below:
  • The APTra–Weib distribution:
G ( x ; α , Θ ) = α ( 1 e β x δ ) 1 α 1 , α 1 , α + and   x +
  • The MO–Weib distribution:
G ( x ; Θ , δ ) = ( 1 e β x α ) δ + ( 1 δ ) ( 1 e β x α ) , δ , α , β + and x +
  • The NExpo–Weib distribution:
G ( x ; Θ ) = 1 ( e e β x α 1 e e β x α ) , α , β + and   x +
  • The Kumar–Weib distribution:
G ( x ; a , b , Θ ) = 1 [ 1 ( 1 e β x α ) a ] b , a , b , α , β + and   x +
  • The FRlog–Weib distribution:
G ( x ; Θ , δ ) = 1 log ( δ + 1 δ ( 1 e β x α ) ) log ( 1 + δ ) , δ , α , β + and   x +
Additionally, we considered various AMs (analytical measures) to verify which distribution outperformed and provided the closest fit to the considered datasets among the fitted distributions. These AMs included GFMs (goodness of fit measures) and DMs (discrimination measures) with corresponding p-values. The GFMs were computed as
  • The CM (Cramer-von Misses) test statistic:
C M = i = 1 n [ F ( x i ; Θ ) 2 i 1 2 n ] 2 + 1 12 n .
  • The AD (Anderson Darling) test statistics:
A D = n 1 n i = 1 n ( 2 i 1 ) × [ log ( 1 F ( x i n + 1 ; Θ ) ) + log F ( x i ; Θ ) ] .
  • The KS (Kolmogorov Smirnov) test statistic:
sup x [ F n ( x ; Θ ) F ( x ; Θ ) ] .
The DMs were computed as
  • The Akaike IC (information criterion):
A I C = 2 w 2 ( Φ ) .
  • The Bayesian IC:
B I C = w log ( n ) 2 ( Φ ) .
  • The Consistent Akaike IC:
C A I C = 2 n w n w 1 2 ( Φ ) .
  • The Hannan–Quinn IC:
H Q I C = 2 w log ( log ( n ) ) 2 ( Φ ) .
where ( Φ ) is the maximized LF evaluated on the MLEs, “n” is the sample size, and “w” is the number of probability model parameters. The proposed distribution was implemented in R software using an (Adequacy Model) package with the “BFGS algorithm” method for analysing the datasets.
In general, a statistical model that has lower values of these AMs and a higher p-value will be considered a good probability (or competitor) model for the underlined (or considered) datasets. By implementing these AMs, it is observed that the NGLog–Weib model gives a superior fit to the considered datasets as compared to the other distributions.

6.1. Injected Guinea Pigs, Dataset 1

The NGLog–Weib distribution was applied to the injected guinea pigs dataset, and its goodness of fit was compared with the other distributions. The summary measures were: min = 10.00, 1st quartile = 108.00, mean = 176.80, median = 149.50, 3rd quartile = 224.00, max = 555.00, skewness = 1.341, kurtosis = 4.989, variance = 10,705.10, and range = 545.0. Corresponding to Data 1, the histogram plot, kernel density, TTT plot, violin plot, and box plot are plotted in Figure 4. Similarly, the corresponding model’s parameter values, i.e., β ^ M L E , α ^ M L E , δ ^ M L E , a ^ M L E , b ^ M L E , and θ ^ M L E of the NGLog–Weib distribution and another extension of the Weib distributions, are recorded in Table 6, whereas the GFMs with p-values and DMs of the competing probability distributions are shown in Table 7 and Table 8. Furthermore, in favour of the AM results presented in Table 7 and Table 8, the estimated PDF, CDF, SF, PP (probability–probability), and QQ (quintile–quintile) plots of the NGLog–Weib distribution are plotted in Figure 5.
Given the results presented in Table 7 and Table 8, we can conclude that the NGLog–Weib distribution has the lowest GFM and DM values and the highest p-value, as compared to the other distributions. For the proposed NGLog–Weib distribution, the AMs are: CM = 0.079, AD = 0.492, KS = 0.086, AIC = 856.654, BIC = 863.484, CAIC = 857.007, HQIC = 859.373, and p-value = 0.665. In terms of the AD, KS, AIC, BIC, CAIC, HQIC, and p-value, the second cohesive distribution is the four-parameter Kumar–Weib model. For the Kumar–Weib model, the AM values of the KS, AIC, BIC, CAIC, HQIC, and p-value are given by 0.092, 859.492, 868.599, 860.089, 863.117, and 0.580, respectively, while in comparison to the AD, the second-best distribution is the three-parameter APTra–Weib model because for the APTra–Waib model, the value of the statistics, AD, is 0.138.
Hence, from the numerical depiction in Table 7 and Table 8, the visual graphical illustration (the close-fitting of the NGLog–Weib model), and the above comprehensive discussion, we can say that the newly derived NGLog–Weib model is a good competitor among the other compared probability models for the injected guinea pigs dataset.

6.2. Head and Neck Cancer, Dataset 2

The NGLog–Weib distribution was analysed on the head and neck cancer data and compared with the other distributions to evaluate its optimum fitting power. The summary measures were: min = 12.2, 1st quartile = 67.210, mean = 223.50, median = 128.50, 3rd quartile = 219.0, max = 1776.0, skewness = 3.384, kurtosis = 16.559, var = 93,286.41, and range = 1763.80. Some basic plots, such as the histogram, kernel density, TTT plot, violin plot, and box plot, for dataset 2 are presented in Figure 6. Similarly, the corresponding parameter values, i.e., β ^ M L E , α ^ M L E , δ ^ M L E , a ^ M L E , b ^ M L E , and θ ^ M L E of the NGLog–Weib distribution in comparison with the other competing probability distributions are given in Table 9, whereas the GFMs with p-values and DMs for selecting the best distribution of the competing probability distributions are presented in Table 10 and Table 11. Furthermore, the graphical representation, the estimated PDF, CDF, SF, PP, and QQ plots of the NGLog–Weib distribution, are shown in Figure 7.
The results presented in Table 10 and Table 11 show that the NGLog–Weib distribution has the lowest GFM and DM values and the highest p-value, as compared to the GFM and DM values and p-values of the other distributions. For the proposed NGLog–Weib distribution, the AMs and p-value are: CM = 0.018, AD = 0.119, KS = 0.057, AIC = 560.732, BIC = 566.085, CAIC = 561.332, HQIC = 562.717, and p-value = 0.9973, respectively. In terms of the CM, AD, KS, AIC, CAIC, HQIC, and p-value, the second optimum distribution is the Kumar–Weib model. For the Kumar–Weib model, the values of the CM, AD, KS, AIC, CAIC, HQIC, and p-value are given by 0.022, 0.136, 0.065, 562.711, 563.737, 565.358 and 0.985, respectively. Similarly, in terms of the BIC, the second optimum distribution is the NExpo–Weib distribution because its value is 568.725.
In light of the numerical justification in Table 10 and Table 11, the visual graphical illustration (the close-fitting of the proposed NGLog–Weib distribution) in Figure 7, and the above discussion, we can observe that the proposed NGLog–Weib distribution is a moral competitor among the other compared probability distributions for the head and neck cancer data (Data 2).

6.3. COVID−19 Patients, Dataset 3

The NGLog–Weib distribution was evaluated on the COVID-19 patients’ dataset. The key measures were: minimum = 1.516, 1st quartile = 2.788, mean = 3.282, median = 3.178, 3rd quartile = 3.637, maximum = 6.868, skewness = 3.384, kurtosis = 16.559, var = 93,286.41, and range = 1763.8. Figure 8 depicts the histogram, kernel density, TTT plot, violin plot, and box plot for the COVID-19 patients’ dataset 3. Similarly, the values of the parameters, i.e., β ^ M L E , α ^ M L E , δ ^ M L E , a ^ M L E , b ^ M L E , and θ ^ M L E of the NGLog–Weib model with the other competing probability distributions are displayed in Table 12, whereas the GFMs with p-values and DMs for selecting the best distribution of the competing distributions are recorded in Table 13 and Table 14. Furthermore, the graphical representation of the estimated PDF, CDF, SF, PP, and QQ plots of the NGLog–Weib distribution is shown in Figure 9.
The results presented in Table 13 and Table 14 display that the NGLog–Weib distribution has the least GFM and DM values and highest p-values, as compared to the GFM and DM values and p-values of the other distributions. For the proposed NGLog–Weib distribution, the AMs and p-values are: CM = 0.092, AD = 0.536, KS = 0.104, AIC = 102.028, BIC = 106.778, CAIC = 102.778, HQIC = 103.686, and p-value = 0.813, respectively. In terms of the CM, AD, KS, and p-value, the second most optimum distribution is the Kumar–Weib model. For the Kumar–Weib model, the values of the CM, AD, KS, and p-value are 0.093, 0.539, 0.106, and 0.795, respectively. Similarly, in terms of the AIC, BIC, CAIC, and HQIC, the second-best distribution is the APTra–Weib distribution because it’s AIC, BIC, CAIC, and HQIC values are 103.616, 108.366, 104.366, and 105.274, respectively.
Based on the justification of the numerical description in Table 13 and Table 14, the visual graphical illustration in Figure 9, and the above comprehensive discussion, we can observe that the NGLog–Weib model is a good competitor among the other compared distributions for the COVID-19 dataset (Data 3).

6.4. Patients Receiving Analgesics, Dataset 4

Finally, the NGLog–Weib distribution was evaluated on the patients receiving analgesics’ datasets. The key measures were: minimum = 1.100, 1st quartile = 1.475, mean = 1.900, median = 1.700, 3rd quartile = 2.050, maximum = 4.100, skewness= 1.720, kurtosis= 5.924, var= 0.496, and range = 3.0. Figure 10 depicts the histogram, kernel density, TTT plot, violin plot, and box plot for patients receiving the analgesics, dataset 4. Similarly, the values of the parameters, i.e., β ^ M L E , α ^ M L E , δ ^ M L E , a ^ M L E , b ^ M L E , and θ ^ M L E of the NGLog–Weib, with other well-known competing distributions, are displayed in Table 15, whereas the GFMs with p-values and DMs for selecting the best distribution of the competing distributions are provided in Table 16 and Table 17. Furthermore, the graphical representation of the estimated PDF, CDF, SF, PP, and QQ plots of the NGLog–Weib distribution is shown in Figure 11.
The results presented in Table 16 and Table 17 display that the NGLog–Weib distribution has the least AM values and the highest p-value, as compared to the AMs and p-values of the other distributions. For the proposed NGLog–Weib distribution, the AMs and p-value are: CM = 0.058, AD = 0.344, KS = 0.146, AIC = 38.699, BIC = 41.687, CAIC = 40.199, HQIC = 39.282, and p-value = 0.850, respectively. In terms of the CM, AD, KS, AIC, HQIC, and p-value, the second most optimum distribution is the Kumar–Weib model. For the Kumar–Weib model, the values of the CM, AD, KS, AIC, HQIC, and p-value are 0.060, 0.359, 0.156, 39.924, 40.702, and 0.833, respectively. Similarly, in terms of the BIC, CAIC, and HQIC, the second-best distribution is the MO–Weib model because its BIC, CAIC, and HQIC values are 43.768, 42.281, and 41.364, respectively.
Based on the justification of the numerical description in Table 16 and Table 17, the visual graphical illustration, and the above comprehensive discussion, we can observe that the proposed NGLog–Weib model is a good (or optimum) competitor among the other competing distributions for the patients receiving analgesics (dataset 4).

7. Conclusions

This study proposes a New Generalized Logarithmic-X (NGLog–X) family of distributions, which is a new family of continuous probability distributions. A three-parameter special subcase of the NGLog–X family of distributions was introduced by implementing the Weibull distribution as a baseline distribution. Various statistical and mathematical properties, such as the moments, moments generating function, residual life, reverse residual life, identifiability property, order statistics, and quantile function were investigated. To this end, the MLE method was utilized to estimate the unknown parameters of the NGLog–X family of distribution. A brief Monte Carlo simulation study was considered to check the efficiency of the δ ^ M L E , α ^ M L E , and β ^ M L E of the NGLog–X family of distributions. The newly proposed method was applied to biomedical datasets in comparison to some other well-known distributions, including the APTra–Weib, MO–Weib, FRLog–Weib, NExpo–Weib, and Kumar–Weib distributions. Based on the GFMs and DMs, it is shown that the NGLog–Weib distribution provides a good fit in comparison with other competitive distributions by modelling the medical datasets. The results suggest that the proposed method and its generated models in distribution theory could provide useful applications in biomedical and other related fields.

Author Contributions

Conceptualization, Z.S., D.M.K. and Z.K.; methodology, D.M.K.; software, Z.S., D.M.K. and N.F.; validation, D.M.K., S.H., T.A. and K.-I.K.; formal analysis, Z.S., D.M.K. and N.F.; investigation, D.M.K., T.A. and A.A.; resources, S.H. and K.-I.K.; data curation, D.M.K., S.H., T.A. and N.F.; writing—original draft preparation, Z.S. and N.F.; writing—review and editing, D.M.K., S.H., K.-I.K., T.A. and Z.K.; visualization, Z.S. and N.F. supervision, D.M.K.; project administration, D.M.K., T.A. and K.-I.K.; funding acquisition K.-I.K. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Institute for Information and Communications Technology Planning and Evaluation (IITP) grant funded by the Korean Government (MSIT) (No. 2022-0-01200, Training Key Talents in Industrial Convergence Security).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Liu, Y.; Ilyas, M.; Khosa, S.K.; Muhmoudi, E.; Ahmad, Z.; Khan, D.M.; Hamedani, G.G. A flexible reduced logarithmic-X family of distributions with biomedical analysis. Comput. Math. Methods Med. 2020, 2020, 4373595. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  2. Almalki, S.J.; Yuan, J. A new modified Weibull distribution. Reliab. Eng. Syst. Saf. 2013, 111, 164–170. [Google Scholar] [CrossRef]
  3. Liao, Q.; Ahmad, Z.; Mahmoudi, E.; Hamedani, G.G. A new flexible bathtub-shaped modification of the Weibull model Properties and applications. Math. Probl. Eng. 2020, 2020, 3206257. [Google Scholar] [CrossRef]
  4. De Micheli, D.; Formigoni, M.L.O. Drug use by Brazilian students: Associations with family, psychosocial, health, demographic and behavioral characteristics. Addiction 2004, 99, 570–578. [Google Scholar] [CrossRef]
  5. Klakattawi, H.; Alsulami, D.; Elaal, M.A.; Dey, S.; Baharith, L. A new generalized family of distributions based on combining Marshal-Olkin transformation with TX family. PLoS ONE 2022, 17, e0263673. [Google Scholar] [CrossRef]
  6. Eghwerido, J.T.; Agu, F.I.; Ibidoja, O.J. The shifted exponential-G family of distributions: Properties and applications. J. Stat. Manag. Syst. 2022, 25, 43–75. [Google Scholar] [CrossRef]
  7. Kilai, M.; Waititu, G.A.; Kibira, W.A.; Alshanbari, H.M.; El-Morshedy, M. A new generalization of Gull Alpha Power Family of distributions with application to modeling COVID-19 mortality rates. Results Phys. 2022, 36, 105339. [Google Scholar] [CrossRef]
  8. Alkhairy, I.; Faqiri, H.; Shah, Z.; Alsuhabi, H.; Yusuf, M.; Aldallal, R.; Makumi, N.; Riad, F.H. A New Flexible Logarithmic-X Family of Distributions with Applications to Biological Systems. Complexity 2022, 2022, 7845765. [Google Scholar] [CrossRef]
  9. Tung, Y.L.; Ahmad, Z.; Mahmoudi, E. The Arcsine-X Family of Distributions with Applications to Financial Sciences. Comput. Syst. Sci. Eng. 2021, 39, 351–363. [Google Scholar] [CrossRef]
  10. Alzaatreh, A.; Aljarrah, M.A.; Smithson, M.; Shahbaz, S.H.; Shahbaz, M.Q.; Famoye, F.; Lee, C. Truncated family of distributions with applications to time and cost to start a business. Methodol. Comput. Appl. Probab. 2021, 23, 5–27. [Google Scholar] [CrossRef]
  11. Eghwerido, J.T.; Efe-Eyefia, E.; Zelibe, S.C. The transmuted alpha power-G family of distributions. J. Stat. Manag. Syst. 2021, 24, 965–1002. [Google Scholar] [CrossRef]
  12. Reyad, H.; Korkmaz, M.Ç.; Afify, A.Z.; Hamedani, G.G.; Othman, S. The Fréchet Topp Leone-G family of distributions: Properties, characterizations and applications. Ann. Data Sci. 2021, 8, 345–366. [Google Scholar] [CrossRef]
  13. Zichuan, M.; Hussain, S.; Iftikhar, A.; Ilyas, M.; Ahmad, Z.; Khan, D.M.; Manzoor, S. A new extended-family of distributions: Properties and applications. Comput. Math. Methods Med. 2020, 2020, 4650520. [Google Scholar] [CrossRef] [PubMed]
  14. Huo, X.; Khosa, S.K.; Ahmad, Z.; Almaspoor, Z.; Ilyas, M.; Aamir, M. A new lifetime exponential-X family of distributions with applications to reliability data. Math. Probl. Eng. 2020, 2020, 1316345. [Google Scholar] [CrossRef]
  15. Shah, Z.; Khan, D.M.; Khan, Z.; Shafiq, M.; Choi, J.G. A New Modified Exponent Power Alpha Family of Distributions with Applications in Reliability Engineering. Processes 2022, 10, 2250. [Google Scholar] [CrossRef]
  16. Hussain, S.; Rashid, M.S.; Ul Hassan, M.; Ahmed, R. The Generalized Alpha Exponent Power Family of Distributions: Properties and Applications. Mathematics 2022, 10, 1421. [Google Scholar] [CrossRef]
  17. Chen, P.; Ye, Z.S. Estimation of field reliability based on aggregate lifetime data. Technometrics 2017, 59, 115–125. [Google Scholar] [CrossRef]
  18. Xu, A.; Zhou, S.; Tang, Y. A unified model for system reliability evaluation under dynamic operating conditions. IEEE Trans. Reliab. 2019, 70, 65–72. [Google Scholar] [CrossRef]
  19. Zhang, L.; Xu, A.; An, L.; Li, M. Bayesian inference of system reliability for multicomponent stress-strength model under Marshall-Olkin Weibull distribution. Systems 2022, 10, 196. [Google Scholar] [CrossRef]
  20. Luo, C.; Shen, L.; Xu, A. Modelling and estimation of system reliability under dynamic operating environments and lifetime ordering constraints. Reliab. Eng. Syst. Saf. 2022, 218, 108136. [Google Scholar] [CrossRef]
  21. Zhuang, L.; Xu, A.; Wang, X.L. A prognostic driven predictive maintenance framework based on Bayesian deep learning. Reliab. Eng. Syst. Saf. 2023, 234, 109181. [Google Scholar] [CrossRef]
  22. Zhao, W.; Khosa, S.K.; Ahmad, Z.; Aslam, M.; Afify, A.Z. Type-I heavy tailed family with applications in medicine, engineering and insurance. PLoS ONE 2020, 15, e0237462. [Google Scholar] [CrossRef] [PubMed]
  23. Alzaatreh, A.; Famoye, F.; Lee, C. A new method for generating families of continuous distributions. Metron 2013, 71, 63–79. [Google Scholar] [CrossRef] [Green Version]
  24. Wang, Y.; Feng, Z.; Zahra, A. A new logarithmic family of distributions: Properties and applications. CMC-Comput. Mater. Contin. 2021, 66, 919–929. [Google Scholar]
  25. Zhao, Y.; Ahmad, Z.; Alrumayh, A.; Yusuf, M.; Aldallal, R.; Elshenawy, A.; Riad, F.H. A novel logarithmic approach to generate new probability distributions for data modeling in the engineering sector. Alex. Eng. J. 2023, 62, 313–325. [Google Scholar] [CrossRef]
  26. Eugene, N.; Lee, C.; Famoye, F. Beta-normal distribution and its applications. Commun. Stat.-Theory Methods 2002, 31, 497–512. [Google Scholar] [CrossRef]
  27. Zografos, K.; Balakrishnan, N. On families of beta-and generalized gamma-generated distributions and associated inference. Stat. Methodol. 2009, 6, 344–362. [Google Scholar] [CrossRef]
  28. Ristić, M.M.; Balakrishnan, N. The gamma-exponentiated exponential distribution. J. Stat. Comput. Simul. 2012, 82, 1191–1206. [Google Scholar] [CrossRef]
  29. Torabi, H.; Hedesh, N.M. The gamma-uniform distribution and its applications. Kybernetika 2012, 48, 16–30. [Google Scholar]
  30. Tahir, M.H.; Cordeiro, G.M.; Alzaatreh, A.; Mansoor, M.; Zubair, M. The logistic-X family of distributions and its applications. Commun. Stat.-Theory Methods 2016, 45, 7326–7349. [Google Scholar] [CrossRef] [Green Version]
  31. Ahmad, Z.; Mahmoudi, E.; Dey, S.; Khosa, S.K. Modeling vehicle insurance loss data using a new member of TX family of distributions. J. Stat. Theory Appl. 2020, 19, 133–147. [Google Scholar] [CrossRef]
  32. Shah, Z.; Ali, A.; Hamraz, M.; Khan, D.M.; Khan, Z.; EL-Morshedy, M.; Al-Bossly, A.; Almaspoor, Z. A New Member of TX Family with Applications in Different Sectors. J. Math. 2022, 2022, 1453451. [Google Scholar] [CrossRef]
  33. David, H.A. Order Statistics, 2nd ed.; Wiley: New York, NY, USA, 1981. [Google Scholar]
  34. Bjerkedal, T. Acquisition of Resistance in Guinea Pies infected with Different Doses of Virulent Tubercle Bacilli. Am. J. Hyg. 1960, 72, 130–148. [Google Scholar] [PubMed]
  35. Ceren, Ü.N.A.L.; Cakmakyapan, S.; Gamze, Ö.Z.E.L. Alpha power inverted exponential distribution: Properties and application. Gazi Univ. J. Sci. 2018, 31, 954–965. [Google Scholar]
  36. Almetwally, E.M.; Alharbi, R.; Alnagar, D.; Hafez, E.H. A new inverted topp-leone distribution: Applications to the COVID-19 mortality rate in two different countries. Axioms 2021, 10, 25. [Google Scholar] [CrossRef]
  37. Jamal, F.; Reyad, H.; Chesneau, C.; Nasir, M.A.; Othman, S. The Marshall-Olkin odd Lindley-G family of distributions: Theory and applications. Punjab Univ. J. Math. 2020, 51, 111–125. [Google Scholar]
  38. Dey, S.; Sharma, V.K.; Mesfioui, M. A new extension of Weibull distribution with application to lifetime data. Ann. Data Sci. 2017, 4, 31–61. [Google Scholar]
  39. Marshall, A.W.; Olkin, I. A new method for adding a parameter to a family of distributions with application to the exponential and Weibull families. Biometrika 1997, 84, 641–652. [Google Scholar] [CrossRef]
  40. Cordeiro, G.M.; Ortega, E.M.; Nadarajah, S. The Kumaraswamy Weibull distribution with application to failure data. J. Frankl. Inst. 2010, 347, 1399–1429. [Google Scholar] [CrossRef]
Figure 1. Plots of the G ( x ; δ , Θ ) CDF (left) and the S ( x ; δ , Θ ) SF (right) of the NGLog–Weib model.
Figure 1. Plots of the G ( x ; δ , Θ ) CDF (left) and the S ( x ; δ , Θ ) SF (right) of the NGLog–Weib model.
Applsci 13 03668 g001
Figure 2. Plots of the g ( x ; δ , Θ ) PDF of the NGLog–Weib model.
Figure 2. Plots of the g ( x ; δ , Θ ) PDF of the NGLog–Weib model.
Applsci 13 03668 g002
Figure 3. Plots of the h ( x ; δ , Θ ) HF of the NGLog–Weib model.
Figure 3. Plots of the h ( x ; δ , Θ ) HF of the NGLog–Weib model.
Applsci 13 03668 g003
Figure 4. For dataset 1, the (a) histogram, (b) kernel density plot, (c) TTT plot, (d) violin plot, and (e) box plot.
Figure 4. For dataset 1, the (a) histogram, (b) kernel density plot, (c) TTT plot, (d) violin plot, and (e) box plot.
Applsci 13 03668 g004
Figure 5. Estimated PDF, CDF, HF, CHF, SF, PP plot, and QQ plot of the NGLog–Weib distribution for dataset 1.
Figure 5. Estimated PDF, CDF, HF, CHF, SF, PP plot, and QQ plot of the NGLog–Weib distribution for dataset 1.
Applsci 13 03668 g005
Figure 6. For dataset 2, the (a) histogram, (b) kernel density plot, (c) TTT plot, (d) violin plot, and (e) box plot.
Figure 6. For dataset 2, the (a) histogram, (b) kernel density plot, (c) TTT plot, (d) violin plot, and (e) box plot.
Applsci 13 03668 g006
Figure 7. Estimated PDF, CDF, HF, CHF, SF, PP plot and QQ plot of the NGLog–Weib distribution for dataset 2.
Figure 7. Estimated PDF, CDF, HF, CHF, SF, PP plot and QQ plot of the NGLog–Weib distribution for dataset 2.
Applsci 13 03668 g007
Figure 8. For dataset 3, the (a) histogram, (b) kernel density plot, (c) TTT plot, (d) violin plot, and (e) box plot.
Figure 8. For dataset 3, the (a) histogram, (b) kernel density plot, (c) TTT plot, (d) violin plot, and (e) box plot.
Applsci 13 03668 g008
Figure 9. Estimated PDF, CDF, HF, CHF, SF, PP plot, and QQ plot of the NGLog–Weib distribution for dataset 3.
Figure 9. Estimated PDF, CDF, HF, CHF, SF, PP plot, and QQ plot of the NGLog–Weib distribution for dataset 3.
Applsci 13 03668 g009
Figure 10. For dataset 4, the (a) histogram, (b) kernel density plot, (c) TTT plot, (d) violin plot, and (e) box plot.
Figure 10. For dataset 4, the (a) histogram, (b) kernel density plot, (c) TTT plot, (d) violin plot, and (e) box plot.
Applsci 13 03668 g010
Figure 11. Estimated PDF, CDF, HF, CHF, SF, PP plot, and QQ plot of the NGLog–Weib distribution for dataset 4.
Figure 11. Estimated PDF, CDF, HF, CHF, SF, PP plot, and QQ plot of the NGLog–Weib distribution for dataset 4.
Applsci 13 03668 g011
Table 2. SRs of the NGLog–Weib distribution for Set I and Set II.
Table 2. SRs of the NGLog–Weib distribution for Set I and Set II.
nEst. Set   I :   δ = 0.5 ,   β = 2.4 ,   α = 3.6 Set   II :   δ = 3.2 ,   β = 2.7 ,   α = 3.8
MLEMSEBiasMLEMSEBias
25 δ ^ 1.50675.9221.0072.9324.041−0.268
β ^ 2.7540.7070.3542.7150.3810.015
α ^ 3.6260.7800.0264.2370.8110.437
50 δ ^ 1.4315.0880.9312.9043.893−0.295
β ^ 2.6450.3840.2452.6600.275−0.039
α ^ 3.4950.598−0.1054.1780.6840.378
75 δ ^ 1.4584.6800.9282.9783.791−0.222
β ^ 2.6080.2980.2382.6430.201−0.057
α ^ 3.4430.481−0.1574.0940.6120.293
100 δ ^ 1.4614.5130.9102.9193.396−0.281
β ^ 2.6140.2730.2142.6390.182−0.061
α ^ 3.4290.430−0.1704.0930.5510.283
200 δ ^ 1.1122.5310.6113.0452.730−0.156
β ^ 2.5330.1570.1432.6560.119−0.044
α ^ 3.5110.284−0.1064.0190.4240.219
300 δ ^ 1.1121.7670.4423.0692.412−0.131
β ^ 2.5430.1140.0982.6540.099−0.046
α ^ 3.4940.223−0.0793.9620.3360.162
400 δ ^ 0.8861.4570.3873.1231.921−0.078
β ^ 2.4810.0980.0812.6650.078−0.034
α ^ 3.5190.196−0.0803.9230.2630.123
500 δ ^ 0.7881.1120.2883.1561.780−0.044
β ^ 2.4630.0780.0632.6690.069−0.031
α ^ 3.5480.167−0.0523.9040.2330.104
600 δ ^ 0.7330.8830.2333.1851.644−0.015
β ^ 2.4440.0640.0442.6740.059−0.026
α ^ 3.5590.144−0.0423.8810.2010.081
700 δ ^ 0.7190.8040.2193.2211.5290.021
β ^ 2.4430.0610.0432.6860.054−0.014
α ^ 3.5530.132−0.0483.8660.1910.065
800 δ ^ 0.6790.6780.1793.1241.415−0.076
β ^ 2.4390.0540.0392.6710.051−0.029
α ^ 3.5690.118−0.0303.8920.1720.052
900 δ ^ 0.7120.6690.1623.2221.2110.022
β ^ 2.4480.0510.0372.6900.043−0.009
α ^ 3.5550.116−0.0453.8500.1370.040
1000 δ ^ 0.6750.6000.1453.2081.0520.008
β ^ 2.4370.0480.0272.6910.036−0.009
α ^ 3.5680.106−0.0323.8490.1180.039
Table 3. SRs of the NGLog–Weib distribution for Set III and Set IV.
Table 3. SRs of the NGLog–Weib distribution for Set III and Set IV.
nEst. Set   III :   δ = 1.2 ,   β = 3.3 ,   α = 4.5 Set   IV :   δ = 1.5 ,   β = 3.9 ,   α = 4.6
MLEMSEBiasMLEMSEBias
25 δ ^ 1.9414.9110.9942.3974.3830.969
β ^ 3.4670.7570.3564.1360.5790.236
α ^ 4.3860.522−0.1054.5080.453−0.092
50 δ ^ 1.8804.3540.8432.4044.1130.903
β ^ 3.4160.4390.2364.0790.3980.179
α ^ 4.3900.490−0.0754.4260.493−0.174
75 δ ^ 1.8503.8340.7412.2063.4760.736
β ^ 3.4140.3040.1674.0470.3190.147
α ^ 4.3940.486−0.1144.4770.424−0.123
100 δ ^ 1.8463.5390.6802.2243.4080.723
β ^ 3.4170.2340.1164.0280.2690.128
α ^ 4.3860.465−0.1094.4530.417−0.147
200 δ ^ 1.5772.3380.3771.9712.3530.471
β ^ 3.3610.1360.0613.9660.1490.066
α ^ 4.4770.358−0.0234.5220.313−0.078
300 δ ^ 1.6132.0590.3291.9081.8490.374
β ^ 3.3590.1120.0443.9710.1140.045
α ^ 4.4380.329−0.0414.5270.267−0.071
400 δ ^ 1.4871.6950.2871.8371.5590.355
β ^ 3.3390.1030.0393.9490.0820.044
α ^ 4.4680.289−0.0324.5390.234−0.074
500 δ ^ 1.3631.2200.1631.7941.2230.232
β ^ 3.3180.0720.0183.9420.0670.034
α ^ 4.5060.2420.0064.5450.203−0.034
600 δ ^ 1.4041.1140.1441.7781.1400.208
β ^ 3.3250.0640.0253.9430.0590.034
α ^ 4.4860.221−0.0144.5420.191−0.058
700 δ ^ 1.3660.9790.1161.6590.9110.159
β ^ 3.3190.0590.0193.9190.0500.021
α ^ 4.4950.201−0.0054.5890.166−0.011
800 δ ^ 1.2980.8610.0781.7050.9090.135
β ^ 3.3070.0490.0043.9210.0460.021
α ^ 4.5240.1940.0244.5660.155−0.034
900 δ ^ 1.2930.7890.0751.5810.7630.081
β ^ 3.3040.0370.0033.9040.0390.019
α ^ 4.5140.1800.0134.6080.1490.008
1000 δ ^ 1.3180.6810.0611.6490.6790.049
β ^ 3.3060.033−0.0073.9180.0330.018
α ^ 4.5090.1570.0094.5790.126−0.021
Table 4. SRs of the NGLog–Weib distribution for Set V and Set VI.
Table 4. SRs of the NGLog–Weib distribution for Set V and Set VI.
nEst. Set   V :   δ = 1.4 ,   β = 3.5 ,   α = 1.0 Set   VI :   δ = 1.2 ,   β = 0.7 ,   α = 1.0
MLEMSEBiasMLEMSEBias
25 δ ^ 1.9125.4390.5121.8565.5230.656
β ^ 3.7390.6620.2390.7650.1480.073
α ^ 1.0710.0790.0711.0650.0820.065
50 δ ^ 1.8394.7690.4391.8134.9270.613
β ^ 3.6420.4290.1420.7680.1220.068
α ^ 1.0480.0590.0481.0330.0560.033
75 δ ^ 1.8524.4110.3921.7274.3800.527
β ^ 3.5940.3240.0940.7580.1090.058
α ^ 1.0320.0480.0311.0180.0440.018
100 δ ^ 1.7644.0340.3641.6493.7210.449
β ^ 3.5620.2590.0620.7530.0910.053
α ^ 1.0330.0430.0281.0160.0380.016
200 δ ^ 1.6962.9810.2971.5232.9860.323
β ^ 3.5270.1600.0270.7310.0740.031
α ^ 1.0190.0310.0191.0150.0310.015
300 δ ^ 1.5392.2210.2091.4432.1770.243
β ^ 3.4970.122−0.0030.7250.0570.025
α ^ 1.0240.0260.0151.0120.0240.012
400 δ ^ 1.5301.9520.1351.3191.7260.119
β ^ 3.5030.1020.0090.7070.0460.007
α ^ 1.0210.0220.0151.0190.0200.012
500 δ ^ 1.5801.6570.1201.3111.5260.094
β ^ 3.5150.0870.0020.7030.0410.004
α ^ 1.0110.0180.0111.0190.0190.010
600 δ ^ 1.5061.3810.0721.2231.2170.088
β ^ 3.4990.071−0.0050.6950.034−0.005
α ^ 1.0140.0170.0091.0220.0160.009
700 δ ^ 1.4721.1480.0101.2731.1060.073
β ^ 3.4950.059−0.0140.7030.0310.003
α ^ 1.0150.0140.0021.0150.0150.005
800 δ ^ 1.4101.055−0.0091.2090.9540.053
β ^ 3.4860.056−0.0230.6940.0280.002
α ^ 1.0220.0130.0091.0180.0130.001
900 δ ^ 1.3680.9270.0071.2840.7980.084
β ^ 3.4760.052−0.0010.7070.0240.002
α ^ 1.0230.0110.0051.0090.0120.001
1000 δ ^ 1.4310.8780.0021.2530.6880.026
β ^ 3.4960.049−0.0040.7020.019−0.003
α ^ 1.0140.0090.0041.0120.0110.011
Table 6. The β ^ M L E , α ^ M L E , δ ^ M L E , a ^ M L E , and b ^ M L E values for infected guinea pigs data (dataset 1).
Table 6. The β ^ M L E , α ^ M L E , δ ^ M L E , a ^ M L E , and b ^ M L E values for infected guinea pigs data (dataset 1).
Models β ^ M L E α ^ M L E δ ^ M L E a ^ M L E b ^ M L E θ ^ M L E
NGLog–Weib0.014 0.9789.322---
APTra–Weib0.0036.8181.190---
NExpo–Weib0.0041.040----
MO–Weib0.0021.243---2.372
FRL–Weib0.0031.223---8.160
Kumar–Weib0.0140.938-3.7601.334-
Table 7. The GFMs and p-values of the competing models for infected guinea pigs data (dataset 1).
Table 7. The GFMs and p-values of the competing models for infected guinea pigs data (dataset 1).
ModelsCMADKSp-Values
NGLog–Weib0.0790.0920.0860.665
APTra–Weib0.8040.1380.1710.029
NExpo–Weib0.0890.5220.2950.067
MO–Weib0.1610.9410.1730.027
FRLog–Weib0.1921.1240.1820.017
Kumar–Weib0.0820.5230.0920.580
Table 8. The DMs of the competing models for infected guinea pigs data (dataset 1).
Table 8. The DMs of the competing models for infected guinea pigs data (dataset 1).
ModelsAICBICCAICHQIC
NGLog–Weib856.654863.485857.007859.373
APTra–Weib864.127870.957864.479866.846
NExpo–Weib902.709907.262902.883904.521
MO–Weib865.711872.541866.064868.430
FRLog–Weib868.085874.915868.438870.804
Kumar–Weib 859.492868.599860.089863.117
Table 9. The δ ^ M L E , β ^ M L E , α ^ M L E , a ^ M L E , b ^ M L E and θ ^ M L E for head and neck cancer data (dataset 2).
Table 9. The δ ^ M L E , β ^ M L E , α ^ M L E , a ^ M L E , b ^ M L E and θ ^ M L E for head and neck cancer data (dataset 2).
Models β ^ M L E α ^ M L E δ ^ M L E a ^ M L E b ^ M L E θ ^ M L E
NGLog–Weib0.7070.32553.292---
APTra–Weib0.0030.2450.993---
NExpo–Weib0.0031.023----
MO–Weib0.0031.001---0.508
FRL–Weib0.0290.762---5.722
Kumar–Weib0.4160.459-12.6650.490-
Table 10. The GFMs and p-values of the competing models for head and neck cancer data (dataset 2).
Table 10. The GFMs and p-values of the competing models for head and neck cancer data (dataset 2).
ModelsCMADKSp-Values
NGLog–Weib0.0200.1280.0620.992
APTra–Weib0.0930.5540.1060.672
NExpo–Weib0.0870.5150.1010.727
MO–Weib0.0950.5620.1130.593
FRLog–Weib0.1911.0960.1340.379
Kumar–Weib 0.0220.1360.0650.985
Table 11. The DMs of the competing models for head and neck cancer, dataset 2.
Table 11. The DMs of the competing models for head and neck cancer, dataset 2.
ModelsAICBICCAICHQIC
NGLog–Weib560.793566.146561.393562.778
APTra–Weib567.772573.124568.371569.757
NExpo–Weib565.157568.725565.450566.480
MO–Weib568.208573.562568.808570.194
FRLog–Weib572.884578.236573.484574.869
Kumar–Weib 562.711569.848563.737565.358
Table 12. The β ^ M L E , α ^ M L E , δ ^ M L E , a ^ M L E , b ^ M L E and θ ^ M L E values for COVID-19 data (dataset 3).
Table 12. The β ^ M L E , α ^ M L E , δ ^ M L E , a ^ M L E , b ^ M L E and θ ^ M L E values for COVID-19 data (dataset 3).
Models β ^ M L E α ^ M L E δ ^ M L E a ^ M L E b ^ M L E θ ^ M L E
NGLog–Weib0.4771.48122.931---
APTra–Weib0.206171.5421.954---
NExpo–Weib0.0043.909----
MO–Weib0.0392.833---1.928
FRLog–Weib0.0083.569---−0.544
Kumar–Weib 0.6951.064-13.0242.112-
Table 13. The GFMs and p-values of the competing models for COVID-19 data (dataset 3).
Table 13. The GFMs and p-values of the competing models for COVID-19 data (dataset 3).
ModelsCMADKSp-Values
NGLog–Weib0.0920.5360.1060.813
APTra–Weib0.1030.5820.1230.649
NExpo–Weib0.1370.7940.1440.446
MO–Weib0.1821.0350.1410.474
FRL–Weib0.1630.9370.1480.411
Kumar–Weib 0.0930.5390.1060.795
Table 14. The DMs of the competing models for COVID-19 data (dataset 3).
Table 14. The DMs of the competing models for COVID-19 data (dataset 3).
ModelsAICBICCAICHQIC
NGLog–Weib102.028106.778102.778103.686
APTra–Weib103.616108.366104.366105.274
NExpo–Weib104.294107.461104.657105.399
MO–Weib109.443114.195110.193111.101
FRLog–Weib108.532113.283109.282110.190
Kumar–Weib 104.002110.336105.292106.213
Table 15. The β ^ M L E , α ^ M L E , δ ^ M L E , a ^ M L E , b ^ M L E and θ ^ M L E values for the patients receiving analgesics’ data (dataset 4).
Table 15. The β ^ M L E , α ^ M L E , δ ^ M L E , a ^ M L E , b ^ M L E and θ ^ M L E values for the patients receiving analgesics’ data (dataset 4).
Models β ^ M L E α ^ M L E δ ^ M L E a ^ M L E b ^ M L E θ ^ M L E
NGLog–Weib2.0911.00978.683---
APTra–Weib0.0210.0163.620---
NExpo–Weib0.0463.323----
MO–Weib0.0034.408---0.044
FRLog–Weib0.005−0.986---4.828
Kumar–Weib 2.3491.469-33.2850.339-
Table 16. The GFMs and p-values of the competing models for the patients receiving analgesics’ data (dataset 4).
Table 16. The GFMs and p-values of the competing models for the patients receiving analgesics’ data (dataset 4).
ModelsCMADKSp-Values
NGLog–Weib0.0580.3440.1460.850
APTra–Weib0.1320.7790.1580.701
NExpo–Weib0.1530.9040.1810.527
MO–Weib0.0760.4570.1650.802
FRLog–Weib0.1410.8520.2230.273
Kumar–Weib 0.0600.3590.1560.833
Table 17. The DMs of the competing models for the patients receiving analgesics’ data (dataset 4).
Table 17. The DMs of the competing models for the patients receiving analgesics’ data (dataset 4).
ModelsAICBICCAICHQIC
NGLog–Weib38.69941.68740.19939.282
APTra–Weib43.41846.40444.91744.000
NExpo–Weib42.93644.92743.64243.325
MO–Weib40.78143.76942.28341.363
FRLog–Weib44.70447.69146.20345.286
Kumar–Weib 39.92443.90742.59140.702
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Shah, Z.; Khan, D.M.; Khan, Z.; Faiz, N.; Hussain, S.; Anwar, A.; Ahmad, T.; Kim, K.-I. A New Generalized Logarithmic–X Family of Distributions with Biomedical Data Analysis. Appl. Sci. 2023, 13, 3668. https://doi.org/10.3390/app13063668

AMA Style

Shah Z, Khan DM, Khan Z, Faiz N, Hussain S, Anwar A, Ahmad T, Kim K-I. A New Generalized Logarithmic–X Family of Distributions with Biomedical Data Analysis. Applied Sciences. 2023; 13(6):3668. https://doi.org/10.3390/app13063668

Chicago/Turabian Style

Shah, Zubir, Dost Muhammad Khan, Zardad Khan, Nosheen Faiz, Sundus Hussain, Asim Anwar, Tanveer Ahmad, and Ki-Il Kim. 2023. "A New Generalized Logarithmic–X Family of Distributions with Biomedical Data Analysis" Applied Sciences 13, no. 6: 3668. https://doi.org/10.3390/app13063668

APA Style

Shah, Z., Khan, D. M., Khan, Z., Faiz, N., Hussain, S., Anwar, A., Ahmad, T., & Kim, K. -I. (2023). A New Generalized Logarithmic–X Family of Distributions with Biomedical Data Analysis. Applied Sciences, 13(6), 3668. https://doi.org/10.3390/app13063668

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop