A New Truncated Lindley-Generated Family of Distributions: Properties, Regression Analysis, and Applications

Mohamed Hussein; Gabriela M. Rodrigues; Edwin M. M. Ortega; Roberto Vila; Howaida Elsayed

doi:10.3390/e25091359

Abstract

We present the truncated Lindley-G (TLG) model, a novel class of probability distributions with an additional shape parameter, by composing a unit distribution called the truncated Lindley distribution with a parent distribution function

G (x)

. The proposed model’s characteristics including critical points, moments, generating function, quantile function, mean deviations, and entropy are discussed. Also, we introduce a regression model based on the truncated Lindley–Weibull distribution considering two systematic components. The model parameters are estimated using the maximum likelihood method. In order to investigate the behavior of the estimators, some simulations are run for various parameter settings, censoring percentages, and sample sizes. Four real datasets are used to demonstrate the new model’s potential.

Keywords:

censored data; survival function; maximum likelihood; regression model; COVID-19 data

MSC:

62E10; 62F10; 60E05; 62P10; 62J02

1. Introduction

Suppose that G is a cumulative distribution function (cdf) that is defined on the real line, several papers have proposed composing a unit distribution with G (a parent cdf) to produce a new cdf. Eugene et al. (2002) [1] combined the cdf of the beta distribution with G to create the Beta-G model with cdf

F (x) = I_{G (x)} (a, b),

where

I_{x} (a, b) = \int_{0}^{x} t^{a - 1} {(1 - t)}^{b - 1} d t / B (a, b)

is the regularized incomplete beta function. Alexander et al. (2012) [2] and Nadarajah et al. (2014b) [3] generalized the Beta-G to the generalized-Beta-G and the modified-Beta-G. Cordeiro and Castro (2011) [4] developed the Kumaraswamy-G model by combining the Kumaraswamy cdf

F (x) = 1 - {[{(1 - x)}^{a}]}^{b}, x \in [0, 1]

with the parent cdf G.

Based on a valid cdf,

F (x)

for

x \in R

, for any continuous distribution, we can construct a unit distribution as a truncated version of

F (x)

with a cdf (monotonically increasing with

{lim}_{x \to 0} F (x) = 0

and

{lim}_{x \to 1} F (x) = 1

) given by

F_{U T} (x) = \frac{F (x)}{F (1)}, x \in [0, 1],

(1)

The truncated-G (TG) model is constructed by composing this truncated version of the cdf (or its associated survival function

\bar{F} (x)

) with a parent cdf G (or its associated survival function

\bar{G} (x)

) to give the parent distribution additional modeling ability and produce a new family of univariate distributions with cdfs (monotonically increasing with

{lim}_{x \to - \infty} F (x) = 0

and

{lim}_{x \to \infty} F (x) = 1

) given by

F_{1} (x) = F_{U T} (G (x)), x \in R,

(2)

F_{2} (x) = 1 - F_{U T} (1 - G (x)), x \in R .

(3)

A list of TG models are given in Table 1.

Table 1. Previous work on TG models.

In this paper, we generate a new family of continuous distributions using a truncated version of the Lindley distribution.

The new distribution is necessary and helpful because it provides an alternative option for failure time analysis. While there are already numerous existing distributions available for this purpose, having a new distribution adds to the range of choices researchers and analysts have when analyzing failure times. The existing distributions may not always adequately capture the characteristics or behavior of the data being analyzed. Different distributions have different assumptions and properties, and no single distribution can fit all scenarios perfectly. Therefore, having a new distribution can be beneficial in situations where none of the existing options are suitable or provide a good fit to the data. Additionally, the new distribution may offer advantages over existing ones in terms of interpretability, flexibility, or computational efficiency. It could introduce novel features or modeling capabilities that were previously unavailable with other distributions. This can lead to improved accuracy and reliability in failure time analysis.

In summary, while there are already many distributions available for failure time analysis, the introduction of a new distribution expands the options and possibilities for researchers, allowing them to choose the most appropriate model for their specific data and research objectives.

On the other hand, in several research areas (medical, engineering, biology, agronomy, etc.), the failure times are affected by explanatory variables. In this paper, we propose a regression model with censored observations, based on the truncated Lindley–Weibull distribution, which is a feasible alternative for modeling failure time data. Also, different simulation studies are presented to study the behavior of maximum likelihood estimation (MLE), as well as the residual analysis of the proposed regression model. The paper is structured as follows: Section 2 describes the unit truncated Lindley distribution which is the main component of the proposed new model. We discuss its properties, including moments, mode, quantile function (qf), mean deviations, and generating function. Section 3 discusses the proposed TLG model (linear representation, properties, shapes of the TLG, stochastic representation, truncated Lindley–Weibull (TLW) submodel and estimation of the parameters using the maximum likelihood method). In Section 4, we propose a regression model based on the TLW distribution and estimate its parameters using maximum likelihood. Also, we perform some simulation studies for the TLW regression model under different sample sizes and censoring proportions. The TLW regression model application is illustrated by examining four real datasets in Section 5. Finally, Section 6 summarizes the result and presents the conclusions.

2. The Unit Truncated Lindley Model

Lindley (1958) [16] first described the Lindley distribution as a lifetime distribution with one parameter. The probability density function (pdf) and the cdf are provided by

\begin{matrix} f_{L} (x; θ) & = & \frac{θ^{2}}{θ + 1} (1 + x) e^{- θ x}, x > 0, θ > 0, and \\ F_{L} (x; θ) & = & 1 - (1 + \frac{θ x}{θ + 1}) e^{- θ x}, x > 0, θ > 0, \end{matrix}

respectively. We suggest a new unit distribution, the unit truncated Lindley (UTL) distribution, based on the cdf of the Lindley distribution, which is a truncated form of

F_{L} (x)

with the cdf and pdf provided by

\begin{matrix} F_{U T L} (x) = C_{θ} [1 + θ - (1 + θ + θ x) e^{- θ x}] x \in [0, 1], θ \neq 0, \end{matrix}

(4)

\begin{matrix} f_{U T L} (x) & = & θ^{2} C_{θ} (1 + x) e^{- θ x} x \in [0, 1], θ \neq 0, \end{matrix}

(5)

where

C_{θ} = 1 / (1 + θ - e^{- θ} - 2 θ e^{- θ}) > 0

.

The properties of the UTL model are given in Appendix A.

3. The Truncated Lindley- $G$ Model

The Truncated Lindley-G (TLG) model is constructed by applying the TG composition scheme (2) on the cdf of the UTL model given in Equation (4), i.e.,

F_{T L G} (x) = F_{U T L} (G (x)) .

That is, the cdf and pdf of the TLG model are given by

\begin{matrix} F_{T L G} (x) & = & C_{θ} [1 + θ - (1 + θ + θ G (x)) e^{- θ G (x)}], x \in R, θ \neq 0, \end{matrix}

(6)

and

\begin{matrix} f_{T L G} (x) & = & θ^{2} C_{θ} g (x) [1 + G (x)] e^{- θ G (x)}, x \in R, θ \neq 0, \end{matrix}

(7)

where

C_{θ} = 1 / (1 + θ - e^{- θ} - 2 θ e^{- θ})

.

The main reason for choosing the unit truncated form of the Lindley distribution is to add a new parameter to the parent distribution to generate a new distribution. The properties of the generated distribution will need further investigation, as they are, generally, different from those of the parent distribution.

Following the expansion

e^{- θ G (x)} = \sum_{i = 0}^{n} {(- 1)}^{i} {[θ G (x)]}^{i} / i!

, the TLG cdf (6) has a linear representation of the exponentiated-G (EG) cdf as

\begin{matrix} F_{T L G} (x) = C_{θ} \{1 + θ + \sum_{i = 0}^{\infty} ν_{i} [(θ + 1) H_{i} (x) + θ H_{i + 1} (x)]\} . \end{matrix}

(8)

where

H_{j} (x) = G^{j} (x)

(for

j = i, i + 1

) is the EG cdf with power parameter j.

Differentiating (8) with respect to x, we obtain the linear representation of the TLG pdf as follows:

\begin{matrix} f_{T L G} (x) & = & C_{θ} \{\sum_{i = 0}^{\infty} ν_{i} [(θ + 1) h_{i} (x) + θ h_{i + 1} (x)]\} \end{matrix}

(9)

where

ν_{i} = {(- 1)}^{i + 1} θ^{i} / i!

,

h_{i} (x) = i g (x) G {(x)}^{i - 1}

and

h_{i + 1} (x) = (i + 1) g (x) G {(x)}^{i}

are the EG densities with power parameters i and

i + 1

, respectively. On the basis of the linear representation (9), some TLG models’ properties are similar to the EG properties reported in several references, such as AL-Hussaini and Ehsanullah (2015) [17]. Henceforth,

Y_{i}

denotes that an rv has an EG distribution, with power parameter i and density

h_{i} (x)

.

3.1. Some Properties of the TLG Model

3.1.1. Critical Points

As

F_{T L G} (x) = F_{U T L} (G (x))

, we have

f_{T L G} (x) = g (x) f_{U T L} (G (x))

. Hence, the derivative of

f_{T L G} (x)

is

\begin{matrix} f_{T L G}^{'} (x) = g^{'} (x) f_{U T L} (G (x)) + g^{2} (x) f_{U T L}^{'} (G (x)) . \end{matrix}

Using the identities

f_{U T L} (y) = θ^{2} C_{θ} (1 + y) e^{- θ y}

and

f_{U T L}^{'} (y) = θ^{2} C_{θ} [1 - θ (1 + y)] e^{- θ y}

, the above identity is written as

\begin{matrix} f_{T L G}^{'} (x) = θ^{2} C_{θ} e^{- θ G (x)} \{g^{'} (x) (1 + G (x)) + g^{2} (x) [1 - θ (1 + G (x))]\} . \end{matrix}

Then, all critical points

x_{0}

of

f_{T L G}

satisfy

f_{T L G}^{'} (x_{0}) = 0

, or equivalently,

\begin{matrix} [g^{'} (x_{0}) - θ g^{2} (x_{0})] (1 + G (x_{0})) + g^{2} (x_{0}) = 0 . \end{matrix}

(10)

Depending on the choice of the cdf G, the above equation can be reduced and its maximum (modes) and minimum points characterized. For an example where the function G is chosen to be the Weibull distribution, see Section 3.2.

3.1.2. Moments

Moments allow the examination of some of the distribution’s most significant features and characteristics. The kth raw moment (for

r = 1, 2, \dots

) of the TLG model is

\begin{matrix} μ_{k}^{'} & = & \int_{- \infty}^{\infty} x^{k} f_{T L G} (x) d x = θ^{2} C_{θ} \int_{- \infty}^{\infty} x^{k} g (x) [1 + G (x)] e^{- θ G (x)} d x \\ = & θ^{2} C_{θ} \int_{0}^{1} {[Q_{G} (y)]}^{k} [1 + y] e^{- θ y} d y, \end{matrix}

where

Q_{G}

is the qf associated with the parent cdf G.

Furthermore, the kth raw moment can be expressed from (9) using the moments of the EG distribution as

μ_{r}^{'} = C_{θ} \{\sum_{i = 0}^{\infty} ν_{i} [(θ + 1) E (Y_{i}^{r}) + θ E (Y_{i + 1}^{r})]\} .

3.1.3. Quantile Function

The qf is a highly desirable property in statistical distributions and is especially helpful in the computation of several values in statistical modeling and inferences. By inverting the cdf of the TLG distribution in (6), the qf for the TLG distribution can be expressed using the qf associated with the parent cdf G as

\begin{matrix} Q_{T L G} (u) & = & Q_{G} \{- 1 - \frac{1}{θ} - \frac{1}{θ} W [(u C_{θ}^{- 1} - θ - 1) e^{- θ - 1}]\}, u \in (0, 1) \end{matrix}

(11)

Therefore,

X = Q_{G} (U)

follows the TLG distribution with pdf (7) if U is a uniform variate on the unit interval.

3.1.4. Mean Deviations

The following relationships can be used to describe, respectively, the mean deviations of X about the mean

μ = E (X)

and the median M.

\begin{matrix} δ_{1} & = & \int_{- \infty}^{\infty} | x - μ | f_{T L G} (x) d x = 2 μ F (μ) - 2 C_{θ} \sum_{i = 0}^{\infty} ν_{i} [(θ + 1) I_{i} (μ, 1) + θ I_{i + 1} (μ, 1)], and \\ δ_{2} & = & \int_{- \infty}^{\infty} | x - M | f_{T L G} (x) d x = μ - 2 C_{θ} \sum_{i = 0}^{\infty} ν_{i} [(θ + 1) I_{i} (M, 1) + θ I_{i + 1} (M, 1)], \end{matrix}

where

I_{j} (t, k)

is the kth incomplete moment of the rv

Y_{j}

that has an EG distribution with power parameter j (i.e.,

Y_{j} \sim h_{j} (x)

).

3.1.5. Moment Generating Function

The mgf of

X \sim T L G

can be expressed in an integral form as

\begin{matrix} M_{X} (t) & = & E (e^{t X}) = \int_{- \infty}^{\infty} e^{t x} f_{T L G} (x) d x \\ = & θ^{2} C_{θ} \int_{- \infty}^{\infty} g (x) [1 + G (x)] e^{- [θ G (x) - t x]} d x \\ = & θ^{2} C_{θ} \int_{0}^{1} [1 + y] e^{- [θ y - t Q_{G} (y)]} d y . \end{matrix}

Furthermore, it can be expressed using the mgf of the EG distribution as

\begin{matrix} M_{X} (t) & = & C_{θ} \{\sum_{i = 0}^{\infty} ν_{i} [(θ + 1) M_{i} (t) + θ M_{i + 1} (t)]\}, \end{matrix}

where

M_{j} (t)

is the mgf of an rv

Y_{j}

that has an EG distribution with power parameter j (

Y_{j} \sim h_{j} (x)

).

3.1.6. Entropy

Entropy measures the change in the uncertainty in physical systems. The Shannon and Rényi entropies are two well-known entropy measurements. Entropy values range from very small to very large, with larger values indicating greater data uncertainty. In this section, we derive the continuous Rényi and Shannon entropies of the TLG distribution. The Rényi entropy,

R (τ)

where

τ > 0

,

τ \neq 1

of the TLG distribution is given by

R (τ) = \frac{1}{1 - τ} log \int_{- \infty}^{\infty} f_{T L G}^{τ} (x) d x = \frac{1}{1 - τ} log [θ^{2 τ} C_{θ}^{τ} \int_{- \infty}^{\infty} g {(x)}^{τ} {[1 + G (x)]}^{τ} e^{- r θ G (x)} d x] .

It follows from the expansions

{[1 + G (x)]}^{r} = \sum_{j = 0}^{\infty} (\binom{r}{j}) G {(x)}^{j}

and

e^{- r θ G (x)} = \sum_{i = 0}^{n} \frac{{(- 1)}^{i}}{i!} {[r θ G (x)]}^{i}

that

\begin{matrix} R (τ) & = & \frac{1}{1 - τ} log [θ^{2 τ} C_{θ}^{τ} \sum_{j = 0}^{τ} \sum_{i = 0}^{n} (\binom{τ}{j}) \frac{{(- 1)}^{i} {(τ θ)}^{i}}{i!} \int_{- \infty}^{\infty} g {(x)}^{τ} G {(x)}^{i + j} d x] . \end{matrix}

The Shannon entropy of the TLG distribution is given

\begin{matrix} S (τ) = - E [log f_{T L G} (X)] = - log (θ^{2} C_{θ}) - E [log g (X)] - E [log (1 + G (X))] + θ E [G (X)], \end{matrix}

using the expansion

\begin{matrix} log [1 + G (x)] = \sum_{i = 1}^{\infty} \frac{{(- 1)}^{i + 1}}{i} G^{i} (x), \end{matrix}

we have

\begin{matrix} η = - log (θ^{2} C_{θ}) + η_{G} - \sum_{i = 1}^{\infty} \frac{{(- 1)}^{i + 1}}{i} E [G^{i} (X)] + θ E [G (X)], \end{matrix}

where

η_{G}

is the Shannon entropy for the parent distribution. Since

G (X) \sim U (0, 1)

, then

\begin{matrix} η & = & - log (θ^{2} C_{θ}) + η_{G} - \sum_{i = 1}^{\infty} \frac{{(- 1)}^{i + 1}}{i (i + 1)} + \frac{θ}{2}, \\ = & - log (θ^{2} C_{θ}) + η_{G} + 1 - 2 log 2 + \frac{θ}{2} . \end{matrix}

3.2. Truncated Lindley–Weibull (TLW) Model

Consider the parent distribution is the Weibull distribution with shape parameter

k > 0

, and scale parameter

λ > 0

, the cdf and pdf are given by

\begin{matrix} \begin{matrix} G (x) & = & G (x; k, λ) = 1 - e^{- {(x / λ)}^{k}}, and \\ g (x) & = & g (x; k, λ) = \frac{k}{λ} {(\frac{x}{λ})}^{k - 1} e^{- {(x / λ)}^{k}}, x > 0 . \end{matrix} \end{matrix}

(12)

The cdf and pdf of the truncated Lindley–Weibull (TLW) model are given by

F_{T L W} (x) = C_{θ} \{1 + θ - [1 + θ + θ (1 - e^{- {(x / λ)}^{k}})] e^{- θ (1 - e^{- {(x / λ)}^{k}})}\}, x, k, λ > 0, θ \neq 0, and

(13)

\begin{matrix} f_{T L W} (x) = \frac{k θ^{2} C_{θ}}{λ} {(\frac{x}{λ})}^{k - 1} (2 - e^{- {(x / λ)}^{k}}) e^{- {(x / λ)}^{k} - θ (1 - e^{- {(x / λ)}^{k}})}, x, k, λ > 0, θ \neq 0, \end{matrix}

(14)

respectively, where

C_{θ}

is as in Equation (7). Note that

\begin{matrix} lim_{x \to 0^{+}} f_{T L W} (x) = \{\begin{matrix} \infty, & k < 1, \\ \frac{θ^{2} C_{θ}}{λ}, & k = 1, \\ 0, & k > 1, \end{matrix} and lim_{x \to \infty} f_{T L W} (x) = 0 . \end{matrix}

(15)

The TLW model’s pdf is shown in Figure 1 for various values of

θ, k

, and

λ

. Figure 1 illustrates how the TLW distribution’s density function is flexible and changes in shape depending on the parameter values.

Figure 1. The pdf of the TLW model.

3.2.1. Shapes of the TLW pdf

Considering G and g as given in (12), the Equation (10) of critical points is written as

\begin{matrix} 0 = [g^{'} (x_{0}) - θ g^{2} (x_{0})] (1 + G (x_{0})) + g^{2} (x_{0}) = [g^{'} (x_{0}) - θ g^{2} (x_{0})] [2 - e^{- {(x_{0} / λ)}^{k}}] + g^{2} (x_{0}) . \end{matrix}

As

g^{'} (x) = - g (x) \{k [{(x / λ)}^{k} - 1] + 1\} / x

, the above identity becomes

\begin{matrix} 0 = - g (x_{0}) \{\frac{k [{(x_{0} / λ)}^{k} - 1] + 1}{x_{0}} + θ g (x_{0})\} [2 - e^{- {(x_{0} / λ)}^{k}}] + g^{2} (x_{0}) . \end{matrix}

Since

g (x) = (k / λ) {(x / λ)}^{k - 1} e^{- {(x / λ)}^{k}}

and

g (x_{0}) > 0

for each

x_{0} > 0

, the above identity is equivalently written as

\begin{matrix} A (z_{0}) = B_{θ, k} (z_{0}), \end{matrix}

(16)

where for

z_{0} = {(x_{0} / λ)}^{k}

and

θ \neq 0

, we denote

\begin{matrix} A (z_{0}) \equiv - z_{0} e^{- z_{0}}, B_{θ, k} (z_{0}) \equiv \frac{2}{θ} \frac{z_{0} (1 - e^{- z_{0}})}{2 - e^{- z_{0}}} + τ^{*} and τ^{*} \equiv \frac{1}{θ} (\frac{1 - k}{k}) . \end{matrix}

A simple calculation shows that the function

z_{0} \mapsto B_{θ, k} (z_{0})

is increasing (respectively, decreasing) when

θ > 0

(respectively,

θ < 0

). Furthermore, notice that the function

z_{0} \mapsto A (z_{0})

reaches the minimum value

- 1 / e

at

z_{0} = 1

. Using the graphs of the functions A and

B_{θ, k}

, and varying the parameters

θ

and

τ^{*}

, we can find the points of intersection of both graphs. Therefore, we can compactly classify the number of roots of Equation (16), as indicated in Table 2.

Table 2. Number of roots of equation

A (z_{0}) = B_{θ, k} (z_{0})

in (16) when varying the parameters

θ

and

τ^{*}

.

Based on Table 2, in what follows we divide our analysis into the following cases.

1

If

θ > 0

(a): and $τ^{*} ⩽ - 1 / e$ , then $k > 1$ and, by Table 2, there is a single root, $z_{0} = {(x_{0} / λ)}^{k}$ , of Equation (16). That is, $x_{0} = λ z_{0}^{1 / k}$ , with $k > 1$ , is a single critical point of $f_{T L W}$ . But, by (15), ${lim}_{x \to 0^{+}} f_{T L W} (x) = {lim}_{x \to \infty} f_{T L W} (x) = 0$ for $k > 1$ . Consequently, $x_{0}$ is a single maximum point of the TLW pdf. Hence, for $θ > 0$ and $τ^{*} ⩽ - 1 / e$ , the TLW pdf is unimodal with mode $x_{0}$ .
(b): and $- 1 / e < τ^{*} < 0$ , then $k > 1$ . Following the same steps as in Item 1(a) we have that $f_{T L W}$ is unimodal.
(c): and $τ^{*} ⩾ 0$ , then $k ⩽ 1$ and, by Table 2, there is no root of Equation (16). I.e., there is no critical point of $f_{T L W}$ . But, by (15), ${lim}_{x \to 0^{+}} f_{T L W} (x) = \infty$ for $k < 1$ (and $= θ^{2} C_{θ} / λ$ for $k = 1$ ) and ${lim}_{x \to \infty} f_{T L W} (x) = 0$ . Consequently, for $θ > 0$ and $τ^{*} ⩾ 0$ , the TLW pdf is decreasing.

2

If

θ < 0

(a): and $τ^{*} ⩽ - 1 / e$ , then $k < 1$ . Following the same steps as in Item 1(c) we have that $f_{T L W}$ is decreasing.
(b): and $- 1 / e < τ^{*} < 0$ , then $k < 1$ , by Table 2, there are two roots, $z_{0} = {(x_{0} / λ)}^{k}$ and $z_{1} = {(x_{1} / λ)}^{k}$ , of Equation (16). In other words, $x_{0} = λ z_{0}^{1 / k}$ and $x_{1} = λ z_{1}^{1 / k}$ , with $k < 1$ , are two critical points of $f_{T L W}$ . Without loss of generality, assume that $x_{0} < x_{1}$ . By (15), ${lim}_{x \to 0^{+}} f_{T L W} (x) = \infty$ , for $k < 1$ , and ${lim}_{x \to \infty} f_{T L W} (x) = 0$ . Consequently, $f_{T L W}$ has an decreasing–increasing–decreasing shape with minimum point $x_{0}$ and maximum point $x_{1}$ .
(c): and $τ^{*} ⩾ 0$ , then $k ⩽ 1$ . Following the same steps as in Item 1(a) we have that $f_{T L W}$ is unimodal.

Table 3 summarizes the shapes of

f_{T L W}

obtained in Items 1 and 2 above.

Table 3. Shapes of TLW pdf when varying the parameters

θ

and

τ^{*}

.

Note that the parameters

θ

and

τ^{*}

obtained from Figure 1 obey the pdf shapes obtained in Table 3.

By way of illustration in Figure 2, we represent the shapes of the TLW pdf shown in Table 3.

Figure 2. Regions of the Cartesian plane

θ τ^{*}

where different forms of the TLW pdf occur.

3.2.2. Stochastic Representation

Let X and Y be two random variables with TLW and UTL distributions, respectively. As

F_{T L W} (x) = F_{U T L} (G (x))

with

G (x) = 1 - e^{- {(x / λ)}^{k}}

, we obtain

\begin{matrix} F_{T L W} (x) = P (X ⩽ x) = P (Y ⩽ G (x)) = P (G^{- 1} (Y) ⩽ x) = P (λ {[- log (1 - Y)]}^{1 / k}) ⩽ x), \forall x . \end{matrix}

Therefore, X has the stochastic representation

\begin{matrix} X \overset{d}{=} λ {[- log (1 - Y)]}^{1 / k}, \end{matrix}

with

\overset{d}{=}

being equality in distribution. In addition to generating random numbers, a stochastic representation is useful for determining moments, characteristic functions, quantiles, etc.

3.3. Maximum Likelihood Estimation

Let

x_{1}, \dots, x_{n}

represent the observed values from the TLW model with the pdf given in (14). For the vector of parameters

Θ = {(θ, k, λ)}^{⊤}

, the log-likelihood function is provided by

\begin{matrix} ℓ = ℓ (Θ) & = & n [log θ^{2} + log C_{θ} + log k - k log λ] + (k - 1) \sum_{i = 0}^{n} log x_{i} \\ + \sum_{i = 0}^{n} log [2 - e^{- {(x_{i} / λ)}^{k}}] - θ \sum_{i = 0}^{n} [1 - e^{- {(x_{i} / λ)}^{k}}] . \end{matrix}

(17)

The following are the elements comprising the score vector

U (Θ)

\begin{matrix} U_{θ} & = & n [\frac{2}{θ} - \frac{1 - e^{- θ} + 2 θ e^{- θ}}{1 + θ - e^{- θ} - 2 θ e^{- θ}} + \frac{1}{k} - \frac{1}{λ}] - \sum_{i = 0}^{n} [1 - e^{- {(x_{i} / λ)}^{k}}], \\ U_{k} & = & \frac{n}{k} - n log λ + \sum_{i = 0}^{n} log x_{i} - \frac{n}{λ^{k}} \sum_{i = 1}^{n} x_{i}^{k} + \frac{1}{λ^{k}} \sum_{i = 1}^{n} \frac{x_{i}^{k} (log x_{i} - log λ) e^{- {(x_{i} / λ)}^{k}}}{2 - e^{- {(x_{i} λ)}^{k}}} \\ - \frac{θ}{λ^{k}} \sum_{i = 0}^{n} x_{i}^{k} (log x_{i} - log λ) e^{- {(x_{i} / λ)}^{k}}, \\ U_{λ} & = & - \frac{n k}{λ} + \frac{k}{λ^{k + 1}} \sum_{i = 0}^{n} x_{i}^{k} + \frac{k}{λ^{k + 1}} \sum_{i = 1}^{n} \frac{x_{i}^{k} e^{- {(x_{i} / λ)}^{k}}}{2 - e^{- {(x_{i} / λ)}^{k}}} + \frac{k θ}{λ^{k + 1}} \sum_{i = 0}^{n} x_{i} e^{- {(x_{i} / λ)}^{k}} . \end{matrix}

Traditionally, the MLEs of the three parameters can also be calculated by setting the preceding equations to zero and simultaneously solving them. Since it appears impossible to find a closed form estimator for

Θ

, direct maximization of (17), as a multidimensional nonlinear unconstrained function, via a quasi-Newton optimization technique such as BFGS, SANN, Nelder–Mead, or CG might be appropriate for finding the maximum likelihood estimates of

Θ = {(θ, k, λ)}^{⊤}

.

3.4. Monte Carlo Simulation

By generating n observations from the TLW distribution with varying parameter values, we conduct simulations to validate the performance of the MLEs of the TLW distribution parameters. The BFGS method from the R package is utilized to estimate the parameter values. The sample sizes considered are n = 20, 50, 100, 150, and 300, and the replicates number is N = 5000. The simulation results are evaluated using the mean absolute bias (MAB), the mean square error (MSE), and the average estimates (AEs), where for

Θ = {(θ, k, λ)}^{⊤}

we have

M A B (\hat{Θ}) = \frac{1}{N} \sum_{i = 1}^{N} | \hat{Θ} - Θ |, M S E (\hat{Θ}) = \frac{1}{N} \sum_{i = 1}^{N} {(\hat{Θ} - Θ)}^{2}, A E (\hat{Θ}) = \frac{1}{N} \sum_{i = 1}^{N} \hat{Θ_{i}} .

(18)

The results in Table 4 and Table 5 show that the AEs tend to the true values and that the MABs and MSEs vanish as n increases, which reveals the asymptotic consistency of the MLEs of the TLW parameters.

Table 4. Average estimates from simulations of the TLW distribution.

Table 5. MABs and MSEs from simulations of the TLW distribution.

Using Equation (11), for the Weibull distribution we have

Q_{W} (u) = λ {[- log (1 - u)]}^{\frac{1}{k}}

, implying that the qf of the TLW distribution is

Q_{T L W} (u) = λ {\{- log [2 + \frac{1}{θ} + \frac{1}{θ} W_{- 1} ((u C_{θ}^{- 1} - θ - 1) e^{- θ - 1})]\}}^{\frac{1}{K}} .

The data are generated from

X = λ {\{- log [2 + \frac{1}{θ} + \frac{1}{θ} W_{- 1} ((U C_{θ}^{- 1} - θ - 1) e^{- θ - 1})]\}}^{\frac{1}{K}}, U \sim U (0, 1) .

4. The TLW Regression Model with Censored Data and Two Systematic Components

Statistical analysis of lifetimes is an important topic used in different areas such as, for example, medicine, biology, epidemiology, engineering, among others. Failure time refers to the time until the occurrence of an event of interest, which may be death, the appearance of a tumor, the development of a disease, the breakdown of an electronic component, among other examples.

We relate the parameters

λ

and k to

v = {(v_{1}, \dots, v_{p})}^{T}

covariates by the logarithm link function

λ_{i} = exp (v_{i}^{T} β_{1}) a n d k_{i} = exp (v_{i}^{T} β_{2}), i = 1, \dots, n,

respectively, where

β_{1} = {(β_{11}, \dots, β_{1 p})}^{T}

and

β_{2} = {(β_{21}, \dots, β_{2 p})}^{T}

denote the vectors of regression coefficients and

v_{i}^{T} = (v_{i 1}, \dots, v_{i p})

.

The survival function of

X | v

is given by

\begin{matrix} S (x | v) = 1 - c_{θ} \{1 + θ - [1 + θ + ω (x | v)] exp [- ω (x | v)]\}, \end{matrix}

(19)

where

ω (x | v) = θ \{1 - exp [- {(\frac{x}{exp (v^{T} β_{1})})}^{exp (v^{T} β_{2})}]\} .

Equation (19) is referred to as the TLW parametric regression model. This regression model opens new possibilities for fitting many different types of data.

Consider a sample

(x_{1}, v_{1}), \dots, (x_{n}, v_{n})

of n independent observations, where each random response is defined by

x_{i} = min {x_{i}^{*}, c_{i}}

, where

c_{1}, \dots, c_{n}

are the censoring times and

x_{1}^{*}, \dots, x_{n}^{*}

are the observed lifetimes. We assume non-informative censoring such that the observed lifetimes and censoring times are independent. Let F and C be the sets of individuals for which

x_{i}

is the lifetime or censoring, respectively. The total log-likelihood function for

τ = {(θ, β_{1}^{T}, β_{2}^{T})}^{T}

reduces to

\begin{matrix} l (τ) & = & r log (θ^{2} c_{θ}) + \sum_{i \in F} log (\frac{k_{i}}{λ_{i}^{k_{i}}}) + \sum_{i \in F} (k_{i} - 1) log (x_{i}) - \sum_{i \in F} {(\frac{x_{i}}{λ_{i}})}^{k_{i}} + \\ \sum_{i \in F} log \{2 - exp [- {(\frac{x_{i}}{λ_{i}})}^{k_{i}}]\} - \sum_{i \in F} q (x_{i} | v_{i}) + \\ \sum_{i \in C} log \{1 - c_{θ} {1 + θ - [1 + θ + q (x_{i} | v_{i})] exp [- q (x_{i} | v_{i})]}\}, \end{matrix}

(20)

where r is the number of uncensored observations (failures) and

q (x_{i} | v_{i}) = θ \{1 - exp [- {(\frac{x_{i}}{λ_{i}})}^{k_{i}}]\}

. By maximizing the log-likelihood (20), the MLE of the vector of unknown parameters can be calculated. We use the R software to determine

\hat{τ}

.

4.1. Residual Analysis

For the TLW regression model with censored observations, we present two types of residuals to evaluate deviations from the error assumptions and detect outliers. The deviance residuals have been used more frequently in the literature because they take into account the information of censored times. The TLW regression model can also use these residuals. A reliable method for detecting atypical observations and confirming that the fitted model is adequate is to plot the deviance residual against the observed times. It is possible to express the deviance residual as

\begin{matrix} r_{D_{i}} = s i g n (r_{M_{i}}) {- 2 [r_{M_{i}} + δ_{i} log (δ_{i} - r_{M_{i}})]}^{1 / 2}, \end{matrix}

(21)

where

\begin{matrix} r_{M_{i}} = \{\begin{matrix} 1 + log \{1 - c_{\hat{θ}} {1 + \hat{θ} - [1 + \hat{θ} + \hat{q} (x_{i} | v_{i})] exp [- \hat{q} (x_{i} | v_{i})]}\} & i f δ_{i} = 1, \\ log \{1 - c_{\hat{θ}} {1 + \hat{θ} - [1 + \hat{θ} + \hat{q} (x_{i} | v_{i})] exp [- \hat{q} (x_{i} | v_{i})]}\} & i f δ_{i} = 0, \end{matrix} \end{matrix}

is the martingale residual,

δ_{i} = 1

means that the observation is uncensored,

δ_{i} = 0

means that the observation is censored and

\hat{q} (x_{i} | v_{i}) = \hat{θ} \{1 - exp [- {(\frac{x_{i}}{{\hat{λ}}_{i}})}^{{\hat{k}}_{i}}]\} .

4.2. Simulation Study

To verify the accuracy of the MLEs of the TLW regression model, we carried out a simulation study for different censoring percentages and sample sizes

n = 100

, 300, and 500. For each sample size, we carried out N = 1000 replicates and considered the approximate censoring percentages: 0%, 10% and 30%. A covariate

v_{1} \sim

binomial

(1, 0.5)

is included from the following systematic components:

λ_{i} = exp (β_{10} + β_{11} v_{1 i}), and k_{i} = exp (β_{20} + β_{21} v_{1 i}),

The inverse transformation method is used to obtain the lifetimes

x_{1}, \dots, x_{n}

from the TLW

(λ_{i}, k_{i}, θ)

distribution, and the censoring times

c_{1}, \dots, c_{n}

are determined from a uniform distribution

(0, γ)

, where

γ

controls the censoring percentages. The true values used for generation are

β_{10} = 0.3

,

β_{11} = 0.4

,

β_{20} = 0.2

,

β_{21} = 0.5

, and

θ = 0.6

.

The Results are checked for

τ^{⊤} = ({\hat{β}}_{10}, {\hat{β}}_{11}, {\hat{β}}_{20}, {\hat{β}}_{21}, \hat{θ})

from MABs, MSEs, and AEs given in (18), where here

Θ = τ

. The simulation process is given by:

(i) Generate

v_{1 i} \sim

binomial

(n, 1, 0.5)

;

(ii) Calculate

λ_{i} = exp (β_{10} + β_{11} v_{1 i})

and

k_{i} = exp (β_{20} + β_{21} v_{1 i})

;

(iii) Generate

x_{i}^{*} \sim

TLW

(n, λ_{i}, k_{i}, θ)

;

(iv) Generate

c_{i} \sim uniform (0, γ)

;

(v) Calculate the survival times

x_{i} = \min (x_{i}^{*}, c_{i})

;

(vi) If

x_{i}^{*} < c_{i}

, then

δ_{i} = 1

; otherwise,

δ_{i} = 0

, for

i = 1, \dots, n

, where

δ

is the censoring indicator.

(vii) Calculate AEs, biases, and MSEs.

Table 6 displays these values. It is verified that for all scenarios the averages of the estimates approach the true values of the parameters and the MABs and MSEs decrease as the sample size increases. These results illustrate that the estimates are consistent, even at higher censoring percentages.

Table 6. Simulation results of TLW regression models for different censoring percentages (%) with true values:

β_{10} = 0.3

,

β_{11} = 0.4

,

β_{20} = 0.2

,

β_{21} = 0.5

, and

θ = 0.6

.

5. Data Analysis

In order to demonstrate the superiority of the new distribution over some other models, we use two real datasets originating from different fields. We compare the fits of the TLW model to those of the parent Weibull model (W), the Kumarswamy–Weibull model (KW) from Cordeiro and Castro (2011) [4], the Weibull–Weibull model (WW) from Alzaatreh et al. [18], the Geometric–Poisson–Weibull model (GPW) from Nadarajah et al. (2013) [19], the Poisson–Weibull model (PW) from Ristic and Nadarajah (2013) [5] the beta-Weibull model (BW) from Eugene et al. (2002) [1], the Marshall–Olkin–Weibull model (MOW) from Marshall and Olkin (1997) [20] and the exponentiated generalized Weibull model (EGW) from Cordeiro et al. (2013) [21]. The cdfs of these models are provided in Appendix B. The parameter estimates are computed by maximizing (17) using the BFGS method available in the adequacy model package in the R software [22].

The considered models are compared according to a collection of statistics (AIC, CAIC, BIC, HQIC, minus maximum log-likelihood function (

- ℓ

)) which assess the relative degree of fit of these models to a dataset.

We also performed an application of the TLW regression model considering censored data. We compared different systematic components for the proposed new regression model and the Weibull regression model. In this part we use the RS algorithm in the gamlss package in the R software to maximize the log-likelihood function (20) and we use the AIC and global deviance (GD) statistics to select the most suitable models.

Dataset I: Temperature Dataset

This dataset, reported by Barakat et al. (2014) [23], depicts the average July temperatures (

^{°}

C) for Neuenburg, Switzerland, between 1864 and 1993. The observations are as follows.

19.0	20.1	18.4	17.4	19.7	21.0	21.4	19.2	19.9	20.4	20.9	17.2	20.2
17.8	18.1	15.6	19.4	21.7	16.2	16.4	19.0	20.6	19.0	20.7	15.8	17.7
16.8	17.1	18.1	18.4	18.7	18.7	18.4	19.2	18.0	18.7	20.7	19.4	19.2
17.4	22.0	21.4	19.3	16.8	18.2	16.2	15.9	22.1	17.5	15.3	16.5	17.4
17.0	18.3	18.3	15.3	18.2	21.5	17.0	21.6	18.2	18.1	17.6	18.2	22.6
19.9	17.1	17.2	17.3	19.4	20.1	20.1	17.0	19.4	17.5	16.8	17.0	19.9
18.2	19.2	18.5	20.8	19.5	21.1	15.8	21.3	21.2	18.8	22.3	18.6	16.8
18.2	17.2	18.4	18.7	21.1	16.3	17.4	18.0	19.5	21.2	16.8	17.4	20.7
18.4	19.8	18.7	20.5	18.3	18.2	18.2	19.2	20.2	18.2	17.4	19.2	16.3
17.4	20.3	23.4	19.2	20.2	19.3	19.0	18.8	20.3	19.7	20.7	19.6	18.1

The MLEs and 95% CIs for the model parameters are shown in Table 7. Table 8 provides the competence of the considered models.

Table 7. Estimates of TLW parameters for dataset I.

Table 8. Competence of the models for the dataset.

The TLW model fits the dataset with the lowest AIC, CAIC, BIC, HQIC, and minus log-likelihood among the other models, as determined by the adequacy statistics presented in Table 8. Therefore, it may be a viable option for modeling these data. Figure 3 compares the empirical and fitted distributions of the data, displaying the histogram and fitted pdf, the fitted and empirical cdfs, the P–P plot, and the Q–Q plot, respectively, to graphically explain the appropriateness of the TLW for modeling these data.

Figure 3. Histogram and fitted pdf, empirical and fitted cdfs, and P–P and Q–Q plots of the TLW model fitted to dataset I.

Dataset II: Breaking Stress of Carbon Fibers

The breaking stress of 64 single carbon fibers of gauge length 10 mm (Cheng and Traylor (1970) [24]). The observations are as follows.

1.901	2.132	2.203	2.228	2.257	2.35	2.361	2.396	2.397	2.4450	2.454
2.454	2.474	2.518	2.522	2.525	2.532	2.575	2.614	2.616	2.618	2.624
2.659	2.675	2.738	2.74	2.856	2.917	2.928	2.937	2.937	2.977	2.996
3.03	3.125	3.139	3.145	3.22	3.223	3.235	3.243	3.264	3.272	3.294
3.332	3.346	3.377	3.408	3.435	3.493	3.501	3.537	3.554	3.562	3.628
3.852	3.871	3.886	3.971	4.024	4.027	4.225	4.395	5.02

Table 9 displays the MLEs and 95% CIs for the model parameters, demonstrating the validity of the considered models. According to Table 10, the TLW model fits the dataset with the lowest AIC, CAIC, BIC, HQIC, and minus log-likelihood among the other models. Therefore, it may be a viable option for modeling these data. Figure 4 compares the empirical and fitted distributions of the data, displaying the histogram and fitted pdf, the fitted and empirical cdfs, the P–P plot and the Q–Q plot to graphically demonstrate the appropriateness of the TLW for modeling these data.

Table 9. Estimates of TLW parameters for dataset II.

Table 10. Competence of the models for dataset II.

Figure 4. Histogram and fitted pdf, empirical and fitted cdfs, and P–P and Q–Q plots of the TLW model fitted to dataset II.

Dataset III: COVID-19

In this application we consider the regression model for censored data. This dataset refers to patients hospitalized with COVID-19. The disease is caused by the pathogen identified as a new coronavirus, denominated severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2). The epidemiological data were tallied by the Health Information System of the Brazilian government, and are available at https://opendatasus.saude.gov.br/dataset/srag-2020 (accessed on 1 May 2023).

This study involved 195 patients hospitalized in the city of Campinas, state of São Paulo, in May 2020, with infection confirmed by RT-PCR and classified as SARS caused by COVID-19. The survival time consisted of the time in days from the date of first symptoms to the date of evolution of the case, either death (failure) or end of observation (censoring). The censoring percentage was 56.92% and the following variables were considered:

(i = 1, \dots, 195)

:

$x_{i}$ : observed time (in days);
${cens}_{i}$ : censoring indicator ( $0 =$ censored, $1 =$ observed lifetime);
$v_{i 1}$ : sex ( $1 =$ male, $0 =$ female);
$v_{i 2}$ : age (in years).

There were 110 male patients (56.41%), of whom 42 (38.18%) died, while of the 85 women (43.58%), there were 42 deaths (49.41%). Figure 5a presents the Kaplan–Meier survival curve broken down by sex. It can be seen that men had a higher risk of death. Figure 5b depicts the histogram of the ages, where the greatest frequency was in the category from 50 to 75 years old.

Figure 5. (a) Kaplan–Meier survival curve for the sex variable (

1 =

male,

0 =

female); (b) histogram of the age variable.

We compared the TLW regression model with the Weibull regression model based on the following systematic components:

Systematic = \{\begin{matrix} M_{0} : log (λ_{i}) = β_{10} and log (k_{i}) = β_{20}; \\ M_{1} : log (λ_{i}) = β_{10} + β_{11} v_{i 1} + β_{12} v_{i 2} and log (k_{i}) = β_{20}; \\ M_{2} : log (λ_{i}) = β_{10} and log (k_{i}) = β_{20} + β_{21} v_{i 1} + β_{22} v_{i 2}; \\ M_{3} : log (λ_{i}) = β_{10} + β_{11} v_{i 1} + β_{12} v_{i 2} and log (k_{i}) = β_{20} + β_{21} v_{i 1} + β_{22} v_{i 2} . \end{matrix}

Table 11 reports the values of the selection criteria of the models, in which the

M_{3}

-TLW model was superior to the others. We also compared this model with the

M_{3}

-Weibull model by means of the residuals in Figure 6. In turn, Figure 6a,c illustrate the residuals versus the index of the observations, showing that both models have residuals with random behavior around zero, and no point is outside the interval

(- 3, 3)

. Nevertheless, Figure 6b,d indicate that the TLW model behaved better, with all the points within the simulated envelope, denoting its superiority. Finally, we illustrate the Kaplan–Meier curves and estimated survival curves in Figure 7 for the TLW model, showing that this model is able to capture the non-proportional curves of this dataset. The results of this model are shown in Table 12. Some conclusions can be obtained as follows.

Table 11. AIC and GD values for TLW and Weibull regression models with different structures for COVID-19 data.

Figure 6. Index plot and normal probability plot with envelope of the deviance residual from the fitted regressions model to the COVID-19 data. (a,b):

M_{3}

-TLW; (c,d):

M_{3}

-Weibull.

Figure 7. Kaplan–Meier survival curve and estimated survival functions from the

M_{3}

-TLW by sex.

Table 12. MLEs, SEs, and p-values for the

M_{3}

-TLW regression fitted to COVID-19 data.

Interpretations for

λ

:

A significant difference exists between men and women in relation to survival time (men have shorter survival). Various other studies have also indicated significant differences between the sexes (see [25,26]);
The survival time declines with advancing age. This result corroborates the findings of several studies that have indicated that older age is a predictor of higher mortality caused by COVID-19 (see [27,28,29]).

Interpretations for k:

A significant difference exists between men and women with regard to the variability in the survival time;
In relation to age, the variability in survival time increased with older age of the patients.

Dataset IV: Post-harvested

In this application, we consider the regression model for uncensored data. These data refer to Musa acuminata banana species from a banana plantation in the Philippines. A total of

n = 194

banana tiers were chosen randomly, in which the numerical values of the RGB colors (red, green, and blue) were obtained from images taken by hardware of four banana classes, extra class, class I, class II, and reject, where the classes contain 65, 49, 30, and 50 samples, respectively. The dataset is available in the repository: https://data.mendeley.com/datasets/zk3tkxndjw/2 (accessed on 20 May 2023) and more details can be seen in [30]. Each banana tier sample was captured with a white background in six different views: front, back, left, right, top, and bottom views. Here, we consider the values of B in front view. Figure 8 displays a boxplot by class, it is possible to observe differences between the colors according to the class.

Figure 8. Boxplot of colors by class for the Post-harvested dataset.

The variables considered are

(i = 1, \dots, 194)

:

$x_{i}$ : color value;
$v_{i j}$ : banana class (factor with four levels, defined by three variable dummies $j = 1, 2, 3$ ).

We verified the relationship between colors and classes from the TLW and Weibull models according to the following systematic components:

Systematic = \{\begin{matrix} M_{0} : log (λ_{i}) = β_{10} and log (k_{i}) = β_{20}; \\ M_{1} : log (λ_{i}) = β_{10} + β_{1 j} v_{i j} and log (k_{i}) = β_{20}; \\ M_{2} : log (λ_{i}) = β_{10} and log (k_{i}) = β_{20} + β_{2 j} v_{i j}; \\ M_{3} : log (λ_{i}) = β_{10} + β_{1 j} v_{i j} and log (k_{i}) = β_{20} + β_{2 j} v_{i j} . \end{matrix}

Table 13 displays the AIC and GD values for these fitted models, in which it can be seen that the

M_{3}

-TLW model obtained the lowest values, being able to be chosen as the best model. In addition, we compare the

M_{3}

-TLW and the

M_{3}

-Weibull from the quantile residues (Figure 9). These plots agree with the results of Table 13, there is a high percentage of points outside the confidence band of the Weibull model (Figure 9e) and many deviations also from the confidence band worm plot confidence (Figure 9f).

Table 13. AIC and GD values for TLW and Weibull regression models with different structures for the Post-harvested dataset.

Figure 9. Index plot, normal probability plot with envelope, and worm plot of the quantile residuals from the regression models fitted to the Post-harvested dataset: (a–c):

M_{3}

-TLW; (d–f):

M_{3}

-Weibull.

Finally, Table 14 presents MLEs, SEs, and p-values of the model

M_{3}

-TLW, in which classes I, II, and extra are compared with the rejected class. We can obtain the following conclusions: there is a significant difference between the color of class 1 and the rejects. Its effect is positive, that is, it presented higher color values. Class II and the extra class do not present a significant difference with the rejected class. The extra class and class I’s colors affect the shape of the distribution compared to the reject class’s color.

Table 14. MLEs, SEs, and p-values for the

M_{3}

-TLW regression fitted to the Post-harvested dataset.

6. Conclusions

In this study, we propose a new class of distributions called the truncated Lindley-G (TLG) distribution with application to the truncated Lindley–Weibull (TLW) distribution with three parameters. Several structural properties of the TLG distribution, including an expansion of the density function, critical points, explicit expressions of the ordinary and incomplete moments, mean deviation, generating function, entropy, and quantile function, are discussed. The parameters of the model are estimated using the maximum likelihood technique. We fitted the TLW model to two sets of data to demonstrate the effectiveness of the proposed distribution. In comparison to the Kumarswamy–Weibull, Weibull–Weibull, Geometric–Poisson–Weibull, Poisson–Weibull, beta-Weibull, Marshall–Olkin–Weibull, and exponentiated generalized Weibull distributions, the proposed model had a better fit on four datasets. However, the goodness-of-fit measures for our model were not drastically better than the comparison models that are currently used in statistical analyses. Based on this new distribution, we propose a TLW regression model with two systematic components very suitable for modeling censored and uncensored data. Several simulation studies are performed for different parameter settings, sample sizes, and censoring percentages. We anticipate the further application of the proposed model in disciplines such as engineering, survival and lifetime data, and economics.

Author Contributions

Conceptualization, M.H., G.M.R., E.M.M.O., R.V. and H.E.; methodology, M.H., G.M.R., E.M.M.O., R.V. and H.E; software, M.H., G.M.R., E.M.M.O., R.V. and H.E.; investigation, M.H., G.M.R., E.M.M.O., R.V. and H.E.; writing—original draft preparation, M.H. and H.E.; writing—review and editing, M.H. and E.M.M.O. All authors have read and agreed to the published version of the manuscript.

Funding

This research received funding from Deanship of Scientific Research at King Khalid University through General Research Project under grant number GRP/206/44, Coordenação de Aperfeiçoamento de Pessoal de Nível Superior–Brasil (CAPES) and Conselho Nacional de Desenvolvimento Científico e Tecnológico—Brasil (CNPq).

Institutional Review Board Statement

Not applicable.

Data Availability Statement

Stated in the text.

Acknowledgments

The authors extend their appreciation to the Deanship of Scientific Research at King Khalid University for funding this work through a General Research Project under grant number GRP/206/44. Also, this study was financed in part by the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior, Brasil (CAPES) and Conselho Nacional de Desenvolvimento Científico e Tecnológico, Brasil (CNPq).

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

The UTL model has the following properties:

(1): Moments
The UTL distribution’s kth raw moment ( $k = 1, 2, \dots$ ) is given by

$μ_{k}^{'} = θ^{2} C_{θ} [- \frac{e^{- θ}}{θ} + (1 + \frac{k + 1}{θ}) d_{k}],$

where $d_{k} = \int_{0}^{1} x^{k} e^{- θ x} d x$ . Using integration by parts, $d_{k}$ can be calculated recursively by

$d_{k} = \frac{1}{θ} (k d_{k - 1} - e^{- θ}), k = 1, 2, \dots,$

and

$d_{0} = \frac{1}{θ} (1 - e^{- θ}) .$

The first three moments are

$\begin{matrix} μ_{1}^{'} & = & \frac{(2 + θ - 2 e^{- θ} - 3 θ e^{- θ} - 2 θ^{2} e^{- θ}) C_{θ}}{θ}, \\ μ_{2}^{'} & = & \frac{(6 + 2 θ - 6 e^{- θ} - 8 θ e^{- θ} - 5 θ^{2} e^{- θ} - 2 θ^{3} e^{- θ}) C_{θ}}{θ^{2}}, and \\ μ_{3}^{'} & = & \frac{(24 + 6 θ - 24 e^{- θ} - 30 θ e^{- θ} - 18 θ^{2} e^{- θ} - 7 θ^{3} e^{- θ} - 2 θ^{4} e^{- θ}) C_{θ}}{θ^{3}} . \end{matrix}$

The kth incomplete moment of X is given by

$\begin{matrix} I_{X} (t; k) & = & E (X^{k} ∣ X \leq t) = \int_{0}^{t} x^{k} f_{U T L} (x) d x = θ^{2} C_{θ} [- \frac{t^{k} e^{- θ t}}{θ} + (1 + \frac{k + 1}{θ}) d_{t, k}], \end{matrix}$

where $d_{t, k} = \int_{0}^{t} x^{k} e^{- θ x}$ . Using integration by parts, $d_{t, k}$ can be calculated recursively by

$d_{t, k} = \frac{1}{θ} (k t^{k} d_{t, k - 1} - e^{- θ t}), k = 1, 2, \dots,$

and

$d_{t, 0} = \frac{1}{θ} (1 - e^{- θ t}) .$
(2): Mode
The mode of the UTL distribution is

$\begin{matrix} M o d e = \{\begin{matrix} \frac{1 - θ}{θ} & i f 0.5 \leq θ \leq 1, \\ 0 & i f θ > 1, \\ 1 & i f θ < 0.5 . \end{matrix} \end{matrix}$
(3): Quantile Function
Therefore The UTL distribution’s qf is

$Q_{U T L} (u) = - 1 - \frac{1}{θ} - \frac{1}{θ} W ((u C_{θ}^{- 1} - θ - 1) e^{- θ - 1}), u \in (0, 1),$

where $W (x)$ is the Lambert function satisfying $W (x) e^{W (x)} = x$ for $x \in [- 1 / e, \infty)$ (see Corless et al. [31] for the definition and properties of the Lambert function).
Therefore, the median of the UTL distribution is simply $M = Q_{U T L} (0.5)$ , that is,

$M = - 1 - \frac{1}{θ} - \frac{1}{θ} W ((0.5 C_{θ}^{- 1} - θ - 1) e^{- θ - 1}) .$
(4): Mean Deviations
The UTL distribution’s mean deviation about the mean $μ = E (X)$ is given by

$\begin{matrix} δ_{1} & = & \int_{0}^{1} | x - μ | f_{U T L} (x) d x \\ = & \int_{0}^{μ} (μ - x) f_{U T L} (x) d x + \int_{μ}^{1} (x - μ) f_{U T L} (x) d x \\ = & 2 μ F_{U T L} (μ) - 2 \int_{0}^{μ} x f_{U T L} (x) d x = 2 [μ F_{U T L} (μ) - I_{X} (μ; 1)] \end{matrix}$

and the mean deviation about the median M is

$δ_{2} = \int_{0}^{1} | x - M | f_{U T L} (x) d x = μ - 2 I_{X} (M; 1),$

where $I_{X} (t; k)$ is the kth incomplete moment.
(5): Moment Generating Function
The UTL distribution’s moment generating function (mgf) can be expressed as

$M (t) = \int_{0}^{1} e^{t x} f_{U T L} (x) d x = θ^{2} C_{θ} [\frac{2 t e^{- (θ - t)} - 2 θ e^{- (θ - t)} - e^{- (θ - t)} - t + θ + 1}{t^{2} - 2 θ t + θ^{2}}] .$

Appendix B

-: The cdf of the Kumaraswamy-G model is given by

$F (x) = 1 - {[1 - G^{a} (x)]}^{b}, a, b > 0$
-: The cdf of the Weibull-G model is given by

$F (x) = 1 - exp \{- {[- \frac{log (1 - G (x))}{b}]}^{a}\}, a, b > 0$
-: The cdf of the Geometric-Poisson-G model is given by

$F (x) = \frac{exp [- a + a G (x)] - exp (- a)}{1 - exp (- a) - b + b exp [- a + a G (x)]}, a > 0, 0 < b < 1$
-: The cdf of the Poisson-G model is given by

$F (x) = \frac{1 - exp [- a G^{b} (x)]}{1 - exp (- a)}, a, b > 0$
-: The cdf of the Beta-G model is given by

$F (x) = I_{G (x)} (a, b)$

where $I_{x} (a, b) = \int_{0}^{x} t^{a - 1} {(1 - t)}^{b - 1} d t / B (a, b)$ is the regularized incomplete beta function, and $B (a, b) = \int_{0}^{1} t^{a - 1} {(1 - t)}^{b - 1} d t$ is the beta function.
-: The cdf of the Marshall–Olkin-G model is given by

$F (x) = \frac{G (x)}{a + (1 - a) G (x)}, a > 0$
-: The cdf of the exponentiated generalized-G model is given by

$F (x) = {[1 - {[1 - G (x)]}^{a}]}^{b}, a, b > 0$

References

Eugene, N.; Lee, C.; Famoye, F. Beta-Normal Distribution and Its Applications. Commun. Stat.-Theory Methods 2002, 31, 497–512. [Google Scholar] [CrossRef]
Alexander, C.; Cordeiro, G.M.; Ortega, E.M.M.; Sarabia, J.M. Generalized Beta-Generated Distributions. Comput. Stat. Data Anal. 2012, 56, 1880–1897. [Google Scholar] [CrossRef]
Nadarajah, S.; Teimouri, M.; Shih, S.H. Modified Beta Distributions. Sankhya B 2014, 76, 19–48. [Google Scholar] [CrossRef]
Cordeiro, G.M.; de Castro, M. A New Family of Generalized Distributions. J. Stat. Comput. Simul. 2011, 81, 883–898. [Google Scholar] [CrossRef]
Ristic, M.M.; Nadarajah, S. A New Lifetime Distribution. J. Stat. Comput. Simul. 2013, 84, 135–150. [Google Scholar] [CrossRef]
Nadarajah, S.; Nassiri, V.; Mohammadpour, A. Truncated-Exponential Skew-Symmetric Distributions. Stat. A J. Theor. Appl. Stat. 2014, 48, 872–895. [Google Scholar] [CrossRef]
Abid, A.H.; Abdulrazak, R.K. [0,1] Truncated Frèchet-G Generator of Distributions. Appl. Math. 2017, 7, 51–66. [Google Scholar] [CrossRef]
Bantan, R.A.; Jamal, F.; Chesneau, C.; Elgarhy, M. Truncated Inverted Kumaraswamy Generated Family of Distributions with Applications. Entropy 2019, 21, 1089. [Google Scholar] [CrossRef]
Aldahlan, M.A. Type II Truncated Fréchet Generated Family of Distributions. Int. J. Math. Its Appl. 2019, 7, 221–228. Available online: https://ijmaa.in/index.php/ijmaa/article/view/285 (accessed on 15 September 2022).
Almarashi, A.M.; Elgarhy, M.; Jamal, F.; Chesneau, C. The Exponentiated Truncated Inverse Weibull Generated Family of Distributions with Applications. Symmetry 2020, 12, 650. [Google Scholar] [CrossRef]
Jamal, F.; Bakouch, H.; Nasir, M.A. A Truncated General-G class of Distributions with Application to Truncated Burr G family. REVSTAT-Stat. J. 2021, 19, 513–530. [Google Scholar] [CrossRef]
Almarashi, A.M.; Jamal, F.; Chesneau, C.; Elgarhy, M. A New Truncated Muth Generated Family of Distributions with Applications. Complexity 2021, 21, 1–4. [Google Scholar] [CrossRef]
ZeinEldin, R.A.; Chesneau, C.; Jamal, F.; Elgarhy, M.; Almarashi, A.M.; Al-Marzouki, S. Generalized Truncated Fréchet Generated Family Distributions and their Applications. Comput. Model. Eng. Sci. 2021, 126, 791–819. [Google Scholar] [CrossRef]
Algarni, A.; Almarashi, A.M.; Jamal, F.; Chesneau, C.; Elgarhy, M. Truncated Inverse Lomax Generated Family of Distributions with Applications to Biomedical Data. J. Med. Imaging Health Inform. 2021, 11, 2425–2439. [Google Scholar] [CrossRef]
Bantan, R.A.; Chesneau, C.; Jamal, F.; Elbatal, I.; Elgarhy, M. The Truncated Burr X-G Family of Distributions: Properties and Applications to Actuarial and Financial Data. Entropy 2021, 23, 1088. [Google Scholar] [CrossRef]
Lindley, D.V. Fiducial Distributions and Bayes’ Theorem. J. R. Stat. Soc. 1958, 20, 102–107. Available online: https://www.jstor.org/stable/2983909 (accessed on 12 August 2020). [CrossRef]
AL-Hussaini, E.K.; Ahsanullah, M. Exponentiated Distributions “Part of the book series: Atlantis Studies in Probability and Statistics”; ATLANTISSPS; Atlantis Press: Paris, France, 2015. [Google Scholar]
Alzaatreh, A.; Lee, C.; Famoye, F. A New Method for Generating Families of Continuous Distributions. Metron 2013, 71, 63–79. [Google Scholar] [CrossRef]
Nadarajah, S.; Cancho, V.G.; Ortega, E.M.M. The Geometric Exponential Poisson Distribution. Stat. Methods Appl. 2013, 22, 355–380. [Google Scholar] [CrossRef]
Marshall, A.W.; Olkin, I. A New Method for Adding a Parameter to a Family of Distributions with Application to the Exponential and Weibull Families. Biometrika 1997, 84, 641–652. Available online: https://www.jstor.org/stable/2337585 (accessed on 3 April 2010). [CrossRef]
Cordeiro, G.M.; Ortega, E.M.M.; da Cunha, D.C.C. The Exponentiated Generalized Class of Distributions. J. Data Sci. 2013, 11, 1–27. [Google Scholar] [CrossRef]
Team RC. R: A language and Environment for Statistical Computing; R Foundation for Statistical Computing, Vienna, Austria. 2022. Available online: https://www.r-project.org/ (accessed on 24 June 2022).
Barakat, H.; Nigm, E.; Aldallal, R. Exact Prediction Intervals for Future Current Records and Record Range from any Continuous Distribution. SORT-Stat. Oper. Res. Trans. 2014, 38, 251–270. Available online: https://raco.cat/index.php/SORT/article/view/284044 (accessed on 5 March 2017).
Cheng, R.C.; Traylor, L. Characterization of Material Strength Properties Using Probabilistic Mixture Models. WIT Trans. Model. Simul. 1970, 31, 553–560. [Google Scholar]
Albitar, O.; Ballouze, R.; Ooi, J.P.; Ghadzi, S.M.S. Risk factors for mortality among COVID-19 patients. Diabetes Res. Clin. Pract. 2020, 166, 1–5. [Google Scholar] [CrossRef] [PubMed]
Liu, Y.; Du, X.; Chen, J.; Jin, Y.; Peng, L.; Wang, H.H.; Zhao, Y. Neutrophil-to-lymphocyte ratio as an independent risk factor for mortality in hospitalized patients with COVID-19. J. Infect. 2020, 81, 6–12. [Google Scholar] [CrossRef] [PubMed]
Giacomelli, A.; Ridolfo, A.L.; Milazzo, L.; Oreni, L.; Bernacchia, D.; Siano, M.; Galli, M. 30-day mortality in patients hospitalized with COVID-19 during the first wave of the Italian epidemic: A prospective cohort study. Pharmacol. Res. 2020, 158, 104931. [Google Scholar] [CrossRef]
Atlam, M.; Torkey, H.; El-Fishawy, N.; Salem, H. Coronavirus disease 2019 (COVID-19): Survival analysis using deep learning and Cox regression model. Pattern Anal. Appl. 2021, 24, 993–1005. [Google Scholar] [CrossRef] [PubMed]
Rodrigues, G.M.; Ortega, E.M.; Cordeiro, G.M.; Vila, R. An extended Weibull regression for censored data: Application for COVID-19 in campinas, Brazil. Mathematics 2022, 10, 3644. [Google Scholar] [CrossRef]
Piedad, E.; Caladcad, J.A. Post-harvested Musa acuminata Banana Tiers Dataset. Data Brief 2023, 46, 108856. [Google Scholar] [CrossRef]
Corless, R.M.; Gonnet, G.H.; Hare, D.E.G.; Jeffrey, D.J.; Knuth, D.E. On the Lambert W function. Adv. Comput. Math. 1996, 5, 329–359. [Google Scholar] [CrossRef]

Figure 1. The pdf of the TLW model.

Figure 2. Regions of the Cartesian plane

θ τ^{*}

where different forms of the TLW pdf occur.

Figure 2. Regions of the Cartesian plane

θ τ^{*}

where different forms of the TLW pdf occur.

Figure 3. Histogram and fitted pdf, empirical and fitted cdfs, and P–P and Q–Q plots of the TLW model fitted to dataset I.

Figure 4. Histogram and fitted pdf, empirical and fitted cdfs, and P–P and Q–Q plots of the TLW model fitted to dataset II.

Figure 5. (a) Kaplan–Meier survival curve for the sex variable (

1 =

male,

0 =

female); (b) histogram of the age variable.

Figure 5. (a) Kaplan–Meier survival curve for the sex variable (

1 =

male,

0 =

female); (b) histogram of the age variable.

Figure 6. Index plot and normal probability plot with envelope of the deviance residual from the fitted regressions model to the COVID-19 data. (a,b):

M_{3}

-TLW; (c,d):

M_{3}

-Weibull.

Figure 6. Index plot and normal probability plot with envelope of the deviance residual from the fitted regressions model to the COVID-19 data. (a,b):

M_{3}

-TLW; (c,d):

M_{3}

-Weibull.

Figure 7. Kaplan–Meier survival curve and estimated survival functions from the

M_{3}

-TLW by sex.

Figure 7. Kaplan–Meier survival curve and estimated survival functions from the

M_{3}

-TLW by sex.

Figure 8. Boxplot of colors by class for the Post-harvested dataset.

Figure 9. Index plot, normal probability plot with envelope, and worm plot of the quantile residuals from the regression models fitted to the Post-harvested dataset: (a–c):

M_{3}

-TLW; (d–f):

M_{3}

-Weibull.

Figure 9. Index plot, normal probability plot with envelope, and worm plot of the quantile residuals from the regression models fitted to the Post-harvested dataset: (a–c):

M_{3}

-TLW; (d–f):

M_{3}

-Weibull.

Table 1. Previous work on TG models.

Model	Author(s)	cdf
Poisson-G	Ristic and Nadarajah (2013) [5]	$\frac{1 - e^{- a G^{b} (x)}}{1 - e^{- a}}$
Truncated-exponential skew-symmetric-G	Nadarajah et al. (2014a) [6]	$\frac{1 - e^{- a G (x)}}{1 - e^{- a}}$
Truncated-Fréchet-G	Abid and Abdulrazak (2017) [7]	$e^{a [1 - G {(x)}^{- b}]}$
Truncated inverted Kumaraswamy-G	Bantan et al. (2019) [8]	$\frac{{[1 - {(1 + G (x))}^{- a}]}^{b}}{{(1 - 2^{- a})}^{b}}$
Type II truncated Fréchet-G (truncated inverse Weibull-G)	Aldahlan et al. (2019) [9]	$1 - e^{1 - {(1 - G (x))}^{- a}}$
Exponentiated truncated inverse Weibull-G	Almarashi et al. (2020) [10]	${[1 - e^{1 - {(1 - G (x))}^{- a}}]}^{b}$
Truncated Burr-G	Jamal et al. (2020) [11]	$\frac{1 - {[1 + G^{c} (x)]}^{- k}}{1 - 2^{- k}}$
Truncated Muth-G	Almarashi et al. (2021) [12]	$\frac{1 - e^{[α G (x) - (e^{α G (x)} - 1) / α]}}{1 - e^{[α - (e^{α} - 1) / α]}}$
Truncated generalized Fréchet-G	ZeinEldin et al. (2021) [13]	$\frac{1 - {[1 - e^{- α / G (x)}]}^{b}}{1 - {(1 - e^{- α})}^{b}}$
Truncated inverse Lomax-G	Algarni et al. (2021) [14]	$1 - 2^{α} {[1 + {(1 - G (x))}^{- 1}]}^{- α}$
Truncated Burr X-G	Bantan et al. (2021) [15]	$\frac{{(1 - e^{- α^{2} G^{2} (x)})}^{θ}}{{(1 - e^{- α^{2}})}^{θ}}$

Table 2. Number of roots of equation

A (z_{0}) = B_{θ, k} (z_{0})

in (16) when varying the parameters

θ

and

τ^{*}

.

Table 2. Number of roots of equation

A (z_{0}) = B_{θ, k} (z_{0})

in (16) when varying the parameters

θ

and

τ^{*}

.

	≤ $- \frac{1}{e}$	$> - \frac{1}{e}$ ∧ $< 0$	≥0
$θ$	≤ $- \frac{1}{e}$	$> - \frac{1}{e}$ ∧ $< 0$	≥0
>0	single root	single root	no root
<0	no root	two roots	single root

Table 3. Shapes of TLW pdf when varying the parameters

θ

and

τ^{*}

.

Table 3. Shapes of TLW pdf when varying the parameters

θ

and

τ^{*}

.

	≤ $- \frac{1}{e}$	$> - \frac{1}{e}$ ∧ $< 0$	≥0
$θ$	≤ $- \frac{1}{e}$	$> - \frac{1}{e}$ ∧ $< 0$	≥0
>0	Unimodality	Unimodality	Decreasing
<0	Decreasing	Decreasing–increasing–decreasing	Unimodality

Table 4. Average estimates from simulations of the TLW distribution.

Parameters				ME
$θ$	$k$	$λ$	$n$	$\hat{θ}$	$\hat{k}$	$\hat{λ}$
0.5	0.5	0.5	20	0.3927	0.5982	0.5915
			50	0.3959	0.5104	0.5143
			100	0.5818	0.5086	0.4582
			150	0.5782	0.5078	0.4737
			300	0.5052	0.5003	0.5012
0.5	2	2	20	0.3821	2.1788	2.4017
			50	0.3828	2.1724	2.3466
			100	0.5195	2.1554	2.1472
			150	0.4945	2.1159	2.0617
			300	0.4984	2.0195	2.0324
2	2	0.5	20	2.8780	2.8245	0.6463
			50	2.6245	2.2245	0.4403
			100	2.1419	2.0419	0.5388
			150	2.0545	1.9545	0.5036
			300	2.0044	1.9994	0.5004
3	0.5	3	20	2.4702	0.6231	2.2937
			50	2.6202	0.3798	3.2610
			100	2.8369	0.4631	3.2424
			150	2.8535	0.5146	3.1024
			300	2.9823	0.5018	3.0635
2	5	2	20	2.7795	5.9724	1.6501
			50	2.3405	5.4046	2.2949
			100	1.8551	5.1855	1.7808
			150	2.0733	5.0733	2.0985
			300	1.9930	4.9790	2.0104
5	3	3	20	6.1274	3.2987	2.6354
			50	5.2781	3.1288	2.6674
			100	4.8956	3.1146	2.8674
			150	4.9895	3.0985	2.9631
			300	5.0013	3.0043	2.9985
5	4	2	20	4.4533	4.5847	2.8655
			50	5.2474	3.8812	2.4652
			100	4.9521	3.8932	2.4245
			150	5.1124	3.9958	2.1135
			300	4.9821	4.0024	2.0075

Table 5. MABs and MSEs from simulations of the TLW distribution.

Parameters				MAB			MSE
$θ$	$k$	$λ$	$n$	$\hat{θ}$	$\hat{k}$	$\hat{λ}$	$\hat{θ}$	$\hat{k}$	$\hat{λ}$
0.5	0.5	0.5	20	0.1073	0.0982	0.0915	0.4927	0.3251	0.2520
			50	0.1041	0.0104	0.0143	0.1538	0.1607	0.2497
			100	0.0818	0.0086	0.0418	0.1353	0.0656	0.1960
			150	0.0782	0.0078	0.0263	0.0230	0.0421	0.0540
			300	0.0052	0.0003	0.0012	0.0110	0.0215	0.0301
0.5	2	2	20	0.1179	0.1788	0.4018	0.3210	0.4573	0.4200
			50	0.1172	0.1724	0.3466	0.1420	0.2923	0.2584
			100	0.0195	0.1554	0.1472	0.0732	0.0832	0.1453
			150	0.0055	0.1159	0.0617	0.0612	0.0549	0.1087
			300	0.0016	0.0195	0.0324	0.0139	0.0490	0.0359
2	2	0.5	20	0.8780	0.8245	0.1463	0.7810	0.5427	0.7147
			50	0.6245	0.2245	0.0597	0.6531	0.4417	0.6984
			100	0.1419	0.0490	0.0388	0.1456	0.2542	0.1825
			150	0.0545	0.0455	0.0036	0.0574	0.0088	0.0821
			300	0.0044	0.0006	0.0004	0.0035	0.0015	0.0674
3	0.5	3	20	0.5298	0.1231	0.7063	1.0745	0.8945	0.7984
			50	0.3798	0.1202	0.2610	0.6870	0.3017	0.5203
			100	0.1631	0.0369	0.2424	0.2153	0.1465	0.2257
			150	0.1465	0.0146	0.1024	0.1040	0.0896	0.0357
			300	0.0177	0.0018	0.0635	0.0862	0.0651	0.0089
2	5	2	20	0.7795	0.9724	0.3499	0.8691	1.2143	1.1401
			50	0.3405	0.4046	0.2949	0.4041	0.9674	0.5189
			100	0.1449	0.1855	0.2192	0.3540	0.6307	0.5021
			150	0.0733	0.0733	0.0985	0.0957	0.0390	0.1008
			300	0.0070	0.0210	0.0104	0.0068	0.0107	0.0096
5	3	3	20	1.1274	0.2987	0.3646	1.8752	2.0145	1.4571
			50	0.2781	0.1288	0.3326	1.0587	1.5124	0.6501
			100	0.1044	0.1146	0.1326	0.6321	0.8210	0.0893
			150	0.0105	0.0985	0.0369	0.2480	0.6347	0.0101
			300	0.0013	0.0043	0.0015	0.0472	0.0086	0.0054
5	4	2	20	0.5467	0.5847	0.8655	2.1768	1.7456	1.9087
			50	0.2474	0.1188	0.4652	0.8740	1.0157	0.9889
			100	0.0479	0.1068	0.4245	0.6531	0.8751	0.2350
			150	0.1124	0.0042	0.1135	0.0478	0.1450	0.0842
			300	0.0179	0.0024	0.0075	0.0023	0.0541	0.0357

Table 6. Simulation results of TLW regression models for different censoring percentages (%) with true values:

β_{10} = 0.3

,

β_{11} = 0.4

,

β_{20} = 0.2

,

β_{21} = 0.5

, and

θ = 0.6

.

Table 6. Simulation results of TLW regression models for different censoring percentages (%) with true values:

β_{10} = 0.3

,

β_{11} = 0.4

,

β_{20} = 0.2

,

β_{21} = 0.5

, and

θ = 0.6

.

		$n = 100$			$n = 300$			$n = 500$
%	$θ$	AEs	MABs	MSEs	AEs	MABs	MSEs	AEs	MABs	MSEs
0%	$β_{10}$	0.2949	−0.0051	0.0239	0.3018	0.0018	0.0091	0.3056	0.0056	0.0054
	$β_{11}$	0.4077	0.0077	0.0232	0.3994	−0.0006	0.0071	0.3960	−0.0040	0.0044
	$β_{20}$	0.2234	0.0234	0.0137	0.2073	0.0073	0.0046	0.2087	0.0087	0.0026
	$β_{21}$	0.5014	0.0014	0.0260	0.5019	0.0019	0.0086	0.4971	−0.0029	0.0049
	$θ$	0.6323	0.0323	0.2064	0.6241	0.0241	0.0984	0.6286	0.0286	0.0569
10%	$β_{10}$	0.2933	−0.0067	0.0219	0.3012	0.0012	0.0090	0.3018	0.0018	0.0050
	$β_{11}$	0.4062	0.0062	0.0228	0.3990	−0.0010	0.0075	0.3991	−0.0009	0.0041
	$β_{20}$	0.2192	0.0192	0.0138	0.2100	0.0100	0.0051	0.2054	0.0054	0.0030
	$β_{21}$	0.5064	0.0064	0.0283	0.4983	−0.0017	0.0089	0.5028	0.0028	0.0054
	$θ$	0.6188	0.0188	0.1765	0.6225	0.0225	0.0908	0.6144	0.0144	0.0491
30%	$β_{10}$	0.2902	−0.0098	0.0253	0.2997	−0.0003	0.0101	0.3033	0.0033	0.0057
	$β_{11}$	0.4114	0.0114	0.0266	0.3987	−0.0013	0.0088	0.3969	−0.0031	0.0055
	$β_{20}$	0.2313	0.0313	0.0208	0.2093	0.0093	0.0060	0.2072	0.0072	0.0033
	$β_{21}$	0.5005	0.0005	0.0404	0.5013	0.0013	0.0111	0.4980	−0.0020	0.0065
	$θ$	0.6306	0.0306	0.1611	0.6125	0.0125	0.0960	0.6138	0.0138	0.0518

Table 7. Estimates of TLW parameters for dataset I.

	MLE	Std. Err	Inf. 95% CI	Sup. 95% CI
$θ$	−28.44948	28.085548	−32.53708	−25.44727
k	3.454494	0.9554519	1.581842	5.327145
$λ$	12.75564	2.3338198	8.181439	17.32985

Table 8. Competence of the models for the dataset.

Distribution	No. of Estimated Parameters	AIC	CAIC	BIC	HQIC	$- ℓ$
TLW	3	507.124	507.314	515.726	510.619	250.562
W	2	524.667	524.762	530.402	526.998	260.334
KW	4	510.612	510.932	522.082	515.273	251.306
WW	4	528.667	528.987	540.137	533.328	260.334
GPW	4	512.800	513.120	524.270	517.460	252.400
PW	4	513.232	513.552	524.702	517.893	252.616
BW	4	511.523	511.843	522.993	516.184	251.762
MOW	3	513.522	513.713	522.125	517.018	253.761
EGW	4	512.706	513.026	524.176	517.367	252.353

Table 9. Estimates of TLW parameters for dataset II.

	MLE	Std. Err	Inf. 95% CI	Sup. 95% CI
$θ$	−49.54747	3.8976505	−57.18672	−43.9367
k	1.374038	0.4973775	0.399197	2.348880
$λ$	1.029519	0.6568050	−0.257795	2.316833

Table 10. Competence of the models for dataset II.

Distribution	No. of Estimated Parameters	AIC	CAIC	BIC	HQIC	$- ℓ$
TLW	3	118.197	118.597	124.673	120.748	56.098
W	2	129.933	130.130	134.251	131.634	62.967
KW	4	121.642	122.320	130.278	125.044	56.821
WW	4	133.933	134.611	142.569	137.335	62.967
GPW	4	122.118	122.796	130.754	125.520	57.059
PW	4	123.742	124.420	132.377	127.144	57.871
BW	4	121.285	121.963	129.921	124.687	56.643
MOW	3	122.570	122.970	129.047	125.122	58.285
EGW	4	121.883	122.561	130.519	125.285	56.942

Table 11. AIC and GD values for TLW and Weibull regression models with different structures for COVID-19 data.

Model	TLW				Weibull
Model	$M_{0}$	$M_{1}$	$M_{2}$	$M_{3}$	$M_{0}$	$M_{1}$	$M_{2}$	$M_{3}$
AIC	854.947	821.162	828.469	814.707	855.651	823.412	848.348	817.815
GD	848.947	811.162	818.469	800.707	851.651	815.412	840.348	805.815

Table 12. MLEs, SEs, and p-values for the

M_{3}

-TLW regression fitted to COVID-19 data.

Table 12. MLEs, SEs, and p-values for the

M_{3}

-TLW regression fitted to COVID-19 data.

	MLEs	SEs	p-Values
$β_{10}$	7.8325	0.2995	<0.01
$β_{11}$	−0.419	0.1432	<0.01
$β_{12}$	−0.0467	0.0040	<0.01
$β_{20}$	−0.4240	0.1690	0.01
$β_{21}$	0.4605	0.0939	<0.01
$β_{22}$	0.0096	0.0027	<0.01
$θ$	3.6408	0.3289	<0.01

Table 13. AIC and GD values for TLW and Weibull regression models with different structures for the Post-harvested dataset.

Model	TLW				Weibull
Model	$M_{0}$	$M_{1}$	$M_{2}$	$M_{3}$	$M_{0}$	$M_{1}$	$M_{2}$	$M_{3}$
AIC	1520.226	1491.587	1487.429	1482.161	1519.985	1495.154	1514.562	1486.809
GD	1514.226	1479.587	1475.429	1464.161	1515.985	1485.154	1504.562	1470.809

Table 14. MLEs, SEs, and p-values for the

M_{3}

-TLW regression fitted to the Post-harvested dataset.

Table 14. MLEs, SEs, and p-values for the

M_{3}

-TLW regression fitted to the Post-harvested dataset.

	MLEs	SEs	p-Values
$β_{10}$	4.1417	0.0380	<0.01
$β_{11}$	0.1469	0.0471	<0.01
$β_{12}$	−0.0800	0.0482	0.0987
$β_{13}$	0.0354	0.0424	0.4044
$β_{20}$	1.5394	0.1087	<0.01
$β_{21}$	0.4443	0.1825	0.0159
$β_{22}$	0.1994	0.1532	0.1949
$β_{23}$	0.5335	0.1407	<0.01
$θ$	3.2938	0.2847

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Article Metrics

Citations

Article Access Statistics

Journal Statistics

Multiple requests from the same IP address are counted as one view.

A New Truncated Lindley-Generated Family of Distributions: Properties, Regression Analysis, and Applications

Abstract

1. Introduction

2. The Unit Truncated Lindley Model

3. The Truncated Lindley- G Model

3.1. Some Properties of the TLG Model

3.1.1. Critical Points

3.1.2. Moments

3.1.3. Quantile Function

3.1.4. Mean Deviations

3.1.5. Moment Generating Function

3.1.6. Entropy

3.2. Truncated Lindley–Weibull (TLW) Model

3.2.1. Shapes of the TLW pdf

3.2.2. Stochastic Representation

3.3. Maximum Likelihood Estimation

3.4. Monte Carlo Simulation

4. The TLW Regression Model with Censored Data and Two Systematic Components

4.1. Residual Analysis

4.2. Simulation Study

5. Data Analysis

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A

Appendix B

References

Article Metrics

Article Access Statistics

3. The Truncated Lindley- $G$ Model