An Extended Weibull Regression for Censored Data: Application for COVID-19 in Campinas, Brazil

Rodrigues, Gabriela M.; Ortega, Edwin M. M.; Cordeiro, Gauss M.; Vila, Roberto

doi:10.3390/math10193644

Open AccessArticle

An Extended Weibull Regression for Censored Data: Application for COVID-19 in Campinas, Brazil

by

Gabriela M. Rodrigues

^1,†

,

Edwin M. M. Ortega

^1,†

,

Gauss M. Cordeiro

^2,*,†

and

Roberto Vila

^3,†

¹

Department of Exact Sciences, University of São Paulo, Piracicaba 13418-900, Brazil

²

Department of Statistics, Federal University of Pernambuco, Recife 50670-901, Brazil

³

Department of Statistics, University of Brasilia, Brasilia 70910-900, Brazil

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Mathematics 2022, 10(19), 3644; https://doi.org/10.3390/math10193644

Submission received: 10 September 2022 / Revised: 28 September 2022 / Accepted: 29 September 2022 / Published: 5 October 2022

(This article belongs to the Special Issue Current Developments in Theoretical and Applied Statistics)

Download

Browse Figures

Versions Notes

Abstract

:

This work aims to study the factors that increase the risk of death of hospitalized patients diagnosed with COVID-19 through the odd log-logistic regression model for censored data with two systematic components, as well as provide new mathematical properties of this distribution. To achieve this, a dataset of individuals residing in the city of Campinas (Brazil) was used and simulations were performed to investigate the accuracy of the maximum likelihood estimators in the proposed regression model. The provided properties, such as stochastic representation, identifiability, and moments, among others, can help future research since they provide important information about the distribution structure. The simulation results revealed the consistency of the estimates for different censoring percentages and show that the empirical distribution of the modified deviance residuals converge to the standard normal distribution. The proposed model proved to be efficient in identifying the determinant variables for the survival of the individuals in this study, which can help to find more opportune treatments and medical interventions. Therefore, the new model can be considered an interesting alternative for future works that evaluate censored lifetimes.

Keywords:

censored data; COVID-19; odd log-logistic Weibull; regression model

MSC:

62J02; 62N01; 62N03

1. Introduction

In theory of survival models, the distributions are often attributed to time intervals and different structures of regression models have been constructed. Recently, many distributions and regression models have been developed based on extended Weibull distributions, for example, the log-odd log-logistic location-scale regression model [1], the bivariate odd-log-logistic-Weibull regression model [2], the Weibull zero-inflated right-censored regression model [3], the inverted Weibull regression model [4], and the Weibull quantile regression model [5], among many others. The importance of such extensions is remarkable and some important results can be found in the field of medicine. For example, in a study of patients with colorectal cancer, Moamer et al. (2017) [6] assessed the survival and prognostic factors based on the Weibull competing-risks model and showed that the body mass index and some stages of disease influenced survival, and Yoosefi et al. (2018) [7], using exponentiated Weibull distribution, found that the age of patients at diagnosis was the most important influencing factor for increasing survival and reducing the mortality rate. The results of the risks associated with breast cancer using a mixture cure fraction model based on the generalized modified Weibull model can be seen in Naseri et al. (2018) [8] where covariates, such as the number of metastatic lymph nodes and histologic grade, were statistically significant and the estimated cure fraction was 58%. Pavisic et al. (2020) [9] determined the factors that influenced the survival of patients with autosomal dominant familial Alzheimer disease (ADAD) using multilevel mixed-effects Weibull survival models, which proved to be longer for successive generations and in individuals with atypical presentations. In this context, we propose the odd log-logistic Weibull (OLLW) regression model for censored data, which is different from the log-linear regression model addressed by da Cruz et al. (2016) [1].

The new regression model has an extra shape parameter that enables greater flexibility for modeling the risk rate function in the four most common shapes. It is a possible alternative to mixture models since the hazard rate can be bimodal. We define two systematic components for the shape and scale parameters of the Weibull using the logarithmic link function to measure the effects of the covariables. We provide some simulations to evaluate the precision of the maximum likelihood estimators (MLEs) and the empirical distribution of the deviance residuals.

We also present an application for hospitalized patients diagnosed with COVID-19 (SARS-CoV-2 B.1.1.529) in the city of Campinas (Brazil). Although its mortality rate (

0, 4 %

) was lower than other variants in earlier periods of the pandemic (about

4 %

), its transmissibility was considered to be extremely high in Brazil, causing a high number of hospitalizations and deaths (Xavier et al., 2022 [10]). In this context, studies are necessary to investigate the variables that increase the risk of death, which can vary according to the pandemic scenario and demographic and epidemiological factors of each region. Knowledge of the progression of the disease can support more timely and effective medical interventions (see Lu et al. (2021) [11], Giacomelli et al. (2020) [12], and Zheng et al. (2020) [13]. In previous survival analyses, some risk factors were frequently mentioned, such as high age [12,14,15], diabetes [12,16,17], and obesity [12,16,17]. In addition to these, some interesting factors were verified, such as neurological diseases [18] and sex [16,19]. Lu et al. (2021) [11] revealed that lower lymphocyte counts in a hemogram, lowplatelet count and serum albumin, high C-reactive protein level, and renal dysfunction may be risk factors. Nijman et al. (2021) [15] found that immunocompromised patients who used anticoagulants or antiplatelet medication had increased risk of death. Zheng et al. (2020) [13] also found cardiovascular disease, hypertension, and smoking as factors that could greatly affect the prognosis of COVID-19.

The present work aims to study the factors that increase the risk of death of hospitalized patients diagnosed with COVID-19 using the odd log-logistic Weibull regression model and to provide new mathematical properties. Motivated by the pandemic scenarios and given the notable contributions of the Weibull distribution and its extensions, the results obtained with the application are considered the main contributions of this work, whereas information and prior knowledge of the impact of such factors on survival can also be decisive in treatment [20]. In addition, the new mathematical properties provided bring more information and can help future research. In addition, the use of this dataset can motivate the future use of this model in lifetime data, thus showing that it can be an interesting and efficient alternative.

The rest of the paper presents the following topics. Section 2 provides a brief summary of the OLLW distribution and some new mathematical properties. Section 3 defines the OLLW regression model for censored data and presents diagnostic measures and residuals. Some simulations for the new regression model are described in Section 4. The usefulness of our results is illustrated through their application to COVID-19 data in Section 5. Finally, some conclusions are cited in Section 6.

2. New OLLW Properties

The Weibull distribution is mostly used in reliability and lifetime modeling, and it encompasses both increasing and decreasing failure rate functions. Its cumulative distribution function (cdf) is

G (t; η) = 1 - exp [- {(\frac{t}{λ})}^{α}], t ⩾ 0,

(1)

where

α > 0

is the shape,

λ > 0

is the scale, and

η = {(α, λ)}^{⊤}

.

The quantile function (qf) of the Weibull by inverting (1) is

Q_{W} (u; η) = λ {[- log (1 - u)]}^{1 / α}

for

u \in (0, 1)

.

Based on the idea of Gleaton and Lynch (2006) [21], the OLLW cdf

F (t) = F (t; η, τ)

(for

t ⩾ 0

) comes from (1)

F (t) = \frac{{\{1 - exp [- {(\frac{t}{λ})}^{α}]\}}^{τ}}{{\{1 - exp [- {(\frac{t}{λ})}^{α}]\}}^{τ} + {\{exp [- {(\frac{t}{λ})}^{α}]\}}^{τ}},

(2)

where

τ > 0

is an extra shape parameter.

By differentiating (2), the OLLW probability density function (pdf) becomes

f (t) = \frac{τ α t^{α - 1} {\{exp [- {(\frac{t}{λ})}^{α}]\}}^{τ} {\{1 - exp [- {(\frac{t}{λ})}^{α}]\}}^{τ - 1}}{λ^{α} {[{\{1 - exp [- {(\frac{t}{λ})}^{α}]\}}^{τ} + {\{exp [- {(\frac{t}{λ})}^{α}]\}}^{τ}]}^{2}} .

(3)

Let the random variable

T \sim O L L W (λ, α, τ)

have pdf (3). Plots of the pdf of T are reported in Figure 1, thus showing flexibility for modeling skewness, kurtosis, and bimodality.

By inverting (2), the qf of the OLLW distrubution is given in terms of the Weibull counterpart

Q_{OLLW} (u) = Q_{W} (v (u; τ); η),

(4)

where

v (u; τ) = u^{1 / τ} / [u^{1 / τ} + {(1 - u)}^{1 / τ}]

.

We provide below new structural properties of the OLLW distribution.

2.1. Modes

Every mode

t_{0} = t_{0} (λ, α, τ)

of the OLLW satisfies the equation

A (t) = B (t)

, where

A (t) = \frac{(τ + 1) {exp [{(\frac{t}{λ})}^{α}] - 1}^{τ} - τ + 1}{{1 - exp [- {(\frac{t}{λ})}^{α}]} \{1 + {exp [{(\frac{t}{λ})}^{α}] - 1}^{τ}\}}, B (t) = \frac{[{(\frac{t}{λ})}^{α} + 1] α - 1}{α {(\frac{t}{λ})}^{α}} .

By taking

y_{t} = exp [{(t / λ)}^{α}] - 1

,

A (t)

and

B (t)

can be written as

\begin{matrix} A (t) = [(τ + 1) - \frac{2 τ}{y_{t}^{τ} + 1}] \frac{(y_{t} + 1)}{y_{t}}, \\ B (t) = 1 + \frac{(α - 1)}{α log (y_{t} + 1)} . \end{matrix}

Hence, every mode

t_{0}

of the OLLW density satisfies

[(τ + 1) - \frac{2 τ}{y_{t}^{τ} + 1}] \frac{(y_{t} + 1)}{y_{t}} = 1 + \frac{(α - 1)}{α log (y_{t} + 1)} .

It is an arduous task to obtain analytically the roots of this equation. Graphically, it has at most three roots from which the bimodality of the OLLW density is guaranteed (Figure 1).

2.2. Stochastic Representation

Proposition 1.

The stochastic representation of

T \sim O L L W (λ, α, τ)

holds:

T = λ {[log (1 + S)]}^{1 / α},

where S has the Burr Type XII distribution, say

S \sim B U R R (τ, 1)

.

Proof.

Note that the cdf

F (t)

in (2) can be rewritten as

F (t) = \int_{0}^{\frac{G (t; η)}{1 - G (t; η)}} \frac{τ u^{τ - 1}}{{(1 + u^{τ})}^{2}} d u = P (S ⩽ \frac{G (t; η)}{1 - G (t; η)}), S \sim B U R R (τ, 1),

(5)

where

G (t; η)

is given by (1). Since

d G (t; η) / d t > 0

(for

t > 0

), we obtain

d G^{- 1} (t; η) / d t = 1 / [d G (G^{- 1} (t; η); η) / d t] > 0

(for

t > 0

), i.e., the function

t ⟼ G^{- 1} (t; η)

is increasing, hence,

P (G^{- 1} (\frac{S}{1 + S}; η) ⩽ t), t > 0 .

In other words, T and

G^{- 1} (S / (1 + S); η)

are equal in distribution. The proof follows based on the Weibull qf. □

2.3. Closure under Changes of Scale and of Power

Proposition 2.

1.: If $T \sim O L L W (λ, α, τ)$ , then $c T \sim O L L W (c λ, α, τ)$ , $c > 0$ .
2.: If $T \sim O L L W (λ, α, τ)$ , then $T^{k} \sim O L L W (λ^{k}, α / k, τ)$ , $k > 0$ .

Proof.

Let

U (t; λ, α) = G (t; η) / [1 - G (t; τ)] = exp [{(t / λ)}^{α}] - 1

. By (5),

F (t) = P (S ⩽ U (t; λ, α))

, with

S \sim B U R R (τ, 1)

. Since

U (t / c; λ, α) = U (t; c λ, α)

and

U (t^{1 / k}; λ, α) = U (t; λ^{k}, α / k)

, the proof is complete. □

2.4. Identifiability

The concept of identifiability of a distribution means that distinct values of the parameters should correspond to distinct probability distributions: if

(λ_{1}, α_{1}, τ_{1}) \neq (λ_{2}, α_{2}, τ_{2})

, then also

F_{1} (t) \neq F_{2} (t)

,

\forall t > 0

, where

F_{i} (t) = F (t; λ_{i}, α_{i}, τ_{i})

,

i = 1, 2

, is defined by (2).

Proposition 3.

The OLLW distribution is identifiable.

Proof.

Let us suposse that

F (t; λ_{1}, α_{1}, τ_{1}) = F (t; λ_{2}, α_{2}, τ_{2})

,

\forall t > 0

. By (5), it is equivalent to

P (S_{1} ⩽ \frac{G (t; η_{1})}{1 - G (t; η_{1})}) = P (S_{2} ⩽ \frac{G (t; η_{2})}{1 - G (t; η_{2})}), S_{i} \sim B U R R (η_{i}, 1),

where

η_{i} = {(α_{i}, λ_{i})}^{⊤}

and

G (t; η_{i}) / [1 - G (t; η_{i})] = exp [{(t / λ_{i})}^{α_{i}}] - 1

,

i = 1, 2

. For

S \sim B U R R (η, 1)

, it is well-known that

P (S ⩽ s) = 1 - {(1 + s^{τ})}^{- 1}

. So, this equation reduces to

{[\frac{G (t; η_{1})}{1 - G (t; η_{1})}]}^{τ_{1}} = {[\frac{G (t; η_{2})}{1 - G (t; η_{2})}]}^{τ_{2}} .

(6)

Setting

t = λ_{1} {log}^{1 / α_{1}} (2)

, we obtain

{\{exp [{(\frac{λ_{1}}{λ_{2}})}^{α_{2}} {log}^{α_{2} / α_{1}} (2)] - 1\}}^{τ_{2}} = 1 .

Equivalently, we have

{(\frac{λ_{1}}{λ_{2}})}^{α_{2}} {log}^{\frac{α_{2} - α_{1}}{α_{1}}} (2) = 1 .

(7)

Since the only real solutions of

x {log}^{y} (2) = 1

are

x = 1

and

y = 0

, it follows from Equation (7) that

λ_{1} = λ_{2}

and

α_{1} = α_{2}

. Using these identities in (6),

τ_{1} = τ_{2}

, and the proof is complete. □

2.5. Existence of Real Moments

Proposition 4.

If

T \sim O L L W (λ, α, τ)

and

α τ > max {p, - p}

, then

E (T^{p}) ⩽ λ^{p} B (\frac{α τ - p}{α τ}, \frac{α τ + p}{α τ}) .

Proof.

Since

0 < log (1 + s) ⩽ s

we have

{[log (1 + s)]}^{p / α} ⩽ s^{p / α}

. By using this inequality and the stochastic representation of T (see Proposition 1), we obtain

T^{p} = λ^{p} {[log (1 + S)]}^{p / α} ⩽ λ^{p} S^{p / α} .

Taking the expectations on both sides of the above inequality and then using the well-known identity

E (S^{r}) = B (\frac{τ - r}{τ}, \frac{τ + r}{τ}), S \sim B U R R (τ, 1), τ > max {r, - r},

the proof follows. □

2.6. Tail Behavior

The continuous univariate distribution F (on

R

) has an upper light tail if (for

s > 0

)

lim_{x \to \infty} \frac{exp (- s x)}{1 - F (x)} = \infty,

whereas it has an upper heavy tail if (for

s > 0

)

lim_{x \to \infty} \frac{exp (- s x)}{1 - F (x)} = 0 .

Proposition 5.

The OLLW distribution has a transition from heavy-tailed to light-tailed. In other words,

(a): For $0 < α < 1$ , the OLLW distribution has an upper heavy tail.
(b): For $α > 1$ , the OLLW distribution has an upper light tail.
(c): For $α = 1$ , the OLLW distribution does not have a defined tail behavior.

Proof.

A simple algebraic manipulation leads to (for

s > 0

and

α > 0

)

\begin{matrix} lim_{t \to \infty} \frac{exp (- s t)}{1 - F (t)} & = lim_{t \to \infty} \{exp (- s t) + {[exp (- \frac{s t}{τ} + \frac{t^{α}}{λ^{α}}) - exp (- \frac{s t}{τ})]}^{τ}\} \\ = \{\begin{matrix} 0, & 0 < α < 1, \\ 0, & α = 1 and s > τ / λ, \\ 1, & α = 1 and s = τ / λ, \\ \infty, & α = 1 and s < τ / λ, \\ \infty, & α > 1 . \end{matrix} \end{matrix}

This completes the proof. □

3. The OLLW Regression Model

The OLLW regression model is defined by two systematic components for

α_{i}

and

λ_{i}

(for

i = 1, \dots, n

), as follows

Equation added

g_{1} (λ_{i}) = η_{i 1} = x_{i 1}^{⊤} β_{1} and g_{2} (α_{i}) = η_{i 2} = x_{i 2}^{⊤} β_{2},

(8)

where

β_{j} = {(β_{j 0}, \dots, β_{j p})}^{⊤}

(

j = 1, 2

) are vectors of length (

p_{j} + 1

) of unknown coefficients functionally independent,

p_{j}

is the number of explanatory variables related to the jth parameter,

η_{i j}

are the linear predictors, and

x_{i j} = {(v_{i j 1}, \dots, v_{i j p_{j}})}^{⊤}

are observations on

p_{1}

and

p_{2}

known regressors. The functions

g_{1}

and

g_{2}

defined from

R \to R^{+}

should be strictly monotone and at least twice differentiable. The functions satisfy

λ_{i} = g_{1}^{- 1} (x_{i 1}^{⊤} β_{1})

and

α_{i} = g_{2}^{- 1} (x_{i 2}^{⊤} β_{2})

, where

g_{j}^{- 1} (\cdot)

is the inverse function of

g_{j} (\cdot)

. So, in the following sections, we consider the logarithmic link function for

g_{j} (\cdot)

:

Equation updated

λ_{i} = exp (x_{i 1}^{⊤} β_{1}) and α_{i} = exp (x_{i 2}^{⊤} β_{2}) .

The case

α_{i} = 1

leads to the exponential regression model.

Let

T_{i}

and

C_{i}

be the lifetime and censoring time for the ith individual. The survival function of

T_{i}

given

x_{i}

comes from (1) as

S (t | x_{i}) = \frac{{\{exp [- {(\frac{t}{λ_{i}})}^{α_{i}}]\}}^{τ}}{{\{1 - exp [- {(\frac{t}{λ_{i}})}^{α_{i}}]\}}^{τ} + {\{exp [- {(\frac{t}{λ_{i}})}^{α_{i}}]\}}^{τ}} .

(9)

Consider the independent observations

(t_{1}, x_{1}), \dots, (t_{n}, x_{n})

, where

t_{i} = min {T_{i}, C_{i}}

under the independence of

T_{i}

and

C_{i}

. The log-likelihood function for

θ = {(τ, β_{1}^{⊤}, β_{2}^{⊤})}^{⊤}

from Equation (9) is

\begin{matrix} l (θ) & = & r log (τ) + \sum_{i \in F} log (\frac{α_{i}}{λ_{i}^{α_{i}}}) + \sum_{i \in F} (α_{i} - 1) log (t_{i}) + τ \sum_{i \in F} log [κ_{α_{i}, λ_{i}} (t_{i})] + \\ (τ - 1) \sum_{i \in F} log [1 - κ_{α_{i}, λ_{i}} (t_{i})] - 2 \sum_{i \in F} log \{{[1 - κ_{α_{i}, λ_{i}} (t_{i})]}^{τ} + κ_{α_{i}, λ_{i}}^{τ} (t_{i})\} + \\ \sum_{i \in C} log \{\frac{κ_{α_{i}, λ_{i}}^{τ} (t_{i})}{{[1 - κ_{α_{i}, λ_{i}} (t_{i})]}^{τ} + κ_{α_{i}, λ_{i}}^{τ} (t_{i})}\}, \end{matrix}

(10)

where r is the number of failures, F and C refer to the sets of lifetimes and censoring times, respectively, and

κ_{α_{i}, λ_{i}} (t_{i}) = exp [- {(t_{i} / λ_{i})}^{α_{i}}]

.

The maximum likelihood estimate (MLE)

\hat{θ}

of

θ

is found to maximize (10). The gamlss and AdequacyModel packages of the R software and the SAS procedure NLMixed can be used to find

\hat{θ}

. These packages have been widely adopted in many applied statistics papers.

3.1. Checking Model

The diagnosis of anomalies of the fitted regression is important after the parameter estimation. An analysis that can be carried out is based on the influence measures from the exclusion of observations.

The influence of the ith observation on the MLE

{\hat{θ}}_{(i)}

of

θ

when it is deleted is measured by the (maximized) likelihood distance (Cook, 1986 [22])

L D_{i} (θ) = 2 [l (\hat{θ}) - l ({\hat{θ}}_{(i)})] .

The generalized distance (Cook et al., 1988 [23]) is another influence measure

G D_{i} (θ) = {({\hat{θ}}_{(i)} - \hat{θ})}^{⊤} [\ddot{L} (\hat{θ})] ({\hat{θ}}_{(i)} - \hat{θ}),

where

- \ddot{L} (θ)

is the observed information matrix.

The deviance residuals used in survival analysis when there are censored observations (Escobar and Meeker, 1992 [24]) are given by

r_{D_{i}} = \{\begin{matrix} sign ({\hat{r}}_{M_{i}}) \times \\ {\{- 2 [1 + log [\frac{{\hat{κ}}_{α_{i}, λ_{i}}^{τ} (t_{i})}{{[1 - {\hat{κ}}_{α_{i}, λ_{i}} (t_{i})]}^{τ} + {\hat{κ}}_{α_{i}, λ_{i}}^{τ} (t_{i})}] + log \{- log [\frac{{\hat{κ}}_{α_{i}, λ_{i}}^{τ} (t_{i})}{{[1 - {\hat{κ}}_{α_{i}, λ_{i}} (t_{i})]}^{τ} + {\hat{κ}}_{α_{i}, λ_{i}}^{τ} (t_{i})}]\}]\}}^{1 / 2}, & if δ_{i} = 1, \\ sign ({\hat{r}}_{M_{i}}) {\{- 2 log [\frac{{\hat{κ}}_{α_{i}, λ_{i}}^{τ} (t_{i})}{{[1 - {\hat{κ}}_{α_{i}, λ_{i}} (t_{i})]}^{τ} + {\hat{κ}}_{α_{i}, λ_{i}}^{τ} (t_{i})}]\}}^{1 / 2}, & if δ_{i} = 0, \end{matrix}

(11)

where

δ_{i}

is the censoring indicator and

{\hat{r}}_{M_{i}} = δ_{i} + log [\hat{S} (t_{i} | x_{i})], {\hat{κ}}_{α_{i}, λ_{i}} (t_{i}) = exp [- {(\frac{t_{i}}{{\hat{λ}}_{i}})}^{{\hat{α}}_{i}}], {\hat{λ}}_{i} = exp (x_{i}^{⊤} {\hat{β}}_{1}), {\hat{α}}_{i} = exp (x_{i}^{⊤} {\hat{β}}_{2}) .

4. Simulation Study

Monte Carlo simulations examined the precision of the MLEs in the new regression model and evaluated the empirical distribution of the deviance residuals using the function optim in R software for some values of n and censoring the percentages. One thousand replicates were carried out for each configuration. The lifetimes

t_{1}^{*}, \dots, t_{n}^{*}

were generated from the OLLW

(λ_{i}, α_{i}, τ)

distribution and the censoring times

c_{1}, \dots, c_{n}

from a uniform distribution

(0, ν)

, where

ν

controls the censoring percentages. Just two covariates

x_{1} \sim Uniform (0, 1)

and

x_{2} \sim Binomial (1, 0.5)

were included in the systematic componentes:

λ_{i} = exp (β_{10} + β_{11} x_{1 i} + β_{12} x_{2 i}), α_{i} = exp (β_{20} + β_{21} x_{1 i} + β_{22} x_{2 i}), τ_{i} = exp (β_{30}),

(12)

where the true parameter values are taken as

β_{10} = 3

,

β_{11} = 2.5

,

β_{12} = 0.9

,

β_{20} = 2

,

β_{21} = 1.5

,

β_{22} = 0.8

and

β_{30} = 0.3

.

The simulation process follows the six steps:

(i): Generate $x_{i 1} \sim uniform (0, 1)$ and $x_{i 2} \sim binomial (1, 0.5)$ ;
(ii): Calculate $λ_{i}$ , $α_{i}$ and $τ_{i}$ from Equation (12);
(iii): Generate $u_{i} \sim U (0, 1)$ ;
(iv): Repeat previous steps to obtain $t_{i}^{*} = Q_{OLLW} (u_{i})$ from Equation (4).
(v): Generate $c_{i} \sim uniform (0, ν)$ and determine survival times $t_{i} = \min (t_{i}^{*}, c_{i})$ . If $t_{i}^{*} < c_{i}$ , then $δ_{i} = 1$ ; otherwise, $δ_{i} = 0$ (for $i = 1, \dots, n$ );
(vi): Calculate the deviance residuals.

Table 1 reveals that the (Averages) estimates tended to the true parameters and their biases and mean square errors (MSEs) decayed to zero when n became large. So, the consistency of the estimators holds. We also checked the model through the empirical coverage probabilities (CPs) of the 95% confidence intervals of the estimates. Table 2 shows that the CPs were close to the nominal level.

Figure 2 proves that the empirical distribution of the deviance residuals approximated the standard normal. So, the normal probability plot can be used with simulated envelopes.

5. Application to COVID-19 Data

We investigated the risk factors associated with death of diagnosed COVID-19 patients in the city of Campinas, Brazil. The sample was composed of hospitalized patients living in the city of Campinas or the northeastern area of the neighboring city of São Paulo in Brazil’s southeast region (Figure 3). A total of 322 patients infected with the virus (confirmed by RT-PCR screening) and classified as having Severe Acute Respiratory Syndrome 2 (SARS) were included in the study. The model was implemented in the gamlss script in the R software. The dataset and application codes can be accessed at https://github.com/gabrielamrodrigues/OLLW (accessed on 10 September 2022).

From an economic standpoint, Campinas has the eleventh largest municipal gross domestic product (GDP) in the country and was the first Brazilian city other than state capitals to be classified as a metropolis. It thus has significant national influence. In 2011, it was responsible for at least 15% of the nation’s scientific production and is the third-leading Brazilian city in terms of research and development. For these reasons and accuracy of the data, Campinas was selected in this study.

The response time

t_{i}

(in days) is the period from the first symptoms until death due to COVID-19. In this sample, approximately 66.45% of the observations are censored, corresponding to patients who died for other reasons and patients who survived until the end of the study. The associated explanatory variables (for

i = 1, \dots, 322

) are:

{cens}_{i}

: censoring indicator (0 = censored, 1 = time of life observed);

x_{i 1}

: sex (0 = female, 1 = male);

x_{i 2}

: age (in years);

x_{i 3}

: chronic cardiovascular disease (1 = yes, 0 = no or not informed);

x_{i 4}

: asthma (1 = yes, 0 = no or not informed);

x_{i 5}

: diabetes mellitus (1 = yes, 0 = no or not informed);

x_{i 6}

: chronic neurological disease (1 = yes, 0 = no or not informed); and

x_{i 7}

: obesity (1 = yes, 0 = no or not informed).

Descriptive Analysis

As in all statistical studies, we began with exploratory analysis of the data by studying the behavior of the response variable and its respective covariables. The Kaplan–Meier survival curves are presented in Figure 4, where it is possible to observe the existence of a higher risk of death among individuals suffering from diabetes or chronic neurological disease. In addition, Figure 5 clearly shows that patients aged from 65 to 90 years had the highest hospitalization frequency, as expected.

The MLEs and their standard errors (SEs) (in parentheses), as well as the Global Deviance (GD), Akaike Information Criterion (AIC), and Bayesian Information Criterion (BIC) from two fitted distributions to these data are given in Table 3.

The likelihood ratio (LR) statistic for comparing the OLLW and Weibull distributions (

w = 7.9

, p-value

< 0.005

) supports the first distribution. The estimated survival functions in Figure 6 also reveal this fact.

Further, the results from the fitted complete OLLW regression model

λ_{i} = exp (β_{10} + \sum_{j = 1}^{7} β_{1 j} x_{i j}) and α_{i} = exp (β_{20} + \sum_{j = 1}^{7} β_{2 j} x_{i j}), i = 1, \dots, 322,

are reported in Table 4.

The variables age, asthma, diabetes mellitus, and chronic neurological disease are significant (at the level of 5%) for

λ

. For the parameter

α

, the age, asthma, diabetes, chronic neurological disease, and obesity variables are significant and hence the reduced OLLW regression model is

λ_{i} = exp (β_{10} + β_{12} x_{i 2} + β_{15} x_{i 5} + β_{16} x_{i 6}) and α_{i} = exp (β_{20} + β_{22} x_{i 2} + β_{26} x_{i 6} + β_{27} x_{i 7}),

whose estimation results are given in Table 5. Some interpretations on the numbers in this table are addressed at the end of this section.

The influence measures in Section 3.1 are calculated in R and displayed in Figure 7. They show that the 26th and 270th observations (referring to the patients below) are possibly influential:

26th: A 64-year-old woman with comorbidities (cardiovascular disease, diabetes, and obesity) died in 6 days.
270th: An 11-month-old baby with a neurological disease died in 5 days, and is the only patient younger than 1 year.

Figure 8a displays the index plot of the residuals (

r_{D_{i}}

) in Equation (11), thus revealing that they have a random behavior. Figure 8b reports the normal probability plot with a simulated envelope (Atkinson, 1987 [25]), thus revealing that the reduced OLLW regression model is appropriate for these data.

The plots of the empirical and estimated survival functions for the two categorical variables in Figure 9 confirm the adequacy of the fitted regression.

Interpretation for $λ$

The survival time declines when the age increases.
Diabetes mellitus has a significant effect in reducing the survival time of COVID-19 patients.
The patients with chronic neurological disease have a significant reduction in survival time.

Interpretation for $α$

The patient age is also significant in terms of survival time variability.
The variability of survival time depends on whether the patient is obese or not.
The variability of survival time depends on whether the patient has chronic neurological disease or not.

Finally, we obtain

S (t | x_{i})

from Equation (9). In Figure 10, the estimated survival and hazard rates are plotted for the four hypothetical patients described earlier. Figure 10a reveals that patients with diabetes mellitus and chronic neurological diseases have a shorter survival time than those who do not have these diseases. Similarly, Figure 10b shows that patients with diabetes and chronic neurological diseases are at higher risk compared to patients who do not have these pathologies.

We can obtain the survival probabilities and median times from Equations (9) and (4), respectively. Then, we consider

x_{7}

fixed at 0 and

x_{2}

and

x_{6}

, as shown in Table 6. Table 7 and Table 8 show the probability of hospitalized patients surviving 20 days after the first symptom and the median time for some ages, respectively.

6. Conclusions

This work studied the factors that increase the risk of death of hospitalized patients diagnosed with COVID-19 using the odd log-logistic Weibull regression model with two systematic components. Some new general structural properties of this model were provided such as its stochastic representation, identifiability, and moments, among others. A simulation study was carried out to evaluate the proposed regression model, which revealed the consistency of the maximum likelihood estimators and showed that the empirical coverage probabilities were close to the nominal level and that the empirical distribution of the deviance modified residuals approached the standard normal.

The application to COVID-19 data revealed some important results. The older age group was a predictor of a higher death rate from COVID-19, corroborating studies by Giacomelli et al. (2020) [12] and Atlam et al. (2021) [14], and diabetes and obesity were also evidenced in this work as determinants for the survival of infected patients, as discussed in Giacomelli et al. (2020) [12], Albitar et al. (2020) [16], and Noor et al. (2020) [17]. Chronic neurological diseases were also identified as risk factors, but we emphasize that few studies have obtained these results (García-Azorín et al. (2020) [18] and Noor et al. (2020) [17]). Therefore, it is recommended to consider the presence of this comorbidity in future studies in the assessment of mortality risk, as well as verify its significance in other datasets. Several studies have also indicated that men are at greater risk of death (see Albitar et al. (2020) [16] and Liu et al. (2020) [19]). However, no significant differences were found between the sexes. Chronic cardiovascular disease and asthma also did not prove to be determinants for the survival of individuals in this study.

It is suggested that future works verify the current datasets and those from other cities, as well as verify whether the same covariates would be significant in a lifetime analysis.

It is possible to conclude that the proposed regression proved to be efficient in identifying the factors that influenced the survival of individuals in this dataset, which can help more timely and efficient medical interventions. Finally, this model can be considered an interesting alternative for future works that evaluate censored lifetimes.

Author Contributions

Conceptualization, G.M.R., E.M.M.O., G.M.C. and R.V.; methodology, G.M.R., E.M.M.O., G.M.C. and R.V.; software, G.M.R., E.M.M.O., G.M.C. and R.V.; validation, G.M.R., E.M.M.O., G.M.C. and R.V.; formal analysis, G.M.R., E.M.M.O., G.M.C. and R.V.; investigation, G.M.R., E.M.M.O., G.M.C. and R.V.; data curation, G.M.R., E.M.M.O., G.M.C. and R.V.; writing—original draft preparation, G.M.R., E.M.M.O., G.M.C. and R.V.; writing—review and editing, G.M.R., E.M.M.O., G.M.C. and R.V.; visualization, G.M.R., E.M.M.O., G.M.C. and R.V.; supervision, G.M.R., E.M.M.O., G.M.C. and R.V. All authors have read and agreed to the current version of the manuscript.

Funding

This study was financed in part by the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior-Brasil (CAPES) (Finance Code 001).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The authors confirm that the data supporting the findings of this study are available within the article.

Conflicts of Interest

The authors declare no conflict of interest.

References

Da Cruz, J.N.; Ortega, E.M.M.; Cordeiro, G.M. The log-odd log-logistic Weibull regression model: Modelling, estimation, influence diagnostics and residual analysis. J. Stat. Comput. Simul. 2016, 86, 1516–1538. [Google Scholar] [CrossRef]
Da Cruz, J.N.; Ortega, E.M.M.; Cordeiro, G.M.; Suzuki, A.K.; Mialhe, F.L. Bivariate odd-log-logistic-Weibull regression model for oral health-related quality of life. Commun. Stat. Appl. Methods 2017, 24, 271–290. [Google Scholar] [CrossRef] [Green Version]
De Freitas Costa, E.; Schneider, S.; Carlotto, G.B.; Cabalheiro, T.; de Oliveira, M.R., Jr. Zero-inflated-censored Weibull and gamma regression models to estimate wild boar population dispersal distance. Jpn. J. Stat. Data Sci. 2021, 4, 1133–1155. [Google Scholar] [CrossRef]
Al-Dawsari, S.R.; Sultan, K.S. Inverted Weibull Regression Models and Their Applications. Stats 2021, 4, 269–290. [Google Scholar] [CrossRef]
Sánchez, L.; Leiva, V.; Saulo, H.; Marchant, C.; Sarabia, J.M. A new quantile regression model and its diagnostic analytics for a Weibull distributed response with applications. Mathematics 2021, 9, 2768. [Google Scholar] [CrossRef]
Moamer, S.; Baghestani, A.; Pourhoseingholi, M.A.; Hajizadeh, N.; Ahmadi, F.; Norouzinia, M. Evaluation of prognostic factors effect on survival time in patients with colorectal cancer, based on Weibull Competing-Risks Model. Gastroenterol. Hepatol. Bed Bench 2017, 10, 54–59. [Google Scholar]
Yoosefi, M.; Baghestani, A.R.; Khadembashi, N.; Pourhoseingholi, M.A.; Baghban, A.A.; Khosrovirad, A. Survival analysis of colorectal cancer patients using exponentiated Weibull distribution. Int. J. Cancer Manag. 2018, 11, e8686. [Google Scholar] [CrossRef]
Naseri, P.; Baghestani, A.R.; Momenyan, N.; Akbari, M.E. Application of a mixture cure fraction model based on the generalized modified weibull distribution for analyzing survival of patients with breast cancer. Int. J. Cancer Manag. 2018, 11, e62863. [Google Scholar] [CrossRef]
Pavisic, I.M.; Nicholas, J.M.; O’Connor, A.; Rice, H.; Lu, K.; Fox, N.C.; Ryan, N.S. Disease duration in autosomal dominant familial Alzheimer disease: A survival analysis. Neurol. Genet. 2020, 6, e507. [Google Scholar] [CrossRef]
Xavier, D.R.; Morais, I.; Magalhães, M.; Saldanha, R.; Dantas, R.; Barcellos, C.; Stenner, C. Nota Técnica 24 de 10 de Fevereiro de 2022. O avanço da Variante Ômicron, a Resposta das Vacinas e o Risco de Desassistência. 2022. Available online: https://www.arca.fiocruz.br/handle/icict/51252 (accessed on 10 September 2022).
Lu, W.; Yu, S.; Liu, H.; Suo, L.; Tang, K.; Hu, J.; Hu, K. Survival analysis and risk factors in COVID-19 patients. In Disaster Medicine and Public Health Preparedness; Cambridge University Press: Cambridge, UK, 2021; pp. 1–6. [Google Scholar] [CrossRef]
Giacomelli, A.; Ridolfo, A.L.; Milazzo, L.; Oreni, L.; Bernacchia, D.; Siano, M.; Bonazzetti, C.; Covizzi, A.; Schiuma, M.; Passerini, M.; et al. 30-day mortality in patients hospitalized with COVID-19 during the first wave of the Italian epidemic: A prospective cohort study. Pharmacol. Res. 2020, 158, 104931. [Google Scholar] [CrossRef] [PubMed]
Zheng, Z.; Peng, F.; Xu, B.; Zhao, J.; Liu, H.; Peng, J.; Li, Q.; Jiang, C.; Zhou, Y.; Liu, S.; et al. Risk factors of critical and mortal COVID-19 cases: A systematic literature review and meta-analysis. J. Infect. 2020, 81, 16–25. [Google Scholar] [CrossRef]
Atlam, M.; Torkey, H.; El-Fishawy, N.; Salem, H. Coronavirus disease 2019 (COVID-19): Survival analysis using deep learning and Cox regression model. Pattern Anal. Appl. 2021, 24, 993–1005. [Google Scholar] [CrossRef]
Nijman, G.; Wientjes, M.; Ramjith, J.; Janssen, N.; Hoogerwerf, J.; Abbink, E.; van de Maat, J.S. Risk factors for in-hospital mortality in laboratory-confirmed COVID-19 patients in The Netherlands: A competing risk survival analysis. PLoS ONE 2021, 16, e0249231. [Google Scholar] [CrossRef]
Albitar, O.; Ballouze, R.; Ooi, J.P.; Ghadzi, S.M.S. Risk factors for mortality among COVID-19 patients. Diabetes Res. Clin. Pract. 2020, 166, 108293. [Google Scholar] [CrossRef] [PubMed]
Noor, F.M.; Islam, M. Prevalence and associated risk factors of mortality among COVID-19 patients: A meta-analysis. J. Commun. Health 2020, 45, 1270–1282. [Google Scholar] [CrossRef] [PubMed]
García-Azorín, D.; Martínez-Pías, E.; Trigo, J.; Hernández-Pérez, I.; Valle-Peñacoba, G.; Talavera, B.; Simón-Campo, P.; de Lera, M.; Chavarría-Mir, A.; López-Sanz, C.; et al. Neurological comorbidity is a predictor of death in Covid-19 disease: A cohort study on 576 patients. Front. Neurol. 2020, 11, 781. [Google Scholar] [CrossRef] [PubMed]
Liu, Y.; Du, X.; Chen, J.; Jin, Y.; Peng, L.; Wang, H.H.; Zhao, Y. Neutrophil-to-lymphocyte ratio as an independent risk factor for mortality in hospitalized patients with COVID-19. J. Infect. 2020, 81, 6–12. [Google Scholar] [CrossRef]
Yang, J.; Zheng, Y.A.; Gou, X.; Pu, K.; Chen, Z.; Guo, Q.; Ji, R.; Wang, H.; Wang, Y.; Zhou, Y. Prevalence of comorbidities and its effects in patients infected with SARS-CoV-2: A systematic review and meta-analysis. Int. J. Infect. Dis. 2020, 94, 91–95. [Google Scholar] [CrossRef]
Gleaton, J.U.; Lynch, J.D. Properties of generalized log-logistic families of lifetime distributions. J. Probab. Stat. Sci. 2006, 4, 51–64. [Google Scholar]
Cook, R.D. Assesment of local influence (with discussion). J. R. Stat. Soc. 1986, 48, 133–169. [Google Scholar]
Cook, R.D.; Peña, D.; Weisberg, S. The likelihood displacement: A unifying principle for influence measures. Commun. Stat. Theor. Methods 1988, 17, 623–640. [Google Scholar] [CrossRef]
Escobar, L.A.; Meeker, W.Q. Assessing influence in regression analysis with censored data. Biometrics 1992, 48, 507–528. [Google Scholar] [CrossRef] [PubMed]
Atkinson, A.C. Plots, Transformations and Regression: An Introduction to Graphical Methods of Diagnostics Regression Analysis, 2nd ed.; Clarendon Press: Oxford, UK, 1987. [Google Scholar]

Figure 1. Plots of the OLLW density. (a) Changing

τ

,

λ = 1.5

and

α = 5

. (b) Changing

λ

,

α = 5

and

τ = 0.3

.

Figure 1. Plots of the OLLW density. (a) Changing

τ

,

λ = 1.5

and

α = 5

. (b) Changing

λ

,

α = 5

and

τ = 0.3

.

Figure 2. Normal probability plots of

r_{D_{i}}

’s for

n = 100

, 250, and 500, and censoring percentages

0 %

,

10 %

, and

30 %

.

Figure 2. Normal probability plots of

r_{D_{i}}

’s for

n = 100

, 250, and 500, and censoring percentages

0 %

,

10 %

, and

30 %

.

Figure 3. Location of the city of Campinas, São Paulo, Brazil.

Figure 4. Kaplan–Meier survival curves: (a) Sex; (b) Chronic cardiovascular disease; (c) Diabetes mellitus; (d) Obesity; (e) Asthma and (f) Chronic neurological disease.

Figure 5. Histogram of the covariate “age”.

Figure 6. The estimated and empirical survival functions for COVID-19 data.

Figure 7. Index plots for (a)

G D_{i} (θ)

and (b)

L D_{i} (θ)

.

Figure 7. Index plots for (a)

G D_{i} (θ)

and (b)

L D_{i} (θ)

.

Figure 8. (a) Index plot of

r_{D_{i}}

. (b) Normal probability plot for

r_{D_{i}}

with envelope.

Figure 8. (a) Index plot of

r_{D_{i}}

. (b) Normal probability plot for

r_{D_{i}}

with envelope.

Figure 9. Estimated and empirical survival functions: (a) Diabetes mellitus; (b) chronic neurological disease.

Figure 10. (a) Estimated survival functions. (b) Estimated hazard functions.

Table 1. Findings for the averages, biases, and MSEs from the simulated OLLW regression model.

			$n = 100$			$n = 250$			$n = 500$
%	$θ$	Averages	Biases	MSEs	Averages	Biases	MSEs	Averages	Biases	MSEs
$0 %$	$β_{10}$	3.0022	0.0022	0.0002	3.0012	0.0012	0.0001	3.0005	0.0005	0.0000
	$β_{11}$	2.4989	−0.0011	0.0001	2.4999	−0.0001	0.0001	2.5001	0.0001	0.0000
	$β_{12}$	0.8997	−0.0003	0.0001	0.8999	−0.0001	0.0000	0.9002	0.0002	0.0000
	$β_{20}$	2.0359	0.0359	0.1392	2.0065	0.0065	0.0485	2.0038	0.0038	0.0205
	$β_{21}$	1.5078	0.0078	0.0900	1.4925	−0.0075	0.0325	1.5030	0.0030	0.0145
	$β_{22}$	0.7987	−0.0013	0.0270	0.7948	−0.0052	0.0114	0.7996	−0.0004	0.0049
	$β_{30}$	0.2832	−0.0168	0.1230	0.3085	0.0085	0.0436	0.2999	−0.0001	0.0182
$15 %$	$β_{10}$	3.0013	0.0013	0.0002	3.0016	0.0016	0.0001	3.0009	0.0009	0.0000
	$β_{11}$	2.4992	−0.0008	0.0002	2.4993	−0.0007	0.0001	2.4998	−0.0002	0.0000
	$β_{12}$	0.9003	0.0003	0.0001	0.8999	−0.0001	0.0000	0.9001	0.0001	0.0000
	$β_{20}$	2.0807	0.0807	0.1605	2.0052	0.0052	0.0545	2.0020	0.0020	0.0260
	$β_{21}$	1.5020	0.0020	0.1030	1.4899	−0.0101	0.0363	1.5012	0.0012	0.0179
	$β_{22}$	0.7880	−0.0120	0.0344	0.7899	−0.0101	0.0123	0.7964	−0.0036	0.0066
	$β_{30}$	0.2459	−0.0541	0.1369	0.3129	0.0129	0.0487	0.3051	0.0051	0.0240
$45 %$	$β_{10}$	3.0012	0.0012	0.0003	3.0019	0.0019	0.0001	3.0014	0.0014	0.0001
	$β_{11}$	2.4990	−0.0010	0.0003	2.4989	−0.0011	0.0001	2.4994	−0.0006	0.0000
	$β_{12}$	0.9002	0.0002	0.0001	0.9001	0.0001	0.0000	0.9000	−0.0000	0.0000
	$β_{20}$	2.1452	0.1452	0.2034	2.0272	0.0272	0.0874	2.0232	0.0232	0.0402
	$β_{21}$	1.4288	−0.0712	0.1559	1.4671	−0.0329	0.0537	1.4798	−0.0202	0.0287
	$β_{22}$	0.7598	−0.0402	0.0535	0.7766	−0.0234	0.0178	0.7863	−0.0137	0.0092
	$β_{30}$	0.2299	−0.0701	0.1503	0.3110	0.0110	0.0775	0.2950	−0.0050	0.0391

Table 2. CPs for the 95% confidence intervals from the simulated OLLW regression model.

	$0 %$			$10 %$			$30 %$
$n$	100	250	500	100	250	500	100	250	500
$β_{10}$	0.957	0.962	0.970	0.959	0.957	0.966	0.965	0.962	0.967
$β_{11}$	0.952	0.960	0.966	0.957	0.960	0.971	0.937	0.961	0.966
$β_{12}$	0.956	0.968	0.955	0.944	0.962	0.961	0.950	0.958	0.962
$β_{20}$	0.923	0.949	0.964	0.903	0.962	0.958	0.925	0.947	0.943
$β_{21}$	0.947	0.953	0.959	0.948	0.952	0.948	0.969	0.964	0.958
$β_{22}$	0.946	0.933	0.957	0.945	0.952	0.932	0.946	0.962	0.949
$β_{30}$	0.938	0.960	0.968	0.924	0.967	0.969	0.958	0.968	0.952

Table 3. Estimation results.

Model	$λ$	$α$	$τ$	GD	AIC	BIC
OLLW	20.6750	5.6523	0.3113	916.9	922.9	934.2
	(0.9375)	(1.4702)	(0.0880)
Weibull	22.5711	1.7510	1	924.8	928.8	936.3
	(1.4116)	(0.1326)

Table 4. Findings from the complete OLLW regression.

	MLEs	SEs	p-Values		MLEs	SEs	p-Values
$β_{10}$	4.3021	0.0754	<0.0001	$β_{20}$	1.7013	0.0410	<0.0001
$β_{11}$	−0.0974	0.0680	0.1531	$β_{21}$	0.0842	0.0553	0.1288
$β_{12}$	−0.0159	0.0012	<0.0001	$β_{22}$	−0.0055	0.0010	<0.0001
$β_{13}$	0.0772	0.0841	0.3596	$β_{23}$	0.1307	0.0946	0.1679
$β_{14}$	−0.4190	0.1335	0.0019	$β_{24}$	0.3603	0.1134	0.0016
$β_{15}$	−0.2443	0.0881	0.0059	$β_{25}$	−0.2896	0.1160	0.0131
$β_{16}$	−0.3546	0.1468	0.0163	$β_{26}$	−0.3880	0.1677	0.0213
$β_{17}$	−0.0277	0.1147	0.8094	$β_{27}$	0.5095	0.1904	0.0078
$log (τ)$	−0.6933	0.0263
AIC: 895.7497; BIC: 959.9171; GD: 861.7497

Table 5. Findings from the reduced OLLW regression model.

	MLEs	SEs	p-Values
$β_{10}$	4.0861	0.0644	<0.0001
$β_{12}$	−0.0130	0.0012	<0.0001
$β_{15}$	−0.2696	0.0818	0.0011
$β_{16}$	−0.3211	0.1625	0.0490
$β_{20}$	1.4304	0.0364	<0.0001
$β_{22}$	−0.0063	0.0009	<0.0001
$β_{26}$	−0.3622	0.1708	0.0347
$β_{27}$	0.5011	0.2023	0.0138
$log (τ)$	−0.3337	0.0269
AIC: 884.7161; BIC: 918.6871; GD: 866.7161

Table 6. Four selected patients.

Patient	Age	Diabetes Mellitus	Chronic Neurological Disease
A	50	Yes	Yes
B	50	Yes	No
C	50	No	Yes
D	50	No	No

Table 7. Probability of hospitalized patients surviving 20 days after the first symptom.

Age	30	60	90
Patient A	0.47	0.25	0.11
Patient B	0.73	0.43	0.17
Patient C	0.62	0.40	0.22
Patient D	0.85	0.62	0.35

Table 8. Median time for some ages.

Age	30	60	90
Patient A	19.15	12.55	8.17
Patient B	27.65	18.30	12.05
Patient C	25.08	16.43	10.70
Patient D	36.21	23.96	15.78

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Rodrigues, G.M.; Ortega, E.M.M.; Cordeiro, G.M.; Vila, R. An Extended Weibull Regression for Censored Data: Application for COVID-19 in Campinas, Brazil. Mathematics 2022, 10, 3644. https://doi.org/10.3390/math10193644

AMA Style

Rodrigues GM, Ortega EMM, Cordeiro GM, Vila R. An Extended Weibull Regression for Censored Data: Application for COVID-19 in Campinas, Brazil. Mathematics. 2022; 10(19):3644. https://doi.org/10.3390/math10193644

Chicago/Turabian Style

Rodrigues, Gabriela M., Edwin M. M. Ortega, Gauss M. Cordeiro, and Roberto Vila. 2022. "An Extended Weibull Regression for Censored Data: Application for COVID-19 in Campinas, Brazil" Mathematics 10, no. 19: 3644. https://doi.org/10.3390/math10193644

APA Style

Rodrigues, G. M., Ortega, E. M. M., Cordeiro, G. M., & Vila, R. (2022). An Extended Weibull Regression for Censored Data: Application for COVID-19 in Campinas, Brazil. Mathematics, 10(19), 3644. https://doi.org/10.3390/math10193644

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

An Extended Weibull Regression for Censored Data: Application for COVID-19 in Campinas, Brazil

Abstract

1. Introduction

2. New OLLW Properties

2.1. Modes

2.2. Stochastic Representation

2.3. Closure under Changes of Scale and of Power

2.4. Identifiability

2.5. Existence of Real Moments

2.6. Tail Behavior

3. The OLLW Regression Model

3.1. Checking Model

4. Simulation Study

5. Application to COVID-19 Data

Descriptive Analysis

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI