Next Article in Journal
Random Walks-Based Node Centralities to Attack Complex Networks
Previous Article in Journal
Fixed/Preassigned-Time Synchronization of Fully Quaternion-Valued Cohen–Grossberg Neural Networks with Generalized Time Delay
Previous Article in Special Issue
Sensitivity of Survival Analysis Metrics
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Expectation-Maximization Algorithm for the Weibull Proportional Hazard Model under Current Status Data

School of Mathematics and Statistics, Shandong University, Weihai 264209, China
*
Author to whom correspondence should be addressed.
Mathematics 2023, 11(23), 4826; https://doi.org/10.3390/math11234826
Submission received: 27 October 2023 / Revised: 22 November 2023 / Accepted: 27 November 2023 / Published: 29 November 2023
(This article belongs to the Special Issue Statistical Methods and Models for Survival Data Analysis)

Abstract

:
Due to the flexibility of the Weibull distribution and the proportional hazard (PH) model, Weibull PH is widely used in survival analysis under right censored data and interval censored data but it is seldom investigated under current status data, partially because there is less information in current status data than in right censored data and interval censored data. This paper considers the Weibull PH model under the current status data and introduces the Poisson latent variables to augment the data, then uses the expectation-maximization (EM) algorithm to obtain the maximum likelihood estimators of the model parameters. The EM algorithm is compared with the Newton–Raphson (NR) algorithm from several perspectives in the simulation studies, and the results show that the proposed method has several highlights, such as computational simplicity, improved convergence stability, and overall estimator results that are either comparable or slightly better in terms of bias. Furthermore, the performance of the Weibull PH model and the semi-parametric PH model is compared under two simulation scenarios, and two standard model selection criteria are used for model selection. The results indicate that the Weibull PH model has significant advantages when failure time follows a Weibull distribution. Lastly, the Weibull PH model along with EM algorithm is applied to lung tumor data and intraocular lens (IOL) calcification data with the aim of assessing the impact of covariates, including environmental factors and gender, on event timing and risk.

1. Introduction

The proportional hazard (PH) model proposed by Cox [1] has gradually gained popularity as one of the most widely used models in survival analysis. The PH model is a semi-parametric model consisting of two parts: the parametric part assuming that the explanatory variables have exponentially multiplicative effects on the hazard function of survival time, and the non-parametric part with an unspecified baseline hazard function. The two-parameter Weibull distribution [2] can have increasing, decreasing, or constant hazard function depending on the shape parameter; thus, it is flexible to describe hazard functions with different shapes. When the baseline hazard function is specified as the Weibull distribution, we obtain the Weibull PH model, which has wide applications in survival analysis and reliability analysis. For example, Alakuş [3] estimated confidence intervals for the survival function of the Weibull PH model with censored survival time data; Gong and Fang [4] investigated the performance of the Weibull PH model, exponential PH model, and 10-piece exponential PH model under interval censored data with different underlying data distributions and censoring patterns, and they advocated the use of a parametric PH model to analyze interval censored data; Sha and Pan [5] used the Weibull PH model to analysis the step-stress accelerated life testing data in a Bayesian framework; Nemati et al. [6] used the Weibull PH model to assess the impact of different factors on the failure rate of cables; Liu and Xie [7] estimated the parameters of the Weibull PH model for right censored data. Few studies on Weibull PH modeling concerned current status data.
In survival analysis, current status censoring also refers to Case I interval censoring, which is an extreme case of interval censoring. It happens if each subject is observed only once and the survival time of interest is known only to be either smaller or greater than the observation time. Sun [8] gave a detailed introduction to the estimation theories and applications for current status data. Current status data often appear along with covariates in cross-sectional studies and tumorigenicity experiments. Under current status censoring, the event times are left censored for events prior to the examination and are otherwise right censored. The main interest in such studies includes estimating the distribution of the lifetime and evaluating the effect of covariates on the lifetime or hazard of failure. A major challenge in the analysis of current status data is that information on lifetime for an individual is limited to the status of the event under consideration at a single monitoring time.
Compared to right censored data, current status data is less informative, and current statistical techniques for right censored data cannot be directly used to analyze current status data. So, it is necessary to develop new techniques to efficiently deal with these kinds of data. Recently, McMahan et al. [9] introduced a two-stage Poisson data augmentation method for estimating a PH model and proportional odds model with current status data. This approach was characterized by its simplicity and high effectiveness; furthermore, it provided closed-form variance estimators. Subsequently, numerous scholars applied this methodology to various models and data types. For example, Wang et al. [10] extended it to a PH model with interval censored data; Zhou et al. [11] discussed the problem of fitting a PH model to interval censored data with missing covariates; Withana et al. [12] analyzed a left-truncated arbitrarily censored data under PH model; Cui and Tee [13] analyzed the Bayesian additive PH model for the current status data. Most of the literature that used this Poisson data augmentation method to study the PH model estimated the cumulative baseline hazard function using splines, such as monotone spline, I-splines, and B-spline. However, it is more reasonable to utilize the corresponding parametric model when the baseline hazard function can be approximated by a flexible parameter distribution.
In this paper, we focus on the Weibull PH model under current status data with the baseline hazard function characterized by the popular Weibull distribution, and discuss the maximum likelihood estimators (MLE) of the model parameters. Instead of directly maximizing the observed likelihood function with the Newton–Raphson (NR) method, we propose a one-step Poisson data augmentation expectation-maximization (EM) algorithm inspired by McMahan et al. [9]. This methodology introduces Poisson latent variables to augment the data; thus, it establishes a missing data structure and produces a simpler complete likelihood. The procedure simplifies the likelihood structure and improves the computation efficiency as well as the convergence speed of the algorithm. Simultaneously, the variance estimators can be provided in closed form by Louis’ method [14]. We evaluate the performance of the EM algorithm through simulation studies involving various sample sizes and censoring ratios, and compare the results with the ones from the NR method. In summary, the EM method in this study yields estimator results comparable to those of the NR method, with the advantage of simplicity and computational convenience. Furthermore, it guarantees that the scale parameter remains positive, contributing to slightly improved convergence. In addition, we compare the Weibull PH model with the I-spline-based semi-parametric PH model through parameter estimation and model selection. Finally, we apply the Weibull PH model with EM algorithm to analyze lung tumor experimental data and IOL calcification data, and obtain some interesting findings.

2. The Proposed Method

2.1. The Weibull PH Model

The PH model is one of the most popular models in survival analysis, with a multiplicative effect of its covariates on the hazard function λ ( t , x ) with
λ t ; x = λ 0 t exp x β ,
where λ 0 ( t ) is the baseline hazard function, x = x 1 , , x p , and β = β 1 , , β p . When the baseline hazard function is unknown, it is often estimated by splines. However, when its distribution is known, the use of the corresponding specific parameter model makes the results more precise. The two-parameter Weibull model is a popular parametric model in survival analysis with the flexible hazard function
λ 0 t = λ γ λ t γ 1 ,
where scale parameter λ > 0 and shape parameter γ > 0 . Obviously, this hazard function is monotone decreasing for γ < 1 and increasing for γ < 1 , and it degenerates to an exponential distribution with a scale parameter λ when γ = 1 . In this paper, we assume that the baseline hazard function comes from the two-parameter Weibull distribution which is given by (1), and discuss the Weibull PH model
λ t ; x = λ γ λ t γ 1 exp x β .
The distribution function and the survival function of the failure time T have the following forms
F t ; x = 1 exp λ t γ exp x β ,
and
S t ; x = exp λ t γ exp x β .
Assume that there are n independently observed samples. Let T i , C i and x i denote the failure time, censoring time, and covariate for subject i, respectively. In current status data, the failure time T i is not directly observable; we only know whether T i is larger than the observation time C i . Let δ i = I T i C i denote the censoring indicator, where I ( · ) is the indicator function. When δ i = 1 , it indicates that the failure time is smaller than the observation time, which is left censored. Conversely, it is right censored in the case δ i = 0 . Let D = D i = C i , δ i , x i , i = 1 , , n denote all observed data and θ = β , γ , λ denote all parameters. Then, the observed data likelihood is given by
L o b s θ | D = i = 1 n F C i ; x i δ i 1 F C i ; x i 1 δ i = i = 1 n 1 exp λ C i γ exp x i β δ i × exp λ C i γ exp x i β 1 δ i .
Correspondingly, the observed data log-likelihood is given by
log L o b s θ | D = i = 1 n δ i log 1 exp λ C i γ exp x i β λ C i γ exp x i β 1 δ i .
The NR method obtains the MLEs of the parameters by directly maximizing the above equation, and the corresponding derivation procedure is shown in Appendix A. It can be seen that the method is complicated in the calculation process, the expression is tedious, and there is no closed-form solution for all three parameters.

2.2. Data Augmentation

Motivated by McMahan et al. [9] and Wang et al. [10], we introduce Poisson latent variables to augment the data and obtain the complete data likelihood by considering the relationship between the PH model and the non-homogeneous Poisson process. According to the idea of Wang et al. [10], consider a non-homogenous Poisson process N ( t ) with mean parameter Λ 0 ( t ) exp ( x β ) , where Λ 0 ( t ) is the cumulative baseline hazard function, and in the Weibull situation, Λ 0 ( t ) = ( λ t ) γ . The lifetime T can be viewed as the time of the first jump of this process, i.e., T = inf { t : N ( t ) > 0 } ; then, the survival function of T is
S ( t ; x ) = P ( T > t ) = P ( N ( t ) = 0 ) = exp { Λ 0 ( t ) exp ( x β ) } = 1 F ( t ; x ) ;
thus, T follows the PH model. This equivalence between the Poisson process and PH model makes the augmentation of the likelihood feasible. In the Weibull PH model, let N i ( t ) denote the latent Poisson process for subject i with mean parameter ( λ C i ) γ exp ( x i β ) , and lifetime T i = inf { t : N i ( t ) > 0 } . We introduce latent variable Z i = N i ( C i ) , then
P ( Z i = 0 ) = P ( N ( C i ) = 0 ) = P ( T i > C i ) = exp { ( λ C i ) γ exp ( x i β ) } = S ( C i ; x i ) ,
and
P ( Z i > 0 ) = P ( N ( C i ) > 0 ) = P ( T i C i ) = 1 exp { ( λ C i ) γ exp ( x i β ) } = F ( C i ; x i ) ;
therefore, Z i follows the Poisson distribution with mean ( λ C i ) γ exp ( x i β ) , that is, Z i Poisson ( ( λ C i ) γ exp ( x i β ) ) , and as a byproduct, we have δ i = I ( Z i > 0 ) .
Based on the latent variable Z = ( Z 1 , , Z n ) , the augmented likelihood has the following form
L * θ | D = i = 1 n δ i I Z i > 0 1 δ i I Z i = 0 × λ C i γ exp x i β Z i exp λ C i γ exp x i β / Z i ! .
Its corresponding log-likelihood is
log L * θ | D = i = 1 n Z i γ log ( λ C i ) + x i β λ C i γ exp x i β log Z i ! .
According to the augmented likelihood, the conditional distribution of Z i can be obtained as
P ( Z i = z i | D , θ ) = 0 , δ i = 0 , λ C i γ exp x i β z i exp λ C i γ exp x i β z i ! , δ i = 1 .
Then, the conditional expectation and conditional variance of Z i can be easily calculated, which are given by
E Z i | θ , D = λ C i γ exp x i β δ i 1 exp { λ C i γ exp x i β } ,
and
Var Z i | θ , D = E Z i | θ , D E Z i | θ , D 2 exp { λ C i γ exp x i β } .

2.3. EM Algorithm

The EM algorithm is an iterative algorithm used to obtain MLEs of model parameters when the model has a missing data structure [15]. The EM algorithm in this paper operates on the augmented likelihood by treating it as the complete data likelihood, where Z is viewed as missing data. The E-step is to compute the expectation of log L * with respect to the conditional distribution of Z given the current iteration values and the observed data. According to Equation (3), the expression for the Q function can be obtained as
Q θ | θ d , D = E log L * θ | θ d , D = i = 1 n u i d γ log ( λ C i ) + x i β λ C i γ exp x i β E log Z i ! ,
where u i d = E Z i | θ d , D is obtained by (6) with θ replaced by θ ( d ) .
The M-step seeks to find θ ( d + 1 ) = arg max θ Q θ | θ d , D . For this purpose, we need to derive the first-order partial derivatives of Q θ | θ d , D with respect to all components of  θ .
Q λ = i = 1 n u i d γ λ γ λ γ 1 C i γ exp x i β , Q γ = i = 1 n u i d log ( λ C i ) λ C i γ log ( λ C i ) exp x i β , Q β = i = 1 n u i d x i λ C i γ exp x i β x i .
Setting these derivatives equal to zero, we can obtain
λ = i = 1 n u i d i = 1 n C i γ d exp x i β d 1 γ d ,
i = 1 n u i d log ( λ C i ) = i = 1 n λ C i γ log ( λ C i ) exp x i β d ,
i = 1 n u i d x i = i = 1 n λ C i γ exp x i β x i .
These equations are considerably simple and easy to solve compared to Appendix A. Specifically, Equation (7) provides an explicit solution for λ and guarantees that λ is positive, while the NR method has no explicit solution for any of the three parameters. Then, we summarize the EM algorithm as follows:
Step 1: 
Initialize θ d = β d , γ d , λ d for d = 0 .
Step 2: 
Calculate λ d + 1 = i = 1 n u i d i = 1 n C i γ d exp x i β d 1 γ d .
Step 3: 
Based on the λ ( d + 1 ) , calculate γ ( d + 1 ) by solving the Equation (8) via NR method.
Step 4: 
Based on λ ( d + 1 ) , γ ( d + 1 ) and NR method, calculate β d + 1 according to the Equation (9), and update d = d + 1 .
Step 5: 
Repeat Steps 2–4 until convergence.
Denote θ ^ as the final convergence value, that is, the MLE of θ .

2.4. Asymptotic Variances and Covariance

According to the asymptotic normality of the MLE, we have θ ^ N 0 , I 1 ( θ ^ ) , where I ( θ ) is the information matrix of the observed likelihood, and Appendix A gives the calculation of I ( θ ) . Within the EM framework, Louis’ method [14] gives a closed-form expression for the observed information matrix in a simple and straightforward way, that is,
I ( θ ) = 2 Q ( θ | θ ^ , D ) θ θ Cov log L * θ θ ,
where Cov log L * θ θ represents Cov log L * θ θ D , θ . The second-order derivatives of Q-function with respect to θ are a symmetric matrix with upper triangle components given by
2 Q 2 λ = i = 1 n u ^ i γ λ 2 + γ γ 1 λ γ 2 C i γ exp x i β , 2 Q 2 γ = i = 1 n λ C i γ log ( λ C i ) 2 exp x i β , 2 Q β β = i = 1 n λ C i γ exp x i β x i x i , 2 Q λ γ = i = 1 n u ^ i 1 λ λ γ 1 C i γ 1 + γ log ( λ C i ) exp x i β , 2 Q λ β = i = 1 n γ λ γ 1 C i γ exp x i β x i , 2 Q γ β = i = 1 n λ C i γ log ( λ C i ) exp x i β x i ,
where u ^ i = λ ^ C i γ ^ exp x i β ^ δ i 1 exp { λ ^ C i γ ^ exp x i β ^ } . To this end, consider the conditional covariance matrix Cov log L * θ θ , whose upper triangle components are given by
Var log L * θ λ = γ λ 2 i = 1 n Var Z i , Var log L * θ γ = i = 1 n Var Z i log ( λ C i ) 2 , Cov log L * θ β = i = 1 n Var Z i x i x i , Cov log L * θ λ , log L * θ γ = γ λ i = 1 n Var Z i log ( λ C i ) , Cov log L * θ λ , log L * θ β = γ λ i = 1 n Var Z i x i , Cov log L * θ γ , log L * θ β = i = 1 n Var Z i x i log ( λ C i ) ,
where Var Z i denotes Var Z i | θ , D given by (6). Apparently, the EM algorithm is simpler and has a more concise expression regarding the computation of the covariance matrix than the NR method in Appendix A.

3. Simulation Study

In this section, a series of simulations are used to evaluate the performance of the proposed method. We compare the proposed EM method with the NR method and compare the Weibull PH model with the semi-parametric PH model.

3.1. Simulation Study I: Compare the Proposed EM Method with the NR Method

This study considers that the failure time T i is generated from the following Weibull PH model
F t ; x i = 1 exp λ t γ exp x i 1 β 1 + x i 2 β 2 ,
where x i = x i 1 , x i 2 , x i 1 Bernoulli 0.5 , x i 2 N 0 , 0 . 5 2 , i = 1 , , n , λ = 3 , γ = 2 .
In the simulation, the sample sizes are set to be n = 30 , 60 , 100, 200 and 400. Both β 1 and β 2 can be taken as 0.5 , 0 and 0.5 , totaling nine parameter combinations, and the left censoring ratio P is set as 0.3 , 0.4 , and 0.5 , respectively. The failure time T i can be obtained by solving F t ; x i = u i , where u i U 0 , 1 , and the censoring time C i obeys U 0 , a , where a is calculated according to the censoring ratio.
Each simulation is repeated 1000 times, with an initial value θ 0 = β 0 , λ 0 , γ 0 = 1 2 , 1 , 1 , where 1 2 = ( 1 , 1 ) . Then, the algorithm is recognized to have reached convergence when max | θ d + 1 θ d | < 0.005 . The bias (Bias), the estimated standard errors (ESE), the standard deviation (SD), and the coverage probabilities (CP) are used as indicators of the goodness of the estimators, where the Bias is the absolute value of the difference between the estimated value and the true value, the ESE is the average of the estimated standard errors obtained by squaring the diagonal elements of the asymptotic covariance matrix computed by Louis’ method, the SD is the standard deviation of the 1000 estimates of β , and the CP is the proportion of the true value that falls into the 95 % confidence interval. Here, we compare the results of the proposed EM method with the ones based on the NR algorithm. All of the above simulations are implemented in R [16]. Table 1, Table 2, Table 3, Table 4 and Table 5 are the estimate results, from which we can draw the following conclusion:
1.
For small sample sizes, such as n = 30 , 60 and 100, both the Bias and ESE of the EM method are generally smaller than those of the NR method. For example, Table 2 presents the estimation results for n = 60 . Among 54 cases, the EM method exhibits smaller biases in 35 cases, and a majority of the ESEs are also smaller than those of the NR method. Similar trends are observed in Table 1 and Table 3.
2.
When the sample size is medium or large, such as n = 200 and 400, the EM method still maintains an advantage in terms of the Bias, with over half of the 54 cases exhibiting smaller Bias. However, when considering ESEs, the EM method performs unsatisfactorily. For instance, when n = 200 , the EM method produces 53.7 % smaller Bias and only 37.0 % smaller ESE.
3.
With the increase in sample size, the Bias and ESE from both methods diminish towards zero, indicating that the performance of both methods improves with larger sample sizes, and the estimates can converge to the true values.
4.
The estimated standard errors and sample standard deviations decrease and approach each other as the sample size increases, suggesting that the asymptotic covariance matrix obtained by Louis’ method performs well for finite sample sizes.
In summary, the estimate results of the EM method are comparable to those of the NR method, and even have an advantage when the sample size is small. In terms of bias, the EM method consistently outperforms the NR method; in terms of SD and ESE, the EM method performs better when the sample size is small, and the NR method slightly outperforms when the sample size is large; in terms of CP, both methods perform well, essentially around 95 % , and the censoring ratio has no impact on the performance of the two methods. With the increase in sample size, the estimation performance of both methods improves.
Remark 1. 
It should be noted that when the small sample size is small, both the EM algorithm and the NR algorithm can sometimes fail to converge, which is similar to the case in Balakrishnan and Mitra [17], and in this case, the simulation data are excluded from the experiment. The EM algorithm fails to converge much less frequently than the NR approach. This is partially due to the fact that the scale parameter λ in the EM algorithm has an analytical solution and is guaranteed to be positive, whereas the NR method relies on numerical computation to solve for all three parameters. When the sample size is large, none of the methods fail to converge.

3.2. Simulation Study II: Compare the Weibull PH Model with the Semi-Parametric PH Model

In this simulation, we compare the Weibull PH model with a semi-parametric PH model proposed by McMahan et al. [9], who utilize a spline-based EM method for estimating the PH model under the current status data. This semi-parametric PH model can be implemented using the R package ICsurv [18]. To ensure a comprehensive comparison, we consider two scenarios:
1.
The failure time T i follows a Weibull distribution, with
F t ; x i = 1 exp λ t γ exp x i 1 β 1 + x i 2 β 2 ,
where λ = 1 , γ = 2 .
2.
Referring to the simulation setting in McMahan et al. [9], the failure time T i follows a non-Weibull distribution, with
F t ; x i = 1 exp Λ 0 ( t ) exp x i 1 β 1 + x i 2 β 2 ,
where Λ 0 ( t ) = log ( 1 + t ) + t 3 / 2 .
In the above equation, x i = x i 1 , x i 2 , x i 1 Bernoulli 0.5 , x i 2 N 0 , 0 . 5 2 , i = 1 , , n . Regression coefficients β 1 and β 2 can be taken as 0.5 , 0 and 0.5 , totaling nine parameter combinations, and the sample size n is specified as 200. The failure time T i can be obtained by solving F t ; x i = u i , where u i U 0 , 1 , and the censoring time C i obeys a truncated exponential distribution Exp ( 1 ) with support ( 0 , 2 ) .
Each simulation is repeated 500 times, with an initial value of θ 0 = β 0 , λ 0 , γ 0 = 1 2 , 1 , 1 for the Weibull PH model, and θ 0 = β 0 , γ 0 = 1 2 , 1 6 for the semi-parametric PH model. In McMahan et al. [9], the I-spline with suitable degree is used to control the smoothness of the hazard function. For comparison, we consider the degree to be 1, 2, and 3, corresponding to the linear, quadratic, and cubic basis function, respectively, and the number of knots is set as 5. The algorithm is recognized to have reached convergence when max | θ d + 1 θ d | < 0.005 . All evaluation criteria are consistent with the Simulation study I. Table 6 and Table 7 show the results.
Table 6 presents the results when the failure times follow the Weibull distribution. In this case, the Weibull PH model performs overwhelmingly superior to the semi-parametric model, with smaller Bias and ESE in the majority of the 54 cases. For instance, when the degree is 2, the proportion with smaller biases from the Weibull PH model is 88.9 % , and almost all ESEs are smaller than those from the semi-parametric PH model. Meanwhile, in the non-Weibull situation, Table 7 indicates the superiority of the semi-parametric PH model. This finding is natural and consistent with expectations.
Simultaneously, we utilize Akaike’s information criterion (AIC) and the Bayesian information criterion (BIC) for model selection and the results are shown in Table 8 and Table 9. Clearly, the Weibull PH model produces smaller AIC and BIC values in all situations, indicating that the Weibull PH model is superior to the semi-parametric PH model in some sense. This may be explained by the fact that the AIC and BIC values prefer a concise model with fewer parameters, while the semi-parametric PH model introduces several splines to describe the smoothness of the unspecified baseline hazard function, which increases the complexity of the model.
Remark 2. 
The function provided by the R package ICsurv [18] sometimes converges too slowly; therefore, in this paper, we add a restriction of 1000 iterations to the simulations using this function.

4. Real Data Analysis

In this section, we use the Weibull PH model to fit two real data, and use the proposed EM method to estimate all unknown parameters.

4.1. Lung Tumor Data

Lung tumor data were obtained by Hoel and Walberg [19] in 1972 from 144 male RFM mice that were examined in a tumorigenicity experiment on lung tumors. The tumorigenicity experiment focused on whether drugs or the environment accelerate the time to tumor onset. In this experiment, the time to tumor onset T i was of interest, but lung tumors were primarily non-lethal and insidious, meaning that the onset of the tumor did not affect the survival of the mice, so the time to tumor onset was not directly observable. Instead, we were only able to observe the time of death and the presence or absence of lung tumors when the mice died. The experiment recorded the death time C i of mice in days, and used indicator variable δ i = I T i C i to note the presence (1) or absence (0) of lung tumors for each subject at the death time.
The experiment placed the mice in two different environments, a conventional environment (CE) and a germ-free environment (GE). There were 96 rats in the CE, and 27 had lung tumors at the time of death. Similarly, there were 48 rats in GE and 35 with lung tumors. The right censoring rate of the tumor onset was approximately 43.1 % . Detailed variable descriptions are shown in Table 10.
In order to estimate the impact of environmental factors on tumor development, we consider the Weibull PH model and use the EM algorithm to estimate the parameters:
λ ^ = 0.001 , γ ^ = 1.988 , β ^ = 0.803 .
Based on Louis’ method, we can easily calculate that the estimated standard deviation of β ^ is 0.247. This results in a p-value of 0.0012 for the comparison of the two groups, meaning that the incidence rates of lung tumors in the two different environments differ significantly. The corresponding hazard function and survival function are
λ t , x = 0.001988 t / 100 0.988 exp 0.803 x ,
and
S t , x = exp t / 100 1.988 exp 0.803 x .
The baseline hazard function λ 0 t = λ γ λ t γ 1 increases with time since γ ^ = 1.988 , indicating that the risk of cancer in mice increases over time. Furthermore, the risk ratio of lung tumors in mice under GE and CE is
λ t , x = 1 λ t , x = 0 = exp 0.803 .
This suggests that the risk of tumor growth in mice in the GE is exp 0.803 2.232 times higher than that in the CE. Sun [8] discussed the non-parametric maximum likelihood estimation (NPMLE) of the survival function based on current status data. Figure 1 presents the estimators of the survival functions for the EM method and the NPMLE method.
It is noteworthy that the estimator of the survival function from the Weibull PH model is very close to that from the NPMLE. Both the EM method and NPMLE method suggest that lung tumors tend to develop earlier in the CE compared to the GE. However, overall, mice in the GE appear to have a higher risk of developing lung tumors than mice in the CE. At the same time, this indicates that the Weibull PH model is appropriate to fit these data. In addition, the EM algorithm can effectively estimate model parameters in the Weibull PH model, which provides a continuous expression for the survival function, enabling the estimation of survival probabilities at any given time. In contrast, the NPMLE method is limited to approximating survival probabilities only at the endpoints of the observation interval.
Furthermore, we compare the Weibull PH model with the semi-parametric PH model, and the results are presented in Table 11. There are slight differences in the regression coefficient estimates from the two methods. Moreover, the Weibull PH model demonstrates smaller variances, and smaller AIC and BIC values compared to the semi-parametric PH model. These findings suggest that the Weibull PH model may be more suitable for modeling the lung tumor data.

4.2. The Data of IOL Calcification

The calcification of hydrogel intraocular lenses (IOL) [20,21] is a rare complication of cataract treatment. At examination time T i for each subject i, an ophthalmologist classified the severity of a patient’s IOL calcification into four grades. We simply consider mild or serious calcification as calcified ( δ i = 1 ) , and no or little calcification as not calcified ( δ i = 0 ) . There are 379 samples, including 142 males and 237 females. The variables are described in Table 12.
We aim to assess the effect of the gender factor on IOL calcification. Assuming that the failure time T i follows the Weibull PH model, we use the proposed EM algorithm to obtain the corresponding parameter estimates as follows:
λ ^ = 0.005 , γ ^ = 0.642 , β ^ = 0.269 .
Meanwhile, the estimated standard error of β ^ is 0.312 , calculated by Louis’ method. This results in a p-value of 0.388 for testing β = 0 according to the asymptotic normality of the MLE, indicating that there is no significant difference between males and females in terms of the time to calcification. The estimator γ ^ = 0.642 suggests that the risk of IOL calcification decreases with time. Although the gender is not significant, it appears that the risk of IOL calcification in males is roughly exp 0.269 0.76 times that of females based on the risk ratio. Figure 2 shows the estimators of the survival functions for the IOL calcification data by the EM method and the NPMLE method [8], respectively. Figure 2 suggests that calcification tends to occur earlier in males, but overall, the risk of calcification is slightly higher in females than in males. In addition, the estimators of the survival function from the Weibull PH model for the IOL data are close to those from the NPMLE, suggesting the suitability of the Weibull PH model for describing the IOL data.
Table 13 summarizes the comparison results of the Weibull PH model and semi-parametric PH models with varying degrees. The estimates of β of these models are relatively close, but the Weibull PH model provides smaller variances, and smaller AIC and BIC values, indicating that it is better suited for this dataset.

5. Conclusions

The Weibull PH model is a commonly used parametric model in fields like reliability analysis and survival analysis. In this paper, we consider the Weibull PH model under current status data. A Poisson-based EM algorithm is developed to estimate the parameters, where the M-step of the EM algorithm is realized by the NR method. Louis’ method is used to estimate the asymptotic variance–covariance matrix, whose diagonal elements are estimators of the variance. Then, the proposed EM method is compared with the NR method through simulations. On the whole, the estimator results of the two methods are close; the bias of the EM algorithm is consistently smaller than that of the NR method, especially when the sample size is small, while the variance of EM is smaller in small samples but larger in larger sample sizes. In addition, the proportion that the true parameter value falls in the 95 % confidence intervals is close for both methods. Moreover, the EM algorithm offers several advantages. It simplifies the process of parameter estimation and covariance matrix estimation. Additionally, the EM algorithm ensures that the critical scale parameter remains positive throughout the estimator process, which is a crucial requirement in the Weibull PH model. The comparison between the Weibull PH model and the semi-parametric PH model reveals that the Weibull PH model demonstrates significant advantages when the failure time follows a Weibull distribution. This is also supported by the smaller values of AIC and BIC for the Weibull PH model. The method and model of this paper are applied to two practical examples. In the lung tumor experiment, the results indicate that the risk of lung tumor increases over time, and the environment significantly influences tumor growth. Specifically, mice in the GE exhibit a higher risk of developing lung tumors compared to those in the CE. In the IOL calcification data, the risk of calcification decreases over time, and gender has no impact on the risk of IOL calcification.
The discussion in this paper is based on current status data; the EM algorithm of the Weibull PH model can be extended to analysis of general interval censored data. In addition, our future research plans include exploring Bayesian analysis for the Weibull PH model.

Author Contributions

Methodology, F.Y., S.C.; Supervision, F.Y.; Writing—original draft, S.C.; Writing—review and editing, F.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Science Foundation of Shandong province of China under grant ZR2019MA026.

Data Availability Statement

All datasets in the real data analysis are from Sun [8].

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:
PHProportional hazard model
EMExpectation-maximization algorithm
NRNewton–Raphson method
MLEMaximum likelihood estimator
AICAkaike’s information criterion
BICBayesian information criterion
GEGerm-free environment
CEConventional environment
IOLIntraocular lenses
NPMLENon-parametric maximum likelihood estimation

Appendix A

The NR method is a direct maximization of the observed log-likelihood to obtain MLEs of the parameters. Given the value of θ ( d ) , the estimator for the θ ( d + 1 ) is given by
θ ( d + 1 ) = θ ( d ) 2 log L o b s θ θ θ | θ = θ ( d ) 1 log L o b s θ θ | θ = θ ( d ) ,
where the first-order derivatives with respect to all components of θ are given by
log L o b s θ λ = i = 1 n γ λ γ 1 C i γ exp x i β δ i 1 exp λ C i γ exp x i β 1 , log L o b s θ γ = i = 1 n λ C i γ log ( λ C i ) exp x i β δ i 1 exp λ C i γ exp x i β 1 , log L o b s θ β = i = 1 n λ C i γ exp x i β x i δ i 1 exp λ C i γ exp x i β 1 .
And the second-order derivatives of log L o b s θ with respect to θ are
2 log L o b s θ 2 λ = i = 1 n { γ γ 1 λ γ 2 C i γ exp x i β δ i 1 exp λ C i γ exp x i β 1 γ λ γ 1 C i γ exp x i β 2 δ i exp λ C i γ exp x i β 1 exp λ C i γ exp x i β 2 } , 2 log L o b s θ 2 γ = i = 1 n { λ C i γ log ( λ C i ) 2 exp x i β δ i 1 exp λ C i γ exp x i β 1 λ C i γ log ( λ C i ) exp x i β 2 δ i exp λ C i γ exp x i β 1 exp λ C i γ exp x i β 2 } , 2 log L o b s θ β β = i = 1 n { λ C i γ exp x i β x i x i δ i 1 exp λ C i γ exp x i β 1 λ C i γ exp x i β 2 x i x i δ i exp λ C i γ exp x i β 1 exp λ C i γ exp x i β 2 } , 2 log L o b s θ 2 λ γ = i = 1 n { λ γ 1 C i γ exp x i β δ i 1 exp λ C i γ exp x i β 1 + γ λ γ 1 C i γ log ( λ C i ) exp x i β δ i 1 exp λ C i γ exp x i β 1 γ λ γ 1 C i γ exp x i β λ C i γ log ( λ C i ) exp x i β δ i exp λ C i γ exp x i β 1 exp λ C i γ exp x i β 2 } , 2 log L o b s θ λ β = i = 1 n { γ λ γ 1 C i γ exp x i β x i δ i 1 exp λ C i γ exp x i β 1 γ λ γ 1 C i γ exp x i β λ C i γ exp x i β x i δ i exp λ C i γ exp x i β 1 exp λ C i γ exp x i β 2 } , 2 log L o b s θ γ β = i = 1 n { λ C i γ log ( λ C i ) exp x i β x i δ i 1 exp λ C i γ exp x i β 1 λ C i γ log ( λ C i ) exp x i β λ C i γ exp x i β x i δ i exp λ C i γ exp x i β 1 exp λ C i γ exp x i β 2 } .
Let θ ^ denote MLEs of the NR method. Then, the covariance matrix of θ ^ can be estimated by the inverse of the observation information matrix, that is,
Cov θ ^ ^ = I θ ^ 1 = 2 log L o b s θ 2 λ 2 log L o b s θ λ γ 2 log L o b s θ λ β 2 log L o b s θ λ γ 2 log L o b s θ 2 γ 2 log L o b s θ γ β 2 log L o b s θ λ β 2 log L o b s θ γ β 2 log L o b s θ β β θ = θ ^ 1 .

References

  1. Cox, D.R. Regression models and life-tables. J. R. Stat. Soc. Ser. Stat. Methodol. 1972, 34, 187–202. [Google Scholar] [CrossRef]
  2. Weibull, W. A statistical distribution function of wide applicability. J. Appl. Mech. 1951, 18, 293–297. [Google Scholar] [CrossRef]
  3. Alakuş, K. Confidence intervals estimation for survival function in Weibull proportional hazards regression based on censored survival time data. Sci. Res. Essays 2010, 5, 1589–1594. [Google Scholar]
  4. Gong, Q.; Fang, L. Comparison of different parametric proportional hazards models for interval-censored data: A simulation study. Contemp. Clin. Trials 2013, 36, 276–283. [Google Scholar] [CrossRef]
  5. Sha, N.; Pan, R. Bayesian analysis for step-stress accelerated life testing using Weibull proportional hazard model. Stat. Pap. 2014, 55, 715–726. [Google Scholar] [CrossRef]
  6. Nemati, H.M.; Sant’Anna, A.; Nowaczyk, S.; Jürgensen, J.H.; Hilber, P. Reliability evaluation of power cables considering the restoration characteristic. Int. J. Electr. Power Energy Syst. 2019, 105, 622–631. [Google Scholar] [CrossRef]
  7. Liu, K.; Xie, T. Parameter estimation of Weibull distribution model with covariate under right censored data. J. Syst. Sci. Math. Sci. 2022, 42, 2497–2507. [Google Scholar]
  8. Sun, J. The Statistical Analysis of Interval-Censored Failure Time Data; Springer: New York, NY, USA, 2006. [Google Scholar]
  9. McMahan, C.S.; Wang, L.; Tebbs, J.M. Regression analysis for current status data using the EM algorithm. Stat. Med. 2013, 32, 4452–4466. [Google Scholar] [CrossRef] [PubMed]
  10. Wang, L.; McMahan, C.S.; Hudgens, M.G.; Qureshi, Z.P. A flexible, computationally efficient method for fitting the proportional hazards model to interval-censored data. Biometrics 2016, 72, 222–231. [Google Scholar] [CrossRef]
  11. Zhou, R.; Li, H.; Sun, J.; Tang, N. A new approach to estimation of the proportional hazards model based on interval-censored data with missing covariates. Lifetime Data Anal. 2022, 28, 335–355. [Google Scholar] [CrossRef] [PubMed]
  12. Withana Gamage, P.W.; McMahan, C.S.; Wang, L. A flexible parametric approach for analyzing arbitrarily censored data that are potentially subject to left truncation under the proportional hazards model. Lifetime Data Anal. 2023, 29, 188–212. [Google Scholar] [CrossRef]
  13. Cui, D.; Tee, C. The expectation–maximization approach for Bayesian additive Cox regression with current status data. J. Korean Stat. Soc. 2023, 52, 361–381. [Google Scholar] [CrossRef]
  14. Louis, T.A. Finding the observed information matrix when using the EM algorithm. J. R. Stat. Soc. Ser. Stat. Methodol. 1982, 44, 226–233. [Google Scholar] [CrossRef]
  15. Dempster, A.P.; Laird, N.M.; Rubin, D.B. Maximum likelihood from incomplete data via the EM algorithm. J. R. Stat. Soc. Ser. Stat. Methodol. 1977, 39, 1–22. [Google Scholar] [CrossRef]
  16. R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2022; Available online: https://www.R-project.org/ (accessed on 31 October 2022).
  17. Balakrishnan, N.; Mitra, D. Left truncated and right censored Weibull data and likelihood inference with an illustration. Comput. Stat. Data Anal. 2012, 56, 4011–4025. [Google Scholar] [CrossRef]
  18. McMahan, C.S.; Wang, L. ICsurv: Semiparametric Regression Analysis of Interval-Censored Data. R Package Version 1.0.1. Available online: http://CRAN.R-project.org/package=ICsurv (accessed on 22 June 2022).
  19. Hoel, D.G.; Walburg, H., Jr. Statistical analysis of survival experiments. J. Natl. Cancer Inst. 1972, 49, 361–372. [Google Scholar] [PubMed]
  20. Yu, A.K.F.; Kwan, K.Y.W.; Chan, D.H.Y.; Fong, D.Y.T. Clinical features of 46 eyes with calcified hydrogel intraocular lenses. J. Cataract. Refract. Surg. 2001, 27, 1596–1606. [Google Scholar] [CrossRef] [PubMed]
  21. Xue, H.; Lam, K.; Li, G. Sieve maximum likelihood estimator for semiparametric regression models with current status data. J. Am. Stat. Assoc. 2004, 99, 346–356. [Google Scholar] [CrossRef]
Figure 1. Estimates of survival functions of time for lung onset tumor by EM method and NPMLE method.
Figure 1. Estimates of survival functions of time for lung onset tumor by EM method and NPMLE method.
Mathematics 11 04826 g001
Figure 2. Estimates of survival functions of time for IOL calcification by EM method and NPMLE method.
Figure 2. Estimates of survival functions of time for IOL calcification by EM method and NPMLE method.
Mathematics 11 04826 g002
Table 1. Comparison of the EM method and the NR method with different censoring ratios at n = 30 .
Table 1. Comparison of the EM method and the NR method with different censoring ratios at n = 30 .
EMNR
BiasESESDCPBiasESESDCP
P = 0.3 β 1 −0.50.08301.84402.23090.9590.08361.91072.45150.968
β 2 −0.50.04841.00891.09130.9690.03281.23681.26330.970
β 1 −0.50.20921.90402.35070.9610.21471.96592.52520.969
β 2 00.02060.92891.05850.9620.02621.01421.13420.971
β 1 −0.50.17911.86372.24180.9750.21321.98502.58680.981
β 2 0.50.18051.06941.13450.9660.21781.55431.35740.970
β 1 00.04291.77082.12920.9660.03921.82992.31990.973
β 2 −0.50.08000.92981.07560.9680.08490.99391.19960.972
β 1 00.00281.83642.24900.9610.02801.91822.51700.973
β 2 00.00070.86480.97200.9630.00110.89161.03090.971
β 1 00.01021.87392.27620.9700.00481.92602.41670.975
β 2 0.50.20840.97391.09880.9660.22331.00441.17070.971
β 1 0.50.24921.85622.25180.9660.24331.90332.34630.972
β 2 −0.50.16100.92481.07190.9610.15840.96281.12880.970
β 1 0.50.13541.86492.31460.9540.14461.91872.46160.964
β 2 00.01220.87760.96510.9590.00950.91471.19970.962
β 1 0.50.24101.92572.31180.9620.28212.01362.60290.969
β 2 0.50.26490.98641.09960.9740.29841.07831.23270.981
P = 0.4 β 1 −0.50.17291.70202.08790.9630.22701.82822.50460.977
β 2 −0.50.18130.84500.97670.9560.22661.06211.30280.972
β 1 −0.50.28321.71862.06280.9640.33931.83982.31690.973
β 2 00.02390.80610.94610.9540.04200.86961.08210.959
β 1 −0.50.11041.72002.03670.9600.12621.81372.24800.973
β 2 0.50.15070.83180.93810.9700.18570.87191.04660.977
β 1 00.10231.67091.96920.9570.13061.78742.24890.973
β 2 −0.50.12770.82590.98910.9560.15440.91961.16360.970
β 1 00.04741.66522.00790.9540.09631.81912.50940.971
β 2 00.03930.79700.90400.9690.05220.86611.09230.976
β 1 00.03901.68451.90400.9680.04031.80562.20310.978
β 2 0.50.16730.83390.96460.9570.21600.89311.14920.966
β 1 0.50.15901.69261.99280.9620.18311.79012.21370.979
β 2 −0.50.18380.82500.92790.9630.19900.87341.04950.975
β 1 0.50.20111.67731.96460.9660.22851.77512.19010.976
β 2 00.01770.79090.89900.9710.03460.84211.04410.977
β 1 0.50.11291.71452.14170.9600.15331.84102.51540.971
β 2 0.50.14990.82640.91150.9720.20680.89211.13310.983
P = 0.5 β 1 −0.50.14041.63811.93550.9560.18611.93372.93880.983
β 2 −0.50.18710.79330.89840.9640.26800.95461.44320.980
β 1 −0.50.21281.63841.85340.9620.29791.92942.62140.982
β 2 00.02530.75820.82980.9590.01060.89361.23740.977
β 1 −0.50.09291.62981.79670.9660.13441.82992.20200.979
β 2 0.50.09890.80580.86080.9660.16510.92461.10440.980
β 1 00.04441.59981.81810.9650.04351.81252.24580.983
β 2 −0.50.18610.79970.88780.9690.23330.93751.23080.983
β 1 00.03041.63571.87620.9560.03641.90372.50940.977
β 2 00.05350.75910.81950.9530.00391.05091.11620.972
β 1 00.03081.61391.77890.9560.03311.83682.31880.979
β 2 0.50.09030.80360.89840.9700.19330.95711.24480.982
β 1 0.50.14481.66891.92460.9640.22511.93092.78920.983
β 2 −0.50.16940.80160.87540.9640.19870.92001.18350.977
β 1 0.50.15841.64081.89340.9620.21161.88512.40360.979
β 2 00.04260.75530.83900.9680.01980.87271.16490.978
β 1 0.50.14991.61961.80700.9700.17701.84452.27990.984
β 2 0.50.09000.80120.86670.9620.17620.91181.16010.977
Table 2. Comparison of the EM method and the NR method with different censoring ratios at n = 60 .
Table 2. Comparison of the EM method and the NR method with different censoring ratios at n = 60 .
EMNR
BiasESESDCPBiasESESDCP
P = 0.3 β 1 −0.50.10741.12251.29000.9440.11251.12611.31280.944
β 2 −0.50.04050.56540.59890.9500.04460.56570.59830.952
β 1 −0.50.04991.11321.19350.9530.05151.11451.20060.954
β 2 00.01550.53020.54850.9660.01230.53040.54910.966
β 1 −0.50.00501.11401.23530.9410.00561.11531.24430.941
β 2 0.50.05640.56860.60860.9510.05480.56930.61390.951
β 1 00.03751.10051.19550.9640.03751.10171.20200.963
β 2 −0.50.08880.56570.59990.9480.09310.56620.60030.947
β 1 00.05071.09711.16010.9570.05151.09781.16460.955
β 2 00.01710.53310.58440.9480.01350.53310.58390.950
β 1 00.00931.10121.22180.9560.00881.10251.23130.956
β 2 0.50.08190.56550.60330.9510.08100.56660.61110.949
β 1 0.50.08671.12261.23700.9510.08771.12491.25100.951
β 2 −0.50.04220.56660.59990.9450.04600.56700.60100.946
β 1 0.50.00161.11341.21370.9480.00061.11421.21960.948
β 2 00.01680.53320.59420.9440.01320.53320.59340.946
β 1 0.50.08521.12581.27440.9440.08891.12891.29050.943
β 2 0.50.07870.56770.61770.9570.07700.56870.62110.953
P = 0.4 β 1 −0.50.11491.01021.13720.9510.12041.01801.16820.950
β 2 −0.50.05690.50980.56210.9490.05980.51320.57380.951
β 1 −0.50.01051.01751.13240.9470.01311.02271.15190.947
β 2 00.00350.47810.52070.9430.00340.48010.52840.944
β 1 −0.50.09451.00701.10290.9440.09771.01101.11910.943
β 2 0.50.04550.50310.53800.9590.04690.50480.54790.959
β 1 00.02981.00271.13160.9470.03151.00891.15530.949
β 2 −0.50.01930.50740.54740.9570.02200.51010.56050.958
β 1 00.00190.99441.11560.9520.00221.00071.14100.951
β 2 00.01020.47800.52530.9460.01110.47990.53210.944
β 1 00.00981.00391.11040.9490.00981.00941.13220.951
β 2 0.50.07180.51000.56160.9460.07300.51200.57160.942
β 1 0.50.06791.02021.21190.9370.07021.02541.23060.936
β 2 −0.50.07920.50760.56210.9480.08120.50980.56820.947
β 1 0.50.06101.00681.10490.9440.06351.01131.12210.944
β 2 00.01650.47580.50960.9440.01620.47690.51300.945
β 1 0.50.06781.01261.11130.9480.06931.01841.13220.948
β 2 0.50.08650.50880.55260.9530.08920.51160.56730.949
P = 0.5 β 1 −0.50.05130.97761.06170.9560.06020.99571.10850.957
β 2 −0.50.09130.49000.51250.9640.08710.49950.54190.963
β 1 −0.50.11440.97470.97520.9640.12700.99081.01870.962
β 2 00.02240.46150.49800.9530.01610.47030.53890.956
β 1 −0.50.08340.97331.05360.9470.09790.99641.13630.949
β 2 0.50.05520.49150.53750.9530.08220.50410.59970.949
β 1 00.02370.96851.06530.9520.02420.98601.11280.951
β 2 −0.50.07630.48700.52340.9580.07220.49650.55810.960
β 1 00.04720.96861.05610.9490.04520.99131.13510.946
β 2 00.00960.46790.51370.9370.00540.47750.55020.934
β 1 00.01810.96811.07310.9540.01890.98991.14380.957
β 2 0.50.05250.49060.52910.9500.07660.50180.58890.949
β 1 0.50.10280.98051.05810.9590.11111.00131.13210.960
β 2 −0.50.07010.48870.52960.9460.06960.50020.57640.949
β 1 0.50.05390.97401.05500.9540.06160.99521.11300.954
β 2 00.04470.46550.51160.9440.03250.47630.55720.945
β 1 0.50.07570.98261.04140.9560.08321.00361.10600.957
β 2 0.50.05860.48840.50830.9640.08300.49890.55450.961
Table 3. Comparison of the EM method and the NR method with different censoring ratios at n = 100 .
Table 3. Comparison of the EM method and the NR method with different censoring ratios at n = 100 .
EMNR
BiasESESDCPBiasESESDCP
P = 0.3 β 1 −0.50.05060.81340.86410.9420.05220.81320.86610.941
β 2 −0.50.04830.42080.44570.9490.05230.42070.44530.950
β 1 −0.50.08670.81400.85130.9540.08780.81370.85330.954
β 2 00.00610.39440.40990.9450.00240.39420.40940.945
β 1 −0.50.03820.81900.87940.9460.03920.81860.88150.946
β 2 0.50.05480.41910.43440.9560.05120.41900.43470.956
β 1 00.01540.80240.85900.9460.01520.80230.86120.946
β 2 −0.50.01470.41770.43490.9470.01880.41750.43420.947
β 1 00.00280.80530.84560.9530.00280.80490.84720.952
β 2 00.01930.39650.41290.9530.01530.39630.41210.953
β 1 00.02580.80610.84180.9460.02600.80560.84360.945
β 2 0.50.03500.41800.44580.9410.03100.41790.44500.941
β 1 0.50.03520.81570.83650.9530.03670.81560.83930.952
β 2 −0.50.02320.41880.43440.9520.02710.41860.43390.953
β 1 0.50.05610.81690.85790.9590.05720.81660.85990.958
β 2 00.00280.39500.40890.9510.00100.39480.40830.951
β 1 0.50.04290.82080.88140.9410.04390.82040.88380.941
β 2 0.50.05510.41900.43110.9480.05150.41890.43100.948
P = 0.4 β 1 −0.50.00330.72810.74290.9590.00520.72900.74970.957
β 2 −0.50.01680.37250.38180.9530.02010.37300.38230.954
β 1 −0.50.09070.73820.76210.9540.09290.73880.76780.952
β 2 00.01960.35450.36690.9500.01610.35460.36740.951
β 1 −0.50.07720.73840.79020.9420.07940.73870.79620.942
β 2 0.50.05240.37360.39380.9430.04870.37380.39580.942
β 1 00.01150.72290.76050.9480.01180.72390.76800.947
β 2 −0.50.03660.37250.38800.9490.03960.37280.38910.950
β 1 00.01520.72600.77760.9450.01570.72670.78480.942
β 2 00.02040.35330.37250.9340.02430.35350.37330.935
β 1 00.00440.72920.78000.9390.00490.72950.78620.938
β 2 0.50.03100.37260.37360.9580.02700.37280.37580.956
β 1 0.50.06860.73470.80410.9440.07110.73560.81090.941
β 2 −0.50.01850.37180.38930.9560.02150.37210.38940.955
β 1 0.50.02150.73900.79080.9490.02340.73970.79700.947
β 2 00.00570.35550.37350.9560.00170.35570.37390.953
β 1 0.50.02660.73700.80720.9310.02890.73780.81670.931
β 2 0.50.03830.37330.39840.9400.03520.37360.40150.939
P = 0.5 β 1 −0.50.00950.70450.71720.9550.00650.70910.73450.955
β 2 −0.50.04100.35550.37320.9490.03520.35770.38070.946
β 1 −0.50.05520.70900.71340.9630.05920.71380.73150.962
β 2 00.00030.34070.34880.9570.00470.34240.35590.951
β 1 −0.50.06060.70580.75530.9390.06620.71090.77830.936
β 2 0.50.03210.35640.37000.9500.04100.35820.38510.942
β 1 00.01540.69720.72540.9490.01530.70140.74140.947
β 2 −0.50.05630.35580.36970.9530.05160.35780.37570.953
β 1 00.00040.69690.71250.9670.00030.70140.73080.966
β 2 00.00080.34020.36690.9310.00390.34170.37470.928
β 1 00.02200.70120.74030.9500.02210.70590.76060.946
β 2 0.50.00080.35600.37610.9460.00580.35750.38890.944
β 1 0.50.01980.70760.77660.9440.02510.71280.79700.944
β 2 −0.50.05250.35710.36790.9550.04780.35930.37530.954
β 1 0.50.03410.70770.74850.9500.03800.71280.77020.948
β 2 00.01000.34040.34240.9570.00540.34180.34940.955
β 1 0.50.01630.70480.72600.9500.02080.70950.74810.948
β 2 0.50.04680.35580.37270.9490.05550.35770.39030.943
Table 4. Comparison of the EM method and the NR method with different censoring ratios at n = 200 .
Table 4. Comparison of the EM method and the NR method with different censoring ratios at n = 200 .
EMNR
BiasESESDCPBiasESESDCP
P = 0.3 β 1 −0.50.01710.55420.56360.9470.01590.55380.56440.947
β 2 −0.50.01710.28810.29790.9490.02100.28790.29770.950
β 1 −0.50.03770.55600.57250.9430.03860.55560.57290.943
β 2 00.01050.27210.28620.9510.00710.27200.28560.951
β 1 −0.50.04070.55490.57840.9390.04140.55440.57880.940
β 2 0.50.02940.28750.29380.9520.02580.28740.29320.951
β 1 00.02770.54740.55680.9470.02780.54710.55750.947
β 2 −0.50.01410.28760.29200.9570.01800.28750.29180.958
β 1 00.02620.54980.57740.9500.02620.54940.57820.950
β 2 00.00690.27280.28000.9430.00350.27260.27960.945
β 1 00.00970.54810.56520.9420.00970.54760.56560.942
β 2 0.50.03500.28710.30660.9410.03110.28690.30590.944
β 1 0.50.02160.55410.54840.9570.02290.55370.54910.956
β 2 −0.50.00950.28660.28730.9560.01340.28640.28710.956
β 1 0.50.01590.55430.58920.9410.01500.55390.58980.941
β 2 00.00360.27120.27920.9440.00020.27110.27880.944
β 1 0.50.02260.55560.58780.9420.02340.55510.58820.942
β 2 0.50.02090.28780.30600.9330.01710.28770.30530.935
P = 0.4 β 1 −0.50.02310.49870.51550.9550.02510.49880.51860.953
β 2 −0.50.02570.25600.26110.9510.03000.25600.26080.950
β 1 −0.50.02110.49580.49250.9550.02260.49560.49500.954
β 2 00.01120.24300.25430.9410.00650.24290.25410.939
β 1 −0.50.02650.49940.50450.9500.02750.49900.50670.948
β 2 0.50.02270.25530.26370.9490.01690.25530.26290.950
β 1 00.01530.48980.50040.9570.01550.48980.50330.956
β 2 −0.50.02360.25650.26490.9450.02780.25650.26440.945
β 1 00.01450.49040.49680.9510.01470.49020.49930.950
β 2 00.00430.24220.24670.9560.00900.24210.24620.955
β 1 00.02690.49280.50210.9530.02710.49250.50420.953
β 2 0.50.01900.25610.26720.9470.01330.25600.26670.948
β 1 0.50.02750.49810.50090.9470.02970.49820.50420.946
β 2 −0.50.01490.25580.26540.9480.01910.25580.26510.948
β 1 0.50.04280.49960.51530.9470.04470.49940.51830.946
β 2 00.01130.24260.25100.9370.00660.24250.25070.937
β 1 0.50.00860.49850.51890.9450.00790.49810.52120.942
β 2 0.50.04420.25590.26410.9470.03830.25580.26340.944
P = 0.5 β 1 −0.50.00270.47530.47870.9600.00050.47670.48680.958
β 2 −0.50.02490.24270.25630.9350.02130.24340.25770.933
β 1 −0.50.05080.47570.47950.9580.05520.47700.48830.956
β 2 00.00210.23190.23330.9550.00210.23230.23560.956
β 1 −0.50.02320.47470.46310.9640.02630.47570.47180.962
β 2 0.50.01530.24240.24920.9420.01490.24270.25420.939
β 1 00.00630.46990.47340.9530.00630.47130.48120.949
β 2 −0.50.02560.24230.24340.9540.02300.24290.24400.953
β 1 00.01950.47100.46120.9640.01970.47220.47000.959
β 2 00.00980.23220.24530.9360.00990.23270.24840.935
β 1 00.01810.47160.47200.9540.01820.47260.48090.950
β 2 0.50.01790.24290.25580.9470.01750.24320.26200.944
β 1 0.50.01710.47440.48780.9500.02070.47580.49530.948
β 2 −0.50.01740.24220.24650.9510.01460.24290.24750.949
β 1 0.50.04290.47660.48200.9600.04700.47790.49110.956
β 2 00.01220.23180.24080.9320.01230.23220.24330.930
β 1 0.50.02570.47520.49590.9530.02830.47620.50550.947
β 2 0.50.02670.24270.23970.9500.02610.24300.24460.946
Table 5. Comparison of the EM method and the NR method with different censoring ratios at n = 400 .
Table 5. Comparison of the EM method and the NR method with different censoring ratios at n = 400 .
EMNR
BiasESESDCPBiasESESDCP
P = 0.3 β 1 −0.50.01200.38640.38990.9500.01310.38610.39030.949
β 2 −0.50.00270.20080.19640.9540.00640.20060.19630.954
β 1 −0.50.00420.38810.40120.9510.00490.38780.40150.951
β 2 00.00020.18990.18100.9660.00330.18980.18080.965
β 1 −0.50.02230.38770.38570.9540.02290.38730.38570.955
β 2 0.50.01950.20120.20490.9370.01610.20120.20460.936
β 1 00.00780.38120.37740.9490.00780.38090.37770.949
β 2 −0.50.00630.20060.20550.9390.01000.20040.20540.938
β 1 00.00630.38000.38490.9510.00630.37980.38510.951
β 2 00.01010.18970.19830.9430.00690.18960.19800.942
β 1 00.01330.38090.37150.9590.01330.38050.37170.960
β 2 0.50.01470.20030.20790.9430.01120.20030.20740.946
β 1 0.50.02030.38600.38420.9480.02130.38580.38470.948
β 2 −0.50.00070.20070.19690.9560.00290.20050.19680.958
β 1 0.50.00100.38720.40310.9360.00170.38690.40340.936
β 2 00.00040.18990.19290.9440.00360.18980.19260.941
β 1 0.50.01710.38730.40300.9430.01780.38690.40320.943
β 2 0.50.02040.20040.20380.9420.01700.20040.20340.942
P = 0.4 β 1 −0.50.02290.34750.36330.9450.02500.34740.36510.945
β 2 −0.50.00280.17850.17870.9540.00750.17850.17840.954
β 1 −0.50.00500.34470.34990.9490.00620.34450.35130.947
β 2 00.00250.16940.17480.9480.00720.16930.17440.950
β 1 −0.50.00390.34500.34490.9480.00440.34470.34570.947
β 2 0.50.01400.17820.17570.9510.00780.17820.17460.955
β 1 00.00530.34050.34110.9590.00520.34030.34280.957
β 2 −0.50.00790.17860.18250.9540.01270.17860.18220.954
β 1 00.00320.34120.34250.9510.00340.34100.34380.951
β 2 00.00130.16970.17250.9380.00340.16970.17220.935
β 1 00.00700.34140.34520.9590.00690.34100.34590.957
β 2 0.50.01740.17840.17950.9560.01120.17840.17840.957
β 1 0.50.01050.34600.34150.9600.01240.34590.34320.960
β 2 −0.50.00540.17870.17760.9490.00070.17870.17730.952
β 1 0.50.00220.34580.34060.9620.00090.34560.34190.962
β 2 00.00960.16970.17850.9390.00480.16960.17820.938
β 1 0.50.00530.34610.35280.9530.00590.34570.35360.952
β 2 0.50.02110.17870.17940.9470.01490.17870.17820.949
P = 0.5 β 1 −0.50.00430.32800.32150.9580.00760.32870.32610.954
β 2 −0.50.01220.16850.17620.9450.01000.16890.17600.942
β 1 −0.50.01970.32860.33050.9530.02330.32910.33610.950
β 2 00.00660.16120.16310.9490.00410.16130.16370.948
β 1 −0.50.01030.32880.34580.9430.01270.32910.35120.941
β 2 0.50.01830.16880.16940.9520.01480.16890.17200.944
β 1 00.00090.32530.33710.9560.00100.32590.34190.954
β 2 −0.50.00740.16870.17360.9450.00510.16910.17350.947
β 1 00.00950.32550.32450.9580.00960.32590.32980.955
β 2 00.00710.16130.16110.9490.00460.16150.16190.949
β 1 00.00550.32590.31670.9650.00560.32610.32180.960
β 2 0.50.00990.16880.16340.9560.00590.16890.16520.956
β 1 0.50.00190.32980.33570.9510.00510.33050.34040.945
β 2 −0.50.00930.16880.17180.9460.00710.16920.17180.945
β 1 0.50.00040.32930.32080.9510.00310.32980.32630.948
β 2 00.00690.16140.17090.9350.00450.16150.17160.936
β 1 0.50.01330.32950.34330.9380.01580.32980.34870.936
β 2 0.50.01470.16890.17380.9500.01120.16910.17670.946
Table 6. Estimates for the Weibull PH model and semi-parametric PH model when the failure time comes from the Weibull distribution.
Table 6. Estimates for the Weibull PH model and semi-parametric PH model when the failure time comes from the Weibull distribution.
Weibull PH ModelSemi-Parametric PH Model
BiasESESDCPBiasESESDCP
degree = 1 β 1 −0.50.00220.57560.56970.9620.03540.60930.61490.958
β 2 −0.50.01370.28440.27960.9480.04960.30900.29870.950
β 1 −0.50.00960.55590.56430.9580.04810.59230.61420.952
β 2 00.00320.27070.27000.9560.00750.29330.28300.964
β 1 −0.50.04350.54710.56610.9520.08660.58520.62430.944
β 2 0.50.01550.27240.26560.9540.04400.29370.28780.962
β 1 00.02030.56960.56490.9600.00860.60310.61040.952
β 2 −0.50.01140.28370.29290.9440.04280.31230.31220.940
β 1 00.00560.55110.54360.9580.00040.58630.59160.948
β 2 00.01230.27000.27530.9520.00490.29360.29610.960
β 1 00.00960.54360.52450.9600.00840.58060.57140.962
β 2 0.50.02420.27280.28120.9640.05730.29560.30970.958
β 1 0.50.03830.57310.60350.9360.00860.60960.64000.944
β 2 −0.50.01210.28310.28440.9660.02210.30740.30500.960
β 1 0.50.00060.55060.55460.9500.02690.58770.60490.944
β 2 00.00850.27080.26380.9580.00260.29250.27750.968
β 1 0.50.02180.55120.53780.9540.07240.59040.59910.948
β 2 0.50.02190.27350.27600.9560.05130.29660.30400.954
degree = 2 β 1 −0.50.04840.57320.59170.9520.07350.62110.64110.946
β 2 −0.50.03860.28480.28270.9540.07590.33990.30850.942
β 1 −0.50.02830.55260.56760.9560.06950.59980.62580.946
β 2 00.00500.27150.27570.9480.01260.31580.29640.944
β 1 −0.50.02480.54680.54940.9700.06500.59940.60570.976
β 2 0.50.02250.27280.26200.9740.04970.32000.28760.974
β 1 00.01270.56690.57390.9600.01200.61780.61210.954
β 2 −0.50.01390.28130.28470.9500.02450.33360.30870.944
β 1 00.02850.54580.54160.9640.03130.59020.58550.966
β 2 00.00260.27020.28150.9540.00980.31620.30030.956
β 1 00.02060.54500.54710.9440.01760.58840.60370.946
β 2 0.50.02930.27290.27170.9520.05920.33170.29060.962
β 1 0.50.05660.57890.59180.9560.09600.64060.63620.956
β 2 −0.50.00350.28520.30380.9420.04020.34850.32330.952
β 1 0.50.01210.55350.54230.9640.04910.59960.59080.958
β 2 00.00160.27060.29130.9360.01000.31740.30960.936
β 1 0.50.06910.54780.56450.9560.10890.60860.61830.952
β 2 0.50.03050.27250.27850.9520.05830.33700.30950.960
degree = 3 β 1 −0.50.00050.57440.59830.9420.02750.62540.63270.948
β 2 −0.50.02810.28430.30430.9380.06450.32960.32490.864
β 1 −0.50.02430.54970.56320.9480.06500.59980.62100.940
β 2 00.00130.27140.28020.9480.02980.33810.30990.914
β 1 −0.50.03560.55840.58480.9420.07440.61600.64350.948
β 2 0.50.01070.27250.27750.9340.01640.31910.29320.884
β 1 00.03410.57460.61040.9320.06700.63320.66440.926
β 2 −0.50.03400.28540.30010.9420.06920.33340.32620.876
β 1 00.04680.54750.56220.9620.08930.60120.61370.956
β 2 00.00910.27130.26930.9620.01890.33550.30090.918
β 1 00.02080.55760.54580.9620.05960.60790.58730.960
β 2 0.50.00920.27250.26680.9580.01980.30940.28090.914
β 1 0.50.01430.57270.57300.9600.01780.63010.61420.964
β 2 −0.50.00270.28310.28790.9460.03740.32520.31050.894
β 1 0.50.01650.54740.55860.9560.02020.59230.61160.964
β 2 00.00210.27210.28960.9360.02690.31560.31900.878
β 1 0.50.02340.57500.62280.9420.02430.63270.66930.940
β 2 0.50.03600.28510.30330.9440.07110.34650.31840.906
Table 7. Estimates for the the Weibull PH model and semi-parametric PH model when the failure time comes from the non-Weibull distribution.
Table 7. Estimates for the the Weibull PH model and semi-parametric PH model when the failure time comes from the non-Weibull distribution.
Weibull PH ModelSemi-Parametric PH Model
BiasESESDCPBiasESESDCP
degree = 1 β 1 −0.50.04510.50060.49660.9600.01830.55080.56530.954
β 2 −0.50.03230.24840.25780.9280.03340.27940.29220.942
β 1 −0.50.04620.48600.43620.9740.03480.53620.51620.962
β 2 00.00190.23970.22910.9660.00270.26470.26250.952
β 1 −0.50.04620.48120.43950.9660.03140.53100.50940.970
β 2 0.50.04270.23990.21460.9720.03890.26790.25710.964
β 1 00.02070.49850.47390.9540.02710.54830.54870.952
β 2 −0.50.03020.24730.25590.9360.04340.27760.29670.942
β 1 00.02650.48780.50020.9600.02730.54090.59060.946
β 2 00.02510.24030.24090.9560.02990.26970.28240.952
β 1 00.01720.48000.45380.9700.02830.52940.53770.952
β 2 0.50.06490.24040.22160.9620.01330.26830.26090.954
β 1 0.50.05350.50070.50030.9540.01590.55190.58060.948
β 2 −0.50.03630.24860.24940.9460.04020.28110.28530.948
β 1 0.50.04680.49020.46900.9660.03240.54360.55500.960
β 2 00.00170.24030.23850.9560.00610.26550.26720.958
β 1 0.50.06750.48260.43420.9700.00270.53010.49680.966
β 2 0.50.05140.24040.21760.9680.02470.26800.25740.962
degree = 2 β 1 −0.50.00860.50370.51000.9560.06160.55800.59180.950
β 2 −0.50.03970.24890.23690.9620.03240.28470.27480.970
β 1 −0.50.07150.48500.41070.9840.00770.53700.48180.974
β 2 00.00230.23950.23490.9420.00180.26830.27080.942
β 1 −0.50.08720.48410.48950.9500.01290.53690.57750.946
β 2 0.50.06280.24110.22080.9580.01660.27220.26730.956
β 1 00.00300.49770.51260.9460.00260.55170.59450.938
β 2 −0.50.02230.24860.24360.9520.05000.28590.28450.952
β 1 00.03110.48850.46410.9680.03850.54170.54570.954
β 2 00.00050.24080.23510.9580.00100.26960.26970.956
β 1 00.00760.48130.43430.9740.01540.53180.51570.962
β 2 0.50.06710.24110.22970.9440.01220.27150.27330.944
β 1 0.50.01770.50270.47980.9740.06960.55720.55920.964
β 2 −0.50.04850.24830.23820.9660.02600.28480.27330.964
β 1 0.50.03690.49120.47340.9600.04810.54590.55380.958
β 2 00.01560.24050.24040.9600.01350.26890.27270.956
β 1 0.50.03530.48410.45870.9500.04670.53900.53360.948
β 2 0.50.05270.24140.22940.9500.02950.27420.27430.958
degree = 3 β 1 −0.50.00450.50260.46900.9740.06970.56290.54970.960
β 2 −0.50.02270.24860.23550.9640.04850.29200.27500.976
β 1 −0.50.02340.48890.44520.9680.06740.54610.52500.954
β 2 00.00630.24010.23140.9580.01230.27230.26730.956
β 1 −0.50.04160.48220.44450.9760.02840.53410.51320.974
β 2 0.50.06900.23990.22760.9480.00790.27360.26920.962
β 1 00.00610.49650.48880.9660.00930.55570.57210.956
β 2 −0.50.02920.24790.24710.9500.04380.29310.28250.958
β 1 00.00630.48610.46070.9640.00900.54300.55250.956
β 2 00.00320.24000.22820.9620.00590.27470.26620.958
β 1 00.02540.47950.44850.9700.03240.53130.52340.968
β 2 0.50.05300.24050.23320.9460.02690.27510.27570.968
β 1 0.50.00300.50400.47610.9820.07020.56480.56470.974
β 2 −0.50.02970.24940.25170.9560.04640.29170.29200.954
β 1 0.50.05790.48790.46140.9720.01630.54140.53520.960
β 2 00.01860.23960.23900.9620.02010.27130.26840.962
β 1 0.50.07410.48330.48010.9560.00000.53610.55900.950
β 2 0.50.04940.24090.20980.9620.02770.27780.25620.974
Table 8. The AIC and BIC of the Weibull PH model and the semi-parametric PH model when the failure time comes from a Weibull distribution.
Table 8. The AIC and BIC of the Weibull PH model and the semi-parametric PH model when the failure time comes from a Weibull distribution.
Weibull PH ModelSemi-Parametric PH Model
AICBICAICBIC
degree = 1 β = ( 0.5 , 0.5 ) 158.0671171.2604162.5652188.9517
β = ( 0.5 , 0 ) 159.3115172.5048164.0589190.4455
β = ( 0.5 , 0.5 ) 153.2519166.4451158.3429184.7294
β = ( 0 , 0.5 ) 158.5114171.7047163.0422189.4287
β = ( 0 , 0 ) 159.2377172.4310164.0106190.3972
β = ( 0 , 0.5 ) 153.1766166.3699158.3069184.6934
β = ( 0.5 , 0.5 ) 159.3043172.4976164.0052190.3918
β = ( 0.5 , 0 ) 158.5679171.7612163.3746189.7611
β = ( 0.5 , 0.5 ) 153.1342166.3275158.1237184.5102
degree = 2 β = ( 0.5 , 0.5 ) 157.5231170.7164163.7349193.4198
β = ( 0.5 , 0 ) 158.5837171.7769165.0200194.7048
β = ( 0.5 , 0.5 ) 153.4395166.6327160.0308189.7156
β = ( 0 , 0.5 ) 159.7065172.8997165.9552195.6400
β = ( 0 , 0 ) 158.5474171.7407164.8709194.5557
β = ( 0 , 0.5 ) 153.4417166.6350159.9141189.5989
β = ( 0.5 , 0.5 ) 157.3260170.5192163.4532193.1381
β = ( 0.5 , 0 ) 159.0321172.2254165.3613195.0462
β = ( 0.5 , 0.5 ) 153.7928166.9861160.1865189.8714
degree = 3 β = ( 0.5 , 0.5 ) 158.0656171.2589166.2319199.2151
β = ( 0.5 , 0 ) 155.7645168.9577164.1663197.1495
β = ( 0.5 , 0.5 ) 157.2065170.3998165.3615198.3447
β = ( 0 , 0.5 ) 157.1589170.3521165.2879198.2711
β = ( 0 , 0 ) 154.8845168.0777163.3521196.3353
β = ( 0 , 0.5 ) 157.3314170.5247165.5741198.5573
β = ( 0.5 , 0.5 ) 158.9620172.1552167.2433200.2265
β = ( 0.5 , 0 ) 153.8931167.0864162.4024195.3856
β = ( 0.5 , 0.5 ) 157.6660170.8592165.8563198.8395
Table 9. The AIC and BIC of the Weibull PH model and the semi-parametric PH model when the failure time does not follow a Weibull distribution.
Table 9. The AIC and BIC of the Weibull PH model and the semi-parametric PH model when the failure time does not follow a Weibull distribution.
Weibull PH ModelSemi-Parametric PH Model
AICBICAICBIC
degree = 1 β = ( 0.5 , 0.5 ) 178.4425191.6358180.6047206.9913
β = ( 0.5 , 0 ) 176.8683190.0616179.8233206.2098
β = ( 0.5 , 0.5 ) 171.7429184.9362174.7805201.1671
β = ( 0 , 0.5 ) 180.2393193.4325182.3858208.7723
β = ( 0 , 0 ) 176.2032189.3964180.6287210.3136
β = ( 0 , 0.5 ) 171.2697184.4630174.6974201.0839
β = ( 0.5 , 0.5 ) 180.0552193.2485182.0399208.4264
β = ( 0.5 , 0 ) 176.5421189.7354179.3505205.7371
β = ( 0.5 , 0.5 ) 170.9358184.1291174.3874200.7740
degree = 2 β = ( 0.5 , 0.5 ) 177.6850190.8783181.5645211.2493
β = ( 0.5 , 0 ) 177.4778190.6711181.9235211.6084
β = ( 0.5 , 0.5 ) 170.4353183.6285175.1660204.8509
β = ( 0 , 0.5 ) 178.6891191.8824182.5802212.2651
β = ( 0 , 0 ) 175.4079188.6011179.8539209.5387
β = ( 0 , 0.5 ) 169.7078182.9011174.7290204.4138
β = ( 0.5 , 0.5 ) 179.2848192.4781182.8814212.5662
β = ( 0.5 , 0 ) 175.9501189.1434180.4436210.1285
β = ( 0.5 , 0.5 ) 169.5194182.7126174.4124204.0973
degree = 3 β = ( 0.5 , 0.5 ) 179.3540192.5473184.6973217.6805
β = ( 0.5 , 0 ) 176.9312190.1245182.8466215.8298
β = ( 0.5 , 0.5 ) 171.5355184.7287178.6081211.5913
β = ( 0 , 0.5 ) 179.2683192.4616184.9653217.9485
β = ( 0 , 0 ) 177.3079190.5011183.3511216.3343
β = ( 0 , 0.5 ) 170.1874183.3807177.2666210.2498
β = ( 0.5 , 0.5 ) 178.5908191.7841183.9859216.9691
β = ( 0.5 , 0 ) 177.4439190.6372183.7068216.6900
β = ( 0.5 , 0.5 ) 170.5001183.6934177.4353210.4184
Table 10. Variable description of lung tumor data.
Table 10. Variable description of lung tumor data.
VariableDescription
x i Environment, GE = 1, CE = 0
δ i Presence of tumor at death time, yes = 1, no = 0
C i The time of death
Table 11. The estimation results of the Weibull PH model and the semi-parametric PH model for lung tumor data, where the value in parentheses represents the degree of the spline basis function in the semi-parametric PH model and the Var ^ denotes the estimated variance of β ^ .
Table 11. The estimation results of the Weibull PH model and the semi-parametric PH model for lung tumor data, where the value in parentheses represents the degree of the spline basis function in the semi-parametric PH model and the Var ^ denotes the estimated variance of β ^ .
β ^ Var ^ AICBIC
Weibull PH model0.8030.061167176
semi-parametric PH model (1)0.8050.144174195
semi-parametric PH model (2)0.8430.148177200
semi-parametric PH model (3)0.8180.157179205
Table 12. Variable description of IOL calcification data.
Table 12. Variable description of IOL calcification data.
VariableDescription
x i Gender, male = 1, female = 0
δ i Calcified or not, calcified = 1, uncalcified = 0
C i Examination time
Table 13. The estimation results of the Weibull PH model and the semi-parametric PH model for the IOL data, where the value in parentheses represents the degree of the spline basis function in the semi-parametric PH model and the Var ^ denotes the estimated variance of β ^ .
Table 13. The estimation results of the Weibull PH model and the semi-parametric PH model for the IOL data, where the value in parentheses represents the degree of the spline basis function in the semi-parametric PH model and the Var ^ denotes the estimated variance of β ^ .
β ^ Var ^ AICBIC
Weibull PH model−0.2690.097287299
semi-parametric PH model (1)−0.2510.130295323
semi-parametric PH model (2)−0.2540.108297328
semi-parametric PH model (3)−0.2560.210299334
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Chen, S.; Yang, F. Expectation-Maximization Algorithm for the Weibull Proportional Hazard Model under Current Status Data. Mathematics 2023, 11, 4826. https://doi.org/10.3390/math11234826

AMA Style

Chen S, Yang F. Expectation-Maximization Algorithm for the Weibull Proportional Hazard Model under Current Status Data. Mathematics. 2023; 11(23):4826. https://doi.org/10.3390/math11234826

Chicago/Turabian Style

Chen, Sisi, and Fengkai Yang. 2023. "Expectation-Maximization Algorithm for the Weibull Proportional Hazard Model under Current Status Data" Mathematics 11, no. 23: 4826. https://doi.org/10.3390/math11234826

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop