Next Article in Journal
On the Solutions of Linear Systems over Additively Idempotent Semirings
Next Article in Special Issue
New Statistical Residuals for Regression Models in the Exponential Family: Characterization, Simulation, Computation, and Applications
Previous Article in Journal
Novel Classes on Generating Functions of the Products of (p,q)-Modified Pell Numbers with Several Bivariate Polynomials
Previous Article in Special Issue
Imputing Missing Data in One-Shot Devices Using Unsupervised Learning Approach
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Modified Cox Models: A Simulation Study on Different Survival Distributions, Censoring Rates, and Sample Sizes

by
Iketle Aretha Maharela
1,
Lizelle Fletcher
1 and
Ding-Geng Chen
1,2,*
1
Department of Statistics, University of Pretoria, Pretoria 0028, South Africa
2
College of Health Solutions, Arizona State University, Phoenix, AZ 85004, USA
*
Author to whom correspondence should be addressed.
Mathematics 2024, 12(18), 2903; https://doi.org/10.3390/math12182903
Submission received: 18 August 2024 / Revised: 11 September 2024 / Accepted: 17 September 2024 / Published: 18 September 2024
(This article belongs to the Special Issue Statistical Simulation and Computation: 3rd Edition)

Abstract

:
The classical Cox model is the most popular procedure for studying right-censored data in survival analysis. However, it is based on the fundamental assumption of proportional hazards (PH). Modified Cox models, stratified and extended, have been widely employed as solutions when the PH assumption is violated. Nevertheless, prior comparisons of the modified Cox models did not employ comprehensive Monte-Carlo simulations to carry out a comparative analysis between the two models. In this paper, we conducted extensive Monte-Carlo simulation to compare the performance of the stratified and extended Cox models under varying censoring rates, sample sizes, and survival distributions. Our results suggest that the models’ performance at varying censoring rates and sample sizes is robust to the distribution of survival times. Thus, their performance under Weibull survival times was comparable to that of exponential survival times. Furthermore, we found that the extended Cox model outperformed other models under every combination of censoring, sample size and survival distribution.

1. Introduction

The classical Cox model is the most commonly used approach for analysing right-censored data [1,2]. It is used across multiple fields of study, such as engineering, education, medicine, etc. [3,4]. Research and development to strengthen Cox model has been continued even since Zheng et al. [5]. However, it is based on the assumption of proportional hazards (PH), which limits its use [6]. As a result, modified versions of the Cox model need to be adapted to circumvent the violation of the PH assumption. Consequently, the stratified and the extended Cox models are two of the most popular extensions of the Cox procedure [7].
The stratified regression model modifies the PH approach by stratifying the explanatory variables, while the extended Cox technique incorporates one or more covariates that vary over time into the Cox model [8]. Numerous studies have employed both approaches as a solution for non-proportional risks. For example, Ata and Sozer [9] applied both models to study lung cancer. Subsequently, Maryma [10] used the two approaches to overcome the violation of proportional hazards when analysing the breastfeeding span in Lampung province. More recently, Purnami et al. [11] investigated factors contributing to improved mental health using the time-dependent Cox model and the stratified procedure, while Seo and Yuk [12] used both extensions of the Cox model to assess fracture risk and osteoporosis in patients undergoing hysterectomy. Moreover, research by Phonskaningtyas [13] used the adjusted Cox approaches to evaluate the impact of spiritual intervention on chronic kidney failure patients. However, the literature suggests that prior studies merely applied the stratified and extended Cox regression approach, they did not assess the adequacy of the models in handling PH violation through simulations [14].
Monte-Carlo simulation studies are a valuable tool in investigating the performance of statistical models [15]. For instance, Mehrotra et al. [16] employed simulations to illustrate the advantages of the two-step unstratified Cox model against the stratified Cox approach. Subsequently, Olaniran and Abdullah [17] also used Monte-Carlo simulations to investigate the efficiency of the newly developed Bayesian extended Cox model to handle non-proportional data against the standard PH model and the extended Cox model. Meanwhile, Adeleke et al. [7] studied only the extended Cox model at varying levels of sample sizes and censoring rates. On the other hand, Ratnaningsih et al. [18] assessed the performance of both modifications against that of the Cox model and the stratified-extended Cox model through Monte-Carlo simulations. Nonetheless, most of the studies did not evaluate the statistical properties of the stratified against the extended Cox procedures under different survival distributions.
Using a singular survival distribution is problematic as several statisticians have pointed out that more flexibility is required when selecting the distribution of survival times such that the simulated data can reflect real data [15,19,20]. Furthermore, Bender et al. [21] claimed that most Cox related simulation studies used the exponential or Weibull distribution for survival time. Hence, in this paper, in addition to assessing the effect of different combinations and censoring and sample sizes on the models, simulations are used to investigate the performance of the extended Cox model against the stratified approach when survival times follow the Weibull distribution versus the exponential distribution.

2. Methods and Simulations Results

2.1. Methodology

The Cox proportional hazards model (in short, the Cox model) quantifies the relationship between several covariates and the hazard rate of an event of interest [22]. Suppose T is a non-negative random variable denoting survival time; then, the Cox model is expressed by
h ( t | X ) = h 0 ( t ) e x p ( β t X ) ,
where X = ( x 1 , x 2 , , x p ) is a vector of time-independent covariates, h ( t | X ) is the hazard function, h 0 ( t ) is the baseline hazard function, and β = ( β 1 , β 2 , , β p ) t denotes a vector of regression coefficients [23]. Suppose that for a sample size of n, data consists of T i , X i , d i , i = 1 , 2 , , n , where T i is the survival time of the i-th subject, X i a vector of explanatory variables, and d i the censoring indicator defined by
d i = 1 if T i C i not censored 0 if T i > C i censored ,
while C i are censoring times [24]. The sum of probabilities of the event of interest time t i over all subjects at risk is indexed by l. Subsequently, R ( t i ) denotes a set of subjects who are at risk at time t i . Therefore, unknown parameters in the Cox model are estimated by maximizing the log partial likelihood function, defined by
L L ( β ) = l o g i = 1 n h 0 ( t i ) e x p ( β t X i ) l ϵ R ( t i ) h 0 ( t i ) e x p ( β t X l ) d i ,
where d i = I ( T i C i ) [25].

2.2. Stratified Cox Model

Stratification entails controlling for predictors that do not meet the PH assumption by dividing the data into strata with different baseline hazard functions [26]. Suppose the covariate that does not satisfy the assumption PH has G levels. Thus, G is the total number of strata, g = 1 , 2 , , G , while i = 1 , 2 , , n g represents the number of subjects in the g-th stratum [25]. The stratified Cox model is defined by
h g ( t | X ) = h 0 g ( t ) e x p ( β t X ) ,
where h g ( t | X ) is the risk function for a subject from the g-th stratum and h 0 g ( t ) is the baseline hazard function for each stratum. Similarly to the Cox model, the partial likelihood function enables inference for the stratified approach in which L g ( β ) is the partial likelihood function from stratum g, defined by
L g ( β ) = i = 1 n g e x p ( β t X i g ) l ϵ R ( t i g ) e x p ( β t X ( l ) ) d i g
where t g i denotes observed time for the i-th subject in stratum g, R ( t g i ) represents subjects in the g-th stratum at risk at time t g i , and X g i is a vector of explanatory variables [26].

2.3. Extended Cox Model

A Cox regression model that includes covariates that vary over time is called an extended Cox model [27]. The model is given by
h ( t | X ( t ) ) = h 0 ( t ) e x p ( β t X + β ( t ) X ( t ) ) ,
where β is a vector of time fixed regression coefficients, β ( t ) a vector of time-varying coefficients and, unlike the Cox regression model, the exponential component in the extended model contains both the time constant X = ( x 1 , x 2 , . , x p 1 ) and time-dependent covariates X ( t ) = ( x 1 ( t ) , x 2 ( t ) , , x p 2 ( t ) ) [8]. Inference for unknown regression coefficients in the extended model is made in the same way as for the Cox model in (1), maximizing the partial likelihood, or better still the log partial likelihood function, to obtain estimates [25]. The formula of the likelihood function is the same as for the PH model, except that the value for time-dependent covariates is assessed for each risk set.

2.4. Simulation Studies

We conducted Monte-Carlo simulations to investigate the simultaneous effect of censoring, sample size, and distribution of survival time on the performance of the Cox, as well as stratified and extended models when the proportional hazards assumption is not satisfied. We adapted the algorithms of [28] to generate right-censored non-proportional data from an extended Cox model that includes one time-dependent predictor. Data were simulated by the time-varying model:
h ( t | X ( t ) ) = h 0 ( t ) e x p ( β t X + β ( t ) z ( t ) ) .
Steps to generate the non-proportional data are as follows:
1.
Covariates: two time-independent covariates, x 1 N ( 0 , 1 ) and x 2 B i n o m ( 0.5 ) , and a single dichotomous time-dependent variable, which is defined as
z ( t ) = 0 , t < t s ( unexposed / untreated ) 1 , t t s ( exposed / treated ) ,
where t s is the time at which z ( t ) changes from untreated to treated.
2.
Potential switching times: the probable exposure time for each subject in a study such that all subjects are likely to switch from unexposed to exposed is generated by
t s = l o g ( μ ) λ e x p ( a 0 + a 1 x 1 + a 2 x 2 )
where λ = 1 is the hazard function for an exponential distribution, and ( a 0 , a 1 , a 2 ) = ( 1 , 1 , 1 ) are regression coefficients [29,30].
3.
Weibull survival times: Weibull survival times are generated by
T = l o g ( u ) λ e x p ( β t X ) 1 / α if l o g ( u ) < λ e x p ( β t X ) t s α l o g ( u ) λ e x p ( β t X ) t s + λ e x p ( β t X + β ( t ) ) t s λ e x p ( β t X ) 1 / α if l o g ( u ) λ e x p ( β t X ) t s α ,
where the shape parameter equals α = 0.5 for a decreasing hazard rate over time.
4.
Exponential survival times: exponential event times are then generated by
T = l o g ( u ) λ e x p ( β t X ) if l o g ( u ) < λ e x p ( β t X ) t s l o g ( u ) λ e x p ( β t X ) t s + λ e x p ( β t X + β ( t ) ) t s λ e x p ( β t X + β ( t ) ) if l o g ( u ) λ e x p ( β t X ) t s
where λ = 1 , β t X = β 1 x 1 + β 2 x 2 , where β 1 = β 2 = 1 , and β ( t ) = 0.5 .
5.
Censoring times: censoring times are generated from the uniform distribution ( 0 , θ ) , where θ is selected to yield the desired censoring rate: 10%, 30%, and 45%.
6.
Data frame: Steps 1 to 4 produces right-censored survival data that constitutes of the observed time Z i = m i n ( T i , C i ) for the ith subject, censoring indicator d i = I ( T i C i ) , invariant covariates X i , and the time-variant variable z ( t ) , the time where the time-varying covariate z ( t ) switches from 0 to 1 t 0 = m i n ( Z i , t s ) . The final dataset consists of ( T i * , d i , X i , z ( t ) ) .
For both distributions, we simulated 10,000 datasets ( m = 10,000) for each combination of factors. Each dataset is analysed using the following statistical models: Cox PH, stratified and extended Cox model. We report the bias, model-based standard errors (Est SE), empirical standard errors (Emp SE), coverage probabilities (Cov 95%), and mean squared errors (MSE) for each model.
The extended Cox model is the true model, since it was used to generate the data. We first examine the simulation results where the event times were generated from the Weibull distribution and then those for the exponential distribution.

2.4.1. Weibull Survival Times

Table 1 offers results at 10% censoring. As expected, the extended Cox model outperformed the other models by having minimal biases, coverage probabilities around 95% and the lowest MSEs for all parameters. In contrast, the misspecified Cox model performed the worst, with significantly higher biases, poor coverage, and the largest MSE values in each sample. The three survival models produced comparable standard error estimates. With regard to the effect of sample size, the biases, standard errors and MSEs of all statistical models decreased with increasing sample size. However, coverage probabilities from the Cox PH and Stratified decreased significantly when the sample size increased. This phenomenon is expected when the decline in coverage results from bias [19]. Thus, the confidence interval narrows in on an incorrect value as the sample size increases.
Table 2 summarizes the results of the three survival regression models when censoring is at 30%. Similarly to the results observed at low censoring (Table 1), the Cox model yielded the highest biases and mean squared errors at each sample size, while the extended Cox model produced minimum bias and MSEs. In addition, the extended Cox model is the only approach that provided coverage close to the nominal value of 95%. The results presented in Table 2 showed a decrease in biases, standard errors, and MSEs for all models with increasing sample size.
Table 3 presents the mean estimates when censoring is 45%. From the table, it can be seen that the extended Cox model provided the best fit for the simulated data sets with the lowest biases, standard errors (Est and Emp) and MSEs compared to PH and the stratified model. Similarly, the model yielded consistent coverage probabilities approaching the nominal 95% for all covariates.
At a high censoring percentage (45%), Table 3 showed a downward trend in biases for all models with larger sample sizes. The biases of the Cox and stratified regression decreased slightly, while the biases of the extended time-varying approach decreased sharply.
In assessing the influence of censoring level when the assumption of proportionality does not hold and survival times follow the Weibull distribution, the results summarized in Table 1, Table 2 and Table 3 indicate that an increase in censoring led to an appreciation in bias, a loss in precision, and accuracy across all models. However, estimates from the extended exhibited robustness to the censoring rate by producing coverage probabilities very close to 95%. Meanwhile, the Cox and stratified approaches consistently brought about coverage probabilities well below 95% at every censoring level. Moreover, just as increasing the sample size led to confidence intervals that targeted the wrong "true" value in misspecified models, an increment in censoring had the same effect on 95% confidence intervals.

2.4.2. Exponential Survival Times

Table 4 contains the results of all models when the censoring level is 10%, and the probability distribution of survival is exponential. Table 4 shows that estimators from the PH and stratified regression approaches had significantly higher biases and MSEs at different sizes of the generated sample than the extended model. The estimates of the time-dependent extended technique produced confidence intervals that resulted in coverage probabilities close to the nominal 95% level. In contrast, the other two models brought coverage probabilities of less than 90%.
Examining the effect of increasing sample size on the three survival models when censoring is low, we observed that the bias of the estimators tends to decrease; the standard errors decreased, leading to estimates with greater precision. Regarding the Cov 95%, the extended Cox regression model estimates induced steady coverage probability that did not sway from the desired nominal value of 95%.
Table 5 gives simulation results when the censoring rate is set at 30% under exponential survival times. Again, we observed that the extended model is exceedingly more efficient about bias, standard errors, Cov 95%, and MSE estimates. Together with the stratified model, the PH model gave rise to inadequate coverage probabilities that not only consistently derailed from the nominal level of 95%, but confidence intervals from the two models narrowed into the wrong value when the sample became larger. Hence, the coverage probabilities decreased by more than 50% when the sample size increased from 50 to 1000. Moreover, the coverage probabilities obtained from the extended model do not vary much (approximated to 95%) for all sample sizes.
Table 6 provides results from the respective models when censoring is at 45%. It is evident from the table that the estimates from the Cox model have the most considerable bias and MSE, while the estimates from the extended model exhibited the most negligible bias and mean squared error. In addition, the misspecified PH model incited the worst coverage, followed by the stratified model.
Assessing the impact of different censoring levels on the estimates from the respective models when survival times follow the exponential distribution and the data were generated in violation of the Cox proportional hazard model assumption. Biases, standard errors, and MSEs increased with increasing censoring levels. Thus, the estimates of the three approaches showed a loss of precision and accuracy.
Generally, for all three models, the results from Table 1, Table 2, Table 3, Table 4, Table 5 and Table 6 established whether duration follows the Weibull or exponential distribution; the respective models produced comparable results. Furthermore, the performance measures displayed similar trends regarding the censoring rate and sample size. All in all, when the assumption of proportional hazards is violated, the Cox, stratified, and extended regression models revealed some robustness to the distribution of survival time.

3. Discussion and Conclusions

Stratification and the inclusion of time-varying covariates are two of the most common modifications of the Cox regression model that aim to solve the problem of non-proportionality. However, a review of the literature has shown that while the two extensions have been widely compared using real data analyses, empirical comparison through comprehensive Monte-Carlo simulations using a wide range of sample sizes and censoring is still lacking. Therefore, we conducted extensive Monte-Carlo simulation studies to evaluate the performance of the two models when the PH assumption is violated.
For non-proportional simulations with survival times following the Weibull distribution, we observed superior performance of the extended Cox model for all combinations of different sample sizes and censoring rates. The finding agrees with the results of [31], where the author found that the extended Cox approach performs satisfactorily at various censoring levels and sample sizes when PH is violated. Meanwhile, the stratified approach fitted similarly to the violated Cox model at all censoring percentages and sample sizes.
Similarly to the performance under Weibull survival times, the Cox extended model showed the best performance with regard to biases, coverage probabilities, and mean squared errors when event times followed the exponential distribution. However, the model’s efficiency is comparable to the misspecified Cox and stratified PH models. Thus, suggesting that the two modified models are inadequate in addressing the problem of non-proportionality [32]. Finally, the three models showed some robustness to the distribution of survival times when the PH assumption is not met. In other words, the models’ estimates obtained when the duration follows the Weibull distribution are comparable to those obtained when the exponential distribution generates the duration. Such is to be expected for semi-parametric models [8].
However, just like the stratified Cox model, the time-varying extended Cox model has some limitations in dealing with non-proportionality. For instance, Dunkler et al. [33] claimed that the model is only beneficial when most variables in a study are time-independent. Subsequently, Olaniran and Abdullah [17] expressed concerns about using the partial likelihood function in estimating the model, as this may lead to a loss of efficiency. Finally, Ratnaningsih et al. [32] argued that using the stratified and extended Cox models separately is inefficient when considering non-proportional hazards. They suggested combining the two models; a stratified-extended Cox model is more appropriate for non-proportionality. A thorough evaluation of the combined model is of interest for future research. Nevertheless, this chapter provides a comprehensive examination of the advantages and disadvantages of the two most common extensions of the Cox model when proportionality is not met.

Author Contributions

Conceptualization, I.A.M. and D.-G.C.; methodology, I.A.M., L.F. and D.-G.C.; formal analysis, I.A.M. and L.F.; writing—original draft, I.A.M.; writing—review and editing, L.F. and D.-G.C.; supervision, L.F. and D.-G.C.; project administration, I.A.M. All authors have read and agreed to the published version of the manuscript.

Funding

This work is partially supported by South Africa National Research Foundation (NRF) and South Africa Medical Research Council (SAMRC) (South Africa DST-NRF-SAMRC SARChI Research Chair in Biostatistics, Grant number 114613).

Data Availability Statement

No new data were created or analyzed in this study.

Acknowledgments

The authors would like to thank the reviewers for their constructive comments which significantly improved the quality of this paper.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Cox, D.R. Regression models and life-tables. J. R. Stat. Soc. Ser. B 1972, 34, 187–202. [Google Scholar] [CrossRef]
  2. Hosmer, D.W.; Lemeshow, S. Applied Survival Analysis: Regression Modelling of Time to Event Data; Wiley: Hoboken, NJ, USA, 2002. [Google Scholar]
  3. Merie, H.E.; Dessie, A.A.; Bizuneh, M.T. Modelling the Transition Process from Higher Education to Employment: The Case of Undergraduates from Debre Markos University. Educ. Res. Int. 2022, 2022, 1119825. [Google Scholar] [CrossRef]
  4. Raoniar, R.; Maqbool, S.; Pathak, A.; Chugh, M.; Maurya, A.K. Hazard-based duration approach for understanding pedestrian crossing risk exposure at signalised intersection crosswalks–A case study of Kolkata, India. Transp. Res. Part F Traffic Psychol. Behav. 2022, 85, 47–68. [Google Scholar] [CrossRef]
  5. Zheng, R.; Wang, J.; Zhang, Y. A hybrid repair-replacement policy in the proportional hazards model. Eur. J. Oper. Res. 2023, 304, 1011–1021. [Google Scholar] [CrossRef]
  6. Orbe, J.; Ferreira, E.; Núñez-Antón, V. Comparing proportional hazards and accelerated failure time models for survival analysis. Stat. Med. 2002, 21, 3493–3510. [Google Scholar] [CrossRef]
  7. Adeleke, K.; Abiodun, A.; Ipinyomi, R. Extended Cox Modelling of Survival Data with Guarantee Time. Malays. J. Appl. Sci. 2018, 3, 21–33. [Google Scholar]
  8. Kleinbaum, D.G.; Klein, M. Evaluating the proportional hazards assumption. In Survival Analysis; Springer: Berlin/Heidelberg, Germany, 2012; pp. 161–200. [Google Scholar]
  9. Ata, N.; Sözer, M.T. Cox regression models with nonproportional hazards applied to lung cancer survival data. Hacet. J. Math. Stat. 2007, 36, 157–167. [Google Scholar]
  10. Maryama, A. Model Regresi Stratified Cox dan Extended Cox untuk Mengatasi Non Proportional Hazard. Ph.D. Thesis, Tesis ITS, Madrid, Spain, 2016. [Google Scholar]
  11. Purnami, S.W.; Arlianni, K.W.; Andari, S.; Sagiran, S.; Khoirunnisa, E.; Widada, W. Influencing factors that improve mental conditions patients with complementary therapy at Nur Hidayah Hospital, Bantul, Yogyakarta. In Proceedings of the BIO Web of Conferences, EDP Sciences, Wuhan, China, 27–28 May 2023; Volume 75, p. 01006. [Google Scholar]
  12. Seo, Y.S.; Yuk, J.S. Osteoporosis and Fracture Risk Following Benign Hysterectomy among Female Patients in Korea. JAMA Netw. Open 2023, 6, e2347323. [Google Scholar] [CrossRef]
  13. Phonskaningtyas, I.C. Pengaruh Hu-Care terhadap Rentang Waktu Kekambuhan Penyakit Gagal Ginjal Kronis di Rumah Sakit Nur Hidayah Bantul Menggunakan Regresi Cox. Ph.D. Thesis, Institut Teknologi Sepuluh Nopember, Surabaya, Indonesia, 2023. [Google Scholar]
  14. Stanley, C.; Molyneux, E.; Mukaka, M. Comparison of performance of exponential, Cox proportional hazards, weibull and frailty survival models for analysis of small sample size data. J. Med Stat. Inform. 2016, 4, 2–3. [Google Scholar] [CrossRef]
  15. Burton, A.; Altman, D.G.; Royston, P.; Holder, R.L. The design of simulation studies in medical statistics. Stat. Med. 2006, 25, 4279–4292. [Google Scholar] [CrossRef]
  16. Mehrotra, D.V.; Su, S.C.; Li, X. An efficient alternative to the stratified cox model analysis. Stat. Med. 2012, 31, 1849–1856. [Google Scholar] [CrossRef] [PubMed]
  17. Olaniran, O.R.; Abdullah, M.A.A. Bayesian analysis of extended cox model with time-varying covariates using bootstrap prior. J. Mod. Appl. Stat. Methods 2020, 18, 7. [Google Scholar] [CrossRef]
  18. Ratnaningsih, D.; Saefuddin, A.; Kurnia, A.; Mangku, I. Stratified-extended cox model in survival modeling of non-proportional hazard. IOP Conf. Ser. Earth Environ. Sci. 2019, 299, 012023. [Google Scholar] [CrossRef]
  19. Morris, T.P.; White, I.R.; Crowther, M.J. Using simulation studies to evaluate statistical methods. Stat. Med. 2019, 38, 2074–2102. [Google Scholar] [CrossRef]
  20. Ngwa, J.S.; Cabral, H.J.; Cheng, D.M.; Gagnon, D.R.; LaValley, M.P.; Cupples, L.A. Generating survival times with time-varying covariates using the Lambert W function. Commun. Stat.-Simul. Comput. 2022, 51, 135–153. [Google Scholar] [CrossRef]
  21. Bender, R.; Augustin, T.; Blettner, M. Generating survival times to simulate Cox proportional hazards models. Stat. Med. 2005, 24, 1713–1723. [Google Scholar] [CrossRef] [PubMed]
  22. Grambsch, P.M.; Therneau, T.M. Proportional hazards tests and diagnostics based on weighted residuals. Biometrika 1994, 81, 515–526. [Google Scholar] [CrossRef]
  23. Klein, J.P.; Moeschberger, M.L. Survival Analysis: Techniques for Censored and Truncated Data; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2005. [Google Scholar]
  24. Allison, P.D. Survival analysis. Rev. Guide Quant. Methods Soc. Sci. 2010, 413, 425. [Google Scholar]
  25. Collett, D. Modelling Survival Data in Medical Research; CRC Press: Boca Raton, FL, USA, 2015. [Google Scholar]
  26. Therneau, T.M.; Grambsch, P.M. The cox model. In Modeling Survival Data: Extending the Cox Model; Springer: Berlin/Heidelberg, Germany, 2000; pp. 39–77. [Google Scholar]
  27. Andersen, P.K.; Borgan, O.; Gill, R.D.; Keiding, N. Statistical Models Based on Counting Processes; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2012. [Google Scholar]
  28. Austin, P.C. Generating survival times to simulate Cox proportional hazards models with time-varying covariates. Stat. Med. 2012, 31, 3946–3958. [Google Scholar] [CrossRef]
  29. Xu, R.; Luo, Y.; Glynn, R.; Johnson, D.; Jones, K.L.; Chambers, C. Time-dependent propensity score for assessing the effect of vaccine exposure on pregnancy outcomes through pregnancy exposure cohort studies. Int. J. Environ. Res. Public Health 2014, 11, 3074–3085. [Google Scholar] [CrossRef]
  30. Zhang, Z. Propensity score method: A non-parametric technique to reduce model dependence. Ann. Transl. Med. 2017, 5, 5–7. [Google Scholar] [CrossRef] [PubMed]
  31. Anjullo, B.B. A Simulation Study to Evaluate the Performance of Extended Cox model in Testing Treatment Effect with Possible Non-proportional Hazards. Int. J. Progress. Sci. Technol. 2018, 10, 284–293. [Google Scholar]
  32. Ratnaningsih, D.J.; Saefuddin, A.; Kurnia, A. Stratified-extended Cox with frailty model for non-proportional hazard: A statistical approach to student retention data from Universitas Terbuka in Indonesia. Thail. Stat. 2021, 19, 209–228. [Google Scholar]
  33. Dunkler, D.; Ploner, M.; Schemper, M.; Heinze, G. Weighted Cox regression using the R package coxphw. J. Stat. Softw. 2018, 84, 1–26. [Google Scholar] [CrossRef]
Table 1. Weibull survival times: simulation results comparing Cox PH, stratified and extended models under non-proportional hazards at 10% censoring, β 1 = β 2 = 1 and β ( t ) = 0.5 .
Table 1. Weibull survival times: simulation results comparing Cox PH, stratified and extended models under non-proportional hazards at 10% censoring, β 1 = β 2 = 1 and β ( t ) = 0.5 .
ModelParameterBiasEst SEEmp SECov 95%MSE
n = 50
Cox PH β 1 0.28260.23970.25640.81650.1456
β 2 0.28060.37210.40330.89350.2413
β t −1.18430.41450.46770.17811.6212
Stratified β 1 0.26500.24960.26270.85870.1392
β 2 0.25720.38220.40550.91220.2305
Extended β 1 0.05730.23910.25030.94950.0659
β 2 0.05300.36510.38800.94270.1533
β t −0.04090.41710.43510.94420.1910
n = 1000
Cox PH β 1 0.19630.04650.04850.00960.0409
β 2 0.19780.07440.07750.24680.0451
β t −1.02780.08110.08780.00001.0641
Stratified β 1 0.19150.04680.04850.01310.0390
β 2 0.19010.07420.07630.27410.0419
Extended β 1 0.00240.04790.04810.94860.0023
β 2 0.00180.07490.07460.95080.0056
β t −0.00150.08640.08690.94980.0075
Est SE = model based standard error, Emp SE = empirical standard error, Cov 95% = coverage probability for 95% confidence intervals, MSE = Mean Squared Error.
Table 2. Weibull survival times: simulation results comparing Cox PH, stratified and extended models under non-proportional hazards at 30% censoring, β 1 = β 2 = 1 and β ( t ) = 0.5 .
Table 2. Weibull survival times: simulation results comparing Cox PH, stratified and extended models under non-proportional hazards at 30% censoring, β 1 = β 2 = 1 and β ( t ) = 0.5 .
ModelParameterBiasEst SEEmp SECov 95%MSE
n = 50
Cox PH β 1 0.30900.26760.28870.83260.1788
β 2 0.30750.41810.45470.89620.3013
β t −1.16950.46530.51340.28171.6312
Stratified β 1 0.28850.27830.29700.87060.1714
β 2 0.27880.42920.45680.91860.2864
Extended β 1 0.06700.26460.2793094740.0825
β 2 0.05740.40810.43370.94640.1914
β t −0.04660.46460.48520.94710.2376
n = 1000
Cox PH β 1 0.21220.05130.05360.01310.0479
β 2 0.21560.08290.08590.26570.0539
β t −1.01580.09090.09600.00001.0409
Stratified β 1 0.20590.05150.05360.01850.0453
β 2 0.20370.08260.08450.30990.0486
Extended β 1 0.00280.05270.05280.95110.0028
β 2 0.00170.08330.08320.94930.0069
β t −0.00150.09590.09660.95070.0093
Est SE = model based standard error, Emp SE = empirical standard error, Cov 95% = coverage probability for 95% confidence intervals, MSE = Mean Squared Error.
Table 3. Weibull survival times: simulation results comparing Cox PH, stratified and extended models under non-proportional hazards at 45% censoring, β 1 = β 2 = 1 and β ( t ) = 0.5 .
Table 3. Weibull survival times: simulation results comparing Cox PH, stratified and extended models under non-proportional hazards at 45% censoring, β 1 = β 2 = 1 and β ( t ) = 0.5 .
ModelParameterBiasEst SEEmp SECov 95%MSE
n = 50
Cox PH β 1 0.33880.30550.33340.84920.2259
β 2 0.33580.45190.52770.90490.3911
β t −1.15660.053350.58520.41011.6802
Stratified β 1 0.31450.31700.34100.88720.2152
β 2 0.30291.44180.56400.92740.4099
Extended β 1 0.08080.29880.31860.94570.1080
β 2 0.06630.46810.49930.94700.2536
β t −0.05290.52820.55510.94710.3109
n = 1000
Cox PH β 1 0.22580.05760.06010.02240.0546
β 2 0.23250.09460.09790.30950.0637
β t −0.99860.10360.10690.00001.0086
Stratified β 1 0.21860.05780.05990.03150.0514
β 2 0.21740.09420.09620.36460.0565
Extended β 1 0.00300.05870.05840.95100.0034
β 2 0.00260.09470.09460.95200.0089
β t −0.00840.10830.10810.95110.0117
Est SE = model based standard error, Emp SE = empirical standard error, Cov 95% = coverage probability for 95% confidence intervals, MSE = Mean Squared Error.
Table 4. Exponential survival times: simulation results comparing Cox PH, stratified and extended models under non-proportional hazards at 10% censoring, β 1 = β 2 = 1 and β ( t ) = 0.5 .
Table 4. Exponential survival times: simulation results comparing Cox PH, stratified and extended models under non-proportional hazards at 10% censoring, β 1 = β 2 = 1 and β ( t ) = 0.5 .
ModelParameterBiasEst SEEmp SECov 95%MSE
n = 50
Cox PH β 1 0.26660.23480.25540.82790.1363
β 2 0.25860.36840.40720.89170.2326
β t −1.93921.14960.57560.00654.0916
Stratified β 1 0.23640.24240.26150.86870.1243
β 2 0.22440.37620.40660.91470.2157
Extended β 1 0.05380.23380.24510.94440.0629
β 2 0.04710.36020.38370.94230.1494
β t −0.03780.47810.49460.94560.2459
n = 1000
Cox PH β 1 0.17060.04520.04670.03350.0313
β 2 0.17050.07330.07680.36530.0349
β t −1.79110.09150.09880.00003.2179
Stratified β 1 0.15190.04560.04710.08480.0253
β 2 0.14990.07350.07570.47300.0282
Extended β 1 0.00240.04720.04700.95020.0022
β 2 0.00170.07440.07410.95320.0055
β t −0.00110.10080.09970.95330.0099
Est SE = model based standard error, Emp SE = empirical standard error, Cov 95% = coverage probability for 95% confidence intervals, MSE = Mean Squared Error.
Table 5. Exponential survival times: simulation results comparing Cox PH, stratified and extended models under non-proportional hazards at 30% censoring, β 1 = β 2 = 1 and β ( t ) = 0.5 .
Table 5. Exponential survival times: simulation results comparing Cox PH, stratified and extended models under non-proportional hazards at 30% censoring, β 1 = β 2 = 1 and β ( t ) = 0.5 .
ModelParameterBiasEst SEEmp SECov 95%MSE
n = 50
Cox PH β 1 0.28070.25840.27950.85010.1569
β 2 0.27180.40730.44650.90150.2732
β t −2.04810.50850.57750.00674.5281
Stratified β 1 0.25520.26720.28590.88710.1469
β 2 0.24160.41740.44660.91890.2578
Extended β 1 0.06220.25440.26580.94770.0745
β 2 0.05270.39490.41530.94720.1752
β t −0.03700.50510.52140.95030.2732
n = 1000
Cox PH β 1 0.18280.04890.05090.03900.0360
β 2 0.18270.08040.08430.38630.0405
β t −1.81890.09580.10350.00003.3194
Stratified β 1 0.16310.04940.05140.08750.0292
β 2 0.16020.08070.08290.49180.0326
Extended β 1 0.00280.05110.05090.95050.0026
β 2 0.00150.08160.08150.95150.0066
β t −0.00130.10610.10540.95400.0111
Est SE = model based standard error, Emp SE = empirical standard error, Cov 95% = coverage probability for 95% confidence intervals, MSE = Mean Squared Error.
Table 6. Exponential survival times: simulation results comparing Cox PH, stratified and extended models under non-proportional hazards at 45% censoring, β 1 = β 2 = 1 and β ( t ) = 0.5 .
Table 6. Exponential survival times: simulation results comparing Cox PH, stratified and extended models under non-proportional hazards at 45% censoring, β 1 = β 2 = 1 and β ( t ) = 0.5 .
ModelParameterBiasEst SEEmp SECov 95%MSE
n = 50
Cox PH β 1 0.31530.29260.32630.85610.2058
β 2 0.30250.46780.52220.90340.3642
β t −2.12120.55740.63560.01234.9034
Stratified β 1 0.28680.30310.33320.89360.1933
β 2 0.26710.48060.52010.92860.3418
Extended β 1 0.07500.28410.30340.94480.0977
β 2 0.05810.44850.47410.94830.2281
β t −0.04280.54870.56910.94980.3257
n = 1000
Cox PH β 1 0.19430.05420.05660.04970.0409
β 2 0.19600.09090.09600.43000.0476
β t −1.85860.10290.11000.00003.4663
Stratified β 1 0.17330.05470.05700.11130.0333
β 2 0.17140.09120.09410.53810.0382
Extended β 1 0.00280.05650.05620.95210.0032
β 2 0.00220.09210.09230.94990.0085
β t −0.00120.11440.11360.95550.0129
Est SE = model based standard error, Emp SE = empirical standard error, Cov 95% = coverage probability for 95% confidence intervals, MSE = Mean Squared Error.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Maharela, I.A.; Fletcher, L.; Chen, D.-G. Modified Cox Models: A Simulation Study on Different Survival Distributions, Censoring Rates, and Sample Sizes. Mathematics 2024, 12, 2903. https://doi.org/10.3390/math12182903

AMA Style

Maharela IA, Fletcher L, Chen D-G. Modified Cox Models: A Simulation Study on Different Survival Distributions, Censoring Rates, and Sample Sizes. Mathematics. 2024; 12(18):2903. https://doi.org/10.3390/math12182903

Chicago/Turabian Style

Maharela, Iketle Aretha, Lizelle Fletcher, and Ding-Geng Chen. 2024. "Modified Cox Models: A Simulation Study on Different Survival Distributions, Censoring Rates, and Sample Sizes" Mathematics 12, no. 18: 2903. https://doi.org/10.3390/math12182903

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop