Next Article in Journal
Pseudo Steady-State Period in Non-Stationary Infinite-Server Queue with State Dependent Arrival Intensity
Next Article in Special Issue
Efficient Estimation and Inference in the Proportional Odds Model for Survival Data
Previous Article in Journal
Spatiality Sensitive Learning for Cancer Metastasis Detection in Whole-Slide Images
Previous Article in Special Issue
Nonparametric Sieve Maximum Likelihood Estimation of Semi-Competing Risks Data
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Generalized Accelerated Failure Time Models for Recurrent Events

1
The Institute of Statistics and Big Data, Renmin University of China, Beijing 100872, China
2
School of Mathematics, Yunnan Normal University, Kunming 650092, China
*
Author to whom correspondence should be addressed.
Mathematics 2022, 10(15), 2662; https://doi.org/10.3390/math10152662
Submission received: 29 June 2022 / Revised: 22 July 2022 / Accepted: 25 July 2022 / Published: 28 July 2022
(This article belongs to the Special Issue Recent Advances in Computational Statistics)

Abstract

:
For analyzing recurrent event data, we consider a generalization of the classical accelerated failure time model. In the proposed approach, the general function is no longer assumed to be a singleton but allowed to be time-varying. This is in the same spirit as in quantile regression and the counting process techniques can be utilized. Theoretical properties such as consistency and asymptotic normality are obtained. The illustration of the methodology using simulation studies and then the application to the bladder cancer data is also given.

1. Introduction

Recurrence event data refer to the situation in which events of interest occur repeatedly over time, and these are wildly used in science and technology, especially in medical research, e.g., epileptic seizures, tumor recurrences, or asthma attack. When the investigators studied the recurrent events, they are often interested in the estimation of covariates on recurrent event times, which can help them perform further predictions. The establishment of models can be approached in many ways; meanwhile, the concept of intensity functions and the counting process are also useful. The nonparametric method used to generalize the intensity estimator for the censored failure time data, independently started by [1,2], also denoted the Nelson–Aalen estimator, is one of the methods widely used by investigators. Refs. [3,4], among others, also studied multiplicative models for the rate and mean functions, an approach which can be used with more general models, including regression. Ref. [5] proposed a generalization of the accelerated failure time model, the so-called accelerated recurrence time model, which resembles quantile regression as well as allows for the evolution of covariate effects.
In this paper, we also extend the quantile regression approach and utilize the counting process techniques. For a time-to-event response T, Ref. [6] proposed that a form may assume Q T ( τ | Z ) = exp X T β 0 for τ ( 0 , 1 ) , where Q T ( τ | Z ) = inf t : P r ( T t | Z ) τ , so Q T ( τ | Z ) can also be seen as the τ -th quantile of T given that the covariates Z and X = ( 1 , Z T ) T . Here, T , τ and t denote the time to event, the quantile level, and a non-negative real number, respectively. Since its introduction, quantile regression has been widely used and researched, mainly in survival analysis. Ref. [7] extended the LAD estimation method to more general quantiles, and can also improve efficiency when the error terms are identically distributed. Ref. [5] developed a new quantile regression approach to the counting process, and Ref. [8] adopted a constant general function in quantile regression which it applied to recurrence event data. In the application and algorithm estimation of quantile regression, efforts have also been made by many people. Ref. [9] proposed a locally weighted censored quantile regression approach that can solve covariate-dependent censoring. Ref. [10] proposed a semi-parametric approach using empirical likelihood to a random effects quantile regression model.
Overall, this work considers a situation wherein the general function of the accelerated failure time model is affected by time, sharing the same spirit of the parametric estimation method, while we will use the nonparametric method to estimate the effect. Then, we developed a new counting process model extended from the quantile regression models of this general function. This method provides some ideas for the development of more diverse estimation methods for the counting process.

2. Model

2.1. Accelerated Recurrent Events Time Model

Defining the R ( · ) = j = 1 I ( T ( j ) · ) , where I ( · ) is the indicator function. The observed counting process N ( · ) = R ( · L ) shows that the observation is limited to follow-up time L and ∧ is the minimization operator, meanwhile, the at-risk process Y ( · ) and Y ( t ) = I ( L t ) . We also have μ Z = E R ( t ) | Z , and Z is a p-vector as the covariates of interest. The original accelerated recurrence time models considered covariates’ effects as time scale changes, which also share the same spirit as quantile regression, and the inverse function is
τ Z ( u ) = inf t : μ Z ( t ) > G ( u ) ,
in a general setting, and G ( u ) = u denotes the time to expected frequency u. However, using a constant to estimate the expected frequency has some limitations. For example, the effect of the intervention of interest factors affecting the occurrence of an event may change over time. In this case, the model which combines a constant and variable coefficient may be more effective. As such, we proposed a new accelerated recurrence time model, and the right side of the inequality in the inverse function is G ( u ) = u · m ( t ) instead of G ( u ) = u , where m ( t ) is the function of time t. Regardless of the form of inverse function, it can be written in the following form:
log τ Z ( u ) = X T β 0 ( u )
As can be seen from the above formula, log τ Z ( u ) should be linear. Compared to the original accelerated recurrence model and the improved model, when the recurrent event T follows a homogeneous Poisson distribution, as shown in Figure 1, the estimated log τ Z ( u ) by both models is approximately linear when the recurrent event is a non-homogeneous Poisson process, which has an intensity function λ 0 ( t ) related to time t. In Figure 2, we compare three different situations, namely when λ 0 ( t ) = b × t and b = 1 , 5 , 10 . As b increases, the estimation of the original accelerated recurrence model tends to be more and more nonlinear, while the proposed improved model still shows a strong linear relationship. In that way, the proposed recurrent event time model can be used in a more general situation.
In this paper, we consider the situation that G ( u ) = u t , then model 2 can be rewritten as
E ( X R exp ( X T β ) exp ( X T β ) u ) = 0
if and only if β = β 0 ( u ) , and the left-hand side should be monotone, then the above equation can be transformed into
β 0 ( u ) = argmin β E j = 1 ( X T β log T ( j ) ) + exp ( X T β ) u .
where a + = a 0 and ∨ denotes the maximization operator. Then, we can come up with the following theorem:
Theorem 1.
For G ( u ) = exp ( X T β ( u ) ) u , if (1) hold and R τ Z ( u ) L Z , define v 2 = v v T for vector v, μ ˙ Z ( · ) is the derivative of μ Z , let F Z ( x ) = E Y ( x ) Z , and assume
(C1) 
E X 2 I L > τ Z ( u ) is non-singular;
(C2) 
μ ˙ Z ( e X T β 0 ( u ) ) F Z ( e X T β 0 ( u ) ) > U for all u ( 0 , U ] .
Then,
β 0 ( u ) = argmin β ψ ( β ; u ) ,
where
ψ ( β ; u ) = E j = 1 M ( X T β log L log T ( j ) ) + exp ( X T β log L ) u .
The proof of this result, as well as the proofs for Theorems 2 and 3 below, are given in Appendix A. Powell extended these results for censored quantile regression when the censoring time is observed [7]. We will then combine the counting process model and apply this method to the recurrence events data.

2.2. The Recurrent Events Model

For the counting process model, we begin with a review of Gill and Andersen’s study [11], which extended the Cox proportional hazards model [12] to a multivariate counting process. The intimate connection of the study on the multivariate counting process and the use of martingale methods can allow us to derive more properties of the statistical estimation and testing procedures. As each component N i ( · ) of a multivariate counting process has a cumulative hazard function Λ T ( · ) , there exist local martingales M i defined by M i ( t ) = N i ( t ) Λ T ( t ) . Since the cumulative hazard function Λ T ( · ) is the integral of the hazard function, then according to [11], the counting process formation of the Cox model can be given by
E N ( t ) Z = E 0 u Y ( s ) λ 0 ( s ) e X T b d s Z , t > 0 ,
where λ 0 ( s ) denotes the baseline hazard function, and this model can accommodate recurrent events data well. Peng and Huang conducted a re-examination work [5], which combined the quantile regression method for survival data with the above counting process model.
E N ( e X T β 0 ( u ) ) Z = E 0 u Y ( e X T β 0 ( s ) ) 1 1 s d s Z , u ( 0 , 1 ) ,
which has a nice monotonicity property. In model 6, the estimated cumulative hazard function Λ ^ T ( x ) = log ( 1 x ) . Huang also showed that a singleton model, where an estimated cumulative hazard function is a constant u, can also be applied in the counting process model [13]. When the singleton model is combined with quantile regression, this extended the following model to a more general situation of recurrent events [8], and the estimation formula is given by
E N ( e X T β 0 ( u ) ) Z = E 0 u Y ( e X T β 0 ( s ) ) g ( s ) d s Z , u ( 0 , 1 ) ,
where g ( s ) 1 . In fact, in model 7, the estimated cumulative hazard function G ( u ) = u , which is a constant. According to Theorem 1 and the above models, we can propose a new counting process model that takes the form
E N ( e X T β 0 ( u ) ) Z = E 0 u Y ( e X T β 0 ( s ) ) d G ( s ) Z , u ( 0 , U ] ,
where in our model, we can also show that
μ Z ( e X T β 0 ( u ) ) = G ( u ) = u t = 0 u g ( s ) d s .
Therefore, we also have an alternative representation of the model that
τ Z ( G ( u ) ) = exp X T β 0 ( u ) , u ( 0 , U ] ,
The proof of this recurrence events setting is also shown in the Appendix A. In the simulation part, we will show that our estimation method performs better when the recurrent event time follows the non-homogeneous Poisson distribution.

2.3. The Proposed Estimation Procedure

From Theorem 1, we can define the objective function
Ψ ( β ; u ) = n 1 i = 1 n j = 1 M i ( X i T β log L i log T i ( j ) ) + exp ( X i T β log L i ) u ) .
Theorem 1 leads to the estimation of β 0 ( u ) , that is β ^ ( u ) = argmin β Ψ ( β ; u ) .
It can be seen that the objective function is not convex. An algorithm was developed to find the local minimizer that is asymptotically equivalent to the global minimizer [13]. Since we proposed the counting process-based model (model 7), then we can propose an equation to estimate β 0 ( u ) :
n 1 / 2 S n ( β , u ) ,
where
S n ( β , u ) = n 1 i = 1 n X i N i ( exp X i T β ( u ) ) 0 u I ( L i exp X i T β ( s ) ) d G ( s ) .
As the results proposed in [11], although M h is a local square integrable martingale, it has the same asymptotic property as a global martingale, and if we let S ( β , u ) = E S n ( β , u ) , then follows the martingale property of M ( · ) given that S ( β 0 , u ) = 0 .
Equation (8) boils down to the estimation of censored quantile regression in [5]. A common method used to predict β 0 ( u ) is the grid-based estimation procedure by denoting β 0 ( u ) as a right-continuous piecewise-constant function that jumps on a grid. More specifically, we define S L ( n ) = 0 = u 0 < u 1 < < u L ( n ) = U , and for our recurrence setting, the U in the grid can be greater than 1. The size of S L ( n ) , denoted by S L ( n ) , is the maximum value of u j u j 1 where j = 1 , , L ( n ) . It is also noteworthy that exp X T β ^ ( 0 ) = 0 . Thus, based on model 9, we can estimate β ^ ( u j ) by sequentially solving the estimating equation
n 1 / 2 i = 1 n X i R i ( exp X i T β ( u j ) ) k = 0 j 1 I ( L i exp X i T β ^ ( u k ) ) u k u k + 1 g ( s ) d s = 0 ,
Since Equation (10) is not continuous, the exact root may not exist. Fygenson and Ritov proposed a generalization solution of β ^ ( u j ) for monotone estimating equations [14]. To find the generalization solution of Equation (10), we need to perform some simple algebraic manipulations, and then the solution-finding problem is equivalent to locating the minimizer of the following L 1 -type convex function:
l j ( h ) = i = 1 n j = 1 I ( T i ( j ) L i ) log T i ( j ) X i T h + R i = 1 n j = 1 I ( T i ( j ) L i ) ( X i ) T h + R i = 1 n 2 X i T h k = 0 j 1 I ( L i exp X i T β ^ ( u k ) ) exp X i T β ^ ( u k ) ( u k + 1 u k ) .
where R is a very large number and j = 1 , , L ( n ) . One can also solve the l j ( h ) by using statistical software, such as the rq ( ) function in R package quantreg or the l 1 fit ( ) function in S-PLUS.

2.4. Asymptotic Properties

In this section, we establish the uniform consistency and weak convergence of the proposed estimator β ^ ( u ) . Firstly, some regularity conditions should be stated.
Define μ ( b ) = E X N ( exp ( X T b ) ) , μ ˜ ( b ) = E X I ( exp ( X T b ) L ) , μ ˜ Z ( x ) = E ( N ( x ) | Z ) , g Z ( x ) = d μ ˜ Z ( x ) / d x , F ( t | Z ) = P r ( x t | Z ) , f ¯ ( x | Z ) = f ( x | Z ) = d F ( x | Z ) / d x . By using simple algebra, we also define B ( b ) = d μ ( b ) / d b T = E X 2 e X T b g Z ( e X T b ) , J ( b ) = d μ ˜ ( b ) / d b T = E X 2 e X T b f ¯ ( e X T b | Z ) . Suppose the assumptions in Theorem 1 hold and β ^ ( u ) is strongly consistent for β 0 ( u ) . Assume the following conditions:
(C1’)
Z and N ( L ) are bounded;
(C2’)
(a) μ ˙ Z ( · ) is bounded and continuous at t = τ Z ( u ) , as well as uniform in Z, and (b) μ ( β 0 ( u ) ) is a Lipschitz function of u;
(C3’)
(a) each component of J ( b ) B ( b ) 1 is uniformly bound on b B ( d ) , where B ( d ) is a neighborhood containing β 0 ( u ) , (b) g Z ( x ) > 0 and E ( Z 2 ) is positive definite;
(C4’)
inf u [ v , U ] eigmin B ( β 0 ( u ) ) > 0 for any v ( 0 , U ] , where eigmin ( · ) denotes the minimum eigenvalue of a matrix.
Condition (C1’) implies the boundedness of covariates and the number of observed events, and (C2’) gives the smoothness of β 0 ( u ) , condition (C3’) sets additional mild assumptions such as the positive definite. Noted that (C4’) is a key condition that ensures the consistency of the proposed estimator. Then, we have the following theorems.
Theorem 2.
Under conditions C1’–C4’, when lim n S L ( n ) = 0 , then
sup u ( v , U ] β ^ ( u ) β 0 ( u ) P 0 ,
where 0 < v < U .
Theorem 3.
Under condition C1’–C4’, when lim n n 1 / 2 S L ( n ) = 0 , then
n 1 / 2 β ^ ( u ) β 0 ( u )
converge weakly to a Gaussian process for u [ v , U ] , where v ( 0 , U ) .
The proofs are also in the Appendix A.

3. Simulation Studies

In this part, we conducted the Monte Carlo simulations to test our model. Combining the methods of Huang and Peng [13], recurrent events were generated from both homogeneous Poisson and non-homogeneous Poisson processes. We also generated two covariates, X 1 and X 2 , following the distribution B e r n o u l l i ( 0.5 ) and U n i f ( 0.5 , 0.5 ) separately. The recurrent event time was generated by:
T ( j ) = exp min ( 1 , T ( j ) 1.5 γ ) X 1 + X 2 T ( j ) / γ , j = 1 , 2 , . . . ,
where in one case, T ( j ) , j = 1 , 2 , was a recurrent event time from the standard homogeneous Poisson process, in other words, the gap times of T ( j ) T ( j 1 ) are independent and identically e x p o n e t i a l ( 1 ) variables; in other cases, T ( j ) , j = 1 , 2 , . . . is a recurrent event time from the non-homogeneous Poisson process with the intensity function λ 0 ( t ) = t . Furthermore, the frailty γ followed the Gamma distribution for the homogeneous Poisson process, we considered two situations, that is the variance of γ was chosen to be 0 and 0.5, for the non-homogeneous Poisson process, and we only consider the variance of γ to be 0. Under our simulation setup, we have
τ Z ( G ( u ) ) = exp log ( u ) + min ( 1 , u / 1.5 ) X 1 + X 2 .
For censoring time L, we generated it from U n i f ( 0 , 12 ) . For each selection of the variance of γ , we generated 500 datasets of sample size n = 100 . Since we adopted the grid-based method to estimate the β , an equally spaced grid on u ( 0 , 3 ] with step size 0.02 was conducted in our simulation.
Figure 3 and Figure 4 are the simulation results for the homogeneous Poisson process from the set-up with a Gamma frailty of variance 0 and 0.5. In the first row, we plot the empirical bias of the proposed estimator β ^ ( u ) (solid lines) and the empirical bias of the Sun’s estimator (dashed lines) [8]. Sun considered the double censored situation while we only consider the right-censored event time to suit a more general situation. In the second row of the plots, we depicted the empirical mean squared error (MSE) versus the expected frequency u. The third row presents the coverage probabilities of 95% confidence intervals obtained from the proposed estimator (solid lines) and Sun’s estimator (dashed lines). We can see that both methods have a slight bias which converges to 0, and the empirical MSE also tends to be stable as u increases. In the homogeneous Poisson process, when the variance of gamma frailty equals 0, the empirical MSE shows less fluctuation compared to the variance that is equal to 0.5.
Figure 5 depicts the same parameters as Figure 3 and Figure 4, while the difference is that the results are from the non-homogeneous Poisson process. We can see that, in the first and second rows of Figure 5, both methods performed quite similarly in terms of the bias and SD. However, from the third row, in terms of the coverage probability, the proposed estimator (solid lines) performed slightly better than the Sun’s estimator (dashed lines), than for the simulation of the non-homogeneous Poisson process.

4. Application to the Bladder Tumor Studies

We applied our proposed estimated method to a well-known bladder tumor study [15]. This dataset was conducted to analyze the effect of two treatments, pyridoxine and thiotepa, based on the recurrence of bladder tumors. A total of 118 subjects were recorded, 48 were treated with placebo, 32 were pyridoxine, and 38 with thiotepa. The covariates also contained the initial number of tumors, the size of the largest initial tumor and others. The maximum observed number of recurrences is 9.
We selected four covariates and the intercept: the two treatment methods and the initial tumor size and number. In Figure 6, we displayed our estimation result of the proposed method (solid lines) and Sun’s method (gray line) [8], surrounded by point-wise Wald 95% confidence intervals (dashed lines). The grid-based estimation method estimated the regression coefficients over [0, 1.6]. The intercept coefficient estimates represent the log time to the expected frequency of the bladder tumor recurrence, consisting of patients who had no pyridoxine and thiotepa treatment. The intercept term indicates that as time increases, the bladder tumor recurrence also increases, which is in line with expectations. The negative non-intercept coefficient estimates pyridoxine and thiotepa show that these two treatments can inhibit tumor recurrence. In contrast, the initial tumor size and number of covariates are positive, suggesting a negative effect on tumor recurrence. From Figure 6, we can see that our method and Sun’s method do not have much difference, and the overall trend is the same.

5. Discussion

In this article, we introduced the accelerated recurrence time model for recurrence events, and then considered a new situation when the expected frequency of the accelerated recurrence time model was affected by the time-varying and combined this model with the counting process model. This new counting process model with a non-singleton general function is similar to Cox’s regression model but can be transformed into quantile regression. This method can also estimate more general situations, such as the non-homogeneous Poisson process.
We can generalize our estimation procedure from double-censored to right-censored situations. In the recurrence events setting, the most popular choice of G ( · ) , which may be G ( u ) = u , as well as G ( u ) = ln ( 1 u ) , we introduce our new estimation method with G ( u ) = u t . The assumption is that a constant hazard is rarely tenable in practical problems. Therefore, in the parametric estimation process, a more general distribution of the hazard function is the Weibull distribution, which is g ( t ) = λ γ t γ 1 for 0 < t < , so we have G ( u ) = u t γ . When γ = 0 , the Weibull distribution function becomes a constant, but resembles the accelerated recurrence time model. When γ = 1 , the model became our proposed method. Thus, after combining the parametric and nonparametric methods, we can make the estimation method more diverse and adapt to different types of datasets.

Author Contributions

Conceptualization, J.X.; Data curation, X.W.; Formal analysis, X.W.; Investigation, X.W. and J.X.; Methodology, J.X. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Proof of Theorem 1.
If we want to estimate the β 0 , the left-hand side of the following equation should be monotone,
E ( X R exp ( X T β ) exp ( X T β ) u ) = 0
taking derivative with respect to β on both sides of equation that
μ ˙ Z ( e X T β ( u ) ) F Z ( e X T β 0 ( u ) ) e X T β ( u ) X T d β u e X T β ( u ) X T d β ,
then, under condition (2), the Equation (A1) can be monotone. Meanwhile, straightforward algebra gives
ϕ ( β ; Z , L ) = E j = 1 M ( X T β log L log T ( j ) ) + exp ( X T β log L ) u Z , L = X T β log L μ Z exp ( s ) d s exp ( X T β log L ) u
and for any β β 0 ( u ) ,
E ϕ ( β 0 ( u ) ; Z , L ) = ψ ( β 0 ( u ) ; u ) < ψ ( β ; u ) = E ϕ ( β ( u ) ; Z , L ) ,
in order to eliminate the minimization operator, we consider the following situations:
  • when log L X T β and log L X T β 0 ,
    ϕ ( β 0 ( u ) ; Z , L ) = ϕ ( β ; Z , L ) .
  • when log L > X T β and log L X T β 0 ,
    ϕ ( β 0 ( u ) ; Z , L ) ϕ ( β ; Z , L ) = X T β l o g L μ Z exp ( s ) d s u exp ( l o g L ) exp ( X T β ) = X T β l o g L μ Z exp ( s ) u exp ( s ) d s 0 .
  • when log L > X T β 0 , since exp ( β 0 ( u ) ) is the unique value that minimizes
    x μ Z exp ( s ) d s e x u ,
    considering the monotonicity of μ Z ( · ) , therefore ϕ ( β 0 ( u ) ; Z , L ) ϕ ( β ; Z , L ) . By the non-singularity of E X 2 I L > τ Z ( u ) ,
    E ϕ ( β 0 ( u ) ; Z , L ) I log L > X T β 0 ( u ) E ϕ ( β ; Z , L ) I log L > X T β 0 ( u ) .
Thus, the proof is completed. □
Proof of Theorem 2.
Define α 0 ( u ) = μ ( β 0 ( u ) ) , α ^ ( u ) = μ ( β ^ ( u ) ) , and
B ( d ) = b R p + 1 : inf u ( 0 , U ] μ ( b ) μ ( β 0 ( u ) ) d , A ( d ) = μ ( b ) : b B ( d ) .
Noted that for any b 1 , b 2 B ( d ) ,
( b 1 b 2 ) μ ( b 1 ) μ ( b 2 ) = E ( X T b 1 X T b 2 ) N ( e x p ( X T b 1 ) ) N ( e x p ( X T b 2 ) ) = 0 ,
which only occurs when b 1 = b 2 by condition (C3’). Then, there exists a one-to-one map from B ( d ) to A ( d ) that we denoted by κ ( · ) and κ μ ( b ) = b for any b B ( d ) .
Furthermore, by the properties of the martingale stochastic process
M ( u ) = N ( exp ( X T β 0 ( u ) ) 0 u Y ( e X T β 0 ( s ) ) g ( s ) d s ,
we have
n 1 i = 1 n X i N i ( exp X i T β ^ ( u ) ) = n 1 i = 1 n 0 u X i I exp X i T β ^ ( s ) L i g ( s ) d s + ξ n , k
where ξ n , k = O p ( n 1 ) .
Then, the following notations are defined:
v n ( b ) = n 1 i = 1 n X i N i ( exp X i T b ) μ ( b ) , v ˜ n ( b ) = n 1 i = 1 n X i I ( exp X i T b L i ) μ ˜ ( b ) .
By the Glivenko–Cantelli theorem, we have
sup b v n ( b ) a . s . 0 , sup b v ˜ n ( b ) a . s . 0 .
Since E ( X N ( exp X T β 0 ( u ) ) ) = E ( X 0 u I ( L exp X T β 0 ( s ) ) d G ( u ) ) . Then, α ^ ( u ) α 0 ( u ) is equivalent to
v n ( β ^ ( u ) ) + 0 u v ˜ n β ^ ( s ) d G ( s ) + k = 1 j u k 1 u k μ ˜ ( β ^ ( s ) ) μ ˜ ( β 0 ( s ) ) d G ( s ) + ξ n , k .
By the arguments as [5], the sup u [ u k 1 , u k ) α ^ ( u ) α 0 ( u ) can be bounded almost surely by ε k = 1 + C 4 b n k 1 ( C 1 + ε 0 C 4 b n + C 2 n 1 + C 3 a n ) , where ε 0 = C 3 a n . Given lim n a n = 0 and L ( n ) = U / a n , we have
lim n ( 1 + C 4 b n ) L ( n ) 1 = exp C 4 U / ( 1 U ) ,
where C 1 , C 2 , C 3 , and C 4 are some positive constant. Since C i are arbitrarily and lim n a n = 0 , then we can know that sup u ( 0 , U ] α ^ ( u ) α 0 ( u ) P 0 . By the application of the Taylor expansion of κ α ^ ( u ) around α 0 ( u ) , the conclusion sup u ( v , U ] β ^ ( u ) β 0 ( u ) P 0 can be reached. □
Proof of Theorem 3.
From the arguments made by [13], we have the following lemma:
sup u ( 0 , U ] n 1 / 2 i = 1 n X i N i ( e X i T β ^ ( u ) ) N i ( e X i T β 0 ( u ) ) n 1 / 2 μ ( β ^ ( u ) ) μ ( β 0 ( u ) ) P 0 , sup u ( 0 , U ] n 1 / 2 i = 1 n X i I ( L i e X i T β ^ ( u ) ) I ( L i e X i T β 0 ( u ) ) n 1 / 2 μ ˜ ( β ^ ( u ) ) μ ˜ ( β 0 ( u ) ) P 0 .
and the above lemma implies that
n 1 / 2 S n ( β 0 , u ) = n 1 / 2 μ ( β ^ ( u ) ) μ ( β 0 ( u ) ) 0 u n 1 / 2 μ ˜ ( β ^ ( s ) ) μ ˜ ( β 0 ( s ) ) d G ( s ) + o ( 0 , U ] ( 1 ) = n 1 / 2 μ ( β ^ ( u ) ) μ ( β 0 ( u ) ) 0 u J ( β 0 ( u ) ) B ( β 0 ( u ) ) 1 + o ( 0 , U ] ( 1 ) n 1 / 2 μ ( β ^ ( u ) ) μ ( β 0 ( u ) ) d G ( s ) + o ( 0 , U ] ( 1 )
since lim n n 1 / 2 S L ( n ) = 0 , then n 1 / 2 S n ( β ^ , u ) = o ( 0 , U ] ( 1 ) . Then, we can use the product integration theory to obtain the following equation:
n 1 / 2 μ ( β ^ ( u ) ) μ ( β 0 ( u ) ) = φ n 1 / 2 S n β 0 , u + o ( 0 , U ] ( 1 ) ,
where φ ( g ) ( u ) = 0 u I ( s , u ) d g ( s ) , which is a one-to-one map from F to F . By our definition, F = g : [ 0 , U ] R p + 1 and g is left-continuous with a right limit, g ( 0 ) = 0 . Meanwhile,
I ( s , t ) = π u ( s , t ] I p + 1 + J ( β 0 ( u ) ) B ( β 0 ( u ) ) 1 d G ( u ) .
Since the definition given by [16] is followed, X i N i ( exp X i β 0 ( s ) ) , s ( 0 , U ] is a VC-class. For 0 u I L i exp X i T β 0 ( s ) d G ( s ) , let h x = 0 x I L i exp X i T β 0 ( s ) d G ( s ) , suppose that x 1 x 2 , then
h ( u 1 ) h ( u 2 ) = u 2 u 1 I L i exp X i T β 0 ( s ) d G ( s ) K u 1 u 2 ,
where K = max ( exp X i T β 0 ( s ) , s [ u 1 , u 2 ] ) , this implies that h ( x ) is a Lipschitz in x. The above arguments show that
X i N i ( exp X i β 0 ( s ) ) 0 u I L i exp X i T β 0 ( s ) d G ( s ) , u ( 0 , U ]
is a Donsker class. By the Donsker theorem, we can know that n 1 / 2 S n converge weakly to a Gaussian process, then φ n 1 / 2 S n β 0 , u is also a Gaussian process for φ ( · ) is a linear operator. By the continuous mapping theory and Taylor expansion of κ μ β ^ ( u ) around κ μ β 0 ( u ) , we can know that n 1 / 2 β ^ ( u ) β 0 ( u ) converge weakly to a Gaussian process with a mean of zero and a covariance matrix Σ ( s , t ) = E ζ 1 ( s ) ζ 1 ( t ) , where ζ i ( u ) = B ( β 0 ( u ) ) 1 φ ( s i ) , and s i ( u ) = N i ( exp X i T β ( u ) ) 0 u I ( L i exp X i T β ( s ) ) d G ( s ) . □

Recurrent Event Setting

Suppose model 6 holds, then according to the following definition:
E N ( e X T β 0 ( u ) ) | Z = μ Z ( e X T β 0 ( u ) L ) ,
and the inequality e X T β 0 ( u ) L can also be written as G ( u ) μ Z ( L ) following the definition of τ Z ( u ) . Then
E 0 u Y ( e X T β 0 ( s ) ) g ( s ) d s | Z = E 0 I ( G ( s ) μ Z ( L ) ) d G ( s ) | L , Z .
In model 6, G ( u ) = μ Z X T β 0 ( u ) , so it follows from the above equation that for u ( 0 , U ] ,
E 0 u Y ( e X T β 0 ( s ) ) g ( s ) d s Z = μ Z ( L e X T β 0 ( u ) ) .
Therefore, the model 5 is satisfied.
Suppose that model 5 holds, then taking the derivative with respect to u on both sides of equation in model 5, we have
μ ˙ Z ( e X T β ( u ) ) F Z ( e X T β 0 ( u ) ) e X T β ( u ) X T d β = F Z ( e X T β 0 ( u ) ) g ( u ) d u ,
that is, d μ Z ( e X T β ( u ) ) = g ( u ) d u , then μ Z ( e X T β ( u ) ) = G ( u ) and model 6 are satisfied.

References

  1. Nelson, W. Hazard plotting for incomplete failure data. J. Qual. Technol. 1969, 1, 27–52. [Google Scholar] [CrossRef]
  2. Altshuler, B. Theory for the measurement of competing risks in animal experiments. Math. Biosci. 1970, 6, 1–11. [Google Scholar] [CrossRef]
  3. Lawless, J.F. Regression methods for Poisson process data. J. Am. Stat. Assoc. 1987, 82, 808–815. [Google Scholar] [CrossRef]
  4. Lin, D.Y.; Wei, L.J.; Yang, I.; Ying, Z. Semiparametric regression for the mean and rate functions of recurrent events. J. R. Stat. Soc. Ser. B (Stat. Methodol.) 2000, 62, 711–730. [Google Scholar] [CrossRef] [Green Version]
  5. Peng, L.; Huang, Y. Survival analysis with quantile regression models. J. Am. Stat. Assoc. 2008, 103, 637–649. [Google Scholar] [CrossRef]
  6. Koenker, R.; Bassett, G., Jr. Regression quantiles. Econom. J. Econom. Soc. 1978, 46, 33–50. [Google Scholar] [CrossRef]
  7. Powell, J.L. Censored regression quantiles. J. Econom. 1986, 32, 143–155. [Google Scholar] [CrossRef]
  8. Sun, X.; Peng, L.; Huang, Y.; Lai, H.J. Generalizing quantile regression for counting processes with applications to recurrent events. J. Am. Stat. Assoc. 2016, 111, 145–156. [Google Scholar] [CrossRef]
  9. Wang, H.J.; Wang, L. Locally weighted censored quantile regression. J. Am. Stat. Assoc. 2009, 104, 1117–1128. [Google Scholar] [CrossRef] [Green Version]
  10. Kim, M.O.; Yang, Y. Semiparametric approach to a random effects quantile regression model. J. Am. Stat. Assoc. 2011, 106, 1405–1417. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  11. Andersen, P.K.; Gill, R.D. Cox’s regression model for counting processes: A large sample study. Ann. Stat. 1982, 10, 1100–1120. [Google Scholar] [CrossRef]
  12. Cox, D.R. Regression models and life-tables. J. R. Stat. Soc. Ser. B (Methodol.) 1972, 34, 187–202. [Google Scholar] [CrossRef]
  13. Huang, Y.; Peng, L. Accelerated recurrence time models. Scand. J. Stat. 2009, 36, 636–648. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  14. Fygenson, M.; Ritov, Y. Monotone estimating equations for censored data. Ann. Stat. 1994, 22, 732–746. [Google Scholar] [CrossRef]
  15. Byar, D. The veterans administration study of chemoprophylaxis for recurrent stage i bladder tumours: Comparisons of placebo, pyridoxine and topical thiotepa. In Bladder Tumors and Other Topics in Urological Oncology; Springer: Berlin/Heidelberg, Germany, 1980; pp. 363–370. [Google Scholar]
  16. Vaart, A.V.D.; Wellner, J.A. Weak convergence and empirical processes with applications to statistics. J. R. Stat. Soc.-Ser. A Stat. Soc. 1997, 160, 596–608. [Google Scholar]
Figure 1. Estimation results of homogeneous Poisson process.
Figure 1. Estimation results of homogeneous Poisson process.
Mathematics 10 02662 g001
Figure 2. Estimation results of non-homogeneous Poisson process.
Figure 2. Estimation results of non-homogeneous Poisson process.
Mathematics 10 02662 g002
Figure 3. Simulation results with sample size n = 100 and the set-up with Gamma frailty of variance 0.
Figure 3. Simulation results with sample size n = 100 and the set-up with Gamma frailty of variance 0.
Mathematics 10 02662 g003
Figure 4. Simulation results with sample size n = 100 and the set-up with Gamma frailty of variance 0.5.
Figure 4. Simulation results with sample size n = 100 and the set-up with Gamma frailty of variance 0.5.
Mathematics 10 02662 g004
Figure 5. Simulation results of non-homogeneous Poisson process with sample size n = 100 and the set-up with Gamma frailty of variance 0.
Figure 5. Simulation results of non-homogeneous Poisson process with sample size n = 100 and the set-up with Gamma frailty of variance 0.
Mathematics 10 02662 g005
Figure 6. Bladder data example: coefficient estimates and 95% pointwise confidence.
Figure 6. Bladder data example: coefficient estimates and 95% pointwise confidence.
Mathematics 10 02662 g006
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Wen, X.; Xu, J. Generalized Accelerated Failure Time Models for Recurrent Events. Mathematics 2022, 10, 2662. https://doi.org/10.3390/math10152662

AMA Style

Wen X, Xu J. Generalized Accelerated Failure Time Models for Recurrent Events. Mathematics. 2022; 10(15):2662. https://doi.org/10.3390/math10152662

Chicago/Turabian Style

Wen, Xiaoyi, and Jinfeng Xu. 2022. "Generalized Accelerated Failure Time Models for Recurrent Events" Mathematics 10, no. 15: 2662. https://doi.org/10.3390/math10152662

APA Style

Wen, X., & Xu, J. (2022). Generalized Accelerated Failure Time Models for Recurrent Events. Mathematics, 10(15), 2662. https://doi.org/10.3390/math10152662

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop