Next Article in Journal
The #-Filter Anti-Aliasing Based on Sub-Pixel Continuous Edges
Next Article in Special Issue
Receiver Operating Characteristic Prediction for Classification: Performances in Cross-Validation by Example
Previous Article in Journal
On Minimal and Maximal Hyperidealsin n-ary Semihypergroups
Previous Article in Special Issue
Common Medical and Statistical Problems: The Dilemma of the Sample Size Calculation for Sensitivity and Specificity Estimation
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Definition and Estimation of Covariate Effect Types in the Context of Treatment Effectiveness

Clinical Research Center, Kindai University Hospital, 377-2, Ohno-higashi, Osaka-sayama, Osaka 589–8511, Japan
Mathematics 2020, 8(10), 1657; https://doi.org/10.3390/math8101657
Submission received: 27 August 2020 / Revised: 22 September 2020 / Accepted: 24 September 2020 / Published: 25 September 2020
(This article belongs to the Special Issue Applied Medical Statistics: Theory, Computation, Applicability)

Abstract

:
In some clinical studies, assessing covariate effect types indicating whether a covariate is predictive and/or prognostic is of interest, in addition to the study endpoint. Recently, for a case with a binary outcome, Chiba (Clinical Trials, 2019; 16: 237–245) proposed the new concept of covariate effect type, which is assessed in terms of four response types, and showed that standard subgroup or regression analysis is applicable only in certain cases. Although this concept could be useful for supplementing conventional standard analysis, its application is limited to cases with a binary outcome. In this article, we aim to generalize Chiba’s concept to continuous and time-to-event outcomes. We define covariate effect types based on four response types. It is difficult to estimate the response types from the observed data without making certain assumptions, so we propose a simple method to estimate them under the assumption of independent potential outcomes. Our approach is illustrated using data from a clinical study with a time-to-event outcome.

1. Introduction

In certain clinical studies, it is important to assess covariate effect types to indicate whether a covariate is predictive and/or prognostic in the context of treatment effectiveness. Clark et al. [1] defined a predictive factor as a factor associated with the response or lack of response to a particular therapy, while a prognostic factor is a factor associated with the clinical outcome in the absence of therapy, or with the application of a standard therapy. To assess whether a covariate is predictive and/or prognostic, subgroup or regression analysis is often used [2]. For a case with a binary outcome, on the difference scale, a binary covariate Z is predictive if
δ { Pr ( Y = 1 | X = 1 ,   Z = 1 ) Pr ( Y = 1 | X = 0 ,   Z = 1 ) } { Pr ( Y = 1 | X = 1 ,   Z = 0 ) Pr ( Y = 1 | X = 0 ,   Z = 0 ) }
is not zero, where X is the treatment arm ( X = 1 for the experimental treatment and X = 0 for the control treatment) and Y is the outcome ( Y = 1 for response and Y = 0 for no response). If δ is close to zero, Z is prognostic if
γ Pr ( Y = 1 | X = 0 ,   Z = 1 ) Pr ( Y = 1 | X = 0 ,   Z = 0 )
is not zero.
Recently, for cases with a binary outcome, Chiba [3] proposed assessing covariate effect types based on four response types [4] defined in terms of the potential outcome of Y if X is x , Y ( x ) , rather than δ and γ as above. The four response types are defined as follows:
  • Activated subjects, who would show a response regardless of the treatment they received; that is, { Y ( 1 ) ,   Y ( 0 ) } = ( 1 ,   1 ) ;
  • Causative subjects, who would show a response only if they received the experimental treatment; that is, { Y ( 1 ) ,   Y ( 0 ) } = ( 1 ,   0 ) ;
  • Preventive subjects, who would show a response only if they received the control treatment; that is, { Y ( 1 ) ,   Y ( 0 ) } = ( 0 ,   1 ) ;
  • Inert subjects, who would not show a response regardless of the treatment they received; that is, { Y ( 1 ) ,   Y ( 0 ) } = ( 0 ,   0 ) .
Based on the four response types, measures of the covariate effect types are defined as follows [3]:
η k l Pr { Y ( 1 ) = k ,   Y ( 0 ) = l | Z = 1 } Pr { Y ( 1 ) = k ,   Y ( 0 ) = l | Z = 0 }
for k ,   l = 0 ,   1 . Then, η 10 can be used instead of a current common measure for predicting the effectiveness of the experimental treatment, i.e., δ in (1). η 10 > 0 indicates that the proportion of subjects who would show a response only if they received the experimental treatment is higher in the subgroup where Z = 1 than in the subgroup where Z = 0 . In addition, we can consider predictability for the effectiveness of the control treatment (or the harm caused by the experimental treatment) by η 01 . Instead of a current common measure for a prognostic factor, i.e., γ in (2), η 11 can be used. η 11 > 0 indicates that the proportion of subjects who would show a response regardless of the treatment they received is higher in the subgroup where Z = 1 than in the subgroup where Z = 0 .
Applying the potential outcome Y ( x ) to (1) under the assumption of Y ( x ) X | Z , which indicates that Y ( x ) is independent from X given Z , the following relationship between (1) and (3) is obtained:
δ = [ Pr { Y ( 1 ) = 1 | Z = 1 } Pr { Y ( 0 ) = 1 | Z = 1 } ] [ Pr { Y ( 1 ) = 1 | Z = 0 } Pr { Y ( 0 ) = 1 | Z = } ] = η 10 η 01 .
Similarly, the following relationship holds between (2) and (3):
γ = Pr { Y ( 0 ) = 1 | Z = 1 } Pr { Y ( 0 ) = 1 | Z = 0 } = η 11 + η 01 .
Obviously, if η 01 0 , the currently used definition of covariate effect types does not correspond to the new definition in (3). Except for the special case of η 01 = 0 , under the current definition, the two characteristics of covariate effect cannot be separated from each other. In actual clinical studies, the aim is to assess covariate effect type on the basis of (3) rather than (1) and (2). However, (3) can be applied only when an outcome is binary and not when an outcome is continuous or time-to-event-based. In this article, we aim to generalize (3) to continuous and time-to-event outcomes. We provide a definition of measures of covariate effect types in Section 2 and propose a simple method to estimate those measures in Section 3. In Section 4, our approach is illustrated using data from a clinical study with a time-to-event outcome. Finally, in Section 5, the performance of the proposed approach and directions for future work in this area are discussed.

2. Definition of Measures of Covariate Effect Types

In the following, we mainly discuss a case with a time-to-event outcome T . However, this discussion can also be applied to a case with a continuous outcome as a special case without censored data in which a larger value represents a better response. X and Z are the same as in Section 1; i.e., X is a binary treatment arm and Z is a binary covariate. T ( x ) is the potential outcome of T if X had been set to x . Unfortunately, it is not possible to observe the values of both T ( 1 ) and T ( 0 ) ; i.e., T = T ( 1 ) X + T ( 0 ) ( 1 X ) [5]. We use the stable unit treatment value assumption under which a single version of each treatment is available and there is no interference among subjects [6,7].
When an outcome is time-to-event, the response type { T ( 1 ) ,   T ( 0 ) } cannot be classified into four types unlike in the case of a binary outcome described in Section 1. However, if we set a cut-off point t c , the time-to-event outcome can be regarded as a binary variable; then, we can consider the following four response types.
Definition 1 
(Response type).Using a cut-off point t c , the response type can be classified into the following four types:
  • Activated subjects (type 11 subjects), whose outcomes would be larger than or equal to t c , regardless of the treatment they received; that is, { T ( 1 ) t c ,   T ( 0 ) t c } ;
  • Causative subjects (type 10 subjects), whose outcomes would be larger than or equal to t c if they received the experimental treatment, but would be smaller than t c if they received the control treatment; that is, { T ( 1 ) t c ,   T ( 0 ) < t c } ;
  • Preventive subjects (type 01 subjects), whose outcomes would be smaller than t c if they received the experimental treatment, but would be larger or equal to t c if they received the control treatment; that is, { T ( 1 ) < t c ,   T ( 0 ) t c } ;
  • Inert subjects (type 00 subjects), whose outcomes would be smaller than t c , regardless of the treatment they received; that is, { T ( 1 ) < t c ,   T ( 0 ) < t c } .
All subjects belong to one of these four types, although we cannot know which type a subject belongs to. We denote a proportion of the k l subjects of the total subjects as P k l ( t c ) ; i.e.,
P k l ( t c ) [ Pr { T ( 1 ) t c ,   T ( 0 ) t c } ] k l [ Pr { T ( 1 ) t c ,   T ( 0 ) < t c } ] k ( 1 l ) × [ Pr { T ( 1 ) < t c ,   T ( 0 ) t c } ] ( 1 k ) l [ Pr { T ( 1 ) < t c ,   T ( 0 ) < t c } ] ( 1 k ) ( 1 l ) ,
and the proportion in the stratum with Z = z as P k l , z ( t c ) . Then, P 11 ( t ) + P 10 ( t ) = Pr { T ( 1 ) t } corresponds to the survival probability under the experimental condition and P 11 ( t ) + P 01 ( t ) = Pr { Y ( 0 ) t } is that under the control condition. For a continuous outcome, they correspond to one minus the cumulative density function.
Using P k l , z ( t c ) , we can simply define measures of covariate effect types at the cut-off point t c by
θ k l ( t c ) P k l , 1 ( t c ) P k l , 0 ( t c ) .
This is essentially the same as η k l in (3). Unfortunately, θ k l ( t c ) with a fixed t c cannot be applied as a general definition of measures of covariate types. If there is a medically meaningful cut-off point, the outcome is no longer time-to-event (continuous); rather, it is binary. If the cut-off point t c has no medical significance, it is not valuable to use a fixed cut-off point. P k l ( t c ) and P k l , z ( t c ) with a fixed cut-off point t c may also not be useful. Using these probabilities with t c changed on the interval, we define the restricted mean probability (RMP) as follows.
Definition 2 
(Restricted mean probability).The restricted mean probability for response type on the interval [ t a ,   t b ] , P ¯ k l , is defined by the following formula:
P ¯ k l 1 t b t a t a t b P k l ( t c ) d t c
for k ,   l = 0 ,   1 .
In this definition, t a and t b will usually be set to t a = 0 and t b = t τ , respectively, where t τ is the truncation time; for a continuous outcome, these values will be set to the minimum and maximum values of the observed outcome, respectively. P ¯ k l , z in the stratum with Z = z is defined in the same manner. P ¯ k l is the restricted expectation of P k l ( t c ) on the interval [ t a ,   t b ] . Then, P ¯ k l is interpreted as the mean proportion of type k l subjects when the cut-off point is changed from t a to t b .
It is important to note that P ¯ k l is related to E [ min { max { T ( x ) ,   t a } ,   t b } ] , which is the restricted expectation of the potential outcome T ( x ) on the interval [ t a ,   t b ] .
Lemma 1. 
We have the following relationships between P ¯ k l and E [ min { max { T ( x ) ,   t a } ,   t b } ] :
E [ min { max { T ( 1 ) ,   t a } ,   t b } ] = ( t b t a ) ( P ¯ 11 + P ¯ 10 ) + t a ,   E [ min { max { T ( 0 ) ,   t a } ,   t b } ] = ( t b t a ) ( P ¯ 11 + P ¯ 01 ) + t a .
The proof is given in Appendix A. The difference between these two equations indicates that the restricted average causal effect on the interval [ t a ,   t b ] can be expressed as ( t b t a ) ( P ¯ 10 P ¯ 01 ) . Notably, when t a = 0 and t b = t τ , t τ ( P ¯ 11 + P ¯ 10 ) and t τ ( P ¯ 11 + P ¯ 01 ) correspond to the restricted mean survival times (RMSTs) [8,9,10], which are equal to the areas under the survival curves on the interval [ 0 , t τ ] under the experimental and control conditions, respectively. Similarly, t τ ( P ¯ 01 + P ¯ 00 ) and t τ ( P ¯ 10 + P ¯ 00 ) correspond to the restricted mean time lost [9] under the experimental and control conditions, respectively.
We give a general definition of measures of covariate effect types using P ¯ k l , z , which is the RMP of a given response type in the stratum with Z = z .
Proposition 1. 
We define a measure of covariate effect type as θ ¯ k l P ¯ k l , 1 P ¯ k l , 0 for k ,   l = 0 ,   1 .
The interpretation of θ ¯ k l is similar to that of η k l in (3) for a case with a binary outcome. As a prediction measure, we use θ ¯ 10 and θ ¯ 01 . θ ¯ 10 > 0 implies the subgroup where Z = 1 contains a higher proportion of subjects who would survive longer by receiving the experimental treatment than the subgroup where Z = 0 . Then, Z predicts the effectiveness of the experimental treatment, and we say that Z is “augmented-causative”. If θ ¯ 10 < 0 , we say that Z is “depleted-causative”. In a similar sense, θ ¯ 01 is a prediction measure of the effectiveness of the control treatment (or for the harm associated with proceeding with the experimental treatment).
We note that for a case with a time-to-event (continuous) outcome, we can obtain results similar to (4) and (5) when the outcome is binary. Using Proposition 1 and Lemma 1, the prediction and prognosis measures based on conventional standard analysis, δ and γ , can be expressed as functions of θ ¯ k l on the difference scale.
Corollary 1. 
We have the following equations:
δ ( E 1 , 1 E 0 , 1 ) ( E 1 , 0 E 0 , 0 ) = ( t b t a ) ( θ ¯ 10 θ ¯ 01 ) , γ E 0 , 1 E 0 , 0 = ( t b t a ) ( θ ¯ 11 + θ ¯ 01 ) ,
where E x , z E [ min { max { T ( x ) ,   t a } ,   t b } | Z = z ] .
On the basis of this corollary, θ ¯ 11 in Proposition 1 can be related to the prognosis under the current standard analysis. Similar to the currently used standard analysis, if θ ¯ 10 is close to θ ¯ 01 , which implies that δ is close to zero, Z is prognostic if θ ¯ 11 is not zero. θ ¯ 11 > 0 implies that the mean survival time is longer for subjects in the subgroup where Z = 1 than the subgroup where Z = 0 , regardless of the treatment they received. In this case, we say that Z is “augmented-activated”. In contrast, when θ ¯ 11 < 0 , we say that Z is “depleted-activated”.
Finally, we emphasize that Corollary 1 indicates that the results of the current standard analysis can be properly interpreted when θ ¯ 01 = 0 ; however, this may not be the case when θ ¯ 01 0 . On the other hand, our measure of covariate effect type can assess whether a covariate is predictive and/or prognostic, even when θ ¯ 01 0 .

3. Estimation of Measures of Covariate Effect Types

Unfortunately, θ ¯ k l in Proposition 1 cannot be identified based on the observed data without making certain assumptions, because the joint probability of Y ( 1 ) and Y ( 0 ) cannot be estimated [11,12]. Therefore, we use two assumptions that are often applied in the context of the pairwise comparison based on Pr { T ( x ) > T ( 1 x ) } [13,14,15].
Assumption 1 
(Ignorable treatment assignment).The potential outcome T ( x ) is independent of the treatment arm X ; i.e., T ( x ) X . It is also assumed that T ( x ) X | Z .
Assumption 2 
(Independent potential outcome).Two potential outcomes are independent of each other; i.e., T ( 1 ) T ( 0 ) .
Assumption 1 is often made in randomized trials. Although some authors have discussed methods to infer Pr { T ( x ) > T ( 1 x ) } without Assumption 2 [16,17], the methods are impractical as they tend to be complex and/or require considerable computational effort.
Corollary 2. 
P k l ( t c ) in (6) is identified under Assumptions 1 and 2 as follows:
P k l ( t c ) = [ Pr { T t c | X = 1 } ] k [ Pr { T t c | X = 1 } ] 1 k × [ Pr { T t c | X = 0 } ] l [ Pr { T t c | X = 0 } ] 1 l .
P k l , z ( t c ) in the stratum with Z = z is also identified under Assumptions 1 and 2, thus allowing P ¯ k l , z in Definition 2 and θ ¯ k l in Proposition 1 to be identified.
We propose a method to estimate P ¯ k l in Definition 2 based on the formula in Corollary 2. Let us denote the observed event occurrence or censoring time as t 1 < < t j < < t D in which the time points in both arms are included. Then, we can use the cut-off point t c as a discrete value taking only the same value as t j ; that is, it is sufficient to consider t 1 ,   ,   t D as the cut-off points. We also set the interval [ t a ,   t b ] to [ 0 ,   t τ ] , where t τ   ( t D ) is the truncation time. These settings yield the following formula to estimate P ¯ k l .
Proposition 2. 
For a time-to-event outcome, we estimate P k l , z ( t c ) with truncation time t τ under Assumptions 1 and 2 as follows:
P ^ k l ( t c ) = { S ^ 1 ( t c 1 ) } k { 1 S ^ 1 ( t c 1 ) } 1 k { S ^ 0 ( t c 1 ) } l { 1 S ^ 0 ( t c 1 ) } 1 l ,
where t 0 = 0 and S x ( t c 1 ) Pr { T t c | X = x } is the survival probability estimated using the Kaplan–Meier method. Then, we estimate P ¯ k l by
P ¯ ^ k l = 1 τ c = 1 τ ( t c t c 1 ) P ^ k l ( t c ) .
Censoring is considered when estimating S x ( t c l by the Kaplan–Meier method. In a similar manner, P k l , z ( t c ) and P ¯ k l , z are estimated by applying Proposition 2 to the stratum with Z = z. Thus, θ k l , z ( t c ) , in (7) and θ k l , z in Proposition 1 can be estimated based on the observed data. It is obvious from Proposition 2 that τ ( P ¯ ^ 11 + P ¯ ^ 10 ) and τ ( P ¯ ^ 11 + P ¯ ^ 01 ) are consistent with the estimators of the RMST in the arm with X = x.
To derive the formulas to estimate P k l ( t c ) and P ¯ k l for a continuous outcome without censoring, let us suppose that t1 < ⋯ < tj < ⋯ tD are the observed values of the continuous outcome. The interval [ta,tb] is set to [t1,tD]. Here, we denote the number of subjects taking the value tj in the experimental arm as nj and that in the control arm as mj. Then, P k l ( t j ) and P ¯ k l are estimated by the following formulas.
Proposition 3. 
For a continuous outcome, we estimate P k l ( t c )   a n d   P ¯ k l under Assumptions 1 and 2 by
P ^ k l ( t c ) = ( j c n j n ) k ( j c 1 n j n ) 1 k ( j c m j m ) l ( j c 1 m j m ) 1 l , P ¯ ^ k l = 1 t D t 1 c = 2 D ( t c t c 1 ) P ^ k l ( t c ) ,
where n = j = 1 D n j and m = j = 1 D m j .
Let us denote the outcome values as t j 1 1 ( j 1 = 1 ,   ,   D 1 ) for subjects in the experimental arm and t j 0 0 ( j 0 = 1 ,   ,   D 0 ) for those in the control arm. Then, in relation to Lemma 1, we have Lemma 2 below.
Lemma 2. 
When P ^ k l ( t c )   a n d   P ¯ ^ k l in Proposition 3 are used, the following equations hold:
( t D t 1 ) ( P ¯ ^ 11 + P ¯ ^ 10 ) + t 1 = 1 n j 1 = 1 D 1 n j 1 t j 1 1 , ( t D t 1 ) ( P ¯ ^ 11 + P ¯ ^ 01 ) + t 1 = 1 m j 0 = 1 D 0 m j 0 t j 0 0 .
The proof is given in Appendix B. In this lemma, the right sides correspond to the arithmetic means in the arm with X = x . As the arithmetic mean is a plausible estimate of E{T|X = x}, the left sides are plausible estimates of unrestricted rather than restricted expectation of T ( x ) , E { T ( x ) } , under Assumption 1.

4. Illustration

We illustrate our measure of covariate effect type using the clinical data set of Ohashi and Hamada [18]. The purpose of that study was to determine whether the survival time is longer with use of radiation during surgery ( X = 1 ) than without it ( X = 0 ) in subjects with pancreatic cancer. As it was an observational study, Assumption 1 would not hold. Nevertheless, we analyzed the data under this assumption, so the following analyses are for illustrative purposes only.
We explored the covariate effect type in the context of radiation effectiveness in the pancreatic cancer site, which was subclassified as pancreatic head ( Z = 1 ) or “other” ( Z = 0 ). In the pancreatic head subgroup, 51 subjects received radiation during surgery and 14 subjects did not. In the other subgroup, nine subjects (including one censored subject) received radiation and nine did not. The survival curves in both subgroups are shown in Figure 1. Table 1 summarizes the RMST estimates and differences between arms in both subgroups, with 95% confidence intervals (CIs). For the estimation, the truncation time was set to the maximum observed time, 21.6 (months), because the final survival probability was zero in both arms in both subgroups.
Before applying our approach, we first applied the current standard analysis given by δ and γ in Corollary 1. Using the RMST estimates in Table 1, δ ^ = 4.349 (95% CI: –8.687, –0.011) and γ ^ = 0.106 (95% CI: –2.635, 2.422). As δ ^ < 0 , it was concluded that the cancer site would be a predictive factor, i.e., if the cancer site is not the pancreatic head, surgery is expected to be more effective with radiation than without it.
Next, we applied the approach proposed in this article. Figure 2 shows P ^ k l , z ( t c ) in each subgroup when the cut-off point t c was changed from 0 to 21.6 (months). The proportions of activated and inert subjects decreased and increased monotonically over time, respectively, as shown in Figure 2a,d; this is to be expected given Proposition 2. Figure 2b shows that the proportion of causative subjects is lower in the pancreatic head subgroup than in the other subgroup. Figure 2c shows that the proportion of preventive subjects was small, especially in the other subgroup. Figure 3 shows the differences in the estimated proportions between the two subgroups; i.e., θ ^ k l ( t c ) in (7). In Figure 3, we can clearly see that the cancer site is “depleted-activated” until approximately six months. Then, over the period from 6 to 12 months, the cancer site is “depleted-causative.”
The estimates of P ¯ k l in Definition 2 and θ ¯ k l in Proposition 1 are summarized in Table 2. In the table, θ ¯ ^ 10 = 0.145 implies that the mean proportion of causative subjects in the pancreatic head subgroup on the interval [0, 21.6] (12.9%) was 14.5% lower than that in the other subgroup (27.4%). This indicates that the cancer site is “depleted-causative”. However, for some subjects, the cancer site might be “activated-preventive” because θ ¯ ^ 01 = 0.057 , although the absolute value is smaller than that of θ ¯ ^ 10 (−0.145). The cancer site might also be considered as “depleted-activated” because θ ¯ ^ 11 = 0.062 . In the context of prediction and prognosis, as | θ ¯ ^ 10 | > | θ ¯ ^ 01 | > 0 , it is concluded that the cancer site would be a predictive factor of the effectiveness of radiation, and possibly also of the harm caused by radiation, albeit to a lesser extent.
The results generated using our approach were similar to those obtained via the current standard analysis; however, the magnitude was different. The difference is attributed to θ ¯ ^ 01 = 0.057 0 , where our approach could not rule out, based on the data, a harmful effect of radiation in some subjects, especially in the pancreatic head subgroup. The value of δ ^ estimated by the current standard analysis was −4.439, while it is converted to −0.201 ( 4.349 / 21.6 ); this is 0.057 less than the value of θ ¯ ^ 10 (−0.145) derived by our approach, as seen in Corollary 1. Similarly, γ ^ = 0.106 is converted to −0.005 ( 0.106 / 21.6 ); this is 0.057 larger than the value of θ ¯ ^ 11 (−0.062). As a result, γ ^ is closer to zero than θ ¯ ^ 11 .

5. Conclusion

In this article, we extended the concept of the covariate effect type proposed by Chiba [3] from the case with a binary outcome to cases with time-to-event and continuous outcomes. This was achieved by using all cut-off points on the interval [ t a ,   t b ] , where the respective values of t a and t b are zero and the truncation time in a case with a time-to-event outcome and the minimum and maximum observed values in a case with a continuous outcome. A proportion of the k l subjects for each cut-off point is P k l ( t c ) in (6), and the mean of P k l ( t c ) is the RMP, P ¯ k l , in Definition 2. As discussed in Section 2 and Section 3, the RMP is related to the arithmetic mean, which corresponds to the RMST for a case with a time-to-event outcome.
In some studies, a covariate of interest may be a continuous variable, rather than a binary variable discussed in this article. Unfortunately, our approach cannot be applied for a continuous covariate. However, a continuous covariate can be partitioned into two subgroups by finding the optimal threshold value to split it using a popular method such as receiver operating characteristic (ROC) curve analysis [19]. Recently, partitioning methods based on a combination of multivariate covariates have also been discussed [20]. These methods will facilitate the use of our approach.
Our measures of covariate effect types can easily be calculated by merging two data sets for the experimental and control arms including the rows of times and survival probabilities, which are automatically generated using commercial software such as SAS (SAS Institute, Cary, NC, USA) and R (R Foundation for Statistical Computing, Vienna, Austria). For example, to derive P ^ 11 ( t c ) = S ^ 1 ( t c 1 ) S ^ 0 ( t c 1 ) , we can use a procedure to generate a Kaplan–Meier plot. Then, we obtain respective data sets including { t j 1 1 ,   S ^ 1 ( t c 1 ) } and { t j 0 0 ,   S ^ 0 ( t c 1 ) } . P ^ 11 ( t c ) is derived by merging these two data sets. The RMP estimate, P ¯ ^ 11 , is calculated by summing the areas of rectangles, which are calculated by applying a lag function. This process to estimate P ¯ k l can be applied to a case with a continuous outcome by setting the minimum value of the outcome to zero, i.e., by using t j t 1 . It is also easy to extend the method to observational studies by using a weighted survival curve [21] derived based on the propensity score [22].
A limitation of our approach is the requirement that the assumption of independent potential outcomes (Assumption 2) is used when estimating the measures of covariate effect types. Unfortunately, we cannot verify whether this assumption holds in actual studies based on the observed data. Additional work is necessary to develop a simple method to estimate our measures of covariate effect types without using the assumption of independent potential outcomes.
Current standard analysis is suitable for assessing prediction and prognosis of a covariate when θ ¯ 01 = 0 , which implies that a covariate is not predictable for the effectiveness of the control treatment (or for the harm caused by the experimental treatment). However, when θ ¯ 01 0 , whether the current approach is appropriate is somewhat questionable unlike for our approach. Thus, our approach can supplement the current standard analysis, despite the limitation of requiring the assumption of independent potential outcomes.

Funding

This research was funded by Grant-in-Aid for Scientific Research from Japan Society for the Promotion of Science, grant number 19K11871.

Conflicts of Interest

The author declares no conflict of interest.

Appendix A. Proof of Lemma 1

Let us denote the probability density function of T ( 1 ) = t c as f ( t c ) , and the cumulative density function as F ( t c ) . Then, the left side of the first equation in Lemma 1 can be expressed as follows:
E [ min { max { T ( 1 ) ,   t a } ,   t b } ] = t c f ( t c ) d t c = t a t a f ( t c ) d t c + t a t b t c f ( t c ) d t c + t b t b f ( t c ) d t c = t a [ F ( t c ) ] t a + t a t b [ { t c F ( t c ) } F ( t c ) ] d t c + t b [ F ( t c ) ] t b = t a t b [ 1 Pr { T ( 1 ) t c } ] d t c + t b = t a t b { P 11 ( t c ) + P 10 ( t c ) } d t c + t a = ( t b t a ) ( P ¯ 11 + P ¯ 10 ) + t a .
The second equation in Lemma 1 is derived in a similar manner.

Appendix B. Proof of Lemma 2

Using Proposition 3, P ^ 11 ( t c ) + P ^ 10 ( t c ) = j c n j / n holds. This implies that the value of P ^ 11 ( t c ) + P ^ 10 ( t c ) does not change when applying the cut-off point for the observed outcome value in the control group. Therefore, for P ¯ ^ 11 + P ¯ ^ 10 , it is sufficient to consider only the observed outcome values in the experimental arm. Then, the left side of the first equation in Lemma 2 can be expressed as follows:
( t D t 1 ) ( P ¯ ^ 11 + P ¯ ^ 10 ) + t 1 = j 1 = 2 D 1 { ( t j 1 1 t j 1 1 1 ) j j 1 n j n } + t 1 j 1 n j n = 1 n { j 1 = 1 D 1 ( t j 1 1 j = j 1 D 1 n j ) j 1 = 2 D 1 ( t j 1 1 1 j = j 1 D 1 n j ) } = 1 n { j 1 = 1 D 1 1 ( t j 1 1 j = j 1 D 1 n j ) + n D 1 t D 1 1 j 1 = 2 D 1 1 ( t j 1 1 j = j 1 + 1 D 1 n j ) } = 1 n j = j 1 D 1 n j 1 t j 1 1
The second equation in Lemma 2 is derived in a similar manner.

References

  1. Clark, G.M.; Zborowski, D.M.; Culbertson, J.L.; Whitehead, M.; Savoie, M.; Seymour, L.; Shepherd, F.A. Clinical utility of epidermal growth factor receptor expression for selecting patients with advanced non-small cell lung cancer for treatment with erlotinib. J. Thorac. Oncol. 2006, 1, 837–846. [Google Scholar] [CrossRef] [Green Version]
  2. Ballman, K.V. Biomarker: Predictive or prognostic? J. Clin. Oncol. 2015, 33, 3968–3972. [Google Scholar] [CrossRef] [PubMed]
  3. Chiba, Y. Inference on covariate effect types for treatment effectiveness in a randomized trial with a binary outcome. Clin. Trials 2019, 16, 237–245. [Google Scholar] [CrossRef] [PubMed]
  4. Hernán, M.A.; Robins, J.M. Causal Inference: What If; Chapman & Hall/CRC: Boca Raton, FL, USA, 2020; pp. 58–65. [Google Scholar]
  5. Holland, P.W. Statistics and causal inference. J. Am. Stat. Assoc. 1986, 81, 945–960. [Google Scholar] [CrossRef]
  6. Rubin, D.B. Bayesian inference for causal effects: The role of randomization. Ann. Stat. 1978, 6, 34–58. [Google Scholar] [CrossRef]
  7. Rubin, D.B. Randomization analysis of experimental data: The Fisher randomization test comment. J. Am. Stat. Assoc. 1980, 75, 591–593. [Google Scholar] [CrossRef]
  8. Royston, P.; Parmar, M.K. Restricted mean survival time: An alternative to the hazard ratio for the design and analysis of randomized trials with a time-to-event outcome. BMC Med. Res. Methodol. 2013, 13, 152. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  9. Uno, H.; Claggett, B.; Tian, L.; Inoue, E.; Gallo, P.; Miyata, T.; Schrag, D.; Takeuchi, M.; Uyama, Y.; Zhao, L.; et al. Moving beyond the hazard ratio in quantifying the between-group difference in survival analysis. J. Clin. Oncol. 2014, 32, 2380–2385. [Google Scholar] [CrossRef] [PubMed]
  10. A’Hern, R.P. Restricted mean survival time: An obligatory end point for time-to-event analysis in cancer trials? J. Clin. Oncol. 2016, 34, 3474–3476. [Google Scholar] [CrossRef] [PubMed]
  11. Ding, P.; Dasgupta, T. A potential tale of two-by-two tables from completely randomized experiments. J. Am. Stat. Assoc. 2016, 111, 157–168. [Google Scholar] [CrossRef] [Green Version]
  12. Lu, J.; Ding, P.; Dasgupta, T. Treatment effects on ordinal outcomes: Causal estimands and sharp bounds. J. Educ. Behav. Stat. 2018, 43, 511–539. [Google Scholar] [CrossRef] [Green Version]
  13. Buyse, M. Generalized pairwise comparisons of prioritized outcomes in the two-sample problem. Stat. Med. 2010, 29, 3245–3257. [Google Scholar] [CrossRef] [PubMed]
  14. Péron, J.; Roy, P.; Ozenne, B.; Roche, L.; Buyse, M. The net chance of a longer survival as a patient-oriented measure of treatment benefit in randomized clinical trials. JAMA Oncol. 2016, 2, 901–905. [Google Scholar] [CrossRef] [PubMed]
  15. Chiba, Y. Sharp nonparametric bounds and randomization inference for treatment effects on an ordinal outcome. Stat. Med. 2017, 36, 3966–3975. [Google Scholar] [CrossRef] [PubMed]
  16. Huang, E.J.; Fang, E.X.; Hanley, D.F.; Rosenblum, M. Inequality in treatment benefits: Can we determine if a new treatment benefits the many or the few? Biostatistics 2017, 18, 308–324. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  17. Chiba, Y. Bayesian inference of causal effects for an ordinal outcome in randomized trials. J. Causal Infer. 2018, 6, 20170019. [Google Scholar] [CrossRef]
  18. Ohashi, Y.; Hamada, C. Survival Analysis: Biostatistics with SAS; University of Tokyo Press: Tokyo, Japan, 1995; pp. 247–250. (In Japanese) [Google Scholar]
  19. Heagerty, P.J.; Lumley, T.; Pepe, M.S. Time-dependent ROC curves for censored survival data and a diagnostic marker. Biometrics 2000, 56, 337–344. [Google Scholar] [CrossRef] [PubMed]
  20. Wang, J.; Li, J.; Li, Y.; Wong, W.K. A model-based multithreshold method for subgroup identification. Stat. Med. 2019, 38, 2605–2631. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  21. Cole, S.R.; Hernán, M.A. Adjusted survival curves with inverse probability weights. Comput. Methods Programs Biomed. 2004, 75, 45–49. [Google Scholar] [CrossRef] [PubMed]
  22. Rosenbaum, P.R.; Rubin, D.B. The central role of the propensity score in observational studies for causal effects. Biometrika 1983, 70, 41–55. [Google Scholar] [CrossRef]
Figure 1. Survival curves for the arms with and without radiation during surgery: (a) pancreatic head subgroup; (b) “other” subgroup.
Figure 1. Survival curves for the arms with and without radiation during surgery: (a) pancreatic head subgroup; (b) “other” subgroup.
Mathematics 08 01657 g001
Figure 2. Response type proportions in the pancreatic head and “other” subgroups: (a) activated (type 11) subjects; (b) causative (type 10) subjects; (c) preventive (type 01) subjects; (d) inert (type 00) subjects.
Figure 2. Response type proportions in the pancreatic head and “other” subgroups: (a) activated (type 11) subjects; (b) causative (type 10) subjects; (c) preventive (type 01) subjects; (d) inert (type 00) subjects.
Mathematics 08 01657 g002
Figure 3. Differences in the proportions of type k l subjects between the pancreatic head and “other” subgroups.
Figure 3. Differences in the proportions of type k l subjects between the pancreatic head and “other” subgroups.
Mathematics 08 01657 g003
Table 1. Restricted mean survival times (RMSTs) for the arms with and without radiation, for the pancreatic head and “other” subgroups, and the RMST differences between the two arms.
Table 1. Restricted mean survival times (RMSTs) for the arms with and without radiation, for the pancreatic head and “other” subgroups, and the RMST differences between the two arms.
Subgroup Arm RMST RMST Difference (95% CI 1)
Pancreatic headRadiation5.5081.236 (–1.127, 3.600)
No radiation4.271
OtherRadiation9.9635.585 (1.947, 9.223)
No radiation4.378
1 Confidence intervals (CIs) were derived based on a normal approximation.
Table 2. Restricted mean probabilities (RMPs) of the four response types by subgroups of cancer site and the RMP differences between the two subgroups.
Table 2. Restricted mean probabilities (RMPs) of the four response types by subgroups of cancer site and the RMP differences between the two subgroups.
Response TypeSubgroup RMP   ( P ¯ k l ) RMP   Difference   ( θ ¯ k l )   ( 95 %   CI   1 )
Activated
(Type 11)
Pancreatic head0.126−0.062 (−0.134, 0.017)
Other0.188
Causative
(Type 10)
Pancreatic head0.129−0.145 (−0.267, −0.026)
Other0.274
Preventive
(Type 01)
Pancreatic head0.0720.057 (0.010, 0.119)
Other0.015
Inert
(Type 00)
Pancreatic head0.6730.149 (−0.007, 0.307)
Other0.524
1 Confidence intervals (CIs) were yielded as percentiles for the bootstrap distribution with 2000 samples.

Share and Cite

MDPI and ACS Style

Chiba, Y. Definition and Estimation of Covariate Effect Types in the Context of Treatment Effectiveness. Mathematics 2020, 8, 1657. https://doi.org/10.3390/math8101657

AMA Style

Chiba Y. Definition and Estimation of Covariate Effect Types in the Context of Treatment Effectiveness. Mathematics. 2020; 8(10):1657. https://doi.org/10.3390/math8101657

Chicago/Turabian Style

Chiba, Yasutaka. 2020. "Definition and Estimation of Covariate Effect Types in the Context of Treatment Effectiveness" Mathematics 8, no. 10: 1657. https://doi.org/10.3390/math8101657

APA Style

Chiba, Y. (2020). Definition and Estimation of Covariate Effect Types in the Context of Treatment Effectiveness. Mathematics, 8(10), 1657. https://doi.org/10.3390/math8101657

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop