Next Article in Journal
Dynamic Behavior of a Stochastic Avian Influenza Model with Two Strains of Zoonotic Virus
Previous Article in Journal
Resultant Normal Contact Force-Based Contact Friction Model for the Combined Finite-Discrete Element Method and Its Validation
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Asymptotic Sample Size for Common Test of Relative Risk Ratios in Stratified Bilateral Data

1
College of Mathematics and System Sciences, Xinjiang University, Urumqi 830000, China
2
Department of Biostatistics, University at Buffalo, Buffalo, NY 14214, USA
*
Author to whom correspondence should be addressed.
Mathematics 2023, 11(19), 4198; https://doi.org/10.3390/math11194198
Submission received: 2 September 2023 / Revised: 29 September 2023 / Accepted: 5 October 2023 / Published: 8 October 2023
(This article belongs to the Section Probability and Statistics)

Abstract

:
In medical clinical studies, various tests usually relate to the sample size. This paper proposes several methods to calculate sample sizes for a common test of relative risk ratios in stratified bilateral data. Under the prespecified significant level and power, we derive some explicit formulae and an algorithm of the sample size. The sample sizes of the stratified intra-class model are obtained from the likelihood ratio, score, and Wald-type tests. Under pooled data, we calculate sample size based on the Wald-type test and its log-transformation form. Numerical simulations show that the proposed sample sizes have empirical power close to the prespecified value for given significance levels. The sample sizes from the iterative method are more stable and effective.

1. Introduction

In medical clinical studies, we often encounter observations from patients’ paired organs or parts (e.g., eyes, ears, and arms). The observations are often binary, that is, response or no response. When each patient in the study contributes such paired measurements, the overall outcome can be bilateral, unilateral, or no response(s). We call this type of data bilateral data. A correlation always exists between paired parts’ responses. Several intra-class correlation models have been proposed for analyzing such paired data under a probability model [1,2,3]. Rosner [1] introduced a constant R model with the assumption that the conditional probability of a response at one side of the paired body parts given response at the other side was R times the unconditional probability. Donner [3] proposed a model by assuming that the correlation coefficient was a fixed constant ρ in each group. Based on the three models mentioned above, there have been many achievements, such as asymptotic tests [4,5,6,7,8], exact tests [9,10,11], and confidence intervals [12,13]. However, comparatively little work has been done on the sample size problem of bilateral data.
In practice, sample size is one of the essential factors in designing clinical trials. Up to now, a considerable body of literature has grown up relating to binary data. For example, Tang et al. [8] proposed a power-controlled sample size, which can guarantee a desired power and control type I error rates (TIEs) at the fixed significance level. Qiu et al. [14] studied sample sizes for common tests of disease prevalence. Further, Qiu et al. [15] used an iterative algorithm to calculate the asymptotic sample sizes of common tests for binomial proportions. Sun et al. [16] extended the iterative algorithm to the homogeneity test in stratified unilateral and bilateral data. For more detailed results of sample sizes, we refer to [17,18,19]. Through the above analysis, there are mainly two approaches: sample size formula and algorithms. The former is practical for some explicit test statistics, while the latter often handles complex cases. However, there is little research to evaluate these methods’ performance.
Under Donner’s model, Zhuang et al. [20] proposed several statistics for testing common relative risk ratios across J strata. In this paper, we extend the hypotheses to the general case. Thus, their results are a special case of our results. What is more important, our work focuses on calculating the sample sizes of various test statistics, while Zhuang et al. [20] did not take it into account. The novelty and contribution of our work are shown by three main aspects as follows: (i) the common hypotheses mentioned above are extended to the general case H 0 : δ = δ 0   versus   H a : δ δ 0 , where δ 0 ( 0 , ) . (ii) Under the dependent and independent models, the approximate equations and algorithms of sample size are derived at prespecified power and significance levels. (iii) Numerical simulations are conducted to compare these methods.
The rest of the work is organized as follows. Section 2 reviews data structure and establishes Donner’s model. Several test statistics are derived for a common test of relative risk ratios. Based on these statistics, sample size calculations are proposed in Section 3. In Section 4, simulations show the performance of eight asymptotic sample sizes in terms of empirical power and TIEs. We also compare estimated values with actual sample sizes. A real example is provided to illustrate the proposed methods in Section 5. Conclusions are given in Section 6.

2. Donner’s Model and Common Test

Let M be the total number of patients for a stratified bilateral design. There are M j ( j = 1 , 2 , , J ) patients in the jth stratum. Thus, M = j = 1 J M j . Let m l i j be the number of patients with l responses ( l = 0 , 1 , 2 ) in the ith group ( i = 1 , 2 ) of the jth stratum ( j = 1 , 2 , , J ), and N i j = l = 0 2 m l i j be a fixed total number of patients in the ith group of the jth stratum. Let p l i j ( l = 0 , 1 , 2 , i = 1 , 2 , j = 1 , , J ) be the probabilities of no, unilateral, and bilateral response(s) in the ith group of the jth stratum, where p 0 i j + p 1 i j + p 2 i j = 1 for any fixed i and j. The data structure of the jth stratum is shown in Table 1.
Denote m i j = ( m 0 i j , m 1 i j , m 2 i j ) T and p i j = ( p 0 i j , p 1 i j , p 2 i j ) T ( i = 1 , 2 , j = 1 , , J ). For the jth stratum, m i j ( i = 1 , 2 ) follows a trinomial distribution. Thus, its probability density is expressed as follows:
f ( m i j p i j ) = N i j ! m 0 i j ! m 1 i j ! m 2 i j ! p 0 i j m 0 i j p 1 i j m 1 i j p 2 i j m 2 i j .
Let Z h i j k be an indicator of the kth organ’s response ( k = 1 , 2 ) for the hth patient ( h = 1 , 2 , , N i j ) in the ith group from the jth stratum. If there is a response, Z h i j k = 1 , and 0 otherwise. Under Donner’s model, Pr ( Z h i j k = 1 ) = π i j ( 0 π i j 1 ), and Corr ( Z h i j k , Z h i j ( 3 k ) ) = ρ j ( 0 ρ j 1 ). The correlation coefficient ρ j ( j = 1 , 2 , , J ) is constant in each stratum. The organ responses are affected by other factors such as group (e.g., treatments) and stratum (e.g., age, sex) factors. Through a simple calculation, the probabilities p l i j can be obtained by p 0 i j = ρ j ( 1 π i j ) + ( 1 ρ j ) ( 1 π i j ) 2 , p 1 i j = 2 π i j ( 1 ρ j ) ( 1 π i j ) , p 2 i j = ρ j π i j + ( 1 ρ j ) π i j 2 for i = 1 , 2 , j = 1 , , J .
Let δ j = π 2 j / π 1 j ( j = 1 , , J ) be the relative risk ratio between two groups in the jth stratum. Denote π i = ( π i 1 , , π i J ) T for i = 1 , 2 , and ρ = ( ρ 1 , , ρ J ) T . Suppose that the relative risk ratios from all strata are equal, that is, δ 1 = δ 2 = = δ J δ . The effects of strata include the correlation coefficients ρ j , probabilities p l i j ( l = 0 , 1 , 2 ) , sample sizes M j ( j = 1 , 2 , , J ) , and ratios N 2 j / N 1 j between two groups. Based on the observed data m i j ( i = 1 , 2 , j = 1 , 2 , , J ) , the log-likelihood function can be written as
( π 1 , δ , ρ ) = log j = 1 J i = 1 2 f ( m i j p i j ) = j = 1 J l j ( π 1 j , δ , ρ j ) + log C ,
where C = j = 1 J i = 1 2 N i j ! m 0 i j ! m 1 i j ! m 2 i j ! and
l j ( π 1 j , δ , ρ j ) = m 01 j log [ ρ j ( 1 π 1 j ) + ( 1 ρ j ) ( 1 π 1 j ) 2 ] + m 11 j log [ 2 π 1 j ( 1 ρ j ) ( 1 π 1 j ) ] + m 21 j log [ ρ j π 1 j + ( 1 ρ j ) π 1 j 2 ] + m 02 j log [ ρ j ( 1 π 1 j δ ) + ( 1 ρ j ) ( 1 π 1 j δ ) 2 ] + m 12 j log [ 2 π 1 j δ ( 1 ρ j ) ( 1 π 1 j δ ) ] + m 22 j log [ ρ j π 1 j δ + ( 1 ρ j ) ( π 1 j δ ) 2 ] .
Under the assumption that relative risk ratios from all strata are equal, we propose the hypotheses of the common test as
H 0 : δ = δ 0   versus   H a : δ δ 0 ,
where δ 0 ( 0 , ) . Under H 0 , that is π 2 j = π 1 j δ 0 ( j = 1 , 2 , , J ) , the log-likelihood function can be rewritten as
0 ( π 1 , δ 0 , ρ ) = j = 1 J l 0 j ( π 1 j , δ 0 , ρ j ) + log C ,
where l 0 j is written in Appendix A. Next, we introduce several asymptotic statistics to test the hypotheses (2) and solve the sample size calculations for the prespecified power and significance level based on these statistics.
Let π ^ 1 = ( π ^ 11 , π ^ 12 , , π ^ 1 J ) T , δ ^ , ρ ^ = ( ρ ^ 1 , ρ ^ 2 , , ρ ^ J ) T be the global maximum likelihood estimations (MLEs) of π 1 , δ , ρ , and π ˜ 1 = ( π ˜ 11 , π ˜ 12 , , π ˜ 1 J ) T , ρ ˜ = ( ρ ˜ 1 , ρ ˜ 2 , , ρ ˜ J ) T be the constrained MLEs of π 1 , ρ under H 0 . The global MLEs are the solutions of the following equations
π 1 j = 0 , δ = 0 , ρ j = 0 , j = 1 , 2 , , J .
The constrained MLEs are the solutions of the following equations
0 π 1 j = 0 , 0 ρ j = 0 , j = 1 , 2 , , J .
However, there are no closed-form solutions of Equations (4) and (5). Following Zhuang et al. [20], the global and constrained MLEs under δ = δ 0 can be obtained through the Fisher scoring algorithm.

2.1. Likelihood Ratio Test

Based on the log-likelihood functions (1) and (3), the likelihood ratio test is defined as
T L = 2 [ ( π ^ 1 , δ ^ , ρ ^ ) 0 ( π ˜ 1 , δ 0 , ρ ˜ ) ] = 2 j = 1 J [ l j ( π ^ 1 j , δ ^ , ρ ^ j ) l 0 j ( π ˜ 1 j , δ 0 , ρ ˜ j ) ] .

2.2. Score Test

Denote a score function U = ( j = 1 J l j δ , 0 , , 0 ) . A score test can be expressed by
T S C = U I 1 U T | δ = δ 0 , ρ = ρ ˜ , π 1 = π ˜ 1 ,
where I is a Fisher information matrix with respect to the parameters δ , ρ = ( ρ 1 , ρ 2 , , ρ J ) T , π 1 = ( π 11 , π 12 , , π 1 J ) T . Denote
A j = π 1 j ( 2 ( 1 ρ j ) ( 1 δ π 1 j ) + ρ j ) ρ j ( 1 δ π 1 j ) + ( 1 ρ j ) ( 1 δ π 1 j ) 2 , B j = ( 1 2 δ π 1 j ) δ ( 1 δ π 1 j ) , C j = ( 2 δ π 1 j ( 1 ρ j ) + ρ j ) δ ρ j + ( 1 ρ j ) δ 2 π 1 j .
Further, it can be simplified as
T S C = j = 1 J e j 2 I 1 ( 1 , 1 ) | δ = δ 0 , ρ = ρ ˜ , π 1 = π ˜ 1 ,
where e j = m 02 j A j + m 12 j B j + m 22 j C j , and I 1 ( 1 , 1 ) is the first diagonal element of the inverse matrix of the Fisher information matrix I. See Appendix B for more details.

2.3. Wald-Type Test

Denote β = ( δ , ρ T , π 1 T ) T , and C = ( 1 , 0 , , 0 ) 1 × ( 1 + 2 J ) . Thus, H 0 : δ = δ 0 is equivalent to C β T = δ 0 . We can construct a Wald-type test statistic as
T W = ( β C T δ 0 ) ( C I 1 C T ) 1 ( C β T δ 0 ) | δ = δ ^ , ρ = ρ ^ , π 1 = π ^ 1 ,
where π ^ 1 = ( π ^ 11 , π ^ 12 , , π ^ 1 J ) T , δ ^ and ρ ^ = ( ρ ^ 1 , ρ ^ 2 , , ρ ^ J ) T are global MLEs under H a . Through calculation, it can be simplified as
T W = ( δ ^ δ 0 ) 2 I 1 ( 1 , 1 ) | δ = δ ^ , ρ = ρ ^ , π 1 = π ^ 1 .

2.4. Pooled MLE-Based Wald-Type Test

Suppose that relative risk ratios from all strata are equal regardless of the stratification effects. Thus, we can pool the data from J strata together (Zhuang et al. [20]) and summarize all observations into a 3 × 2 table. The pooled data structure is shown in Table 2. Let M l i ( l = 0 , 1 , 2 ; i = 1 , 2 ) be the total number of subjects with l response(s) in the ith group, and M l + = i = 1 2 M l i be the number of subjects with l responses, where l = 0 , 1 , 2 . M + i ( i = 1 , 2 ) is the total sample size in the ith group.
For the pooled data, let π i ( i = 1 , 2 ) be the response rate of the ith group. Denote the relative risk ratio δ = π 2 / π 1 . If we ignore the relevance between the paired bilateral responses, the MLEs of π i for i = 1 , 2 and δ can be derived as
π ¯ i = M 1 i + 2 M 2 i 2 M + i , δ ¯ = π ¯ 2 π ¯ 1 = ( M 12 + 2 M 22 ) M + 1 ( M 11 + 2 M 21 ) M + 2 .
Based on the assumption that the observations from the paired organs are independent, the estimate of V a r ( π ¯ i ) is given in Tang et al. [9] as
V i = 4 M 0 i M 2 i + M 1 i ( M 0 i + M 2 i ) 4 M + i 3 ,
for i = 1 , 2 . By the delta method, we have E ( δ ¯ ) δ and V a r ( δ ¯ ) [ δ 2 V a r ( π ¯ 1 ) + V a r ( π ¯ 2 ) ] / π 1 2 . The estimate of V a r ( δ ¯ ) is directly obtained by σ ¯ 2 = [ δ ¯ 2 V 1 + V 2 ] / π ¯ 1 2 . A Wald-type test can be given by
T L S = ( δ ¯ δ 0 ) 2 σ ¯ 2 = ( δ ¯ δ 0 ) 2 ( δ ¯ 2 V 1 + V 2 ) / π ¯ 1 2 .

2.5. Pooled MLE-Based Log-Transformation Test

According to the delta method, log ( δ ¯ ) has the mean log ( δ ) and the variance V a r [ log ( δ ¯ ) ] V a r ( δ ¯ ) / δ 2 . In this case, an estimate of V a r [ log ( δ ¯ ) ] is σ ˜ 2 = σ ¯ 2 / δ ¯ 2 = ( V 1 + V 2 / δ ¯ 2 ) / π ¯ 1 2 . The χ 2 test statistic based on pooled MLE-based log-transformation is given by
T l o g = ( log ( δ ¯ ) log ( δ 0 ) ) 2 σ ˜ 2 = ( log ( δ ¯ ) log ( δ 0 ) ) 2 ( V 1 + V 2 / δ ¯ 2 ) / π ¯ 1 2 .
Under H 0 , the test statistics T i ( i = L , S C , W , L S , l o g ) are asymptotically distributed as a chi-square distribution with one degree of freedom. Thus, H 0 should be rejected if the value of the test statistic is larger than χ 1 , 1 α 2 at the significant level α , where χ 1 , 1 α 2 is the 100 ( 1 α ) percentile of the chi-square distribution with one degree of freedom.

3. Sample Size Determination

Determining an appropriate sample size for a statistical test is essential in any clinical trial design. This section proposes the approximate formulae and iterative algorithm to determine sample sizes based on the above test statistics. However, the approximate formula method is not entirely applicable to all statistics, but only some of them. For convenience, denote k j = N 1 j / M + 1 , and t j = N 2 j / N 1 j ( j = 1 , 2 , , J ).

3.1. Asymptotic Sample Size

Based on the statistics above T S C , T L S , and T l o g , we derive the asymptotic sample size under the desired power level. Under the alternative hypothesis, T i ( i = S C , L S , l o g ) asymptotically follows a non-central chi-square distribution with one degree of freedom. The non-central parameter, denoted by τ i , can be obtained by solving the equation: χ 1 , 1 β 2 ( τ i ) = χ 1 , α 2 , where χ 1 , 1 β 2 ( τ i ) is the 100 ( 1 β ) percentile of the non-central χ 2 distribution. When the observed frequencies are replaced by their expected frequencies (i.e., replace m l i j with M + i p l i j for l = 0 , 1 , 2 , i = 1 , 2 and j = 1 , 2 , , J ), denote the solutions from the Equation (5) for π 1 j , ρ j as π ˜ 1 j * , ρ ˜ j * , respectively. The Fisher scoring algorithm can be used to solve π ˜ 1 j * , ρ ˜ j * for j = 1 , 2 , , J .
Theorem 1.
Given the prespecified power 1 β and significance level α, the asymptotic sample size for the score test T S C is expressed by
M S C a j = 1 J ( τ S C + 1 ) / W ( δ 0 , ρ ˜ j * , π ˜ j * ) j = 1 J k j t j P j ( j = 1 J k j t j E j ) 2 × ( k j + k j t j ) ,
where W ( δ , ρ j , π j ) = I 1 ( 1 , 1 ) M + 1 , and
P j = A j 2 p 02 j ( 1 p 02 j ) ) + B j 2 p 12 j ( 1 p 12 j ) + C j 2 p 22 j ( 1 p 22 j ) 2 A j B j p 02 j p 12 j 2 A j C j p 02 j p 22 j 2 B j C j p 12 j p 22 j | δ = δ 0 , π 1 j = π ˜ 1 j * , ρ j = ρ ˜ j * ,
E j = p 02 j A j + p 12 j B j + p 22 j C j | δ = δ 0 , π 1 j = π ˜ 1 j * , ρ j = ρ ˜ j * .
Proof. 
Given a significance level α , the power of the test T S C can be expressed as
P r ( T S C χ 1 , α 2 H a ) = P r ( χ 1 2 ( τ S C ) χ 1 , α 2 ) .
Thus, it follows that
τ S C + 1 = E T S C H a .
We first substitute M 1 j and M 2 j with k j M + 1 and k j t j M + 1 into the Fisher information matrix I and obtain the symmetrical matrix I = M + 1 I s y m . Letting W ( δ , ρ j , π j ) be the first main-diagonal element of the inverse of the matrix I s y m , we have I 1 ( 1 , 1 ) = W ( δ , ρ j , π j ) / M + 1 . The conditional expectation can be written as
E T S C H a = E ( j = 1 J e j ) 2 I 1 ( 1 , 1 ) H a W ( δ 0 , ρ ˜ j * , π ˜ j * ) / M + 1 E ( j = 1 J e j ) 2 H a ,
where
E ( j = 1 J e j ) 2 H a = V a r ( j = 1 J e j H a ) + E ( j = 1 J e j H a ) 2 .
Since e j = m 02 j A j + m 12 j B j + m 22 j C j , the variance of j = 1 J e j is derived through the properties of the trinomial distribution of m l 2 j ( l = 0 , 1 , 2 , j = 1 , 2 , , J ) . Thus,
V a r ( j = 1 J e j ) = V a r ( j = 1 J m 02 j A j + m 12 j B j + m 22 j C j ) = j = 1 J V a r ( m 02 j A j + m 12 j B j + m 22 j C j ) = j = 1 J A j 2 V a r ( m 02 j ) + B j 2 V a r ( m 12 j ) + C j 2 V a r ( m 22 j ) + 2 A j B j C o v ( m 02 j , m 12 j ) + 2 A j C j C o v ( m 02 j , m 22 j ) + 2 B j C j C o v ( m 12 j , m 22 j ) ,
where V a r ( m 02 j ) = N 2 j p 02 j ( 1 p 02 j ) , V a r ( m 12 j ) = N 2 j p 12 j ( 1 p 12 j ) , V a r ( m 22 j ) = N 2 j p 22 j ( 1 p 22 j ) , C o v ( m 02 j , m 12 j ) = N 2 j p 02 j p 12 j , C o v ( m 02 j , m 22 j ) = N 2 j p 02 j p 22 j and C o v ( m 12 j , m 22 j ) = N 2 j p 12 j p 22 j . Substitute the MLEs π ˜ 1 j , ρ ˜ j with π ˜ 1 j * , ρ ˜ j * for j = 1 , 2 , , J . The approximate conditional expectation of e j under H a can be given as E ( e j H a ) M + 1 k j t j E j . The approximate conditional variance of e j under H a is V a r ( e j H a ) M + 1 k j t j P j . According to the Equation (6), we have
τ S C + 1 W ( δ 0 , ρ ˜ j * , π ˜ j * ) ( j = 1 J k j t j P j + M + 1 ( j = 1 J k j t j E j ) 2 ) .
The total sample size of the first group can be solved as
M + 1 ( τ S C + 1 ) / W ( δ 0 , ρ ˜ j * , π ˜ j * ) j = 1 J k j t j P j ( j = 1 J k j t j E j ) 2 .
Thus, the asymptotic sample size of score statistic can be obtained as
M S C a j = 1 J ( τ S C + 1 ) / W ( δ 0 , ρ ˜ j * , π ˜ j * ) j = 1 J k j t j P j ( j = 1 J k j t j E j ) 2 × ( k j + k j t j ) .
The proof is complete. □
For the Wald-type statistic under the pooled model, the sample size formula can be given according to the non-central chi-square distribution under H a . Denote t = M + 2 / M + 1 for the pooled data.
Theorem 2.
Given the prespecified power 1 β and significance level α, the asymptotic sample size for the test T L S is expressed by
M L S a ( 1 + t ) ( τ S L + 1 ) δ ¯ 2 W 1 + W 2 / t ( π ¯ 1 ) 2 δ ¯ 2 W 1 + W 2 / t π ¯ 1 2 ( δ δ 0 ) 2 ,
where W i = p 0 i p 2 i + p 1 i ( p 0 i + p 2 i ) / 4 for i = 1 , 2 .
Proof. 
The test statistic T L S is asymptotically distributed as a non-central χ 2 distribution with one degree of freedom and non-central parameter τ L S under the alternative hypothesis H a . The non-central parameter τ L S can be obtained by solving the equation: χ 1 , 1 β 2 ( τ L S ) = χ 1 , α 2 . Thus, it follows that τ L S + 1 = E T L S H a . Substitute M l 1 and M l 2 with M + 1 p l 1 and k M + 1 p l 2 (l = 0,1,2) for σ ¯ 2 = [ δ ¯ 2 V 1 + V 2 ] / π ¯ 1 2 . Hence,
E T L S H a = E ( δ ¯ δ 0 ) 2 ( δ ¯ 2 V 1 + V 2 ) / π ¯ 1 2 H a E ( ( δ ¯ δ 0 ) 2 H a ) π ¯ 1 2 δ ¯ 2 W 1 / M + 1 + W 2 / ( t M + 1 ) .
Based on the conditional expectation E ( ( δ ¯ δ 0 ) 2 H a ) V a r ( δ ¯ H a ) + E ( δ ¯ δ 0 H a ) 2 , and the conditional variance V a r ( δ ¯ H a ) [ δ ¯ 2 W 1 / M + 1 + W 2 / ( t M + 1 ) ] / π ¯ 1 2 , we have
τ S L + 1 δ ¯ 2 W 1 + W 2 / t π ¯ 1 2 + ( δ δ 0 ) 2 M + 1 ( π ¯ 1 ) 2 δ ¯ 2 W 1 + W 2 / t .
We can obtain the sample size of the first group by
M + 1 ( τ S L + 1 ) δ ¯ 2 W 1 + W 2 / t ( π ¯ 1 ) 2 δ ¯ 2 W 1 + W 2 / t π ¯ 1 2 ( δ δ 0 ) 2 .
Then, the sample size for the statistic T L S is given as
M L S a ( 1 + t ) ( τ L S + 1 ) δ ¯ 2 W 1 + W 2 / t ( π ¯ 1 ) 2 δ ¯ 2 W 1 + W 2 / t π ¯ 1 2 ( δ δ 0 ) 2 .
The proof is complete. □
Next, we present the corresponding sample size formula of the statistic T l o g for the prespecified power at a fixed significance level.
Theorem 3.
Given the prespecified power 1 β and significance level α, the asymptotic sample size for the test T l o g is expressed by
M l o g a ( 1 + t ) ( τ l o g + 1 ) W 1 + W 2 / ( δ ¯ 2 t ) ( π ¯ 1 ) 2 W 1 + W 2 / ( δ ¯ 2 t ) π ¯ 1 2 ( log δ log δ 0 ) 2 ,
where W i = p 0 i p 2 i + p 1 i ( p 0 i + p 2 i ) / 4 for i = 1 , 2 .
Proof. 
The test statistic T l o g is asymptotically distributed as a non-central χ 2 distribution with one degree of freedom and non-central parameter τ l o g under the alternative hypothesis H a . Thus, it follows that
τ l o g + 1 = E T l o g H a .
Substitute M l 1 and M l 2 with M + 1 p l 1 and k M + 1 p l 2 ( l = 0 , 1 , 2 ) in σ ˜ 2 = ( V 1 + V 2 / δ ¯ 2 ) / π ¯ 1 2 . The conditional expectation of T l o g can be obtained
E T l o g H a = E ( ( log ( δ ¯ ) log ( δ 0 ) ) 2 ( V 1 + V 2 / δ ¯ 2 ) / π ¯ 1 2 H a ) = E ( ( log δ ¯ log δ 0 ) 2 H a ) π ¯ 1 2 W 1 / M + 1 + W 2 / ( t M + 1 δ ¯ 2 ) ,
where E ( ( log δ ¯ log δ 0 ) 2 H a ) V a r ( log δ ¯ H a ) + E ( ( l o g δ ¯ log δ 0 ) 2 H a ) . The value of V a r ( log δ ¯ H a ) can be approximated by W 1 / M + 1 + W 2 / ( t M + 1 δ ¯ 2 ) π ¯ 1 2 . Then, we have
τ l o g + 1 ( l o g δ log δ 0 ) 2 + W 1 / M + 1 + W 2 / ( t M + 1 δ ¯ 2 ) π ¯ 1 2 π ¯ 1 2 W 1 / M + 1 + W 2 / ( t M + 1 δ ¯ 2 ) = ( l o g δ log δ 0 ) 2 M + 1 + W 1 + W 2 / ( t δ ¯ 2 ) π ¯ 1 2 π ¯ 1 2 W 1 + W 2 / ( t δ ¯ 2 ) .
We can obtain the sample size of the first group by
M + 1 ( τ l o g + 1 ) W 1 + W 2 / ( δ ¯ 2 t ) ( π ¯ 1 ) 2 W 1 + W 2 / ( δ ¯ 2 t ) π ¯ 1 2 ( log δ log δ 0 ) 2 .
The sample size of T l o g statistic is
M l o g a ( τ l o g + 1 ) W 1 + W 2 / ( δ ¯ 2 t ) ( π ¯ 1 ) 2 W 1 + W 2 / ( δ ¯ 2 t ) π ¯ 1 2 ( log δ log δ 0 ) 2 × ( 1 + t ) .
The result follows. □
Theorems 1–3 provide the sample size formulae according to the asymptotic distribution of the statistics T S C , T L S , and T l o g under the hypothesis H a : δ δ 0 . However, it is not easy to obtain explicit expressions in practical applications. The iterative method can be considered to solve the problem.

3.2. The Iterative Method

The power 1 β of a common test is the probability that the null hypothesis is rejected when the alternative hypothesis is true. Under the alternative hypothesis H a , the asymptotic power of statistic T i ( i = L , S C , W , L S or l o g ) is given by
P r ( T i χ 1 , 1 α 2 H a ) = 1 β ,
where significance level α is fixed. We can usually derive sample size by solving Equation (2) at a given significance level α and power of 1 β . However, we cannot know the exact distribution under H a of some test statistics based on MLEs from the iterative algorithm (e.g., T L and T W ). In this case, we propose an iterative algorithm for calculating the asymptotic sample size for stratified correlated data. The method can compare empirical power with the given power of 1 β for searching sample size. The detailed process of the algorithm is shown as follows:
(i)
Given k j , t j , δ , π 1 j and ρ j , j = 1 , 2 , , J . The initial values of sample size M ˜ ( 0 ) = 0 , the step size d = 1000 and flag f = 1 .
(ii)
The ( t + 1 ) th update of M ˜ is M ˜ ( t + 1 ) = M ˜ ( t ) + d × f . The 10,000 replicates m i j = ( m 0 i j , m 1 i j , m 2 i j ) T are randomly generated under H a , where m i j ( i = 1   or   2 ) follows a trinomial distribution
f ( m i j p i j ) = N i j ! m 0 i j ! m 1 i j ! m 2 i j ! p 0 i j m 0 i j p 1 i j m 1 i j p 2 i j m 2 i j .
(iii)
Calculate empirical power based on random samples generated in step (ii) at a given significance level α . The empirical power can be computed by dividing the number of times rejecting H 0 by 10,000. The empirical power is denoted as p * ( M ˜ ( t + 1 ) ) .
(iv)
Compare p * ( M ˜ ( t + 1 ) ) with given power 1 β . If f × p * ( M ˜ ( t + 1 ) ) < f × ( 1 β ) , return to step (ii). Otherwise, d = 0.1 × d , f = f and return to step (ii).
(v)
Repeat the steps (ii)–(iv) until p * ( M ˜ ( t + 1 ) ) closes to 1 β before d becomes a decimal.
From the test statistics T i , the sample sizes from power approximation method are denoted as M i b , where i is L , S C , W , L S , or l o g . The MATLAB code of the iterative algorithm is available in the Supplementary Materials.

4. Simulation for Asymptotic Power and Sample Size

In this section, numerical simulations are conducted to evaluate the performance of asymptotic sample sizes from our proposed methods.

4.1. Asymptotic Sample Size, Power and TIE

Take J = 3 . Under the null hypothesis H 0 : δ = 1 , we calculate the sample size M S C a , M L S a , M l o g a , M L b , M S C b , M W b , M L S b , and M l o g b for given power 1 β = 0.9 . Considering that the total numbers of strata may be the same or different, we let k = ( 1 / 3 , 1 / 3 , 1 / 3 ) and ( 0.5 , 0.3 , 0.2 ) . The correlation coefficient ρ j = 0.3 (low), or 0.5 (moderate), and the response rates π 1 j = 0.4 (less than 50%), or 0.6 (bigger than 50%). The scenarios of parameter settings in the simulations are listed in Table 3.
Figure 1, Figure 2 and Figure 3 show the estimated sample sizes, empirical powers and TIEs of M S C a , M L S a , M l o g a , M L b , M S C b , M W b , M L S b and M l o g b for the given parameters listed in Table 3. We observe that there are obvious differences among M S C a , M L S a , and M l o g a even though they are all solved by fitting asymptotic χ 2 distributions. Asymptotic sample sizes based on the statistics T L , T S C , and T L S are close under the iterative method. For the five sample sizes from the iterative algorithm, the values of M W b are always lower, and those of M l o g b tend to be bigger. The sample sizes of δ a = 0.7 are smaller than that of δ a = 0.8 . When other parameters are fixed, we can observe two phenomena: (i) the sample sizes become bigger for ρ j = 0.5 than ρ j = 0.3 ; (ii) the sample sizes tend to decrease when π 1 j increases from 0.4 to 0.6. The sample sizes increase as δ a is closer to 1. That is to say, more samples are required when δ = δ a is closer to that under H 0 : δ = δ 0 .
We can calculate the empirical powers and TIEs based on the estimated sample sizes. For every sample size, 10,000 replicates m i j = ( m 0 i j , m 1 i j , m 2 i j ) T are randomly generated under H a . Further, the empirical power can be computed by dividing the number of times rejecting H 0 by 10,000 when the significance level is 0.05. Empirical TIE is calculated through 10,000 replicates randomly generated under δ = δ 0 . The results are listed in Figure 2 and Figure 3. The empirical powers are mostly close to the desired powers of 0.9. Compared with the sample size formula, the iterative algorithm often produces the sample size with empirical power closer to 0.9. In terms of empirical power, there are no significant trend changes for various parameter settings of δ a , k j , t j , π 1 j and ρ j ( j = 1 , 2 , , J ) .
According to Tang et al. [8], a test is robust if its empirical TIE is between 0.04 and 0.06 when the significance level is 0.05, liberal if it is greater than 0.06, and conservative if it is less than 0.04. As shown in Figure 3, the sample size M S C a has better performance in empirical TIEs. For the sample size M S C a , the empirical TIEs are located on [ 0.04 , 0.06 ] under all considered settings. For some cases, the empirical TIEs based on statistics T L S and T l o g are lower than 0.04, which means that T L S and T l o g have conservative TIEs based on sample sizes from the approximate formula or the iterative algorithm. Sample sizes M L b , M S C b , and M W b sometimes have liberal TIEs bigger than 0.06.

4.2. Accuracy

This section evaluates the effectiveness of sample size determinations. Let J = 3 , δ 0 = 1 , and the sample size M + 1 = 100 , 200 , 300 . Other parameters k i , π i j , ρ j are randomly generated under H a : δ = δ a ( δ 0 ) , satisfying 0 < k i < 1 , 0 π i j 1 , 1 ρ j 1 and 0 δ a 1 . Denote k = ( k 1 , k 2 , k 3 ) , t 1 = t 2 = t 3 , π 1 = ( π 11 , π 12 , π 13 ) T , ρ = ( ρ 1 , ρ 2 , ρ 3 ) T . We choose 1000 parameter settings for every sample size. There are 10,000 random replications under every parameter setting. Empirical powers are given at a significance level of 0.05. Further, we calculate the corresponding asymptotic sample sizes for the obtained empirical powers at a significance level α = 0.05 . The boxplots in Figure 4 show all asymptotic sample sizes.
We note that the estimated sample sizes ( M L b , M S C b , M W b , M L S b , and M l o g b ) are close to the true sample size for M + 1 = 100 , 200 , 300 , which performs better than the sample size formula method ( M S C a , M L S a and M l o g a ). In terms of the sample size formula, the sample size from the score statistic is more accurate than the statistics T L S and T l o g . The estimated sample sizes M L S a and M l o g a are always bigger than the true values. Thus, the stratified correlated model performs better for effectively estimating sample size through formula derivation.
Above all, all power-controlled methods can provide optimal estimates because the empirical powers are usually close to the prespecified one. The sample sizes from the iterative algorithm have more satisfactory power. Thus, the iterative method is recommended in practice.

4.3. The Effect of Parameters

Through the iterative algorithm, we further study the effect of parameters on sample sizes. Given a prespecified power of 0.9 and a significance level of 0.05, the hypotheses H 0 : δ 0 = 1 and H 1 : δ a = 0.8 . To study the effect of strata, we take J = 2 , 4 , 6 . The influence of π 1 j , ρ j and t j can be observed according to these cases: (i) π 1 j = 0.3 ( 0.1 ) 0.7 for ρ j = 0.7 and t j = 1 ; (ii) ρ j = 0.1 ( 0.2 ) 0.9 for π 1 j = 0.5 and t j = 1 ; and (iii) t j = 0.2 ( 0.2 ) 1 for π 1 j = 0.5 and ρ j = 0.7 . Under various parameter settings, Figure 5 shows that the asymptotic sample sizes become smaller when π 1 j ( j = 1 , 2 , , J ) increases. The five sample sizes increase with bigger ρ j ( j = 1 , 2 , , J ) . As t j ( j = 1 , 2 , , J ) approaches 1, the values of asymptotic sample sizes will be smaller. This reflects that the balanced design (i.e., N 1 j = N 2 j for j = 1 , 2 , , J ) of two groups needs a smaller sample size for the prespecified power. When the number of strata changes, the sample sizes have similar values for J = 2 , 4 , 6 . Strata number has little effect on sample sizes.

5. A Real Example

Mandel et al. [21] conducted a double-blinded clinical trial to illustrate the proposed sample size considerations. In this trial, children with otitis media with effusion (OME) were stratified by age and randomly assigned to the cefaclor or amoxicillin treatment groups. There are three strata (<2, 2–5 and >5 yr) divided according to age as shown in Table 4.
From Table 4, it is obvious that k 1 = ( 18 + 15 ) / 75 = 33 / 75 , k 2 = ( 22 + 9 ) / 75 = 31 / 75 , k 3 = ( 4 + 7 ) / 75 = 11 / 75 . Let t j = M + 2 / M + 1 = ( 18 + 22 + 4 ) / ( 15 + 9 + 7 ) = 1.4 for j = 1 , 2 , 3 . The Fisher scoring algorithm can derive the global MLEs under H a : δ 1 = δ 2 = δ 3 δ (see in Table 5). The asymptotic sample sizes are obtained based on the global MLEs of parameters ρ j , π 1 j and δ ( j = 1 , 2 , 3 ) .
Table 6 shows the asymptotic sample sizes for the prespecified power 0.8 , 0.9 , and 0.95 under δ 0 = 0.5 , 0.6 . The sample sizes based on T L S have bigger values than other sample size formulae or the iterative methods. The sample sizes from the iterative algorithm are smaller than those from the sample size formula. For δ 0 = 0.5, the sample size 75 in the real example is close to the values of M S C b and M L b even when the power is 0.95. However, only M S C b and M L b for power 0.8 are close to the real sample size for δ 0 = 0.6 . That is to say, the sample size 75 can basically guarantee the power of 0.95 for the test statistics T L and T S C under H 0 : δ = 0.5 . Moreover, when the values of δ under H a and H 0 become closer, we need the larger sample size to keep the power at the fixed significance level. Then, we conduct the common test under the null hypothesis H 0 : δ = 0.5 , 0.6 . The values and p-values of the test statistics are recorded in Table 7. For δ 0 = 0.5, the p-values of five statistics T i ( L , S C , W , L S , l o g ) are 0.0029, 0.0084, 0.0040, 0.0384, and 0.0311, respectively. Obviously, the p-values are lower than 0.05. Thus, we have full evidence to reject H 0 : δ = 0.5 at the significance level 0.05. Under H 0 : δ = 0.6 , there are some p-values smaller than 0.05 but others bigger than 0.05. Among the estimated sample sizes, only M L b and M S C b have close values to real sample sizes given power 0.8. Since p-values of T L and T S C are 0.0373 and 0.0490, the hypothesis H 0 : δ = 0.6 is rejected at the significance level 0.05.

6. Conclusions

This paper mainly investigated the asymptotic sample size for the common test of relative risk ratio in stratified bilateral data. Under Donner’s model, we obtained the sample size formula and proposed the iterative method based on the likelihood ratio statistic T L , score statistic T S C , and Wald-type statistic T W . We also applied the sample size methods to the pooled MLE-based Wald-type statistic T L S and the log-transformation T l o g .
Numerical simulations are designed to evaluate the performance of estimated sample sizes. According to the simulations, the main conclusions lie in five aspects. (i) The estimated sample sizes have empirical power closer to the prespecified power level. (ii) The sample sizes M L b , M S C b , M W b , M L S b and M l o g b , which are obtained by the iterative algorithm, outperform other asymptotic sample sizes because of accurate values and satisfactory power. (iii) The sample size M S C a has more robust TIEs than M L S a and M l o g a from the sample size formula. (iv) The sample sizes are sensitive to π 1 j and ρ j ( j = 1 , 2 , , J ) when other parameters are fixed. (v) The sample sizes under unbalanced designs of two groups are greater than those under balanced designs for given power and significance levels.
This paper mainly focuses on the power-controlled sample size of the common test of relative risk ratios for stratified correlated bilateral data. However, one may want to calculate the necessary sample size such that the width of the ( 1 α ) % confidence interval (CI) does not exceed a prespecified quantity. This type of sample size is called CI-width controlled sample size determination [8]. In the future, we will study the CI-width controlled sample size for the relative risk ratio in stratified paired data. Through numerical simulation, we compared the performance of the proposed methods in terms of empirical TIEs and powers. An example was used to illustrate our proposed methods. However, we have not analyzed the theoretical results of the efficiency of these methods. These problems will be considered in our future works.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/math11194198/s1, the MATLAB code of the iterative algorithm.

Author Contributions

Conceptualization, investigation, project administration, writing—original draft, K.M. and Z.L.; Writing—review and editing, supervision, C.M. All authors have read and agreed to the published version of the manuscript.

Funding

The work was supported by 2022 Innovation Project for Excellent Doctoral Postgraduates of Xinjiang University (XJU2022BS027), the National Natural Science Foundation of China (12061070) and the Natural Science Foundation of Xinjiang Uygur Autonomous Region (2021D01E13).

Data Availability Statement

The real example data is from the article: Mandel, E.; Bluestone, C.; Rockette, H.; Blatter, M.; Reisinger, K.; Wucher, F.; Harper, J. Duration of effusion after antibiotic treatment for acute otitis media: comparison of Cefaclor and Amoxicillin. Pediatr. Infect. Dis. J. 1982, 1, 310–316.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Derivation for the Log-Likelihood Function under H0

Under H 0 , the log-likelihood function can be rewritten as
0 ( π 1 , δ 0 , ρ ) = j = 1 J l 0 j ( π 1 j , δ 0 , ρ j ) + log C ,
where
l 0 j ( π 1 j , δ 0 , ρ j ) = m 01 j log [ ρ j ( 1 π 1 j ) + ( 1 ρ j ) ( 1 π 1 j ) 2 ] + m 11 j log [ 2 π 1 j ( 1 ρ j ) ( 1 π 1 j ) ] + m 21 j log [ ρ j π 1 j + ( 1 ρ j ) π 1 j 2 ] + m 02 j log [ ρ j ( 1 π 1 j δ 0 ) + ( 1 ρ j ) ( 1 π 1 j δ 0 ) 2 ] + m 12 j log [ 2 π 1 j δ 0 ( 1 ρ j ) ( 1 π 1 j δ 0 ) ] + m 22 j log [ ρ j π 1 j δ 0 + ( 1 ρ j ) ( π 1 j δ 0 ) 2 ] .

Appendix B. Derivation for Score Statistic

The information matrix I for score test statistic is written as
I = I 11 I 12 I 13 I 12 T I 22 I 23 I 13 T I 23 T I 33 ,
where
I 11 = E ( j = 1 J 2 l j δ 2 ) = j = 1 J N 2 j π 1 j 2 { ( 1 + 2 δ 2 π 1 j 2 ) ( 1 ρ j ) 2 2 δ π 1 j ( 1 ρ j ) ( 2 ρ j ) + 1 ρ j ( 1 δ π 1 j ) + ( 1 ρ j ) ( 1 δ π 1 j ) 2 + 2 ( 1 ρ j ) ( 2 δ 2 π 1 j 2 2 δ π 1 j + 1 ) δ π 1 j ( 1 δ π 1 j ) + 2 δ 2 π 1 j 2 ( 1 ρ j ) 2 + 2 δ π 1 j ρ j ( 1 ρ j ) + ρ j 2 ρ j δ π 1 j + ( 1 ρ j ) ( δ π 1 j ) 2 } = j = 1 J i 11 ( j ) ,
I 12 = E 2 l 1 δ ρ 1 , E 2 l 2 δ ρ 2 , , E 2 l J δ ρ J = { N 21 π 11 ( 1 δ π 11 ) 2 ρ 1 ( 1 δ π 11 ) + ( 1 ρ 1 ) ( 1 δ π 11 ) 2 + ( δ π 11 ) 2 ρ 1 δ π 11 + ( 1 ρ 1 ) ( δ π 11 ) 2 , N 22 π 12 ( 1 δ π 12 ) 2 ρ 2 ( 1 δ π 12 ) + ( 1 ρ 2 ) ( 1 δ π 12 ) 2 + ( δ π 12 ) 2 ρ 2 δ π 12 + ( 1 ρ 2 ) ( δ π 12 ) 2 , N 2 J π 1 J ( 1 δ π 1 J ) 2 ρ J ( 1 δ π 1 J ) + ( 1 ρ J ) ( 1 δ π 1 J ) 2 + ( δ π 1 J ) 2 ρ J δ π 1 J + ( 1 ρ J ) ( δ π 1 J ) 2 } = [ i 12 ( 1 ) , i 12 ( 2 ) , , i 12 ( J ) ] , I 13 = E 2 l 1 π 11 δ , E 2 l 2 π 12 δ , , E 2 l J π 1 J δ = [ E 1 π 11 l 1 δ δ π 11 2 l 1 δ 2 , E 1 π 12 l 2 δ δ π 12 2 l 2 δ 2 , , E 1 π 1 J l J δ δ π 1 J 2 l J δ 2 ] = δ π 11 i 11 ( 1 ) , δ π 12 i 11 ( 2 ) , , δ π 1 J i 11 ( J ) ,
I 22 = diag E 2 l j ρ j = diag { N 1 j π 1 j 2 ( 1 π 1 j ) 2 ρ j ( 1 π 1 j ) + ( 1 ρ j ) ( 1 π 1 j ) 2 + 2 π 1 j ( 1 π 1 j ) 1 ρ j + ( 1 π 1 j ) 2 π 1 j 2 ρ j π 1 j + ( 1 ρ j ) π 1 j 2 + N 2 j δ 2 π 1 j 2 ( 1 δ π 1 j ) 2 ρ j ( 1 δ π 1 j ) + ( 1 ρ j ) ( 1 δ π 1 j ) 2 + 2 π 1 j ( 1 π 1 j ) 1 ρ j + ( 1 δ π 1 j ) 2 ( δ π 1 j ) 2 ρ j δ π 1 j + ( 1 ρ j ) δ π 1 j 2 } , I 23 = diag E 2 l j π 1 j ρ j = diag { N 1 j ( ( 1 π 1 j ) 2 ρ j ( 1 π 1 j ) + ( 1 ρ j ) ( 1 π 1 j ) 2 + π 1 j 2 ρ j π 1 j + ( 1 ρ j ) π 1 j 2 ) + δ π 1 j i 12 ( j ) } , I 33 = diag E 2 l j π 1 j 2 = diag { N 1 j ( ( 1 + 2 π 1 j 2 ) ( 1 ρ j ) 2 2 π 1 j ( 1 ρ j ) ( 2 ρ j ) + 1 ρ j ( 1 π 1 j ) + ( 1 ρ j ) ( 1 π 1 j ) 2 + 2 ( 1 ρ j ) ( 2 π 1 j 2 2 π 1 j + 1 ) π 1 j ( 1 π 1 j ) + 2 π 1 j 2 ( 1 ρ j ) 2 + 2 π 1 j ρ j ( 1 ρ j ) + ρ j 2 π 1 j ρ j + ( 1 ρ j ) π 1 j 2 ) + δ 2 π 1 j 2 i 11 ( j ) } .
The first diagonal element in the inverse matrix I is
I 1 ( 1 , 1 ) = I 11 I 12 , I 13 I 22 I 23 I 23 I 33 1 I 12 T I 13 T 1 .
Thus, the test statistics T S C can be simplified as
T S C = ( j = 1 J e j ) 2 I 1 ( 1 , 1 ) | δ = δ 0 , ρ = ρ ˜ , π 1 = π ˜ 1 ,
where
e j = m 02 j π 1 j ( 2 ( 1 ρ j ) ( 1 δ π 1 j ) + ρ j ) ρ j ( 1 δ π 1 j ) + ( 1 ρ j ) ( 1 δ π 1 j ) 2 + m 12 j ( 1 2 δ π 1 j ) δ ( 1 δ π 1 j ) + m 22 j ( 2 δ π 1 j ( 1 ρ j ) + ρ j ) δ ρ j + ( 1 ρ j ) δ 2 π 1 j .

References

  1. Rosner, B. Statistical methods in ophthalmology: An adjustment for the correlation between eyes. Biometrics 1982, 38, 105–114. [Google Scholar] [CrossRef]
  2. Dallal, G. Paired Bernoulli trials. Biometrics 1988, 44, 253–257. [Google Scholar] [CrossRef]
  3. Donner, A. Statistical methods in ophthalmology: An adjusted chi-square approach. Biometrics 1989, 45, 605–611. [Google Scholar] [CrossRef]
  4. Tang, N.; Tang, M.; Qiu, S. Testing the equality of proportions for correlated otolaryngologic data. Comput. Stat. Data Anal. 2008, 52, 3719–3729. [Google Scholar] [CrossRef]
  5. Pei, Y.; Tang, M.; Wong, W.; Tang, N. Testing equality of correlations of two paired binary responses from two treated groups in a randomized trial. J. Biopharm. Stat. 2011, 21, 511–525. [Google Scholar] [CrossRef]
  6. Mou, K.; Ma, C.; Li, Z. Homogeneity test of relative risk ratios for stratified bilateral data under different algorithms. J. Appl. Stat. 2023, 50, 1060–1077. [Google Scholar] [CrossRef]
  7. Tang, N.; Qiu, S. Homogeneity test, sample size determination and interval construction of difference of two proportions in stratified bilateral-sample designs. J. Stat. Plan. Inference 2012, 142, 1242–1251. [Google Scholar] [CrossRef]
  8. Tang, M.; Tang, N.; Carey, V. Sample size determination for 2-step studies with dichotomous response. J. Stat. Plan. Inference 2006, 136, 1166–1180. [Google Scholar] [CrossRef]
  9. Tang, M.; Tang, N.; Rosner, B. Statistical inference for correlated data in ophthalmologic studies. Stat. Med. 2006, 25, 2771–2783. [Google Scholar] [CrossRef]
  10. Liu, X.; Shan, G.; Tian, L.; Ma, C. Exact methods for testing homogeneity of proportions for multiple groups of paired binary data. Commun. Stat.-Simul. C 2017, 46, 6074–6082. [Google Scholar] [CrossRef]
  11. Shan, G. Exact approaches for testing non-inferiority or superiority of two incidence rates. Stat. Probab. Lett. 2014, 85, 129–134. [Google Scholar] [CrossRef]
  12. Tang, N.; Qiu, S.; Tang, M.; Pei, Y. Asymptotic confidence interval construction for proportion difference in medical studies with bilateral data. Stat. Methods Med. Res. 2011, 20, 233–259. [Google Scholar] [CrossRef] [PubMed]
  13. Pei, Y.; Tang, M.; Wong, W.; Gao, J. Confidence intervals for correlated proportion differences from paired data in a two-arm randomised clinical trial. Stat. Methods Med. Res. 2012, 21, 167–187. [Google Scholar] [CrossRef] [PubMed]
  14. Qiu, S.; Poon, W.; Tang, M. Sample size determination for disease prevalence studies with partially validated data. Stat. Methods Med. Res. 2012, 25, 37–63. [Google Scholar] [CrossRef]
  15. Qiu, S.; Zeng, X.; Tang, M.; Pei, Y. Test procedure and sample size determination for a proportion study using a doublesampling scheme with two fallible classifiers. Stat. Methods Med. Res. 2019, 28, 1019–1043. [Google Scholar] [CrossRef] [PubMed]
  16. Sun, S.; Li, Z.; Jiang, H. Homogeneity test and sample size of risk difference for stratified unilateral and bilateral data. Commun. Stat.-Simul. C 2022, 1–24. [Google Scholar] [CrossRef]
  17. Lloyd, C.; Ripamonti, E. comprehensive open-source library for exact required sample size in binary clinical trials. Contemp. Clin. Trials 2021, 107, 106491. [Google Scholar] [CrossRef]
  18. Tang, N.; Yu, B. Bayesian sample size determination in a three-arm non-inferiority trial with binary endpoints. J. Biopharm. Stat. 2022, 32, 768–788. [Google Scholar] [CrossRef]
  19. Pilz, M. Sample size calculation for one-armed clinical trials with clustered data and binary outcome. Biom. J. 2023, 28, e2300123. [Google Scholar] [CrossRef]
  20. Zhuang, T.; Tian, G.; Ma, C. Homogeneity test of ratio of two proportions in stratified bilateral data. Stat. Biopharm. Res. 2019, 11, 200–209. [Google Scholar] [CrossRef]
  21. Mandel, E.; Bluestone, C.; Rockette, H.; Blatter, M.; Reisinger, K.; Wucher, F.; Harper, J. Duration of effusion after antibiotic treatment for acute otitis media: Comparison of Cefaclor and Amoxicillin. Pediatr. Infect. Dis. J. 1982, 1, 310–316. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Asymptotic sample size for 1 β = 0.9 .
Figure 1. Asymptotic sample size for 1 β = 0.9 .
Mathematics 11 04198 g001
Figure 2. Empirical power (%) based on estimated sample sizes for 1 β = 0.9 .
Figure 2. Empirical power (%) based on estimated sample sizes for 1 β = 0.9 .
Mathematics 11 04198 g002
Figure 3. Empirical TIEs (%) based on estimated sample sizes for 1 β = 0.9 .
Figure 3. Empirical TIEs (%) based on estimated sample sizes for 1 β = 0.9 .
Mathematics 11 04198 g003
Figure 4. Sample sizes under various methods.
Figure 4. Sample sizes under various methods.
Mathematics 11 04198 g004
Figure 5. The effect of the parameter settings for sample size ( δ 0 = 1 , k j = 1 / J ).
Figure 5. The effect of the parameter settings for sample size ( δ 0 = 1 , k j = 1 / J ).
Mathematics 11 04198 g005
Table 1. Data structure in the jth stratum.
Table 1. Data structure in the jth stratum.
Number of Responses ( l ) Group ( i ) Total
1 2
0 m 01 j ( p 01 j ) m 02 j ( p 02 j ) S 0 j
1 m 11 j ( p 11 j ) m 12 j ( p 12 j ) S 1 j
2 m 21 j ( p 21 j ) m 22 j ( p 22 j ) S 2 j
Total N 1 j N 2 j M j
Table 2. Pooled data structure.
Table 2. Pooled data structure.
Number of Responses ( l ) Group ( i ) Total
1 2
0 M 01 = j = 1 J m 01 j M 02 = j = 1 J m 02 j M 0 +
1 M 11 = j = 1 J m 11 j M 12 = j = 1 J m 12 j M 1 +
2 M 21 = j = 1 J m 21 j M 22 = j = 1 J m 22 j M 2 +
Total M + 1 M + 2 M
Table 3. The scenarios of parameter settings in the simulations.
Table 3. The scenarios of parameter settings in the simulations.
Scenario π 1 j ρ j k
i0.40.5(1/3, 1/3, 1/3)
i i 0.40.5(0.5, 0.3, 0.2)
i i i 0.40.3(1/3, 1/3, 1/3)
i v 0.40.3(0.5, 0.3, 0.2)
v0.60.5(1/3, 1/3, 1/3)
v i 0.60.5(0.5, 0.3, 0.2)
v i i 0.60.3(1/3, 1/3, 1/3)
v i i i 0.60.3(0.5, 0.3, 0.2)
Scenarios i v i i i represent various parameter combinations of π 1 j , ρ j and k .
Table 4. Frequency of OME-free ears after treatment.
Table 4. Frequency of OME-free ears after treatment.
Number of OME-Free Ears<2 yr2–5 yr>5 yrTotal
Cefaclor Amoxicillin Cefaclor Amoxicillin Cefaclor Amoxicillin
0811630129
122611012
2821053634
Total18152294775
Table 5. Global MLEs in the real example.
Table 5. Global MLEs in the real example.
AgeStratum ( j ) π ^ 1 j ρ ^ j δ j ^
<2 yr10.3770.7360.937
2–5 yr20.6060.5320.937
>5 yr30.8850.6240.937
Table 6. Asymptotic sample sizes for the real example.
Table 6. Asymptotic sample sizes for the real example.
δ 0 Power M SC a M LS a M log a M L b M SC b M W b M LS b M log b
0.50.8070127674343538653
0.90811538253627210077
0.95881709170748612294
0.60.80129213134777991151120
0.90151259163100110122182154
0.95164285180132132146218192
Table 7. Values and p-values of the test statistics for the real example.
Table 7. Values and p-values of the test statistics for the real example.
δ 0 Result T L T SC T W T LS T log
0.5Value8.84756.95518.26664.28534.6490
p-value0.00290.00840.00400.03840.0311
0.6Value4.33633.87674.91581.65141.7826
p-value0.03730.04900.02660.19880.1818
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Mou, K.; Li, Z.; Ma, C. Asymptotic Sample Size for Common Test of Relative Risk Ratios in Stratified Bilateral Data. Mathematics 2023, 11, 4198. https://doi.org/10.3390/math11194198

AMA Style

Mou K, Li Z, Ma C. Asymptotic Sample Size for Common Test of Relative Risk Ratios in Stratified Bilateral Data. Mathematics. 2023; 11(19):4198. https://doi.org/10.3390/math11194198

Chicago/Turabian Style

Mou, Keyi, Zhiming Li, and Changxing Ma. 2023. "Asymptotic Sample Size for Common Test of Relative Risk Ratios in Stratified Bilateral Data" Mathematics 11, no. 19: 4198. https://doi.org/10.3390/math11194198

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop