Next Article in Journal
Efficient Monitoring of a Parameter of Non-Normal Process Using a Robust Efficient Control Chart: A Comparative Study
Previous Article in Journal
Pricing European Vulnerable Options with Jumps and Stochastic Default Obstacles Barrier under Regime Switching
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Testing the Homogeneity of Differences between Two Proportions for Stratified Bilateral and Unilateral Data across Strata

Department of Biostatistics, The State University of New York at Buffalo, Buffalo, NY 14260, USA
*
Author to whom correspondence should be addressed.
Mathematics 2023, 11(19), 4156; https://doi.org/10.3390/math11194156
Submission received: 2 September 2023 / Revised: 26 September 2023 / Accepted: 2 October 2023 / Published: 3 October 2023
(This article belongs to the Section Probability and Statistics)

Abstract

:
Medical comparative studies often involve collecting data from paired organs, which can produce either bilateral or unilateral data. While many testing procedures are available that account for the intra-class correlation between paired organs for bilateral data, more research needs to be conducted to determine how to analyze combined correlated bilateral and unilateral data. In practice, stratification is often used in analysis to ensure participants are allocated equally to each experimental condition. In this paper, we propose three Maximum Likelihood Estimation (MLE)-based methods for testing the homogeneity of differences between two proportions for stratified bilateral and unilateral data across strata using Donner’s model. We compare the performance of these methods with a model-based method based on Generalized Estimating Equations using Monte Carlo simulations. We also provide a real example to illustrate the proposed methodologies. Our findings suggest that the Score test performs well and offers a valuable alternative to the exact tests in future studies.

1. Introduction

Medical comparative studies often involve collecting either bilateral or unilateral data from subjects. For instance, in ophthalmologic studies, it is meaningful to assume that the information between two eyes from the same subject is generally correlated. Rosner [1] pointed out that the fundamental unit for statistical analysis in ophthalmologic studies is often the eye rather than the person. For example, when an individual contributes both eyes’ information to analysis, such as comparing intraocular pressures between age groups, the values of the two eyes are generally correlated. Therefore, methods of analysis that treat each eye independently are not valid. For bilateral data, Rosner [1] proposed a constant R model to test whether the proportions of affected eyes are the same among the g groups of patients while accounting for the intra-person dependence. Dallal [2] criticized the appropriateness of Rosner’s model, noting that if the trait would almost certainly occur bilaterally with widely differing population-specific prevalences, the model would give a poor fit, and thus they proposed an alternative approach based on compound multinomial sampling. Donner [3] proposed an alternative approach based on a simple adjustment of the standard Pearson chi-square test for the homogeneity of proportions. Neuhaus [4] compared the magnitudes of covariate effects and the differences in the interpretation of the regression coefficients measured by the different classes of approaches. Tang et al. [5] investigated eight procedures for testing the equality of proportions between two groups in correlated otolaryngologic data. The empirical results show that tests based on the approximate unconditional method usually produce better empirical type I error rates than their asymptotic tests. Based on Donner [3]’s model, Lin et al. [6] discussed the relationship between the disease probability and covariates via the logistic regression in ophthalmologic studies and proposed a new minorization-–maximization (MM) algorithm and a fast quadratic lower bound (QLB) algorithm to calculate the MLEs of the vector of regression coefficients. Most recently, Westgate [7] assessed the residual pseudo-likelihood approach with the GLIMMIX procedure in SAS and deemed it a practical and reliable method for estimating the intra-cluster correlation coefficient in binary outcome data from cluster randomized trials. It allows for easy confidence interval construction and offers the advantage of small-sample adjustments to ensure valid statistical inference. Furthermore, the Poisson regression model with a sandwich variance estimator provides covariate-adjusted risk ratios and standard errors for prospective studies with independent binary outcomes. Zou and Donner [8] adjusted the middle term in the sandwich estimator and extended the model to studies with correlated outcomes in longitudinal or cluster randomization studies. Also, Li and Tong [9] stated that sample size formulas for relative risk regression can be applied to the modified Poisson model in cluster-randomized trials without losing efficiency. These formulas also work well with variable cluster sizes when appropriate corrections into the sandwich variance estimator are used, and a minimum of 10 clusters is present.
Obviously, procedures that fail to utilize both bilateral and unilateral data would be less powerful. Researchers have provided some comprehensive studies. Pei et al. [10] studied ten test statistics to test the equality of two proportions and found that Rosner and Wald-type statistics based on dependency models and constrained maximum likelihood estimation perform satisfactorily for small to large samples. For general g 2 groups, Wang and Ma [11] worked on the combined bilateral and unilateral data and developed three types of confidence intervals for the relative ratio. Their simulation results show that score tests are always robust under different configurations with the median near the nominal type I error rate, and they yield higher powers than other test statistics, assuming the paired organs are not completely independent. The score CIs are the closest to the nominal coverage probability with reasonable interval widths on the whole parameter space.
In practice, stratified data analysis has attracted much attention, and several statistical methods have been proposed in clinical trials. Stratification involves dividing subjects into subgroups based on specific characteristics, such as age, gender, disease severity, or other relevant factors. By stratifying subjects, researchers can ensure that each treatment group has a similar distribution of these characteristics, which can help to minimize the impact of these variables on the outcomes of the trial. Nam [12] provided the sample size using the modified homogeneity score method and compared it with the goodness-of-fit method in stratified studies. Li and Tang [13] showed that the modified score test based on the bootstrap resampling is a desirable test procedure for testing the homogeneity of rate ratios in stratified matched–paired designs. Tang and Qiu [14] explored the homogeneity test of differences between two proportions in a stratified bilateral sample design, and a modified score test statistic was proposed. Further, Tang et al. [15] developed ten confidence intervals for risk differences, and notably, eight hybrid confidence intervals do not rely on any model assumption. The simulation results show that the hybrid confidence interval based on Mid-P individual confidence interval is well-controlled but without a closed-form solution for the individual confidence interval. Most recently, Shen et al. [16] introduced a common test and interval estimation of the difference between two proportions on stratified bilateral designs under an equal intra-class correlation coefficients model. This paper is well-motivated but only considers bilateral data. The analysis methods that do not take into account unilateral data are obviously not comprehensive enough.
In the stratified data analysis with two groups (experimental vs. placebo group), the homogeneity test of the difference of two proportions is used to see whether the differences of the group effects are the same across strata. When the differences across strata are significantly different, we can conclude that a confounding effect exists. If insignificant, we can combine the stratified data into a pooled two-group dataset. Based on the previous discussion, we apply a homogeneity test of the differences between two proportions across strata into stratified bilateral and unilateral data. In this article, focusing on Donner’s common ρ model, we will derive the constrained and unconstrained MLEs and develop three MLE-based testing procedures in Section 2. In addition, a model-based method treats the measurements from paired organs of each subject as repeated observations, which will also be provided for comparison with the MLE-based methods. Generalized estimating equations (GEEs) will be briefly introduced as the theoretical underpinning of how the model-based method handles correlated binary outcomes. In Section 3, Monte Carlo simulations are conducted to evaluate the test performances concerning the Type I error rates and powers through various configurations. In Section 4, a real example from an otolaryngological study is shown using the proposed method. Discussions are presented in Section 5.

2. Method

2.1. MLE-Based Method

For two groups, suppose there are J strata and total M = M j subjects that contribute bilateral data and N = N j subjects that contribute unilateral data. Let m l i j and n l i j be the number of bilateral and unilateral patients having l ( l = 0 , 1   or   2 ) response(s) in the ith ( i = 1 , 2 ) group of the jth ( j = 1 , , J ) stratum. For each stratum, we denote m + i j = m 0 i j + m 1 i j + m 2 i j ,   n + i j = n 0 i j + n 1 i j , where i = 1 , 2 , m l + j = m l 1 j + m l 2 j ,   n l + j = n l 1 j + n l 2 j , where l = 0 , 1   or   2 . And M j = m + 1 j + m + 2 j , N j = n + 1 j + n + 2 j , where j = 1 , , J . The data structure in the jth stratum is shown in Table 1.
For bilateral data, we denote Z i j h k = 1 as a cured(or affected) event of the kth eye ( k = 1 , 2 ) of the hth patient in the ith group from jth strata; otherwise, Z i j h k = 0 . Under Donner’s model, given jth stratum, P ( Z i j k h = 1 ) = π i j , where π i j is the event proportion of ith group in jth stratum, and C o r r ( Z i j h 1 ,   Z i j h 2 ) = ρ j , where 1 ρ j 1 is the common correlation in jth stratum. Then, the probabilities of an expected response for none, one, and both paired organs are
p Z 0 i j = ρ j ( 1 π i j ) + ( 1 ρ j ) ( 1 π i j ) 2 ,
p Z 1 i j = 2 π i j ( 1 π i j ) ( 1 ρ j ) ,
p Z 2 i j = ρ j π i j + ( 1 ρ j ) π i j 2 .
For unilateral data, similar to the approach used for bilateral data, we define Y i j h = 1 as a cured(or affected) event of any one of the paired organs that contributes response of the hth patient in the ith group from the jth strata, otherwise, Y i j h = 0 . And P ( Y i j h = 1 ) = π i j . Then, we have
p Y 0 i j = 1 π i j ,
p Y 1 i j = π i j .
Further, the joint probability function f Z Y is
f Z Y = j = 1 J f Z j Y j = j = 1 J i = 1 2 m + i j ! m 0 i j ! m 1 i j ! m 2 i j ! n + i j ! n 0 i j ! n 1 i j ! p Z 0 i j m 0 i j p Z 1 i j m 1 i j p Z 2 i j m 2 i j p Y 0 i j n 0 i j p Y 1 i j n 1 i j .
Let d j = π 2 j π 1 j ,   j = 1 , , J , be the risk difference between the two groups with combined bilateral and unilateral data in the jth stratum. Our interest is in testing the homogeneity of the proportion differences across strata, which is
H 0 :   d 1 = d 2 = · · · = d J     v s     H 1 :   d s d t   for some   s t ,   where   1 s , t J .

2.1.1. Maximum Likelihood Estimates

Unconstrained MLE

Under no constraints, the corresponding log-likelihood l ( π 1 , π 2 , ρ ) can be expressed as
l ( π 1 , π 2 , ρ ) = j = 1 J l j ( π 1 j , π 2 j , ρ j ) + log C ,
where
l j ( π 1 j , π 2 j , ρ j ) = i = 1 2 m 0 i j log [ ρ j ( 1 π i j ) + ( 1 ρ j ) ( 1 π i j ) 2 ]   + m 1 i j log [ 2 π i j ( 1 π i j ) ( 1 ρ j ) ]   + m 2 i j log [ ρ j π i j + ( 1 ρ j ) π i j 2 ]   + n 0 i j log ( 1 π i j ) + n 1 i j log π i j .
We differentiate l j with respect to π i j ’s and ρ j ’s, then set the equations equal to zero to obtain the unconstrained MLEs.
l π i j = 0 ,     l ρ j = 0 ,     j = 1 , , J .
There is no closed-form solution, so we apply a two-step approach to estimate the parameters. First, we can simplify the above equation as a fourth-order polynomial of π i j and then solve the π i j . The solution is simplified by Wang and Ma [17]; that is,
π ^ i j = b b 2 3 c ( 2 M j + N j ) ( cos ( θ ) 3 sin ( θ ) ) ( 2 M j + N j ) ( 3 ρ j 3 ) ,
where
D i j = ( m 0 i j ,   m 1 i j ,   m 2 i j ,   n 0 i j ,   n 1 i j ) T , b = ρ ( 3 M j + N j + n 1 + j ) ( 2 , 3 , 4 , 1 , 2 ) D i j T , c = ρ 2 ( M j + n 1 + j ) ρ ( 2 , 4 , 4 , 1 , 3 ) D i j T + m 1 + j + 2 m 2 + j + n 1 + j , θ = 1 3 arccos ( 2 b 3 9 ( 2 M j + N j ) b c + 27 ( 2 M j + N j ) 2 ( m 1 + j + m 2 + j + n 1 + j ) ( 1 ρ j ) ρ j 2 ( b 2 3 c ( 2 M j + N j ) ) 3 / 2 ) .
Second, the estimate of ρ j can be updated using the Fisher Scoring algorithm, which is
ρ j ( t + 1 ) = ρ j ( t ) ( 2 l ρ j 2 ( π 1 j ( t ) ,   π 2 j ( t ) ,   ρ j ( t ) ) ) 1 l ρ j ( π 1 j ( t ) ,   π 2 j ( t ) ,   ρ j ( t ) ) .
We repeat the above two steps until convergence. Then, the risk difference d ^ j can be obtained by π ^ 2 j π ^ 1 j for each stratum. Therefore, we denote the unconstrained MLEs by ( π ^ 1 j ,   d ^ j ,   ρ ^ j ) ,   j = 1 , , J .

Constrained MLEs

Under the null hypothesis d 1 = d 2 = · · · = d J = d , π 2 j can be expressed as π 1 j + d for jth strata. Then, we have
l 0 ( π 1 , d , ρ ) = j = 1 J l 0 j ( π 1 j , ρ j , d ) + log C ,
where
l 0 j ( π 1 j , ρ j , d ) = m 01 j log [ ρ j ( 1 π 1 j ) + ( 1 ρ j ) ( 1 π 1 j ) 2 ]   + m 11 j log [ 2 π 1 j ( 1 π 1 j ) ( 1 ρ j ) ]   + m 21 j log [ ρ j π 1 j + ( 1 ρ j ) π 1 j 2 ]   + n 01 j log ( 1 π 1 j ) + n 11 j log π 1 j   + m 02 j log [ ρ j ( 1 ( π 1 j + d ) ) + ( 1 ρ j ) ( 1 ( π 1 j + d ) ) 2 ]   + m 12 j log [ 2 ( π 1 j + d ) ( 1 ( π 1 j + d ) ) ( 1 ρ j ) ]   + m 22 j log [ ρ j ( π 1 j + d ) + ( 1 ρ j ) ( π 1 j + d ) 2 ]   + n 02 j log ( 1 ( π 1 j + d ) ) + n 12 j log ( π 1 j + d ) .
The MLEs satisfy
l 0 π 1 j = 0 ,     l 0 ρ j = 0 ,   and   l 0 d = 0 ,     j = 1 , , J .
There is no closed-form solution, so we apply the two-step iterative methods. First, we update d using the Newton–Raphson method in each stratum. Second, we apply the Fisher Scoring method to estimate π 1 j and ρ j . The procedures are as follows:
  • Start with the initial value d ( 0 ) = 1 / J j = 1 J d ^ j , π 1 j ( 0 ) = 1 / J j = 1 J π ^ 1 j , ρ j ( 0 ) = 1 / J j = 1 J ρ ^ j , where d ^ j ,   π ^ 1 j ,   a n d   ρ ^ j are unconstrained MLEs.
  • Update d by
    d ( t + 1 ) = d ( t ) 1 j = 1 J 2 l j d 2 ( π 1 j ( t ) ,   ρ j ( t ) ,   d ( t ) ) × j = 1 J l j d ( π 1 j ( t ) ,   ρ j ( t ) ,   d ( t ) )
    .
  • Update π 1 j and ρ j by
    π 1 j ( t + 1 ) ρ j ( t + 1 ) = π 1 j ( t ) ρ j ( t ) + I 1 1 ( π 1 j ( t ) ,   ρ j ( t ) ) × l 0 π 1 j l 0 ρ j d = d ( t + 1 ) ,   π 1 j = π 1 j ( t ) ,   ρ j = ρ j ( t ) ,
    where I 1 is the Fisher Information matrix with respect to π 1 j and ρ j , which is given in Appendix A,
  • Repeat Steps 2 and 3 until convergence.
After completing the above procedures, we obtain the constrained MLEs defined as ( π ^ 1 j , H 0 ,   d ^ H 0 ,   ρ ^ j , H 0 ) , j = 1 , , J .

2.1.2. Test Methods

Likelihood Ratio Test

The Likelihood Ratio test statistics are
T L = 2 [ l ( π ^ 1 ,   d ^ ,   ρ ^ ) l ( π ^ 1 , H 0 ,   d ^ H 0 ,   ρ ^ H 0 ) ] = 2 j = 1 J [ l j ( π ^ 1 j ,   d ^ j ,   ρ ^ j ) l j ( π ^ 1 j , H 0 ,   d ^ H 0 ,   ρ ^ j , H 0 ) ] .
Under the null hypothesis, the LR test statistic asymptotically follows a Chi-square distribution with J 1 degree of freedom.

Wald Test

The null hypothesis H 0 :   d 1 = d 2 = · · · = d J can be rewritten as π 21 π 11 = π 22 π 12 = · · · = π 2 J π 1 J , which can also be expressed as C β T = 0 , where β = [ d 1 ,   π 11 ,   ρ 1 ,   d 2 ,   π 12 ,   ρ 2 ,   · · ·   ,   d J ,   π 1 J ,   ρ J ] and
C = 1   0   0 1   0   0               0 0   0   0     1   0   0 1   0   0               0                             0           1   0   0 1   0   0 ( J 1 ) × 3 J .
The Wald test statistic for testing H 0 has the form
T W = ( β C T ) ( C I 1 C T ) 1 ( C β T ) | β = β ^ = [ d ^ H 0 , π ^ 11 , ρ ^ 1 , d ^ H 0 , π ^ 12 , ρ ^ 2 , , d ^ H 0 , π ^ 1 J , ρ ^ J ] ,
where I is the Fisher Information matrix for β , which is given in Appendix B. And β ^ is the unconstrained MLE of β . Under the null hypothesis, the Wald test statistic is asymptotically distributed as a Chi-square distribution with J 1 degree of freedom. Moreover, if we want to test the alternative hypothesis H a :   d s d t   for any   s t ,   where   1 s , t J , the Wald test statistic becomes
T W a = ( β c T ) ( c I 1 c T ) 1 ( c β T ) | β = β ^
where c = ( 0 , 0 , , 0 , 1 , 0 , 0 , 0 , 1 , 0 , 0 , , 0 ) with ( 1 , 0 , 0 ) in the ( 3 s 2 ) th element to ( 3 s ) th element, ( 1 , 0 , 0 ) in the ( 3 t 2 ) th element to ( 3 t ) th element and 0 otherwise. T W a is asymptotically distributed as a Chi-square distribution with 1 degree of freedom.

Score Test

The Score test statistic for testing the homogeneity of proportion difference is
T S = U I 1 ( d , π 1 , ρ ) U T | d = d ^ H 0 , π 1 = π ^ 1 , H 0 , ρ = ρ ^ H 0 ,
where the score function U is
U = ( U 1 , , U J ) = ( l d , 0 , 0 , , l d , 0 , 0 )
since π 1 and ρ are nuisance parameters. The Score test statistic also follows a Chi-square distribution with J 1 degree of freedom and H 0 should be rejected at the significant level α when T S > χ J 1 , 1 α 2 .

2.2. Model-Based Method

The Generalized Estimating Equations (GEEs) are proposed by Liang and Zeger [18], which are often used to estimate the parameters of a generalized linear model with a possible unknown correlation between outcomes. This approach estimates model parameters by specifying possible working correlation structures to account for the within-subject correlations by iteratively solving a system of equations based on quasi-likelihood distributional assumptions. SAS has algorithms for fitting specified models using GEE. The GENMOD procedure fits generalized linear models. The REPEATED statement under PROC GENMOD invokes the GEE method and specifies the correlation structure. In our case, we specify “unstructured” as the structure of the working correlation matrix. Our interest is in testing the homogeneity of the differences between two proportions across strata. For binary data, we treat the measurements of paired organs from each subject as repeated observations, the binary outcome of individual organs as response variables, and the “group × strata” as a predictor variable. We assign zero to the organ without disease and one to the organ with a disease, and we treat the paired organs with no difference.

3. Simulation Study

We evaluate the testing performance by comparing the three MLE-based and model-based methods using the type I error rate and power. For the type I error rate, we generate Monte Carlo simulations with a balanced sample size m . 1 j = m . 2 j = n . 1 j = n . 2 j = 25, 50, and 100 and unbalanced sample size ( m . 1 j = m . 2 j ,   n . 1 j = n . 2 j ) = (60,40), where j = 1 , , J , and specifically, J = 2, 4 and 8. We also consider ρ = 0.2, 0.5, 0.7, and π 1 =0.3, 0.5. Then, we test the homogeneity of the risk differences across strata under the null hypothesis H 0 :   d 1 = d 2 = = d J = d by choosing d = 0, 0.1, and 0.2. In each configuration, 10,000 random samples are generated under the null hypothesis, and empirical type I error rates are calculated as t h e   n u m b e r   o f   r e j e c t i o n s / 10 , 000 . The calculations are based on converged cases of 10,000 replicates. At a 5% significant level, Table 2, Table 3 and Table 4 report the empirical type I error rates based on all proposed configurations for J = 2, 4, and 8, respectively.
Following Tang et al. [5], a test is considered liberal if the ratio of the empirical type I error to the nominal type I error is greater than 1.2 (i.e., empirical type I error > 0.06 for α = 0.05), conservative if the ratio is smaller than 0.8 (i.e., empirical type I error < 0.04 for α = 0.05), and robust otherwise. From the tables, the GEE method is robust for all scenarios. The Wald test performs well when J = 2, but becomes liberal when J increases. The Likelihood ratio test is liberal when π 1 is large as well as a small or unbalanced sample size. The Score test becomes liberal for some cases with a larger π 1 , d, and ρ under an unbalanced sample size. Moreover, the Score test is robust when d = 0, 0.1. All of the tests perform better as the sample size increases. The program’s running time also needs to be considered to evaluate the above methodologies. Even if the results obtained using the GEE method are satisfactory, its running time is much longer than that of the MLE-based methods.
We also conduct simulations with parameters chosen at random, except for the above parameter settings. For each of J = 2 , 4 , 8 and sample size m = n = 25 , 50 , 100 , the parameters ρ , π 1 , and d are generated from a uniform distribution for 1000 configurations. The empirical type I error rates are calculated after 10,000 replications for each configuration. The Boxplots in Figure 1 summarize the results. The plot shows that the likelihood ratio test performs worst. When J and the sample size are small, the LR test behaves liberally. The Wald test also becomes inflated when J increases. The Score test shows a relatively stable type I error control, and its plots are closest to the nominal level α = 0.05. All three tests behave better when given a larger sample size and a smaller J. In general, the Score test behaves satisfactorily.
Next, we investigate the performance of power for the four tests. The settings of m ,   n ,   ρ ,   a n d   π 1 are the same when calculating the empirical type I error. Furthermore, the parameter settings of the alternative hypothesis are listed in Table 5. Table 6, Table 7 and Table 8 report the empirical powers for the same scenarios for J = 2, 4, 8, respectively. We observe that powers increase when the sample size increases. Also, when the difference of d in the alternative hypothesis increases, the powers increase significantly. In some settings, the likelihood ratio test’s powers are inflated because their empirical type I error rates are greater than the nominal level. Moreover, the powers of the three MLE-based methods are very close, and the model-based test gets the lowest power. Overall, the Score test is recommended because of its satisfactory type I error control and good performance in terms of power.

4. Real Example

We demonstrate the performance of the four mentioned methods by considering the data from a double-blinded clinical trial by Mandel et al. [19]. The study compared cefaclor and amoxicillin for treating acute otitis media with effusion in 214 children (293 ears), who received either unilateral or bilateral tympanocentesis and were randomly assigned to either Cefaclor or Amoxicillin group. After 14 days of treatment, the number of cured ears was recorded at the end of the treatment period. The data are summarized in Table 9 according to the stratum determined based on the children’s age.
According to the data above, the constrained MLEs and unconstrained MLEs of parameters are listed in Table 10. The test statistics and the p-values of the proposed methods are reported in Table 11. All the p-values are greater than the nominal level α = 0.05 , and we conclude that there is no evidence to claim the corresponding d varies across strata for all methods. However, we notice that the unconstrained MLEs of d for different strata are vastly different, which may be due to the small sample size for the groups.

5. Conclusions

In this article, we compare three test procedures with the model-based method to test the homogeneity of differences between two proportions for stratified bilateral and unilateral data across stratum. The three MLE-based methodologies are illustrated in detail, especially the procedures used to compute the constrained and unconstrained MLEs of parameters. According to the simulation results, the Score test shows a relatively stable type I error control and good performance in terms of power. The Wald test behaves well when the strata J = 2, and the Likelihood ratio test works well when the difference d = 0. All four tests are close to the nominal level when the sample size is set to 100. The model-based method is robust and performs well on power, sometimes even more robustly than the Score test. However, this GEE for repeated measures analysis is developed without the explicit form of the test statistics and is very time-consuming. The explicit form of the Score test statistic is useful for its simplicity and further development of the exact test. If we wish to conduct further research to determine the exact test for small sample cases, this model-based method will be nearly impossible because of the extensive calculations involved. Also, in fact, many studies in clinical ophthalmology focus only on the eye instead of a person with paired organs. For example, Saba and Walter [20] worked on a retrospective consecutive case series of 626 eyes (543 patients) with nvAMD treated with 1438 Brolucizumab injections. Some of these patients contributed one eye and some contributed two eyes, and it would be imprudent to ignore the correlation between two eyes from the same patient in a statistical analysis. Therefore, our proposed methodologies related to the comprehensive data structure can be applied to such cases to compare drug efficacy. In summation, the Score test is worth a more in-depth investigation.

Author Contributions

Conceptualization, X.Z. and C.M.; Methodology, X.Z. and C.M.; Writing—Original Draft Preparation, X.Z.; Writing—Review & Editing, X.Z. and C.M. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The data presented in this study are openly available in reference [19].

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Fisher Information for Given d

Under the null hypothesis, we update π 1 j and ρ j by the Fisher-Scoring algorithm. The Fisher information matrix for jth strata is
I 1 ( π 1 j , ρ j , d ) = I 11 ( j ) I 12 ( j ) I 12 ( j ) I 22 ( j ) = m + 1 j A 1 + m + 2 j A 2 + A 3 m + 1 j B 1 + m + 2 j B 2 m + 1 j B 1 + m + 2 j B 2 m + 1 j C 1 + m + 2 j C 2
where
A 1 = ( π 1 j 1 ) ( ρ j 1 ) + π 1 j π 1 j ρ j + 1 π 1 j 1 2 ( π 1 j 1 ) ( ρ j 1 ) + 2 π 1 j ( ρ j 1 ) π 1 j ( π 1 j ρ j 2 π 1 j + ρ j ( π 1 j 1 ) ) 2 π 1 j 2 π 1 j ρ j ( π 1 j 1 ) + ( ρ j 1 ) ( ( π 1 j 1 ) ( ρ j 1 ) π 1 j + π 1 j ρ j + 1 ) π 1 j ρ j π 1 j + 1 ;
A 2 = ( ( ρ j 1 ) ( 2 d + 2 π 1 j ) + 2 ( ρ j 1 ) ( d + π 1 j 1 ) ) ( 1 2 ( d + π 1 j ) ) ( d + π 1 j 1 ) ( d + π 1 j ) ( 2 d + 2 π 1 j ρ j ( d + π 1 j ) ρ j ( d + π 1 j 1 ) ) 2 ( d + π 1 j ) 2 ρ j ( d + π 1 j ) ( d + π 1 j 1 ) + ρ j ( d + π 1 j ) π 1 j d + ( ρ j 1 ) ( d + π 1 j 1 ) + 1 d + π 1 j 1 ( ρ j 1 ) ( ρ j ( d + π 1 j ) π 1 j d + ( ρ j 1 ) ( d + π 1 j 1 ) + 1 ) d + π 1 j ρ j ( d + π 1 j ) 1 ;
A 3 = n + 2 j ( d + π 1 j 1 ) ( d + π 1 j ) + n + 1 j ( π 1 j 1 ) π 1 j ;
B 1 = 2 ( π 1 j 1 ) ( ρ j 1 ) + 2 π 1 j ( ρ j 1 ) ρ j 1 + π 1 j ( ( π 1 j 1 ) ( ρ j 1 ) π 1 j + π 1 j ρ j + 1 ) π 1 j ρ j π 1 j + 1 π 1 j ( π 1 j 1 ) ( π 1 j ρ j 2 π 1 j + ρ j ( π 1 j 1 ) ) π 1 j 2 π 1 j ρ j ( π 1 j 1 ) ;
B 2 = ( ρ j 1 ) ( 2 d + 2 π 1 j ) + 2 ( ρ j 1 ) ( d + π 1 j 1 ) ρ j 1 ( d + π 1 j ) ( ρ j ( d + π 1 j ) π 1 j d + ( ρ j 1 ) ( d + π 1 j 1 ) + 1 ) d + π 1 j ρ j ( d + π 1 j ) 1 + ( d + π 1 j ) ( d + π 1 j 1 ) ( 2 d + 2 π 1 j ρ j ( d + π 1 j ) ρ j ( d + π 1 j 1 ) ) ( d + π 1 j ) 2 ρ j ( d + π 1 j ) ( d + π 1 j 1 ) ;
C 1 = π 1 j 2 ( π 1 j 1 ) π 1 j ρ j π 1 j + 1 2 π 1 j ( π 1 j 1 ) ρ j 1 π 1 j 2 ( π 1 j 1 ) 2 π 1 j 2 π 1 j ρ j ( π 1 j 1 ) ;
C 2 = ( d + π 1 j ) 2 ( d + π 1 j 1 ) ( d + π 1 j ρ j ( d + π 1 j ) 1 ) ( 2 d + 2 π 1 j ) ( d + π 1 j 1 ) ( ρ j 1 ) ( d + π 1 j ) 2 ( d + π 1 j 1 ) 2 ( ( d + π 1 j ) 2 ρ j ( d + π 1 j ) ( d + π 1 j 1 ) ) .

Appendix B

Fisher Information Matrix for Wald Test and Score Test

The Fisher information matrix I is a 3 J × 3 J block diagonal matrix
I = I 1 0 0 0 I 2 0 0 I J ,
where the diagonal elements
I j = I 11 j I 12 j I 13 j I 12 j I 22 j I 23 j I 13 j I 23 j I 33 j ,
with
I 11 j = I 12 j = n + 2 j ( d + π 1 j 1 ) ( d + π 1 j ) + m + 2 j [ ( ( ρ j 1 ) ( 2 d + 2 π 1 j ) + 2 ( ρ j 1 ) ( d + π 1 j 1 ) ) ( 2 ( d + π 1 j ) 1 ) ( d + π 1 j 1 ) ( d + π 1 j ) ( 2 d + 2 π 1 j ρ j ( d + π 1 j ) ρ j ( d + π 1 j 1 ) ) 2 ( d + π 1 j ) 2 ρ j ( d + π 1 j ) ( d + π 1 j 1 ) + ( ρ j ( d + π 1 j ) π 1 j d + ( ρ j 1 ) ( d + π 1 j 1 ) + 1 ) d + π 1 j 1 ( ρ j 1 ) ( ρ j ( d + π 1 j ) π 1 j d + ( ρ j 1 ) ( d + π 1 j 1 ) + 1 ) d + π 1 j ρ j ( d + π 1 j ) 1 ] ;
I 13 j = m + 2 j [ ( ρ j 1 ) ( 2 d + 2 π 1 j ) + 2 ( ρ j 1 ) ( d + π 1 j 1 ) ρ j 1 ( d + π 1 j ) ( ρ j ( d + π 1 j ) π 1 j d + ( ρ j 1 ) ( d + π 1 j 1 ) + 1 ) d + π 1 j ρ j ( d + π 1 j ) 1 + ( d + π 1 j ) ( d + π 1 j 1 ) ( 2 d + 2 π 1 j ρ j ( d + π 1 j ) ρ j ( d + π 1 j 1 ) ) ( d + π 1 j ) 2 ρ j ( d + π 1 j ) ( d + π 1 j 1 ) ] ;
I 22 j = n + 2 j ( d + π 1 j 1 ) ( d + π 1 j ) + n + 1 j ( π 1 j 1 ) π 1 j + m + 2 j [ ( ( ρ j 1 ) ( 2 d + 2 π 1 j ) + 2 ( ρ j 1 ) ( d + π 1 j 1 ) ) ( 2 ( d + π 1 j ) 1 ) ( d + π 1 j 1 ) ( d + π 1 j ) ( 2 d + 2 π 1 j ρ j ( d + π 1 j ) ρ j ( d + π 1 j 1 ) ) 2 ( d + π 1 j ) 2 ρ j ( d + π 1 j ) ( d + π 1 j 1 ) + ρ j ( d + π 1 j ) π 1 j d + ( ρ j 1 ) ( d + π 1 j 1 ) + 1 d + π 1 j 1 ( ρ j 1 ) ( ρ j ( d + π 1 j ) π 1 j d + ( ρ j 1 ) ( d + π 1 j 1 ) + 1 ) d + π 1 j ρ j ( d + π 1 j ) 1 ] + m + 1 j [ ( π 1 j 1 ) ( ρ j 1 ) π 1 j + π 1 j ρ j + 1 π 1 j 1 ( π 1 j ρ j 2 π 1 j + ρ j ( π 1 j 1 ) ) 2 π 1 j 2 π 1 j ρ j ( π 1 j 1 ) ( 2 ( π 1 j 1 ) ( ρ j 1 ) + 2 π 1 j ( ρ j 1 ) ) ( 2 π 1 j 1 ) ( π 1 j 1 ) π 1 j + ( ρ j 1 ) ( ( π 1 j 1 ) ( ρ j 1 ) π 1 j + π 1 j ρ j + 1 ) π 1 j ρ j π 1 j + 1 ] ;
I 23 j = m + 2 j [ ( ρ j 1 ) ( 2 d + 2 π 1 j ) + 2 ( ρ j 1 ) ( d + π 1 j 1 ) ρ j 1 ( d + π 1 j ) ( ρ j ( d + π 1 j ) π 1 j d + ( ρ j 1 ) ( d + π 1 j 1 ) + 1 ) d + π 1 j ρ j ( d + π 1 j ) 1 + ( d + π 1 j ) ( d + π 1 j 1 ) ( 2 d + 2 π 1 j ρ j ( d + π 1 j ) ρ j ( d + π 1 j 1 ) ) ( d + π 1 j ) 2 ρ j ( d + π 1 j ) ( d + π 1 j 1 ) ] + m + 1 j [ 2 ( π 1 j 1 ) ( ρ j 1 ) + 2 π 1 j ( ρ j 1 ) ρ j 1 + π 1 j ( ( π 1 j 1 ) ( ρ j 1 ) π 1 j + π 1 j ρ j + 1 ) π 1 j ρ j π 1 j + 1 π 1 j ( π 1 j 1 ) ( π 1 j ρ j 2 π 1 j + ρ j ( π 1 j 1 ) ) π 1 j 2 π 1 j ρ j ( π 1 j 1 ) ] ;
I 33 j = m + 1 j π 1 j 2 ( π 1 j 1 ) π 1 j ρ j π 1 j + 1 2 π 1 j ( π 1 j 1 ) ρ j 1 π 1 j 2 ( π 1 j 1 ) 2 π 1 j 2 π 1 j ρ j ( π 1 j 1 ) + m + 2 j [ ( d + π 1 j ) 2 ( d + π 1 j 1 ) d + π 1 j ρ j ( d + π 1 j ) 1 ( 2 d + 2 π 1 j ) ( d + π 1 j 1 ) ρ j 1 ( d + π 1 j ) 2 ( d + π 1 j 1 ) 2 ( d + π 1 j ) 2 ρ j ( d + π 1 j ) ( d + π 1 j 1 ) ] .

References

  1. Rosner, B. Statistical methods in ophthalmology: An adjustment for the intraclass correlation between eyes. Biometrics 1982, 38, 105–114. [Google Scholar]
  2. Dallal, G.E. Paired Bernoulli trials. Biometrics 1988, 44, 253–257. [Google Scholar]
  3. Donner, A. Statistical methods in ophthalmology: An adjusted chi-square approach. Biometrics 1989, 45, 605–611. [Google Scholar]
  4. Neuhaus, J.M. Statistical methods for longitudinal and clustered designs with binary responses. Stat. Methods Med. Res. 1992, 1, 249–273. [Google Scholar]
  5. Tang, N.; Tang, M.; Qiu, S. Testing the equality of proportions for correlated otolaryngologic data. Comput. Stat. Data Anal. 2008, 52, 3719–3729. [Google Scholar]
  6. Lin, Y.; Zhang, Y.; Tian, G.; Ma, C. Fast QLB algorithm and hypothesis tests in logistic model for ophthalmologic bilateral correlated data. J. Biopharm. Stat. 2021, 31, 91–107. [Google Scholar]
  7. Westgate, P.M. A readily available improvement over method of moments for intra-cluster correlation estimation in the context of cluster randomized trials and fitting a GEE-type marginal model for binary outcomes. Clin. Trials 2019, 16, 41–51. [Google Scholar]
  8. Zou, G.Y.; Donner, A. Extension of the modified Poisson regression model to prospective studies with correlated binary data. Stat. Methods Med. Res. 2013, 22, 661–670. [Google Scholar]
  9. Li, F.; Tong, G. Sample size estimation for modified Poisson analysis of cluster randomized trials with a binary outcome. Stat. Methods Med. Res. 2021, 30, 1288–1305. [Google Scholar]
  10. Pei, Y.; Tang, M.; Guo, J. Testing the equality of two proportions for combined unilateral and bilateral data. Commun.-Stat.—Simul. Comput. 2008, 37, 1515–1529. [Google Scholar]
  11. Wang, K.; Ma, C. Interval estimation for risk ratio with combined unilateral and bilateral correlated data based on the equal dependence model. arXiv 2021, arXiv:2110.15468. [Google Scholar]
  12. Nam, J. Homogeneity score test for the intraclass version of the kappa statistics and sample-size determination in multiple or stratified studies. Biometrics 2003, 59, 1027–1035. [Google Scholar] [CrossRef]
  13. Li, H.; Tang, N. Homogeneity test of rate ratios in stratified matched-pair studies. Biom. J. 2011, 53, 614–627. [Google Scholar]
  14. Tang, N.; Qiu, S. Homogeneity test, sample size determination and interval construction of difference of two proportions in stratified bilateral-sample designs. J. Stat. Plan. Inference 2012, 142, 1243–1251. [Google Scholar]
  15. Tang, N.; Qiu, S.; Tang, M.; Zou, G.; Yu, D. Simultaneous confidence intervals of risk differences in stratified paired designs. J. Biopharm. Stat. 2013, 23, 361–377. [Google Scholar]
  16. Shen, X.; Ma, C.; Yuen, K.C.; Tian, G. Common risk difference test and interval estimation of risk difference for stratified bilateral correlated data. Stat. Methods Med Res. 2019, 28, 2418–2438. [Google Scholar]
  17. Ma, C.; Wang, H. Testing the Equality of Proportions for Combined Unilateral and Bilateral Data Under Equal Intraclass Correlation Model. Stat. Biopharm. Res. 2023, 15, 608–617. [Google Scholar]
  18. Liang, K.; Zeger, S.L. Longitudinal data analysis using generalized linear models. Biometrika 1986, 73, 13–22. [Google Scholar]
  19. Mandel, E.M.; Bluestone, C.D.; Rockette, H.E.; Blatter, M.M.; Reisinger, K.S.; Wucher, F.P.; Harper, J. Duration of effusion after antibiotic treatment for acute otitis media: Comparison of cefaclor and amoxicillin. Pediatr. Infect. Dis. J. 1982, 1, 310–316. [Google Scholar]
  20. Saba, N.J.; Walter, S.D. Efficacy, Safety, and Durability of Brolucizumab: An 8-Month Post-Marketing Surveillance Analysis. Clin. Ophthalmol. 2023, 17, 2791–2802. [Google Scholar] [CrossRef]
Figure 1. Box-plots of simulation results in terms of empirical sizes for 1000 randomly chosen parameters ( ρ , π , d) settings when J = 2 , 4 , 8 and m = n = 25 , 50 , 100 for the three MLE-based tests.
Figure 1. Box-plots of simulation results in terms of empirical sizes for 1000 randomly chosen parameters ( ρ , π , d) settings when J = 2 , 4 , 8 and m = n = 25 , 50 , 100 for the three MLE-based tests.
Mathematics 11 04156 g001
Table 1. Data structure for combined bilateral and unilateral data in the jth stratum.
Table 1. Data structure for combined bilateral and unilateral data in the jth stratum.
Number of Responses in EventGroup (i)Total
12
0 m 01 j m 02 j m 0 + j
1 m 11 j m 12 j m 1 + j
2 m 21 j m 22 j m 2 + j
Total m + 1 j m + 2 j M j
0 n 01 j n 02 j n 0 + j
1 n 11 j n 12 j n 1 + j
Total n + 1 j n + 2 j N j
Table 2. Under J = 2, the empirical type I error rates for testing H 0 : d 1 = d 2 = d corresponding to α = 5 % based on 10,000 replicates.
Table 2. Under J = 2, the empirical type I error rates for testing H 0 : d 1 = d 2 = d corresponding to α = 5 % based on 10,000 replicates.
d = 0d = 0.1d = 0.2
mn ρ π 1 T L T W T S T G T L T W T S T G T L T W T S T G
25250.20.35.515.385.385.305.135.275.074.925.055.104.925.23
0.55.235.375.025.095.845.165.145.2310.085.156.165.29
0.50.35.785.735.655.225.245.395.175.185.265.365.165.00
0.55.175.294.985.425.435.365.195.465.405.085.065.13
0.70.35.205.275.135.275.245.405.144.895.265.385.185.18
0.55.425.475.185.405.655.715.495.075.795.705.565.50
50500.20.35.865.885.795.254.924.994.885.235.025.095.005.04
0.55.655.535.405.045.425.245.185.107.655.205.805.35
0.50.35.155.185.124.895.565.605.525.225.225.295.164.78
0.55.175.175.005.095.235.125.125.075.785.435.385.23
0.70.35.415.445.374.835.005.114.995.175.155.195.115.38
0.55.265.235.095.265.155.094.975.305.625.485.435.11
1001000.20.35.465.465.445.254.864.904.864.605.015.045.004.56
0.55.545.495.434.535.275.235.194.935.855.265.394.86
0.50.35.195.205.155.275.105.145.095.025.255.285.235.37
0.55.275.225.204.994.864.804.785.515.775.395.375.11
0.70.35.185.195.174.784.854.884.855.144.934.974.904.83
0.55.185.205.114.855.275.205.165.204.644.574.554.75
60400.20.35.395.335.315.095.245.305.235.135.375.425.325.05
0.55.094.984.884.725.435.345.225.037.325.375.655.09
0.50.35.335.335.275.115.185.225.155.155.255.335.215.00
0.55.225.175.065.355.174.934.944.866.115.385.684.87
0.70.35.255.225.214.725.265.305.205.125.095.084.995.23
0.55.085.065.055.195.915.425.654.956.505.096.365.09
Table 3. Under J = 4, the empirical type I error rates for testing H 0 : d 1 = = d 4 = d corresponding to α = 5 % based on 10,000 replicates.
Table 3. Under J = 4, the empirical type I error rates for testing H 0 : d 1 = = d 4 = d corresponding to α = 5 % based on 10,000 replicates.
d = 0d = 0.1d = 0.2
mn ρ π 1 T L T W T S T G T L T W T S T G T L T W T S T G
25250.20.35.575.795.355.195.515.965.384.875.556.015.385.16
0.55.245.614.735.076.405.645.224.9811.464.926.075.37
0.50.35.815.935.604.835.145.604.975.495.515.985.435.56
0.55.265.574.815.385.295.554.875.366.056.065.695.53
0.70.35.225.404.965.535.465.965.294.925.546.085.434.87
0.55.155.484.745.235.675.945.355.205.585.745.255.24
50500.20.35.375.365.225.345.335.485.194.605.175.495.104.84
0.55.425.344.954.985.565.435.124.718.165.215.495.06
0.50.35.655.725.575.065.225.465.085.215.305.525.215.72
0.55.455.475.114.765.175.194.934.985.245.214.995.18
0.70.35.465.535.374.515.225.455.144.885.185.435.135.23
0.55.335.345.025.025.425.525.114.735.495.385.134.75
1001000.20.35.105.105.064.764.784.834.734.635.025.064.995.20
0.55.125.094.934.735.175.104.964.905.574.784.915.08
0.50.35.335.385.274.614.975.044.945.095.105.315.065.35
0.54.924.894.744.924.734.724.614.985.515.525.345.03
0.70.34.704.744.644.715.275.345.215.115.245.425.255.11
0.54.884.874.735.245.125.204.994.745.205.125.075.30
60400.20.34.985.054.865.195.655.955.575.055.515.825.395.05
0.55.515.365.015.335.425.284.995.207.125.125.325.02
0.50.35.465.555.364.835.265.545.165.025.335.655.274.93
0.55.375.384.985.485.575.535.275.086.515.766.224.81
0.70.35.445.535.374.725.675.945.604.645.275.585.305.05
0.55.355.455.135.345.635.555.365.167.205.336.955.27
Table 4. Under J = 8, the empirical type I error rates for testing H 0 : d 1 = = d 8 = d corresponding to α = 5 % based on 10,000 replicates.
Table 4. Under J = 8, the empirical type I error rates for testing H 0 : d 1 = = d 8 = d corresponding to α = 5 % based on 10,000 replicates.
d = 0d = 0.1d = 0.2
mn ρ π 1 T L T W T S T G T L T W T S T G T L T W T S T G
25250.20.35.515.885.125.645.015.884.715.005.466.455.405.10
0.55.455.624.475.236.795.925.135.2113.674.525.875.02
0.50.36.006.335.704.995.526.225.215.385.806.635.524.85
0.55.796.075.055.105.826.095.045.486.116.115.485.31
0.70.35.966.365.635.075.886.775.715.105.516.425.305.08
0.55.535.834.855.375.706.135.175.095.896.385.325.25
50500.20.35.886.085.574.916.136.565.985.135.055.514.874.75
0.55.074.884.304.835.455.314.754.999.875.285.685.10
0.50.35.375.585.245.205.045.464.925.415.085.694.984.85
0.55.305.344.645.435.495.384.994.945.365.264.844.98
0.70.35.295.575.085.125.155.505.055.035.465.895.475.46
0.55.395.244.855.035.325.394.785.215.595.494.995.00
1001000.20.35.355.485.244.985.025.154.934.865.315.625.285.07
0.55.545.465.145.195.085.094.795.375.714.914.885.34
0.50.35.365.455.224.885.155.345.144.805.115.255.034.83
0.54.954.824.584.815.385.285.075.074.934.904.745.38
0.70.35.525.655.414.775.085.325.004.725.335.555.335.17
0.55.575.535.234.974.794.824.614.965.195.224.904.81
60400.20.35.475.635.255.065.445.775.375.155.205.705.054.92
0.55.595.234.654.985.295.164.594.927.934.935.265.08
0.50.35.345.575.215.005.295.705.164.835.395.765.275.08
0.55.585.444.875.315.545.434.995.245.785.335.225.38
0.70.35.655.855.505.225.135.534.995.605.285.765.224.73
0.55.165.404.965.045.896.085.645.137.415.877.085.12
Table 5. Parameter settings of the alternative hypothesis for computing powers.
Table 5. Parameter settings of the alternative hypothesis for computing powers.
Number of Strata
H a J = 2 J = 4 J = 8
( d 1 , d 2 ) ( d 1 , d 2 , d 3 , d 4 ) ( d 1 , d 2 , d 3 , d 4 , d 5 , d 6 , d 7 , d 8 )
H a 1 (0, 0.1)(0, 0.1, 0, 0.1)(0, 0.1, 0, 0.1, 0, 0.1, 0, 0.1)
H a 2 (0, 0.15)(0.05, 0.1, 0.15, 0.2)(0.05, 0.1, 0.15, 0.2, 0.05, 0.1, 0.15, 0.2)
H a 3 (0, 0.2)(0, 0.15, 0, 0.15)(0, 0.15, 0, 0.15, 0, 0.15, 0, 0.15)
Table 6. Simulation results of the empirical powers for testing H a under J = 2.
Table 6. Simulation results of the empirical powers for testing H a under J = 2.
d = 0, 0.1d = 0, 0.15d = 0, 0.2
mn ρ π 1 T L T W T S T G T L T W T S T G T L T W T S T G
25250.20.315.6715.7515.4214.4426.6726.9426.4125.1342.3742.6042.0639.79
0.513.5213.6913.2912.5725.1125.1424.5822.9640.3639.9539.3436.77
0.50.313.0913.1812.9712.3123.3823.6423.1422.4537.9038.3637.6336.77
0.512.4012.6312.2311.9822.3722.6022.1421.6835.2135.5135.0234.18
0.70.313.1013.2112.9612.6122.0022.2921.7521.6435.3035.6635.0834.62
0.511.7512.0911.6311.4220.8921.2920.7920.4532.2132.6531.9831.85
50500.20.323.5123.6023.3722.4545.1645.3144.9942.6168.7568.9568.6566.02
0.521.6121.7721.4719.8741.4141.6641.2039.0765.8065.9165.5562.86
0.50.322.3422.4122.1822.0541.9042.0741.8441.1563.9164.1663.7562.71
0.519.4019.6119.3218.9438.2138.5838.1637.1259.9560.2359.8258.49
0.70.320.2620.3920.1119.8638.9339.0638.8738.2160.7160.9760.5359.92
0.518.3718.6318.3018.1036.0836.3235.9635.6256.8757.1556.8056.15
1001000.20.341.6041.6641.5439.4674.4774.5174.4171.5893.6393.6793.5892.00
0.536.7336.8836.6434.2770.0070.0569.9266.8491.2791.3191.2589.44
0.50.338.7238.8038.6337.4269.2769.3269.2267.9390.1790.2290.1289.66
0.533.9534.1133.8933.3564.0264.1264.0163.2787.4187.4887.4686.86
0.70.335.2835.3235.2134.8165.8766.0165.7765.0588.5288.6088.4888.16
0.531.5031.5831.4731.4160.7560.9060.7360.5285.4685.5785.4185.11
60400.20.324.5324.6024.4023.1648.6848.7548.4246.3171.7571.8671.6968.42
0.522.6422.8922.5121.2944.8345.0044.7042.0767.1967.3266.9163.79
0.50.321.5521.6321.4720.8841.9942.1841.8640.8164.8565.0964.7163.58
0.520.7020.8120.6420.2838.5438.7938.5137.8660.6960.7660.5459.38
0.70.321.0921.1920.9920.5239.4139.6339.2938.7961.1761.4161.0160.14
0.518.9518.7818.7618.4035.7335.6035.5635.0258.3558.0558.1157.24
Table 7. Simulation results of the empirical powers for testing H a under J = 4.
Table 7. Simulation results of the empirical powers for testing H a under J = 4.
d = (0, 0.1, 0, 0.1)d = (0.05, 0.1, 0.15, 0.2)d = (0, 0.15, 0, 0.15)
mn ρ π 1 T L T W T S T G T L T W T S T G T L T W T S T G
25250.20.316.4216.8215.9015.5619.4120.6218.7418.7232.0133.2531.1431.09
0.515.2016.0114.5914.4920.2019.3918.4517.4830.0330.6928.9128.96
0.50.315.0815.5614.6314.2017.3318.5016.9216.7328.9430.1528.3427.90
0.513.6114.5013.2813.1616.5117.2716.1315.7725.4026.4925.0124.55
0.70.314.2514.9913.8413.6916.9018.1116.4316.7126.4027.4125.9325.73
0.512.7713.6012.5212.5115.7216.4615.2615.1124.6326.1624.1224.13
50500.20.329.2729.5928.9428.3333.2333.9332.9032.6259.4459.9758.9658.40
0.525.6626.2925.2225.0131.7631.9431.0430.8052.9953.7352.6752.77
0.50.325.8526.3525.5225.1430.5531.2030.2030.2052.6653.2452.3251.74
0.523.2223.9023.0022.6728.3128.8928.1027.7947.1248.1346.8846.65
0.70.324.0624.4223.8023.6227.4728.2627.2527.1649.8350.4449.5549.45
0.521.5622.1021.4121.3626.7327.4026.6226.3844.7645.4344.5444.20
1001000.20.352.9253.0752.6952.4061.4561.8361.2961.2888.8889.0488.7788.76
0.547.2247.6847.1246.9659.7360.0359.4959.5584.7884.9784.6584.64
0.50.347.1247.3747.0046.5255.6956.0155.5555.1483.5683.7083.4583.14
0.541.7342.2041.6141.4951.6252.0151.5551.5879.2979.5279.2479.33
0.70.343.7744.0243.6043.3452.0152.4451.8951.8181.0681.2680.9880.66
0.538.6839.0638.5838.5549.9950.4749.9749.8374.8975.1674.7474.80
60400.20.329.3729.8029.1328.6935.8036.4735.5335.3360.4460.9160.1359.94
0.526.5627.3126.2126.2933.2633.7232.6232.2655.8756.5855.4055.33
0.50.326.2026.6425.9025.5730.9231.8630.6730.7253.9654.6253.7353.27
0.522.8323.3122.6422.4428.3628.8028.1427.7348.8649.4848.7848.26
0.70.323.7624.3723.5523.2528.1728.9827.9327.8949.8550.3149.5348.94
0.521.6922.2321.4021.3127.8427.5927.4226.7345.3145.6144.8644.54
Table 8. Simulation results of the empirical powers for testing H a under J = 8.
Table 8. Simulation results of the empirical powers for testing H a under J = 8.
d = (0, 0.1, 0, 0.1,d = (0.05, 0.1, 0.15, 0.2,d = (0, 0.15, 0, 0.15,
0, 0.1, 0, 0.1)0.05, 0.1, 0.15, 0.2)0, 0.15, 0, 0.15)
mn ρ π 1 T L T W T S T G T L T W T S T G T L T W T S T G
25250.20.320.6721.9119.6019.5924.7627.0023.8824.1544.5546.4243.3743.43
0.519.6321.2918.8718.3825.3924.8822.9221.6040.7642.5539.1538.63
0.50.318.6420.0317.8717.5622.3624.3621.6221.4240.2642.4339.2538.95
0.516.7118.4916.1716.2320.1022.0219.4319.0335.3337.7734.3634.38
0.70.318.0219.4217.2217.1620.7822.7420.0620.0237.1938.9536.1535.77
0.515.4717.3314.8614.7819.8921.4819.3719.0032.3134.9131.6431.42
50500.20.339.9540.7539.3938.8548.7349.8947.9747.9478.2979.1077.8477.86
0.533.9335.2333.3332.9945.2645.9244.4143.5373.7574.4673.2372.95
0.50.334.8835.6434.4634.0242.1443.4941.5641.5471.1071.7170.6070.15
0.530.2031.5029.8829.5939.3740.7639.0538.7164.8265.9164.5264.34
0.70.332.7133.5432.1531.8438.8140.1138.4738.2768.0568.9467.5567.08
0.528.4429.4828.0127.9836.3637.5136.0235.6261.8762.9061.6461.31
1001000.20.373.1473.4272.9172.5481.0881.4880.9680.8698.6498.7098.6198.61
0.565.7366.4065.4665.2579.0479.4178.7578.6297.4997.5797.4797.48
0.50.365.2565.6365.0764.2575.2975.9275.0374.8896.8896.9496.8496.81
0.558.4258.9458.1557.8972.4772.9972.5072.3894.8094.9794.8094.65
0.70.362.1762.4962.0161.3272.3372.9372.1371.9695.7295.8195.6795.52
0.554.7955.4554.6854.5967.5568.0567.3566.9393.0193.2092.9993.01
60400.20.341.6042.4540.9940.7550.1351.2849.5649.0980.6981.2080.2780.01
0.537.2938.4736.7936.4446.7347.5845.9545.0974.6775.6674.1674.20
0.50.335.6436.4435.1634.8242.0843.4741.6041.4873.2174.0372.8672.53
0.531.6733.0531.4030.9739.9740.9439.5439.1067.3868.4067.0166.88
0.70.333.2934.1832.7932.2939.2140.5938.7738.4269.5970.3669.2968.70
0.529.5530.6029.1328.9137.5037.7037.0736.0662.5563.3162.0161.53
Table 9. The number of children whose ears have been cured across age strata (Group1: Cefaclor; Group2: Amoxicillin).
Table 9. The number of children whose ears have been cured across age strata (Group1: Cefaclor; Group2: Amoxicillin).
Age Groups
<2 Years2–5 Years6 Years
Number of OME-Free Ears121212
Bilateral08116301
1226110
28210536
Unilateral03224141111
191072287
Table 10. MLEs of parameters based on the data with 3 strata.
Table 10. MLEs of parameters based on the data with 3 strata.
Constrained MLEUnconstrained MLE
Age Group       ρ ^ π ^ 1 d ^       ρ ^ π ^ 1 d ^
<2 years0.76700.49980.06440.75250.5929−0.1315
2–5 years0.55180.46590.06440.55990.40110.2119
≥ 6 years0.74070.48260.06440.74230.49740.0358
Table 11. Statistics and p-values of different proposed methods.
Table 11. Statistics and p-values of different proposed methods.
T L T W T S T G
Statistics4.71324.82494.66595.0600
p-value0.09470.08960.09700.0796
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zhang, X.; Ma, C. Testing the Homogeneity of Differences between Two Proportions for Stratified Bilateral and Unilateral Data across Strata. Mathematics 2023, 11, 4156. https://doi.org/10.3390/math11194156

AMA Style

Zhang X, Ma C. Testing the Homogeneity of Differences between Two Proportions for Stratified Bilateral and Unilateral Data across Strata. Mathematics. 2023; 11(19):4156. https://doi.org/10.3390/math11194156

Chicago/Turabian Style

Zhang, Xueqing, and Changxing Ma. 2023. "Testing the Homogeneity of Differences between Two Proportions for Stratified Bilateral and Unilateral Data across Strata" Mathematics 11, no. 19: 4156. https://doi.org/10.3390/math11194156

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop