Next Article in Journal
Decentralized Energy Swapping for Sustainable Wireless Sensor Networks Using Blockchain Technology
Previous Article in Journal
Inference and Optimal Design on Partially Accelerated Life Tests for the Power Half-Logistic Distribution Under Adaptive Type II Progressive Censoring
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Generalized Inference for Mediation Analysis

by
Xiaoyang Ma
1,
David P. MacKinnon
2,
Thomas Mathew
3,
Brian Agan
4,5,
George Luta
1,* and
Ionut Bebu
6
1
Department of Biostatistics, Bioinformatics, and Biomathematics, Georgetown University, Washington, DC 20007, USA
2
Department of Psychology, Arizona State University, Tempe, AZ 85281, USA
3
Department of Mathematics and Statistics, University of Maryland Baltimore Country, Baltimore, MD 21202, USA
4
Infectious Disease Clinical Research Program, Department of Preventive Medicine and Biostatistics, Uniformed Services University of the Health Sciences, Bethesda, MD 20814, USA
5
Henry M. Jackson Foundation for the Advancement of Military Medicine, Inc., Bethesda, MD 20817, USA
6
Department of Biostatistics and Bioinformatics, George Washington University, Rockville, MD 20852, USA
*
Author to whom correspondence should be addressed.
Mathematics 2025, 13(3), 396; https://doi.org/10.3390/math13030396
Submission received: 10 December 2024 / Revised: 15 January 2025 / Accepted: 22 January 2025 / Published: 25 January 2025
(This article belongs to the Special Issue Recent Advances in Statistical Analysis and Applications)

Abstract

:
Mediation analysis has a long history and its use in applied research has been increasing. Studying mediators can help improve our understanding of mechanisms relating independent and dependent variables. The objective of this paper is to compare different methods to construct confidence intervals for the mediation effect for the one-mediator and two-mediator models. For the one-mediator model, we evaluated the generalized pivotal quantity (GPQ) method, the PRODCLIN method, bootstrap methods, the Sobel method, the Goodman method, and the Monte Carlo method. For the two-mediator model, we evaluated a new GPQ method, bootstrap methods, the Sobel method, the Goodman method, and the Monte Carlo method. Simulation studies compared the performance of the methods for sample sizes of 50, 100, and 200. The results of the simulation studies indicated that, for the simple traditional mediation models under consideration, the GPQ method performed well when compared with the other methods. Future work should consider the extension of the GPQ method to causal mediation analysis involving more complex models with multiple mediators.

1. Introduction

The use of mediation analysis in many research fields has increased because it provides a method to investigate the process by which an independent variable affects a dependent variable, see [1,2]. For example, the use of mediation analysis for treatment studies provides a way to investigate how a treatment achieved its effects. As mediation analysis is increasingly applied in research studies, the construction of confidence intervals for quantities of interest (e.g., mediated effects) with good coverage properties is becoming very important. Although there are many methods available to construct confidence intervals for the simplest case of a single-mediator model, their extensions to more complex cases involving multiple mediators are either difficult (e.g., the PRODCLIN method) or computationally intensive (e.g., bootstrap methods). Statistical methods that can be easily generalized to more complex mediation models while having reduced computational requirements are needed. The Monte Carlo method is simpler to use than other existing methods because it only requires the availability of estimates for the parameters of interest together with the corresponding estimated variance-covariance matrix. The Monte Carlo method for mediation was introduced in [3] and was called the empirical M-method. This Monte Carlo method, which involves simulations from the assumed joint multivariate normal distribution for the vector of parameter estimators, has been shown in simulation studies to perform better than the traditional methods that are based on the multivariate delta method [4]. The relatively good performance of the Monte Carlo method, together with the fact that it does not require the original data (only summary statistics), makes it a very competitive method for mediation analysis; see [5].
In this paper, we use the concept of a generalized pivotal quantity (GPQ) to construct confidence intervals for the mediated effect for the single-mediator model and the multiple-mediator model. While having the advantages of the Monte Carlo method mentioned above, and being comparable to it with respect to computational demands, the GPQ approach has the additional advantage of not being an ad hoc approach, but rather being based on the theory of generalized inference (see [6]), with well-established asymptotic properties for the resulting generalized confidence intervals (see [7]).
There is only one previous publication (that we are aware of) where generalized inference methods have been applied to construct confidence intervals for the mediated effect for the simplest case involving one-mediator and linear regression models [8]. A simulation study similar to the one from [3] was performed to compare the performance of the GPQ method with other methods: Sobel, Goodman, PRODCLIN, percentile bootstrap, and the bias-corrected and accelerated (BCa) bootstrap. The Sobel and Goodman methods are asymptotic approaches; the first one is based on the use of the multivariate delta method, while the second is based on the use of the second-order Taylor series and exact variance. Based on the results of their simulations, the authors concluded that the GPQ method and the percentile bootstrap method outperform the other methods for small and medium sample sizes, with the GPQ method also being more stable for small sample sizes. The proposed new GPQ method for the multiple-mediator models is the main contribution of this paper.
A brief description of the GPQ approach is provided in Appendix A of this paper. Since its introduction in [6], the GPQ methodology has been successfully used by a growing number of researchers to obtain improved confidence intervals for several parametric problems for which traditional approaches are either unavailable or provide unsatisfactory results [9]. In particular, the GPQ methodology has been applied for constructing confidence intervals for ratios of parameters, see [10], where the GPQ approach has been used successfully for the situation involving a ratio of regression parameters. The asymptotic accuracy of the GPQ methodology has been established in [7]; however, satisfactory small sample performance has been noted in many situations, for example in [10]. An added appeal of the GPQ methodology is that if a vector GPQ is available for a vector parameter, then a GPQ can be constructed for any function of the vector parameter by simply using the corresponding function of the vector GPQ. We will take advantage of this very useful property in this paper. For a detailed discussion of the GPQ methodology, along with numerous applications, we refer to the books by Weerahandi [9] and Meeker [11].
This paper is organized as follows. After briefly describing the single-mediator model, we use the GPQ for the linear regression model to construct a GPQ for the single-mediator model. We then describe an extension of the GPQ approach to the situation involving two mediators. We assess the performance of the GPQ-based confidence interval for the mediated effect for the single-mediator model using a simulation study similar to those employed in [3,8]. We then illustrate the application of the new GPQ method using a hypothetical data set, and assess the performance of the GPQ-based confidence interval for the mediated effect for the two-mediator model using a simulation study. We end with a discussion where we propose other extensions of the GPQ approach.

2. Materials and Methods

2.1. Single Mediator

2.1.1. The Model

Consider an independent variable Z, a dependent variable Y, and a mediator M of the effect of Z on Y. Then, one can consider the following two regression models (where for simplicity we drop the index used to identify each observation):
M = δ 1 + α Z + ϵ 1
Y = δ 2 + τ Z + β M + ϵ 2 ,
where ϵ i N ( 0 , σ i 2 ) , i = 1 , 2 are independent error terms. The mediated (indirect) effect of Z mediated through M is given by α β , while the total effect is τ + α β .

2.1.2. A GPQ for the Linear Regression Model

The following construction follows the approach described in [10].
Consider the linear regression model
y = X γ + ϵ ,
where y is the n × 1 dependent variable vector, X is an n × p design matrix (assumed of full column rank), and ϵ N ( 0 , σ 2 I p ) . Using the usual estimators
γ ^ = ( X X ) 1 X y , σ ^ 2 = ( y X γ ^ ) ( y X γ ^ ) / ( n p ) ,
one has
Z = ( X X ) 1 / 2 σ ( γ ^ γ ) N ( 0 , I p ) U 2 = ( n p ) σ ^ 2 σ 2 χ ( n p ) 2 T = n p U Z t ( n p ) ,
where χ ( n p ) 2 denotes the chi-square distribution with n p degrees of freedom (df), and, Z and U 2 are independent. Let γ ^ obs and σ ^ obs 2 denote the observed values of γ ^ and σ ^ 2 , respectively, and define
T γ = γ ^ obs σ ^ obs ( X X ) 1 / 2 σ σ ^ ( X X ) 1 / 2 σ ( γ ^ γ ) = γ ^ obs σ ^ obs ( X X ) 1 / 2 n p U Z = γ ^ obs σ ^ obs ( X X ) 1 / 2 T ,
T σ 2 = ( n p ) · σ ^ o b s 2 U 2 ,
where Z, U 2 , and T are defined in (4); ( X X ) 1 / 2 denotes the positive definite square root of X X ; and ( X X ) 1 / 2 is the inverse of ( X X ) 1 / 2 . T γ is a GPQ for the entire vector γ , and therefore g ( T γ ) provides a GPQ for any function g ( γ ) , while T σ 2 is a GPQ for σ 2 .

2.1.3. A GPQ for the Mediated Effect for the Single-Mediator Model

Using (5), one can construct GPQs for the two regression models in (1) and (2). More precisely, if T γ 1 denotes the GPQ for the parameters in (1), then a GPQ for α is given by T γ 1 ( 2 ) , the second component of T γ 1 . Similarly, GPQs for τ and β are given by T γ 2 ( 2 ) and T γ 2 ( 3 ) , the second and the third components of T γ 2 , a GPQ for the parameters in (2).
Then, the GPQs for the mediated (indirect) effect and the total effect are given by
T α β = T γ 1 ( 2 ) T γ 2 ( 3 ) , T τ + α β = T γ 2 ( 2 ) + T γ 1 ( 2 ) T γ 2 ( 3 ) .
The algorithm used to construct the generalized confidence intervals is as follows:
(i)
Generate a random sample T 1 , , T m in (5) from the t distribution with n p df for each GPQ.
(ii)
Construct T α β 1 , , T α β m .
(iii)
Choose the ϕ th quantile and 1 ϕ th quantile of the distribution of T α β to form the lower and upper bounds of 100 ( 1 2 ϕ ) % .
A similar algorithm can be used to construct a GPQ-based confidence interval for the mediation proportion, the ratio between the mediated effect and the total effect.

2.2. Extension to Multiple Mediators

The extension described in this paper is the use of the GPQ method for situations involving multiple mediators. For simplicity, we only consider the case of two mediators. The linear regression models are
M 1 = δ 1 + α 1 Z + ϵ 1
M 2 = δ 2 + α 2 Z + ϵ 2
Y = δ 3 + τ Z + β 1 M 1 + β 2 M 2 + ϵ 3
where ϵ i N ( 0 , σ i 2 ) , i = 1 , , 3 are independent error terms.
Then, the effect of Z on Y mediated through M i is α i β i , i = 1 , 2 ; the (total) effect mediated through the two mediators is α 1 β 1 + α 2 β 2 ; and the difference between the two mediated effects is α 1 β 1 α 2 β 2 . The GPQs for all these quantities of interest can be obtained similarly to the one-mediator model case, by first constructing the GPQs for the regression parameters from models (8) to (10).
Using (5), if T γ 1 denotes the GPQ for (8), then a GPQ for α 1 is given by T γ 1 ( 2 ) , the second component of T γ 1 . Similarly, a GPQ for α 2 from (9) is given by T γ 2 ( 2 ) ; GPQs for τ , β 1 , and β 2 are given by T γ 3 ( 2 ) , T γ 3 ( 3 ) , and T γ 3 ( 4 ) , the second, third, and fourth components, respectively, of T γ 3 , the GPQ from (10).
Then, the GPQs for the mediated effect and the total effect are given by
T α 1 β 1 + α 2 β 2 = T γ 1 ( 2 ) T γ 3 ( 3 ) + T γ 2 ( 2 ) T γ 3 ( 4 ) , T τ + α 1 β 1 + α 2 β 2 = T γ 4 ( 2 ) + T γ 1 ( 2 ) T γ 3 ( 3 ) + T γ 2 ( 2 ) T γ 3 ( 4 ) .

3. Results

3.1. Illustrative Example

As an illustrative example for the case of two mediators, we use data from a study of how two different ways of processing a list of words affect subsequent recall of the words (see [12] for more details on this study). The data set contains 44 observations. The independent variable Z is a random assignment of participants to either make images of words or to repeat the words over and over. The two mediators were participants’ self-reported imagery ( M 1 ) and self-reported repetition of the words ( M 2 ) when hearing the word list. The dependent variable, Y, was the number of words recalled. The Z variable was binary and M 1 , M 2 , and Y were treated as continuous variables for the sake of illustration. The research hypothesis is that the effect on word recall would be mediated through the use of imagery when memorizing the word list. The point estimates for the mediated effects are 2.447 for imagery and −0.875 for repetition, respectively. The 95% CI for the effect mediated through imagery is (0.655, 4.590), while the 95% CI for the effect mediated through repetition is (−2.775, 0.896). The 95% CI for the effect mediated through the two mediators is (−1.019, 4.298), while the 95% CI for the difference between the two mediated effects, namely α 1 β 1 α 2 β 2 , is (0.764, 6.115). Figure 1 is a forest plot that shows the 95% CI for the effect mediated through the two mediators by using five different methods. We note that the point estimates are very close, and the GPQ-based 95% CI is the widest CI among the five CIs.

3.2. Simulation Study

3.2.1. Single-Mediator Model

To evaluate the performance of the GPQ method and compare it with the other available methods, we performed a simulation study where we simulated data using the two regression models from (1) and (2). We used 16 combinations of values for α and β as performed before in [3,8] ( α and β take the values: 0, 0.14, 0.39, and 0.59). The study involved 5000 simulated data sets; for each simulated data set, the bootstrap methods used 5000 resamples, and the GPQ and Monte Carlo methods used 5000 simulations each. The sizes of the simulated data sets were 50, 100, and 200, respectively. The independent variable Z was simulated as a standard normal. The intercepts δ 1 and δ 2 from (1) and (2) were set as 0 and the coefficient τ from (2) was set as 0.5. We considered a total of 16 scenarios and compared the performance of the GPQ, Sobel, PRODCLIN, Monte Carlo, Goodman, percentile bootstrap, and BCa bootstrap methods. The results of the simulations, specifically the simulated coverage probability and average interval length for 95% confidence intervals, are presented in Appendix A, Table A1, Table A2 and Table A3. We used the R package RMediation (see [13]) for the Goodman and PRODCLIN methods.
We have considered an empirical coverage of 94.5% or more to be acceptable from a practical point of view. Table 1 provides the percentage of simulated coverage probabilities that are above the 94.5% level across the 16 scenarios. Our simulation results are similar to those from [8] in the sense that the GPQ method had more instances than the other methods when the simulated coverage was above the 94.5% level, especially for the small sample sizes. For the larger sample size, n = 200 , the Sobel and Goodman methods had larger percentages than the GPQ method. However, the lowest coverage probabilities for the Sobel and Goodman methods were 90.64% and 93.18%, respectively, for the situation when both parameters were 0.14, while the lowest coverage probability for the GPQ method was 94.34%. This may indicate that the GPQ method is more stable than the other methods. As shown before, and also as seen in our simulation study, the GPQ method tended to have the largest average interval length.

3.2.2. Multiple Mediators Model

To evaluate the performance of the GPQ method for the multiple-mediator model and compare it with other methods, we performed a simulation study where we simulated data using the three regression models from (8) to (10). We used combinations of values for α 1 , α 2 , β 1 , and β 2 as performed in the previous section. Therefore, there are 256 scenarios in total. The study involved 5000 simulated data sets, the bootstrap methods used 5000 resamples, and the GPQ method involved 5000 simulations. The sample sizes of the simulated data sets were 50, 100, and 200, respectively. The independent variable was simulated as a standard normal. The intercepts δ 1 , δ 2 , and δ 3 from (8) to (10) were set as 0 and the coefficient τ from (2) was set as 0.5. We compared the performance of the GPQ, Sobel, Monte Carlo, percentile bootstrap, and BCa bootstrap methods. The results for 16 scenarios, selected out of the total of 256 scenarios, including the simulated coverage probability and average interval length for 95% confidence intervals, are presented in Appendix A, Table A4, Table A5 and Table A6.
Table 2 provides a summary of the results for the 256 scenarios regarding the percentage of simulated coverage probabilities that are above the 94.5% level. The GPQ method performed better than the other methods for the case of two mediators. We can see that, regardless of the sample size, the GPQ method had the largest percentages of scenarios where the simulated coverage probability was above 94.5 % , while the other methods had much smaller percentages.
Table 3 provides a summary of the results across the 256 scenarios, separately for each sample size, regarding the number of scenarios when the specific method had the largest average interval length of the 95% confidence intervals. When the sample size is 50, the number of scenarios for the GPQ method is higher than that for the BCa bootstrap method, while when the sample sizes are 100 and 200, the BCa bootstrap method has the highest number of scenarios. As shown in Table 2, the GPQ method provides good coverage probabilities relative to the nominal level, being competitive with the other approaches for all simulation setups. The cost of that performance is a small increase in average interval length, as seen from the results from Appendix A, Table A4, Table A5 and Table A6.

4. Discussion

We have described GPQ methods to construct confidence intervals for mediation effects for the case of a single-mediator model, and also for the more complex case of two mediators. While the GPQ method for the simple case is not novel, being proposed before in [8] albeit using a different mathematical presentation, the extension to the multiple-mediator model is new. Our simulation study has provided results consistent with those from [8], showing that for the single-mediator model, the GPQ-based confidence intervals have better coverage performance than the alternative methods, although at the price of being somewhat conservative. Based on the results of our simulation studies, we recommend the use of the GPQ method for both the one-mediator and the two-mediator models because it is simpler to use and offers improved coverage.
The proposed new GPQ method for the two-mediator model is an important step in extending the use of the GPQ methods for mediation analysis. Our current work is somewhat limited by focusing solely on a simple two-mediator model. This is a direct consequence of our goal to provide a comprehensive evaluation, involving 256 simulation scenarios, of a simple (but practically important) model involving two mediators (which are allowed to be correlated) as a proof-of-concept study.
Future work will include the extension of the GPQ approach to causal mediation analysis for more complex models with multiple mediators involving sequential mediation, interactions between exposure and mediators, interactions between mediators, a multilevel structure, and the evaluation of interventional mediation effects (see [14,15]), and to meta-analysis of mediated effects.

Author Contributions

Conceptualization, D.P.M., G.L. and I.B.; methodology, X.M., D.P.M., T.M., G.L. and I.B.; software, X.M. and I.B.; formal analysis, X.M.; investigation, D.P.M. and B.A.; writing—original draft preparation, X.M., D.P.M. and I.B.; writing—review and editing, D.P.M., T.M., B.A., G.L. and I.B.; visualization, X.M. All authors have read and agreed to the published version of the manuscript.

Funding

This project has been funded in part by the National Institute on Drug Abuse, National Institutes of Health, under award R37DA09757, the National Institute of Allergy and Infectious Diseases, National Institutes of Health, under Inter-Agency Agreement Y1-AI-5072, and the Defense Health Program, U.S. Department of Defense, under award HU0001190002.

Data Availability Statement

A folder containing the R Code and the data set used for this study can be downloaded as a Zip folder from https://shorturl.at/8QYF0, accessed on 20 January 2025.

Conflicts of Interest

The views expressed are those of the authors and do not reflect the official views of the Uniformed Services University of the Health Sciences, the Henry M. Jackson Foundation for the Advancement of Military Medicine, Inc., the National Institutes of Health, the Department of Health and Human Services, the Department of Defense, the Defense Health Agency, the Departments of the Army, Navy or Air Force, or the U.S. Government. Mention of trade names, commercial products, or organizations does not imply endorsement by the U.S. Government.

Abbreviations

The following abbreviations are used in this manuscript:
BCabias-corrected and accelerated
CIconfidence interval
dfdegrees of freedom
GPQgeneralized pivotal quantity

Appendix A

Appendix A.1. Generalized Pivotal Quantity

Let X = ( X 1 , , X n ) be a random sample from a distribution defined based on a parameter of interest θ , and a (possibly vector) nuisance parameter δ , and let x denote the observed value of X. A generalized pivotal quantity (GPQ) for the parameter of interest θ is a function of X, x, θ , and δ , denoted by T ( X , x , θ , δ ) , such that (1) given x, the distribution of T ( X , x , θ , δ ) is free of unknown parameters and (2) T ( x , x , θ , δ ) , the observed value of T ( X , x , θ , δ ) , is free of the parameter δ . The GPQ is usually defined such that T ( x , x , θ , δ ) = θ .
Confidence intervals for the parameter of interest θ are obtained using the percentiles of T ( X , x , θ , δ ) . In light of property (1), one can use a Monte Carlo approach when they are not available in closed form. See [6,9] for more details on generalized inference. The correct asymptotic coverage probability of the generalized confidence intervals was established in [7].
An important property of a GPQ is that if T ( X , x , θ , δ ) is a GPQ for θ , then a GPQ for h ( θ ) is given by h ( T ( X , x , θ , δ ) ) for any scalar-valued function h ( θ ) of θ .
The asymptotic performance of generalized confidence intervals was investigated by [16]; they have shown that the intervals have guaranteed coverage probability asymptotically. Furthermore, they have also noted the relation to fiducial inference. Numerous applications and a detailed review are available in [7].

Appendix A.2. Simulation Results

Table A1, Table A2 and Table A3 show the simulation results for the single-mediator model. Table A4, Table A5 and Table A6 show selected simulation results for the two-mediator model.
Table A1. Simulated results using 5000 simulations for the 95% confidence intervals for the mediated effect for the single-mediator model (n = 50).
Table A1. Simulated results using 5000 simulations for the 95% confidence intervals for the mediated effect for the single-mediator model (n = 50).
Coverage ProbabilityAverage Interval Length
α β GPQSobelProdMCGoodmanPercentileBCaGPQSobelProdMCGoodmanPercentileBCa
0.000.000.99781.00000.99780.99741.00000.99780.99160.14920.10290.14570.14560.13640.14760.1595
0.000.140.99660.99980.99520.99520.99980.99260.97800.16790.12650.16350.16340.15490.16550.1776
0.000.390.97360.99040.96620.96700.99140.95940.92600.26560.23660.25970.25950.25300.26130.2708
0.000.590.95500.97620.94820.94920.97820.94180.90880.36780.34230.36520.36510.35910.36770.3733
0.140.000.99560.99880.99480.99460.99920.99220.97660.16920.12780.16420.16420.15580.16610.1781
0.140.140.99280.95460.98920.99020.99880.98560.94040.18540.14840.17930.17920.17210.18080.1942
0.140.390.95300.92720.95600.95500.97520.94840.91620.27590.24990.27000.26990.26560.27170.2835
0.140.590.95340.95400.94480.94520.95820.93800.92020.37840.35500.37040.37020.36610.37400.3819
0.390.000.96540.98760.96700.96740.99160.96400.92700.26690.23790.26110.26100.25420.26330.2728
0.390.140.95580.94600.94840.94840.97080.94140.91600.27880.25280.27140.27130.26690.27360.2858
0.390.390.95080.91920.93640.93740.92260.93120.93980.34440.32660.33720.33710.33830.34080.3568
0.390.590.95280.93320.95160.95060.94180.94200.95000.42810.41100.42120.42100.42230.42430.4379
0.590.000.94880.96860.94800.94840.97360.94140.91220.37300.34730.36710.36690.36080.37020.3759
0.590.140.95600.95360.94480.94380.96000.93680.91580.37900.35550.37270.37260.36830.37570.3831
0.590.390.95080.93480.94360.94420.93880.93940.94280.43370.41610.42200.42180.42300.42570.4388
0.590.590.95120.93140.94260.94320.93860.93320.93880.50200.48540.49190.49170.49400.49540.5095
Table A2. Simulated results using 5000 simulations for the 95% confidence intervals for the mediated effect for the single-mediator model (n = 100).
Table A2. Simulated results using 5000 simulations for the 95% confidence intervals for the mediated effect for the single-mediator model (n = 100).
Coverage ProbabilityAverage Interval Length
α β GPQSobelProdMCGoodmanPercentileBCaGPQSobelProdMCGoodmanPercentileBCa
0.000.000.99881.00000.99860.99881.00000.99820.99260.07170.05020.07080.07080.06630.07140.0773
0.000.140.99040.99820.98980.99020.99840.98740.97040.08920.07200.08810.08800.08420.08900.0948
0.000.390.95080.97560.94900.94900.97880.94740.91740.17020.15990.16830.16820.16520.16920.1717
0.000.590.94280.95420.94100.94080.95800.93900.92160.24530.23650.24260.24250.24000.24390.2452
0.140.000.99140.99860.99080.99040.99900.98940.96960.09000.07270.08890.08880.08500.08970.0955
0.140.140.97560.90340.97260.97320.99060.96220.90820.10340.08980.10220.10210.09970.10320.1101
0.140.390.94760.94020.94440.94400.95040.94040.92920.17750.16910.17550.17550.17410.17630.1807
0.140.590.95480.95700.95260.95260.96020.94820.93740.25230.24440.24940.24930.24780.25070.2532
0.390.000.95500.97860.95320.95340.98100.95020.92080.17070.16050.16880.16870.16570.16990.1725
0.390.140.95140.94420.94920.94920.95280.94480.93520.17840.17000.17640.17640.17500.17770.1822
0.390.390.95060.93300.94880.94880.93820.94260.94960.22900.22400.22650.22640.22760.22730.2341
0.390.590.94960.93900.94820.94800.94100.94480.94620.28990.28460.28680.28670.28740.28790.2930
0.590.000.95000.95900.94580.94540.96120.94180.92740.24800.23910.24510.24490.24250.24600.2477
0.590.140.94840.95040.94560.94540.95400.93980.92960.25210.24410.24920.24910.24750.25080.2533
0.590.390.95000.93840.94700.94780.94120.94520.94620.29040.28500.28730.28710.28790.28860.2939
0.590.590.95300.94160.95000.95040.94380.94680.94800.34050.33520.33670.33650.33760.33770.3431
Table A3. Simulated results using 5000 simulations for the 95% confidence intervals for the mediated effect for the single-mediator model (n = 200).
Table A3. Simulated results using 5000 simulations for the 95% confidence intervals for the mediated effect for the single-mediator model (n = 200).
Coverage ProbabilityAverage Interval Length
α β GPQSobelProdMCGoodmanPercentileBCaGPQSobelProdMCGoodmanPercentileBCa
0.000.000.99880.99960.99880.99880.99960.99860.99520.03540.02500.03520.03520.03300.03560.0385
0.000.140.98020.99680.97980.97980.99760.97880.95360.05160.04460.05130.05130.04960.05190.0545
0.000.390.95060.96300.94960.94920.96580.94800.92820.11420.11040.11360.11360.11220.11450.1152
0.000.590.95140.95700.95100.95060.95940.94680.94140.16830.16530.16740.16740.16650.16880.1692
0.140.000.98080.99680.98060.98040.99740.97860.95500.05180.04490.05150.05150.04970.05200.0545
0.140.140.94340.90640.94320.94440.93180.94240.94220.06300.05870.06270.06270.06240.06330.0669
0.140.390.94880.94480.94820.94660.94980.94240.94080.11960.11690.11910.11900.11860.11970.1215
0.140.590.95020.95460.95020.95000.95580.94920.94180.17290.17030.17210.17200.17150.17310.1740
0.390.000.94360.96000.94240.94040.96280.94380.92580.11440.11070.11380.11380.11250.11460.1153
0.390.140.94880.94500.94600.94520.94880.93960.93740.12010.11730.11950.11950.11910.12050.1221
0.390.390.95420.94640.95320.95360.94760.95360.95580.15740.15590.15670.15660.15720.15750.1603
0.390.590.95340.94800.95260.95300.94840.95160.95240.20030.19850.19930.19920.19950.20050.2026
0.590.000.95040.95660.94880.94880.95780.94960.94040.16920.16610.16830.16820.16730.16950.1700
0.590.140.94480.94560.94400.94340.94680.94220.93600.17270.17010.17180.17170.17120.17310.1740
0.590.390.95380.94860.95320.95200.94960.94880.95140.20010.19840.19910.19900.19940.20060.2026
0.590.590.94900.94560.94940.95000.94680.94460.94580.23580.23410.23460.23450.23500.23600.2382
Table A4. Simulated results using 5000 simulations for the 95% confidence intervals for the mediated effect for the two-mediator model (n = 50).
Table A4. Simulated results using 5000 simulations for the 95% confidence intervals for the mediated effect for the two-mediator model (n = 50).
Coverage ProbabilityAverage Interval Length
α 1 β 1 α 2 β 2 GPQSobelMCPercentileBCaGPQSobelMCPercentileBCa
0.000.000.000.000.99840.99960.99920.99620.99200.21360.15600.20900.21230.2178
0.000.000.390.390.96160.93460.95820.95220.95680.38120.35050.37170.37410.3852
0.000.140.000.590.96840.97640.96480.95860.94120.40860.37360.39870.40330.4069
0.000.390.590.590.95780.94680.94820.94940.94780.57690.55020.56380.56410.5728
0.140.000.590.000.96700.97400.94880.95640.93400.41600.38070.40580.40900.4132
0.140.390.000.000.98180.96900.97840.96920.94920.31920.28000.31010.31480.3217
0.140.390.590.390.95760.93960.94960.94520.94560.52110.49440.50770.51450.5236
0.140.590.140.590.96180.95720.94880.94820.94040.53710.50710.52460.52770.5329
0.390.000.000.000.97920.98680.97860.97420.95380.31460.27320.30480.30990.3155
0.390.000.590.390.96240.94920.95000.94500.94260.51850.48990.50800.51220.5197
0.390.390.590.590.94860.93640.95100.94660.95060.61760.59310.60450.60740.6183
0.390.590.140.140.95980.94300.95300.94880.95120.47400.44710.46190.46480.4748
0.590.000.590.140.94780.95040.94880.94640.93040.54340.51210.52720.53460.5383
0.590.140.390.000.96000.96260.95260.95040.93600.47620.44360.46260.46820.4721
0.590.390.000.140.96300.94700.95320.95120.95140.46970.44160.46010.46250.4720
0.590.590.590.590.95200.94120.94880.94460.94600.71880.69420.70430.71060.7216
Table A5. Simulated results using 5000 simulations for the 95% confidence intervals for the mediated effect for the two-mediator model (n = 100).
Table A5. Simulated results using 5000 simulations for the 95% confidence intervals for the mediated effect for the two-mediator model (n = 100).
Coverage ProbabilityAverage Interval Length
α 1 β 1 α 2 β 2 GPQSobelMCPercentileBCaGPQSobelMCPercentileBCa
0.000.000.000.000.99920.99980.99900.99880.99540.10180.07540.10110.10150.1042
0.000.000.390.390.96300.94340.95300.95360.95980.24020.23090.23760.23840.2439
0.000.140.000.590.95300.96100.95300.95340.94200.26370.25170.25900.26130.2627
0.000.390.590.590.95680.95200.95120.94840.94780.38360.37520.37770.38030.3838
0.140.000.590.000.95980.96640.94780.95620.94400.26360.25150.26140.26370.2650
0.140.390.000.000.96320.95360.96580.95440.94360.19250.17930.19100.19160.1949
0.140.390.590.390.95700.94700.95160.94720.94420.34390.33550.34090.34140.3453
0.140.590.140.590.95560.95580.95020.94200.93940.35720.34750.35270.35400.3560
0.390.000.000.000.96560.97660.96100.95920.94120.18680.17180.18450.18510.1870
0.390.000.590.390.95600.95100.94980.94480.94280.34060.33150.33670.33910.3424
0.390.390.590.590.95240.94500.95560.94960.94940.41350.40580.40920.41130.4159
0.390.590.140.140.95100.94000.95300.94400.94700.30950.30120.30530.30810.3128
0.590.000.590.140.95480.95700.94440.94760.93980.35630.34590.35300.35470.3559
0.590.140.390.000.95500.95580.95000.95040.94420.30720.29650.30430.30640.3080
0.590.390.000.140.95040.93920.94760.95060.95060.30450.29590.30140.30590.3104
0.590.590.590.590.95560.94840.95320.94920.94980.48490.47720.47790.48220.4867
Table A6. Simulated results using 5000 simulations for the 95% confidence intervals for the mediated effect for the two-mediator model (n = 200).
Table A6. Simulated results using 5000 simulations for the 95% confidence intervals for the mediated effect for the two-mediator model (n = 200).
Coverage ProbabilityAverage Interval Length
α 1 β 1 α 2 β 2 GPQSobelMCPercentileBCaGPQSobelMCPercentileBCa
0.000.000.000.000.99860.99960.99820.99960.99580.05030.03770.04980.05020.0515
0.000.000.390.390.95380.94060.95100.95280.95660.16140.15850.16010.16180.1643
0.000.140.000.590.95140.95580.95460.94820.93820.17670.17250.17580.17660.1772
0.000.390.590.590.95600.95440.94920.95020.95060.26240.25970.26100.26240.2638
0.140.000.590.000.95020.95420.95260.94620.93720.17760.17350.17680.17830.1786
0.140.390.000.000.95500.95140.95340.95000.94460.12550.12100.12520.12580.1273
0.140.390.590.390.94920.94480.95240.95060.94980.23430.23160.23330.23510.2367
0.140.590.140.590.95580.95480.95000.94820.94520.24420.24100.24340.24460.2455
0.390.000.000.000.95560.96320.95260.95700.94660.12050.11510.11990.12040.1211
0.390.000.590.390.95060.94920.95380.94900.94500.23210.22910.23010.23240.2336
0.390.390.590.590.95380.94720.95720.95360.95180.28400.28160.28290.28420.2860
0.390.590.140.140.95140.94420.94800.94520.94800.21080.20810.20990.21040.2123
0.590.000.590.140.95460.95520.94880.94880.94360.24340.24000.24220.24370.2442
0.590.140.390.000.94840.95080.95340.94840.94320.20840.20480.20770.20970.2103
0.590.390.000.140.95880.95060.94920.94800.94700.20760.20480.20680.20840.2104
0.590.590.590.590.95340.95000.95400.94940.95080.33410.33180.33280.33460.3362

References

  1. MacKinnon, D.P. Introduction to Statistical Mediation Analysis; Routledge: London, UK, 2008. [Google Scholar]
  2. VanderWeele, T. Explanation in Causal Inference: Methods for Mediation and Interaction; Oxford University Press: Oxford, UK, 2015. [Google Scholar]
  3. MacKinnon, D.P.; Lockwood, C.M.; Williams, J. Confidence limits for the indirect effect: Distribution of the product and resampling methods. Multivar. Behav. Res. 2004, 39, 99–128. [Google Scholar] [CrossRef]
  4. Sobel, M.E. Asymptotic confidence intervals for indirect effects in structural equation models. Sociol. Methodol. 1982, 13, 290–312. [Google Scholar] [CrossRef]
  5. Preacher, K.J.; Selig, J.P. Advantages of Monte Carlo confidence intervals for indirect effects. Commun. Methods Meas. 2012, 6, 77–98. [Google Scholar] [CrossRef]
  6. Weerahandi, S. Generalized Confidence Intervals. J. Am. Stat. Assoc. 1993, 88, 899–905. [Google Scholar] [CrossRef]
  7. Hannig, J.; Iyer, H.; Patterson, P. Fiducial generalized confidence intervals. J. Am. Stat. Assoc. 2006, 101, 254–269. [Google Scholar] [CrossRef]
  8. Yu, C.Y.; Tsai, J.R. A comparison of approaches to interval estimation of mediated effects in linear regression models. Commun. Stat.-Simul. Comput. 2012, 41, 1301–1316. [Google Scholar] [CrossRef]
  9. Weerahandi, S. Exact Statistical Methods for Data Analysis; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2003. [Google Scholar]
  10. Bebu, I.; Seillier-Moiseiwitsch, F.; Mathew, T. Generalized confidence intervals for ratios of regression coefficients with applications to bioassays. Biom. J. J. Math. Methods Biosci. 2009, 51, 1047–1058. [Google Scholar] [CrossRef]
  11. Meeker, W.Q.; Hahn, G.J.; Escobar, L.A. Statistical Intervals: A Guide for Practitioners and Researchers; John Wiley & Sons: Hoboken, NJ, USA, 2017; Volume 541. [Google Scholar]
  12. MacKinnon, D.P.; Valente, M.J.; Wurpts, I.C. Benchmark validation of Statistical Models: Application to mediation analysis of imagery and memory. Psychol. Methods 2018, 23, 654–671. [Google Scholar] [CrossRef] [PubMed]
  13. Tofighi, D.; MacKinnon, D.P. RMediation: An R package for mediation analysis confidence intervals. Behav. Res. Methods 2011, 43, 692–700. [Google Scholar] [CrossRef]
  14. VanderWeele, T.J.; Vansteelandt, S. Mediation analysis with multiple mediators. Epidemiol. Methods 2014, 2, 95–115. [Google Scholar] [CrossRef] [PubMed]
  15. Vansteelandt, S.; Daniel, R.M. Interventional effects for mediation analysis with multiple mediators. Epidemiology 2017, 28, 258–265. [Google Scholar] [CrossRef]
  16. Hannig, J. On generalized fiducial inference. Stat. Sin. 2009, 19, 491–544. [Google Scholar]
Figure 1. Point estimates with 95% confidence intervals for the effect on word recall mediated through the two mediators (imagery and repetition) using five different methods: generalized pivotal quantity (GPQ), Sobel, Monte Carlo, percentile bootstrap, and bias-corrected and accelerated (BCa) bootstrap.
Figure 1. Point estimates with 95% confidence intervals for the effect on word recall mediated through the two mediators (imagery and repetition) using five different methods: generalized pivotal quantity (GPQ), Sobel, Monte Carlo, percentile bootstrap, and bias-corrected and accelerated (BCa) bootstrap.
Mathematics 13 00396 g001
Table 1. The percentage of simulated coverage probabilities for the 16 scenarios which are above the 94.5% level. The seven methods used were: generalized pivotal quantity (GPQ), Sobel, PRODCLIN (Prod), Monte Carlo (MC), Goodman, percentile bootstrap, and bias-corrected and accelerated (BCa) bootstrap.
Table 1. The percentage of simulated coverage probabilities for the 16 scenarios which are above the 94.5% level. The seven methods used were: generalized pivotal quantity (GPQ), Sobel, PRODCLIN (Prod), Monte Carlo (MC), Goodman, percentile bootstrap, and bias-corrected and accelerated (BCa) bootstrap.
GPQSobelProdMCGoodmanPercentileBCa
n = 50 100.00%68.75%68.75%75.00%75.00%43.75%25.00%
n = 100 93.75%56.25%87.50%87.50%75.00%56.25%43.75%
n = 200 81.25%87.50%81.25%81.25%93.75%62.50%43.75%
Table 2. The percentage of simulated coverage probabilities for the 256 scenarios which are above the 94.5% level.
Table 2. The percentage of simulated coverage probabilities for the 256 scenarios which are above the 94.5% level.
GPQSobelMCPercentileBCa
n = 50 100.00%74.61%99.61%88.28%51.17%
n = 100 100.00%84.77%98.83%89.45%46.09%
n = 200 98.83%89.84%96.88%91.80%63.28%
Table 3. The number of scenarios (out of 256 scenarios for each sample size) where the method had the largest average interval length of the 95% confidence intervals.
Table 3. The number of scenarios (out of 256 scenarios for each sample size) where the method had the largest average interval length of the 95% confidence intervals.
GPQSobelMCPercentileBCa
n = 50 147000109
n = 100 53000203
n = 200 5000251
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Ma, X.; MacKinnon, D.P.; Mathew, T.; Agan, B.; Luta, G.; Bebu, I. Generalized Inference for Mediation Analysis. Mathematics 2025, 13, 396. https://doi.org/10.3390/math13030396

AMA Style

Ma X, MacKinnon DP, Mathew T, Agan B, Luta G, Bebu I. Generalized Inference for Mediation Analysis. Mathematics. 2025; 13(3):396. https://doi.org/10.3390/math13030396

Chicago/Turabian Style

Ma, Xiaoyang, David P. MacKinnon, Thomas Mathew, Brian Agan, George Luta, and Ionut Bebu. 2025. "Generalized Inference for Mediation Analysis" Mathematics 13, no. 3: 396. https://doi.org/10.3390/math13030396

APA Style

Ma, X., MacKinnon, D. P., Mathew, T., Agan, B., Luta, G., & Bebu, I. (2025). Generalized Inference for Mediation Analysis. Mathematics, 13(3), 396. https://doi.org/10.3390/math13030396

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop