1. Introduction
Mediation analysis is widely applied in biomedical research to understand how treatments impact outcomes through various biological features. Baron and Kenny first introduced the concept of mediation within the structural equation modeling framework in their 1986 paper [
1]. Another widely adopted approach is causal mediation analysis within the counterfactual framework. Both approaches have undergone substantial development, enabling the handling of low-dimensional [
2,
3,
4,
5,
6,
7] and high-dimensional mediators [
8,
9,
10,
11,
12].
The advancement of next-generation sequencing technology has deeply impacted the scope of available biological data, which has enabled detailed profiling of biological datasets such as microbial communities and genetic variations. This progress has resulted in complex, high-dimensional datasets, such as a vast number of operational taxonomic units (OTUs) in microbiome studies and comprehensive gene expression profiles in genetic research. These datasets raise unique challenges for mediation analysis due to their high dimensionality, non-linear interactive effects between features, and complex inherent structures, such as the hierarchical taxonomic structure in the microbiome and gene–pathway relationships.
Many studies have highlighted the roles of the human microbiome and gene expression as potential mediators in the path from clinical exposures to disease outcomes [
13,
14,
15,
16,
17]. The composition of the human microbiome is dynamic and influenced by many factors, such as diet and medication, refs. [
18,
19,
20,
21], and it is associated with several diseases, including obesity, type 2 diabetes, and cardiovascular disease [
22,
23,
24]. Similarly, gene expression plays a critical role in understanding disease mechanisms, as the expression of genes influences disease outcomes by regulating biological processes such as immune responses and metabolism [
25,
26].
A key challenge in analyzing both microbiome and gene expression as mediators arises from their hierarchical structures and the complex interactive relationships among features. In the microbiome, OTUs within the same taxonomic group often interact and function collaboratively, while OTUs from different taxa may function more independently. These relationships often involve non-linear interactions, which add complexity to accurately estimating mediation effects. Similarly, gene expression data present gene pathway structures, where genes within the same pathway are often correlated and work coordinately. These structural relationships, combined with potential non-linear interactions, add complexity to mediation analysis.
Most existing high-dimensional mediation methods primarily focus on dimension reduction but do not explicitly capture complex interactive mediation effects. For example, Zhang et al. employed a regularization-based method to assess the mediation effects of DNA methylation markers between environmental exposures and health outcomes [
10]. Similarly, Zhao et al. introduced the pathway LASSO for analyzing genetic and brain mediators [
11]. Chén et al. developed the directions of the mediation method for neuroimaging data [
12]. Huang and Pan proposed a principle component analysis-based approach to transform high-dimensional gene expression mediators into conditionally independent marginal mediation models to analyze the genomic contribution to patient survival in glioblastoma multiforme [
27]. Although these methods provide useful tools for high-dimensional mediation analysis, they often fail to capture the complex interactions and hierarchical structures inherent in biological datasets, which are critical for understanding mediation effects comprehensively.
In contrast, CNNs naturally capture complex interactions and hierarchical structures and have been widely adopted to analyze high-dimensional biomedical data. For example, the ReGeNNe method employs CNNs to incorporate the biological clustering of genes within pathways and capture interactions between pathways for disease prediction [
28]. Similarly, the TaxoNN applies CNNs to account for inherent correlations and non-linear relationships in microbiome data to enhance disease prediction accuracy [
29].
To leverage these strengths for causal mediation, we introduce Med-CNN, a novel high-dimensional causal mediation model that employs CNNs to transform high-dimensional biological data into an integrative mediation metric (IMM) [
29,
30]. By synthesizing features within networks, Med-CNN derives network-specific values, which are then concatenated to predict an IMM. This approach captures the aggregate impact of the network in mediating the treatment-to-outcome relationship and effectively addresses the complexities of high-dimensional mediation analysis. Our method combines the predictive power of deep learning with the theoretical framework of mediation analysis, which allows Med-CNN to learn the inherent data structures among complex biological datasets.
2. Results
2.1. Simulation Results
In the first simulation, we evaluated the performance of Med-CNN under both null and alternative mediation effect scenarios. The results are shown in
Table 1. Across all scenarios, our approach showed consistently low bias and standard deviation (SD) for both the natural indirect effect (NIE) and natural direct effect (NDE). In the null scenarios (Scenarios 1–3), where there was no mediation effect, the bias for NIE ranged from −0.08 to −0.002, and the NDE ranged from 0.04 to 0.09. The SD values for the NIE were between 0.11 and 1.15, while for the NDE, they ranged from 0.18 to 1.15. In particular, Scenario 1 (
) showed the smallest SD, as the absence of both effects reduced the complexity of the parameter estimation. In contrast, Scenario 2 (
,
) had a larger SD compared to Scenario 3 (
,
). This is due to the fact that in Scenario 2,
introduced multicollinearity between IMM and
A, leading to greater variability in the estimated coefficients. In the alternative scenario (Scenario 4), where mediation effects were present, we examined the model’s performance across a range of
values from
to 5. For the NIE, the bias ranged from −0.22 to 0.56, with SD values between 0.68 and 1.74. For the NDE, the bias remained within −0.58 to 0.20, with the SD values ranging from 0.24 to 1.47.
In Simulation 2, we compared the performance of Med-CNN with the spectral decomposition-based method (SD-based) and the regularization-based approach (Reg-based) under the non-zero mediation effect scenario [
10,
27]. The results are summarized in
Table 2. In the linear feature–network setting, the SD-based method achieved the lowest bias, with values of 0.24 for the NIE and −0.11 for the NDE and SDs of 0.75 and 0.29, respectively. These results indicate that the SD-based method performs well in linear conditions due to its ability to transform and simplify the mediators into a set of linear combinations of features that explain most of the variability, which aligns closely with the structure of the data. The Med-CNN showed moderate estimation accuracy with a bias of 0.56 for the NIE and −0.58 for the NDE, with SDs of 1.74 and 1.47, respectively. While it did not outperform the SD-based method, it still achieved better results compared to the Reg-based method. In the non-linear feature–network setting, Med-CNN had the lowest bias and SD, with a bias of 0.17 for the NIE and −0.19 for the NDE. These results demonstrate Med-CNN’s efficiency in estimating mediation effects under non-linear conditions compared to both SD-based and Reg-based methods.
In the third simulation (
Table 3), we examined the impact of feature interactions on the performance of our approach in comparison to the SD-based and Reg-based methods. In the linear feature–network setting, Med-CNN achieved the highest accuracy with a bias of 0.42 for the NIE and −0.44 for the NDE, with SDs of 0.81 and 0.46, respectively. While the SD-based method had slightly lower SDs (0.72 for the NIE and 0.86 for the NDE), its much larger bias (6.68 for the NIE and −6.54 for the NDE) makes it less accurate. The Reg-based method showed both higher bias (12.82 for the NIE and −12.05 for the NDE) and higher SDs (5.20 for the NIE and 5.10 for the NDE). These results suggest that Med-CNN has a better balance between bias and variability in estimating mediation effects under linear conditions. In the non-linear feature–network setting, Med-CNN maintained low bias values (0.27 for the NIE and −0.29 for the NDE) and SDs (0.75 for the NIE and 0.39 for the NDE), which outperformed the other two methods in accounting for interactions. The SD-based method had large bias values (13.27 for the NIE and −13.14 for the NDE) under this non-linear condition, suggesting that it has difficulty handling complex feature interactions due to its design for linear relationships. The Reg-based method, however, showed improved performance, with a bias of 0.81 for the NIE and −0.76 for the NDE.
In the fourth simulation, we evaluated convergence thresholds of 1%, 0.1%, and 0.01% to determine the optimal setting for Med-CNN training under scenarios with a non-zero NIE.
Table 4 presents the results, showing that both the 0.1% and 0.01% thresholds demonstrated similar performance in terms of bias and SD for the NIE and the NDE estimates. However, the 0.01% threshold produced the lowest bias and SD values, which suggests slightly better accuracy. Although the 0.1% threshold may require fewer iterations, we recommend the 0.01% threshold as the optimal convergence criterion for our model due to its better accuracy in parameter estimation.
In the fifth simulation, we assessed scalability by evaluating Med-CNN’s performance across sample sizes of 500, 1000 and 1500. The results, presented in
Table 5, showed that as the sample size increased, Med-CNN demonstrated reduced bias and smaller SD values, highlighting its improved stability and performance for large-scale datasets.
2.2. Real Data Analysis Results
The results of the real data analysis and the graphical illustration are shown in
Table 6 and
Figure 1. We found that the estimated NIE came out to 0.06, indicating that the vaginal microbiome mediates 24% of the effect from the exposure to the outcome.
Previous studies have established that the relative abundance of some of the vaginal microbiome varies among different ethnic groups and could be associated with vaginal pH levels [
31,
32,
33,
34,
35]. Our results align with the existing studies, supporting the potential role of the microbiome concerning ethnicity and vaginal pH levels. By quantifying the mediation effect of the entire set of vaginal OTUs on pH levels, our analysis provides a more comprehensive understanding of the microbiome’s role in this context.
3. Discussion
In this study, we introduced Med-CNN, a deep-learning approach for estimating mediation effects within high-dimensional biological data. Our novel approach simplified the process of estimating the mediation effects in complex biological structures by effectively integrating multiple networks into an integrative mediation metric while accounting for the interactive effects among biological features within these networks. Through comprehensive simulation studies, we demonstrated that Med-CNN produces accurate and stable estimates of the NIE and NDE across varying effect sizes, sample sizes, and complexity levels, validating its reliability in different scenarios. Additionally, our real-data application illustrated Med-CNN’s ability to capture the overall mediation effect of the microbiome while accounting for hierarchical and interactive relationships among OTUs. Unlike traditional methods that often focus on individual OTUs and ignore the complex hierarchical structure and interactions within the microbiome, Med-CNN provides a comprehensive view of how OTUs collectively mediate the relationship between ethnicity and vaginal pH levels.
Our simulations provided additional insights into Med-CNN’s performance under different null hypothesis scenarios. We observed a fluctuation in the SDs of the NIE estimates ranging from 0.11 to 1.15. This variability highlights how the setup of the causal mediation model can either inflate or reduce the variability in mediation effect estimates. For example, in the first null hypothesis scenario (), where both the exposure-to-mediator and mediator-to-outcome pathways were disrupted, the parameter space was simplified, resulting in a smaller SD. In contrast, when only the mediator-to-outcome pathway was zero (), the multicollinearity between IMM and A resulted in greater variability compared to the third scenario (), where an exposure-to-mediator pathway was absent.
Med-CNN demonstrated clear advantages over SD-based and Reg-based approaches, particularly in scenarios with non-linear relationships or feature interactions within the network. In simpler cases with no interactions and purely linear relationships among features, the SD-based method slightly outperformed Med-CNN in terms of bias and SD estimation. This is likely due to the alignment of the SD-based architecture with linear data structures, which enables it to reduce dimensionality efficiently while capturing most of the variability. Nevertheless, Med-CNN consistently outperformed the Reg-based approach and excelled in scenarios with non-linearity or complex interactions. This highlights Med-CNN’s strength in handling the complexities of high-dimensional biological datasets.
Our study also has limitations that should be addressed in future work. The current causal mediation model focused on accurately estimating the mediation effects; however, we did not incorporate confounders or conduct formal hypothesis testing with p-values and confidence intervals to quantitatively assess the statistical validity of our results. Incorporating confounders into the Med-CNN is feasible, but it presents challenges related to model convergence and computational complexity. Confidence intervals can be approximated using standard errors (SE) derived from the delta method (details in
Appendix D) under the assumption that IMM is known, but this assumption may underestimate the SE of NIE. Alternatively, bootstrap procedures could be employed for empirical confidence intervals. However, developing stringent hypothesis testing for Med-CNN requires further methodology development, particularly in exploring the asymptotic properties of the estimates. In addition, we did not include a feature importance mechanism to identify specific mediators driving the mediation effect, as our primary objective was to estimate the overall mediation effect. Incorporating feature importance analysis could provide deeper biological insights and improve the interpretability of results.
Although Med-CNN handles data sparsity and noise through normalization during preprocessing, missing data remains a common challenge in real-world studies. Future efforts could focus on integrating automated handling of missing data directly into the model pipeline. This would further enhance the applicability of Med-CNN in practical settings.
Despite these limitations, Med-CNN is an innovative deep-learning approach in high-dimensional mediation analysis. By leveraging convolutional layers, it is well suited for large-scale datasets with complex biological structures, including gene–pathway relationships in gene expression studies, voxel-level spatial dependencies in neuroimaging, and hierarchical correlations in microbiome research. For instance, Med-CNN could model causal mediation in gene expression data by treating pathways as structured networks of genes, capturing the interactions within the pathways. This novel approach not only captures the non-linear interactive effects between the biological features but also effectively integrates their inherent structural complexities. Future directions will focus on integrating confounders, implementing hypothesis testing frameworks, refining the architecture to incorporate feature importance analysis, and improving the model’s adaptability to practical challenges. As a deep-learning model that bridges statistical mediation models with high-dimensional biological research, Med-CNN provides a powerful tool to advance the understanding of mediation mechanisms in complex biological studies.
4. Materials
and Methods
4.1. Review of Natural Direct and Indirect Effects
In this section, we briefly review the mediation model for continuous outcomes and continuous mediators proposed by VanderWeele [
36]. Let
A be the exposure,
M be the mediator,
Y be the outcome, and
X be the confounder. The relationship between
A,
M, and
Y is shown in
Figure 2. The counterfactual variables are introduced to define the effects to be estimated and to state the assumptions needed for estimation. For example,
represents a subject’s counterfactual outcome if the exposure
A were set at level
a. Similarly,
denotes the subject’s counterfactual mediator value if exposure were set to
a.
denotes the counterfactual outcome value when the exposure is set to
a and the mediator
M is fixed at a specific value
m.
To identify mediation effects, several assumptions must hold. First, there should be no unmeasured confounding in the exposure–outcome relationship when conditioning on X. This implies that all variables influencing A and Y are measured and included in X so that . Second, there should be no unmeasured confounding in the exposure–mediator relationship when conditioning on X. This means that all common causes of A and M are adjusted via X to ensure . Third, for the mediator–outcome relationship, it is necessary to condition on A and X so that . This ensures that no unmeasured confounders affect both M and Y. Fourth, there is no mediator–outcome confounder that is affected by the exposure. This condition ensures that there are no variables acting as confounders for the M–Y relationship that are influenced by A and can be expressed as , where is the mediator under an alternative exposure .
In addition to these assumptions, consistency and positivity assumptions are also required. The consistency assumption ensures that the observed outcomes correspond to the counterfactual outcomes under the same exposure and mediator levels. The positivity assumption requires that every level of A and M has a nonzero probability for any values of X. Mathematically, this can be expressed as and for all , and m.
If all the above assumptions hold, then we have
We are interested in three mediation effects: the NDE, NIE, and the total effect (TE). The NDE measures the amount of outcome change if the exposure of an individual changes from
to
with the mediator set to the level it would have taken if the exposure had been
(i.e.,
). It captures the direct effect from
. The NIE is the difference in outcome for an individual with exposure
and a mediator set to the value it would have taken at the level of exposure
versus the individual whose mediator would have taken the level of exposure
(i.e.,
). It captures the indirect effect through
. The TE is the difference in outcome for an individual having their exposure change from
to
(i.e.,
). The TE can be decomposed into the NIE and NDE:
If
Y and
M are continuous and
A is binary, the mediation effects are modeled through the following regression equations:
The NDE and NIE can be estimated with the following equations:
While VanderWeele’s causal mediation model provides a solid foundation for estimating mediation effects, it has several limitations that may restrict its applicability in certain contexts. One key challenge is satisfying the identifying assumptions, particularly in observational studies where unmeasured confounding could violate these assumptions and bias the results. Additionally, the model assumes linear relationships between variables, which may not hold in biomedical research datasets that are often complex and involve interactions among features. Furthermore, the model is not designed to handle high-dimensional datasets, such as those encountered in microbiome studies, where the number of mediators can vastly exceed the sample size, and interactions among mediators are prevalent. Although the challenge of satisfying identifying assumptions is a common limitation in mediation analysis and remains difficult to address, the other limitations highlight the need for advanced methods, such as Med-CNN, that can tackle non-linear relationships and high-dimensional data.
4.2. Proposed Neural-Network-Based Mediation Model: Med-CNN
4.2.1. Framework of Med-CNN Algorithm
We introduce Med-CNN, denoted as
, a novel deep-learning algorithm designed for mediation analysis in high-dimensional biological datasets with inherent hierarchical structures. For example, in the human microbiome, OTUs have inherent correlations within taxonomic levels and non-linear relationships between OTUs. CNNs address these challenges by leveraging their ability to model spatial dependence through convolutional layers and activation functions that capture non-linear effects. To illustrate the biological relevance of mediation analysis in this context, consider the role of dietary habits in influencing gut inflammation through changes in gut microbiome composition. Studies such as Turpin et al. have shown that Mediterranean diets rich in fiber, polyphenols, and omega-3 fatty acids promote beneficial gut microbiota, which in turn produce anti-inflammatory metabolites like short-chain fatty acids [
37]. These microbiome changes mediate the relationship between diet and gut inflammation.
In the context of mediation analysis, we consider a study with n subjects indexed by i, where . Dietary habit serves as the binary exposure variable, denoted by , and gut inflammation is the continuous outcome, which is denoted by . The mediators in this analysis are the high-dimensional features of the gut microbiome, which are denoted as for the i-th subject. These features are organized into K distinct networks based on taxonomic levels and organized according to similarities observed from microbiome sequencing data. Each network k () contains microbiome features. Specifically, denotes the j-th feature in the k-th network for the i-th subject, where . The vector of microbiome features for the k-th network in the i-th subject is represented as .
To integrate these high-dimensional features, we introduced the IMM, denoted as , which serves as a composite score that aggregates information across all networks. This IMM is computed through the Med-CNN model, which employs an integrated CNN model. Specifically, each microbiome network is analyzed by a network-specific CNN model designed to extract important biological features. For each network, these extracted features are condensed into a final hidden layer, which is denoted as . The values from all networks are then concatenated to form a composite layer, which is used to predict the .
The Med-CNN framework is trained with an objective function that incorporates the outcome model, which ensures that the IMM is not only a summary of the biological features but also optimized to minimize the loss of the underlying mediation model. This approach ensures that the IMM is directly connected to the outcome, rather than being just an unsupervised dimensionality reduction technique. Once the IMM is computed, it is further incorporated into a single mediator model to estimate mediation parameters.
Figure 3 provides an overview of our proposed algorithm.
4.2.2. Network-Specific CNN Model
To enhance the efficiency of the CNN framework and effectively capture the relationship among biological features, we employed a preprocessing step suggested by Sharma et al. [
29]. This approach involves reordering the input biological features based on their inter-correlations within each network. Specifically, for the
biological features identified within a network, we construct a
correlation matrix, which is defined as
This matrix is then summarized into a vector
, where each element of
represents a composite correlation coefficient for each feature computed by
The vector
is then arranged in descending order and denoted by
. The input biological features are then reordered in accordance with the sequence in
to preserve the inherent relationships within each network and potentially enhance the efficiency of the CNN input. This preprocessing step has been validated in TaxoNN, where correlation-based reordering improved prediction accuracy compared to alternative ordering strategies [
29].
Each network-specific CNN model is structured to include two 1D convolutional layers, with each followed by a pooling layer. The initial convolutional layer contains 32 filters with a kernel size of 5 and a stride of 1. The subsequent layer contains 64 filters and maintains the same kernel size. These hyperparameters were chosen based on their demonstrated effectiveness in prior studies involving high-dimensional microbiome datasets [
29]. The Rectified Linear Unit activation function was applied to both convolutional layers to introduce non-linearity [
38]. The output from the final pooling layer was then flattened and processed through a dense layer with a linear activation function to generate the network-specific value. A graphical illustration of the neural network’s layer for Network 1 is shown in
Figure 4.
4.2.3. Loss Function and Parameter Estimation
Our algorithm employs a loss function
, inspired by Nath et al. [
30], and which is formulated as
This function is designed to optimize the mediation regression based on a simplified version of VanderWeele’s mediation model, with the mediation relationships expressed as
In our Med-CNN algorithm, we utilized the as the mediator in these regression models. Our model explicitly omits confounders and the interaction between exposure and mediator to reduce computational demands and demonstrate the model’s capabilities.
An iterative method, analogous to the Expectation–Maximization algorithm, was employed to fit our machine learning model and estimate the mediation parameters. This method consists of two steps. In the first step, we estimate the IMM by training the Med-CNN by assuming that the mediation parameters (
,
,
,
, and
) are known. Under this assumption, the loss function is simplified as
where
is a function of known parameters, leaving
as the only unknown. This allows the CNN model to predict the IMM by optimizing the mediation regression and minimizing the loss function.
The second step focuses on optimizing the coefficients in the regression models (
10) and (11) assuming the IMM is known. The normalized predicted IMM is incorporated into the mediation regressions to update the parameter estimates. This step enables the estimation of mediation model parameters and effects. This iterative training and updating process continues until the percentage change in each of the five parameters falls below the prespecified criteria. The convergence for each iteration is calculated using the following equation:
Here, G represents the estimated parameters at iteration l. Convergence is achieved when the percentage change for all mediation parameters (, , , , and ) falls below the specified threshold. This convergence formula evaluates the relative improvement of the estimated parameters across iterations, avoiding scale-related issues and providing a consistent stopping criterion applicable across all parameters.
4.3. Simulation Studies
4.3.1. Simulation Setup and Scenarios
We conducted five simulation scenarios to evaluate the performance of the proposed method. In the simulation studies, the exposure variable
followed a Bernoulli distribution with a probability of 0.5. The true IMM
and the outcome
were generated according to Equations (
10) and (11), with error terms
and
following normal distributions
, respectively. Each network was modeled as a linear function of the true IMM. Network-specific biological features were incorporated as linear functions in all simulations, while non-linear relationships were additionally explored in the second and third simulation scenarios. The Med-CNN model then predicted the IMM, which is denoted
. Further details of data generation can be found in
Appendix A.
The first simulation scenario aimed to validate the general functionality and performance of the Med-CNN model across various null and alternative hypothesis scenarios. Specifically, we evaluated whether Med-CNN produced stable and accurate estimates of mediation effects under the following four mediation effect conditions: (1) no mediation effects (
), (2) no effects of the mediator on the outcome (
but
), (3) no effect of the exposure on the mediator
but
, and (4) a non-zero mediation effect (
). In the non-zero mediation effect scenario, we further evaluated the influence of varying the effect size of the exposure on the mediator by setting
to values of
, 1, 3, and 5. For other model parameters, we set
. The sample size was fixed at 1000, and we considered 5 networks with
, and 140 biological features in each network. This setup extends TaxoNN’s 4-network design for microbiome analysis to incorporate additional biological complexity [
29].
The second simulation scenario compared the performance of our proposed method with the SD-based and Reg-based approaches under the scenario of non-zero NIE. The SD-based method, developed by Huang and Pan, uses spectral decomposition to address high-dimensional mediators by transforming the correlated mediators into uncorrelated components. This transformation simplifies the estimation process by using a series of low-dimensional regression models [
27]. The Reg-based method, introduced by Zhang et al., employs sure independent screening to reduce the number of potential mediators, followed by applying the minimax concave penalty method to select significant mediators [
10]. We evaluated the performance of these methods under both linear and non-linear relationships between biological features and networks. Details of the feature–network relationships generation are provided in
Appendix A.
In the third simulation, we assessed the performance of our method alongside the SD-based method and Reg-based method in the presence of interactions among biological features within each network under the scenario of non-zero NIE. We aimed to explore how each method handles additional complexities introduced by these interactions. To simulate these biological interactions, we selected features at intervals. Specifically, we simulated every fifth feature (e.g., 5th, 10th, 15th, etc.) to represent interacting pairs. For each pair, we modified the value of a subsequent feature by adding a term that represents the product of the interacting features’ value, which was scaled by a predefined interaction strength of 2. Further details on the generation of these interactions can be found in
Appendix A.
In the fourth simulation, we aimed to determine the optimal convergence threshold for Med-CNN training under scenarios with a non-zero NIE. Convergence thresholds of 1%, 0.1%, and 0.01% were evaluated to explore the trade-off between computational efficiency and the precision of parameter estimates. These thresholds were chosen to reflect varying degrees of precision, with stricter thresholds providing finer parameter convergence at the cost of increased computational time.
Lastly, we conducted a simulation scenario to evaluate the scalability of Med-CNN with increasing sample sizes. We simulated datasets with sample sizes ranging from 500 to 1500 and assessed Med-CNN’s performance in estimating mediation effects.
For each simulation, we assessed the model’s precision by calculating the bias and the SD of the estimates for both the NIE and NDE across simulations. These metrics provide a comprehensive assessment of the accuracy and variability of the mediation estimates across simulations. SEs associated with these estimates are reported in the
Appendix B.
4.3.2. Model Specification and Evaluation Criteria
For the training of Med-CNN, the dataset was partitioned into a training set and a testing set with a 70/30 split. This split is a commonly used standard in deep learning to balance sufficient data for model training and adequate data for testing. Specifically, of the subjects were allocated to the training set and to the testing set. The simulation was conducted 100 times, with each iteration up to a maximum of 50 updates to refine the model. The updates would stop earlier if the convergence criteria were met. From each iteration, the mediation effects were estimated.
The simulations were conducted using Python 3.8.13 and R 4.2.2 on the SciNet-SOSCIP joint GPU cluster Mist server with one node allocated for each simulation [
39,
40]. The server was equipped with IBM Power9 cores, 256 GB of RAM, and 4 NVIDIA V100-SMX2-32GB GPUs. The computation time of each iteration of the Med-CNN algorithm is summarized in
Table 4.
4.4. Real-Data Study
To assess the performance of our proposed method, we applied our method to vaginal microbiome data collected from a study of reproductive-age women [
31]. The study contains 394 participants from North America who were recruited with informed consent from clinical sites at the University of Maryland School of Medicine and Emory University. Each participant completed the questionnaire and provided vaginal swabs. The vaginal microbiome data were obtained using pyrosequencing of barcoded 16S rRNA genes, resulting in 305 OTUs.
Our objective was to investigate whether the vaginal microbiome mediates variations in vaginal pH levels across different ethnicity groups. Four ethnicity groups (white (24.6%), black (26.4%), Hispanic (24.6%), and Asian (24.4%)) were dichotomized into white versus others. Vaginal pH levels were a continuous variable ranging from 4 to 7. For the vaginal microbiome, we classified OTUs into four networks based on their phylum-level classifications to incorporate the taxonomic hierarchy within the data. The top three phyla with the highest number of OTUs were individually selected, while the remaining phyla were aggregated into the “other” network. Specifically, the major networks for Med-CNN were identified as Firmicutes, Proteobacteria, Actinobacteria, and others. Additional details regarding the OTUs in each network can be found in
Appendix C.
Author Contributions
Conceptualization, Y.L., D.S. and W.X.; methodology, Y.L.; software, Y.L. and Z.Z.; writing—original draft preparation, Y.L.; writing—review and editing, Z.Z., O.S., D.S. and W.X. All authors have read and agreed to the published version of the manuscript.
Funding
This research was funded by the Natural Sciences and Engineering Research Council of Canada (NSERC Grant RGPIN-2024-06081).
Informed Consent Statement
Informed consent was obtained from all subjects involved in the study.
Data Availability Statement
The vaginal microbiome data analyzed during the study are from publicly accessible bacterial 16S rRNA gene sequences, which are available at the National Center for Biotechnology Information Short Read Archive (SRA022855).
Conflicts of Interest
The authors declare no conflicts of interest.
Abbreviations
The following abbreviations are used in this manuscript:
Med-CNN | Mediation CNN |
OTUs | Operational Taxonomic Units |
CNNs | Convolutional Neural Networks |
IMM | Integrative Mediation Metric |
SD | Standard Deviation |
NIE | Natural Indirect Effect |
NDE | Natural Direct Effect |
SD-based | Spectral Decomposition-based method |
Reg-based | Regularization-based approach |
SE | Standard Error |
TE | Total Effect |
Appendix A. Data Generation
We generate
,
M,
,
,
for
as follows:
where
,
,
,
,
,
,
, and
.
In the second simulation, the non-linear relationship between feature–network is as follows:
where
,
,
, and
.
In the third simulation, the interactions between features within the network are generated as follows:
where
represents the modified feature value after incorporating interaction;
and
are selected interacting features which chosen from every fifth feature within the network; and
is a constant that defines the strength of the interaction term.
Appendix B. Standard Errors of Simulation Results Across All Scenarios
Table A1.
Results of Simulation 1, evaluating the model performance across four mediation effect scenarios with standard errors.
Table A1.
Results of Simulation 1, evaluating the model performance across four mediation effect scenarios with standard errors.
Scenarios | NIE SE | NDE SE |
---|
(1) | | 0.01 | 0.02 |
(2) | | 0.02 | 0.04 |
(3) | | 0.56 | 0.03 |
| −5 | 0.62 | 0.12 |
| −3 | 0.58 | 0.05 |
(4) | −1 | 0.57 | 0.03 |
| 1 | 0.56 | 0.04 |
| 3 | 0.57 | 0.07 |
| 5 | 0.63 | 0.15 |
Table A2.
Results of Simulation 2, assessing the performance of Med-CNN in comparison to SD-based and Reg-based approaches in a non-zero mediation effect scenario with standard errors.
Table A2.
Results of Simulation 2, assessing the performance of Med-CNN in comparison to SD-based and Reg-based approaches in a non-zero mediation effect scenario with standard errors.
Feature–Network | Relationship | NIE SE | NDE SE |
---|
Linear | Med-CNN | 0.63 | 0.15 |
| SD-based | 0.78 | 0.28 |
| Reg-based | 5.54 | 5.55 |
Non-linear | Med-CNN | 0.61 | 0.06 |
| SD-based | 2.36 | 2.64 |
| Reg-based | 0.87 | 0.58 |
Table A3.
Results of Simulation 3, evaluating the performance of Med-CNN model in handling interactions between biological features compared to SD-based and Reg-based approaches under a non-zero mediation effect scenario with standard errors.
Table A3.
Results of Simulation 3, evaluating the performance of Med-CNN model in handling interactions between biological features compared to SD-based and Reg-based approaches under a non-zero mediation effect scenario with standard errors.
Feature–Network | Relationship | NIE SE | NDE SE |
---|
Linear | Med-CNN | 0.61 | 0.09 |
| SD-based | 0.61 | 0.70 |
| Reg-based | 4.73 | 4.74 |
Non-linear | Med-CNN | 0.61 | 0.07 |
| SD-based | 2.73 | 2.92 |
| Reg-based | 0.88 | 0.58 |
Table A4.
Results of Simulation 4, assessing the performance of Med-CNN across three convergence thresholds: 1%, 0.1%, 0.01% with standard errors.
Table A4.
Results of Simulation 4, assessing the performance of Med-CNN across three convergence thresholds: 1%, 0.1%, 0.01% with standard errors.
Convergence Thresholds | NIE SE | NDE SE |
---|
1% | 0.62 | 0.23 |
0.1% | 0.63 | 0.16 |
0.01% | 0.63 | 0.15 |
Table A5.
Results of Simulation 5, assessing the performance of Med-CNN across different sample sizes: 500, 1000, 1500 with standard errors.
Table A5.
Results of Simulation 5, assessing the performance of Med-CNN across different sample sizes: 500, 1000, 1500 with standard errors.
Sample Sizes | NIE SE | NDE SE |
---|
500 | 1.25 | 0.51 |
1000 | 0.63 | 0.15 |
1500 | 0.42 | 0.08 |
Appendix C. OTUs in Each Network
Appendix D. Estimating the Standard Error of NIE and NDE by Delta Method
By the delta method, the standard error of the NIE and NDE can be estimated by
where the NIE and NDE are estimated by
and
and
are the derivative of the NIE and NDE with respective to the
s and
s:
The
value is the variance–covariance matrix from the mediation model:
References
- Baron, R.M.; Kenny, D.A. The moderator–mediator variable distinction in social psychological research: Conceptual, strategic, and statistical considerations. J. Personal. Soc. Psychol. 1986, 51, 1173. [Google Scholar] [CrossRef] [PubMed]
- Pearl, J. Direct and indirect effects. In Probabilistic and Causal Inference: The Works of Judea Pearl; ACM: New York, NY, USA, 2022; pp. 373–392. [Google Scholar]
- Imai, K.; Keele, L.; Yamamoto, T. Identification, inference and sensitivity analysis for causal mediation effects. Stat. Sci. 2010, 25, 51–71. [Google Scholar] [CrossRef]
- VanderWeele, T.J.; Vansteelandt, S. Odds ratios for mediation analysis for a dichotomous outcome. Am. J. Epidemiol. 2010, 172, 1339–1348. [Google Scholar] [CrossRef] [PubMed]
- VanderWeele, T.; Vansteelandt, S. Mediation analysis with multiple mediators. Epidemiol. Methods 2014, 2, 95–115. [Google Scholar] [CrossRef]
- Imai, K.; Yamamoto, T. Identification and sensitivity analysis for multiple causal mechanisms: Revisiting evidence from framing experiments. Political Anal. 2013, 21, 141–171. [Google Scholar] [CrossRef]
- Boca, S.M.; Sinha, R.; Cross, A.J.; Moore, S.C.; Sampson, J.N. Testing multiple biological mediators simultaneously. Bioinformatics 2014, 30, 214–220. [Google Scholar] [CrossRef] [PubMed]
- Daniel, R.M.; De Stavola, B.L.; Cousens, S.; Vansteelandt, S. Causal mediation analysis with multiple mediators. Biometrics 2015, 71, 1–14. [Google Scholar] [CrossRef] [PubMed]
- Jérolon, A.; Baglietto, L.; Birmelé, E.; Alarcon, F.; Perduca, V. Causal mediation analysis in presence of multiple mediators uncausally related. Int. J. Biostat. 2021, 17, 191–221. [Google Scholar] [CrossRef]
- Zhang, H.; Zheng, Y.; Zhang, Z.; Gao, T.; Joyce, B.; Yoon, G.; Zhang, W.; Schwartz, J.; Just, A.; Colicino, E.; et al. Estimating and testing high-dimensional mediation effects in epigenetic studies. Bioinformatics 2016, 32, 3150–3154. [Google Scholar] [CrossRef]
- Zhao, Y.; Luo, X. Pathway Lasso: Pathway estimation and selection with high-dimensional mediators. Stat. Its Interface 2022, 15, 39–50. [Google Scholar] [CrossRef]
- Chén, O.Y.; Crainiceanu, C.; Ogburn, E.L.; Caffo, B.S.; Wager, T.D.; Lindquist, M.A. High-dimensional multivariate mediation with application to neuroimaging data. Biostatistics 2018, 19, 121–136. [Google Scholar] [CrossRef] [PubMed]
- Latorre, M.; Krishnareddy, S.; Freedberg, D.E. Microbiome as mediator: Do systemic infections start in the gut? World J. Gastroenterol. 2015, 21, 10487. [Google Scholar] [CrossRef]
- Schulz, M.D.; Atay, Ç.; Heringer, J.; Romrig, F.K.; Schwitalla, S.; Aydin, B.; Ziegler, P.K.; Varga, J.; Reindl, W.; Pommerenke, C.; et al. High-fat-diet-mediated dysbiosis promotes intestinal carcinogenesis independently of obesity. Nature 2014, 514, 508–512. [Google Scholar] [CrossRef]
- Taur, Y.; Pamer, E.G. Microbiome mediation of infections in the cancer setting. Genome Med. 2016, 8, 40. [Google Scholar] [CrossRef] [PubMed]
- Lutz, S.M.; Hokanson, J.E. Genetic influences on smoking and clinical disease. Understanding behavioral and biological pathways with mediation analysis. Ann. Am. Thorac. Soc. 2014, 11, 1082–1083. [Google Scholar] [CrossRef] [PubMed]
- Teng, M.S.; Hsu, L.A.; Wu, S.; Sun, Y.C.; Juan, S.H.; Ko, Y.L. Association of CDH13 genotypes/haplotypes with circulating adiponectin levels, metabolic syndrome, and related metabolic phenotypes: The role of the suppression effect. PLoS ONE 2015, 10, e0122664. [Google Scholar] [CrossRef]
- Wu, G.D.; Chen, J.; Hoffmann, C.; Bittinger, K.; Chen, Y.Y.; Keilbaugh, S.A.; Bewtra, M.; Knights, D.; Walters, W.A.; Knight, R.; et al. Linking long-term dietary patterns with gut microbial enterotypes. Science 2011, 334, 105–108. [Google Scholar] [CrossRef]
- Lewis, J.D.; Chen, E.Z.; Baldassano, R.N.; Otley, A.R.; Griffiths, A.M.; Lee, D.; Bittinger, K.; Bailey, A.; Friedman, E.S.; Hoffmann, C.; et al. Inflammation, antibiotics, and diet as environmental stressors of the gut microbiome in pediatric Crohn’s disease. Cell Host Microbe 2015, 18, 489–500. [Google Scholar] [CrossRef] [PubMed]
- Kurilshikov, A.; Wijmenga, C.; Fu, J.; Zhernakova, A. Host genetics and gut microbiome: Challenges and perspectives. Trends Immunol. 2017, 38, 633–647. [Google Scholar] [CrossRef]
- Kim, D.; Zeng, M.Y.; Núñez, G. The interplay between host immune cells and gut microbiota in chronic inflammatory diseases. Exp. Mol. Med. 2017, 49, e339. [Google Scholar] [CrossRef] [PubMed]
- Turnbaugh, P.J.; Hamady, M.; Yatsunenko, T.; Cantarel, B.L.; Duncan, A.; Ley, R.E.; Sogin, M.L.; Jones, W.J.; Roe, B.A.; Affourtit, J.P.; et al. A core gut microbiome in obese and lean twins. Nature 2009, 457, 480–484. [Google Scholar] [CrossRef]
- Qin, J.; Li, Y.; Cai, Z.; Li, S.; Zhu, J.; Zhang, F.; Liang, S.; Zhang, W.; Guan, Y.; Shen, D.; et al. A metagenome-wide association study of gut microbiota in type 2 diabetes. Nature 2012, 490, 55–60. [Google Scholar] [CrossRef] [PubMed]
- Koeth, R.A.; Wang, Z.; Levison, B.S.; Buffa, J.A.; Org, E.; Sheehy, B.T.; Britt, E.B.; Fu, X.; Wu, Y.; Li, L.; et al. Intestinal microbiota metabolism of L-carnitine, a nutrient in red meat, promotes atherosclerosis. Nat. Med. 2013, 19, 576–585. [Google Scholar] [CrossRef]
- Soskic, B.; Cano-Gamez, E.; Smyth, D.J.; Ambridge, K.; Ke, Z.; Matte, J.C.; Bossini-Castillo, L.; Kaplanis, J.; Ramirez-Navarro, L.; Lorenc, A.; et al. Immune disease risk variants regulate gene expression dynamics during CD4+ T cell activation. Nat. Genet. 2022, 54, 817–826. [Google Scholar] [CrossRef]
- Britt, E.C.; John, S.V.; Locasale, J.W.; Fan, J. Metabolic regulation of epigenetic remodeling in immune cells. Curr. Opin. Biotechnol. 2020, 63, 111–117. [Google Scholar] [CrossRef] [PubMed]
- Huang, Y.T.; Pan, W.C. Hypothesis test of mediation effect in causal mediation model with high-dimensional continuous mediators. Biometrics 2016, 72, 402–413. [Google Scholar] [CrossRef] [PubMed]
- Sharma, D.; Xu, W. ReGeNNe: Genetic pathway-based deep neural network using canonical correlation regularizer for disease prediction. Bioinformatics 2023, 39, btad679. [Google Scholar] [CrossRef] [PubMed]
- Sharma, D.; Paterson, A.D.; Xu, W. TaxoNN: Ensemble of neural networks on stratified microbiome data for disease prediction. Bioinformatics 2020, 36, 4544–4550. [Google Scholar] [CrossRef] [PubMed]
- Nath, T.; Caffo, B.; Wager, T.; Lindquist, M.A. A machine learning based approach towards high-dimensional mediation analysis. NeuroImage 2023, 268, 119843. [Google Scholar] [CrossRef] [PubMed]
- Ravel, J.; Gajer, P.; Abdo, Z.; Schneider, G.M.; Koenig, S.S.; McCulle, S.L.; Karlebach, S.; Gorle, R.; Russell, J.; Tacket, C.O.; et al. Vaginal microbiome of reproductive-age women. Proc. Natl. Acad. Sci. USA 2011, 108, 4680–4687. [Google Scholar] [CrossRef] [PubMed]
- Shishpal, P.; Patel, V.; Singh, D.; Bhor, V.M. pH Stress Mediated Alteration in Protein Composition and Reduction in Cytotoxic Potential of Gardnerella vaginalis Membrane Vesicles. Front. Microbiol. 2021, 12, 723909. [Google Scholar] [CrossRef] [PubMed]
- Serrano, M.G.; Parikh, H.I.; Brooks, J.P.; Edwards, D.J.; Arodz, T.J.; Edupuganti, L.; Huang, B.; Girerd, P.H.; Bokhari, Y.A.; Bradley, S.P.; et al. Racioethnic diversity in the dynamics of the vaginal microbiome during pregnancy. Nat. Med. 2019, 25, 1001–1011. [Google Scholar] [CrossRef] [PubMed]
- Fettweis, J.M.; Brooks, J.P.; Serrano, M.G.; Sheth, N.U.; Girerd, P.H.; Edwards, D.J.; Strauss, J.F., III; Jefferson, K.K.; Buck, G.A.; Consortium, V.M.; et al. Differences in vaginal microbiome in African American women versus women of European ancestry. Microbiology 2014, 160, 2272. [Google Scholar] [CrossRef]
- Beamer, M.A.; Austin, M.N.; Avolia, H.A.; Meyn, L.A.; Bunge, K.E.; Hillier, S.L. Bacterial species colonizing the vagina of healthy women are not associated with race. Anaerobe 2017, 45, 40–43. [Google Scholar] [CrossRef] [PubMed]
- Valeri, L.; VanderWeele, T.J. Mediation analysis allowing for exposure–mediator interactions and causal interpretation: Theoretical assumptions and implementation with SAS and SPSS macros. Psychol. Methods 2013, 18, 137. [Google Scholar] [CrossRef] [PubMed]
- Turpin, W.; Dong, M.; Sasson, G.; Garay, J.A.R.; Espin-Garcia, O.; Lee, S.H.; Neustaeter, A.; Smith, M.I.; Leibovitzh, H.; Guttman, D.S.; et al. Mediterranean-like dietary pattern associations with gut microbiome composition and subclinical gastrointestinal inflammation. Gastroenterology 2022, 163, 685–698. [Google Scholar] [CrossRef] [PubMed]
- Nair, V.; Hinton, G.E. Rectified linear units improve restricted boltzmann machines. In Proceedings of the 27th International Conference on Machine Learning (ICML-10), Haifa, Israel, 21–24 June 2010; pp. 807–814. [Google Scholar]
- Python Core Team. Python: A Dynamic, Open Source Programming Language; Python Software Foundation: Wolfeboro Falls, NH, USA, 2019. [Google Scholar]
- R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2021. [Google Scholar]
Figure 1.
The DAG of the real-data example. A is ethnicity, is the predicted integrative mediation metric, Y is the vaginal pH level.
Figure 1.
The DAG of the real-data example. A is ethnicity, is the predicted integrative mediation metric, Y is the vaginal pH level.
Figure 2.
Mediation structure with a single mediator. In this model, A denotes the exposure variable, M is the mediator, Y is the outcome variable, and X is the confounder.
Figure 2.
Mediation structure with a single mediator. In this model, A denotes the exposure variable, M is the mediator, Y is the outcome variable, and X is the confounder.
Figure 3.
Illustration of the Med-CNN framework among K networks. Here, a ‘network’ is defined as a set of biologically related features organized based on their functional or structural relationships, such as OTUs clustered by phylum level or gene expressions aggregated into pathways. Each network contains features, where k ranges from 1 to K.
Figure 3.
Illustration of the Med-CNN framework among K networks. Here, a ‘network’ is defined as a set of biologically related features organized based on their functional or structural relationships, such as OTUs clustered by phylum level or gene expressions aggregated into pathways. Each network contains features, where k ranges from 1 to K.
Figure 4.
Illustration of the neural network layers for Network 1, with denoting the network-specific value associated with Network 1. All networks follow a similar architecture.
Figure 4.
Illustration of the neural network layers for Network 1, with denoting the network-specific value associated with Network 1. All networks follow a similar architecture.
Table 1.
Results of Simulation 1, evaluating the model performance across four mediation effect scenarios: (1) no mediation effects (), (2) no effects of the mediator on the outcome ( but ), (3) no effect of the exposure on the mediator but , and (4) a non-zero mediation effect (). In Scenario (4), we further assessed the impact of varying the effect size of on model performance. Reported values are derived from training datasets using a deep learning model.
Table 1.
Results of Simulation 1, evaluating the model performance across four mediation effect scenarios: (1) no mediation effects (), (2) no effects of the mediator on the outcome ( but ), (3) no effect of the exposure on the mediator but , and (4) a non-zero mediation effect (). In Scenario (4), we further assessed the impact of varying the effect size of on model performance. Reported values are derived from training datasets using a deep learning model.
| NIE | NDE |
---|
Scenarios | True Value | Bias | SD | True Value | Bias | SD |
Null
| | | | | | |
(1) | | 0 | −0.002 | 0.11 | 2 | 0.04 | 0.18 |
(2) | | 0 | −0.05 | 1.15 | 2 | 0.09 | 1.15 |
(3) | | 0 | −0.08 | 0.67 | 2 | 0.06 | 0.24 |
Alternative ()
| | | | | | |
| −5 | −25 | 0.53 | 1.13 | 2 | −0.53 | 0.80 |
| −3 | −15 | −0.16 | 0.78 | 2 | 0.14 | 0.38 |
(4) | −1 | −5 | −0.22 | 0.68 | 2 | 0.20 | 0.24 |
| 1 | 5 | 0.09 | 0.70 | 2 | −0.11 | 0.26 |
| 3 | 15 | 0.38 | 0.89 | 2 | −0.38 | 0.51 |
| 5 | 25 | 0.56 | 1.74 | 2 | −0.58 | 1.47 |
Table 2.
Results of Simulation 2, assessing the performance of Med-CNN in comparison to SD-based and Reg-based approaches in a non-zero mediation effect scenario (). Reported values are based on training datasets using a deep learning model.
Table 2.
Results of Simulation 2, assessing the performance of Med-CNN in comparison to SD-based and Reg-based approaches in a non-zero mediation effect scenario (). Reported values are based on training datasets using a deep learning model.
Feature–Network | | NIE | NDE |
---|
Relationship
|
Method
|
True Value
|
Bias
|
SD
|
True Value
|
Bias
|
SD
|
---|
Linear | Med-CNN | 25 | 0.56 | 1.74 | 2 | −0.58 | 1.47 |
| SD-based | 25 | 0.24 | 0.75 | 2 | −0.11 | 0.29 |
| Reg-based | 25 | 12.73 | 4.94 | 2 | −12.59 | 4.89 |
Non-linear | Med-CNN | 25 | 0.17 | 0.73 | 2 | −0.19 | 0.34 |
| SD-based | 25 | 2.01 | 1.25 | 2 | −1.87 | 1.23 |
| Reg-based | 25 | 1.03 | 0.90 | 2 | −0.90 | 0.70 |
Table 3.
Results of Simulation 3, evaluating the performance of Med-CNN model in handling interactions between biological features compared to SD-based and Reg-based approaches under a non-zero mediation effect scenario. Reported values are based on results from training datasets using a deep learning model.
Table 3.
Results of Simulation 3, evaluating the performance of Med-CNN model in handling interactions between biological features compared to SD-based and Reg-based approaches under a non-zero mediation effect scenario. Reported values are based on results from training datasets using a deep learning model.
Feature–Network | | NIE | NDE |
---|
Relationship
|
Method
|
True Value
|
Bias
|
SD
|
True Value
|
Bias
|
SD
|
---|
Linear | Med-CNN | 25 | 0.42 | 0.81 | 2 | −0.44 | 0.46 |
| SD-based | 25 | 6.68 | 0.72 | 2 | −6.54 | 0.86 |
| Reg-based | 25 | 12.82 | 5.20 | 2 | −12.05 | 5.10 |
Non-linear | Med-CNN | 25 | 0.27 | 0.75 | 2 | −0.29 | 0.39 |
| SD-based | 25 | 13.27 | 2.39 | 2 | −13.14 | 2.47 |
| Reg-based | 25 | 0.81 | 0.98 | 2 | −0.76 | 0.74 |
Table 4.
Results of Simulation 4 assessing the performance of Med-CNN across three convergence thresholds: 1%, 0.1%, 0.01%.
Table 4.
Results of Simulation 4 assessing the performance of Med-CNN across three convergence thresholds: 1%, 0.1%, 0.01%.
Convergence | NIE | NDE | Computation |
---|
Thresholds
|
True Value
|
Bias
|
SD
|
True Value
|
Bias
|
SD
|
Time (min)
|
---|
1% | 25 | 2.23 | 4.32 | 2 | −2.23 | 4.19 | 11.60 |
0.1% | 25 | 0.75 | 2.19 | 2 | −0.76 | 1.94 | 18.53 |
0.01% | 25 | 0.56 | 1.74 | 2 | −0.58 | 1.47 | 18.63 |
Table 5.
Results of Simulation 5, assessing the performance of Med-CNN across different sample sizes: 500, 1000, 1500.
Table 5.
Results of Simulation 5, assessing the performance of Med-CNN across different sample sizes: 500, 1000, 1500.
Sample Sizes | NIE | NDE |
---|
True Value
|
Bias
|
SD
|
True Value
|
Bias
|
SD
|
---|
500 | 25 | 3.48 | 3.82 | 2 | −3.46 | 3.68 |
1000 | 25 | 0.56 | 1.74 | 2 | −0.58 | 1.47 |
1500 | 25 | 0.13 | 0.84 | 2 | −0.26 | 0.67 |
Table 6.
Estimated effects from the real data example.
Table 6.
Estimated effects from the real data example.
Scenarios | Effect | Standard Error |
---|
NIE | 0.06 | 0.001 |
NDE | −0.30 | 0.008 |
| Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).