Next Article in Journal
Complete Chloroplast Genome of Krascheninnikovia ewersmanniana: Comparative and Phylogenetic Analysis
Previous Article in Journal
Integrated Analysis of Transcriptome and Metabolome Reveals Differential Responses to Alternaria brassicicola Infection in Cabbage (Brassica oleracea var. capitata)
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Statistical Genetic Approaches to Investigate Genotype-by-Environment Interaction: Review and Novel Extension of Models

1
Department of Human Genetics, School of Medicine, University of Texas Rio Grande Valley, Brownsville, TX 78520, USA
2
South Texas Diabetes and Obesity Institute, School of Medicine, University of Texas Rio Grande Valley, Brownsville, TX 78520, USA
3
Department of Family Medicine, School of Medicine, University of Texas Rio Grande Valley, Brownsville, TX 78520, USA
*
Author to whom correspondence should be addressed.
Genes 2024, 15(5), 547; https://doi.org/10.3390/genes15050547
Submission received: 25 March 2024 / Revised: 17 April 2024 / Accepted: 19 April 2024 / Published: 25 April 2024
(This article belongs to the Special Issue Statistical Genetics of Human Complex Traits)

Abstract

:
Statistical genetic models of genotype-by-environment (G×E) interaction can be divided into two general classes, one on G×E interaction in response to dichotomous environments (e.g., sex, disease-affection status, or presence/absence of an exposure) and the other in response to continuous environments (e.g., physical activity, nutritional measurements, or continuous socioeconomic measures). Here we develop a novel model to jointly account for dichotomous and continuous environments. We develop the model in terms of a joint genotype-by-sex (for the dichotomous environment) and genotype-by-social determinants of health (SDoH; for the continuous environment). Using this model, we show how a depression variable, as measured by the Beck Depression Inventory-II survey instrument, is not only underlain by genetic effects (as has been reported elsewhere) but is also significantly determined by joint G×Sex and G×SDoH interaction effects. This model has numerous applications leading to potentially transformative research on the genetic and environmental determinants underlying complex diseases.

1. Introduction

There is presently a robust body of approaches to modeling the genotype-by-environment (G×E) interaction in statistical genetics developed from within a linear mixed model framework, which can be divided into genotype-by-dichotomous environment [1,2,3,4,5,6,7,8] and genotype-by-continuous environment [1,2,7,9,10,11,12,13] interaction models. The former class has been applied to sex (male/female) [1,4,5,6,7], disease affection status (affected/unaffected), or presence/absence of environmental exposures (e.g., smoking/non-smoking, basal/high cholesterol–high-fat diet) [2]. The latter class has been applied to continuous environments such as the age continuum [7,9], physical activity levels and sedentary behavior [10], and socioeconomic status variables [8,13] (e.g., socioeconomic index, household income, or education levels) among others. It would be advantageous to biomedical investigations to combine the two approaches to address interesting questions like a dichotomous sex-specific response to a continuous index of social determinants of health (SDoH), as we do here, or disease affection-status-specific responses to continuous environmental exposures, for example. Deeper reflection shows a plethora of invigorating, potentially transformative research that could be carried out using such a joint interaction model. Here, we review the G × E interaction models for dichotomous and continuous environments and demonstrate for the first time that they can be integrated into a single unified model. The results of our review show that the joint model can uncover important patterns and phenomena that neither model can elucidate alone.
Since the work of Sir Michael Marmot and colleagues beginning in the seventies, SDoH has come to the forefront of biomedical studies aimed at elucidating the causal mechanisms underlying disease [14,15,16,17]. Not surprisingly, the role of SDoH subsequently became an active area of research and application in mental health, particularly depression [18,19,20,21,22,23,24,25,26]. In physiological research, stress in general and SDoH-associated stress have been linked to depression, mainly by the hypothalamus–pituitary–adrenal (HPA) axis [27,28,29,30], which is the primary physiological system that deals with stress [31,32]. We theorize that SDoH may be conceptualized as a complex environment [33] in which the genetic factors underlying depression respond dynamically. That is, there is an observable genotype-by-environment (G × E) interaction between the genetic basis of depression on the one hand and the (composite multivariate) SDoH environment on the other. The SDoH environment may trigger our genetically based stress response system, which may lead to mental health disorders such as depression or anxiety. Moreover, there is an additional layer of complexity to consider because there is now ample evidence of sex-specific genetic effects underlying depression [34,35,36,37]. Thus, it is theoretically possible that the sex and SDoH environments may jointly influence depression outcomes in non-trivial ways. Our novel joint interaction model seeks to investigate how the genetic effects underlying depression may be jointly influenced by the sex and SDoH environments.

2. Materials and Methods

The University of Texas Rio Grande Valley IRB approved the study protocol. All participants provided informed consent before participating in the study. We evaluated 522 Mexican American participants recruited from the community in an ongoing genetic study for the presence of obesity, diabetes, hypertension, hyperlipidemia, and depression.
The Beck Depression Inventory II (BDI-II) was used to assess the degree of depressive symptoms over two weeks [12,38]. The BDI-II is a reliable screening tool for assessing the severity of depression, and it can be administered in both Spanish and English. The BDI-II assesses the severity of depression and is an acceptable screening instrument for depression when administered in both Spanish and English [39,40,41].
For our measure of SDoH, we used the survey of health-related social needs screen developed by Billioux et al. (2017)—known as the Accountable Health Communities Health-Related Social Needs (AHC HRSN) screen—comprising ten questions covering housing instability (2 questions), food insecurity (2 questions), transportation needs (1 question), utility needs (1 question), and interpersonal safety (4 questions) [42]. The responses to the ten questions were summed and then divided by 10.
Our main variable of interest in this report is depression as measured by the BDI-II screening instrument. The dichotomous and continuous environments of interest are respectively gender and a derived SDoH environment described in greater detail below.

2.1. Statistical Genetic Models and Inference

2.1.1. Polygenic Model

For a generic phenotype vector y, assumed to follow a multivariate normal distribution, we write the following:
y = X β + g + e
where X is a matrix of covariates augmented at the left by a column of 1s, β is a vector of the intercept parameter and corresponding regression coefficients, and g and e are unobserved random genetic and environmental effects, respectively [43,44,45]. The phenotypic covariance matrix, denoted by Σ, is given as follows:
Σ = K σ g 2 + I σ e 2 ,
where K and I, respectively, denote the genetic relationship and identity matrices, and σ g 2 and σ e 2 are correspondingly the additive genetic and environmental variance components. In statistical genetics, this base model is called the polygenic or additive genetic model [43,44,45,46]. In our initial polygenic models, we accounted for age, sex, age-squared, age-by-sex, and age-squared-by-sex as covariates, and then subjected the residuals to an inverse normalization transformation to induce agreement with the normality assumption [44]. The polygenic model is used to obtain estimates of trait heritability, defined as the ratio of the additive genetic variance to the total phenotypic variance, h 2 = σ g 2 σ g 2 + σ e 2 = σ g 2 σ p 2 , and as a model reference point upon which more complex models can be elaborated.
Using the polygenic model, the initial SDoH variable (the AHC HRSN score) was found to have a moderate heritability ( h 2 = 0.47; p = 4 × 10−6). Given our objective of analyzing SDoH strictly as an environment, we, therefore, employed a best linear unbiased prediction (BLUP) approach to extract the genetic effects that then yielded a predicted value of the environmental effects, which, in turn, can be thought of as a representation of the SDoH variable no longer confounded by genetic effects [45,47]. We term this latter BLUP-derived variable the SDoH Index (SDHI) and use it as our SDoH environmental variable.

2.1.2. Modeling the Genotype-by-Environment Interaction for Discrete and Continuous Environments

For a sample of related individuals, assuming fully uncorrelated genetic and environmental effects, the polygenic model posits that the phenotypic covariance is decomposable into additive genetic and residual environmental variance components and that inter-individual covariances are determined strictly by the additive genetic variance weighted by the genetic relatedness coefficient (see the polygenic covariance equation). The latter feature of the polygenic model makes two implicit assumptions regarding the genetic covariance: that the pairwise genetic correlation is unity and that the additive genetic variance is homogeneous. Explicitly modeling these assumptions is key to our approach to modeling the G×E interaction.
For the simplest case of contrasting two different environments, the G × E variance is zero if the following two conditions are simultaneously true: homogeneity in the additive genetic variance, σ g 1 2 = σ g 2 2 = σ g 2 , where σ g 1 2 and σ g 2 2 are the additive genetic variances in environments 1 and 2 (for example, male and female status for a G × Sex model or unaffected and affected status for a G × dichotomous disease model), respectively; the genetic correlation ( ρ g ) is one across environments, ρ g = 1 [1,6,10,12]. Rejection of either or both is evidence that the phenotypic response to the environment has a genetic basis.
We formulate the G × E model for discrete environments in terms of the G × Sex model, with which our group has had great success in elucidating interesting G × Sex or G × dichotomous environment interaction effects (for more detail, see the Appendix A) [4,6]. Under the G × Sex model, the total phenotypic covariance can be decomposed into (1) a within-female polygenic model; (2) a within-male polygenic model; and (3) the across-sex additive genetic covariance. The G × Sex model has parameters σ g f 2 , σ g m 2 , σ e f 2 , σ e m 2 for the sex-specific additive genetic and environmental variances and ρ G f , m for the across-sex genetic correlation.
We can extend this theory to an environmental spectrum to model G × E for arbitrary continuous environments as opposed to two levels of the environmental variable (for more detail, see the Appendix A). To this end, we employ variance and correlation functions [10,12,13], which we define as follows:
σ g 2 = e x p α g + γ g q i q ¯ ,   and   ρ g = e x p λ g q i q j ,
where the additive genetic variance is reparameterized as an exponential function of the value of the environmental variable q for the i t h individual, q i , scaled against the sample mean, q ¯ , and where the genetic correlation is reparameterized as an exponential decay function of the difference of environmental variables for any pair of individuals i and j , and where α g , γ g , and λ g are parameters to be estimated. The statistical null hypotheses under the reparameterizations for variance homogeneity and genetic correlation stationarity at unity, respectively, are given as γ g = 0 and λ g = 0 . To guard against model misspecification bias, we also model the residual environmental variance as a function of the environment in the same way as the additive genetic variance. Thus, the G × SDHI model has parameters α g , γ g , α e , γ e , and λ g .

2.1.3. Joint Genotype-by-Environment Interaction for Discrete and Continuous Environments

We now have the elements to construct a model to allow for a joint accounting of the G × E interaction under both a dichotomous environment and a continuous environment (sex and SDHI in the current case, respectively). The full model (detailed in the Appendix A) is decomposed into five equations specifying the within-female variance (1) and (2) covariance, the within-male variance (3) and (4) covariance, and across-sex covariance functions (all functions of the SDHI environment). The joint model has 11 total parameters, namely α g f , γ g f , λ g f , α e f , γ e f , α g m , γ g m , λ g m , α e m , γ e m , and ρ G f , m .

2.2. Statistical Inferential Theory

For the basic polygenic model, denote the parameter vector by θ = β , σ g 2 , σ e 2 and the residuals vector by ε = y X β . On assuming multivariate normality of the phenotype vector, the log-likelihood function is as follows:
l n L θ | y , X = 1 2 N l n 2 π + l n Σ + ε Σ 1 ε
The statistical genetics package SOLAR was used to obtain the model likelihoods, maximum likelihood estimates (MLEs) of model parameters, and their standard errors (SEs) [48]. Hypothesis tests were performed by way of the likelihood ratio test (LRT) statistic, which is given as follows:
Λ = 2 l n L θ N l n L θ ^ ,
where for the simplest example, θ N denotes the parameter vector for which a single parameter is constrained to 0 and all other parameters are free to be estimated at their MLEs, and θ ^ denotes the fully unconstrained parameter vector. In this case, the LRT is distributed as a chi-square random variable with degrees of freedom (df) given by the difference of the number of constrained and unconstrained parameters, which in this simplest case is 1 df: Λ ~ χ 1 2 .
It is necessary at this point to distinguish between so-called standard and non-standard conditions. Under standard conditions, the null hypothesis is not on a boundary of the acceptable parameter space, in which case the usual asymptotics for the limiting distribution of the LRT hold. For example, a regression coefficient, being essentially a slope term, takes values on the real line and, more to the point, the null hypothesis of β = 0 is not on a boundary. Under non-standard conditions, however, the null hypothesis is on a boundary of the parameter space, in which case the asymptotic limiting distribution for a 1 parameter difference can be shown to be given by a 1 2 : 1 2 mixture of a chi-square random variable with a point mass of 0, denoted by χ 0 2 and χ 1 2 [44,49,50,51,52,53]. Thus, for this non-standard case, we have Λ ~ 1 2 χ 0 2 + 1 2 χ 1 2 . For example, testing h 2 = 0 is such a 1-parameter testing scenario where the LRT follows this last mixture distribution.
Our first step is to establish that there is a genetic basis for the trait of interest, which amounts to testing and rejecting h 2 = 0 . Moving forward from this point, we advocate a staged hypothesis testing approach that our team has successfully developed and implemented in analyses of the gene-by-environment interaction [10,12,13]. In the first stage, we compared the polygenic model to the G × Sex and to the G × SDHI models. Both interaction models are 5-parameter models. Thus, there is an overall 3-parameter difference between the polygenic model (with parameters σ g 2 and σ e 2 ) and either G × E model. As made clear below in the next stage, 2 of the 3 parameters making up the difference may be considered to be under standard conditions, whereas the remaining parameter can be shown to be under a non-standard condition. Given that, in general, chi-squares are additive, these considerations give rise to Λ ~ χ 1 2 + χ 1 2 + 1 2 χ 0 2 + 1 2 χ 1 2 = 1 2 χ 2 2 + 1 2 χ 3 2 .
In the second stage, we examine the specific sources of potential G × E. In particular, we examine the null hypothesis of additive genetic variance homogeneity ( σ g f 2 = σ g m 2 and γ g = 0 under G × Sex and G × SDHI, respectively) and a genetic correlation equal to 1 ( ρ G f , m = 1 and λ g = 0 under G × Sex and G × SDHI, respectively). We note that these reparameterizations and their resulting hypothesis tests are predicated on the principle that the likelihood function is invariant under one-to-one reparameterization [46,54,55,56,57,58]. Under the null, the G × Sex model reduces to the polygenic model, whereas the G × SDHI model reduces to a reparameterized polygenic model: e x p α g + e x p α e . The additive genetic variance homogeneity null under the G × Sex model is a standard scenario because it is algebraically equivalent to testing the null hypothesis that their difference equals 0, σ g f 2 σ g m 2 = 0 , where this difference may take values on the real line and is thus not on a boundary. Regarding the G × SDHI model, γ g is essentially a slope term on the log-linear scale and, similar to the case for regression coefficients, the null hypothesis of γ g = 0 is thus not on a boundary. Therefore, for either model, Λ ~ χ 1 2 . As for the null hypothesis of a genetic correlation equal to 1 under the G × Sex model, this is clearly on the right boundary of the acceptable parameter space for any correlation coefficient, which takes values in the closed interval 1 , + 1 . As for the G × SDHI model, it happens that λ g = 0 is on the left boundary of the permissible parameter space for the exponential decay function, which corresponds to the right boundary of the genetic correlation coefficient because for λ g = 0 we have ρ G = e x p λ g q i q j = e 0 = 1 . Thus, for both cases, we have Λ ~ 1 2 χ 0 2 + 1 2 χ 1 2 .
Under the joint G × Sex and G × SDHI model, we now advocate a third stage, where we test the joint model against whichever of the two 5-parameter models (G × Sex or G × SDHI) has the higher likelihood (this difference need not be significant). The joint model has 11 total parameters, namely α g f , γ g f , λ g f , α e f , γ e f , α g m , γ g m , λ g m , α e m , γ e m , and ρ G f , m . Therefore, there is a 6-parameter difference, whereupon focusing on the parameters relevant for G × E testing (or environmental variance heterogeneity), we have 2 parameters on a boundary ( λ g f and λ g m ) and 4 not on a boundary ( γ g f , γ g m , γ e f , and γ e m ). The sum of 4 χ 1 2 variables gives χ 4 2 . Further, to jointly determine the mixture distribution for λ g f and λ g m we sum their individual mixture distributions as follows: 1 2 χ 0 2 + 1 2 χ 1 2 + 1 2 χ 0 2 + 1 2 χ 1 2 = 1 4 χ 0 2 + 1 2 χ 1 2 + 1 4 χ 2 2 . Therefore, by comparing the joint G × E model to the G × Sex model, we find Λ ~ χ 4 2 + 1 4 χ 0 2 + 1 2 χ 1 2 + 1 4 χ 2 2 = 1 4 χ 4 2 + 1 2 χ 5 2 + 1 4 χ 6 2 .
The fourth stage would then consist of the individual tests with limiting distributions given by their corresponding parameters under either the G × Sex or G × SDHI models. Note that although ρ G f , m was not relevant in deriving the LRT mixture distribution for the comparison of the joint model to the G × Sex model (because this is the only parameter present under both models), it is necessary in this last stage to test if it is significant by using the same procedure given earlier for the G × Sex model.
Before concluding this section, we note that two models, namely the G × SDHI and joint interaction models, had a parameter with a standard error (SE) greater than its corresponding maximum likelihood estimate (MLE). Likelihood theory shows that the MLE of a parameter should be greater than twice its SE if it is significant [57]. Given this principle, we elected to constrain the parameter in question (where the MLE was less than the SE) to its null value following our previously published G × E investigations where this issue was addressed [10,12,13]. To be sure, we also formally tested the parameter by the appropriate test discussed above and confirmed it to be non-significant. In the ensuing sections of this study, such models are referred to as a reduced (abbreviated Red.) version of the model. More importantly, the resulting LRT distributions were accordingly modified by one less parameter following the principles detailed above.

2.3. Comparison of Sex-Specific Additive Genetic Variance Functions

In order to enable statistical comparison of the sex-specific additive genetic variance functions at fixed values of the SDHI continuum (minimum, mean, and maximum), we computed their sampling variances using a second-order Taylor expansion approximation for a multivariable non-linear function (see Appendix A) [59].
These sampling variances were then used to compute the adjusted confidence intervals for a two-sided hypothesis test [60]. We note that we have adjusted our confidence interval to correspond to a two-sided test of the hypothesis of no difference [61,62,63]. It can be shown that the proper confidence level corresponding to a significance level of α 0.05 is conservatively given as 84%, yielding a confidence interval of 8% and 92% for the lower and upper bounds, respectively. We also performed a modified Wald test [46,54,56,57,64] denoted by W , as follows:
W = σ g f q 2 σ g m q 2 2 V a r σ g f q 2 + V a r σ g m q 2
where σ g f q 2 and σ g m q 2 are the sex-specific additive genetic variances for females and males, respectively, expressed as a function of a fixed SDHI value, denoted by q , and where V a r σ g f q 2 and V a r σ g m q 2 are their respective variance approximations. Given that we are testing if the difference is different from 0, W ~ χ 1 2 .

3. Results

In Table 1, we report the demographic characteristics of the study sample by sex. The heritabilities of both BDI-II and AHC HRSN, each estimated under a polygenic model, were both found to be significant (Table 2). In all analyses, the focal variable is BDI-II. Given that we were interested in using the AHC HRSN score as a measure of the SDoH environment, we used BLUP to statistically extract its genetic effects, at which point we termed it the SDoH index (SDHI).
We proceeded to separately test the performance of the G × Sex and G × SDoH models of BDI-II against the polygenic model for this variable. From the formal comparison in Table 3, we infer that the G × Sex model provides a significantly better fit to the data than the polygenic model. Thus, in Table 4, we proceeded to test the individual hypotheses of homogeneity in the additive genetic and residual environmental variances, and of the genetic correlation equal to 1. We found that there is significant G × Sex interaction due to the cross-sex genetic correlation being significantly different from 1. There was also evidence of significant residual variance heterogeneity.
Similarly, we found that the G × SDoH model performed significantly better than the polygenic model in explaining the data (Table 5). We thus proceeded to test the slopes of the additive genetic and residual environmental variance functions, and whether the genetic correlation function was different from 1 (Table 6). We found that there was evidence of G × SDoH interaction due to both additive genetic variance heterogeneity and a genetic correlation function that decays away from 1 with increasing differences in the environmental index, SDHI (Figure 1 and Figure 2).
Up to this point, we have demonstrated evidence of G × Sex and G × SDoH interactions. However, each interaction analysis was conducted separately from the other, and supposing that our investigation ended here, we could not be certain if both significant interactions hold when considered together. Therefore, we performed a joint interaction model analysis to address whether both interactions are still important when considered together. We selected the G × Sex interaction model to be the baseline model to assess the significance of the joint interaction model because it had a higher likelihood than the G × SDoH interaction model. On comparing the joint interaction model to the G × Sex interaction model, we found that it had a significantly better fit to the data (Table 7). We further investigated the potential significance of the sex-specific variance and correlation functions and the cross-sex genetic correlation (Table 8). We found that there was still evidence of the G × Sex interaction in this case due to the genetic correlation being significantly less than 1. There was evidence of the G × SDoH interaction in males due to additive genetic variance heterogeneity.
Our final analysis compared the sex-specific additive genetic variance functions at the minimum, mean, and maximum SDHI values (Table 9; Figure 3). The male and female additive genetic variance functions significantly differed at the minimum SDHI value but not at the mean or maximum. Consistent with the finding of male-specific additive genetic variance heterogeneity (Table 8), the adjusted confidence intervals for the male-specific additive genetic variances do not show overlap when comparing the variances at the minimum to the mean SDHI values and when comparing the variances at the mean to the maximum SDHI values, indicating a sustained and significant increase in the male-specific additive genetic variance with increasing SDHI (Table 9; Figure 3).

4. Conclusions

In this study, we developed a novel model to jointly account for genotype-by-environment interactions for dichotomous and continuous environments. In particular, we applied this joint interaction model to account for the G × Sex and G × SDoH interaction influencing depression. A motivating factor for developing this model is that it allows us to establish if both types of interaction are important independent of one another, similar to the rationale underlying multivariate logistic regression models. We were able to show that there is G × SDoH interaction but only in males and that there is G × Sex interaction due to the cross-sex genetic correlation being significantly different from 1, which indicates that depression is underlain or influenced by different sets of genes in males and females. This model has potentially critical applications in medical research. For example, the dichotomous environment component of the model could be used to model the affected and unaffected states of a given disease of interest, while the continuous environment component could be used to investigate how the genetic response of individuals in both states may be different across a continuous environment of interest, such as physical activity or sedentary behavior. Whatever the case, we are confident that this novel model can lead to fruitful investigations on the genetic basis of response to the environment.

Author Contributions

Conceptualization, E.G.M. and D.O.; Methodology, V.P.D. and J.B.; Software, V.P.D. and J.B.; Formal analysis, V.P.D. and J.B.; Investigation, V.P.D., E.G.M., M.A., D.O., J.B. and S.W.-B.; Resources, J.B. and S.W.-B.; Data curation, S.L.; Writing—original draft, V.P.D., E.G.M. and M.A.; Writing—review & editing, V.P.D., E.G.M., M.A., S.L., D.O. and J.B.; Visualization, E.G.M.; Supervision, E.G.M. and S.W.-B.; Project administration, S.W.-B.; Funding acquisition, J.B. and S.W.-B. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by a grant from the Knapp Community Care Foundation to Dr. Sarah Williams-Blangero. This work was also funded by the following NIH grants: R01 AG058464, U54 HG013247, P30 AG066546, P30 AG059305, and U19 AG076581.

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki, and approved by the Institutional Review Board of the University of Texas Rio Grande Valley.

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors on request.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. On the Statistical Genetic Models

Appendix A.1. Polygenic Model

For a generic phenotype vector y, assumed to follow a multivariate normal distribution, we write the following:
y = X β + g + e
where X is a matrix of covariates augmented at the left by a column of 1s, β is a vector of the intercept parameter and corresponding regression coefficients, and g and e are unobserved random genetic and environmental effects, respectively [43,44,45]. The phenotypic covariance matrix, denoted by Σ, is given as follows:
Σ = K σ g 2 + I σ e 2 ,
where K and I, respectively, give genetic relationship and identity matrices, and σ g 2 and σ e 2 are correspondingly the additive genetic and environmental variance components. In statistical genetics, this base model is called the polygenic or additive genetic model [43,44,45,46]. It is instructive at this point to provide the following scalar phenotypic covariance equation, which specifies the elements of the covariance matrix for all possible i j -comparisons:
σ p y i , y j = k i j σ g 2 + δ i j σ e 2 = σ g 2 + σ e 2     i = j ; k i j = 1 ; δ i j = 1 k i j σ g 2     i j ; δ i j = 0
where σ p y i , y j denotes the phenotypic covariance, k i j is the i j -th element in K, and δ i j = 1     i = j 0     i j denotes the Kronecker delta [65,66]. The top case on the right-hand side gives the diagonal elements of the covariance matrix under the polygenic model, and the bottom case gives the off-diagonal elements.

Appendix A.2. Modeling the Genotype-by-Environment Interaction for Discrete and Continuous Environments

For a sample of related individuals, assuming fully uncorrelated genetic and environmental effects, the polygenic model posits that the phenotypic covariance is decomposable into additive genetic and residual environmental variance components and that inter-individual covariances are given strictly by the additive genetic variance weighted by the genetic relatedness coefficient (see the polygenic covariance equation). The latter feature of the polygenic model makes two implicit assumptions regarding the genetic covariance: that the pairwise genetic correlation is unity and that the additive genetic variance is homogeneous. Explicitly modeling these assumptions is key to our approach to modeling the G × E interaction.
For the simplest case of contrasting two different environments, the G × E variance is zero if the following two conditions are simultaneously true: homogeneity in the additive genetic variance, σ g 1 2 = σ g 2 2 = σ g 2 , where σ g 1 2 and σ g 2 2 are the additive genetic variances in environments 1 and 2 (for example, male and female status for a G × Sex model or unaffected and affected status for a G × dichotomous disease model), respectively; the genetic correlation ( ρ g ) is one across environments: ρ g = 1 . Denoting the G × E variance as σ g Δ 2 , we have the following expression:
σ g Δ 2 = σ g 1 2 + σ g 2 2 2 ρ g σ g 1 σ g 2     σ g 1 2 σ g 2 2 2 σ g 2 1 ρ g     σ g 1 2 = σ g 2 2 = σ g 2
There is G × E evidence if either null hypothesis is rejected [1,6,10,12]. Rejection of either or both is evidence that the phenotypic response to the environment has a genetic basis.
We formulate the G × E model for discrete environments in terms of the G × Sex model, with which our group has had great success in elucidating interesting G × Sex or G × dichotomous environment interaction effects [4,6]. Let there be an indicator variable specifying the sex of individuals in the study, denoted by s i , as follows:
s i = 1     i Z 0     i Z
where Z is the set of males or, for generic clinical dichotomous variables, individuals in the “1-class” or affected class. The phenotypic covariance can be decomposed under the G × Sex model as follows:
σ p y i , y j = k i j σ g f 2 + δ i j σ e f 2     s i = s j = 0 k i j σ g m 2 + δ i j σ e m 2     s i = s j = 1 k i j σ g f σ g m ρ G f , m     s i = 1   a n d   s j = 0   o r   s i = 0   a n d   s j = 1
Note that the decomposition covers three general classes, which in descending order are the (1) within-female polygenic model; (2) within-male polygenic model; and (3) the across-sex additive genetic covariance. To write the matrix model, let there be a n × 1 indicator vector s with elements s i , as defined above. To construct incidence matrices for pairwise comparisons consisting of both individuals in the male class and both individuals in the female class, we write, respectively, M n × n = s n x 1 s 1 × n and F n × n = r n x 1 r 1 × n , where r = 1 s and 1 is a vector of 1s [4]. For the across-class pairwise comparisons, we write C n × n = s n x 1 r 1 × n + r n x 1 s 1 × n [4]. The covariance matrix for this model may be given as follows:
Σ = K F σ g f 2 + M σ g m 2 + C σ g f σ g b ρ G f , m + I F σ e f 2 + M σ e f 2 ,
where denotes the Hadamard matrix multiplication operator.
We can extend this theory to an environmental spectrum to model G × E for arbitrary continuous environments as opposed to two levels of the environmental variable. To this end, we employ variance and correlation functions [10,12,13], which we define as follows:
σ g 2 = e x p α g + γ g q i q ¯ ,   and   ρ g = e x p λ g q i q j ,
where the additive genetic variance is reparameterized as an exponential function of the value of the environmental variable q for the i t h individual, q i , scaled against the sample mean, q ¯ , and where the genetic correlation is reparameterized as an exponential decay function of the difference of environmental variables for any pair of individuals i and j , and where α g , γ g , and λ g are parameters to be estimated. These functions can be interpreted as the variance and correlation functions of a Gaussian stationary stochastic process [8,13,67,68,69,70], where the index variable of the stochastic process is the SDHI environment. The statistical null hypotheses under the reparameterizations for variance homogeneity and genetic correlation stationarity at unity, respectively, are given as γ g = 0 and λ g = 0 . To guard against model misspecification bias, we also model the residual environmental variance as a function of the environment in the same way as the additive genetic variance. The phenotypic covariance function, with components given by the variance and covariance functions of a Gaussian stationary stochastic process, can be partitioned as follows:
σ p y i , y j = σ g 2 + σ e 2     i = j ;   δ i j = 1 ;   q i = q j σ i σ j ρ g     i j ;   δ i j = 0 ;   q i q j = e x p α g + γ g q i q ¯ + e x p α e + γ e q i q ¯     i = j ;   δ i j = 1 ;   q i = q j e x p α g + γ g q i q ¯ 1 2   e x p α g + γ g q j q ¯ 1 2     × e x p λ g q i q j     i j ;   δ i j = 0 ;   q i q j = e x p α g + γ g q i q ¯ + e x p α e + γ e q i q ¯     i = j ;   δ i j = 1 ;   q i = q j e x p α g + 1 2 γ g q i + q j 2 q ¯ λ g q i q j     i j ;   δ i j = 0 ; q i q j
We define a genetic covariance matrix Ψ = ψ i j , with elements given as follows:
ψ i j = e x p α g + γ g q i q ¯     i = j ;   δ i j = ; q i = q j e x p α g + 1 2 γ g q i + q j 2 q ¯ λ g q i q j     i j ;   δ i j = 0 ;   q i q j
We posit a diagonal matrix Ω = d i a g ω i i with diagonal elements containing the residual environmental variance function, ω i i = e x p α e + γ e q i q ¯ . The covariance matrix for this G × E model for continuous environments is then given as follows:
Σ = K Ψ + Ω

Appendix A.3. Joint Genotype-by-Environment Interaction for Discrete and Continuous Environments

We now have the elements to construct a model to allow for a joint accounting of the G × E interaction under both a dichotomous environment and a continuous environment (sex and SDHI in the current case, respectively). A necessary first step is to posit sex-specific variance and correlation functions subscripted by f for females and m for males. Further, we specify the cross-sex genetic covariance function (as opposed to the separate within-sex genetic covariance functions), denoted by Δ , as follows:
Δ = e x p α g f + 1 2 γ g f q i + q j 2 q ¯ λ g f q i q j 1 2
× e x p α g m + 1 2 γ g m q i + q j 2 q ¯ λ g m q i q j 1 2
= e x p 1 2 α g f + α g m + 1 2 γ g f + γ g m q i + q j 2 q ¯ λ g f + λ g m q i q j
We can write the scalar covariance equation specifying all possible i j -comparisons as follows:
σ p y i , y j = e x p α g f + γ g f q i q ¯ + e x p α e f + γ e f q i q ¯     i = j ; k i j = 1 ; δ i j = 1 ; s i = s j = 0 k i j e x p α g f + 1 2 γ g f q i + q j 2 q ¯ λ g f q i q j     i j ;   δ i j = 0 ; q i q j   ;   s i = s j = 0 e x p α g m + γ g m q i q ¯ + e x p α e m + γ e m q i q ¯     i = j ; k i j = 1 ; δ i j = 1 ; s i = s j = 1 k i j e x p α g m + 1 2 γ g m q i + q j 2 q ¯ λ g m q i q j     i j ;   δ i j = 0 ; q i q j   ;   s i = s j = 1 k i j Δ ρ G f , m     i j ;   δ i j = 0 ; q i q j   ; s i = 1   a n d   s j = 0   o r   s i = 0   a n d   s j = 1
In descending order, this scalar covariance equation specifies the variance and covariance for the within-female comparisons (the first two cases), the variance and covariance for the within-male comparisons (the next two cases), and the cross-sex covariance. The matrix model, with obvious subscripts for the sex-specific matrices, is given as follows:
Σ = K F Ψ f + M Ψ m + C Δ ρ G f , m + F Ω f + M Ω m
Comparison of sex-specific additive genetic variance functions.
In order to enable statistical comparison of the sex-specific additive genetic variance functions at fixed values of the SDHI continuum (minimum, mean, and maximum), we computed their sampling variances using a second-order Taylor expansion approximation for a multivariable non-linear function [59]. The sampling variance approximation, denoted by D f y , may be written as follows:
D f y a Σ y a 1 4 t r A Σ y 2
where y = α s γ s ( s = f   or   m ), f y = e x p α s + γ s q i q ¯ ; a is the vector of first partial derivatives evaluated at the maximum, a = f y y | y = μ = e x p α s + γ s q i q ¯ q i q ¯ e x p α s + γ s q i q ¯ ; A is the Hessian matrix of second partial derivatives evaluated at the maximum, A = 2 f y y y | y = μ = e x p α s + γ s q i q ¯ q i q ¯ e x p α s + γ s q i q ¯ q i q ¯ e x p α s + γ s q i q ¯ q i q ¯ 2 e x p α s + γ s q i q ¯ ; and where Σ y is the asymptotic covariance matrix of the parameters (which can be computed from the output in SOLAR) [59]. The full approximation contains two more terms that were not included in this study due to our focus being on the immediate vicinity of the MLEs.

References

  1. Blangero, J. Statistical genetic approaches to human adaptability. Hum. Biol. 2009, 81, 523–546. [Google Scholar] [CrossRef] [PubMed]
  2. Viel, K.R.; Warren, D.M.; Buil, A.; Dyer, T.D.; Howard, T.E.; Almasy, L. A comparison of discrete versus continuous environment in a variance components-based linkage analysis of the COGA data. BMC Genet. 2005, 6, S57. [Google Scholar] [CrossRef] [PubMed]
  3. Avery, C.L.; Freedman, B.I.; Kraja, A.T.; Borecki, I.B.; Miller, M.B.; Pankow, J.S.; Arnett, D.; Lewis, C.E.; Myers, R.H.; Hunt, S.C.; et al. Genotype-by-sex interaction in the aetiology of type 2 diabetes mellitus: Support for sex-specific quantitative trait loci in Hypertension Genetic Epidemiology Network participants. Diabetologia 2006, 49, 2329–2336. [Google Scholar] [CrossRef] [PubMed]
  4. Winnier, D.A.; Rainwater, D.L.; Cole, S.A.; Williams, J.T.; Dyer, T.D.; Blangero, J.; MacCluer, J.W.; Mahaney, M.C. Sex-specific QTL effects on variation in paraoxonase 1 (PON1) activity in Mexican Americans. Genet. Epidemiol. 2007, 31, 66–74. [Google Scholar] [CrossRef] [PubMed]
  5. North, K.E.; Franceschini, N.; Borecki, I.B.; Gu, C.C.; Heiss, G.; Province, M.A.; Arnett, D.K.; Lewis, C.E.; Miller, M.B.; Myers, R.H.; et al. Genotype-by-Sex Interaction on Fasting Insulin Concentration: The HyperGEN Study. Diabetes 2007, 56, 137–142. [Google Scholar] [CrossRef]
  6. Diego, V.P.; de Chaves, R.N.; Blangero, J.; de Souza, M.C.; Santos, D.; Gomes, T.N.; dos Santos, F.K.; Garganta, R.; Katzmarzyk, P.T.; Maia, J.A. Sex-specific genetic effects in physical activity: Results from a quantitative genetic analysis. BMC Med. Genet. 2015, 16, 58. [Google Scholar] [CrossRef]
  7. Poveda, A.; Chen, Y.; Brändström, A.; Engberg, E.; Hallmans, G.; Johansson, I.; Renström, F.; Kurbasic, A.; Franks, P.W. The heritable basis of gene-environment interactions in cardiometabolic traits. Diabetologia 2017, 60, 442–452. [Google Scholar] [CrossRef]
  8. Diego, V.P.; Manusov, E.G.; Mao, X.; Almeida, M.; Peralta, J.M.; Curran, J.E.; Mahaney, M.C.; Göring, H.; Blangero, J.; Williams-Blangero, S. Metabolic syndrome traits exhibit genotype-by-environment interaction in relation to socioeconomic status in the Mexican American family heart study. Front. Genet. 2024, 15, 1240462. [Google Scholar] [CrossRef]
  9. Glahn, D.C.; Kent, J.W., Jr.; Sprooten, E.; Diego, V.P.; Winkler, A.M.; Curran, J.E.; McKay, D.R.; Knowles, E.E.; Carless, M.A.; Göring, H.H.; et al. Genetic basis of neurocognitive decline and reduced white-matter integrity in normal human brain aging. Proc. Natl. Acad. Sci. USA 2013, 110, 19006–19011. [Google Scholar] [CrossRef]
  10. Arya, R.; Farook, V.S.; Fowler, S.P.; Puppala, S.; Chittoor, G.; Resendez, R.G.; Mummidi, S.; Vanamala, J.; Almasy, L.; Curran, J.E.; et al. Genetic and environmental (physical fitness and sedentary activity) interaction effects on cardiometabolic risk factors in Mexican American children and adolescents. Genet. Epidemiol. 2018, 42, 378–393. [Google Scholar] [CrossRef]
  11. Pittner, K.; Bakermans-Kranenburg, M.J.; Alink, L.R.A.; Buisman, R.S.M.; van den Berg, L.J.M.; Block, L.H.C.G.C.C.-d.; Voorthuis, A.; Elzinga, B.M.; Lindenberg, J.; Tollenaar, M.S.; et al. Estimating the Heritability of Experiencing Child Maltreatment in an Extended Family Design. Child Maltreatment 2020, 25, 289–299. [Google Scholar] [CrossRef]
  12. Manusov, E.G.; Diego, V.P.; Sheikh, K.; Laston, S.; Blangero, J.; Williams-Blangero, S. Non-alcoholic Fatty Liver Disease and Depression: Evidence for Genotype × Environment Interaction in Mexican Americans. Front. Psychiatry 2022, 13, 936052. [Google Scholar] [CrossRef] [PubMed]
  13. Diego, V.P.; Manusov, E.G.; Mao, X.; Curran, J.E.; Göring, H.; Almeida, M.; Mahaney, M.C.; Peralta, J.M.; Blangero, J.; Williams-Blangero, S. Genotype-by-socioeconomic status interaction influences heart disease risk scores and carotid artery thickness in Mexican Americans: The predominant role of education in comparison to household income and socioeconomic index. Front. Genet. 2023, 14, 1132110. [Google Scholar] [CrossRef]
  14. Marmot, M.G.; Rose, G.; Shipley, M.; Hamilton, P.J. Employment grade and coronary heart disease in British civil servants. J. Epidemiol. Community Health 1978, 32, 244–249. [Google Scholar] [CrossRef] [PubMed]
  15. Marmot, M.G.; Stansfeld, S.; Patel, C.; North, F.; Head, J.; White, I.; Brunner, E.; Feeney, A.; Marmot, M.G.; Smith, G.D. Health inequalities among British civil servants: The Whitehall II study. Lancet 1991, 337, 1387–1393. [Google Scholar] [CrossRef]
  16. Langenberg, C.; Shipley, M.J.; Batty, G.D.; Marmot, M.G. Adult Socioeconomic Position and the Association Between Height and Coronary Heart Disease Mortality: Findings From 33 Years of Follow-Up in the Whitehall Study. Am. J. Public Health 2005, 95, 628–632. [Google Scholar] [CrossRef]
  17. Stringhini, S.; Batty, G.D.; Bovet, P.; Shipley, M.J.; Marmot, M.G.; Kumari, M.; Tabak, A.G.; Kivimäki, M. Association of Lifecourse Socioeconomic Status with Chronic Inflammation and Type 2 Diabetes Risk: The Whitehall II Prospective Cohort Study. PLoS Med. 2013, 10, e1001479. [Google Scholar] [CrossRef] [PubMed]
  18. Shim, R.S.; Ye, J.; Baltrus, P.; Fry-Johnson, Y.; Daniels, E.; Rust, G. Racial/ethnic disparities, social support, and depression: Examining a social determinant of mental health. Ethn. Dis. 2012, 22, 15–20. [Google Scholar]
  19. Allen, J.; Balfour, R.; Bell, R.; Marmot, M. Social determinants of mental health. Int. Rev. Psychiatry 2014, 26, 392–407. [Google Scholar] [CrossRef]
  20. Shim, R.; Koplan, C.; Langheim, F.J.; Manseau, M.W.; Powers, R.A.; Compton, M.T. The social determinants of mental health: An overview and call to action. Psychiatr. Ann. 2014, 44, 22–26. [Google Scholar] [CrossRef]
  21. World Health Organization. Social Determinants of Mental Health; World Health Organization: Geneva, Switzerland, 2014. [Google Scholar]
  22. Carod-Artal, F.J. Social Determinants of Mental Health. In Global Mental Health: Prevention and Promotion; Bährer-Kohler, S., Carod-Artal, F.J., Eds.; Springer International Publishing: Cham, Switzerland, 2017; pp. 33–46. [Google Scholar] [CrossRef]
  23. Alegría, M.; NeMoyer, A.; Falgàs Bagué, I.; Wang, Y.; Alvarez, K. Social Determinants of Mental Health: Where We Are and Where We Need to Go. Curr. Psychiatry Rep. 2018, 20, 95. [Google Scholar] [CrossRef]
  24. Jeste, D.V.; Koh, S.; Pender, V.B. Perspective: Social Determinants of Mental Health for the New Decade of Healthy Aging. Am. J. Geriatr. Psychiatry 2022, 30, 733–736. [Google Scholar] [CrossRef] [PubMed]
  25. Jeste, D.V.; Pender, V.B. Social Determinants of Mental Health: Recommendations for Research, Training, Practice, and Policy. JAMA Psychiatry 2022, 79, 283–284. [Google Scholar] [CrossRef] [PubMed]
  26. Shim, R.S.; Compton, M.T. The Social Determinants of Mental Health: Psychiatrists’ Roles in Addressing Discrimination and Food Insecurity. Focus 2020, 18, 25–30. [Google Scholar] [CrossRef] [PubMed]
  27. Tafet, G.E.; Nemeroff, C.B. The links between stress and depression: Psychoneuroendocrinological, genetic, and environmental interactions. J. Neuropsychiatry Clin. Neurosci. 2016, 28, 77–88. [Google Scholar] [CrossRef] [PubMed]
  28. LeMoult, J. From stress to depression: Bringing together cognitive and biological science. Curr. Dir. Psychol. Sci. 2020, 29, 592–598. [Google Scholar] [CrossRef] [PubMed]
  29. McEwen, B.S.; Akil, H. Revisiting the stress concept: Implications for affective disorders. J. Neurosci. 2020, 40, 12–21. [Google Scholar] [CrossRef] [PubMed]
  30. McEwen, B.S. Protective and damaging effects of stress mediators: Central role of the brain. Dialogues Clin. Neurosci. 2022, 8, 367–381. [Google Scholar] [CrossRef] [PubMed]
  31. McEwen, B.S. Physiology and Neurobiology of Stress and Adaptation: Central Role of the Brain. Physiol. Rev. 2007, 87, 873–904. [Google Scholar] [CrossRef]
  32. McEwen, B.S. Neurobiological and Systemic Effects of Chronic Stress. Chronic Stress 2017, 1, 2470547017692328. [Google Scholar] [CrossRef]
  33. Wilkinson, R.G.; Marmot, M. Social Determinants of Health: The Solid Facts; World Health Organization: Geneva, Switzerland, 2003. [Google Scholar]
  34. Kang, H.-J.; Park, Y.; Yoo, K.-H.; Kim, K.-T.; Kim, E.-S.; Kim, J.-W.; Kim, S.-W.; Shin, I.-S.; Yoon, J.-S.; Kim, J.H. Sex differences in the genetic architecture of depression. Sci. Rep. 2020, 10, 9927. [Google Scholar] [CrossRef] [PubMed]
  35. Labonté, B.; Engmann, O.; Purushothaman, I.; Menard, C.; Wang, J.; Tan, C.; Scarpa, J.R.; Moy, G.; Loh, Y.-H.E.; Cahill, M. Sex-specific transcriptional signatures in human depression. Nat. Med. 2017, 23, 1102–1111. [Google Scholar] [CrossRef] [PubMed]
  36. Seney, M.L.; Glausier, J.; Sibille, E. Large-scale transcriptomics studies provide insight into sex differences in depression. Biol. Psychiatry 2022, 91, 14–24. [Google Scholar] [CrossRef] [PubMed]
  37. Silveira, P.P.; Pokhvisneva, I.; Howard, D.M.; Meaney, M.J. A sex-specific genome-wide association study of depression phenotypes in UK Biobank. Mol. Psychiatry 2023, 28, 2469–2479. [Google Scholar] [CrossRef] [PubMed]
  38. Lee, K.; Kim, D.; Cho, Y. Exploratory Factor Analysis of the Beck Anxiety Inventory and the Beck Depression Inventory-II in a Psychiatric Outpatient Population. J. Korean Med. Sci. 2018, 33, e128. [Google Scholar] [CrossRef]
  39. Penley, J.A.; Wiebe, J.S.; Nwosu, A. Psychometric properties of the Spanish Beck Depression Inventory-II in a medical sample. Psychol. Assess. 2003, 15, 569–577. [Google Scholar] [CrossRef] [PubMed]
  40. Wiebe, J.S.; Penley, J.A. A psychometric comparison of the Beck Depression Inventory-II in English and Spanish. Psychol. Assess. 2005, 17, 481–485. [Google Scholar] [CrossRef] [PubMed]
  41. Wang, Y.P.; Gorenstein, C. Psychometric properties of the Beck Depression Inventory-II: A comprehensive review. Braz. J. Psychiatry 2013, 35, 416–431. [Google Scholar] [CrossRef] [PubMed]
  42. Billioux, A.; Verlander, K.; Anthony, S.; Alley, D. Standardized screening for health-related social needs in clinical settings: The accountable health communities screening tool. In NAM Perspectives; National Academy of Medicine: Washington, DC, USA, 2017. [Google Scholar]
  43. Lange, K. Mathematical and Statistical Methods for Genetic Analysis; Springer: Berlin/Heidelberg, Germany, 2002; Volume 488. [Google Scholar]
  44. Blangero, J.; Diego, V.P.; Dyer, T.D.; Almeida, M.; Peralta, J.; Kent, J.W., Jr.; Williams, J.T.; Almasy, L.; Göring, H.H. A kernel of truth: Statistical advances in polygenic variance component models for complex human pedigrees. Adv. Genet. 2013, 81, 1–31. [Google Scholar] [CrossRef]
  45. Diego, V.P.; Kent, J.W.; Blangero, J. Familial Studies: Genetic Inferences. In International Encyclopedia of the Social & Behavioral Sciences, 2nd ed.; Wright, J.D., Ed.; Elsevier: Oxford, UK, 2015; pp. 715–724. [Google Scholar] [CrossRef]
  46. Sorensen, D.; Gianola, D. Likelihood, Bayesian and MCMC Methods in Quantitative Genetics; Springer: Berlin/Heidelberg, Germany, 2002. [Google Scholar]
  47. Quillen, E.E.; Voruganti, V.S.; Chittoor, G.; Rubicz, R.; Peralta, J.M.; Almeida, M.A.; Kent, J.W., Jr.; Diego, V.P.; Dyer, T.D.; Comuzzie, A.G.; et al. Evaluation of estimated genetic values and their application to genome-wide investigation of systolic blood pressure. BMC Proc. 2014, 8, S66. [Google Scholar] [CrossRef]
  48. Almasy, L.; Blangero, J. Multipoint quantitative-trait linkage analysis in general pedigrees. Am. J. Hum. Genet. 1998, 62, 1198–1211. [Google Scholar] [CrossRef] [PubMed]
  49. Self, S.G.; Liang, K.-Y. Asymptotic Properties of Maximum Likelihood Estimators and Likelihood Ratio Tests under Nonstandard Conditions. J. Am. Stat. Assoc. 1987, 82, 605–610. [Google Scholar] [CrossRef]
  50. Dominicus, A.; Skrondal, A.; Gjessing, H.K.; Pedersen, N.L.; Palmgren, J. Likelihood ratio tests in behavioral genetics: Problems and solutions. Behav. Genet. 2006, 36, 331–340. [Google Scholar] [CrossRef] [PubMed]
  51. Visscher, P.M. A note on the asymptotic distribution of likelihood ratio tests to test variance components. Twin. Res. Hum. Genet. 2006, 9, 490–495. [Google Scholar] [CrossRef] [PubMed]
  52. DasGupta, A. Asymptotic Theory of Statistics and Probability; Springer: Berlin/Heidelberg, Germany, 2008; Volume 180. [Google Scholar]
  53. Giampaoli, V.; Singer, J.M. Likelihood ratio tests for variance components in linear mixed models. J. Stat. Plan. Inference 2009, 139, 1435–1448. [Google Scholar] [CrossRef]
  54. Azzalini, A. Statistical Inference Based on the Likelihood; Routledge: London, UK, 2017. [Google Scholar]
  55. Edwards, A.W.F. Likelihood: An Account of the Statistical Concept of Likelihood and Its Application to Scientific Inference; Johns Hopkins University Press: Baltimore, MA, USA, 1992. [Google Scholar]
  56. Held, L.; Bové, D.S. Likelihood and Bayesian Inference. Statistics for Biology and Health; Springer: Berlin/Heidelberg, Germany, 2020. [Google Scholar]
  57. Pawitan, Y. All Likelihood: Statistical Modelling and Inference Using Likelihood; Oxford University Press: Oxford, UK, 2001. [Google Scholar]
  58. Severini, T.A. Likelihood Methods in Statistics; Oxford University Press: Oxford, UK, 2000. [Google Scholar]
  59. Neudecker, H.; Trenkler, G. On the approximate variance of a nonlinear function of random variables. In Contributions to Probability and Statistics: Applications and Challenges; World Scientific: Singapore, 2006; pp. 172–177. [Google Scholar]
  60. Nakagawa, S.; Cuthill, I.C. Effect size, confidence interval and statistical significance: A practical guide for biologists. Biol. Rev. 2007, 82, 591–605. [Google Scholar] [CrossRef] [PubMed]
  61. Julious, S.A. Using confidence intervals around individual means to assess statistical significance between two means. Pharm. Stat. 2004, 3, 217–222. [Google Scholar] [CrossRef]
  62. Knol, M.J.; Pestman, W.R.; Grobbee, D.E. The (mis)use of overlap of confidence intervals to assess effect modification. Eur. J. Epidemiol. 2011, 26, 253–254. [Google Scholar] [CrossRef] [PubMed]
  63. Maghsoodloo, S.; Huang, C.-Y. Comparing the overlapping of two independent confidence intervals with a single confidence interval for two normal population parameters. J. Stat. Plan. Inference 2010, 140, 3295–3305. [Google Scholar] [CrossRef]
  64. Molenberghs, G.; Verbeke, G. Likelihood Ratio, Score, and Wald Tests in a Constrained Parameter Space. Am. Stat. 2007, 61, 22–27. [Google Scholar] [CrossRef]
  65. Elston, R.; Rao, D. Statistical modeling and analysis in human genetics. Annu. Rev. Biophys. Bioeng. 1978, 7, 253–286. [Google Scholar] [CrossRef] [PubMed]
  66. Lange, K.; Westlake, J.; Spence, M.A. Extensions to pedigree analysis III. Variance components by the scoring method. Ann. Hum. Genet. 1976, 39, 485–491. [Google Scholar] [CrossRef] [PubMed]
  67. Kirkpatrick, M.; Heckman, N. A quantitative genetic model for growth, shape, reaction norms, and other infinite-dimensional characters. J. Math. Biol. 1989, 27, 429–450. [Google Scholar] [CrossRef] [PubMed]
  68. Pletcher, S.D.; Geyer, C.J. The genetic analysis of age-dependent traits: Modeling the character process. Genetics 1999, 153, 825–835. [Google Scholar] [CrossRef]
  69. Jaffrézic, F.; Pletcher, S.D. Statistical models for estimating the genetic basis of repeated measures and other function-valued traits. Genetics 2000, 156, 913–922. [Google Scholar] [CrossRef]
  70. Pletcher, S.D.; Jaffrézic, F. Generalized character process models: Estimating the genetic basis of traits that cannot be observed and that change with age or environmental conditions. Biometrics 2002, 58, 157–162. [Google Scholar] [CrossRef]
Figure 1. (A) Additive genetic variance function. (B) Genetic correlation function. The dotted line represents the expected value of the genetic correlation under the null. The red en dash line represents the SDHI value at which the genetic correlation has decayed to half the maximum value.
Figure 1. (A) Additive genetic variance function. (B) Genetic correlation function. The dotted line represents the expected value of the genetic correlation under the null. The red en dash line represents the SDHI value at which the genetic correlation has decayed to half the maximum value.
Genes 15 00547 g001
Figure 2. The genetic covariance function showing how the additive genetic variance as a function of SDHI and the genetic correlation as a function of SDHI differences jointly influence the overall covariance.
Figure 2. The genetic covariance function showing how the additive genetic variance as a function of SDHI and the genetic correlation as a function of SDHI differences jointly influence the overall covariance.
Genes 15 00547 g002
Figure 3. Sex-specific additive genetic variances as functions of SDH Index (SDHI) at the minimum, mean, and maximum SDHI values. The horizontal dashed black lines demonstrate no overlap between the confidence intervals for the male-specific additive genetic variances at the minimum and the mean and at the mean and the maximum. The different color en dash lines indicate the adjusted confidence intervals corresponding to the sex-specific colors (pink for females and blue for males in this case).
Figure 3. Sex-specific additive genetic variances as functions of SDH Index (SDHI) at the minimum, mean, and maximum SDHI values. The horizontal dashed black lines demonstrate no overlap between the confidence intervals for the male-specific additive genetic variances at the minimum and the mean and at the mean and the maximum. The different color en dash lines indicate the adjusted confidence intervals corresponding to the sex-specific colors (pink for females and blue for males in this case).
Genes 15 00547 g003
Table 1. Demographic characteristics of the sample.
Table 1. Demographic characteristics of the sample.
TraitFemales N = 389Males N = 133p-Value
MeanSDMeanSD
Age44.33 14.7645.9615.700.2936
BDI-II19.88 15.9925.6719.993.1 × 10−4
AHA HRSN0.100.140.070.120.0025
Table 2. Heritability analysis of BDI-II and AHC HRSN screening on residualized normalized data.
Table 2. Heritability analysis of BDI-II and AHC HRSN screening on residualized normalized data.
TraitHeritabilityStandard ErrorSample Sizep-Value
BDI-II0.370.145217.8 × 10−6
AHC HRSN0.400.135216.6 × 10−4
Table 3. Testing the G × Sex interaction model against the polygenic model for BDI-II.
Table 3. Testing the G × Sex interaction model against the polygenic model for BDI-II.
TraitModelLn LikelihoodChi-Squarep-Value
BDI-IIPolygenic−247.71636.975292.8 × 10−8
G × Sex−229.228
Table 4. Testing the critical parameters of the G × Sex interaction model for BDI-II.
Table 4. Testing the critical parameters of the G × Sex interaction model for BDI-II.
TraitModel Ln LikelihoodChi-Squarep-Value
BDI-IIAdditive genetic variance homogeneity −229.3170.1780.67
Residual environmental variance homogeneity−233.4658.4731.0 × 10−3
Constrained genetic correlation across sex−234.85011.2404.0 × 10−4
G × Sex interaction model−229.230
The LRT statistic for this comparison is distributed as a 50:50 mixture of chi-squares with 2 and 3 degrees of freedom.
Table 5. Testing the G × SDHI interaction model against the polygenic model for BDI-II.
Table 5. Testing the G × SDHI interaction model against the polygenic model for BDI-II.
TraitModelLn LikelihoodChi-Squarep-Value
BDI-IIPolygenic−241.40122.0269.6 × 10−6
Red. G × E−230.388
Table 6. Testing the critical parameters of the G × SDHI interaction model for BDI-II.
Table 6. Testing the critical parameters of the G × SDHI interaction model for BDI-II.
TraitModelLn LikelihoodChi-Squarep-Value
BDI-IIConstrained genetic slope−237.64614.5171.3 × 10−4
Constrained environmental slope---------
Constrained genetic correlation decay−233.5096.2456.2 × 10−3
Red. G × E interaction model−230.388
The distributions of the LRT statistics are a chi-square with 1 df and a 50:50 mixture of a chi-square with a point mass at 0 and a chi-square with 1 df. “---” indicates the parameter constrained to the null hypothesis under the reduced G×E interaction model.
Table 7. Testing the joint G × Sex and G × SDHI interaction model against the G × Sex for BDI-II.
Table 7. Testing the joint G × Sex and G × SDHI interaction model against the G × Sex for BDI-II.
TraitModelLn LikelihoodChi-Squarep-Value
BDI-IIG × Sex−229.22842.739961.8 × 10−8
Red. G × E−207.858
Table 8. Testing the critical parameters of the joint G × Sex and G × SDHI interaction model for BDI-II.
Table 8. Testing the critical parameters of the joint G × Sex and G × SDHI interaction model for BDI-II.
TraitModelLn LikelihoodChi-Squarep-Value
BDI-IIConstrained genetic slope in females−208.1820.647 0.42
Constrained environmental slope in females−207.9420.1690.68
Constrained genetic correlation decay in females−208.5761.4360.12
Constrained genetic slope in males−216.24816.7804.2 × 10−5
Constrained environmental slope in males---------
Constrained genetic correlation decay in males−208.280.844330.18
Constrained across-sex genetic correlation−211.6427.5684022.0 × 10−3
Red. G × E interaction model−207.858
“---” indicates the parameter constrained to the null hypothesis under the reduced G×E interaction model.
Table 9. Comparison of sex-specific additive genetic variances at the minimum, mean, and maximum SDHI values (95% confidence interval at the lower and upper bounds).
Table 9. Comparison of sex-specific additive genetic variances at the minimum, mean, and maximum SDHI values (95% confidence interval at the lower and upper bounds).
SexAdditive Genetic VarianceAdjusted
Lower Bound *
Adjusted
Upper Bound
Wald
Statistic
p-Value
Min. SDHI
Females0.6180.3470.8906.5270.011
Males0.0530.0130.092
Mean SDHI
Females0.9440.4371.4511.5500.213
Males0.4290.3540.503
Max. SDHI
Females1.5253−0.11853.1690.64850.4206
Males4.63780.85098.4247
* See text on the adjusted lower and upper bounds.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Diego, V.P.; Manusov, E.G.; Almeida, M.; Laston, S.; Ortiz, D.; Blangero, J.; Williams-Blangero, S. Statistical Genetic Approaches to Investigate Genotype-by-Environment Interaction: Review and Novel Extension of Models. Genes 2024, 15, 547. https://doi.org/10.3390/genes15050547

AMA Style

Diego VP, Manusov EG, Almeida M, Laston S, Ortiz D, Blangero J, Williams-Blangero S. Statistical Genetic Approaches to Investigate Genotype-by-Environment Interaction: Review and Novel Extension of Models. Genes. 2024; 15(5):547. https://doi.org/10.3390/genes15050547

Chicago/Turabian Style

Diego, Vincent P., Eron G. Manusov, Marcio Almeida, Sandra Laston, David Ortiz, John Blangero, and Sarah Williams-Blangero. 2024. "Statistical Genetic Approaches to Investigate Genotype-by-Environment Interaction: Review and Novel Extension of Models" Genes 15, no. 5: 547. https://doi.org/10.3390/genes15050547

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop