Hostname: page-component-89b8bd64d-mmrw7 Total loading time: 0 Render date: 2026-05-10T08:23:51.457Z Has data issue: false hasContentIssue false

Simulated nonlinear genetic and environmental dynamics of complex traits

Published online by Cambridge University Press:  03 March 2022

Michael D. Hunter*
Affiliation:
Department of Human Development and Family Studies, The Pennsylvania State University, University Park, PA, USA
Kevin L. McKee
Affiliation:
Center for Neuroscience, University of California, Davis, Davis, CA, USA
Eric Turkheimer
Affiliation:
Department of Psychology, University of Virginia, Charlottesville, VA, USA
*
Corresponding author: Michael D. Hunter, email: mhunter.ou@gmail.com
Rights & Permissions [Opens in a new window]

Abstract

Genetic studies of complex traits often show disparities in estimated heritability depending on the method used, whether by genomic associations or twin and family studies. We present a simulation of individual genomes with dynamic environmental conditions to consider how linear and nonlinear effects, gene-by-environment interactions, and gene-by-environment correlations may work together to govern the long-term development of complex traits and affect estimates of heritability from common methods. Our simulation studies demonstrate that the genetic effects estimated by genome wide association studies in unrelated individuals are inadequate to characterize gene-by-environment interaction, while including related individuals in genome-wide complex trait analysis (GCTA) allows gene-by-environment interactions to be recovered in the heritability. These theoretical findings provide an explanation for the “missing heritability” problem and bridge the conceptual gap between the most common findings of GCTA and twin studies. Future studies may use the simulation model to test hypotheses about phenotypic complexity either in an exploratory way or by replicating well-established observations of specific phenotypes.

Information

Type
Regular Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright
© The Author(s), 2022. Published by Cambridge University Press
Figure 0

Table 1. Model variables and user-input simulation parameters

Figure 1

Figure 1. Components of the dynamical gene-by-environment simulation, adding complexity from top to bottom.

Figure 2

Figure 2. Example simulated monozygotic twin pair phenotypes (red) with identical genomes, identical initial conditions, but different dynamic environmental factors (blue). The Twin 2 phenotype (bottom) converges to the additive genetic expectation. The Twin 1 phenotype (top) does not converge to the additive genetic expectation, but cycles instead. Gene-by-environment interactions result in (1) divergent phenotypic outcomes, (2) different developmental dynamics, with Twin 1 (top) showing periodic change, and (3) deviation from the expected phenotypic outcome under an additive genetic model (dashed line).

Figure 3

Figure 3. Phase portraits of simulated monozygotic twin pair from Figure 2. Twin 1 converges to a limit cycle, as shown by the trajectory re-tracing nearly the same, looping path repeatedly. Twin 2 converges to a fixed point, as shown by the trajectory settling to a single value.

Figure 4

Figure 4. Manhattan, cumulative variance explained by genes, and QQ plots for Study 1. Rows have different kinds of outcome measures. Columns have different conditions.

Figure 5

Figure 5. Cross-validated R2 for polygenic risk scores created by adding SNPs in order of statistical significance. A point is highlighted in red if the corresponding SNP effect is statistically significant at the Bonferroni corrected level. Note that R2 is theoretically between 0 and 1, but the vertical axis range is restricted.

Figure 6

Figure 6. GCTA Simulations with gene-by-environment interaction. Ninety-five percent confidence intervals are shown as dotted lines.