The Power to Detect Cultural Transmission in the Nuclear Twin Family Design With and Without Polygenic Risk Scores and in the Transmitted–Nontransmitted (Alleles) Design

Susanne Bruins; Conor V. Dolan; Dorret I. Boomsma

doi:10.1017/thg.2020.76

The Power to Detect Cultural Transmission in the Nuclear Twin Family Design With and Without Polygenic Risk Scores and in the Transmitted–Nontransmitted (Alleles) Design

Published online by Cambridge University Press: 16 October 2020

Susanne Bruins ,

Conor V. Dolan and

Dorret I. Boomsma

Show author details

Susanne Bruins: Affiliation:
Department of Biological Psychology, Vrije Universiteit Amsterdam, Amsterdam, The Netherlands Amsterdam Public Health Research Institute, Amsterdam, The Netherlands
Conor V. Dolan*: Affiliation:
Department of Biological Psychology, Vrije Universiteit Amsterdam, Amsterdam, The Netherlands Amsterdam Public Health Research Institute, Amsterdam, The Netherlands
Dorret I. Boomsma: Affiliation:
Department of Biological Psychology, Vrije Universiteit Amsterdam, Amsterdam, The Netherlands Amsterdam Public Health Research Institute, Amsterdam, The Netherlands
*: Author for correspondence: Conor V. Dolan, Email: c.v.dolan@vu.nl

Article contents

Abstract
The NTF Design
The T–NT PRS Design
Power
Results
Discussion
Supplementary material
References

Abstract

We compare the power of two different approaches to detect passive genotype–environment (GE) covariance originating from cultural and genetic transmission operating simultaneously. In the traditional nuclear twin family (NTF) design, cultural transmission is estimated from the phenotypic covariance matrices of the mono- and dizygotic twins and their parents. Here, phenotyping is required in all family members. A more recent method is the transmitted–nontransmitted (T–NT) allele design, which exploits measured genetic variants in parents and offspring to test for effects of nontransmitted alleles from parents. This design requires two-generation genome-wide data and a powerful genome-wide association study (GWAS) for the phenotype in addition to phenotyping in offspring. We compared the power of both designs. Using exact data simulation, we demonstrate three points: how the power of the T–NT design depends on the predictive power of polygenic risk scores (PRSs); that when the NTF design can be applied, its power to detect cultural transmission and GE covariance is high relative to T–NT; and that, given effect sizes from contemporary GWAS, adding PRSs to the NTF design does not yield an appreciable increase in the power to detect cultural transmission. However, it may be difficult to collect phenotypes of parents and the possible importance of gene × age interaction, and secular generational effects can cause complications for many important phenotypes. The T–NT design avoids these complications.

Keywords

cultural transmission nontransmitted alleles genetic nurture nuclear twin family design power

Information

Type: Articles
Information: Twin Research and Human Genetics , Volume 23 , Issue 5 , October 2020 , pp. 265 - 270

DOI: https://doi.org/10.1017/thg.2020.76 [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives licence (http://creativecommons.org/licenses/by-nc-nd/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is unaltered and is properly cited. The written permission of Cambridge University Press must be obtained for commercial re-use or in order to create a derivative work.
Copyright: © The Author(s) 2020. Published by Cambridge University Press

This study aimed to compare the power of the nuclear twin family (NTF) design and the transmitted–nontransmitted (T–NT) alleles design to detect genotype–environment (GE) covariance due to cultural transmission. The classical NTF design uses the implied phenotypic covariance matrices of monozygotic (MZ) and dizygotic (DZ) twins and their parents, and the T–NT design exploits measured genetic variants (GVs) in parents and offspring, to estimate genetic and cultural transmission.

Simultaneous genetic and cultural transmission leads to passive GE covariance (Plomin et al., Reference Plomin, DeFries and Loehlin1977). Passive GE covariance occurs when parental genotypes influence the rearing environment of their offspring, which is sometimes referred to as cultural transmission (e.g., Cavalli-Sforza & Feldman, Reference Cavalli-Sforza and Feldman1973; Eaves, Reference Eaves1976a, Reference Eaves1976b; Eaves et al., Reference Eaves, Last, Martin and Jinks1977; Fulker, Reference Fulker, Weir, Eisen, Goodman and Namkoong1988; Maes et al., Reference Maes, Neale, Kendler, Martin, Heath and Eaves2006). Because the offspring inherits half of each parent’s alleles, and the offspring is subject to influences of the rearing environment shaped indirectly by parents’ genotypes, a covariance arises between genotypic and environmental influences. In addition to passive GE covariance (on which we focus in this article), evocative and active GE covariance are also distinguished (Plomin et al., Reference Plomin, DeFries and Loehlin1977). The latter arises when an individual’s behavior and preferences are influenced by the individual’s genotype, and the individual actively chooses and creates environments that suit their behavior and preferences. The former occurs when an individual’s actions, influenced by the individual’s genotype, systematically evoked certain responses from the individual’s environment.

GE covariance is of substantive interest as it is thought to be important in cognitive development (Cheesman et al., Reference Cheesman, Hunjan, Coleman, Ahmadzadeh, Plomin, McAdams, Eley and Breen2020; Scarr & McCartney, Reference Scarr and McCartney1983; Zavala et al., Reference Zavala, Beam, Finch, Gatz, Johnson, Kremen, Neiderhiser, Pedersen and Reynolds2018) and in the development of behavioral problems (Bornovalova et al., Reference Bornovalova, Cummings, Hunt, Blazei, Malone and Iacono2014; Harold et al., Reference Harold, Leve, Elam, Thapar, Neiderhiser, Natsuaki, Shaw and Reiss2013; Jaffee & Price, Reference Jaffee and Price2007; Rutter & Silberg, Reference Rutter and Silberg2002). It is of statistical interest, as the correct interpretation of model parameters in the classical twin design hinges on the assumption of no GE covariance (Keller et al., Reference Keller, Medland and Duncan2010). For instance, (unmodeled) covariance between A (additive genetic variable) and C (shared environmental variable) biases the estimate of the shared environmental variance (σ_C) in the classical twin ACE model (Eaves et al., Reference Eaves, Last, Martin and Jinks1977; Keller et al., Reference Keller, Medland and Duncan2010; Keller et al., Reference Keller, Medland, Duncan, Hatemi, Neale, Maes and Eaves2009; Purcell, Reference Purcell2002).

While there are various designs and models that allow for the estimation of GE covariance (e.g., Carey, Reference Carey1986; Dolan et al., Reference Dolan, de Kort, van Beijsterveldt, Bartels and Boomsma2014; Dolan et al., Reference Dolan, Huijskens, Minică, Neale and Boomsma2020; Eaves et al., Reference Eaves, Last, Martin and Jinks1977), we focus on two designs here that specifically assess cultural transmission. The NTF design (Keller et al., Reference Keller, Medland, Duncan, Hatemi, Neale, Maes and Eaves2009) extends the classical twin design by including the parents of the twins. In the NTF design, the family environment is defined as an environment shared between all family members that arises due to cultural transmission. Additionally, one can estimate the variance due to sibling shared environment (environment shared between members of a twin pair or sibling pair), variance due to nonadditive genetic effects, unshared environment (all variance due to influences unique to the individual, including measurement error), genotype–family environment covariance, and phenotypic assortative mating.

A more recent design to detect GE covariance stemming from cultural transmission is the T–NT alleles design. In this design, cultural transmission is manifested in the effect on the offspring phenotype of parents’ nontransmitted alleles (Bates et al., Reference Bates, Maher, Medland, McAloney, Wright, Hansell and Gillespie2018; Kong et al., Reference Kong, Thorleifsson, Frigge, Vilhjalmsson, Young, Thorgeirsson, Benonisdottir, Oddsson, Halldorsson, Masson, Gudbjartsson, Helgason, Bjornsdottir, Thorsteinsdottir and Stefansson2018). This method can be applied to parent–offspring trios but can easily be extended to include multiple offspring, including twins. As parents pass on half of their alleles to their offspring, the polygenic risk score (PRS) of the offspring is a function of the transmitted alleles from both parents. For the other nontransmitted half of the parental alleles, a nontransmitted PRS can be calculated. In a linear regression, if genetic transmission is the only pathway of transmission from parent to offspring, the regression coefficient in the regression of offspring phenotype on the nontransmitted PRS should be zero. Rejection of the (null) hypothesis that the regression coefficient is zero suggests that the parental genotype has an indirect effect on the offspring phenotype, that is, that cultural transmission is present.

Our aim is to compare the power of the NTF design and the T–NT design to detect cultural transmission and to assess whether the addition of PRS to the NTF design improves the power to detect cultural transmission in this design. The article is organized as follows. First, we present the classical NTF design, the T–NT design, and the NTF design including PRS. Second, we outline our strategy with respect to simulation and power analysis. Third, we present the results of our simulation studies and discuss the implications.

The NTF Design

The relationship between the total additive genetic variance inferred in the classical NTF design and the variance explained by the PRS is as follows. Suppose that there are $$M$$ GVs contributing to the variance of phenotype $$Ph$$ , and that we have measured all $$M$$ relevant GVs. For convenience (but without loss of generality) assume also that the GVs are in gametic phase (linkage) equilibrium, we have, for individual $$i$$ :

$$\eqalign{ P{h_i} = {b_0} + {b_1}G{V_{1i}} + {b_2}G{V_{2i}} + {b_3}G{V_{3i}} + \ldots + {b_M}G{V_{Mi}} + {e_i} \cr {\rm{\sigma }}_{Ph}^2 = b_1^2{\rm{\sigma }}_{GV1}^2 + b_2^2{\rm{\sigma }}_{GV2}^2 + b_3^2\sigma _{GV3}^2 + \ldots + {\rm{}}b_M^2{\rm{\sigma }}_{GVM}^2 + {\rm{\sigma }}_E^2 \cr {\rm{\sigma }}_{Ph}^2 = g_1^2 + g_2^2 + g_3^2 + \ldots + g_M^2 + {\rm{\sigma }}_E^2 = \sum\nolimits_{m = 1}^M {g_m^2} + {\rm{\sigma }}_E^2, \cr} $$

where $$g_m^2 = b_m^2var\left( {G{V_m}} \right)$$ , that is, the additive genetic variance due to the $$m$$ ’th genetic variant ( $$G{V_m})$$ , $$\mathop \sum \nolimits_{m = 1}^M g_m^2$$ is the total additive genetic variance, and $${\rm{\sigma }}_E^2$$ is the residual ( $$e$$ ) variance. The additive genetic variance as estimated in the NTF design (or the classical twin design) is an estimate of $$\mathop \sum \nolimits_{m = 1}^M g_m^2$$ . This estimate of genetic variance may be biased if assumptions of the design are violated. Since the PRS is based on a subset ( $$T$$ ) of GVs, the total additive genetic variance ( $${\rm{\sigma }}_A^2$$ ) is the sum of the variance captured by the PRS based on these $$T$$ measured alleles ( $${\rm{\sigma }}_{PRS}^2$$ ), and the remaining latent additive genetic variance ( $${\rm{\sigma }}_{AL}^2$$ ):

$${\rm{\sigma }}_{PRS}^2 = \sum\nolimits_{t = 1}^T {g_{\rm{t}}^2} ,{\rm{\sigma }}_A^2 = \sum\nolimits_{m = 1}^M {g_{\rm{m}}^2} = {\rm{\sigma }}_{AL}^2 + \sum\nolimits_{t = 1}^T {g_{\rm{t}}^2} ,$$

and the proportion of explained additive genetic variance is $${R_A}^2 = {{{\rm{\sigma }}_{PRS}^2} \over {{\rm{\sigma }}_A^2}}$$ . Since the total additive genetic variance $${\rm{\sigma }}_A^2$$ is identified by the implied covariance between MZ and DZ twins and their parents, $${\rm{\sigma }}_{AL}^2$$ and $${\rm{\sigma }}_{PRS}^2$$ are $${\rm{\sigma }}_{AL}^2 = {\rm{\sigma }}_A^2 \times (1 - {R^2})$$ and $${\rm{\sigma }}_{PRS}^2 = {\rm{\sigma }}_A^2 \times {R^2}$$ , respectively. Note that this decomposition of the additive genetic variance into observed and latent components assumes that the observed genetic variance is not inflated by noise in the PRS.

The path diagrammatic representation of the classical NTF design, given random mating, is given in Figure 1. Using path tracing (Wright, Reference Wright1920), we can deduct the model-implied variances and covariances. In the NTF design, we then have

$${\rm{\sigma }}_{Ph}^2 = {a^2}{\rm{\sigma }}_A^2 + {s^2}{\rm{\sigma }}_S^2 + {e^2}{\rm{\sigma }}_E^2 + {f^2}{\rm{\sigma }}_F^2+2afw,$$

where the $$q$$ and $$x$$ in Figure 1 equal $${\rm{\sigma }}_A^2$$ and $${\rm{\sigma }}_F^2,$$ respectively. While $$w$$ is the covariance between the latent variables $$A$$ and $$F$$ , the term $$2afw$$ is the total contribution of the covariance between genotype $$A$$ and family environment $$F$$ to the phenotypic variance. Given the scaling in Figure 1 (based on Keller et al., Reference Keller, Medland, Duncan, Hatemi, Neale, Maes and Eaves2009), we have

Fig. 1. Path diagram of the classical nuclear twin family (NTF) design given random mating (in the parameterization and notation of Keller et al., Reference Keller, Medland, Duncan, Hatemi, Neale, Maes and Eaves2009). The circles denote latent variables; the squares are observed/measured phenotypic values. Single-headed arrows are paths; double-headed arrows indicate covariances. Solid lines are free parameters; dashed lines are fixed parameters. Dashed paths between parents–offspring A are fixed to .5. Note: A, additive genetic; F, family environment due to cultural transmission; S, sibling environment shared between twins; E, unshared environment; Ph, phenotype; m, cultural transmission; w, covariance between family environment and additive genetic variable. Constraints include $${\rm{\sigma }}_A^2 = 1,{\rm{\sigma }}_S^2 = 1,{\rm{\sigma }}_E^2 = 1,\;f = 1$$ .

$${\rm{\sigma }}_S^2 = 1$$ , $${\rm{\sigma }}_E^2 = 1$$ , $${\rm{\sigma }}_A^2 = 1$$ , $$f = 1$$ , so the equation can be rewritten as

$${\rm{\sigma }}_{Ph}^2 = {a^2} + {s^2} + {e^2} + {\rm{\sigma }}_F^2+2aw$$

We find the variance of the family environment $${\rm{\sigma }}_F^2$$ and the genotype–family environment covariance $$w$$ by $${\rm{\sigma }}_F^2 = 2\left( {{m^2}{{\rm{\sigma }}^2}} \right)$$ and $$w = m\left( {a{\rm{\sigma }}_A^2 + wf} \right) = {{a{\rm{\sigma }}_A^2m} \over {1 - fm}} = {{am} \over {1 - m}},$$ given $${\rm{\sigma }}_A^2 = 1$$ and $$f = 1$$ . Note that $$w \ne 0$$ and $${\rm{\sigma }}_F^2>0$$ only if there is cultural transmission, that is, $$m \ne 0$$ .

The NTF design, including PRSs, is depicted in Figure 2. Using path tracing and given the same scaling and constraints as in the classical NTF design, the complete model-implied covariance structure is

$$\mathop \sum \nolimits= \left[ {\matrix{ {{\rm{\sigma }}_{fa}^2} \quad\,\,\,\,\,{{{\rm{\sigma }}_{mofa}}} \quad{{{\rm{\sigma }}_{T1fa}}} \quad{{{\rm{\sigma }}_{T2fa}}} \cr {{{\rm{\sigma }}_{famo}}} \quad{{\rm{\sigma }}_{mo}^2} \quad{{{\rm{\sigma }}_{T1mo}}} \quad{{{\rm{\sigma }}_{T2mo}}} \cr {{{\rm{\sigma }}_{faT1}}} \quad\,{{{\rm{\sigma }}_{moT1}}} \quad{{\rm{\sigma }}_{T1}^2} \quad{{{\rm{\sigma }}_{T2T1}}} \cr {{{\rm{\sigma }}_{faT2}}} \quad\,{{{\rm{\sigma }}_{moT2}}} \quad{{{\rm{\sigma }}_{T1T2}}} \quad{{\rm{\sigma }}_{T2}^2} \cr } } \right],$$

where $$fa$$ , $$mo$$ , $$T1$$ and $$T2$$ stand for the phenotypic (co)variance of father, mother, twin 1 and twin 2, respectively. Variances are assumed to be equal, and (genotypic as well as cultural) transmission is assumed to be equal for both parents, such that

$$\scale 80% {{\mathop \sum \nolimits_{MZ}} = \left[ {\matrix{ {{\rm{\sigma }}_{Ph}^2} \,\,\,\,\,\,\,\,\,\,{{{\rm{\sigma }}_{P,P}}} \,\,\,\,{{{\rm{\sigma }}_{P,MZ}}} \,\,\,\,{{{\rm{\sigma }}_{P,MZ}}} \cr {{{\rm{\sigma }}_{P,P}}} \,\,\,\,\,\,\,\,\,\,{{\rm{\sigma }}_{Ph}^2} \,\,\,\,{{{\rm{\sigma }}_{P,MZ}}} \,\,\,\,{{{\rm{\sigma }}_{P,MZ}}} \cr {{{\rm{\sigma }}_{P,MZ}}} \,\,\,\,{{{\rm{\sigma }}_{P,MZ}}} \,\,\,\,{{\rm{\sigma }}_{Ph}^2} \,\,\,\,{{{\rm{\sigma }}_{MZ,MZ}}} \cr {{{\rm{\sigma }}_{P,MZ}}} \,\,\,\,{{{\rm{\sigma }}_{P,MZ}}} \,\,\,\,{{{\rm{\sigma }}_{MZ,MZ}}} \,\,\,\,{{\rm{\sigma }}_{Ph}^2} \cr } } \right],{{\mathop \sum \nolimits_{DZ}} = \left[ {\matrix{ {{\rm{\sigma }}_{Ph}^2} \,\,\,\,\,\,\,\,{{{\rm{\sigma }}_{P,P}}} \,\,\,\,{{{\rm{\sigma }}_{P,DZ}}} \,\,\,\,{{{\rm{\sigma }}_{P,DZ}}} \cr {{{\rm{\sigma }}_{P,P}}} \,\,\,\,\,\,\,{{\rm{\sigma }}_{Ph}^2} \,\,\,\,{{{\rm{\sigma }}_{P,DZ}}} \,\,\,\,{{{\rm{\sigma }}_{P,DZ}}} \cr {{{\rm{\sigma }}_{P,DZ}}} \,\,\,\,{{{\rm{\sigma }}_{P,DZ}}} \,\,\,\,{{\rm{\sigma }}_{Ph}^2} \,\,\,\,{{{\rm{\sigma }}_{DZ,DZ}}} \cr {{{\rm{\sigma }}_{P,DZ}}} \,\,\,\,{{{\rm{\sigma }}_{P,DZ}}} \,\,\,\,{{{\rm{\sigma }}_{DZ,DZ}}} \,\,\,\,{{\rm{\sigma }}_{Ph}^2} \cr } } \right]},$$

where the total phenotypic variance is $${\rm{\sigma }}_{Ph}^2 = a_1^2 + a_2^2 + x + 2{a_1}{w_1} + 2{a_2}{w_2} + {s^2} + {e^2}$$ , the covariance between parents is $${{\rm{\sigma }}_{P,P}} = x$$ , $$\;$$ the covariance between parent and (twin) offspring is $${{\rm{\sigma }}_{P,T}} = {1 \over 2}\left( {{a_1} + {w_1}} \right) + {1 \over 2}\left( {{a_2} + {w_2}} \right) + m{{\rm{\sigma }}^2},$$ the covariance between MZ twins is $${{\rm{\sigma }}_{MZ,MZ}} = a_1^2 + a_2^2 + x + 2{a_1}{w_1} + 2{a_2}{w_2} + {s^2}$$ and the covariance between DZ twins is $${{\rm{\sigma }}_{DZ,DZ}} = \cr {1 \over 2}a_1^2 + {1 \over 2}a_2^2 + x + 2{a_1}{w_1} + 2{a_2}{w_2} + {s^2}}$$ (equations and matrices adapted from Keller et al., Reference Keller, Medland, Duncan, Hatemi, Neale, Maes and Eaves2009).

Fig. 2. Path diagram of the nuclear twin family design including PRS, again assuming random mating in the latent model. The circles denote latent variables; the squares are observed/measured values. Single-headed arrows are paths; double-headed arrows indicate covariances. Solid lines are free parameters; dashed lines are fixed parameters. Dashed paths between parents–offspring A_L and PRS are fixed to .5. Note: A_L, latent additive genetic; PRS, observed (transmitted) additive genetic; F, family environment due to cultural transmission; S, sibling environment shared between twins; E, unshared environment; Ph, phenotype; m, cultural transmission path; w1 and w2, covariance between family environment and latent and observed additive genetic variables. Constraints include $${\rm{\sigma }}_{AL}^2 = 1,{\rm{\sigma }}_{PRS}^2 = 1,{\rm{\sigma }}_S^2 = 1,{\rm{\sigma }}_E^2 = 1,\;f = 1$$ .

The T–NT PRS Design

In the T–NT design, the phenotype is regressed on the transmitted and nontransmitted PRS, such that for individual $$i$$ , we have

$${{Ph}^2 = {b0} + {b1} \times {PRS_{T{i}} + {b2}\times {PRS_{NTi}}+{\epsilon_{i}},$$

where $$PR{S_{Ti}}$$ is the PRS that was transmitted from parents to individual $$i$$ , and $$PR{S_{NTi}}$$ is the nontransmitted PRS of individual $$i$$ , based on the alleles that were not transmitted to individual $$i$$ . While the transmitted PRS in the T–NT design is equal to the PRS in the NTF design, the relation between the nontransmitted PRS in the T–NT design and the cultural transmission effects in the NTF design is more complicated. In the NTF design, cultural transmission processes are captured by the family environment, the genotype–family environment covariance, and the cultural transmission from the parental phenotype to the family environment itself. In the T–NT design, however, cultural transmission is solely represented by the regression of the offspring phenotype on the nontransmitted parental PRS.

Power

Model identification of the NTF is well established, and the addition of PRS does not pose an identification problem. However, within an identified model, the power in tests concerning the parameters is an open question. We conducted power analysis in the NTF models using exact data simulation (van der Sluis et al., Reference van der Sluis, Dolan, Neale and Posthuma2008). Specifically, the power to detect cultural transmission was calculated as the power to reject a misspecified model in which $$\;m = 0$$ , when in truth, $$m$$ took values of $$m = .05$$ , $$m = .10$$ , $$m = .15$$ and $$\;m = .20$$ . Data were simulated for various parameter settings of $${a^2},\;{s^2}\;$$ and $${e^2}$$ (and $$m = .05-.20$$ ). Detailed model parameters and variance decomposition per scenario are given in Table 1. The PRSs were simulated such that the PRS explained 10% of the additive genetic variance (i.e., $${R_A}^2 = .10$$ ). Since the noncentrality parameter is linearly related to sample size, the NCP was weighted for the number of families ( $$N$$ ), and power was calculated for $$N = 100 - 10.000$$ (with an MZ/DZ family ratio of 1).

Table 1. Parameter settings and variance components for 12 data simulations

Note: $${\rm{\sigma }}_{Ph}^2$$ is the total phenotypic variance given specified parameter settings. $${\rm{\sigma }}_A^2,\;{\rm{\sigma }}_F^2,\;{\rm{\sigma }}_S^2$$ , $${\rm{\sigma }}_E^2$$ and $${{\rm{\sigma }}_{A,F}}$$ indicate the phenotypic variance that is explained by genotype, family environment, sibling environment, unshared environment and genotype–environment covariance, respectively. This decomposition is based on $${\rm{\sigma }}_{Ph}^2 = {a^2} + {s^2} + {e^2} + {\rm{\sigma }}_F^2 + 2aw$$ . Standardized variance components are in parentheses (proportions of phenotypic variance). Sim, simulation scenario; Par. set., Parameter settings. Parameter settings represent unstandardized input parameters, where $${e^2} = .50$$ and $${s^2} = .20$$ over all simulations.

Given these parameter settings, we compared the power to detect cultural transmission in the NTF design and T–NT design, given identical parameter settings and sample sizes, using ordinary (i.e., not exact) data simulation. To compare power of the T–NT design with that of the NTF design, we must ensure that the sample sizes are comparable. To do so, we determined the number of families for which, in the NTF design + PRS model, the power to reject $$m = 0$$ was .80, given specific parameter settings. We performed a general estimation equations (GEEs; e.g., Minică et al., Reference Minică, Boomsma, Vink and Dolan2014) analysis on the phenotype and (transmitted and nontransmitted) PRS data of both members of twin pairs. GEE automatically adjusts the standard errors and test statistic for the dependency. Since the twins are related, the effective $$N$$ is defined as $$ = {{2N} \over {1 + {\rm{\rho }}}}$$ , where ρ is the twin correlation. For instance, if we find that the NTF design + PRS requires 1000 twin families (i.e., 2000 individual twins) to reject $$m = 0$$ with a power of .80, we use $$N = 2000$$ for the GEE analysis. If then, for example, $${{\rm{\rho }}_{MZ}} = .6$$ and $${{\rm{\rho }}_{DZ}} = .4$$ , we effectively have $$NE = {{1000} \over {1.6}} + {{1000} \over {1.4}} \approx 1339.3$$ unrelated individuals. Therefore, the NCP from the GEE was corrected for the effective number of unrelated individuals.

For the NTF model, we used the NCP to calculate the power to detect cultural transmission. Given exact data simulation, the NCP equals the value of the log-likelihood ratio test statistic in the test of $$m = 0$$ . For the simulations involving the T–NT model, we used ordinary simulation (not exact), so that the NCP is the average test statistic of the regression coefficient of the nontransmitted PRS, minus the degrees of freedom, which was obtained by running 5000 replications. The test statistic in the GEE is a robust z statistic. Since asymptotically, $${z^2} = {{\rm{\chi }}^2}$$ , the squared test statistics follows a (central) $${{\rm{\chi }}^2}$$ distribution (1 df), under the null hypothesis. Power and required sample sizes are reported given $${\rm{\alpha }} = .05$$ . Analyses were conducted in R (version 3.5.1; R Development Core Team, 2018). Structural equation modeling was performed in OpenMx (Neale et al., Reference Neale, Hunter, Pritikin, Zahery, Brick, Kirkpatrick, Estabrook, Bates, Maes and Boker2016), using the NPSOL optimizer. For the GEE modeling, the R-package gee was used (Carey et al., Reference Carey, Lumley and Ripley2012). R-scripts for the simulations and model fitting are provided in the Supplementary Materials on the Cambridge Core website.

Results

The transmitted and nontransmitted PRS are uncorrelated, and the regression coefficients of the nontransmitted PRS increased with increasing values of $$\;m$$ , indicating that the simulated PRS indeed captured the cultural transmission effects. As expected, the regression coefficient of the transmitted PRS also increases with increasing $$m$$ , reflecting the presence of cultural transmission. Model information and power calculations for the three models are given in Table 2. As can be seen from Table 2, the power to reject $$m = 0$$ does not differ substantially between the classical NTF design and the NTF design with PRS, but the NTF designs have greater power to reject $$m = 0$$ than the T–NT design. Figure 3 displays the power to detect the cultural transmission effects of scenario 7 over a range of sample sizes (N = 100−10,000).

Table 2. Required sample sizes and power for NTF design, NTF design + PRS and T–NT design

Note: NTF, nuclear twin family; T–NT, transmitted–nontransmitted; Sim, simulation scenario; pow, power; −2LL, −2 log-likelihood; Z, z statistic of b; nfam, number of families; NE, effective number of unrelated individuals (rounded to the nearest integer); .80 is the effective number of unrelated individuals required for a power of .80 (rounded to the nearest integer).

Fig. 3. Power plot of the power to detect genotype–environment correlation due to cultural transmission, in three models. This represent scenario 7, where genotype–environment correlation due to cultural transmission explains 15% of the phenotypic variance ( $$2{{\rm{\sigma }}_{AL,F}} = .14$$ and $$2{{\rm{\sigma }}_{PRS,F}} = .02$$ ). In the nuclear twin family (NTF) design, N stands for number of families, while in the transmitted–nontransmitted (T–NT) design, N is the effective number of unrelated individuals. A power of .80 is achieved at N = 330 in the NTF + PRS design and at N = 4209 in the T–NT design.

Since the power of the T–NT design is expected to increase as the additive genetic variance explained by the PRS increases, we tested a model with $${a^2} = .8$$ , $${s^2} = .2$$ , $${e^2} = .5$$ and $$m = .15$$ , in which the PRS explained either 50% or 100% of the additive genetic variance. We recognize that such high $${R^2}$$ values are unrealistic; hence, this scenario was designed to inform us as to power contributions as genome-wide association study power increases. As can be seen from Table 3, when the $${R^2}$$ of the PRS increases, the power to detect cultural transmission also increases, in both the NTF design + PRS and the T–NT design.

Table 3. Required sample sizes and power given unstandardized input parameters $${a^2} = .8$$ , $${s^2} = .2$$ , $${e^2} = .5$$ , $$m = .15$$ and $${R^2} = \;.5$$ (sim. A) or $${R^2} = \;1$$ (sim B)

Note: NTF, nuclear twin family; T–NT, transmitted–nontransmitted; Sim, simulation scenario; pow, power; −2LL, −2 log-likelihood; Z, z statistic of b; nfam is the number of families, NE, effective number of unrelated individuals (rounded to the nearest integer), .80, effective number of unrelated individuals required for a power of .80 (rounded to the nearest integer). For simulation A, parameters are $${\rm{\sigma }}_{AL}^2 = .40\;\left( {.21} \right)$$ , $${\rm{\sigma }}_{PRS}^2 = .40\;\left( {.21} \right)$$ , $${\rm{\sigma }}_F^2 = .08\;\left( {.05} \right)$$ , $${\rm{\sigma }}_S^2 = .20\;\left( {.11} \right)$$ , $${\rm{\sigma }}_E^2 = .50\;\left( {.27} \right)$$ , $$2{{\rm{\sigma }}_{AL,F}} = .14\;\left( {.08} \right)$$ , 2 $${{\rm{\sigma }}_{PRS,F}} = .14\;\left( {.08} \right)$$ , $${\rm{\sigma }}_{Ph}^2 = 1.87$$ . For simulation B, parameters are $${\rm{\sigma }}_{AL}^2 = 0$$ , $${\rm{\sigma }}_{PRS}^2 = .80\;\left( {.43} \right)$$ , $${\rm{\sigma }}_F^2 = .08\;\left( {.05} \right)$$ , $${\rm{\sigma }}_S^2 = .20\;\left( {.11} \right)$$ , $${\rm{\sigma }}_E^2 = .50\;\left( {.27} \right)$$ , $$2{{\rm{\sigma }}_{AL,F}} = 0$$ , 2 $${{\rm{\sigma }}_{PRS,F}} = .28\;\left( {.15} \right)$$ , $${\rm{\sigma }}_{Ph}^2 = 1.87$$ .

Discussion

The present aim was to compare the power of the NTF and T–NT designs to detect cultural transmission. In addition, we tested the benefits of incorporating PRS in the NTF design. The classical NTF design is well powered to detect cultural transmission in sample sizes common in twin studies. Unless the PRS explains a large portion of the additive genetic variance, inclusion of PRS in the design did not result in an appreciable improvement in power. Compared to the T–NT design, the NTF design has greater power, given comparable sample sizes and parameter settings, at the cost, of course, of phenotyping the parents, issues of cultural change across phenotyping, and so forth. The T–NT design requires much larger samples to detect cultural transmission effects. The difference in power is due to the fact that, by definition, the T–NT design only captures part of the transmission effects due to the regression of the nontransmitted PRS being an imperfect representation of the total cultural transmission effect. First, cultural transmission effects are also captured in the transmitted PRS. Second, currently most PRSs only explain a (relatively) small proportion of the total additive genetic variance (e.g., Baselmans et al., Reference Baselmans, Yengo, van Rheenen and Wray2020). Therefore, the extent to which GE covariance can be captured by PRSs is proportional to the amount of genetic variance captured by the PRSs. For example, if the total passive GE covariance accounts for 27% of the phenotypic variance, a PRS explaining 10% of the additive genetic variance will capture only 3% of the phenotypic variance due to the passive GE covariance (Table 2, simulation 12).

The T–NT design is an ingenious addition to the designs suitable to detect passive GE covariance. At present, work is underway to specify T–NT design as a structural equation model (e.g., Balbona et al., Reference Balbona, Kim and Keller2020; Bates et al., Reference Bates, Maher, Medland, McAloney, Wright, Hansell and Gillespie2018; Kim et al., Reference Kim, Balbona and Keller2020), which will increase its flexibility and scope. For instance, Balbona et al. (Reference Balbona, Kim and Keller2020) and Kim et al. (Reference Kim, Balbona and Keller2020) proposed extensions of the T–NT design in which family environment was modeled as a latent variable. In addition, Kim et al. (Reference Kim, Balbona and Keller2020) proposed that existing bias in the model might be due to unmodeled latent additive genetic variance.

In conclusion, when the PRSs explain a relatively small part of the total additive genetic variance, the incorporation of PRS in the NTF design does not provide any benefits to the power to detect cultural transmission. The classical NTF design is itself relatively well powered to detect cultural transmission, and a decent sample of nuclear twin families is currently more informative with respect to cultural transmission than is measured genotypic information. However, it must be borne in mind that our results are based on phenotypes that are subject to random mating. This assumption may not hold for many phenotypes. While the NTF design can accommodate primary phenotypic assortative mating by the addition of a co-path between parents, the T–NT design does not require random mating, given that assortative mating only occurred in the parental (but not grandparental) generation. In addition, it is often difficult to collect phenotypes of parents, and even when we can, the possible importance of gene × age interaction and secular generational effects can cause complications for many important phenotypes. The T–NT design avoids these complications.

Supplementary material

To view supplementary material for this article, please visit https://doi.org/10.1017/thg.2020.76.

Acknowledgments

DIB acknowledges Royal Netherlands Academy of Science Professor Award (PAH/6635).

References

Balbona, J. V., Kim, Y., & Keller, M. C. (2020). Estimation of parental effects using polygenic scores. BioRvix. http://doi.org/10.1101/2020.08.11.247049.Google Scholar

Baselmans, B. M., Yengo, L., van Rheenen, W., & Wray, N. R. (2020). Risk in relatives, heritability, SNP-based heritability and genetic correlations in psychiatric disorders: A review. Biological Psychiatry. http://doi.org/10.1016/j.biopsych.2020.05.034 CrossRef Google Scholar PubMed

Bates, T. C., Maher, B. S., Medland, S. E., McAloney, K., Wright, M. J., Hansell, N. K. & Gillespie, N. A. (2018). The nature of nurture: Using a virtual-parent design to test parenting effects on children’s educational attainment in genotyped families. Twin Research and Human Genetics, 21, 73–83.CrossRef Google Scholar PubMed

Bornovalova, M., Cummings, J., Hunt, E., Blazei, R., Malone, S., & Iacono, W. (2014). Understanding the relative contributions of direct environmental effects and passive genotype–environment correlations in the association between familial risk factors and child disruptive behavior disorders. Psychological Medicine, 44, 831–844.CrossRef Google Scholar PubMed

Carey, G. (1986). Sibling imitation and contrast effects. Behavior Genetics, 16, 319–341.CrossRef Google Scholar PubMed

Carey, V. J., Lumley, T., & Ripley, B. D. (2012). gee: Generalized Estimation Equation Solver, R package version 4.13-18. http://CRAN.R-project.org/package=gee Google Scholar

Cavalli-Sforza, L. L., & Feldman, M. W. (1973). Cultural versus biological inheritance: phenotypic transmission from parents to children. (A theory of the effect of parental phenotypes on children’s phenotypes). American Journal of Human Genetics, 25, 618.Google Scholar

Cheesman, R., Hunjan, A., Coleman, J. R., Ahmadzadeh, Y., Plomin, R., McAdams, T. A., Eley, T. C., & Breen, G. (2020). Comparison of adopted and nonadopted individuals reveals gene–environment interplay for education in the UK Biobank. Psychological Science, 31, 582–591.CrossRef Google Scholar PubMed

Dolan, C. V, Huijskens, R. C. A., Minică, C. C., Neale, M. C., & Boomsma, D. I. (2020). Incorporating polygenic risk scores in the ACE twin model to estimate A-C covariance. Manuscript submitted for publication.Google Scholar

Dolan, C. V., de Kort, J. M., van Beijsterveldt, T. C., Bartels, M., & Boomsma, D. I. (2014). GE covariance through phenotype to environment transmission: An assessment in longitudinal twin data and application to childhood anxiety. Behavior Genetics, 44, 240–253.CrossRef Google Scholar PubMed

Eaves, L. (1976a). A model for sibling effects in man. Heredity, 36, 205–214.CrossRef Google Scholar

Eaves, L. (1976b). The effect of cultural transmission on continuous variation. Heredity, 37, 41–57.CrossRef Google Scholar PubMed

Eaves, L. J., Last, K., Martin, N. G., & Jinks, J. L. (1977). A progressive approach to non-additivity and genotype–environmental covariance in the analysis of human differences. British Journal of Mathematical and Statistical Psychology, 30, 1–42.CrossRef Google Scholar

Fulker, D. W. (1988). Genetic and cultural transmission in human behavior. In Weir, B. S., Eisen, E. J., Goodman, M. M., & Namkoong, G. (Eds.), Proceedings of the second international conference on quantitative genetics (pp. 318–340). Sinauer Associates.Google Scholar

Harold, G. T., Leve, L. D., Elam, K. K., Thapar, A., Neiderhiser, J. M., Natsuaki, M. N., Shaw, D. S. & Reiss, D. (2013). The nature of nurture: Disentangling passive genotype–environment correlation from family relationship influences on children’s externalizing problems. Journal of Family Psychology, 27, 12–21.CrossRef Google Scholar PubMed

Jaffee, S. R., & Price, T. S. (2007). Gene–environment correlations: A review of the evidence and implications for prevention of mental illness. Molecular Psychiatry, 12, 432–442.CrossRef Google Scholar PubMed

Keller, M. C., Medland, S. E., & Duncan, L. E. (2010). Are extended twin family designs worth the trouble? A comparison of the bias, precision, and accuracy of parameters estimated in four twin family models. Behavior Genetics, 40, 377–393.CrossRef Google Scholar PubMed

Keller, M. C., Medland, S. E., Duncan, L. E., Hatemi, P. K., Neale, M. C., Maes, H. H., & Eaves, L. J. (2009). Modeling extended twin family data I: Description of the Cascade model. Twin Research and Human Genetics, 12, 8–18.CrossRef Google Scholar PubMed

Kim, Y., Balbona, J.V., & Keller, M.C. (2020). Nature of nurture analysis via structural equation modeling. Paper presented at the meeting of Behavior Genetics Association.Google Scholar

Kong, A., Thorleifsson, G., Frigge, M. L., Vilhjalmsson, B. J., Young, A. I., Thorgeirsson, T. E., Benonisdottir, S., Oddsson, A., Halldorsson, B. V., Masson, G., Gudbjartsson, D. F., Helgason, A., Bjornsdottir, G., Thorsteinsdottir, U., Stefansson, K. (2018). The nature of nurture: Effects of parental genotypes. Science, 359, 424–428.CrossRef Google Scholar PubMed

Maes, H. H., Neale, M. C., Kendler, K. S., Martin, N. G., Heath, A. C., & Eaves, L. J. (2006). Genetic and cultural transmission of smoking initiation: An extended twin kinship model. Behavior Genetics, 36, 795–808.CrossRef Google Scholar PubMed

Minică, C. C., Boomsma, D. I., Vink, J. M., & Dolan, C. V. (2014). MZ twin pairs or MZ singletons in population family-based GWAS? More power in pairs. Molecular Psychiatry, 19, 1154–1155.CrossRef Google Scholar PubMed

Neale, M. C., Hunter, M. D., Pritikin, J. N., Zahery, M., Brick, T. R., Kirkpatrick, R. M., Estabrook, R., Bates, T., Maes, H., & Boker, S. M. (2016). OpenMx 2.0: Extended structural equation and statistical modeling. Psychometrika, 81, 535–549.CrossRef Google Scholar PubMed

Plomin, R., DeFries, J. C., & Loehlin, J. C. (1977). Genotype–environment interaction and correlation in the analysis of human behavior. Psychological Bulletin, 84, 309–322.CrossRef Google Scholar PubMed

Purcell, S. (2002). Variance components models for gene–environment interaction in twin analysis. Twin Research and Human Genetics, 5, 554–571.CrossRef Google Scholar PubMed

R Development Core Team. (2018). R: A language and environment for statistical computing. R Foundation for Statistical Computing. https://www.r-project.org Google Scholar

Rutter, M., & Silberg, J. (2002). Gene–environment interplay in relation to emotional and behavioral disturbance. Annual Review of Psychology, 53, 463–490.CrossRef Google Scholar PubMed

Scarr, S., & McCartney, K. (1983). How people make their own environments: A theory of genotype→ environment effects. Child Development, 424–435.Google Scholar PubMed

van der Sluis, S., Dolan, C. V., Neale, M. C., & Posthuma, D. (2008). Power calculations using exact data simulation: A useful tool for genetic study designs. Behavior Genetics, 38, 202–211.CrossRef Google Scholar PubMed

Wright, S. (1920). The relative importance of heredity and environment in determining the piebald pattern of guinea-pigs. Proceedings of the National Academy of Sciences of the United States of America, 6, 320–332.CrossRef Google Scholar PubMed

Zavala, C., Beam, C. R., Finch, B. K., Gatz, M., Johnson, W., Kremen, W. S., Neiderhiser, J. M., Pedersen, N. L., & Reynolds, C. A. (2018). Attained SES as a moderator of adult cognitive performance: Testing gene–environment interaction in various cognitive domains. Developmental Psychology, 54, 2356–2370.CrossRef Google Scholar PubMed

Fig. 1. Path diagram of the classical nuclear twin family (NTF) design given random mating (in the parameterization and notation of Keller et al., 2009). The circles denote latent variables; the squares are observed/measured phenotypic values. Single-headed arrows are paths; double-headed arrows indicate covariances. Solid lines are free parameters; dashed lines are fixed parameters. Dashed paths between parents–offspring A are fixed to .5. Note: A, additive genetic; F, family environment due to cultural transmission; S, sibling environment shared between twins; E, unshared environment; Ph, phenotype; m, cultural transmission; w, covariance between family environment and additive genetic variable. Constraints include $${\rm{\sigma }}_A^2 = 1,{\rm{\sigma }}_S^2 = 1,{\rm{\sigma }}_E^2 = 1,\;f = 1$$.

Fig. 2. Path diagram of the nuclear twin family design including PRS, again assuming random mating in the latent model. The circles denote latent variables; the squares are observed/measured values. Single-headed arrows are paths; double-headed arrows indicate covariances. Solid lines are free parameters; dashed lines are fixed parameters. Dashed paths between parents–offspring AL and PRS are fixed to .5. Note: AL, latent additive genetic; PRS, observed (transmitted) additive genetic; F, family environment due to cultural transmission; S, sibling environment shared between twins; E, unshared environment; Ph, phenotype; m, cultural transmission path; w1 and w2, covariance between family environment and latent and observed additive genetic variables. Constraints include $${\rm{\sigma }}_{AL}^2 = 1,{\rm{\sigma }}_{PRS}^2 = 1,{\rm{\sigma }}_S^2 = 1,{\rm{\sigma }}_E^2 = 1,\;f = 1$$.

Table 1. Parameter settings and variance components for 12 data simulations

Table 2. Required sample sizes and power for NTF design, NTF design + PRS and T–NT design

Fig. 3. Power plot of the power to detect genotype–environment correlation due to cultural transmission, in three models. This represent scenario 7, where genotype–environment correlation due to cultural transmission explains 15% of the phenotypic variance ($$2{{\rm{\sigma }}_{AL,F}} = .14$$ and $$2{{\rm{\sigma }}_{PRS,F}} = .02$$). In the nuclear twin family (NTF) design, N stands for number of families, while in the transmitted–nontransmitted (T–NT) design, N is the effective number of unrelated individuals. A power of .80 is achieved at N = 330 in the NTF + PRS design and at N = 4209 in the T–NT design.

Table 3. Required sample sizes and power given unstandardized input parameters $${a^2} = .8$$, $${s^2} = .2$$, $${e^2} = .5$$, $$m = .15$$and$${R^2} = \;.5$$(sim. A) or$${R^2} = \;1$$(sim B)

Bruins et al. supplementary material

Bruins et al. supplementary material 1

File 466.6 KB

Bruins et al. supplementary material

Bruins et al. supplementary material 2

PDF 509.8 KB

Article contents

The Power to Detect Cultural Transmission in the Nuclear Twin Family Design With and Without Polygenic Risk Scores and in the Transmitted–Nontransmitted (Alleles) Design

Abstract

Keywords

Information

The NTF Design

The T–NT PRS Design

Power

Results

Discussion

Supplementary material

Acknowledgments

References

Bruins et al. supplementary material

Bruins et al. supplementary material

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests