Hostname: page-component-89b8bd64d-7zcd7 Total loading time: 0 Render date: 2026-05-13T12:12:04.921Z Has data issue: false hasContentIssue false

Operating Characteristics of Statistical Methods for Detecting Gene-by-Measured Environment Interaction in the Presence of Gene-Environment Correlation under Violations of Distributional Assumptions

Published online by Cambridge University Press:  13 January 2015

Carol A. Van Hulle*
Affiliation:
Waisman Center, University of Wisconsin-Madison, Madison, WI, USA
Paul J. Rathouz
Affiliation:
Department of Biostatistics and Medical Informatics, University of Wisconsin School of Medicine and Public Health, Madison, WI, USA
*
address for correspondence: Carol Van Hulle, University of Wisconsin-Madison, 1500 Highland Ave, Madison, WI 53705, USA. E-mail: vanhulle@waisman.wisc.edu

Abstract

Accurately identifying interactions between genetic vulnerabilities and environmental factors is of critical importance for genetic research on health and behavior. In the previous work of Van Hulle et al. (Behavior Genetics, Vol. 43, 2013, pp. 71–84), we explored the operating characteristics for a set of biometric (e.g., twin) models of Rathouz et al. (Behavior Genetics, Vol. 38, 2008, pp. 301–315), for testing gene-by-measured environment interaction (GxM) in the presence of gene-by-measured environment correlation (rGM) where data followed the assumed distributional structure. Here we explore the effects that violating distributional assumptions have on the operating characteristics of these same models even when structural model assumptions are correct. We simulated N = 2,000 replicates of n = 1,000 twin pairs under a number of conditions. Non-normality was imposed on either the putative moderator or on the ultimate outcome by ordinalizing or censoring the data. We examined the empirical Type I error rates and compared Bayesian information criterion (BIC) values. In general, non-normality in the putative moderator had little impact on the Type I error rates or BIC comparisons. In contrast, non-normality in the outcome was often mistaken for or masked GxM, especially when the outcome data were censored.

Information

Type
Articles
Copyright
Copyright © The Author(s) 2015 
Figure 0

TABLE 1 Bivariate Variance Components Models for Latent Component-by-Measured Environment (GxM) Interaction

Figure 1

TABLE 2 Models and Data Generation Mechanism (DGM) Parameter Values used in Data Simulation

Figure 2

FIGURE 1 Example of distribution of simulated data after ordinalizing (left panel) or censoring (right panel). Data were ordinalized by grouping the top 2%, the bottom 30%, and evenly dividing the remaining scores. Data were censored by replacing scores in the bottom 30% with the value of the 30th percentile.Note: For left panel M = 1.8, SD = 1.6, Skew = 0.2, Kurtosis = -1.3; for right panel M = 0.4, SD = 1.2, Skew = 1.6, Kurtosis = 3.2.

Figure 3

TABLE 3 Ordinalized Data Simulation: Percent of Simulated LRT Statistics under the Null Hypothesis Exceeding Critical Value for Pairs of Nested Models Based on 2,000 Replicates of n = 1,000

Figure 4

TABLE 4 Censored Data Simulation: Percent of LRT Statistics under the Null Hypothesis Exceeding Critical Value for Pairs of Nested Models Based on 2,000 Replicates of n = 1,000

Figure 5

TABLE 5 Ordinalized Data Simulation: Percent Each Model is Favored Via Comparison of BIC valuesa for All Pairwise Comparisons, and Percent Each Model has Lowest BIC: n = 1,000

Figure 6

TABLE 6 Censored Data Simulation: Percent Each Model is Favored via Comparison of BIC Valuesa for All Pairwise Comparisons, and Percent Each Model has Lowest BIC: n = 1,000

Supplementary material: File

Van Hulle and Rathouz supplementary material

Table S1 and Figure S1

Download Van Hulle and Rathouz supplementary material(File)
File 24.3 KB