Hostname: page-component-6766d58669-h8lrw Total loading time: 0 Render date: 2026-05-17T18:12:43.392Z Has data issue: false hasContentIssue false

Sibling Comparison Designs: Addressing Confounding Bias with Inclusion of Measured Confounders

Published online by Cambridge University Press:  27 September 2019

Gretchen R. B. Saunders*
Affiliation:
Department of Psychology, University of Minnesota, Minneapolis, MN, USA
Matt McGue
Affiliation:
Department of Psychology, University of Minnesota, Minneapolis, MN, USA
Stephen M. Malone
Affiliation:
Department of Psychology, University of Minnesota, Minneapolis, MN, USA
*
Author for correspondence: Gretchen R. B. Saunders, Email: saund247@umn.edu

Abstract

Genetically informative research designs are becoming increasingly popular as a way to strengthen causal inference with their ability to control for genetic and shared environmental confounding. Co-twin control (CTC) models, a special case of these designs using twin samples, decompose the overall effect of exposure on outcome into a within- and between-twin-pair term. Ideally, the within-twin-pair term would serve as an estimate of the exposure effect controlling for genetic and shared environmental factors, but it is often confounded by factors not shared within a twin-pair. Previous simulation work has shown that if twins are less similar on an unmeasured confounder than they are on an exposure, the within-twin-pair estimate will be a biased estimate of the exposure effect, even more biased than the individual, unpaired estimate. The current study uses simulation and analytical derivations to show that while incorporating a covariate related to the nonshared confounder in CTC models always reduces bias in the within-pair estimate, it will be less biased than the individual estimate only in a narrow set of circumstances. The best case for bias reduction in the within-pair estimate occurs when the within-twin-pair correlation in exposure is less than the correlation in the confounder and the twin-pair correlation in the covariate is high. Additionally, the form of covariate inclusion is compared between adjustment for only one’s own covariate value and adjustment for the deviation of one’s own value from the covariate twin-pair mean. Results show that adjusting for the deviation from the twin-pair mean results in equal or reduced bias.

Figure 0

Fig. 1. Causal diagram shown for one twin-pair (subscripts of 1 and 2 represent each twin). Variables X, Y and C represent the exposure, outcome and unmeasured confounder, respectively. Z represents the measured covariate. ${\beta _{YX}}$ is the true causal effect of exposure on outcome. ${\rm{\;}}{\beta _{ZC}}$ is the effect of the confounder on the covariate. Double-headed arrows represent familial factors that cause aggregation of phenotypes within families.

Figure 1

Fig. 2. Results recreated from Frisell et al. (2012). Blue lines denote the exposure estimate from individual-level models, while red lines denote the exposure estimate from CTC models. The true causal effect is 0 (${\beta _{YX}} = 0$). The within-twin-pair correlations in the exposure and the confounder are ${\rho _X}$ and ${\rho _C}$, ${\rm{\;}}$ respectively. For each scenario ${\rho _C}$ = 0.5, while ${\rho _X}$ varies between 0.3, 0.5 and 0.7. The bias in the individual-level effect and the within-twin-pair effect does not depend on ${\beta _{ZC}}$, the effect of the confounder on the covariate, because the covariate is not included in these models.

Figure 2

Fig. 3. Exposure effect estimates with the inclusion of a covariate from individual-level and within-pair models when (A) the within-pair correlation in the exposure is less than the within-pair correlation in the confounder; (B) the within-pair correlation in the exposure equals the within-pair correlation in the confounder; (C) the within-pair correlation in the exposure is more than the within-pair correlation in the confounder. For each scenario ${\rho _C}{\rm{\;}}$ = 0.5, while ${\rho _X}$ varies between 0.3, 0.5 and 0.7 (consistent with Figure 2). Additionally, each column represents a different value of ${\rho _Z}$, the within-pair correlation in the covariate. ${\rm{\;}}{\beta _{ZC}}$ is the effect of the confounder on the covariate. Blue lines denote the exposure estimate from individual-level models, red lines denote the exposure estimate from CTC models as specified in equation 4 and green lines denote the exposure estimate from CTC models as specified in equation 3. Solid lines denote the exposure effect estimate with covariate inclusion, while dashed lines denote the same estimate without covariate inclusion. The true causal exposure effect is 0 (${\beta _{YX}} = 0$).

Supplementary material: File

Saunders et al. supplementary material

Saunders et al. supplementary material 1

Download Saunders et al. supplementary material(File)
File 468.9 KB
Supplementary material: PDF

Saunders et al. supplementary material

Saunders et al. supplementary material 2

Download Saunders et al. supplementary material(PDF)
PDF 446.3 KB