Hostname: page-component-77f85d65b8-9nbrm Total loading time: 0 Render date: 2026-03-29T02:36:01.901Z Has data issue: false hasContentIssue false

Variance Components Models for Analysis of Big Family Data of Health Outcomes in the Lifelines Cohort Study

Published online by Cambridge University Press:  04 April 2019

Nino Demetrashvili
Affiliation:
Department of Epidemiology, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands Department of Medical Statistics, National Center for Disease Control and Public Health, Tbilisi, Georgia
Nynke Smidt
Affiliation:
Department of Epidemiology, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
Harold Snieder
Affiliation:
Department of Epidemiology, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
Edwin R. van den Heuvel
Affiliation:
Department of Epidemiology, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands Department of Mathematics and Computer Science, Eindhoven University of Technology, Eindhoven, The Netherlands
Ernst C. Wit*
Affiliation:
Bernoulli Institute, University of Groningen, Groningen, The Netherlands Institute of Computational Science, Università della Svizzera italiana, Lugano, Switzerland
*
Author for correspondence: Ernst C. Wit, Email: e.c.wit@rug.nl

Abstract

Large multigenerational cohort studies offer powerful ways to study the hereditary effects on various health outcomes. However, accounting for complex kinship relations in big data structures can be methodologically challenging. The traditional kinship model is computationally infeasible when considering thousands of individuals. In this article, we propose a computationally efficient alternative that employs fractional relatedness of family members through a series of founding members. The primary goal of this study is to investigate whether the effect of determinants on health outcome variables differs with and without accounting for family structure. We compare a fixed-effects model without familial effects with several variance components models that account for heritability and shared environment structure. Our secondary goal is to apply the fractional relatedness model in a realistic setting. Lifelines is a three-generation cohort study investigating the biological, behavioral, and environmental determinants of healthy aging. We analyzed a sample of 89,353 participants from 32,452 reconstructed families. Our primary conclusion is that the effect of determinants on health outcome variables does not differ with and without accounting for family structure. However, accounting for family structure through fractional relatedness allows for estimating heritability in a computationally efficient way, showing some interesting differences between physical and mental quality of life heritability. We have shown through simulations that the proposed fractional relatedness model performs better than the standard kinship model, not only in terms of computational time and convenience of fitting using standard functions in R, but also in terms of bias of heritability estimates and coverage.

Information

Type
Articles
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright
© The Author(s) 2019
Figure 0

Fig. 1. Family example consisting of 13 members.

Figure 1

Table 1. Algorithm for Construction of a Fractional Relatedness Matrix

Figure 2

Table 2. Counts of Family Sizes for the Original Set of 91,759 Participants and the Remaining 89,353 Participants after Removing Incomplete Records

Figure 3

Table 3. Estimates of variance components, ICC and its confidence interval for outcomes MCS and PCS

Figure 4

Fig. 2. BMI effects for MCS surrounded by confidence intervals.

Figure 5

Fig. 3. BMI effects for PCS surrounded by confidence intervals.

Figure 6

Table 4. Conditional F Tests for BMI, Age, and Sex of Selected Models

Figure 7

Table 5. Estimates of Coefficients for BMI, Age and Sex of Selected Models

Figure 8

Table 6. Comparison of Heritability Parameters and Computational Time between Kinship (lmekin Function) and M3 (lme Function) Models

Figure 9

Fig. 4. Overlapping histograms for comparison of estimated heritabilities across M3 and kinship models in setting 3: vertical bar (0.13) shows true heritability, light grey histogram (left) and smoothed plain line show distribution of estimated heritabilities in M3 model and dark grey histogram (right) and smoothed dashed line show distribution of estimated heritabilities in kinship model.