Hostname: page-component-89b8bd64d-5bvrz Total loading time: 0 Render date: 2026-05-07T23:57:41.215Z Has data issue: false hasContentIssue false

Longitudinal Reading Measures and Genome Imputation in the National Child Development Study: Prospects for Future Reading Research

Published online by Cambridge University Press:  10 March 2023

Elinor C. Bridges
Affiliation:
School of Philosophy, Psychology and Language Sciences, University of Edinburgh, Edinburgh, United Kingdom
N. William Rayner
Affiliation:
Institute of Translational Genomics, Helmholtz Zentrum München — German Research Center for Environmental Health, Neuherberg, Germany Oxford Centre for Diabetes, Endocrinology and Metabolism, Radcliffe Department of Medicine, University of Oxford, Oxford, United Kingdom Wellcome Centre for Human Genetics, Nuffield Department of Medicine, University of Oxford, Oxford, United Kingdom
Hayley S. Mountford
Affiliation:
School of Philosophy, Psychology and Language Sciences, University of Edinburgh, Edinburgh, United Kingdom
Timothy C. Bates
Affiliation:
School of Philosophy, Psychology and Language Sciences, University of Edinburgh, Edinburgh, United Kingdom
Michelle Luciano*
Affiliation:
School of Philosophy, Psychology and Language Sciences, University of Edinburgh, Edinburgh, United Kingdom
*
Author for correspondence: Michelle Luciano, E-mail: michelle.luciano@ed.ac.uk

Abstract

Reading difficulties are prevalent worldwide, including in economically developed countries, and are associated with low academic achievement and unemployment. Longitudinal studies have identified several early childhood predictors of reading ability, but studies frequently lack genotype data that would enable testing of predictors with heritable influences. The National Child Development Study (NCDS) is a UK birth cohort study containing direct reading skill variables at every data collection wave from age 7 years through to adulthood with a subsample (final n = 6431) for whom modern genotype data are available. It is one of the longest running UK cohort studies for which genotyped data are currently available and is a rich dataset with excellent potential for future phenotypic and gene-by-environment interaction studies in reading. Here, we carry out imputation of the genotype data to the Haplotype Reference Panel, an updated reference panel that offers greater imputation quality. Guiding phenotype choice, we report a principal components analysis of nine reading variables, yielding a composite measure of reading ability in the genotyped sample. We include recommendations for use of composite scores and the most reliable variables for use during childhood when conducting longitudinal, genetically sensitive analyses of reading ability.

Information

Type
Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2023. Published by Cambridge University Press on behalf of International Society for Twin Studies
Figure 0

Table 1. Breakdown of number of participants sampled on each array in the NCDS after removal of exclusions and duplications, and number of SNPs sequenced in each dataset

Figure 1

Table 2. Number and overall percentage of SNPs that were removed from each array due to failure to pass quality control steps

Figure 2

Table 3. Number and overall percentage of individuals removed from each array due to failure to pass quality control steps

Figure 3

Table 4. Percentage and number of SNPs with genomewide imputation R2 scores greater than 0.8 for each array

Figure 4

Table 5. Descriptive statistics of continuous reading variables in the full sample and matched genotyped subsample

Figure 5

Fig. 1. Correlation Heatmap Between All Reading Variables in the Full Sample (A) and Subsample (B).

Figure 6

Table 6. Principal component loadings and communality from a principal components analysis

Figure 7

Fig. 2. Eigenvalues generated from A parallel analysis of principal components in the full NCDS sample suggesting that three components should be retained.

Figure 8

Table 7. Interfactor correlations of the three principal components generated by principal components analysis of the full NCDS dataset

Figure 9

Table 8. Cronbach’s alpha to show reliability of scale of overall and age-specific reading composites in the full sample and subsample

Figure 10

Fig. 3. Correlation heat map of age-specific reading composites and reading measures in adulthood in the full sample (A) and subsample (B). Note: Age 16 composite does not include Can Cope.

Supplementary material: File

Bridges et al. supplementary material

Bridges et al. supplementary material

Download Bridges et al. supplementary material(File)
File 82 KB