Hostname: page-component-89b8bd64d-dvtzq Total loading time: 0 Render date: 2026-05-09T19:30:04.597Z Has data issue: false hasContentIssue false

Including copy number variation in association studies to predict genotypic values

Published online by Cambridge University Press:  01 June 2010

M. P. L. CALUS*
Affiliation:
Animal Breeding and Genomics Centre, Wageningen UR Livestock Research, 8200 AB, Lelystad, The Netherlands
D. J. DE KONING
Affiliation:
Division of Genetics and Genomics, Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Roslin EH25 9PS, UK
C. S. HALEY
Affiliation:
Division of Genetics and Genomics, Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Roslin EH25 9PS, UK MRC Human Genetics Unit, Western General Hospital, Crewe Road, Edinburgh EH4 2XU, UK
*
*Corresponding author: Animal Breeding and Genomics Centre, Wageningen UR Livestock Research, P. O. Box 65, 8200 AB, Lelystad, The Netherlands. Tel: 31 320 238265. Fax: 31 320 293591. e-mail: mario.calus@wur.nl
Rights & Permissions [Opens in a new window]

Summary

The objective of this study was to investigate, both empirically and deterministically, the ability to explain genetic variation resulting from a copy number polymorphism (CNP) by including the CNP, either by its genotype or by a continuous derivation thereof, alone or together with a nearby single nucleotide polymorphism (SNP) in the model. This continuous measure of a CNP genotype could be a raw hybridization measurement, or a predicted CNP genotype. Results from simulations showed that the linkage disequilibrium (LD) between an SNP and CNP was lower than LD between two SNPs, due to the higher mutation rate at the CNP loci. The model R2 values from analysing the simulated data were very similar to the R2 values predicted with the deterministic formulae. Under the assumption that x copies at a CNP locus lead to the effect of x times the effect of 1 copy, including a continuous measure of a CNP locus in the model together with the genotype of a nearby SNP increased power to explain variation at the CNP locus, even when the continuous measure explained only 15% of the variation at the CNP locus.

Information

Type
Research Papers
Copyright
Copyright © Cambridge University Press 2010
Figure 0

Fig. 1. The simulated marker map, with one CNP locus with only 2 alleles (CNP2), one CNP locus with two or more alleles (CNPm), and SNP loci at respectively 0·0, 0·1, 0·5, 1·0 and 2·0 cM distance with 10 SNPs at each locus.

Figure 1

Table 1. Distribution of number of segregating alleles at simulated CNPm loci

Figure 2

Fig. 2. Average frequencies of alleles across all CNP2 loci and CNPm loci with 2 alleles in generation 4000.

Figure 3

Table 2. Average minor allele frequencies (MAFs) across segregating loci, in ascending order

Figure 4

Table 3. Estimated r2 values between an SNP and an SNP, CNP2 or CNPm loci, and the predicted r2 between two bi-allelic loci according to Sved (1971), located at different distances

Figure 5

Fig. 3. Deterministic R2 values obtained for models including CNP phenotypes and SNP genotypes assuming different r2 values between CNP and SNP loci, as a function of h2 of the CNP phenotypes.

Figure 6

Table 4. Realized and predicted model R2 values for different models averaged across 1000 replicates

Figure 7

Table 5. MSEP for different models, averaged across 1000 replicates

Figure 8

Fig. 4. Simulated CNP phenotypes, assuming a heritability of 0·25, plotted against the CNP genotypes for 500 individuals with a CNP locus with allele frequencies for 0, 1, 2, 3 and 4 copies as in Table 1 for CNPm(3) loci.