Mating strategies with genomic information reduce rates of inbreeding in animal breeding schemes without compromising genetic gain

H. Liu; M. Henryon; A. C. Sørensen

doi:10.1017/S1751731116001786

Mating strategies with genomic information reduce rates of inbreeding in animal breeding schemes without compromising genetic gain

Published online by Cambridge University Press: 17 August 2016

H. Liu ,

M. Henryon and

A. C. Sørensen

Show author details

H. Liu*: Affiliation:
Department of Molecular Biology and Genetics, Center for Quantitative Genetics and Genomics, Aarhus University, PO Box 50, 8830 Tjele, Denmark
M. Henryon: Affiliation:
Seges, Danish Pig Research Centre, Axeltorv 3, 1609 Copenhagen V, Denmark School of Animal Biology, University of Western Australia, 35 Stirling Highway, Crawley, WA 6009, Australia
A. C. Sørensen: Affiliation:
Department of Molecular Biology and Genetics, Center for Quantitative Genetics and Genomics, Aarhus University, PO Box 50, 8830 Tjele, Denmark
*: † E-mail: Huiming.liu@mbg.au.dk

Article contents

Abstract
Implications
Introduction
Material and methods
Results
Discussion
Supplementary Material
References

Abstract

We tested the hypothesis that mating strategies with genomic information realise lower rates of inbreeding (∆F) than with pedigree information without compromising rates of genetic gain (∆G). We used stochastic simulation to compare ∆F and ∆G realised by two mating strategies with pedigree and genomic information in five breeding schemes. The two mating strategies were minimum-coancestry mating (MC) and minimising the covariance between ancestral genetic contributions (MCAC). We also simulated random mating (RAND) as a reference point. Generations were discrete. Animals were truncation-selected for a single trait that was controlled by 2000 quantitative trait loci, and the trait was observed for all selection candidates before selection. The criterion for selection was genomic-breeding values predicted by a ridge-regression model. Our results showed that MC and MCAC with genomic information realised 6% to 22% less ∆F than MC and MCAC with pedigree information without compromising ∆G across breeding schemes. MC and MCAC realised similar ∆F and ∆G. In turn, MC and MCAC with genomic information realised 28% to 44% less ∆F and up to 14% higher ∆G than RAND. These results indicated that MC and MCAC with genomic information are more effective than with pedigree information in controlling rates of inbreeding. This implies that genomic information should be applied to more than just prediction of breeding values in breeding schemes with truncation selection.

Keywords

mating strategies genomic selection genetic gain inbreeding genetic contributions

Information

Type: Research Article
Information: animal , Volume 11 , Issue 4 , April 2017 , pp. 547 - 555

DOI: https://doi.org/10.1017/S1751731116001786 [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited
Copyright: © The Animal Consortium 2016

Implications

We showed that mating strategies (minimum-MC and mating by minimising covariance between ancestral contributions) with genomic information realise lower rates of inbreeding than with pedigree information without compromising rates of genetic gain when performing truncation selection on predicted genomic-breeding values. Application of these mating strategies in animal breeding schemes is feasible in practice. They simply use the same information as for genomic prediction to pair the parents more appropriately without any extra cost and logistic constraints, which is an improvement on pedigree information. This implies that in animal breeding schemes with truncation selection, genomic data should be applied to more than just prediction of breeding values.

Introduction

Choosing appropriate mating strategies in animal breeding reduces rates of inbreeding without compromising rates of genetic gain (Caballero et al., Reference Caballero, Santiago and Toro1996). They do so by distributing genetic contributions of ancestors more evenly across mating pairs, which improves the genetic structures of breeding populations (Sonesson and Meuwissen, Reference Sonesson and Meuwissen2000). Two mating strategies that were developed for breeding schemes using phenotypic and pedigree data without genomic information are minimum-coancestry mating (MC) and mating by minimising the covariance between ancestral contributions (MCAC) (Wright, Reference Wright1921; Henryon et al., Reference Henryon, Sorensen and Berg2009). These strategies are generally recommended to realise lower rates of inbreeding in breeding schemes without genomic information (Caballero et al., Reference Caballero, Santiago and Toro1996; Meuwissen, Reference Meuwissen and Luo2007; Henryon et al., Reference Henryon, Sorensen and Berg2009; Nirea et al., Reference Nirea, Sonesson, Woolliams and Meuwissen2012). The lower rates of inbreeding realised by MC and MCAC can be explained using the theory of long-term genetic contributions (Woolliams and Thompson, Reference Woolliams, Bijma and Willanueva1994; Grundy et al., Reference Grundy, Villanueva and Woolliams1998). The theory proposes that the minimum rate of inbreeding, given a pre-defined rate of genetic gain, is realised when the long-term genetic contributions of the ancestors stabilise to an exact threshold-linear relationship with their Mendelian-sampling term: genetic contributions of ancestors are zero below a threshold Mendelian-sampling term and increase linearly with the value of Mendelian-sampling terms above the threshold (Lindgren and Matheson, Reference Lindgren and Matheson1986; Grundy et al., Reference Grundy, Villanueva and Woolliams1998; Woolliams et al., Reference Woolliams and Thompson2002, Woolliams, Reference Woolliams2006 and Reference Woolliams2007). MC and MCAC realise low rates of inbreeding by increasing the independence and reducing confounding between genetic contributions of ancestors. They disperse the contributions within breeding populations and increase the number of ancestors that contribute to each descendent (Woolliams et al., Reference Woolliams and Thompson2002; Sørensen et al., Reference Sun, VanRaden, O’Connell, Weigel and Gianola2005; Henryon et al., Reference Henryon, Berg and Sorensen2014). This enables selection to align the ancestors closer to the exact threshold-linear relationship and reduce rates of inbreeding (Woolliams and Thompson, Reference Woolliams, Bijma and Willanueva1994; Woolliams et al., Reference Woolliams and Thompson2002). The challenge in breeding schemes without genomic information is that there is a limit to the level of independence that can be achieved by MC and MCAC with pedigree information. Therefore, if we are to further reduce rates of inbreeding with MC and MCAC, we need information that enables us to make ancestral genetic contributions more independent so that genetic contributions can be more dispersed across the population.

Genomic information may enable MC and MCAC to disperse genetic contributions within breeding populations more effectively than pedigree information. One way to achieve this is by replacing the pedigree-relationship matrices in MC and MCAC with genomic-relationship matrices (Sun et al., Reference Speed and Balding2013; Henryon et al., Reference Henryon, Berg and Sorensen2014). Genomic-relationship matrices provide more accurate estimates of relationships between individuals by tracing Mendelian segregation of chromosome segments (Hayes et al., Reference Hayes, Visscher and Goddard2009). This should further increase the independence and dispersion of genetic contributions within breeding populations, enabling selection to align the ancestors even closer to the exact threshold-linear relationship and bringing about further reductions in rates of inbreeding. Based on this background information, we reasoned that MC and MCAC with genomic information realises lower rates of inbreeding than MC and MCAC with pedigree information without compromising rates of genetic gain. We tested this hypothesis by stochastic simulation. We simulated MC and MCAC in five breeding schemes with different family structures and heritabilities. We measured inbreeding as homozygosity due to identity-by-descent. Genetic gain was measured as increases in true breeding values.

Material and methods

Experimental design

We used stochastic simulation to compare rates of inbreeding and genetic gain realised by MC and MCAC with pedigree and genomic information in five breeding schemes. We also simulated random mating (RAND) as a reference point. Generations were discrete, animals were truncation-selected for a single trait that was controlled by 2000 quantitative trait loci (QTL), and the trait was observed for all selection candidates before selection. The criterion for selection was genomic-breeding values predicted by a ridge-regression model using 8257 markers. The five breeding schemes differed for mating ratio (1, 2 or 6 dams/sire), litter size (10, 20 or 60 offspring/litter) and heritability of the trait (0.1 or 0.4). Each combination of breeding scheme and mating strategy was run for 20 discrete generations and replicated 100 times.

Mating strategies

Minimum-coancestry mating

Selected sires and dams were paired by minimising the average coancestry of the proposed matings. Pedigree-additive relationship A or the genomic relationship G were created after (Meuwissen and Luo, Reference Meuwissen1992) and (Yang et al., Reference Yang, Benyamin, McEvoy, Gordon, Henders, Nyholt, Madden, Heath, Martin and Montgomery2010). The dimensions of A and G were N _s×N _d, where N _s and N _d are the number of selected sires and dams, and the elements ${1 \over 2}{\mib{A}} _{{ij}} $ and ${1 \over 2}{\mib{G}} _{{ij}} $ are expected coefficients of coancestry of individuals i and j. These coefficients are equivalent to the inbreeding coefficients of their offspring. Minimum-MC were performed using the algorithm described in Henryon et al. (Reference Henryon, Sorensen and Berg2009).

Mating by minimising the covariance between ancestral contributions

Selected sires and dams were paired by minimising the sum of absolute values of the covariances between ancestral genetic contributions. The algorithm used to pair the sires and dams by MCAC with pedigree information is described in the Appendix of Henryon et al. (Reference Henryon, Sorensen and Berg2009). We made one modification to the algorithm when MCAC with genomic information was used to pair sires and dams: matrices L and C were computed by decomposing the genomic-additive relationship matrix, G , between ancestors and their descendants, where L is a normed lower triangular matrix describing the genetic contribution from ancestors to their descendants and C represents genetic contributions from the ancestors to allocated matings. Computation of C is presented in the Appendix (Supplementary Material S1).

Random mating

Selected sires and dams were paired randomly.

Breeding schemes

The five breeding schemes are shown in Table 1, where scheme 1 is the hierarchical-breeding scheme that was used by Henryon et al. (Reference Henryon, Sorensen and Berg2009). In total, 20 sires were selected in each breeding scheme. Each sire was mated to one (schemes 2 and 3), two (schemes 4 and 5) and six (scheme 1) dams. In schemes 1 and 2, 1200 offspring were generated in each generation; 400 offspring were generated in schemes 3 to 5. The heritability of the trait under selection was 0.1 in schemes 1 to 4. It was 0.4 in scheme 5.

Table 1 Details of the simulation of the five breeding schemes with different population structure (the number of selected dams and litter size) and heritability

N _s=number of selected sires.

N _d=number of selected dams.

Simulation

Simulations were carried out in three stages (Figure 1). In the first stage, we generated a single founder population. This founder population was used as the basis for the subsequent stages. In the second stage, we generated base population, which was run for 100 replicates. In the third stage, we generated selected population, which was run for 100 replicates.

Figure 1 A summary of simulations. Simulations were carried out in the following three stages. In the first stage, 8257 markers and 2000 quantitative trait loci were generated by simulating a single founder population with a Fisher–Wright inheritance model. The founder population had an effective population size of 200 animals and 2000 generations, which was created to obtain desirable level of linkage disequilibrium between simulated loci. In the second stage, the base animals (in generation 0) were generated by choosing 20 sires and N _d dams from the last generation of the founder population. Two thousand identical-by-descent (IBD) markers were used to trace each base animal’s contribution to their descendant generations and infer IBD status relative the base population. In total, 20 sires and N _d dams in generation 0 were used to produce N _total offspring in generation 1. In the third stage, from generation 1 to 19, all N _total selection candidates were both genotyped and phenotyped before selection. In each generation, 20 sires and N _d dams were truncation-selected using breeding values predicted from a Ridge-Regression model and were mated to produce N _total offspring.

Founder population and genetic architecture

The 8257 markers and 2000 QTL were generated by simulating a founder population with a Fisher–Wright inheritance model. The population had an effective population size of 200 animals and 2000 generations. Simulation of the genomes has been described in Liu et al. (Reference Liu, Meuwissen, Sørensen and Berg2015). The simulated genome consisted of four 1 Morgan long chromosomes, on which 10 000 loci were equally distributed, resulting in 40 000 loci across the genome. The offspring inherited alleles at these loci from their parents following Mendel’s rules allowing for mutation and recombinations. Mutation was recurrent at a rate of 2.5×10⁻⁵ per locus per meiosis (Son et al., Reference Son, Kang, Kim, Park and Lee2014; Liu et al., Reference Liu, Meuwissen, Sørensen and Berg2015). Recombinations per chromosome were sampled from a Poisson distribution with a mean equal to the length of the chromosome in Morgan and were randomly placed along the chromosome assuming a uniform distribution. In generation −1, the average (±SD) linkage disequilibrium (LD) between neighbouring loci was R ²=0.27 (±0.32), and the allele frequency distribution followed a U-shaped distribution, with ~30.2% of the loci fixed. Average R ² between neighbouring loci was in the range of the estimation of LD in pig breeds (Badke et al., Reference Badke, Bates, Ernst, Schwab and Steibel2012).

Among all segregating loci, every second one with a minor allele frequency (MAF) >0.05 were used as markers. So in total, there were 8257 markers used for genomic prediction and mating strategies with genomic information in the simulated breeding schemes. Among the remaining segregating loci, 2000 loci with MAF>0.01 were used as QTL. The QTL allelic effects were assumed to follow a gamma distribution with a shape parameter of 1.48. This was the shape parameter derived for the distribution of QTL effects in pigs (Hayes and Goddard, Reference Hayes and Goddard2001).

Base population

We randomly sampled 20 sires and N _d dams from 200 animals in the last generation of the founder population, where N _d were the number of animals that were assigned to be sires and dams (Table 1). The base population produced the first generation of offspring (generation 1).

Selected population

In generation 1 to 19, 20 sires and N _d were selected from N _total animals and mated to produce N _total offspring in each breeding scheme (Table 1). Offspring produced in generation 20 were the result of 19 generations of selection. The offspring in each generation inherited alleles at markers and QTL from their parents following Mendel’s rules allowing for recombinations. The simulation of recombination was the same as for the founder population.

Tracking identity-by-descent

In total, 2000 identical-by-descent (IBD) markers were randomly distributed across the genome. These IBD markers were not involved in selection, but were assigned unique alleles to each base animal. They were used to trace each base animal’s contribution to their descendant generations and infer IBD status relative the base population. So within each locus of a descendant, each IBD marker allele could be traced directly back to the base animal from which it was derived. Any homozygous locus at IBD markers was an inbred locus. Inbreeding at each IBD marker was defined as the probability that two alleles at that locus from a randomly selected animal in the population are IBD.

Phenotypes

The phenotype of the trait for the ith base animal, y _i, was calculated as y _i=α _i+e _i, where α _i is the base animal’s true additive-genetic value and e _i is its residual environmental value. The true additive-genetic value was calculated as the sum of 4000 QTL effects. The effects of those QTL were scaled to achieve an initial genetic variance equal to the heritability, that is $$a_{j} \,\,{\equals}\,\,a_{j}^{\prime} \,{\times}\,\sqrt {{{h^{2} } \over {\mathop{\sum}\nolimits_{k\,{\equals}\,1}^n {2p_{k} (1{\minus}p_{k} )a_{k}^{{\prime2}} } }}} $$ , where h ² is 0.1 or 0.4, subscripts k (j) denote QTL k (j), p _k (p _j) is the frequency of the ‘1’ allele of QTL k (j) and $a_{k}^{\prime} \,(a_{k}^{\prime} )$ is the substitution effect of QTL k (j) before being scaled. The true breeding value for each animal was obtained by summing the allelic effects at each QTL. The additive QTL variance explained all additive-genetic variance $\big( {\sigma _{\alpha }^{2} \,{\equals}\,\sigma _{{qtl}}^{2} } \big)$ . The environmental values were sampled from the distribution $N\left( {0,\,\sigma _{e}^{2} \,{\equals}\,1{\minus}h^{2} } \right)$ . As a result, the phenotypes of the trait in the base population had a mean of 0 and a SD of 1. In descendant generations, QTL and tags were sampled according to principles of Mendelian inheritance. The environmental variance was constant through the simulation, such that genetic variance and heritability decreased over the course of generations of selection due to Bulmer-effect (Bulmer, Reference Bulmer1971).

Genomic prediction

Genomic-breeding values were predicted by fitting the ridge-regression model presented in Liu et al. (Reference Liu, Meuwissen, Sørensen and Berg2015). The breeding value g _i for animal i was defined as a parametric linear regression on marker covariates x _ij of the form $g_{i} \,\,{\equals}\,\,\mathop{\sum}\nolimits_{j\,{\equals}\,1}^p {x_{{ij}} \beta _{j} } $ , such that $y_{i} \,{\equals}\,\mu {\plus}\mathop{\sum}\nolimits_{j\,{\equals}\,1}^p {x_{{ij}} \beta _{j} {\plus}e_{i} } $ , where y _i is the phenotypic record of an animal from generation u and u−1, μ is the intercept, x _ij takes the value 0, 1 or 2 for animal i and locus j, and $\{ {\mib \beta } _{{\mib{j}} } \} _{{j\,{\equals}\,{\bf{1}} }}^{p} $ is a vector of marker effects (j=1, 2, …, p markers). Gaussian assumptions for model residuals were applied, that is the joint distribution of model residuals was assumed to follow $ N(0,\sigma _{e}^{2} )$ . The likelihood function yields

$$p\left( {y{\rm \mid}\mu ,\,g,\,\sigma _{e}^{2} } \right)\,\,{\equals}\,\,\prod\nolimits_{i\,{\equals}\,1}^n {N\left( {y_{i} \! \mid \! \mu {\plus}\mathop{\sum}\limits_{j\,{\equals}\,1}^p {x_{{ij}} \beta _{j} ,\sigma _{e}^{2} } } \right)} $$

where $N\big( {y_{i} \!\mid\! \mu {\plus}\mathop{\sum}\nolimits_{j\,{\equals}\,1}^p {x_{{ij}} \beta _{j} ,\sigma _{e}^{{\rm 2}} } } \big)$ is a normal density for random variable y _i centered at $\mu {\plus}\mathop{\sum}\nolimits_{j\,{\equals}\,1}^p {x_{{ij}} \beta _{j} } $ and with variance $\sigma _{e}^{{\rm 2}} $ . A common variance was assigned to all marker effects, that is $$\beta _{j} \sim N\big( {\beta _{j} \!\mid \!0,\,\sigma _{\beta }^{2} } \big)$$ . The predicted breeding value $\hat{g}_{i} $ used for selection was defined as $$\hat{g}_{i} \,{\equals}\,\mathop{\sum}\nolimits_{j\,{\equals}\,1}^p {x_{{ij}} \hat{\beta }_{j} } $$ .

Assessment criteria

We present the rates of inbreeding and genetic gain realised by MC, MCAC and RAND with pedigree and genomic information. The inbreeding coefficient was calculated for each individual as the proportion of IBD markers that are homozygous. Rates of inbreeding, ∆F, were calculated as 1−e ^β, where β is slope of the linear regression of ln(1−F _u) on u and F _u is the mean inbreeding coefficient for animals born at generation u, as used in Nirea et al. (Reference Nirea, Sonesson, Woolliams and Meuwissen2012). We did this transformation because theoretically the mean of inbreeding coefficients in a cohort becomes a linear function to the generation after data transformation, so that ∆F is constant across the generations (Benoit, Reference Benoit2011). Genetic gain at generation u was calculated as the difference in average true breeding values between generation u and u−1 (1<u<20). We calculated ∆F using F _u and rates of genetic gain (∆G) using true breeding values in generation 5 to 20. The difference between mating strategies with respect to ΔF and ∆G were tested for significance using Tukey’s HSD (honest significant difference, P<0.05).

We also present findings that provide insight into the mechanisms that underlie any difference in ΔF or ΔG between mating strategies with genomic and pedigree information:

• The inbreeding coefficient (F _u) over time.
• The genetic variance over time. The genetic variance was calculated as the variance of true breeding values in each generation.
• Genetic contributions.
- – The number of ancestors making a genetic contribution to the offspring in generation 20. We presented the average number of ancestors from generations 0 to 19 that made a genetic contribution to the offspring in generation 20 based on matrix C as described in Appendix (Supplementary Material S1) for genomic MCAC.
- – The deviation of long-term genetic contributions from the exact linear relationship between long-term genetic contributions and Mendelian-sampling terms. The Mendelian-sampling term was calculated as the difference between an animal’s true breeding value and the mean true breeding values of the parents of the animal. Long-term genetic contributions were computed based on both pedigree and genomic information (Henryon et al., Reference Henryon, Sorensen and Berg2009; Supplementary Material S1). Then the deviation was computed as the standard deviation of residuals from a linear regression of genetic contributions on Mendelian-sampling terms using the ancestors in generations 0 to 19 that made a genetic contribution to the offspring in generation 20.

All results are presented as means (and standard deviations across replicates) of the 100 simulation replicates.

Results

Rate of inbreeding

MC and MCAC with genomic information realised lower ΔF than MC and MCAC with pedigree information (Table 2). MC and MCAC with genomic information realised 6% to 22% less ΔF than with pedigree information in our five breeding schemes, which varied for mating ratio, litter size and heritability.

Table 2 Average rate of inbreeding (ΔF) realised by generation 5 to 20 in each of the pig breeding schemes

Pedigree MC=minimum-coancestry mating with pedigree information; genomic MC=minimum coancestry mating with genomic information; pedigree MCAC=mating by minimising the covariance between ancestral genetic contributions with pedigree information; genomic MCAC=mating by minimising the covariance between ancestral genetic contributions with genomic information; RAND=random mating.

The SD of means of 100 replicates of ΔF were <0.0044.

^a,b,cValues within a row with different superscripts differ significantly (Tukey’s honest significant difference, P<0.05).

MC and MCAC using the same information source realised similar ΔF (Table 2). In turn, MC and MCAC with genomic and pedigree information realised 20% to 44% less ΔF than RAND.

The reduction in ΔF by using MC and MCAC with genomic information instead of pedigree information was larger when mating ratio, litter size and heritability were smaller in our breeding schemes. Mating ratio had the largest impact on the reduction in ΔF. The reduction in ΔF by using genomic information was 6% to 11% in scheme 1 with mating ratio of 6 dams/sire. It increased to 16% to 18% in scheme 2 with mating ratio of 1 dam/sire. Likewise, the reduction in ΔF was 9% to 13% in scheme 4 with mating ratio of 2 dams/sire. It increased to 17% to 22% in scheme 3 with mating ratio of 1 dam/sire. On the other hand, litter size and heritability had the smallest impact on the reduction in ΔF. The reduction in ΔF was increased from 16% to 18% to 17% to 22%, when the number of offspring per litter was decreased from 60 in scheme 2 to 20 in scheme 3. The reduction in ΔF was increased from 7% to 9% to 9% to 13%, when the heritability was decreased from 0.4 in scheme 5 to 0.1 in scheme 4.

Rates of genetic gain

MC and MCAC with genomic and pedigree information realised similar ΔG within each of the five breeding schemes (Table 3). In turn, MC and MCAC with genomic and pedigree information realised 1% to 14% more ΔG than RAND.

Table 3 Average rate of genetic gain (ΔG) realised by different mating strategies at generation 5 to 20 in each of the pig breeding schemes

The SD of means of 100 replicates of ΔG were <0.001.

^a,b,c,dValues within a row with different superscripts differ significantly (Tukey’s honest significant difference, P<0.05).

The following sections present findings that provide insight into the mechanisms that underlie the differences in ΔF between mating strategies with genomic and pedigree information.

Inbreeding over time

MC and MCAC with genomic information realised less inbreeding over time than MC and MCAC with pedigree information. This is illustrated for scheme 1 (Figure 2a). In this scheme, the onset of inbreeding with genomic and pedigree information was delayed until generation 3. After the onset of inbreeding, MC and MCAC with genomic information realised 6% to 11% less ΔF than with pedigree information. The result of delaying the onset of inbreeding and reducing ΔF with genomic information was that at generation 20, MC and MCAC with genomic information realised 3% to 8% less inbreeding than with pedigree information. In breeding schemes 2 to 5, MC and MCAC with genomic information realised 4% to 18% less inbreeding than with pedigree information at generation 20 (results not shown).

Figure 2 (a) Inbreeding coefficient and (b) genetic variance in each generation of selection in breeding scheme 1. Pedigree MC=minimum-coancestry mating with pedigree information; genomic MC=minimum coancestry mating with genomic information; pedigree MCAC=mating by minimising the covariance between ancestral genetic contributions with pedigree information; genomic MCAC=mating by minimising the covariance between ancestral genetic contributions with genomic information; RAND=random mating.

MC and MCAC with genomic and pedigree information realised less inbreeding over time than RAND. This was also illustrated for scheme 1, where the onset of inbreeding with RAND occurred one generation earlier than MC and MCAC with both genomic and pedigree information (generation 2 v. 3, Figure 2a). After the onset of inbreeding, MC and MCAC with genomic and pedigree information realised 24% to 32% less ΔF than RAND in scheme 1 (Table 2). In breeding schemes 2 to 5, MC and MCAC with genomic and pedigree information realised 19% to 41% less inbreeding than with RAND at generation 20 (results not shown).

Genetic variance

MC and MCAC with genomic information maintained more genetic variance than with pedigree information. This is illustrated for scheme 1 (Figure 2b). The genetic variance maintained by MC and MCAC with genomic information (0.020) at generation 20 was 11% higher than with pedigree information (0.018). The variance maintained in RAND was only 0.014.

Genetic contributions

MC and MCAC with genomic information had two impacts on realised genetic contributions of ancestors to their descendants. First, MC and MCAC with genomic information resulted in more ancestors making long-term genetic contributions to descendants in the populations than pedigree information (Table 4). With genomic information, the number of ancestors from generation 0 to 19 that made a genetic contribution to the offspring in generation 20 of each breeding scheme ranged from 11 to 47 per generation. The number of ancestors was 10 to 44 per generation with pedigree information. With RAND, the number of ancestors making contributions was only 8 to 41 per generation. Second, the genomic information reduced the standard deviation of residuals from the linear regression of genomic-based genetic contributions on Mendelian-sampling terms (Table 5). The standard deviation of residuals with genomic information ranged from 0.05 to 0.09 by using MC and MCAC at generation 20 in all breeding schemes. This was 2% to 7% less than with pedigree information. RAND generated highest standard deviation of residuals with genomic information, which ranged from 0.07 to 0.13 in all breeding schemes.

Table 4 Average number of ancestors in generations 0 to 19 that made a genetic contribution to offspring in generation 20 in all breeding schemes

The SD of means of 100 replicates of the number of ancestors making a genetic contribution were <0.42.

Table 5 Mean of SD of residuals from a linear regression of genetic contributions on Mendelian-sampling terms for the ancestors in generations 0 to 19 that made a genetic contribution to the offspring in generation 20

The SD of means of 100 replicates of SD_ped were <0.0024 and those of SD_gen were <0.0041.

¹ The deviation of long-term genetic contributions from exact linear relationship with true Mendelian-sampling terms. It was achieved by presenting the SD of residuals from the linear regression of pedigree-based genetic contributions on Mendelian-sampling terms.

² The deviation of long-term genetic contributions from exact linear relationship with true Mendelian-sampling terms. It was achieved by presenting the SD of residuals from the linear regression of genomic-based genetic contributions on Mendelian-sampling terms.

Discussion

Our results supported the hypothesis that MC and MCAC with genomic information realise lower rates of inbreeding than MC and MCAC with pedigree information without compromising rates of genetic gain. The 6% to 22% reduction in ΔF by using genomic information with these mating strategies was realised over a range of breeding schemes using truncation selection. These reductions are worthwhile in practical breeding schemes because they can be achieved without extra costs and logistical constraints. All of the information required to implement MC and MCAC with genomic information is available from genomic prediction, which was the initial reason for implementing genomic information in breeding schemes. The mating strategies use genomic information to pair the parents more appropriately. They disperse genetic contributions of ancestors more widely, allowing selection to align the ancestors closer to the exact threshold-linear relationship. This is clearly an improvement on pedigree information. Therefore, genomic information should be applied to more than just prediction of breeding values in breeding schemes with truncation selection.

As we proposed, MC and MCAC with genomic information realised lower ΔF than with pedigree information for two reasons. First, MC and MCAC with genomic information increased independence and reduced confounding between the genetic contributions of ancestors. Second, this increased independence enabled selection to increase the contributions of ancestors with the largest Mendelian-sampling terms. The contributions of ancestors stabilised closer to the exact linear relationship between the long-term genetic contributions and Mendelian-sampling terms of the ancestors. It resulted in more ancestors making long-term contributions to descendant animals. We verified this with our analyses of genetic contributions. This underlying mechanism, linking truncation selection to genetic gain and inbreeding applies across a broad range of breeding schemes, species and genetic architectures. So, we recommend that MC or MCAC with genomic information become the method-of-choice for any breeding scheme with truncation selection because it should always realise lower ΔF.

Although we expect these mechanisms to apply to any breeding scheme, species and genetic architecture, the reduction in ΔF by using genomic information was largest with small mating ratio, litter size and heritability in our breeding schemes. Mating ratio had the largest impact on the reduction in ΔF because it affects the independence of genetic contributions of ancestors. With small mating ratio, there was more dependence between contributions of sires and dams to be broken up by MC and MCAC. In contrast, there was less dependence between the contributions with large mating ratio, and therefore, there was less to be gained by using MC and MCAC with genomic information. On the other hand, litter size and heritability had the smallest impact on the reduction in ΔF. With small litter size, dependence between contributions of sires and dams were larger, because there were fewer reshuffled chromosome segments inherited from the parents to the offspring. In contrast, with large litter size, many recombinations occurred and more combinations of chromosome segments were created in the offspring. Then genomic selection can select animals carrying diverse combinations of chromosome segments to mate subsequently, allowing for new degrees of freedom to be generated for allocating mates. In this case, there was less to be gained by using genomic information compared with small litter size. With low heritability, the selection was less accurate, so that more independence was required for subsequent changing of genetic contributions. More independence can be achieved by using genomic information. Therefore, although MC and MCAC with genomic information should always realise lower ΔF, we will gain most benefit by implementing MC and MCAC with genomic information in breeding schemes when there is less independence between genetic contributions of ancestors or where more independence is required.

Not only did MC and MCAC with genomic information reduce ΔF, they did so without compromising genetic gain for two reasons. First, MC and MCAC with genomic information reduced the deviations of genetic contributions from the threshold-linear relationship without changing the slope of the relationship. It resulted in the same selection intensity and the same accuracy realised by MC and MCAC with pedigree and genomic information. Second, we are able to maintain larger genetic variance by using MC and MCAC with genomic information, so we can obtain as much genetic gain as using MC and MCAC with pedigree information (Woolliams et al., Reference Woolliams, Pong-Wong and Villanueva1999). Therefore, by applying MC and MCAC with genomic information, we are able to guarantee a breeding programme with less inbreeding without sacrificing any benefit.

Although mating with genomic information can better manage ΔF than with pedigree information, it was surprising to find similar ΔF and ΔG realised by MC and MCAC, even though they use different criteria to allocate mates. MC improves the family structure by minimising the variance of ancestral genetic contributions for a randomly chosen offspring, whereas MCAC minimises the covariance of ancestral genetic contributions (Caballero et al., Reference Caballero, Santiago and Toro1996; Sonesson and Meuwissen, Reference Sonesson and Meuwissen2000; Henryon et al., Reference Henryon, Sorensen and Berg2009; Nirea et al., Reference Nirea, Sonesson, Woolliams and Meuwissen2012). The reason that the differences in ΔF between MC and MCAC were small was that the mating strategy that controls the variance of genetic contributions also controls covariance between genetic contributions (Supplementary Tables S1 and S2 and Supplementary Figure S1). This means that the mechanisms underlying MC and MCAC are similar even though these two mating strategies applied different methodologies to achieve lower ΔF. Therefore, it does not matter which one to use in practical breeding schemes with truncation selection and genomic information.

There are numerous ways to estimate realised coancestry and genetic contributions directly from the marker data (Speed and Balding, Reference Sørensen, Berg and Woolliams2015). The G matrix we estimated was genome-wise averages of single-SNP statistics, which was similar to VanRaden’s G matrices in that it did not take lengths of the genomic regions shared between individuals into account (VanRaden, Reference VanRaden2008). Edwards (Reference Edwards2015) proposed measures of coancestry based on models for the joint distribution of markers at the haplotype level. These measures can estimate genealogical relatedness more accurately than vanRaden’s G particularly for distantly related individuals. The use of haplotype methods can track the way the alleles inherited from parents to offspring more accurately, which may further increase the independence between the contributions of sires and dams compared with G matrices. If this is the case, then we would expect that MC and MCAC with relationship matrices estimated using haplotype methods will generate even lower rates of inbreeding than using G matrices.

Acknowledgements

This study was financed by Green Development and Demonstration, VikingGenetics (Denmark), Danish Agency for Science, Technology and Innovation (Project number 09-067559) and the Centre for Genomic Selection in Animals and Plants (GenSAP) funded by The Danish Council for Strategic Research.

Supplementary Material

To view supplementary material for in this article, please visit http://dx.doi.org/10.1017/S1751731116001786

References

Badke, YM, Bates, RO, Ernst, CW, Schwab, C and Steibel, JP 2012. Estimation of linkage disequilibrium in four US pig breeds. BMC Genomics 13, 24.Google Scholar

Benoit, K 2011. Linear regression models with logarithmic transformations. London School of Economics and Political Science 1–8. Available at http://www.kenbenoit.net/courses/ME104/logmodels2.pdfGoogle Scholar

Bulmer, MG 1971. The effect of selection on genetic variability. The American Naturalist 105, 201–211.Google Scholar

Caballero, A, Santiago, E and Toro, MA 1996. Systems of mating to reduce inbreeding in selected populations. Animal Science 62, 431–442.CrossRef Google Scholar

Edwards, D 2015. Two molecular measures of relatedness based on haplotype sharing. BMC Bioinformatics 16, 383.CrossRef Google Scholar PubMed

Grundy, B, Villanueva, B and Woolliams, JA 1998. Dynamic selection procedures for constrained inbreeding and their consequences for pedigree development. Genetic Research 72, 159–168.CrossRef Google Scholar

Hayes, BJ and Goddard, ME 2001. The distribution of the effects of genes affecting quantitative traits in livestock. Genetic Selection Evolution 33, 209–229.Google Scholar

Hayes, BJ, Visscher, PM and Goddard, ME 2009. Increased accuracy of artificial selection by using the realised relationship matrix. Genetics Research 91, 47–60.Google Scholar

Henryon, M, Berg, P and Sorensen, AC 2014. Animal-breeding schemes using genomic information need breeding plans designed to maximise long-term genetic gains. Livestock Science 166, 38–47.Google Scholar

Henryon, M, Sorensen, AC and Berg, P 2009. Mating animals by minimising the covariance between ancestral contributions generates less inbreeding without compromising genetic gain in breeding schemes with truncation selection. Animal 3, 1339–1346.CrossRef Google Scholar PubMed

Lindgren, D and Matheson, AC 1986. An algorithm for increasing the genetic quality of seed from orchards by using the better clones in higher proportions. Silvae Genetica 35, 173–177.Google Scholar

Liu, H, Meuwissen, TH, Sørensen, AC and Berg, P 2015. Upweighting rare favourable alleles increases long-term genetic gain in genomic selection programs. Genetics Selection Evolution 47, 3.Google Scholar

Meuwissen, THE 2007. Operation of conservation schemes. In Utilisation and conservation of farm animals genetic resources (ed. H Oldenbroek), pp. 167–193. Wageningen Academic, the Netherlands.CrossRef Google Scholar

Meuwissen, THE and Luo, Z 1992. Computing inbreeding coefficients in large populations. Genetics Selection Evolution 24, 305–313.Google Scholar

Nirea, KG, Sonesson, AK, Woolliams, JA and Meuwissen, TH 2012. Effect of non-random mating on genomic and BLUP selection schemes. Genetics Selection Evolution 44, 11.Google Scholar

Son, J, Kang, H, Kim, J, Park, J and Lee, D 2014. Accuracy of genomic prediction in a simulated pig populations. Proceedings, 10th World Congress of Genetics Applied to Livestock Production, 18–22 August 2014, Vancouver, BC, Canada.Google Scholar

Sonesson, AK and Meuwissen, TH 2000. Mating schemes for optimum contribution selection with constrained rates of inbreeding. Genetics Selection Evolution 32, 231–248.Google Scholar

Sørensen, AC, Berg, P and Woolliams, JA 2005. The advantage of factorial mating under selection is uncovered by deterministically predicted rates of inbreeding. Genetics Selection Evolution 37, 57–81.CrossRef Google Scholar PubMed

Speed, D and Balding, DJ 2015. Relatedness in the post-genomic era: is it still useful? Nature Review Genetics 16, 33–44.Google Scholar

Sun, C, VanRaden, PM, O’Connell, JR, Weigel, KA and Gianola, D 2013. Mating programs including genomic relationships and dominance effects. Journal of Dairy Science 96, 8014–8023.Google Scholar

VanRaden, PM 2008. Efficient methods to compute genomic predictions. Journal Dairy Science 91, 4414–4423.Google Scholar

Woolliams, JA 2006. Designs and evaluations for managing genetic diversity in breeding programs, Proceedings of the 8th World Congress on Genetics Applied to Livestock Production, 13–18 August 2006, Belo Horizonte, MG, Brazil, Communication no. 30-01.Google Scholar

Woolliams, JA 2007. Genetic contributions and inbreeding. In Utilisation and conservation of farm animal genetic resources (ed. K Oldenbroek), pp. 147–165. Wageningen Academic Publishers, Wageningen, the Netherlands.Google Scholar

Woolliams, JA, Bijma, P and Willanueva, B 1999. Expected genetic contributions and their impact on gene flow and genetic gain. Genetics 153, 1009–1020.Google Scholar

Woolliams, JA, Pong-Wong, R and Villanueva, B 2002. Strategic optimisation of short- and long-term gain and inbreeding in MAS and non-MAS schemes. Proceedings of the 7th World Congress on Genetics Applied to Livestock Production, 19–23 August 2002, Montpelier, VT, France, pp. 155-–162.Google Scholar

Woolliams, JA and Thompson, R 1994. A theory of genetic contributions. Proceedings of the 5th World Congress on Genetics Applied to Livestock Production, 7–12 August 1994, Guelph, ON, Canada, pp. 127–134.Google Scholar

Wright, S 1921. Systems of mating. Genetics 6, 111–178.Google Scholar

Yang, J, Benyamin, B, McEvoy, BP, Gordon, S, Henders, AK, Nyholt, DR, Madden, PA, Heath, AC, Martin, NG, Montgomery, GW et al 2010. Common SNPs explain a large proportion of the heritability for human height. Nature Genetics 42, 565–569.Google Scholar

Table 1 Details of the simulation of the five breeding schemes with different population structure (the number of selected dams and litter size) and heritability

Figure 1 A summary of simulations. Simulations were carried out in the following three stages. In the first stage, 8257 markers and 2000 quantitative trait loci were generated by simulating a single founder population with a Fisher–Wright inheritance model. The founder population had an effective population size of 200 animals and 2000 generations, which was created to obtain desirable level of linkage disequilibrium between simulated loci. In the second stage, the base animals (in generation 0) were generated by choosing 20 sires and Nd dams from the last generation of the founder population. Two thousand identical-by-descent (IBD) markers were used to trace each base animal’s contribution to their descendant generations and infer IBD status relative the base population. In total, 20 sires and Nd dams in generation 0 were used to produce Ntotal offspring in generation 1. In the third stage, from generation 1 to 19, all Ntotal selection candidates were both genotyped and phenotyped before selection. In each generation, 20 sires and Nd dams were truncation-selected using breeding values predicted from a Ridge-Regression model and were mated to produce Ntotal offspring.

Table 2 Average rate of inbreeding (ΔF) realised by generation 5 to 20 in each of the pig breeding schemes

Table 3 Average rate of genetic gain (ΔG) realised by different mating strategies at generation 5 to 20 in each of the pig breeding schemes

Table 4 Average number of ancestors in generations 0 to 19 that made a genetic contribution to offspring in generation 20 in all breeding schemes

Liu supplementary material

Liu supplementary material 1

File 111 KB

Article contents

Mating strategies with genomic information reduce rates of inbreeding in animal breeding schemes without compromising genetic gain

Abstract

Keywords

Information

Implications

Introduction

Material and methods

Experimental design

Mating strategies

Minimum-coancestry mating

Mating by minimising the covariance between ancestral contributions

Random mating

Breeding schemes

Simulation

Founder population and genetic architecture

Base population

Selected population

Tracking identity-by-descent

Phenotypes

Genomic prediction

Assessment criteria

Results

Rate of inbreeding

Rates of genetic gain

Inbreeding over time

Genetic variance

Genetic contributions

Discussion

Acknowledgements

Supplementary Material

References

Liu supplementary material

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests