Hostname: page-component-89b8bd64d-4ws75 Total loading time: 0 Render date: 2026-05-07T01:04:56.472Z Has data issue: false hasContentIssue false

Improving the inference of population genetic structure in the presence of related individuals

Published online by Cambridge University Press:  16 May 2014

SILVIA T. RODRÍGUEZ-RAMILO
Affiliation:
Departamento de Mejora Genética Animal, Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria, Ctra. La Coruña Km. 7,5. 28040, Madrid, Spain
MIGUEL A. TORO
Affiliation:
Departamento de Producción Animal, Escuela Técnica Superior de Ingenieros Agrónomos, Universidad Politécnica de Madrid, 28040, Madrid, Spain
JINLIANG WANG
Affiliation:
Institute of Zoology, Zoological Society of London, London NW1 4RY, UK
JESÚS FERNÁNDEZ*
Affiliation:
Departamento de Mejora Genética Animal, Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria, Ctra. La Coruña Km. 7,5. 28040, Madrid, Spain
*
* Corresponding author: Departamento de Mejora Genética Animal, Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria, Ctra. La Coruña Km. 7,5, 28040, Madrid, Spain. E-mail: jmj@inia.es
Rights & Permissions [Opens in a new window]

Summary

It is well known that the presence of related individuals can affect the inference of population genetic structure from molecular data. This has been verified, for example, on the unsupervised Bayesian clustering algorithm implemented in the software STRUCTURE. This methodology assumes, among others, Hardy–Weinberg and linkage equilibrium within subpopulations. The existence of groups of close relatives, such as full-sib families, may prevent these assumptions to be fulfilled, causing the algorithm to work suboptimally. The purpose of this study was to evaluate the effect of the presence of related individuals on a different methodology (implemented in CLUSTER_DIST) for population genetic structure inference. This approach arranges individuals to maximize the genetic distance between groups and does not make Hardy–Weinberg and linkage equilibrium assumptions. We study the robustness of this approach to the presence of close relatives in a sample using simulated scenarios involving combinations of several factors, including the number of subpopulations, the level of differentiation between them, the number, size and type (full or half-sibs) of families in a sample, and the type and number of molecular markers available for clustering analysis. Results indicate that the methodology that maximizes the genetic distance between subpopulations is less influenced by the presence of related individuals than the program STRUCTURE. Therefore, the former can be used, in combination with the program STRUCTURE, to analyse population genetic structure when related individuals are suspected to be present in a sample.

Information

Type
Research Paper
Copyright
Copyright © Cambridge University Press 2014 
Figure 0

Fig. 1. Proportion of replicates where STRUCTURE (left panels) and CLUSTER_DIST (right panels) infer K = 3 when n = 3 in the presence of full-siblings. Dashed and solid lines indicate 10 and 20 microsatellites, respectively. Triangles represent one family, squares indicate two families in the same subpopulation, and circles represent two families in different subpopulations.

Figure 1

Fig. 2. Proportion of replicates where STRUCTURE (left panels) and CLUSTER_DIST (right panels) infer K = 3 when n = 3 in the presence of full-siblings. Dashed and solid lines indicate 100 and 200 SNPs, respectively. Triangles represent one family, squares indicate two families in the same subpopulation, and circles represent two families in different subpopulations.

Figure 2

Fig. 3. Proportion of replicates where STRUCTURE (left panels) and CLUSTER_DIST (right panels) infer K = 3 when n = 3 in the presence of half-siblings, using 10 microsatellites. Triangles represent one family, squares indicate two families in the same subpopulation, and circles represent two families in different subpopulations.

Supplementary material: PDF

Rodríguez-Ramilo Supplementary Material

Figures S1, S2 and S3

Download Rodríguez-Ramilo Supplementary Material(PDF)
PDF 302.2 KB