Hostname: page-component-6766d58669-76mfw Total loading time: 0 Render date: 2026-05-18T18:02:49.148Z Has data issue: false hasContentIssue false

What do artificial neural networks tell us about the genetic structure of populations? The example of European pig populations

Published online by Cambridge University Press:  27 April 2009

NATACHA NIKOLIC
Affiliation:
Laboratoire de Génétique Cellulaire (UMR 444), INRA-ENVT, BP 52627, 31326 Castanet Tolosan Cedex, France
YOUNG-SEUK PARK
Affiliation:
Department of Biology, Kyung Hee University, Dongdaemun-gu, Seoul 130-701, Korea
MAGALI SANCRISTOBAL
Affiliation:
Laboratoire de Génétique Cellulaire (UMR 444), INRA-ENVT, BP 52627, 31326 Castanet Tolosan Cedex, France
SOVAN LEK
Affiliation:
Laboratoire Evolution de la Diversité Biologique (UMR 5274), CNRS, 118 route de Narbonne, 31400 Toulouse Cedex 4, France
CLAUDE CHEVALET*
Affiliation:
Laboratoire de Génétique Cellulaire (UMR 444), INRA-ENVT, BP 52627, 31326 Castanet Tolosan Cedex, France
*
*Corresponding author. Claude Chevalet, Laboratoire de Génétique Cellulaire, INRA-Toulouse, BP 52627, 31326 Castanet Tolosan Cedex, France. Tel: +33 5 61 28 51 17. Fax: +33 5 61 28 53 08. e-mail: claude.chevalet@toulouse.inra.fr
Rights & Permissions [Opens in a new window]

Summary

General and genetic statistical methods are commonly used to deal with microsatellite data (highly variable neutral genetic markers). In this paper, the self-organizing map (SOM) that belongs to the unsupervised artificial neural networks (ANNs) was applied to analyse the structure of 58 European and two Chinese pig populations (Sus scrofa) including commercial lines, local breeds and cosmopolitan breeds. Results were compared with other unsupervised classification or ordination methods such as factorial correspondence analysis, hierarchical clustering from an allele sharing distance and the Bayesian genetic model and with principal components analysis and neighbour joining from allelic frequencies and genetic distances between populations. Like other methods, SOMs were able to classify individuals according to their breed origin and to visualize similarities between breeds. They provided additional information on the within- and between-population diversity, allowed differences between similar populations to be highlighted and helped differentiate different groups of populations.

Information

Type
Paper
Copyright
Copyright © Cambridge University Press 2009
Figure 0

Fig. 1. Classification of European pigs generated by SOM. (a) SOM tree of the eight clusters defined in Table 1. (b) Classification of pig populations on the SOM map. The contents of the eight SOM clusters are detailed in Table 1. The repartition of the breeds on this map is shown in Figure 2. (c) U-matrix map. This map is made up of ‘mini-cells’ corresponding to both the output neurones and to the links between adjacent cells. Black or dark mini-cells indicate links between unrelated or distant cells and hence limits between clusters. The main breeds corresponding to clusters are Landrace LR, Large White LW, Pietrain PI for clusters 1, 3 and 4, respectively, Meishan for cluster 6 and Duroc and Hampshire for cluster 8.

Figure 1

Table 1. Distribution of pig populations in the different clusters (1–8) given by SOM, STRUCTURE (St), AS and NJ methods

Figure 2

Fig. 2. Repartition on the SOM map of European pig breeds and populations. (a) Repartition on the SOM map of cosmopolitan and local breeds: breeds are designated by their two-letter codes. Examples of the dispersion of individuals from a single national population and from a single commercial line are shown for the Landrace (cluster 1) and the Large White (cluster 3) breeds. The two large dotted circles correspond to pairs of similar breeds that are spread in the same region of the SOM map (RE-NI and AS-BS). (b) Repartition of the four synthetic lines and of populations that are spread in various places on the SOM map. IR stands for the Icelandic Landrace population.

Figure 3

Fig. 3. Relationship between heterozygosity and relative SOM diversity in cosmopolitan and local breeds. Abscissa: expected heterozygosity. Ordinates: relative SOM diversity defined as the ratio of Nc (the number of SOM cells occupied by individuals of the population) to the total number of cells in the corresponding cluster. (a) National populations and commercial lines of cosmopolitan breeds Landrace, Large White and Piétrain (clusters 1, 3 and 4). The Scandinavian Landrace populations (triangles) that show a low heterozygosity behave like the commercial lines. (b) Local breeds (clusters 5, 7 and 8).

Figure 4

Fig. 4. PCA ordination. Two projections of combinations of the first four components C1, C2, C3 and C4 are shown. Populations are designated by the two-letter code of the breed. IR stands for the Icelandic Landrace population. (a) Coordinates are the combinations C2−C1/2 and C3+C4/2. Note the external position of the Chinese Meishan breed (MS) and of the Tia Meslan synthetic (TM), the intermediate position of the Icelandic Landrace between the Landrace and the Large White clusters, and the position of British Lop inside the Landrace cluster. (b) Coordinates are the combinations C1 and C3. Note the general similarity between this topology and that proposed by SOM (Figs 1c and 2a).