Hostname: page-component-6766d58669-88psn Total loading time: 0 Render date: 2026-05-17T11:04:04.188Z Has data issue: false hasContentIssue false

Plant variety selection using interaction classes derived from factor analytic linear mixed models: models with information on genetic relatedness

Published online by Cambridge University Press:  27 February 2025

Alison Barbara Smith*
Affiliation:
Mixed Models and Experimental Design Lab, School of Mathematics and Applied Statistics, University of Wollongong, Wollongong, NSW, Australia
Arun S. K. Shunmugam
Affiliation:
Agriculture Victoria Research, Grains Innovation Park, Horsham, VIC, Australia
David G. Butler
Affiliation:
Mixed Models and Experimental Design Lab, School of Mathematics and Applied Statistics, University of Wollongong, Wollongong, NSW, Australia
Brian R. Cullis
Affiliation:
Mixed Models and Experimental Design Lab, School of Mathematics and Applied Statistics, University of Wollongong, Wollongong, NSW, Australia
*
Corresponding author: Alison Barbara Smith; Email: alismith@uow.edu.au
Rights & Permissions [Opens in a new window]

Abstract

The concept of interaction classes (iClasses) for multi-environment trial data was introduced to address the problem of summarising variety performance across environments in the presence of variety by environment interaction (VEI). The approach involves the fitting of a factor analytic linear mixed model (FALMM), with the resultant estimates of factor loadings being used to form groups of environments (iClasses) that discriminate varieties with different patterns of VEI. It is then meaningful to summarise variety performance across environments within iClasses. The iClass methodology was developed with respect to a FALMM in which the genetic effects for different varieties were assumed independent. This was done for pedagogical reasons but it was pointed out that the accuracy of variety selection is greatly enhanced by considering the genetic relatedness of varieties, either via ancestral or genomic information. The focus of the current paper is therefore to extend the iClass approach for FALMMs which incorporate such information. In addition, a measure of stability of variety performance across iClasses is defined. The utility of the approach for variety selection is illustrated using a multi-environment trial dataset from the lentil breeding programme operated by Agriculture Victoria.

Information

Type
Crops and Soils Research Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited
Copyright
© The Author(s), 2025. Published by Cambridge University Press
Figure 0

Table 1. Multi-environment trial dataset for L0, L1, L2 and L3 stage selection decisions in 2023: number of trials included from each stage and year. Note that, prior to 2023, the entries in different stages were grown in separate trials, whereas in 2023, the L1, L2 and L3 entries were grown together in the same trial

Figure 1

Figure 1. Factor analytic regression model for hypothetical example: fitted values for three varieties (v1, v2 and v3) from model with three factors and nine environments. Fitted values (represented as points) are plotted against estimated factor loadings for individual factors (panels (a), (b) and (c)). The slopes of the regression lines are the empirical best linear unbiased predictions of factor scores for each variety. The residual maximum likelihood estimates of the (rotated) loadings are shown along the bottom axis, and the associated environments are labelled along the top axis.

Figure 2

Table 2. Summary of model fits when factor analytic models of order ${k_a}$ and ${k_e}$ used for additive and non-additive (independent) variety effects, respectively. Note that an order of zero means no factors were fitted so corresponds to a diagonal variance structure; the missing order for ${k_a}$ means that additive variety effects were not included in the model. The residual log-likelihoods and Akaike Information Criteria are provided as differences from model M1. The number of genetic variance parameters is given for each model. For all models with non-zero ${k_a}$ and ${k_e}$, the final columns in the table show the percentage of additive genetic variance accounted for by ${k_a}$ additive factors; percentage of non-additive genetic variance accounted for by ${k_e} = 1$ non-additive factor; percentage of total genetic variance accounted for by all $k = {k_a} + {k_e}$ factors

Figure 3

Table 3. Number of environments in each interaction class for classes based on all 5 factors (top half of table) and classes based on the first 4 factors in order of percentage variance accounted for (bottom half of table)

Figure 4

Table 4. Summary of information used to define interaction classes for a subset of 23 environments: rotated residual maximum likelihood estimates of loadings ($ \times 1000$) for each factor, ordered on variance accounted for (additive factors 1, 2, 3 and 4, non-additive factor 1); interaction classes based on all five factors and first four factors. The 23 environments cover all 13 interaction classes when five factors used, with two randomly chosen environments within each class (apart from the final three classes, each of which only contained one environment)

Figure 5

Figure 2. Rotated residual maximum likelihood estimates of loadings for first and second additive factors. Environments with positive estimated loadings in the second factor have been labelled.

Figure 6

Figure 3. Plot of interaction class overall performance (t/ha) for three varieties that are probes for presence of the disease Botrytis Grey Mould. The variety PBA BOLT is a susceptible variety, whereas ALB TERRIER and PBA JUMBO2 are tolerant varieties. The number of environments in each iClass and their associated mean yield (t/ha) is given along the top axis. The dashed horizontal lines are the grand means of the common variety by environment effects across all environments for these varieties.

Figure 7

Figure 4. Estimated genetic correlations for total common variety by environment effects for all pairs of environments summarised on an interaction class basis for (a) classes based on all five factors and (b) classes based on first four factors. The value listed in each cell is the mean of all pairwise estimated correlations between environments in the interaction class. The interaction class labels include the associated numbers of environments in parentheses. The colour scale corresponds to the mean values.

Figure 8

Figure 5. Empirical best linear unbiased predictions of additive factor scores for the 125 L3 varieties grown in 2023. Varieties of interest have been labelled with their names and two key varieties have been plotted in red. In each panel, the y-axis corresponds to the first factor and the x-axis to the second, third and fourth additive factors for (a), (b) and (c) and the non-additive factor for (d).

Figure 9

Figure 6. Stability across interaction classes (excluding those linked to the disease Botrytis Grey Mould) for the 125 L3 varieties grown in 2023: interaction classes defined using (a) all five factors and (b) first four factors. The y-axis in each plot is the grand mean of the common variety by environment effects (for each variety) across environments and the x-axis is the square root of the between interaction class mean square from the one-way analysis of variance on those effects. Points are coloured according to the p-value for the variance ratio from the analysis of variance. Varieties of interest have been labelled with their names.

Figure 10

Table 5. Stability across interaction classes (excluding those linked to the disease Botrytis Grey Mould) for the 125 L3 varieties grown in 2023: p-values of variance ratios (ratio of between to within interaction class mean square) from analyses of variance tabulated for interaction classes defined using first four factors and all five factors

Figure 11

Figure 7. Plots of interaction class overall performance (t/ha) (excluding classes linked to the disease Botrytis Grey Mould) for (a) four varieties comprising ALB TERRIER and three test lines and (b) three varieties comprising ALB TERRIER and two other commercial varieties. The number of environments in each interaction class and their associated mean yield (t/ha) is given along the top axis. The dashed horizontal lines are the grand means of the common variety by environment effects for these varieties.

Figure 12

Table 6. Interaction class overall performance (t/ha) and factor scores for ALB TERRIER and PBA HURRICANE XT. Also given are the differences between the two varieties (ALB TERRIER and PBA HURRICANE XT for interaction class overall performance and the reverse order for factor scores.)

Supplementary material: File

Smith et al. supplementary material 1

Smith et al. supplementary material
Download Smith et al. supplementary material 1(File)
File 2 MB
Supplementary material: File

Smith et al. supplementary material 2

Smith et al. supplementary material
Download Smith et al. supplementary material 2(File)
File 12.5 MB
Supplementary material: File

Smith et al. supplementary material 3

Smith et al. supplementary material
Download Smith et al. supplementary material 3(File)
File 4.5 MB