Hostname: page-component-6766d58669-zlvph Total loading time: 0 Render date: 2026-05-22T03:49:46.591Z Has data issue: false hasContentIssue false

Analysis of a genetically structured variance heterogeneity model using the Box–Cox transformation

Published online by Cambridge University Press:  25 February 2011

YE YANG
Affiliation:
Department of Genetics and Biotechnology, Faculty of Agricultural Sciences, Aarhus University, DK-8830 Tjele, Denmark
OLE F. CHRISTENSEN
Affiliation:
Department of Genetics and Biotechnology, Faculty of Agricultural Sciences, Aarhus University, DK-8830 Tjele, Denmark
DANIEL SORENSEN*
Affiliation:
Department of Genetics and Biotechnology, Faculty of Agricultural Sciences, Aarhus University, DK-8830 Tjele, Denmark
*
*Corresponding author. email: Sorensen@humo.dk
Rights & Permissions [Opens in a new window]

Summary

Over recent years, statistical support for the presence of genetic factors operating at the level of the environmental variance has come from fitting a genetically structured heterogeneous variance model to field or experimental data in various species. Misleading results may arise due to skewness of the marginal distribution of the data. To investigate how the scale of measurement affects inferences, the genetically structured heterogeneous variance model is extended to accommodate the family of Box–Cox transformations. Litter size data in rabbits and pigs that had previously been analysed in the untransformed scale were reanalysed in a scale equal to the mode of the marginal posterior distribution of the Box–Cox parameter. In the rabbit data, the statistical evidence for a genetic component at the level of the environmental variance is considerably weaker than that resulting from an analysis in the original metric. In the pig data, the statistical evidence is stronger, but the coefficient of correlation between additive genetic effects affecting mean and variance changes sign, compared to the results in the untransformed scale. The study confirms that inferences on variances can be strongly affected by the presence of asymmetry in the distribution of data. We recommend that to avoid one important source of spurious inferences, future work seeking support for a genetic component acting on environmental variation using a parametric approach based on normality assumptions confirms that these are met.

Information

Type
Research Papers
Copyright
Copyright © Cambridge University Press 2011
Figure 0

Table 1. MC estimates of posterior means and 95% posterior interval (95% P.I.) for λ and ρ with increasing number of repeated records per individual

Figure 1

Table 2. MC estimates of posterior means (first row for each model), 95% posterior intervals (second row for each model) and of 95% Monte Carlo confidence intervals (third row for each model) of chosen parameters based on models M and Mλ, for the rabbit litter size data (top) and on models M and Mλ for the pig data set (bottom).

Figure 2

Figure 1. Top: Monte Carlo estimates of σa2 and σa*2 (left) and σp2 and σp*2 (right) under Mλ for rabbit data. The thick lines represent the prior scaled inverted chi-square densities with degrees of freedom ν=5 and scale parameters S_{\sigma _{a}^{\setnum{2}} } \equals 0{\cdot}492, S_{\sigma _{{a^{ \ast } }}^{\setnum{2}} } \equals 0{\cdot}096, S_{\sigma _{p}^{\setnum{2}} } \equals 0{\cdot}264 and S_{\sigma _{{p^{ \ast } }}^{\setnum{2}} } \equals 0{\cdot}072 under Mλ. Bottom: as above, with ν=5 and S_{\sigma _{a}^{\setnum{2}} } \equals 0{\cdot}124, S_{\sigma _{{a^{ \ast } }}^{\setnum{2}} } \equals 0{\cdot}024, S_{\sigma _{p}^{\setnum{2}} } \equals 0{\cdot}066 and S_{\sigma _{{p^{ \ast } }}^{\setnum{2}} } \equals 0{\cdot}018.

Figure 3

Figure 2. Top: Monte Carlo estimates of σa2 and σa*2 (left) and σp2 and σp*2 (right) under Mλ for pig data. The thick lines represent the prior scaled inverted χ2 densities with degrees of freedom ν=5 and scale parameters S_{\sigma _{a}^{\setnum{2}} } \equals 0{\cdot}972, S_{\sigma _{{a^{ \ast } }}^{\setnum{2}} } \equals 0{\cdot}054, S_{\sigma _{p}^{\setnum{2}} } \equals 0{\cdot}36 and S_{\sigma _{{p^{ \ast } }}^{\setnum{2}} } \equals 0{\cdot}036 under Mλ. Bottom: as above, with ν=5 and scale parameters S_{\sigma _{a}^{\setnum{2}} } \equals 0{\cdot}243, S_{\sigma _{{a^{ \ast } }}^{\setnum{2}} } \equals 0{\cdot}0135, S_{\sigma _{p}^{\setnum{2}} } \equals 0{\cdot}09 and S_{\sigma _{{p^{ \ast } }}^{\setnum{2}} } \equals 0{\cdot}009.

Figure 4

Figure 3. Histograms of posterior predictive realization of T\lpar {\bf z}^{\lpar \lambda \equals \lambda _{\setnum{0}} \rpar } \comma \theta \rpar \minus T\lpar {\bf z}_{rep} \comma \theta \rpar , designed to test residual skewness for the rabbit data (first two from left) and pig data (last two from left). The first and third figures correspond to analyses on the original scale; the second and fourth to analyses on the transformed scale.

Figure 5

Figure 4. Left: plot of the posterior means of the vector a under M (aλ=1) versus the posterior means of the vector a under Mλ (aλ=1·393). Right: plot of the posterior means of the vector a* under M (aλ=1*) versus the posterior means of the vector a* under Mλ (aλ=1·393*) in pig litter size data.

Figure 6

Figure 5. Relationship between variance and mean (expressions (8) and (9)) when only ai changes and ai* is kept constant (top) and when ai* changes and ai is kept constant (bottom), for three values of λ.