Skip to main content
    • Aa
    • Aa
  • Get access
    Check if you have access via personal or institutional login
  • Cited by 39
  • Cited by
    This article has been cited by the following publications. This list is generated based on data provided by CrossRef.

    Knight, Keith 2014. Wiley StatsRef: Statistics Reference Online.

    Neves, Haroldo HR Carvalheiro, Roberto and Queiroz, Sandra A 2012. A comparison of statistical methods for genomic selection in a mice population. BMC Genetics, Vol. 13, Issue. 1, p. 100.

    Wang, Xin Yang, Zefeng and Xu, Chenwu 2015. A comparison of genomic selection methods for breeding value prediction. Science Bulletin, Vol. 60, Issue. 10, p. 925.

    Knight, Keith 2015. Wiley StatsRef: Statistics Reference Online.

    Zhou, Yao Isabel Vales, M. Wang, Aoxue and Zhang, Zhiwu 2016. Systematic bias of correlation coefficient may explain negative accuracy of genomic prediction. Briefings in Bioinformatics, p. bbw064.

    SILLANPÄÄ, MIKKO J. 2011. On statistical methods for estimating heritability in wild populations. Molecular Ecology, Vol. 20, Issue. 7, p. 1324.

    Denis, Marie and Bouvet, Jean-Marc 2013. Efficiency of genomic selection with models including dominance effect in the context of Eucalyptus breeding. Tree Genetics & Genomes, Vol. 9, Issue. 1, p. 37.

    Logsdon, B. A. Gentles, A. J. Miller, C. P. Blau, C. A. Becker, P. S. and Lee, S.-I. 2015. Sparse expression bases in cancer reveal tumor drivers. Nucleic Acids Research, Vol. 43, Issue. 3, p. 1332.

    Desta, Zeratsion Abera and Ortiz, Rodomiro 2014. Genomic selection: genome-wide prediction in plant improvement. Trends in Plant Science, Vol. 19, Issue. 9, p. 592.

    Shariati, Mohammad M Sørensen, Peter and Janss, Luc 2012. A two step Bayesian approach for genomic prediction of breeding values. BMC Proceedings, Vol. 6, Issue. Suppl 2, p. S12.

    Xu, S. Zhu, D. and Zhang, Q. 2014. Predicting hybrid performance in rice using genomic best linear unbiased prediction. Proceedings of the National Academy of Sciences, Vol. 111, Issue. 34, p. 12456.

    Rubanovich, A. V. and Khromov-Borisov, N. N. 2016. Genetic risk assessment of the joint effect of several genes: Critical appraisal. Russian Journal of Genetics, Vol. 52, Issue. 7, p. 757.

    Knight, Keith 2013. Encyclopedia of Environmetrics.

    Li, Hengde Wang, Jingwei and Bao, Zhenmin 2015. A novel genomic selection method combining GBLUP and LASSO. Genetica, Vol. 143, Issue. 3, p. 299.

    Pryce, J.E. Haile-Mariam, M. Verbyla, K. Bowman, P.J. Goddard, M.E. and Hayes, B.J. 2010. Genetic markers for lactation persistency in primiparous Australian dairy cows. Journal of Dairy Science, Vol. 93, Issue. 5, p. 2202.

    Felipe, Vivian PS Okut, Hayrettin Gianola, Daniel Silva, Martinho A and Rosa, Guilherme JM 2014. Effect of genotype imputation on genome-enabled prediction of complex traits: an empirical study with mice data. BMC Genetics, Vol. 15, Issue. 1,

    Usai, M Graziano Carta, Antonello and Casu, Sara 2012. Alternative strategies for selecting subsets of predicting SNPs by LASSO-LARS procedure. BMC Proceedings, Vol. 6, Issue. Suppl 2, p. S9.

    Branicki, Wojciech Liu, Fan van Duijn, Kate Draus-Barini, Jolanta Pośpiech, Ewelina Walsh, Susan Kupiec, Tomasz Wojas-Pelc, Anna and Kayser, Manfred 2011. Model-based prediction of human hair color using DNA variants. Human Genetics, Vol. 129, Issue. 4, p. 443.

    Park, Minsu Kim, Tae-Hun Cho, Eun-Seok Kim, Heebal and Oh, Hee-Seok 2014. Genomic Selection for Adjacent Genetic Markers of Yorkshire Pigs Using Regularized Regression Approaches. Asian-Australasian Journal of Animal Sciences, Vol. 27, Issue. 12, p. 1678.

    Jiménez-Montero, J.A. González-Recio, O. and Alenda, R. 2013. Comparison of methods for the implementation of genome-assisted evaluation of Spanish dairy cattle. Journal of Dairy Science, Vol. 96, Issue. 1, p. 625.


LASSO with cross-validation for genomic selection

  • M. GRAZIANO USAI (a1), MIKE E. GODDARD (a2) (a3) and BEN J. HAYES (a3)
  • DOI:
  • Published online: 01 February 2010

We used a least absolute shrinkage and selection operator (LASSO) approach to estimate marker effects for genomic selection. The least angle regression (LARS) algorithm and cross-validation were used to define the best subset of markers to include in the model. The LASSO–LARS approach was tested on two data sets: a simulated data set with 5865 individuals and 6000 Single Nucleotide Polymorphisms (SNPs); and a mouse data set with 1885 individuals genotyped for 10 656 SNPs and phenotyped for a number of quantitative traits. In the simulated data, three approaches were used to split the reference population into training and validation subsets for cross-validation: random splitting across the whole population; random sampling of validation set from the last generation only, either within or across families. The highest accuracy was obtained by random splitting across the whole population. The accuracy of genomic estimated breeding values (GEBVs) in the candidate population obtained by LASSO–LARS was 0·89 with 156 explanatory SNPs. This value was higher than those obtained by Best Linear Unbiased Prediction (BLUP) and a Bayesian method (BayesA), which were 0·75 and 0·84, respectively. In the mouse data, 1600 individuals were randomly allocated to the reference population. The GEBVs for the remaining 285 individuals estimated by LASSO–LARS were more accurate than those obtained by BLUP and BayesA for weight at six weeks and slightly lower for growth rate and body length. It was concluded that LASSO–LARS approach is a good alternative method to estimate marker effects for genomic selection, particularly when the cost of genotyping can be reduced by using a limited subset of markers.

Corresponding author
*Corresponding author. Settore Genetica e Biotecnologie, AGRIS-Sardegna, Loc. Bonassai, Km 18·6 S. S. Sassari-Fertilia, 07040, Olmedo (SS), Italy. Tel: +39 079387318. Fax: +39-079389450. e-mail:
Linked references
Hide All

This list contains references from the content that can be linked to their source. For a full set of references and notes please see the PDF or HTML where available.

G. de Los Campos , H. Naya , D. Gianola , J. Crossa , A. Legarra , E. Manfredi , K. Weigel & J. M. Cotes (2009). Predicting quantitative traits with regression models for dense molecular markers and pedigree. Genetics 182, 375385.

S. D. Foster , A. P. Verbyla & W. S. Pitchford (2007). Incorporating LASSO effects into a mixed model for quantitative trait loci detection. Journal of Agricultural, Biological and Environmental Statistics 12, 300314.

D. Gianola , R. L. Fernando & A. Stella (2006). Genomic-assisted prediction of genetic value with semiparametric procedures. Genetics 173, 17611776.

D. Gianola , G. de Los Campos , W. G. Hill , E. Manfredi & R. Fernando (2009). Additive genetic variability and the Bayesian alphabet. Genetics 183, 347363.

B. J. Hayes & M. E. Goddard (2001). The distribution of the effects of genes affecting quantitative traits in livestock. Genetics, Selection, Evolution 33, 209229.

A. Legarra , C. Robert-Granie , E. Manfredi & J. M. Elsen (2008). Performance of genomic selection in mice. Genetics 180, 611618.

M. S. Lund , G. Sahana , D. J. de Koning , G. Su & Ö. Carlborg (2009). Comparison of analyses of the QTLMAS XII common dataset I: genomic selection. BMC Proceedings 3, S1.

T. Park & G. Casella (2008). The Bayesian LASSO. Journal of the American Statistical Association 103, 681686.

S. Sanna , A. U. Jackson , R. Nagaraja , C. J. Willer , W. M. Chen , L. L. Bonnycastle , H. Shen , N. Timpson , G. Lettre , G. Usala , P. S. Chines , H. M. Stringham , L. J. Scott , M. Dei , S. Lai , G. Albai , L. Crisponi , S. Naitza , K. F. Doheny , E. W. Pugh , Y. Ben-Shlomo , S. Ebrahim , D. A. Lawlor , R. N. Bergman , R. M. Watanabe , M. Uda , J. Tuomilehto , J. Coresh , J. N. Hirschhorn , A. R. Shuldiner , D. Schlessinger , F. S. Collins , G. Davey Smith , E. Boerwinkle , A. Cao , M. Boehnke , G. R. Abecasis & K. L. Mohlke (2008). Common variants in the GDF5-UQCC region are associated with variation in human height. Nature Genetics 40, 198203.

W. Valdar , L. C. Solberg , D. Gauguier , W. O. Cookson , J. N. P. Rawlins , R. Mott & J. Flint (2006). Genetic and environmental effects on complex traits in mice. Genetics 174, 959984.

N. Yi & S. Xu (2008). Bayesian LASSO for quantitative trait loci mapping. Genetics 179, 1045–55.

Recommend this journal

Email your librarian or administrator to recommend adding this journal to your organisation's collection.

Genetics Research
  • ISSN: 0016-6723
  • EISSN: 1469-5073
  • URL: /core/journals/genetics-research
Please enter your name
Please enter a valid email address
Who would you like to send this to? *