Hostname: page-component-6766d58669-88psn Total loading time: 0 Render date: 2026-05-14T20:56:29.791Z Has data issue: false hasContentIssue false

Least absolute shrinkage and selection operator regression used to select important features when predicting wheat yield from various genotype groups

Published online by Cambridge University Press:  06 November 2024

Muhuddin Rajin Anwar*
Affiliation:
NSW Department of Primary Industries, Wagga Wagga Agricultural Institute, Wagga Wagga, NSW, Australia Gulbali Institute (Agriculture, Water and Environment), Charles Sturt University, Wagga Wagga, NSW, Australia
Livinus Emebiri
Affiliation:
NSW Department of Primary Industries, Wagga Wagga Agricultural Institute, Wagga Wagga, NSW, Australia
Ryan H. L. Ip
Affiliation:
Department of Mathematical Sciences, Auckland University of Technology, Auckland, New Zealand School of Computing and Mathematics, Charles Sturt University, Wagga Wagga, NSW, Australia
David J. Luckett
Affiliation:
Gulbali Institute (Agriculture, Water and Environment), Charles Sturt University, Wagga Wagga, NSW, Australia
Yashvir S. Chauhan
Affiliation:
Department of Agriculture and Fisheries (DAF), Kingaroy, QLD, Australia
Ketema T. Zeleke
Affiliation:
Gulbali Institute (Agriculture, Water and Environment), Charles Sturt University, Wagga Wagga, NSW, Australia School of Agricultural, Environmental and Veterinary Sciences, Charles Sturt University, Wagga Wagga, NSW, Australia
*
Corresponding author: Muhuddin Rajin Anwar; Email: muhuddin.anwar@dpi.nsw.gov.au

Abstract

Bread wheat and durum wheat genotypes were grown in field experiments at two locations in New South Wales, Australia across several years and using two sowing times (‘early’ v. ‘late’). Genotypes were grouped based on genetic similarity. Grain yield, grain size, soil characteristics and daily weather data were collected. The weather data were used to calculate water and heat stress indices for four key growth periods around flowering. Least absolute shrinkage and selection operator (LASSO) was used to predict grain yield and to identify the most influential features (a combination of index and growth period). A novel approach involving the crop water supply–demand ratio effectively summarized water relations during growth. LASSO predicted grain yield quite well (adjusted R2 from 0.57 to 0.98), especially in a set of durum genotypes. However, the addition of other important variables such as lodging score, disease incidence, weed incidence and insect damage could have improved modelling results. Growth period 2 (30 days pre-flowering up to flowering) was the most sensitive for yield loss from heat stress and water stress for most features. Although one group of bread wheat genotypes was more sensitive to water stress (drought) in period 3 (20 days pre-flowering to 10 days post-flowering). Evapotranspiration was a significant positive feature but only in the vegetative phase (pre-flowering, period 1). This study confirms the usefulness of LASSO modelling as a technique to make predictions that could be used to identify genotypes that are suitable candidates for further investigation by breeders for their stress-tolerance ability.

Information

Type
Crops and Soils Research Paper
Copyright
Copyright © The Author(s), 2024. Published by Cambridge University Press

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable

Supplementary material: File

Anwar et al. supplementary material

Anwar et al. supplementary material
Download Anwar et al. supplementary material(File)
File 21.5 KB