Hostname: page-component-89b8bd64d-dvtzq Total loading time: 0 Render date: 2026-05-06T16:13:53.276Z Has data issue: false hasContentIssue false

Deep learning–based detection and quantification of weed seed mixtures

Published online by Cambridge University Press:  21 November 2024

Shahbaz Ahmed*
Affiliation:
Postdoctoral Researcher, Department of Crop and Soil Sciences, Washington State University, Pullman, WA, USA
Samuel R. Revolinski
Affiliation:
Assistant Professor, University of Kentucky, Department of Plant & Soil Science, Lexington, KY, 40503
P. Weston Maughan
Affiliation:
Graduate Research Assistant, Department of Crop and Soil Sciences, Washington State University, Pullman, WA, USA
Marija Savic
Affiliation:
Research Associate, Department of Crop and Soil Sciences, Washington State University, Pullman, WA, USA
Jessica Kalin
Affiliation:
Research Associate, Department of Crop and Soil Sciences, Washington State University, Pullman, WA, USA
Ian C. Burke
Affiliation:
Professor, Department of Crop and Soil Sciences, Washington State University, Pullman, WA, USA
*
Corresponding author: Shahbaz Ahmed; Email: shahbaz.ahmed@wsu.edu
Rights & Permissions [Opens in a new window]

Abstract

Knowledge of weed seeds present in the soil seedbank is important for understanding population dynamics and forecasting future weed infestations. Quantification of the weed seedbank has historically been laborious, and few studies have attempted to quantify seedbanks on the scale required to make management decisions. An accurate, efficient, and ideally automated method to identify weed seeds in field samples is needed. To achieve sufficient precision, we leveraged YOLOv8, a machine learning object detection to accurately identify and count weed seeds obtained from the soil seedbank and weed seed collection. The YOLOv8 model, trained and evaluated using high-quality images captured with a digital microscope, achieved an overall accuracy and precision exceeding 80% confidence in distinguishing various weed seed species in both images and real-time videos. Despite the challenges associated with species having similar seed morphology, the application of YOLOv8 will facilitate rapid and accurate identification of weed seeds for the assessment of future weed management strategies.

Information

Type
Rapid Communication
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2024. Published by Cambridge University Press on behalf of Weed Science Society of America
Figure 0

Table 1. List of species with instances used in a study to develop a machine learning program for weed seed identification.

Figure 1

Table 2. Example of bounding box data from.txt output generated by labelImg during labeling process.

Figure 2

Figure 1. labelImg-generated bounding boxes drawn around weed seeds belonging to different classes. Each colored box and number corresponds to a distinct class of weeds. Bounding boxes aid in identifying and localizing weed seeds within images, assisting in the accurate classification and detection of various weed species. Understanding the representation of bounding boxes is fundamental for interpreting object detection results.

Figure 3

Figure 2. The precision–recall curve generated during evaluation of weed species detection model. The x axis represents the recall, which measures the proportion of true positive detections out of all actual positive instances. The y axis represents precision, indicating the proportion of true positive detections out of all predicted positive instances. Each point on the curve corresponds to a different threshold used for classification, with the curve representing the trade-off between precision and recall. A higher area under the curve (AUC) indicates improving model performance in distinguishing between weed species. Abbreviations: LOLMU, Lolium perenne L. ssp. multiflorum (Lam.) Husnot; SINAR, Sinapis arvensis (L.) Andrz. ex Besser; GALAP, Galium aparine L.; DESSO-SSYAL, Descurainia sophia (L.) Webb ex Prantl/Sisymbrium altissimum L.; CHEAL, Chenopodium album L.; ANTCO, Anthemis cotula L.; KCHSC, Bassia scoparia (L.) A.J. Scott; LACSE, Lactuca serriola L.; AMARE, Amaranthus spp.; VLPMY, Vulpia myuros (L.) C.C. Gmel.; BROTE, Bromus tectorum L.; SASKT, Salsola tragus (L.) Scop.; Brassica spp.; TRZAX, Triticum aestivum L.; LAMAM, Lamium spp.

Figure 4

Figure 3. Evolution of performance metric precision, recall, mean average precision calculated at an intersection over union threshold of 0.50 (mAP50), and the average of the mean average precision calculated at varying intersection over union thresholds, ranging from 0.50 to 0.95 (mAP50-95) over the course of training epochs from 0 to 100. The x axis represents each epoch, indicating the progress of the model during training. The y axis showcases the improvement in performance metrics with each epoch, providing insights into how the model’s accuracy, recall rate, and mean average precision vary throughout the training process. Understanding these metrics is crucial for assessing the effectiveness and progress of the object detection model.

Figure 5

Figure 4. Example detection results generated by the model, with bounding boxes drawn around different weed seeds. Each bounding box indicates the location and size of a detected seed with a confidence score. The confidence score is a measure of the model’s certainty in its prediction, ranging from 0 to 1, with 1 indicating complete certainty. In this case, for example, a score of 0.95 suggests a very high level of confidence in the accuracy of the detection and identification.

Figure 6

Figure 5. Normalized confusion matrix of seed classification. Rows represent predictions of species while columns represent the true species. Total incidences of miss classification are divided by the number of incidences of each class. Abbreviations: LOLMU, Lolium perenne L. ssp. multiflorum (Lam.) Husnot; SINAR, Sinapis arvensis (L.) Andrz. ex Besser; GALAP, Galium aparine L.; DESSO-SSYAL, Descurainia sophia (L.) Webb ex Prantl/Sisymbrium altissimum L.; CHEAL, Chenopodium album L.; ANTCO, Anthemis cotula L.; KCHSC, Bassia scoparia (L.) A.J. Scott; LACSE, Lactuca serriola L.; LACSE-SONOL, Lactuca serriola L./Sonchus oleraceus L.; AMARE, Amaranthus spp.; VLPMY, Vulpia myuros (L.) C.C. Gmel.; BROTE, Bromus tectorum L.; SASKT, Salsola tragus (L.) Scop.; Brassica spp.; TRZAX, Triticum aestivum L.; LAMAM, Lamium spp.; SOLDU, Solanum spp.; CIRVU Cirsium vulgare (Savi) Ten.