Hostname: page-component-5db58dd55d-8mwbx Total loading time: 0 Render date: 2026-06-01T05:32:24.059Z Has data issue: false hasContentIssue false

Evaluating hyperspectral and machine learning approaches to classify biocontrol-induced damage on water hyacinth (Pontederia crassipes)

Published online by Cambridge University Press:  09 January 2026

Usman Mohammed
Affiliation:
Institute of Food and Agricultural Sciences, Indian River Research and Education Center, Fort Pierce, FL, USA
Stephen Lantin
Affiliation:
Department of Agricultural and Biological Engineering, University of Florida, Gainesville, FL, USA
Moses Chilenje
Affiliation:
Department of Agricultural and Biological Engineering, University of Florida, Gainesville, FL, USA
Aditya Singh
Affiliation:
Department of Agricultural and Biological Engineering, University of Florida, Gainesville, FL, USA
Carey Minteer*
Affiliation:
Institute of Food and Agricultural Sciences, Indian River Research and Education Center, Fort Pierce, FL, USA
*
Corresponding author: Carey Minteer; Email: c.minteerkillian@ufl.edu
Rights & Permissions [Opens in a new window]

Abstract

Water hyacinth (Pontederia crassipes Mart.) is a free-floating aquatic plant native to South America that has spread to nearly 50 countries, becoming one of the world’s most invasive aquatic weeds. In Florida, the biocontrol agents Neochetina eichhorniae and Neochetina bruchi were released in 1970s, while Megamelus scutellaris was released in 2010. Assessing the impact of these biocontrol agents is crucial in evaluating efficacy, distribution, and overall progress in management efforts. The traditional survey and monitoring methods used to evaluate the impact of biocontrol present numerous challenges in data acquisition, especially in remote areas and aquatic habitats. This study aimed to detect damage caused by Neochetina spp. and M. scutellaris on P. crassipes using hyperspectral remote sensing. Plants were exposed to varying levels of Neochetina spp. and M. scutellaris herbivory for 2 and 4 wk under laboratory conditions. After the exposure period, the plants were scanned using a visible and near-infrared hyperspectral imaging system. Two classification algorithms, partial least-squares discriminant analysis (PLS-DA) and support vector machine (SVM) were employed for classification. SVM achieved high classification accuracy at both low and high damage levels, with overall training and validation accuracies of 84.9% and 78.79%, respectively, while PLS-DA only achieved high classification accuracy at high damage levels, with overall training and validation accuracies of 56.3% and 60.38%. Based on the observed performance metrics, both algorithms demonstrated improved classification accuracy as damage increased over time. The results indicated that hyperspectral remote sensing can be used to monitor and assess biocontrol agents damage on P. crassipes.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2026. Published by Cambridge University Press on behalf of Weed Science Society of America
Figure 0

Table 1. Treatments with number of adults released per planta.

Figure 1

Figure 1. Image processing flowchart, from data acquisition and processing through model training and validation. PLS-DA, partial least-squares discriminant analysis; SPOT, Scanning Plant IoT Facility; SVM, support vector machine.

Figure 2

Figure 2. Damaged Pontederia crassipes plants exposed for 2 wk to varying levels of Megamelus scutellaris, Neochetina spp., and their combination (MN). Note that an image of the same plant is shown in the control row for illustrative purposes.

Figure 3

Figure 3. Damaged Pontederia crassipes plants exposed for 4 wk to varying levels of Megamelus scutellaris, Neochetina spp., and their combination (MN). Note that an image of the same plant is shown in the control row for illustrative purposes.

Figure 4

Figure 4. Spectral reflectance of Megamelus scutellaris and Neochetina spp. damage on Pontederia crassipes. Left: 2 wk of exposure; right: 4 wk of exposure. MN, combined treatment of M. scutellaris and Neochetina spp.

Figure 5

Table 2. Partial least-squares discriminant analysis (PLS-DA) and support vector machine (SVM) training and validation accuracy for 2 and 4 wk of exposure and the general modela.

Figure 6

Table 3. Per-class precision, recall, and F1 scores for partial least-squares discriminant analysis (PLS-DA) models based on 2 and 4 wk of exposure and the general modela.

Figure 7

Table 4. Per-class precision, recall, and F1 scores for support vector machine (SVM) models based on 2 and 4 wk of exposure and the general modela.

Figure 8

Figure 5. Training and validation accuracy vs. number of principal component analysis (PCA) components for 2 wk exposure: (A) partial least-squares discriminant analysis (PLS-DA) and (B) support vector machine (SVM).

Figure 9

Figure 6. Training and validation accuracy vs. number of principal component analysis (PCA) components for 4 wk of exposure: (A) partial least-squares discriminant analysis (PLS-DA) and (B) support vector machine (SVM).

Figure 10

Figure 7. Training and validation accuracy vs. number of principal component analysis (PCA) components for the general models: (A) partial least-squares discriminant analysis (PLS-DA) and (B) support vector machine (SVM). The general model integrates both datasets for a comprehensive accuracy assessment.

Figure 11

Figure 8. Score plots for 2 wk of exposure: (A) partial least squares (PLS) and (B) principal component analysis (PCA).

Figure 12

Figure 9. Score plots for 4 wk of exposure: (A) partial least squares (PLS) and (B) principal component analysis (PCA).

Figure 13

Figure 10. Score plots for the general model: (A) partial least squares (PLS) and (B) principal component analysis (PCA). The general model integrates both datasets for a comprehensive accuracy assessment.