Hostname: page-component-6766d58669-zlvph Total loading time: 0 Render date: 2026-05-16T10:35:37.888Z Has data issue: false hasContentIssue false

Operational uncertainty in machine learning based debris block detection in urban waterways

Published online by Cambridge University Press:  02 March 2026

Christopher Rowlatt
Affiliation:
Institute for Mathematical Innovation, University of Bath, UK
Andrew Paul Barnes
Affiliation:
Computer Science, University of Bath, UK
Simon Dooley
Affiliation:
Cardiff City, UK
Thomas Rodding Kjeldsen*
Affiliation:
University of Bath, UK
*
Corresponding author: Thomas Rodding Kjeldsen; Email: trk23@bath.ac.uk
Rights & Permissions [Opens in a new window]

Abstract

This study investigates the use of machine learning based image classification techniques to detect debris blocking of urban waterways. Using a dataset comprising 1089 labelled CCTV images of a trash screen located in Cardiff, UK and a comprehensive re-sampling approach, we investigate not only the ability of selected machine learning algorithms to correctly identify images, but also to evaluate the uncertainty of these algorithms conditional on the datasets presented to them. For each candidate model, we considered two datasets: an imbalanced dataset and an under-sampled dataset. The results demonstrate that the performance of a simple logistic regression model was broadly comparable to that of more advanced machine learning models such as vision transformers. The best performing models (vision transformers and logistic regression) achieved an accuracy of more than 80%, while the NetRes50 model achieved an accuracy in the low 70%. This is an important result that opens the possibility for implementing these techniques as part of an operational real-time flood warning system utilising already existing cameras.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2026. Published by Cambridge University Press
Figure 0

Figure 1. Catchment of the Nant Y Forest watercourse (red polyline) and the location of the CCTV installation in Tongwynlais (red triangle). Examples image classification labels of (a) low risk, (b) high risk and (c) unknown risk.

Figure 1

Figure 2. Demonstration of the (b) cropping and (c) reduction and normalisation of an (a) original image.

Figure 2

Table 1. Summary of imbalanced and under-sampled datasets

Figure 3

Figure 3. Boxplot of balanced accuracy score for the imbalanced and under-sampled datasets for each model.

Figure 4

Figure 4. Boxplot of precision score of the high-risk category for the imbalanced and under-sampled datasets for each model.

Figure 5

Figure 5. Boxplot of recall score of the high-risk category for the imbalanced and under-sampled datasets for each model.

Figure 6

Figure 6. Boxplot of the percentage of images labelled as high risk of future blockage that are classified as low risk for the imbalanced and under-sampled datasets for each model.

Figure 7

Figure 7. Confusion matrices for the imbalanced (b, right) and under-sampled (a, left) datasets for the LogReg model. The numerical values given are the percentage of the total number of images in each category. The accuracy score is given at the top of each plot.

Figure 8

Figure 8. Confusion matrices for the imbalanced (b, right) and under-sampled (a, left) datasets for the ResNet50 model. The numerical values given are the percentage of the total number of images in each category. The accuracy score is given at the top of each plot.

Figure 9

Figure 9. Confusion matrices for the imbalanced (b, right) and under-sampled (a, left) datasets for the ViT-B-16 model. The numerical values given are the percentage of the total number of images in each category. The accuracy score is given at the top of each plot.

Figure 10

Figure 10. Confusion matrices for the imbalanced (b, right) and under-sampled (a, left) datasets for the ViT-L-16 model. The numerical values given are the percentage of the total number of images in each category. The accuracy score is given at the top of each plot.

Figure 11

Table A1. Default hyperparameters for the logistic regression model of SK-learn (Pedregosa et al. 2011)

Figure 12

Table A2. Default hyperparameter values for the MLP, ResNet50 and ViT models of PyTorch (Paszke et al. 2019)

Author comment: Operational uncertainty in machine learning based debris block detection in urban waterways — R0/PR1

Comments

We are pleased to submit our manuscript on the use of machine learning models for detecting debris blocking of urban rivers from CCTV images. The use of AI/ML models for real-time management of flood risk and vital – yet unassuming and often overlooked - infrastructure components such as culverts is a mostly unexplored area. However, mitigating flood risk as well as securing the health and safety of maintenance crews tasked with cleaning these rivers are important challenges to cash-strapped city authorities across the world; and challenges that will be exacerbated by the twin challenges of climate change and increasing urbanisation. Thus, by demonstrating the successful and robust performance of AI/ML model applied to CCTV images not necessarily collected for the purpose of modelling has the potential for unlocking wider interest in the application of ML/AI to address flood risk management problems.

Review: Operational uncertainty in machine learning based debris block detection in urban waterways — R0/PR2

Conflict of interest statement

I do not have competing interests.

Comments

There should be a clear tabular description of the datasets.

There should be a description of how datasets were labeled into three categories.

The machine learning problem formulation is weak.

There is a lack of description available to explain how the input space was prepared for the ML models.

There is no description of the model’s robustness.

There are no guardrails in place to prevent models from being overfitting.

A literature review suggests that there have been works on this application. A comparative analysis should be made with the other works. The literature review appears to lack a diversity of works done in this application space.

The nobility of this work is in question, as works in this application space have already been done. There are similar works has been done on trash screen blockage classification. Apart from trying several machine learning methods how this work brings an unclear novel continuation.

Review: Operational uncertainty in machine learning based debris block detection in urban waterways — R0/PR3

Conflict of interest statement

Reviewer declares none.

Comments

Dear authors,

Thank you for submitting this manuscript. I think that your work is tackling an important issue in an interesting way, and the paper is well structured and easy to read. However, while the approach is interesting, there are two big weaknesses that I see in your paper:

(1) The fact that you only use images from a single camera make the results lack in genericity. From these experiments, it is hard to draw a strong conclusion on how your model(s) would behave on new cameras (and fields of view) without needing any kind of retraining, which I assume is the intended application.

(2) The decision to not perform any kind of hyperparameter tuning raise some concerns about the reliability of your results. CNNs and ViTs particularly are known to need proper fine-tuning to work at their best.

My decision is ‘Minor Revision’ as I think that (1) can be addressed relatively easily in your Discussion/Conclusion section, and (2) can be easily addressed with additional experiments for which you already have the code.

I have added my additional comments below.

Good luck with the paper.

P5 - 84 : I don’t think that Streftaris et al & Vandaele et al. use the same dataset.

P7-8 - 118-120 : the standardization process could be explained more clearly

P8 - 128-136 : Have you explored upsampling instead of downsampling? You could try to augment the classes with less observations using typical data augmentation techniques.

P9-10 - Machine learning algorithms section : You could add a paragraph about the logistic regression model. Also, for the MLP and LR model, do you use all the image pixels, or do you preprocess the image (downscaling, feature extraction,...)? This is worth mentioning even if you don’t.

P11 : Evaluation section: the metrics formulas could be added to help the readers

P12 - 216 : Can you introduce the IQR acronym?

P11-21 Results section:

- As I said in my general comments, while this results section is well outlined, I think that if you are not considering hyper-parameter tuning with a training/validation set from a single camera, you are really narrowing the scope of your observations.

- I think that you should also analyze the images that were incorrectly detected. Can you explain why you think your models failed at classifying? I am a bit surprised by the relatively low Balanced Accuracy scores reported. From the images shown and given the fact you only have one camera, I would have expected results close to 100%, but maybe the problem is inherently hard.

P21-22 Conclusion section: as I commented above, I think that you should also explain what you think the current experiments say about the future practical implementation of your models.

Recommendation: Operational uncertainty in machine learning based debris block detection in urban waterways — R0/PR4

Comments

The manuscript is evaluating the uncertainty associated with the ML model for detecting debris blockage on trash screen. The application could support more effective management of drainage system. The Reviewers have assessed the manuscript and provided suggestions for improving the quality of the manuscript. Please revise the manuscript to address the comments raised by the Reviewers.

Decision: Operational uncertainty in machine learning based debris block detection in urban waterways — R0/PR5

Comments

No accompanying comment.

Author comment: Operational uncertainty in machine learning based debris block detection in urban waterways — R1/PR6

Comments

As requested we have added: ‘Author Contribution Statement’, ‘Financial Support’, ‘Conflict of Interest Statement’.

Figures uploaded as individual files. Note multiple figures are composed of an a and b image.

Review: Operational uncertainty in machine learning based debris block detection in urban waterways — R1/PR7

Conflict of interest statement

Reviewer declares none.

Comments

Dear authors,

Thank you for your revisions and addressing my comments. I have reviewed the updated version of the manuscript, and I am satisfied with the clarifications and changes that you have made.

Based on these improvements, I recommend to accept this paper.

Best regards

Review: Operational uncertainty in machine learning based debris block detection in urban waterways — R1/PR8

Conflict of interest statement

Reviewer declares none.

Comments

Review of the manuscript “Operational uncertainty in machine-learning based debris

block detection in urban waterways”

This paper examines how machine-learning image classification models can detect debris blocking in urban waterways using CCTV footage. Using 1089 labeled images from a trash screen in Cardiff, the study compares different algorithms on both imbalanced and undersampled datasets. It evaluates not only accuracy but also model uncertainty. Results show that simple models perform nearly as well as advanced ones, with top models exceeding 80% accuracy, supporting the potential use of such methods in real-time flood warning systems.

The authors should carefully review the paper to address the provided comments. The comments are as below:

1. The abstract states that the best-performing model achieved 80% accuracy, but it does not specify which model reached this performance.

2. The dataset includes only 1,095 CCTV images, which is relatively small for training deep learning models. This limitation should be highlighted more clearly, as it significantly affects the overall performance and generalizability of the results.

3. The manuscript mentions that each image is cropped to focus on the front of the trash screen. The authors should clarify whether this cropping was performed manually or automatically, as this has direct implications for the automation and scalability of the monitoring system.

4. The authors reduced image resolution to 224×224 pixels. This substantial down sampling may limit the extraction of detailed visual features, particularly for models like ViT. The rationale for this choice should be explained in more detail

5. Deep models such as ResNet50 and ViT were trained for only 10 epochs, which may be insufficient for convergence or optimal performance. The authors should justify why only 10 epochs were used and consider discussing whether additional training could improve the results.

Recommendation: Operational uncertainty in machine learning based debris block detection in urban waterways — R1/PR9

Comments

The authors have revised the manuscript to address the Reviewers' comments in the previous round. Nevertheless, there are still several major concerns pointed out by the Reviewers that require further improvement.

Decision: Operational uncertainty in machine learning based debris block detection in urban waterways — R1/PR10

Comments

No accompanying comment.

Author comment: Operational uncertainty in machine learning based debris block detection in urban waterways — R2/PR11

Comments

No accompanying comment.

Recommendation: Operational uncertainty in machine learning based debris block detection in urban waterways — R2/PR12

Comments

The authors have revised the manuscript to address the comments raised by the Reviewers. The manuscript is now in a good shape for the publication in the journal.

Decision: Operational uncertainty in machine learning based debris block detection in urban waterways — R2/PR13

Comments

No accompanying comment.