Hostname: page-component-89b8bd64d-r6c6k Total loading time: 0 Render date: 2026-05-07T00:52:46.512Z Has data issue: false hasContentIssue false

Streamflow prediction using artificial neural networks and soil moisture proxies

Published online by Cambridge University Press:  16 January 2025

Robert Edwin Rouse*
Affiliation:
Department of Engineering, University of Cambridge, Cambridge CB2 1PZ, UK
Doran Khamis
Affiliation:
UK Centre for Ecology & Hydrology, Wallingford OX10 8BB, UK
Scott Hosking
Affiliation:
British Antarctic Survey, Cambridge CB3 0ET, UK The Alan Turing Institute, London NW1 2DB, UK
Allan McRobie
Affiliation:
Department of Engineering, University of Cambridge, Cambridge CB2 1PZ, UK
Emily Shuckburgh
Affiliation:
Department of Computer Science and Technology, University of Cambridge, Cambridge CB3 0FD, UK
*
Corresponding author: Robert Edwin Rouse; Email: rer44@cam.ac.uk

Abstract

Machine learning models have been used extensively in hydrology, but issues persist with regard to their transparency, and there is currently no identifiable best practice for forcing variables in streamflow or flood modeling. In this paper, using data from the Centre for Ecology & Hydrology’s National River Flow Archive and from the European Centre for Medium-Range Weather Forecasts, we present a study that focuses on the input variable set for a neural network streamflow model to demonstrate how certain variables can be internalized, leading to a compressed feature set. By highlighting this capability to learn effectively using proxy variables, we demonstrate a more transferable framework that minimizes sensing requirements and that enables a route toward generalizing models.

Information

Type
Application Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Open Practices
Open materials
Copyright
© The Author(s), 2025. Published by Cambridge University Press
Figure 0

Figure 1. Graphical representation of a single neuron unit and an arbitrary neural network with $ l $ layers.

Figure 1

Figure 2. Elevation, land use, and geology maps for the three study catchments; corresponding keys represent the proportion of each catchment that falls under the respective subcategories. Note that the catchments are shown at different scales for visibility. Adapted from the National River Flow Archive (UK Centre for Ecology and Hydrology, 2022).

Figure 2

Table 1. Pearson correlation coefficients between soil moisture-level variables and antecedent proxies, averaged over the three study catchments

Figure 3

Table 2. MLP model performance in terms of Nash-Sutcliffe efficiency for meteorological input sequence length of 28 days (28), 7 days (7), 7 days with soil moisture (7 + SM), and 7 days with proxy variables (7 + P) (bold typeface indicates highest performance)

Figure 4

Figure 3. Predictions against observations for the Severn at Haw Bridge from the test set of predictions generated using different feature sets as inputs to the MLP model.

Figure 5

Figure 4. Comparative streamflow time series for the Severn at Haw Bridge in the year, 2012, with both predictions and observations using different feature sets as inputs to the MLP model.

Figure 6

Figure 5. Violin plots of percentage relative error for each of the 10$ {}^{th} $ percentile bands of flow magnitude (with the upper bound marked on the scale and the lower bound being the preceding upper bound to the left) between observations and predictions for the Severn at Haw Bridge using different feature sets as inputs to the MLP model.

Figure 7

Table 3. Average model sensitivity to individual soil moisture and antecedent proxy inputs using positive and negative perturbations for the two different model setups

Figure 8

Figure 6. Average network sensitivity to daily meteorological variables under a positive perturbation (left) and negative perturbation (right), with logarithmic scales on the y axis.

Supplementary material: File

Rouse et al. supplementary material

Rouse et al. supplementary material
Download Rouse et al. supplementary material(File)
File 3.2 MB

Author comment: Streamflow prediction using artificial neural networks and soil moisture proxies — R0/PR1

Comments

Dear Professor Monteleoni,

We wish to submit an original research article, entitled Streamflow Prediction Using Artificial Neural Networks & Soil Moisture Proxies, for consideration by Environmental Data Science. We confirm that this work is original and has not been published elsewhere, nor is it currently under consideration for publication elsewhere.

In this paper, we demonstrate how internal variables of hydrological systems can be represented within the feature set of a machine learning model, resulting in a modelling approach more easily applied to different geographies where data availability might not support complex modelling approaches whilst also enabling practitioners to hold a higher level of confidence in machine learning models.

Given the potential impacts of climate change, both in terms of flooding and water resources, being able to rapidly develop modelling approaches with greater global applicability and, subsequently, to generate projections of said impacts under different scenarios is of high importance and likely to be of interest to hydrologists and decision makers around the world. This paper helps to provide a parsimonious modelling foundation that we might hope to build upon to generalise between catchments and generate hydrological predictions in the absence of data.

We have no conflicts of interest to disclose. Please address all correspondence concerning this manuscript to me at rer44@cam. ac.uk.

Thank you for your consideration of this manuscript.

Yours Sincerely,

Robert Edwin Rouse,

Doran Khamis,

Scott Hosking,

Allan McRobie,

Emily Shuckburgh

Review: Streamflow prediction using artificial neural networks and soil moisture proxies — R0/PR2

Conflict of interest statement

Reviewer declares none.

Comments

This is interesting study where authors show that incorporation of soil moisture and antecedent proxies in a machine learning model (e.g., MLP) may enhance forecasting accuracy, and it may be applicable for operational forecasting. However, authors have assumed stationarity in watershed characteristics, and utilized mostly reanalysis datasets for meteorological parameters (spatial resolution is not mentioned). They have not accounted uncertainty in their forecasting. for overestimation and underestimation of peak flows were not discussed properly.

Review: Streamflow prediction using artificial neural networks and soil moisture proxies — R0/PR3

Conflict of interest statement

Reviewer declares none.

Comments

This paper addresses challenges in hydrological modeling with Machine Learning (ML), particularly focusing on transparency and variable selection. Utilizing data from the National River Flow Archive and weather forecasts, the study refines input variables for a neural network streamflow model, specifically a Multi-Layer Perceptron (MLP). It demonstrates how certain variables, such as soil moisture, can be internalized, thereby reducing the feature set. By showcasing effective learning with proxy variables, the paper proposes a transferable framework that minimizes data requirements and facilitates model generalization. This approach holds promise for developing universally applicable hydrological ML models capable of assessing the impacts of climate change on global hydrological systems.

The paper aligns with the journal’s scope by exploring Machine Learning (ML) techniques in hydrological modeling, which involves novel data sources and data-intensive methods. The findings not only advance understanding of ML applications in hydrological modeling but also offer solutions for improving model generalization and reducing data requirements, making it suitable for publication in EDS. The paper exhibits a good level of clarity in writing and organization. It is well-structured, logically presented, and written in a clear and concise manner, facilitating easy comprehension of the research content. The authors effectively adhere to the page limit, ensuring the paper efficiently conveys its research findings without unnecessary repetition or verbosity.

Detailed Suggestions:

Incorporating a study location figure would be beneficial for readers to understand the catchment properties and terrain of the study area. Since the paper did not discuss the terrain of the catchments or its potential impact on the model results, a visual representation of the study area’s topography and catchment boundaries would provide valuable context. This could help readers assess the relevance of the study findings to other hydrological settings with similar terrain characteristics and identify potential limitations or challenges associated with extrapolating the model to different geographic regions.

Provide a detailed explanation in the methodology section regarding why the Multi-Layer Perceptron (MLP) model was chosen over other potential models for streamflow prediction. Consider discussing the specific advantages of MLP in this context, such as its ability to capture nonlinear relationships or its performance in similar hydrological modeling studies. Additionally, discussing any limitations or drawbacks of alternative models that led to their exclusion could enhance the justification for selecting MLP.

While Root Mean Square Error (RMSE) and Nash-Sutcliffe Efficiency (NSE) are commonly used metrics for model evaluation, consider incorporating additional metrics to provide a more comprehensive assessment of model performance. For instance, metrics like the Kling-Gupta Efficiency or Percent Bias can offer insights into different aspects of the model’s predictive accuracy and bias. Including a wider range of metrics would enhance the robustness of the model evaluation and increase confidence in the reported results.

Recommendation: Streamflow prediction using artificial neural networks and soil moisture proxies — R0/PR4

Comments

The two reviewers find the paper of interest but have a few suggestions. Reviewer 1 raises points related to stationarity and uncertainty. At the very least, the authors need to discuss these in the paper to the satisfaction of this reviews. Reviewer 2 has substantive comments regarding developing a detailed case study, describing the specifics of the ML models, and adapting standard metrics from the literature. The authors need to address these comments in depth.

Decision: Streamflow prediction using artificial neural networks and soil moisture proxies — R0/PR5

Comments

No accompanying comment.

Author comment: Streamflow prediction using artificial neural networks and soil moisture proxies — R1/PR6

Comments

Dear Professor Monteleoni

REF: Manuscript - Streamflow Prediction Using Artificial Neural Networks & Soil Moisture Proxies

We wish to submit an original research article, entitled Streamflow Prediction Using Artificial Neural Networks & Soil Moisture Proxies, for consideration by Environmental Data Science. We confirm that this work is original and has not been published elsewhere, nor is it currently under consideration for publication elsewhere.

In this paper, we demonstrate how internal variables of hydrological systems can be represented within the feature set of a machine learning model, resulting in a modelling approach more easily applied to different geographies where data availability might not support complex modelling approaches whilst also enabling practitioners to hold a higher level of confidence in machine learning models.

Given the potential impacts of climate change, both in terms of flooding and water resources, being able to rapidly develop modelling approaches with greater global applicability and, subsequently, to generate projections of said impacts under different scenarios is of high importance and likely to be of interest to hydrologists and decision makers around the world. This paper helps to provide a parsimonious modelling foundation that we might hope to build upon to generalise between catchments and generate hydrological predictions in the absence of data.

We have no conflicts of interest to disclose. Please address all correspondence concerning this manuscript to me at rer44@cam. ac.uk.

Thank you for your consideration of this manuscript.

Yours Sincerely,

Robert Edwin Rouse, Doran Khamis, Scott Hocking, Allan McRobie, Emily Shuckburgh

Review: Streamflow prediction using artificial neural networks and soil moisture proxies — R1/PR7

Conflict of interest statement

Reviewer declares none.

Comments

I am satisfied with the authors' responses and the revisions made to the manuscript. The inclusion of a study location figure and detailed commentary on catchment characteristics significantly enhances the contextual understanding for readers. The expanded explanation for choosing the Multi-Layer Perceptron model and the addition of a broader range of evaluation metrics provide a more comprehensive justification and assessment of the model’s performance. These improvements address my concerns effectively and contribute to the overall quality and clarity of the paper.

Review: Streamflow prediction using artificial neural networks and soil moisture proxies — R1/PR8

Conflict of interest statement

Reviewer declares none.

Comments

This revised version is better in terms of clarity and details. Since scope of this paper is to develop a reliable ML model for hydrological forecasting and authors have not accounted uncertainty in model prediction, I encourage to add ‘Limitation of Study/ or Future Scope’ in the manuscript. Their existing model is unable to forecast peak and base flow discharge between 7 to 28 days. Therefore, such section will strengthen the acceptability of the developed model.

Recommendation: Streamflow prediction using artificial neural networks and soil moisture proxies — R1/PR9

Comments

No accompanying comment.

Decision: Streamflow prediction using artificial neural networks and soil moisture proxies — R1/PR10

Comments

No accompanying comment.

Author comment: Streamflow prediction using artificial neural networks and soil moisture proxies — R2/PR11

Comments

Dear Professor Monteleoni,

REF: Manuscript - Streamflow Prediction Using Artificial Neural Networks & Soil Moisture Proxies

We wish to submit an original research article, entitled Streamflow Prediction Using Artificial Neural Networks & Soil Moisture Proxies, for consideration by Environmental Data Science. We confirm that this work is original and has not been published elsewhere, nor is it currently under consideration for publication elsewhere.

In this paper, we demonstrate how internal variables of hydrological systems can be represented within the feature set of a machine learning model, resulting in a modelling approach more easily applied to different geographies where data availability might not support complex modelling approaches whilst also enabling practitioners to hold a higher level of confidence in machine learning models.

Given the potential impacts of climate change, both in terms of flooding and water resources, being able to rapidly develop modelling approaches with greater global applicability and, subsequently, to generate projections of said impacts under different scenarios is of high importance and likely to be of interest to hydrologists and decision makers around the world. This paper helps to provide a parsimonious modelling foundation that we might hope to build upon to generalise between catchments and generate hydrological predictions in the absence of data.

We have no conflicts of interest to disclose. Please address all correspondence concerning this manuscript to me at rer44@cam. ac.uk.

Thank you for your consideration of this manuscript.

Yours Sincerely,

Robert Edwin Rouse, Doran Khamis, Scott Hosking, Allan McRobie, Emily Shuckburgh

Review: Streamflow prediction using artificial neural networks and soil moisture proxies — R2/PR12

Conflict of interest statement

Reviewer declares none.

Comments

I recommend for accepting the paper in its current form.

Best wishes

Recommendation: Streamflow prediction using artificial neural networks and soil moisture proxies — R2/PR13

Comments

No accompanying comment.

Decision: Streamflow prediction using artificial neural networks and soil moisture proxies — R2/PR14

Comments

No accompanying comment.