Hostname: page-component-5db58dd55d-l8wb7 Total loading time: 0 Render date: 2026-06-01T19:14:54.053Z Has data issue: false hasContentIssue false

Reflective error: a metric for assessing predictive performance at extreme events

Published online by Cambridge University Press:  24 April 2025

Robert Edwin Rouse*
Affiliation:
Department of Engineering, University of Cambridge, Cambridge, UK Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge, UK
Henry Moss
Affiliation:
Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge, UK
Scott Hosking
Affiliation:
AI Lab at British Antarctic Survey, Cambridge, UK The Alan Turing Institute, London, UK
Allan McRobie
Affiliation:
Department of Engineering, University of Cambridge, Cambridge, UK
Emily Shuckburgh
Affiliation:
Department of Computer Science and Technology, University of Cambridge, Cambridge, UK
*
Corresponding author: Robert Edwin Rouse; Email: rer44@cam.ac.uk

Abstract

When using machine learning to model environmental systems, it is often a model’s ability to predict extreme behaviors that yields the highest practical value to policy makers. However, most existing error metrics used to evaluate the performance of environmental machine learning models weigh error equally across test data. Thus, routine performance is prioritized over a model’s ability to robustly quantify extreme behaviors. In this work, we present a new error metric, termed Reflective Error, which quantifies the degree at which our model error is distributed around our extremes, in contrast to existing model evaluation methods that aggregate error over all events. The suitability of our proposed metric is demonstrated on a real-world hydrological modeling problem, where extreme values are of particular concern.

Information

Type
Methods Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Open Practices
Open materials
Copyright
© The Author(s), 2025. Published by Cambridge University Press
Figure 0

Figure 1. Normally distributed dataset of synthetic observations with fitted probability density and reflective weighting functions (left) with the predictions versus observations for the two perturbation scenarios (center and right).

Figure 1

Figure 2. Synthetic temporally variant dataset with predictions and observations shown about the mean function and within 2 standard deviations with anomalous observation highlighted (left); and the overall probability distribution fitted to the data with normalized histogram of the observations (right).

Figure 2

Figure 3. Maps of elevation, land use, and geology for the Avon at Bathford and Exe at Pixton (note that the catchments are shown at different scales for visibility). Keys corresponding to each map represent the proportion of each within respective subcategories. Adapted from the National River Flow Archive UK Centre for Ecology & Hydrology (2022).

Figure 3

Figure 4. Parameters for a log-normal probability density function fitted to a 365-day rolling window of streamflow observations for the River Avon and the River Exe.

Figure 4

Figure 5. Predicted and observed streamflow time series in the year 2012 for the River Avon and River Exe, with predictions generated using a basic artificial neural network model.

Figure 5

Figure 6. Performance in terms of NSE and RE for different $ \alpha $ and $ \beta $ pairs in the Reflective Loss Function used for training a neural network on streamflow data from the Rivers Avon ((a) and (b)) & Exe ((c) and (d)).

Figure 6

Figure 7. Focused time series for winter periods for the River Avon showing observations and predictions generated from the Neural Network model using both the MSE Loss and RE Loss functions.