Search

Reflective error: a metric for assessing predictive performance at extreme events
Robert Edwin Rouse, Henry Moss, Scott Hosking, Allan McRobie, Emily Shuckburgh
Journal:

Environmental Data Science / Volume 4 / 2025

Published online by Cambridge University Press:

24 April 2025, e26
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
When using machine learning to model environmental systems, it is often a model’s ability to predict extreme behaviors that yields the highest practical value to policy makers. However, most existing error metrics used to evaluate the performance of environmental machine learning models weigh error equally across test data. Thus, routine performance is prioritized over a model’s ability to robustly quantify extreme behaviors. In this work, we present a new error metric, termed Reflective Error, which quantifies the degree at which our model error is distributed around our extremes, in contrast to existing model evaluation methods that aggregate error over all events. The suitability of our proposed metric is demonstrated on a real-world hydrological modeling problem, where extreme values are of particular concern.

Improving the reproducibility in geoscientific papers: lessons learned from a Hackathon in climate science
Part of
- Climate Informatics 2024
Alejandro Coca-Castro, Anne Fouilloux, Ricardo Barros Lourenço, Andrew McDonald, Yuhan Rao, J. Scott Hosking
Journal:

Environmental Data Science / Volume 4 / 2025

Published online by Cambridge University Press:

16 January 2025, e6
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
In this paper, we explore the crucial role and challenges of computational reproducibility in geosciences, drawing insights from the Climate Informatics Reproducibility Challenge (CICR) in 2023. The competition aimed at (1) identifying common hurdles to reproduce computational climate science; and (2) creating interactive reproducible publications for selected papers of the Environmental Data Science journal. Based on lessons learned from the challenge, we emphasize the significance of open research practices, mentorship, transparency guidelines, as well as the use of technologies such as executable research objects for the reproduction of geoscientific published research. We propose a supportive framework of tools and infrastructure for evaluating reproducibility in geoscientific publications, with a case study for the climate informatics community. While the recommendations focus on future CIRCs, we expect they would be beneficial for wider umbrella of reproducibility initiatives in geosciences.

Streamflow prediction using artificial neural networks and soil moisture proxies
Robert Edwin Rouse, Doran Khamis, Scott Hosking, Allan McRobie, Emily Shuckburgh
Journal:

Environmental Data Science / Volume 4 / 2025

Published online by Cambridge University Press:

16 January 2025, e5
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Machine learning models have been used extensively in hydrology, but issues persist with regard to their transparency, and there is currently no identifiable best practice for forcing variables in streamflow or flood modeling. In this paper, using data from the Centre for Ecology & Hydrology’s National River Flow Archive and from the European Centre for Medium-Range Weather Forecasts, we present a study that focuses on the input variable set for a neural network streamflow model to demonstrate how certain variables can be internalized, leading to a compressed feature set. By highlighting this capability to learn effectively using proxy variables, we demonstrate a more transferable framework that minimizes sensing requirements and that enables a route toward generalizing models.

Environmental sensor placement with convolutional Gaussian neural processes
Tom R. Andersson, Wessel P. Bruinsma, Stratis Markou, James Requeima, Alejandro Coca-Castro, Anna Vaughan, Anna-Louise Ellis, Matthew A. Lazzara, Dani Jones, Scott Hosking, Richard E. Turner
Journal:

Environmental Data Science / Volume 2 / 2023

Published online by Cambridge University Press:

03 August 2023, e32
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Environmental sensors are crucial for monitoring weather conditions and the impacts of climate change. However, it is challenging to place sensors in a way that maximises the informativeness of their measurements, particularly in remote regions like Antarctica. Probabilistic machine learning models can suggest informative sensor placements by finding sites that maximally reduce prediction uncertainty. Gaussian process (GP) models are widely used for this purpose, but they struggle with capturing complex non-stationary behaviour and scaling to large datasets. This paper proposes using a convolutional Gaussian neural process (ConvGNP) to address these issues. A ConvGNP uses neural networks to parameterise a joint Gaussian distribution at arbitrary target locations, enabling flexibility and scalability. Using simulated surface air temperature anomaly over Antarctica as training data, the ConvGNP learns spatial and seasonal non-stationarities, outperforming a non-stationary GP baseline. In a simulated sensor placement experiment, the ConvGNP better predicts the performance boost obtained from new observations than GP baselines, leading to more informative sensor placements. We contrast our approach with physics-based sensor placement methods and propose future steps towards an operational sensor placement recommendation system. Our work could help to realise environmental digital twins that actively direct measurement sampling to improve the digital representation of reality.

Search Results

Refine search

Refine search

Actions for selected content:

4 results

Reflective error: a metric for assessing predictive performance at extreme events

Improving the reproducibility in geoscientific papers: lessons learned from a Hackathon in climate science

Streamflow prediction using artificial neural networks and soil moisture proxies

Environmental sensor placement with convolutional Gaussian neural processes

Search Results

Refine search

Refine search

Actions for selected content:

Save Search

4 results

Reflective error: a metric for assessing predictive performance at extreme events

Improving the reproducibility in geoscientific papers: lessons learned from a Hackathon in climate science

Streamflow prediction using artificial neural networks and soil moisture proxies

Environmental sensor placement with convolutional Gaussian neural processes