Hostname: page-component-77f85d65b8-45ctf Total loading time: 0 Render date: 2026-03-28T14:11:11.083Z Has data issue: false hasContentIssue false

Environmental sensor placement with convolutional Gaussian neural processes

Published online by Cambridge University Press:  03 August 2023

Tom R. Andersson*
Affiliation:
British Antarctic Survey, NERC, UKRI, Cambridge, United Kingdom
Wessel P. Bruinsma
Affiliation:
Microsoft Research AI4Science, Amsterdam, The Netherlands
Stratis Markou
Affiliation:
University of Cambridge, Cambridge, United Kingdom
James Requeima
Affiliation:
Vector Institute, Toronto, ON, Canada
Alejandro Coca-Castro
Affiliation:
The Alan Turing Institute, London, United Kingdom
Anna Vaughan
Affiliation:
University of Cambridge, Cambridge, United Kingdom
Anna-Louise Ellis
Affiliation:
Met Office, London, United Kingdom
Matthew A. Lazzara
Affiliation:
University of Wisconsin-Madison, Madison, WI, USA Madison Area Technical College, Madison, WI, USA
Dani Jones
Affiliation:
British Antarctic Survey, NERC, UKRI, Cambridge, United Kingdom
Scott Hosking
Affiliation:
British Antarctic Survey, NERC, UKRI, Cambridge, United Kingdom The Alan Turing Institute, London, United Kingdom
Richard E. Turner
Affiliation:
Microsoft Research AI4Science, Amsterdam, The Netherlands University of Cambridge, Cambridge, United Kingdom
*
Corresponding author: Tom R. Andersson; Email: tomand@bas.ac.uk

Abstract

Environmental sensors are crucial for monitoring weather conditions and the impacts of climate change. However, it is challenging to place sensors in a way that maximises the informativeness of their measurements, particularly in remote regions like Antarctica. Probabilistic machine learning models can suggest informative sensor placements by finding sites that maximally reduce prediction uncertainty. Gaussian process (GP) models are widely used for this purpose, but they struggle with capturing complex non-stationary behaviour and scaling to large datasets. This paper proposes using a convolutional Gaussian neural process (ConvGNP) to address these issues. A ConvGNP uses neural networks to parameterise a joint Gaussian distribution at arbitrary target locations, enabling flexibility and scalability. Using simulated surface air temperature anomaly over Antarctica as training data, the ConvGNP learns spatial and seasonal non-stationarities, outperforming a non-stationary GP baseline. In a simulated sensor placement experiment, the ConvGNP better predicts the performance boost obtained from new observations than GP baselines, leading to more informative sensor placements. We contrast our approach with physics-based sensor placement methods and propose future steps towards an operational sensor placement recommendation system. Our work could help to realise environmental digital twins that actively direct measurement sampling to improve the digital representation of reality.

Information

Type
Application Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2023. Published by Cambridge University Press
Figure 0

Figure 1. We have two context sets: ERA5 temperature anomaly observations and 6 gridded auxiliary fields, and we wish to make probabilistic predictions for temperature anomaly over a vertical line of target points (blue dotted line in left-most panel). In the ConvGNP, a SetConv layer fuses the context sets into a single gridded encoding (Supplementary Figure B1; Gordon et al., 2020). A U-Net (Ronneberger et al., 2015) takes this encoded tensor as input and outputs a gridded representation, which is interpolated at target points $ {\boldsymbol{X}}^{(t)} $ and used to parameterise the mean and covariance of a multivariate Gaussian distribution over $ {\boldsymbol{y}}^{(t)} $. The output mean vector $ \boldsymbol{\mu} $ is shown as a black line, with 10 Gaussian samples overlaid in grey. The heatmap of the covariance matrix $ \boldsymbol{K} $ shows the magnitude of spatial covariances, with covariance decreasing close to temperature anomaly context points.

Figure 1

Figure 2. (a) ERA5 2 m temperature anomaly on January 1, 2018; (b–d) ConvGNP samples with ERA5 temperature anomaly context points at Antarctic station locations (black circles). Comparing colours within the black circles across plots shows that the ConvGNP interpolates context observations.

Figure 2

Figure 3. Prior covariance function, $ k\left({\boldsymbol{x}}_1,{\boldsymbol{x}}_2\right) $, with $ {\boldsymbol{x}}_1 $ fixed at the white plus location and $ {\boldsymbol{x}}_2 $ varying over the grid. Plots are shown for three different $ {\boldsymbol{x}}_1 $-locations (the Ross Ice Shelf, the South Pole, and East Antarctica) for the 1st of June. The most prominent section of the Transantarctic Mountains is indicated by the red dashed line in (b).

Figure 3

Figure 4. Mean metric values versus number of context points on the test set. The joint negative log-likelihood (NLL) is normalised by the number of targets. Error bars are standard errors.

Figure 4

Figure 5. Maps of acquisition function values $ \alpha \left({\boldsymbol{x}}_i^{(s)}\right) $ for the initial $ k=1 $ greedy iteration. The initial context set $ {\boldsymbol{X}}^{(c)} $ is derived from real Antarctic station locations (black circles). Running the sensor placement algorithm for $ K=10 $ sensor placements results in the proposed sensor placements $ {\boldsymbol{X}}^{\ast } $ (white pluses). Each pixel is $ 100\times $$ 100 $ km.

Figure 5

Figure 6. Correlation between model-based and oracle acquisition functions, $ \boldsymbol{\alpha} \left({\boldsymbol{X}}^{(s)}\right) $ and $ {\boldsymbol{\alpha}}_{\mathrm{oracle}}\left({\boldsymbol{X}}^{(s)}\right) $. Error bars indicate the 2.5–97.5% quantiles from 5,000 bootstrapped correlation values, computed by resampling the 1,365 pairs of points with replacement, measuring how spatially consistent the correlation is across the search space $ {\boldsymbol{X}}^{(s)} $.

Figure 6

Figure 7. Performance metrics on the sensor placement test data versus number of stations revealed to the models. Results are averaged over 243 dates in 2018–2019, with targets defined on a 100 km grid over Antarctica. For simplicity, we only plot the model-based criterion that targets the plotted metric. The GP baselines are shown on the marginal negative log-likelihood (NLL) and RMSE panels. For the joint NLL, the GP baselines perform far worse than the ConvGNP and are not shown. The confidence interval of Random is the standard error from 5 random placements.

Figure 7

Figure 8. Accounting for sensor placement cost using multi-objective Pareto optimisation, maximising the ConvGNP’s DeltaVar (a proxy for informativeness) and minimising ContextDist (a proxy for cost). (a) Scatter plot showing all pairs of informativeness and cost values. (b) Heatmap of Pareto rank. The rank-1 Pareto set is highlighted in red for both plots.

Supplementary material: File

Andersson et al. supplementary material
Download undefined(File)
File 8.6 MB