Hostname: page-component-89b8bd64d-5bvrz Total loading time: 0 Render date: 2026-05-06T13:33:44.005Z Has data issue: false hasContentIssue false

Modeling and simulating spatial extremes by combining extreme value theory with generative adversarial networks

Published online by Cambridge University Press:  13 April 2022

Younes Boulaguiem*
Affiliation:
Geneva School of Economics and Management, University of Geneva, Geneva, Switzerland
Jakob Zscheischler
Affiliation:
Department of Computational Hydrosystems, Helmholtz Centre for Environmental Research – UFZ, Leipzig, Germany Climate and Environmental Physics, University of Bern, Bern, Switzerland Oeschger Centre for Climate Change Research, University of Bern, Bern, Switzerland
Edoardo Vignotto
Affiliation:
Geneva School of Economics and Management, University of Geneva, Geneva, Switzerland
Karin van der Wiel
Affiliation:
R&D weather and climate modelling, Royal Netherlands Meteorological Institute, De Bilt, The Netherlands
Sebastian Engelke
Affiliation:
Geneva School of Economics and Management, University of Geneva, Geneva, Switzerland
*
*Corresponding author. E-mail: younes.boulaguiem@unige.ch

Abstract

Modeling dependencies between climate extremes is important for climate risk assessment, for instance when allocating emergency management funds. In statistics, multivariate extreme value theory is often used to model spatial extremes. However, most commonly used approaches require strong assumptions and are either too simplistic or over-parameterized. From a machine learning perspective, generative adversarial networks (GANs) are a powerful tool to model dependencies in high-dimensional spaces. Yet in the standard setting, GANs do not well represent dependencies in the extremes. Here we combine GANs with extreme value theory (evtGAN) to model spatial dependencies in summer maxima of temperature and winter maxima in precipitation over a large part of western Europe. We use data from a stationary 2000-year climate model simulation to validate the approach and explore its sensitivity to small sample sizes. Our results show that evtGAN outperforms classical GANs and standard statistical approaches to model spatial extremes. Already with about 50 years of data, which corresponds to commonly available climate records, we obtain reasonably good performance. In general, dependencies between temperature extremes are better captured than dependencies between precipitation extremes due to the high spatial coherence in temperature fields. Our approach can be applied to other climate variables and can be used to emulate climate models when running very long simulations to determine dependencies in the extremes is deemed infeasible.

Information

Type
Application Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2022. Published by Cambridge University Press
Figure 0

Figure 1. Discriminator’s architecture and values of hyperparameters used for training. lrelu stands for the leaky relu function and its argument corresponds to the value of the slope when $ x<0 $; drop stands for drop-out (see Section 3.2) and its argument corresponds to the drop-out probability; F stands for the kernel’s dimension, and S for the value of the stride; FC stands for fully connected, and sigmoid for the sigmoid activation function.

Figure 1

Figure 2. Generator’s architecture and values of hyper-parameters used for training. lrelu stands for the leaky relu function and its argument corresponds to the value of the slope when $ x<0 $; drop stands for drop-out (see Section 3.2) and its argument corresponds to the drop-out probability; F stands for the kernel’s dimension, and S for the value of the stride; BN stands for batch-normalization (see Section 3.2); FC stands for fully connected.

Figure 2

Figure 3. The generalized extreme value distribution parameters estimated for each grid point for temperature (a–c) and precipitation (d–f) extremes. On the left (a,d) the mean parameter $ \mu $, in the middle (b,e) the scale parameter $ \sigma $, and on the right (c,f) the shape parameter $ \xi $.

Figure 3

Figure 4. Bivariate plots of temperature extremes based on train and test sets and simulated through the different presented methods. Shown are selected pairs of locations with varying tail dependence. Columns from left to right: train, test, evtGAN, DCGAN and Brown–Resnick. From top to bottom: weak tail dependence (a–e), mild tail dependence (f–j), and strong tail dependence (k–o). The colors represent the empirically estimated density: the spectrum goes from yellow to gray, reflecting a decrease in density.

Figure 4

Figure 5. Bivariate plots of precipitation extremes based on train and test sets and simulated through the different presented methods. Shown are selected pairs of locations with varying tail dependence. Columns from left to right: train, test, evtGAN, DCGAN, and Brown–Resnick. From top to bottom: weak tail dependence (a–e), mild tail dependence (f–j), and strong tail dependence (k–o). The colors represent the empirically estimated density: the spectrum goes from yellow to gray, reflecting a decrease in density.

Figure 5

Figure 6. Scatter plots of the extremal correlations for temperature extremes (a–c) and precipitation extremes (d–f) between $ 100 $ randomly selected locations. The x-axes always show the estimates based on the test set. Estimates on the y-axes are based on the train set (a,d), on the output of evtGAN (b,d), and on the Brown–Resnick model. The colors represent the empirically estimated density: the spectrum goes from yellow to gray, reflecting a decrease in density.

Figure 6

Figure 7. Spectral distributions for a threshold of 0.95 for selected pairs of locations with varying tail dependence for temperature (a–c) and precipitation (d–f). (a,d) weak tail dependence, (b,e) mild tail dependence, (c,f) strong tail dependence. Shown are kernel density estimates of the evtGAN (red), the Brown–Resnick model (blue) and bars for the ground truth (magenta).

Figure 7

Figure 8. Model performance versus training epochs for different sample sizes in evtGAN for temperature (a,c,e) and precipitation (b,d,f) extremes. A number of observations equal to $ {n}_{\mathrm{test}}=N-{n}_{\mathrm{train}} $ were sampled from evtGAN, where $ N=2000 $ for temperature, and $ N=1999 $ for precipitation. The mean $ {l}^2 $ norms for train (black) and test set (red) are defined as $ {C}_{\mathrm{te}}={\left\Vert {\chi}_{\mathrm{evtG}}-{\chi}_{\mathrm{te}}\right\Vert}_2 $, $ {C}_{\mathrm{tr}}={\left\Vert {\chi}_{\mathrm{evtG}}-{\chi}_{\mathrm{tr}}\right\Vert}_2 $, where $ {\chi}_{\mathrm{evtG}} $, $ {\chi}_{\mathrm{te}} $, and $ {\chi}_{\mathrm{tr}} $ denote the vectors of extremal correlations calculated on the samples from evtGAN, the test set and the train set, respectively.