Hostname: page-component-6766d58669-7fx5l Total loading time: 0 Render date: 2026-05-16T23:35:33.231Z Has data issue: false hasContentIssue false

Predicting years with extremely low gross primary production from daily weather data using Convolutional Neural Networks

Published online by Cambridge University Press:  13 April 2022

Aris Marcolongo*
Affiliation:
Mathematical Institute, University of Bern, Bern, Switzerland Climate and Environmental Physics, University of Bern, Bern, Switzerland
Mykhailo Vladymyrov
Affiliation:
Mathematical Institute, University of Bern, Bern, Switzerland Theodor Kocher Institute, University of Bern, Bern, Switzerland
Sebastian Lienert
Affiliation:
Climate and Environmental Physics, University of Bern, Bern, Switzerland Oschger Centre for Climate Change Research, University of Bern, Bern, Switzerland
Nadav Peleg
Affiliation:
Institute of Earth Surface Dynamics, University of Lausanne, Lausanne, Switzerland
Sigve Haug
Affiliation:
Mathematical Institute, University of Bern, Bern, Switzerland
Jakob Zscheischler
Affiliation:
Climate and Environmental Physics, University of Bern, Bern, Switzerland Theodor Kocher Institute, University of Bern, Bern, Switzerland Department of Computational Hydrosystems, Helmholtz Centre for Environmental Research – UFZ, Leipzig, Germany
*
*Corresponding author. E-mail: aris.marcolongo@math.unibe.ch

Abstract

Understanding the meteorological drivers of extreme impacts in social or environmental systems is important to better quantify current and project future climate risks. Impacts are typically an aggregated response to many different interacting drivers at various temporal scales, rendering such driver identification a challenging task. Machine learning–based approaches, such as deep neural networks, may be able to address this task but require large training datasets. Here, we explore the ability of Convolutional Neural Networks (CNNs) to predict years with extremely low gross primary production (GPP) from daily weather data in three different vegetation types. To circumvent data limitations in observations, we simulate 100,000 years of daily weather with a weather generator for three different geographical sites and subsequently simulate vegetation dynamics with a complex vegetation model. For each resulting vegetation distribution, we then train two different CNNs to classify daily weather data (temperature, precipitation, and radiation) into years with extremely low GPP and normal years. Overall, prediction accuracy is very good if the monthly or yearly GPP values are used as an intermediate training target (area under the precision-recall curve AUC $ \ge $ 0.9). The best prediction accuracy is found in tropical forests, with temperate grasslands and boreal forests leading to comparable results. Prediction accuracy is strongly reduced when binary classification is used directly. Furthermore, using daily GPP during training does not improve the predictive power. We conclude that CNNs are able to predict extreme impacts from complex meteorological drivers if sufficient data are available.

Information

Type
Application Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Open Practices
Open data
Copyright
© The Author(s), 2022. Published by Cambridge University Press
Figure 0

Figure 1. Geographic locations of the three sites considered in this work. Sites are named according to their climate type.

Figure 1

Figure 2. (a) The surface coverage by plant functional type, as defined by the Bern-LPX vegetation model. (b) Box-plot (across years) of monthly gross primary production (GPP) values. (c) Normalized density of yearly GPP, according to which a year is defined as extreme or nonextreme.

Figure 2

Figure 3. CNN-M model architecture. Rectangles represent 2D or 1D tensors according to the reported dimensions. The input corresponds to meteorological data from the year considered plus around half of the previous one, for a total of 544 days. The output of the network is shown in red. Neurons $ {M}_i,i=1\dots 12 $ correspond to monthly gross primary production (GPP) values of current year. In regression mode the monthly predictions are summed up to obtain the yearly GPP, denoted as $ {\hat{Y}}^{GPP} $. In classification mode, a neuron predicts the probability for the year to be an extreme as well, indicated with $ {\hat{P}}^{GPP}. $ When both classification and regression modes are active CNN-M is in multitasking mode.

Figure 3

Figure 4. CNN-D model architecture. The same notation of Figure 3 is followed. The output tensor contains daily gross primary production (GPP) values from current year plus around half of previous one (544 days), $ {\hat{Y}}^{GPP} $ is the yearly GPP (sum over last 365 days).

Figure 4

Figure 5. Distributions of rescaled residuals of yearly gross primary production (GPP) for models in exclusive regression (Reg.) mode, with reported normalized root mean squared error (NRMSE). $ {\hat{Y}}^{GPP} $ indicates the predicted yearly GPP and $ {Y}^{GPP} $ its exact value, evaluated for the test set. For CNN-M, when $ \alpha ={\alpha}_m $/$ {\alpha}_y $ the regression task fits monthly/yearly GPP values. For CNN-D, $ \alpha $ changes during training from $ {\alpha}_d $ to $ {\alpha}_y $, fitting daily/yearly GPP values at the beginning/end of training.

Figure 5

Figure 6. Daily preditions of GPP from CNN-D (black points), superposed with the exact behavior (red lines), for one random year selected from the test set.

Figure 6

Figure 7. Precision-recall (PR) curves for different architectures. Corresponding area under curve (AUC) scores are reported on the legends. See caption of Figure 5 and Section 3.2.3 for definition of the different $ \alpha $ s. (a) Models based on CNN-M in classification or multitasking mode. Classification is equivalent to multitasking without regression task ($ \beta =0 $, see Equation (5)). For these settings the classification output neuron can be directly used to evaluate the PR curves. (b) Models in exclusive regression (Reg.) mode. PR curves are obtained varying a cutoff on the predicted yearly GPP. With this procedure, the PR curves provide a common metric to compare all models.

Figure 7

Figure 8. For the indicated model in multitasking mode, we plot predicted versus exact yearly GPP on the test set. Colors identify correct (blue) and missclassified (red) examples when a fixed threshold of $ 0.5 $ on the predicted probability is applied. Dashed lines indicate the quantile used to define extreme and normal years.

Supplementary material: PDF

Marcolongo et al. supplementary material

Marcolongo et al. supplementary material

Download Marcolongo et al. supplementary material(PDF)
PDF 1.2 MB