Hostname: page-component-89b8bd64d-b5k59 Total loading time: 0 Render date: 2026-05-08T21:37:44.879Z Has data issue: false hasContentIssue false

Identifying compound weather drivers of forest biomass loss with generative deep learning

Published online by Cambridge University Press:  12 February 2024

Mohit Anand*
Affiliation:
Helmholtz Centre for Environmental Research—UFZ, Department of Compound Environmental Risks, Leipzig, Germany Technische Universität Dresden, Dresden, Germany
Friedrich J. Bohn
Affiliation:
Helmholtz Centre for Environmental Research—UFZ, Department of Computational Hydrosystems, Leipzig, Germany
Gustau Camps-Valls
Affiliation:
Image Processing Laboratory (IPL), Universitat de València, València, Spain
Rico Fischer
Affiliation:
Helmholtz Centre for Environmental Research—UFZ, Department of Ecological Modelling, Leipzig, Germany
Andreas Huth
Affiliation:
Helmholtz Centre for Environmental Research—UFZ, Department of Ecological Modelling, Leipzig, Germany
Lily-belle Sweet
Affiliation:
Helmholtz Centre for Environmental Research—UFZ, Department of Compound Environmental Risks, Leipzig, Germany
Jakob Zscheischler
Affiliation:
Helmholtz Centre for Environmental Research—UFZ, Department of Compound Environmental Risks, Leipzig, Germany Technische Universität Dresden, Dresden, Germany
*
Corresponding author: Mohit Anand; Email: mohit.anand@ufz.de

Abstract

Globally, forests are net carbon sinks that partly mitigates anthropogenic climate change. However, there is evidence of increasing weather-induced tree mortality, which needs to be better understood to improve forest management under future climate conditions. Disentangling drivers of tree mortality is challenging because of their interacting behavior over multiple temporal scales. In this study, we take a data-driven approach to the problem. We generate hourly temperate weather data using a stochastic weather generator to simulate 160,000 years of beech, pine, and spruce forest dynamics with a forest gap model. These data are used to train a generative deep learning model (a modified variational autoencoder) to learn representations of three-year-long monthly weather conditions (precipitation, temperature, and solar radiation) in an unsupervised way. We then associate these weather representations with years of high biomass loss in the forests and derive weather prototypes associated with such years. The identified prototype weather conditions are associated with 5–22% higher median biomass loss compared to the median of all samples, depending on the forest type and the prototype. When prototype weather conditions co-occur, these numbers increase to 10–25%. Our research illustrates how generative deep learning can discover compounding weather patterns associated with extreme impacts.

Information

Type
Application Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2024. Published by Cambridge University Press
Figure 0

Figure 1. Illustration of the methodology. ERA5 data are used to calibrate AWE-GEN. AWE-GEN is used to generate realistic weather conditions. A VAE is used to learn low-dimensional weather representations and generate prototypes. Based on simulated weather conditions, FORMIND simulates forest dynamics, including mortality and biomass loss. FORMIND-derived biomass loss is used to select weather prototypes associated with high biomass loss. The explainable AI method used here consists of the training of VAE together with the generation of weather prototypes relevant for high biomass loss induced by forest mortality.

Figure 1

Figure 2. (a) Monthly mean temperature (red) and precipitation (blue) of the bias-adjusted ERA5 (dotted line) and simulated by AWE-GEN (continuous line). Correlation between monthly means across all three variable combinations for all months for the bias-adjusted ERA5 data (b) and data simulated by AWE-GEN (c).

Figure 2

Figure 3. The VAE architecture with encoder and decoder blocks. The encoder takes in $ 36\times 3 $ dimensional weather variables and outputs two 32-dimensional vectors of mean and log variance of a normal distribution, which define the latent space. The decoder takes samples in the latent variable space associated with specific weather variables in the input space (distinguished by different colors) to generate reconstructions of each of the individual weather variables. Three passes of the decoder are required to reconstruct all three weather variables.

Figure 3

Figure 4. Probability density estimate of biomass loss (left axis), along with three-year average standardized temperature (red), radiation (yellow) and precipitation (blue) (right axis). Vertical lines indicate the median, mean, and 90th percentile of the biomass loss distribution.

Figure 4

Figure 5. Composite of monthly standardized weather anomalies associated with the 90th percentile of biomass loss (left) for beech (a), pine (c), and spruce (e). Coefficients of logistic regression without structure variables (right) for beech (b), pine (d), and spruce (f).

Figure 5

Table 1. Performance metrics for the logistic regression with weather variables only ($ {\mathbf{x}}_d $) and with structure variables ($ {\mathbf{x}}_d $ and $ {\mathbf{x}}_s $)

Figure 6

Table 2. Training, validation and test losses and mean standard errors (MSE) for $ \beta $-VAE

Figure 7

Figure 6. Absolute mean difference between samples in the latent dimensions representative of three-year periods leading to extreme and nonextreme BL for beech (a), pine (c), and spruce (e). The latent dimensions are sorted in descending order with respect to the absolute mean difference. Box plot of three latent dimensions with the highest absolute mean difference associated with nonextreme (orange) and extreme BL (purple) for beech (b), pine (d), and spruce (f).

Figure 8

Table 3. Logistic regression with$ {z}_{14} $, $ {z}_{22} $and$ {z}_{12} $

Figure 9

Figure 7. Weather prototypes. Note that the y-axis ranges differ between the univariate prototypes (first three rows) and the compound prototypes (last two rows).

Figure 10

Figure 8. Biomass loss density plot for all samples (top), samples contributing to univariate prototypes (2nd to 4th row) and samples contributing to compound prototypes (last two rows) for beech (left), pine (middle), and spruce (right). Vertical dashed and dash-dotted lines represent the median and 90th percentile in each distribution.

Figure 11

Table 4. Percentage increase in the median and the 90th percentile biomass loss for different weather prototypes and different types of forests

Figure 12

Figure A1. Pearson’s correlation coefficients among all latent dimensions.