Hostname: page-component-89b8bd64d-j4x9h Total loading time: 0 Render date: 2026-05-12T13:45:14.896Z Has data issue: false hasContentIssue false

Uncertainty bounds for long-term causal effects of perturbations in spatiotemporal systems

Published online by Cambridge University Press:  03 July 2025

Kevin Debeire*
Affiliation:
Deutsches Zentrum für Luft- und Raumfahrt (DLR), Institut für Physik der Atmosphäre, Oberpfaffenhofen, Germany
Andreas Gerhardus
Affiliation:
Deutsches Zentrum für Luft- und Raumfahrt (DLR), Institut für Datenwissenschaften, Jena, Germany
Renée Bichler
Affiliation:
Deutsches Zentrum für Luft- und Raumfahrt (DLR), German Remote Sensing Data Center, Oberpfaffenhofen, Germany Institute of Geography, University of Augsburg , Augsburg, Germany
Jakob Runge
Affiliation:
Deutsches Zentrum für Luft- und Raumfahrt (DLR), Institut für Datenwissenschaften, Jena, Germany Faculty of Electrical Engineering and Computer Science, Technische Universität Berlin , Berlin, Germany now at: Center for Scalable Data Analytics and Artificial Intelligence (ScaDS.AI), Technische Universität Dresden, Dresden, Germany
Veronika Eyring
Affiliation:
Deutsches Zentrum für Luft- und Raumfahrt (DLR), Institut für Physik der Atmosphäre, Oberpfaffenhofen, Germany Institute of Environmental Physics (IUP), University of Bremen , Bremen, Germany
*
Corresponding author: Kevin Debeire; Email: kevin.debeire@dlr.de

Abstract

In time-dependent systems, autoregressive models are frequently employed to investigate the interactions between variables of interest in fields such as climate science, macroeconomics, and neuroscience. Typically, these variables are aggregated from smaller-scale variables into large-scale variables, for instance, representing modes of climate variability in climate science. A key aspect of these models is estimating the long-term effects of external perturbations, once the system stabilizes. Our primary contribution is an explicit formula for quantifying these long-term effects on small-scale variables, which is directly estimable from the model’s linear coefficients and aggregation weights. This improves traditional autoregressive models by providing a localized understanding of the system behavior. We conduct a series of numerical experiments to evaluate the performance of various methods to estimate perturbation effects from data. Our second contribution is the derivation of the asymptotic properties of these estimators under suitable assumptions. These asymptotic properties can be leveraged for uncertainty quantification. In a numerical experiment, we compare the uncertainty ranges of the proposed asymptotic-based approach with four bootstrap-based methods. Finally, we apply our methods to investigate the effects of economic activities on air pollution in Northern Italy, demonstrating their ability to reveal local effects. Our novel approach provides a comprehensive framework for analyzing the impacts of perturbations on both large- and small-scale variables, thereby enhancing our understanding of complex systems. Our research has implications for various disciplines where the study of perturbation effects is crucial for understanding and predicting systems’ behavior.

Information

Type
Methods Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Open Practices
Open materials
Copyright
© The Author(s), 2025. Published by Cambridge University Press
Figure 0

Figure 1. (A) Example of large-scale variables $ {X}_0,{X}_1,{X}_2 $ following a VAR model. Arrows indicate direct linear time dependencies between the large-scale variables defined in $ \mathbf{A} $, where $ {\mathbf{A}}_{ij}\left(\tau \right) $ indicates the effect of variable $ {X}_j $ on variable $ {X}_i $ at time lag $ \tau $. The large-scale variables $ {X}_0,{X}_1,{X}_2 $ are aggregates of the small-scale variables $ \mathbf{y} $ defined on the spatial grid (gray grid) with weights $ {\mathbf{W}}_{\mathbf{0}},{\mathbf{W}}_{\mathbf{1}},{\mathbf{W}}_{\mathbf{2}} $, respectively. Each large-scale variable $ {X}_i $ can be calculated from the small-scale variables $ \mathbf{y} $ thanks to the weights vector $ {\mathbf{W}}_{\mathbf{i}} $ such that $ {X}_i={{\mathbf{W}}_{\mathbf{i}}}^{\prime}\mathbf{y} $. (B) An example of an external forcing of intensity $ f $ with spatial weights $ \mathbf{b} $ is applied for all time $ t\ge {t}_0 $. The forcing is constant over time, but nonuniform spatially. (C) An example of the effects of the forcing once the system reaches equilibrium (LREs). Note that the aggregation weights are obtained solely via standard dimensionality reduction methods (e.g., PCA or PCA-Varimax) or prior knowledge and are not optimized based on the linear coefficient estimation. In particular, there is no feedback from the estimation of the linear coefficients to the determination of these weights.

Figure 1

Table 1. Summary of the synthetic data experiments

Figure 2

Table 2. Numerical experiments’ default setup

Figure 3

Table 3. Parameters values for Experiment 1

Figure 4

Figure 2. Comparison of four methods to estimate the sensitivity mentioned in Section 3.1.1. The subfigures illustrate how the evaluation metrics (y-axis) vary for the different methods evaluated based on the (A) number of time samples available, (B) number of modes, (C) number of links between the modes, (D) strength of the autocorrelation of each mode, (E) resolution of the spatial grid, and (F) covariant noise strength. The evaluation metric considered is the absolute error (AE) between the ground truth and estimated sensitivity. The mean absolute error (MAE) is represented by the solid lines, and the shaded areas depict the 90% range of the AE metric across the 250 realizations.

Figure 5

Figure 3. Mean absolute error between true sensitivity and estimated sensitivity. True weights are known and the linear coefficient matrices are estimated with PCMCI–CCM or a VAR model. The Mean Absolute error is averaged over 100 random SAVAR models.

Figure 6

Figure 4. Average CI size of the sensitivity for five different CI methods. Bootstrap-based confidence intervals are computed over 100 bootstrap realizations. The average CI sizes are calculated over the same 100 random SAVAR models of Figure 3. The true weights are known but the linear coefficient matrices are estimated with (A) a VAR model or (B) PCMCI–CCM. To provide an estimation of the variability of the CI size, black error bars indicate the one standard deviation of the CI size over the 100 SAVAR models.

Figure 7

Figure 5. Empirical significance level for the 90% CI shown in Figure 4, for (A) a VAR model or (B) PCMCI–CCM. For each CI method, the empirical significance level is calculated as the frequency of the confidence interval containing the true sensitivity across 100 random SAVAR models. The gray dashed line indicates the 90% significant level. The colormap used in this plot is the same as the one used in Figure 4.

Figure 8

Figure 6. Comparison of the CI methods mentioned in Section 3.1.2 to build a confidence interval of the sensitivity ($ y $-axis) for varying time sample size ($ x $-axis) for one particular realization of an SAVAR model (seed 13). We plot the 90% CI for five different methods indicated in the legend. The analytical sensitivity is known and indicated by the gray dashed line. The estimated sensitivity is given by the black solid line. To estimate the sensitivity, we used the ground truth mode weights and the linear coefficient matrix estimated with VAR estimation (A) or with PCMCI–CCM (B). The different bootstrap-based confidence intervals are obtained with 100 bootstrap realizations. We calculated the asymptotic-based 90% CI using 1.645 standard deviations of the asymptotic normal distribution.

Figure 9

Figure 7. Topographic map of the study area in the Po Valley in northern Italy (Latitudes: 44°–46°, Longitudes: 7°–14°). Sources: EU-DEM (2016), EUROSTAT (2020), Natural Earth (2023), and Open-StreetMap (2023); EPSG: 4326.

Figure 10

Figure 8. Loadings of the five first PCA-Varimax components of $ {\mathrm{NO}}_2 $ pollution over northern Italy.

Figure 11

Figure 9. Time series of included variables after linear de-trending. In black solid lines, the time series of the first five PCA-Varimax components of $ {\mathrm{NO}}_2 $ pollution in northern Italy. In orange solid lines, the time series of the two economic variables (VA-EG and FCE-HWE).

Figure 12

Table 4. Estimated linear coefficients after the two-step procedure PCMCI+ (selection of causal predictors for each target) and CCM (least squares method to estimate linear dependencies). T-statistics are shown in parentheses. The other values represent the coefficients. All coefficients which are not shown in the table have a value of zero

Figure 13

Figure 10. (A) and (B) Long-run effects of economic variables (VA-EG and FCE-HWE) on $ {\mathrm{NO}}_2 $ pollution. (C) and (D) Statistically significant long-run effects of economic variables (VA-EG and FCE-HWE) on $ {\mathrm{NO}}_2 $ pollution in northern Italy. Here, the confidence interval is obtained using the asymptotic distribution of the long-run effects and with a one-sided significance level of 10%. (E) and (F) Statistically significant long-run effects of economic variables (VA-EG and FCE-HWE) on $ {\mathrm{NO}}_2 $ pollution in northern Italy. Here, the confidence intervals are obtained with a bootstrap of the residuals and Hall’s percentile interval and with a one-sided significance level of 10% and 1000 bootstrap realizations.

Figure 14

Figure B1. Comparison of four methods to estimate the long-run effects mentioned in Section 3.1.1. The different subfigures represent the change of the evaluation metrics (y-axis) for the different methods evaluated as a function of the (A) number of time samples available, (B) number of modes, (C) number of links between the modes, (D) strength of the autocorrelation of each mode, (E) resolution of the spatial grid, and (F) strength of the covariant noise. The evaluation metric is the relative absolute error (RAE) between the ground truth and estimated long-run effects. The mean RAE (MRAE) is represented by the solid lines, and the shaded areas depict the 90% range of the MRAE metric across the 250 realizations.

Figure 15

Figure B2. Same as Figure 6 but with a different seed (here seed 12). Comparison of the confidence interval methods mentioned in Section 3.1.2 to build a 90% confidence interval of the sensitivity ($ y $-axis) for varying time sample size ($ x $-axis) for one particular realization of an SAVAR model (seed 12). We estimate the linear coefficient matrix with a VAR model (A) or with PCMCI–CCM (B).