Hostname: page-component-77f85d65b8-g4pgd Total loading time: 0 Render date: 2026-04-19T17:20:00.044Z Has data issue: false hasContentIssue false

Modeling stratospheric polar vortex variation and identifying vortex extremes using explainable machine learning

Published online by Cambridge University Press:  21 November 2022

Zheng Wu*
Affiliation:
Institute for Atmospheric and Climate Science, ETH Zürich, Zürich, Switzerland
Tom Beucler
Affiliation:
Institute of Earth Surface Dynamics, University of Lausanne, Lausanne, Switzerland
Enikő Székely
Affiliation:
Swiss Data Science Center, ETH Zürich and EPFL, Lausanne, Switzerland
William T. Ball
Affiliation:
Department of Geoscience and Remote Sensing, TU Delft, Delft, The Netherlands
Daniela I.V. Domeisen
Affiliation:
Institute for Atmospheric and Climate Science, ETH Zürich, Zürich, Switzerland Institute of Earth Surface Dynamics, University of Lausanne, Lausanne, Switzerland
*
*Corresponding author. E-mail: zheng.wu@env.ethz.ch

Abstract

The winter stratospheric polar vortex (SPV) exhibits considerable variability in magnitude and structure, which can result in extreme SPV events. These extremes can subsequently influence weather in the troposphere from weeks to months and thus are important sources of surface predictability. However, the predictability of the SPV extreme events is limited to 1–2 weeks in state-of-the-art prediction systems. Longer predictability timescales of SPV would strongly benefit long-range surface prediction. One potential option for extending predictability timescales is the use of machine learning (ML). However, it is often unclear which predictors and patterns are important for ML models to make a successful prediction. Here we use explainable multiple linear regressions (MLRs) and an explainable artificial neural network (ANN) framework to model SPV variations and identify one type of extreme SPV events called sudden stratospheric warmings. We employ a NN attribution method to propagate the ANN’s decision-making process backward and uncover feature importance in the predictors. The feature importance of the input is consistent with the known precursors for extreme SPV events. This consistency provides confidence that ANNs can extract reliable and physically meaningful indicators for the prediction of the SPV. In addition, our study shows a simple MLR model can predict the SPV daily variations using sequential feature selection, which provides hints for the connections between the input features and the SPV variations. Our results indicate the potential of explainable ML techniques in predicting stratospheric variability and extreme events, and in searching for potential precursors for these events on extended-range timescales.

Information

Type
Application Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2022. Published by Cambridge University Press
Figure 0

Table 1. Performance metrics for the regression task of the MLR using 300 modes as predictors (left columns) and the MLR using the five best modes identified by the SFS procedure (right columns) over the training and test data.

Figure 1

Figure 1. The low-pass filtered standardized $ U{10}_{60} $ daily anomalies of the test data for the target ERA-interim values, which is our “truth” (blue), the predicted values obtained from the MLR using 300 modes (orange), and from the MLR using the five best modes selected by the SFS procedure (green). Each panel shows the results of one set of test data (8-year) from the 60 experiments. Note that the test data in the 60 experiments are different and here we only show 6 experiments as examples.

Figure 2

Figure 2. The spatial patterns of the most selected modes of (upper row) $ Z100 $ and (bottom row) $ U50 $ by the forward SFS procedure. The numbers in the parenthesis show the explained variance by each mode. The two panels in the red box show the third mode of $ Z100 $ and second mode of $ U50 $, which represents larger variability of the variables but are rarely selected by SFS.

Figure 3

Table 2. Performance metrics for the classification task of the ANN (left columns) and the logistic regression baseline (right columns) over the training and test data using 2D $ Z100 $ spatial patterns as input.

Figure 4

Figure 3. The relevance heat maps of $ Z100 $ for correctly identified SSW events by the ANN models. (a) The composite of heat maps for correctly identified SSWs in all 60 experiments; (b) the heat map of the SSW event on March 4, 1981; and (c) the heat map of the SSW event on February 21, 1989.