Hostname: page-component-89b8bd64d-nlwjb Total loading time: 0 Render date: 2026-05-07T05:08:01.389Z Has data issue: false hasContentIssue false

Identifying probabilistic weather regimes targeted to a local-scale impact variable

Published online by Cambridge University Press:  21 November 2024

Fiona R. Spuler*
Affiliation:
Department of Meteorology, University of Reading, Reading, UK
Marlene Kretschmer
Affiliation:
Department of Meteorology, University of Reading, Reading, UK Leipzig Institute for Meteorology, University of Leipzig, Leipzig, Germany
Yevgeniya Kovalchuk
Affiliation:
Centre for Advanced Research Computing, University College London, London, UK
Magdalena Alonso Balmaseda
Affiliation:
European Centre for Medium-Range Weather Forecasts, Reading, UK
Theodore G. Shepherd
Affiliation:
Department of Meteorology, University of Reading, Reading, UK
*
Corresponding author: Fiona R. Spuler; Email: f.r.spuler@pgr.reading.ac.uk

Abstract

Large-scale atmospheric circulation patterns, so-called weather regimes, modulate the occurrence of extreme events such as heatwaves or extreme precipitation. In their role as mediators between long-range teleconnections and local impacts, weather regimes have demonstrated potential in improving long-term climate projections as well as sub-seasonal to seasonal forecasts. However, existing methods for identifying weather regimes are not specifically designed to capture the relevant physical processes responsible for variations in the impact variable in question. This paper introduces a novel probabilistic machine learning method, RMM-VAE, for identifying weather regimes targeted to a local-scale impact variable. Based on a variational autoencoder architecture, the method combines non-linear dimensionality reduction with a prediction task and probabilistic clustering in one coherent architecture. The new method is applied to identify circulation patterns over the Mediterranean region targeted to precipitation over Morocco and compared to three existing approaches: two established linear methods and another machine-learning approach. The RMM-VAE method identifies regimes that are more predictive of the target variable compared to the two linear methods, both in terms of terciles and extremes in precipitation, while also improving the reconstruction of the input space. Further, the regimes identified by the RMM-VAE method are also more robust and persistent compared to the alternative machine learning method. The results demonstrate the potential benefit of the new method for use in various climate applications such as sub-seasonal forecasting, and illustrate the trade-offs involved in targeted clustering.

Information

Type
Methods Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Open Practices
Open data
Copyright
© The Author(s), 2024. Published by Cambridge University Press
Figure 0

Figure 1. Illustration of selected methodological choices for dimensionality reduction and clustering based on Hannachi et al. (2017) and Murphy (2022). The methods highlighted in green are applied in this paper and described in more detail in Section 3. Methods highlighted with a star refer to joint dimensionality reduction methods between two high-dimensional spaces.

Figure 1

Figure 2. Extended winter precipitation (November–March) over the selected region over Morocco based on ERA5 reanalysis data from 1940 to 2022. Left-side: daily mean precipitation. Right-side: 95th percentile of daily precipitation.

Figure 2

Figure 3. Schematic diagram of the R-VAE + k-means method based on the architecture developed by Zhao et al. (2019a). The input data $ x $ is passed through the encoder network, shown here in blue, which outputs both an estimate of the latent space $ z $ and a prediction of the scalar target variable $ t $. The target variable is then used to predict back into the latent space, thereby targeting the dimensionality reduction. The reduced space is subsequently clustered using k-means.

Figure 3

Figure 4. Different variational autoencoder models represented as probabilistic graphical models using plate notation. The inference model corresponds to the encoder and the generative model to the decoder of the architecture. a) A standard VAE with input variable $ x $, latent variable $ z $ and prior $ {\Phi}_z $ on the parameters $ \mu $ and $ \sigma $ of the multivariate Gaussian distribution of $ z $. b) The R-VAE method with an additional target variable $ t $, and c) the RMM-VAE method with probabilistic cluster assignment $ c $ regularized by the prior $ {\pi}_k $. In all panels, dashed lines indicate the inference model and solid lines the generative model.

Figure 4

Figure 5. Schematic diagram of the proposed RMM-VAE approach. In contrast to the R-VAE method, the encoder network, shown again in blue, outputs not only an estimate of the latent space $ z $ and a prediction of the scalar target variable $ t $, but also a probabilistic cluster assignment of the data point $ {c}_k $. The method thereby combines a regression VAE (R-VAE) with probabilistic clustering using mixture models (MM) in a coherent statistical model.

Figure 5

Table 1. Overview of methods for the identification of weather regimes and associated parameter choices

Figure 6

Figure 6. Identified weather regimes (top rows) and corresponding odds ratios of extreme precipitation (bottom rows) for the four different methods with the number of clusters specified as k = 5. The regime frequencies are given in percent. The odds ratio of extreme precipitation corresponds to the ratio of the probability of the climatological 95th percentile of precipitation at the grid cell conditional on that weather regime, divided by the unconditional probability of 95th percentile of precipitation (i.e., 0.05). The weather regimes are ordered in decreasing order of total precipitation during the days assigned to this cluster by the respective method.

Figure 7

Figure 7. Precipitation clusters computed on precipitation reanalysis data without pre-processing using the k-means clustering algorithm for k = 8 (top rows) and corresponding z500 anomalies (bottom rows).

Figure 8

Figure 8. Ranked Probability Skill Score of an empirical prediction of total precipitation over Morocco using the weather regimes, shown for different numbers of weather regimes k. a) Skill score for the prediction of the tercile of the precipitation distribution, and b) Skill score for extreme precipitation, defined as a binary prediction above or below the 95th percentile. For probabilistic clustering, the skill score is computed using the most likely cluster at the given data point. The higher the RPSS, the more predictive the weather regimes are of precipitation over Morocco.

Figure 9

Figure 9. Regime persistence and separability. a) Distribution of mean persistence across $ k=5 $ regimes. Violin plots show the kernel density estimation of the distribution, the distribution median (white point) as well as the interquartile range (black box). The ranking of different methods in terms of persistence remains the same for different choices of $ k $. b) Silhouette score for a range of cluster numbers k. The silhouette score defined as the mean silhouette coefficient (b-a)/max(a,b), where a is the average intra-cluster distance and b is the average inter-cluster distance that is the average distance between all clusters.

Figure 10

Figure 10. Visualization of the 10-dimensional latent spaces in two dimensions using t-distributed stochastic nearest-neighbor embedding (t-SNE). Embedded data points are coloured according to the value of the target variable, total mean precipitation (top row), and according to the cluster they are subsequently assigned to (bottom row).

Figure 11

Figure 11. Distribution of the reconstruction loss, assessed using the mean squared error (MSE) between original input data and reconstructed data for all data points. The thin line in the boxes corresponds to the mean of the distribution, while the boxes extend to quartiles of the dataset. The whiskers extend to points that lie within 1.5 inter-quartile ranges of the opposite quartile, so the lower quartile in the case of the upper bound of the whiskers and the upper quartile in the case of the lower bound of the whiskers. Observations outside this range are displayed as black points.

Figure 12

Figure 12. Gridded and normalized z500 anomalies, as detailed in Section 3, on an example day 1940-01-04, showing the original data on the left and the reconstructions using different methods on the right.

Figure 13

Figure A-1. 50 subsamples containing 80% of data points each are created, and cluster centres are computed. The two sets of cluster centres are then paired by matching centres with the lowest Anomaly Correlation Coefficient (ACC). The lowest of these maximum ACC values is recorded, corresponding to the ACC of the least well-correlated cluster pair.

Figure 14

Figure A-2. Sample time series of cluster assignment in different methods.

Figure 15

Table A-1. Glossary of selected statistical and machine learning terminology based on Murphy (2022)

Author comment: Identifying probabilistic weather regimes targeted to a local-scale impact variable — R0/PR1

Comments

Dear Professor Manteleoni,

We would like to submit the following original contribution for consideration at Environmental Data Science: Identifying probabilistic weather regimes targeted to a local-scale impact variable.

Even though the relationship between weather regimes and extremes in local impact variables such as precipitation is a key motivation for their investigation, there are no comprehensive methods available for identifying probabilistic weather regimes targeted to a specific impact variable.

This motivated us to develop RMM-VAE, a new machine learning method that combines a regression task with a variational autoencoder architecture that integrates a mixture model, and hence probabilistic clustering, in the regularization of its latent space. To demonstrate the performance of the RMM-VAE method, we apply it to the task of identifying weather regimes over the Mediterranean region in extended winter targeted to precipitation over Morocco, which serves as our target variable here, and compare it to two linear, and one other machine learning method.

We are submitting this paper for consideration to Environmental Data Science because we believe this contribution to be relevant to both the machine learning as well as the climate and weather science communities, and value the positioning of the journal in this space, as well as support the journal’s commitment to open data and publishing standards. Please note that we have successfully submitted a brief outline of the proposed method as a short, 4-pages, paper to the workshop at NeurIPS 2023: Tackling Climate Change with Machine Learning.

Sincerely,

Fiona Spuler, Marlene Kretschmer, Yevgeniya Kovalchuk, Magdalena Balmaseda and Ted Shepherd

Contact details:

Fiona Spuler

f.r.spuler@pgr.reading.ac.uk

Brian Hoskins Building 2U06

University of Reading, Earley Gate, Whiteknights Rd, Reading RG6 6ET

Reading, United Kingdom

Review: Identifying probabilistic weather regimes targeted to a local-scale impact variable — R0/PR2

Conflict of interest statement

Reviewer declares none.

Comments

Review of “Identifying probabilistic weather regimes targeted to a local scale impact variable” by Spuler at al.

Recommendation: Major Revision

This paper presents a new machine learning method for identifying weather regimes oriented towards a scalar target variable. The novelty of the method lies in integrating the different objectives of identifying regimes on the one hand and robust clustering and prediction of a target variable on the other hand.

I am writing this review as one who is not an expert in machine learning, but can appreciate its application to real-world physical problems. As such, my suggestions are aimed at broadening the reach of the paper.

As written, this paper will be understood only by those who are already quite proficient in machine learning and the associated statistical mathematics. This is a pity, since the technique is interesting enough to be more widely appreciated by readers of Environmental Data Science. Equally important, the physical problem to which the new technique is applied (rainfall over Morocco and the associated larger-scale circulation patterns) is given very short shrift until very late in the paper. The physical problem is barely mentioned in the introduction, while the actual results of the new method are interpreted in the physical sense only very late in the paper (in section 4.5).

My suggestions for improving the paper to make it far more accessible are as follows:

(1) The following discussion in Section 4.5: “…literature on extreme precipitation events in the Western Mediterranean which highlights dynamically driven moisture flux from the Atlantic as a key driver …show that geopotential height anomalies associated with extreme precipitation over the Western Mediterranean, … are associated with an alignment of the subtropical jet with the African coastline and anomalous southwesterly surface to mid-tropospheric flow which leads to large-scale ascending motions and instability over the Western Mediterranean region” should be moved to much earlier in the paper, in Section 1.4 (Contribution). Further this discussion should be expanded (along the lines taken much, much, later in the paper) to describe the techniques previously tried to forecast Moroccan rainfall using the circulation, and their success (or lack of it).

(2) Throughout the discussion of the technique, define all terms used that may not be familiar to all readers. Some examples are: “latent variable”, ‘priors”, “encoders”, “auto-encoders”, “latent space”, “variational auto-encoders” and “regularization”, “inference model”, “generative model.” Also make sure to define all of the many mathematical variables appearing in the equations (2)-(5). Although most of the variables are defined in the captions to Figures 3 and 4, they need also to be defined in the paper (then the captions can be shortened). The authors should consider foregoing the many equations, and describing their essence in words

More minor concerns:

(1) In Section 1.1, The paper of Cassou (2008) already referenced, should be mentioned in the context of cluster analysis and MJO-teleconnections. Also in this section, the authors might be interested in a paper that uses cluster analysis of the Indian region summer circulation to reconstruct local and regional cycles (Straus, 2022).

(2) Section 2.2: Leave out equation (1): since none of the terms are defined, it doesn’t add anything to the paper.

(3) Section 3.1 – choice of dividing Z500 by its standard deviation at every gid point breaks the quasi-geostrophic relationship between Z500 and the horizontal wind field. The authors should discuss this.

(4) In Section 4.1, Figure 6 needs to be explained better in the text. What are the axes in the figure? What is the “perplexity” mentioned in the figure caption?

(5) Figure 7 – caption needs expanding – that are the black blobs ?

Reference:

Straus, D.M. Preferred intra-seasonal circulation patterns of the Indian summer monsoon and active-break cycles. Clim Dyn 59, 1415–1434 (2022). https://doi.org/10.1007/s00382-021-06047-6

Review: Identifying probabilistic weather regimes targeted to a local-scale impact variable — R0/PR3

Conflict of interest statement

I co-authored publications with one of the co-authores ( M. Balmaseda) in the past 3 years, but I feel able to neutrally assess this manuscript and this relationship did not affect my recommendation.

Comments

Review of manuscript EDS-2023-0075 by Spuler et al. entitled “Identifying probabilistic weather regimes targeted to a localscale impact variable”

The manuscript at hand explores different methods to identify large-scale circulation patterns that affect precipitation over Morocco in extended winter. It introduces a new machine learning method, RMM-VAE, which aims to identify patterns targeted to an impact variable, here Moroccan precipitation. The novel method is compared to classical PCA, CCA, and simpler version of the ML method and assessed in terms of reconstruction of the input data, ability to “predict” the target variable, and persistence, robustness, and separability of the clusters. RMM-VAE does a very good job in reconstructing the input data and in identifying patterns which represent Moroccan precipitation very well. However, the method is less stable in terms of identifying persistent and well-separated circulation patterns, compared to classical PCA, probably due to fact that these patterns are less rooted in dynamics.

The study is very well-written and structured. The choice and presentation of figures is appropriate and well-understandable. The study aims to bridge data- and domain science - an increasing challenge in the era of machine learning - and will provide an interesting contribution to the field. My only concern is that it could even do a bit more in emphasising the need for including domain knowledge in developing new ML methods and I provide some suggestions below. I particularly have the impression, that the tone of the manuscript is that the novel ML method RMM-VAE was superior to classical approaches which likely detect physically more meaningful (and therefore more predictable patters, e.g. PCA). Indeed, your results show, that clusters identified by PCA separate best and persist longer than those targeted on an impact variable. This is likely due to the fact that PCA identifies dynamical modes (in a physics sense) and is less of a mere categorisation compared to targeted circulation types. This matters for the application of such regimes, as - if rooted in physics - patterns are likely more predictable. Some rewording to keep a better balance and more discussion of how domain knowledge could help in also identifying the predictable modes would help. Overall I am happy with the manuscript and recommend to accept with minor revisions, provided my concern above, two other main comments below are addressed. In addition I provide some further remarks for consideration.

1. When exploring how well-suited the different methods are in distinguishing precipitation events and distribution you are talking of “prediction”. However, you do not mean prediction in the classical NWP forecasting sense, and thus the wording might be misleading. You must stress more that you are exploring the ability of the methods of representing distinct precipitation patterns. Or in simple words, if there is a robust signal in precipitation in the different clusters, and not a huge intra-cluster precipitation variability. Furthermore, different methods to assess this could be explored or should be mentioned. E.g. Schiemann and Frei (2010) systematically assessed a bulk of cicrulation type classification from the COST733 in terms of their ability to represent Alpine precipitation using the Brier Skill Score.

2. In the discussion of the methods suitability for forecasting extremes page 17, line 47ff page 18 l 45) it should be stressed more the finding of Bloomfield et al. 2020, that TCTs compared to WRs only perform well in the first 2 weeks, while WRs (PCA) performs better on S2S / extended-range time scales. As WRs are more rooted in dynamics and can be seen as dynamical systems with a distinct life cycle they are more predictable in a NWP system. Their link to teleconnection then aids models in S2S forecasting (e.g. Yiou and Nogaj 2004, Beerli and Grams 2019). The ascpect of the TCTs / targeted regimes more rooted in the actual impact vs. the classical weather regimes rooted in dynamcis with certain physical processes driving them with consequences for extended-range predictability should be stressed more and could be discussed as a topic for future research. References for the general predictability of regimes, and their link to teleconnections are e.g. Faranda et al. 2017, Hochman et al. 2021, Ferranti et al. 2015, Büeler et al. 2021.

Further comments:

- The terminology “probablilistic” weather regime in title, abstract, and elsewhere is confusing to me and could be removed or required more explanation

- page 1 line 32 predicting -> representing

- Section 1.1 (or later Section 1.3) could link a bit more directly to the Morrocan precipitation in focus of this study and that weather regimes in general are useful to describe weather variability on S2S time scales. E.g. are Yiou and Nogaj (2004) are an early reference for that and Pasquier et al. 2019 specifically also show atmospheric river modulation during regimes being important for Iberian / North African precipitation.

- Section 1.3: The tone implies as if you would provide a method which remedies all these gaps, but this is not entirely true. Some more modest wording would be appropriate.

-Section 1.2: Some discussion of the need for persistence and how it might (or might not) be considered is missing in the methods overview. In Section 1.1 you mention (line 11 p1) that persistence is important for regimes. In this section you ignore this important requirement.

-page 4 line 3-7: Dorrington et al. 2023, 2024 provide such an approach (similar to Rouges 2023) and proove its usefulness in forecasting. This should be acknowledged but the need for exploring other clustering methods more specifically including the target variable in the regime definition deduced as an additional motivation for your study.

- Section 2.1 page 5: perhaps not only stating it is the most common but also why: make a statement that this is most common as European regimes are little sensitive to details of the methodological choice (and here regimes have been studied a lot), but in other world regions a more exact method might be beneficial -> this is an additional motivation for your paper!

Section 3 page 9

- line 16: directly showing the data domain would help

- line 20: there is a problem regarding regime stability across decades if you are using such a long data period 1940-2022. The decadal regime variability must be discussed here or elsewhere, and in the discussion and how this might affect your results. See e.g. Dorrington and Strommen 2020.

- line 24: how reliable is precipitation data in the pre-satellite era? Make a statement that you consider precipitation in the ERA5-“model” climate.

- line 27: low-pass filter e.g. Lanczos, would be more appropriate than running mean.

- line 45-46: based on which criteria are k=5 a reasonable choice?

- Table 1: why 0.25° for precip, is the resolution different to other ERA5 data?

Section 4 page 10-12

- Discussion of reduced space in Section 4.1 (page 11, line 33). Make clearer that R-VAE and RMM-VAE disentangle /organise the dimensions better in this specific 2D space. A mere visual inspection of Figure 6 indicates that PCA and CCA also do a good job in disentangling mean precipitation into clusters (e.g. cluster 1 and high precip values). Why make it sound like a problem?

- page 11, line 40: RMSE or MSE? -> MSE

- page 13 line 32: Here the wording about “prediction” becomes confusing. All the analysis is based on reanalysis data, not on forecast data. So the wording in this section would be easier to make clear you explore how well the different methods are able to link the regime to actual precipation events, or how robustly they represent surface weather modulation. It does not give information, yet, on how well these regimes are forecast.

- Section 4.4 : There is a trade off with PCA apparently identifying well separated, and physically meaningful regimes, vs. targeted methods identifying regimes, which well explain surface weather modulation.

Open question which type in the end is better predicted in NWP models. This trade off should be emphasised more and not with one sentence only (page 18, line 44-45) in the discussion and a side note (page 18, line 2). E.g. Beerli and Grams 2019 nicely showed how regimes link to large-scale surface weather and their modulation by the stratospheric teleconnection adds surface weather prediction.

- Appendix B: You mention that the metrics are sensitive to a larger geographical region. Does it also change the patterns? One of your arguments, is that VAE methods would be statistically more consistent, thus one would expect less sensitivity, instability in regime detection. if domain is changed (more stable detection) can you show, that PCA is more sensitive to shift in domain than e.g. RMM-VAE?

References to consider:

Beerli, R., and C. M. Grams, 2019: Stratospheric modulation of the large-scale circulation in the Atlantic–European region and its implications for surface weather events. Q.J.R. Meteorol. Soc., 145, 3732–3750, doi:10.1002/qj.3653.

Büeler, D., L. Ferranti, L. Magnusson, J. F. Quinting, and C. M. Grams, 2021: Year-round sub-seasonal forecast skill for Atlantic–European weather regimes. Q. J. R. Meteorol. Soc., 147, 4283–4309, doi:10.1002/qj.4178.

Dorrington, J., and K. J. Strommen, 2020: Jet Speed Variability Obscures Euro-Atlantic Regime Structure. Geophysical Research Letters, 47, e2020GL087907, doi:https://doi.org/10.1029/2020GL087907.

Dorrington, J., C. Grams, F. Grazzini, L. Magnusson, and F. Vitart, 2024a: Domino: A new framework for the automated identification of weather event precursors, demonstrated for European extreme rainfall. Quarterly Journal of the Royal Meteorological Society, 150, 776–795, doi:10.1002/qj.4622.

Dorrington, J., M. Wenta, F. Grazzini, L. Magnusson, F. Vitart, and C. Grams, 2024b: Precursors and pathways: Dynamically informed extreme event forecasting demonstrated on the historic Emilia-Romagna 2023 flood. EGUsphere, 1–27, doi:10.5194/egusphere-2024-415.

Faranda, D., G. Messori, and P. Yiou, 2017: Dynamical proxies of North Atlantic predictability and extremes. Scientific Reports, 7, 41278, doi:10.1038/srep41278.

Ferranti, L., S. Corti, and M. Janousek, 2015: Flow-dependent verification of the ECMWF ensemble over the Euro-Atlantic sector. Q.J.R. Meteorol. Soc., 141, 916–924, doi:10.1002/qj.2411.

Hochman, A., G. Messori, J. F. Quinting, J. G. Pinto, and C. M. Grams, 2021: Do Atlantic-European Weather Regimes Physically Exist? Geophys. Res. Lett., 48, e2021GL095574, doi:10.1029/2021GL095574.

Pasquier, J. T., S. Pfahl, and C. M. Grams, 2019: Modulation of Atmospheric River Occurrence and Associated Precipitation Extremes in the North Atlantic Region by European Weather Regimes. Geophys. Res. Lett., 46, 1014–1023, doi:10.1029/2018GL081194.

Schiemann, R., and C. Frei, 2010: How to quantify the resolution of surface climate by circulation types: An example for Alpine precipitation. Physics and Chemistry of the Earth, Parts A/B/C, 35, 403–410, doi:10.1016/j.pce.2009.09.005.

Yiou, P., and M. Nogaj, 2004: Extreme climatic events and weather regimes over the North Atlantic: When and where? Geophys. Res. Lett., 31, doi:10.1029/2003GL019119.

Recommendation: Identifying probabilistic weather regimes targeted to a local-scale impact variable — R0/PR4

Comments

Many thanks for submitting this interesting and relevant paper, and thank you for your patience while we sought reviewers.

Both reviewers have commented positively on the substance of the manuscript but have asked for some additions and slight restructuring to help orient readers who might be less familiar with the domain or the methods. These are fairly minor ‘major revisions’ but please be sure to address each of the reviewers' comments in full.

Decision: Identifying probabilistic weather regimes targeted to a local-scale impact variable — R0/PR5

Comments

No accompanying comment.

Author comment: Identifying probabilistic weather regimes targeted to a local-scale impact variable — R1/PR6

Comments

Dear Editor,

We would like to re-submit the following original contribution for consideration at Environmental Data Science: Identifying probabilistic weather regimes targeted to a local-scale impact variable after responding to the initial reviewer comments. We believe that the comments of the two reviewers significantly improved the paper, in particular in terms of accessibility and ability to reach a wider audience.

As stated in the initial submission, we are submitting this paper for consideration to Environmental Data Science because we believe this contribution to be relevant to both the machine learning as well as the climate and weather science communities, and value the positioning of the journal in this space, as well as support the journal’s commitment to open data and publishing standards. Please note that we have successfully submitted a brief outline of the proposed method as a short, 4-pages, paper to the workshop at NeurIPS 2023: Tackling Climate Change with Machine Learning.

Sincerely,

Fiona Spuler, Marlene Kretschmer, Yevgeniya Kovalchuk, Magdalena Balmaseda and Ted Shepherd

Review: Identifying probabilistic weather regimes targeted to a local-scale impact variable — R1/PR7

Conflict of interest statement

No competing intetrests

Comments

I think the paper can be accepted in its current form. The clarity has been greatly improved. One minor point that could be cleared up is the Caption to Figure 11. The expression “...points that lie within 2.5 inter-quartile ranges of the opposite quartile” seems obscure and should be explained better.

Review: Identifying probabilistic weather regimes targeted to a local-scale impact variable — R1/PR8

Conflict of interest statement

Reviewer declares none.

Comments

Review of revised manuscript EDS-2023-0075 by Spuler et al. entitled “Identifying probabilistic weather regimes targeted to a localscale impact variable”

I thank the authors for the thorough revision of the manuscript based on both reviewers' comments. I particularly appreciate the in-depth explanations for choices made, especially regarding statistical terminology vs. suggested (NWP) domain-specific terminology.

Having once more read the entire manuscript, with the additional explanations, I do no longer struggle about the meaning of terminology, when switching between domain and data science.

In terms of dynamical interpretation I still have few minor comment regarding the interpretation of the identified regimes by the different methods.

- In Figure 6, the fact that CCA k-means accounts for the spatial pattern in precipitation nicely reveals that there are different drivers for extreme precipitation affecting different regions.

While this is discussed in the Conclusion / Disuccsion somewhat, from a dynamical perspective this is a very important result! The discussion here on page 17 lines 7-18 (_review_diff.pdf Version), leaves it as a side note. I think here and in the discussion this result deserves a bit more attention. Also RMM-VAE v1 seems to catch up some of the signal, so thoughts on how it could be improved more to account for spatial heterogeneity and picking up patterns with various locations of the cut-off would be nice.

- page 237 lines 18ff (_review_diff.pdf Version): Discussion of Figure 10: I wonder how much the fact that PCA does seem to not separate too well, is simply due to the way data is shown is. Comparing pixels with high values of precipitation to the cluster separation suggests that high precip data points all cluster in the dark violet cluster. However, as they are spread across the latent space the figure appears noisy.

Overall I am happy with the revised manuscript, its balanced tone, and find an important piece of work which helps stressing the need for collaboration across the domain, statistical, and data sciences.

I recommend the manuscript to accept as is although authors might still consider my remaining minor comment regarding different dynamical drivers of extreme precipitation.

Recommendation: Identifying probabilistic weather regimes targeted to a local-scale impact variable — R1/PR9

Comments

Thank you for fully responding the reviewers comments and concerns. As part of the final preparation of the manuscript please consider addressing the reviewers' final minor comments

Decision: Identifying probabilistic weather regimes targeted to a local-scale impact variable — R1/PR10

Comments

No accompanying comment.