Hostname: page-component-89b8bd64d-shngb Total loading time: 0 Render date: 2026-05-07T18:45:48.744Z Has data issue: false hasContentIssue false

Modeling of spatial extremes in environmental data science: time to move away from max-stable processes

Published online by Cambridge University Press:  15 January 2025

Raphaël Huser*
Affiliation:
Statistics Program, Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division, King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia
Thomas Opitz
Affiliation:
INRAE, Biostatistics and Spatial Processes (BioSP, UR546), Avignon, France
Jennifer L. Wadsworth
Affiliation:
School of Mathematical Sciences, Fylde College, Lancaster University, Lancaster, UK
*
Corresponding author: Raphaël Huser; Email: raphael.huser@kaust.edu.sa

Abstract

Environmental data science for spatial extremes has traditionally relied heavily on max-stable processes. Even though the popularity of these models has perhaps peaked with statisticians, they are still perceived and considered as the “state of the art” in many applied fields. However, while the asymptotic theory supporting the use of max-stable processes is mathematically rigorous and comprehensive, we think that it has also been overused, if not misused, in environmental applications, to the detriment of more purposeful and meticulously validated models. In this article, we review the main limitations of max-stable process models, and strongly argue against their systematic use in environmental studies. Alternative solutions based on more flexible frameworks using the exceedances of variables above appropriately chosen high thresholds are discussed, and an outlook on future research is given. We consider the opportunities offered by hybridizing machine learning with extreme-value statistics, highlighting seven key recommendations moving forward.

Information

Type
Position Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press
Figure 0

Figure 1. Left: Observational grid (gray) of the UK Climate Projections (UKCP) daily precipitation data, showing a particular horizontal transect (red) and a $ 32\times 32 $ subregion (blue). Middle: Illustration of underlying daily random fields $ {Y}_i\left(\boldsymbol{s}\right) $ (gray) for the UKCP data over the summers 1981–2000 along the selected transect, with three of the daily fields that contribute to the pointwise annual maximum for the years 1988 (yellow), 1990 (red), and 1996 (blue). Right: Illustration of underlying random fields $ {Y}_i\left(\boldsymbol{s}\right) $ (gray) for the UKCP data over the summers 1981–2000 along the selected transect, with the pointwise annual maximum $ {Z}_n\left(\boldsymbol{s}\right) $ for the years 1988 (yellow), 1990 (red), and 1996 (blue), corresponding to the three individual extreme events shown in the middle panel. In each case, there is no $ j=1,\dots, n $ such that $ {Z}_n\left(\boldsymbol{s}\right)={Y}_j\left(\boldsymbol{s}\right) $ for all $ \boldsymbol{s} $.

Figure 1

Figure 2. For a pair of variables $ \left\{Y\left({\boldsymbol{s}}_1\right),Y\left({\boldsymbol{s}}_2\right)\right\} $ from various extreme-value models, illustration of the domains (colored areas) over which each model is meant to provide accurate tail probability approximations. Left: max-stable process; Middle: Pareto process for various aggregation functionals, $ r $; and Right: Spatial conditional extremes process, for each conditioning variable.

Figure 2

Figure 3. Examples of estimates of $ {\chi}_D(u) $ for four environmental datasets. Solid black lines represent point estimates, whereas dashed black lines are approximate $ 95\% $ pointwise confidence intervals based on block bootstrapped estimates. From left to right: UKCP daily precipitation data from Figure 1 at $ d=1024 $ sites (within the blue subregion shown in the left panel of Figure 1), with the model fit in blue (based on the Huser and Wadsworth, 2019 model fitted using a pretrained neural Bayes estimator); gridded conditionally simulated E-OBS Irish summer temperature data at $ d=178 $ locations; (detrended) Red Sea surface temperature data at $ d=144 $ locations in the Gulf of Aqaba; daily mean windspeed at $ d=7 $ locations in the Vaucluse “département” in France.

Figure 3

Figure 4. Three of the five estimated parameters obtained from local fits of the anisotropic Huser and Wadsworth (2019) copula model in all subgrids of size $ 32\times 32 $ ($ d=1024 $) within the study domain for the UKCP precipitation example from Figure 1. The plots display, for each local subgrid, the estimated range parameter (left), smoothness parameter (middle), and shape parameter (right) controlling the asymptotic dependence type; the two anisotropy parameters (stretch and rotation) are not shown for brevity. All of these estimates were obtained in a few minutes in total by reusing the pretrained censored neural Bayes estimators from Richards et al. (2024).

Author comment: Modeling of spatial extremes in environmental data science: time to move away from max-stable processes — R0/PR1

Comments

Dear Editor,

We thank you for the invitation to write this opinion piece about environmental data science and extremes. In this paper, we discuss the severe limitations of max-stable processes, which are currently widely used in environmental applications. After strongly arguing against their systematic use in environmental extreme data science, we discuss more appropriate modeling framework based on peaks-over-thresholds, as well as the use of modern hybrid statistical machine learning approaches that enhance the modeling and inference of extreme events.

We hope you will appreciate our contributions, and look forward to hearing from you.

With thanks and best wishes,

Raphael Huser (on behalf of all authors)

Review: Modeling of spatial extremes in environmental data science: time to move away from max-stable processes — R0/PR2

Conflict of interest statement

Reviewer declares none.

Comments

Summary:

This position paper offers a nice and concise overview of the major limitations associated with utilizing the class of max-stable process models for spatial extremes, along with presenting several viable alternative models and methods. Such a contribution is timely and holds significant interest, particularly considering the rapidly expanding focus on modeling spatially-natured climate and weather extremes across several research communities. Below, I provide some comments and suggestions aimed at enhancing accessibility and appreciation for a broader audience.

Major comment:

Given the potentially diverse background of the readers this position paper aims to reach, I believe it would be highly beneficial to use real data as much as possible for selected variables where spatial extreme analysis is to be performed. Specifically, Fig. 1 could be replaced by real data (or a gridded data product to avoid spatial gaps). I believe daily precipitation could be a suitable variable to illustrate the point that pointwise maximum is unlikely to be obtained from the original data, as well as making the main message for Fig. 3.

In general, I believe it would be beneficial to focus on one (or more) representative running examples, similar to what Davison et al. (2012) did. Specifically, the selected running example(s) can be used to not only demonstrate the named limitations but also illustrate the benefits of using alternative methods in Section 3.

Ref:

Davison, Anthony C., Simone A. Padoan, and Mathieu Ribatet. ``Statistical modeling of spatial extremes.‘’ (2012): 161-186.

Minor comments:

1. page 2, lines 8-11, ``massive high-quality environmental data products based on observation and simulation...‘’

Should provide some representative references for each data type here.

2. page 2, line 19, ``in order to provide sound extrapolation into the tail of the distribution...‘’

The authors may want to make this more concise to ensure that the interdisciplinary readership understands precisely what it means.

3. page 2, line 29 ``and subasymptotic forms of tail dependence‘’

Provide a pointer to the relevant subsection in this paper.

4. page 3, line 20, ``has already realized some of their limitations‘’

Add the reference to Huang et al. (2021) here. Specifically, Figs. 1 and 2 in that paper concretely demonstrate the time-mismatch issue in the context of concurrent extremes, which can easily arise in a spatial setting.

Ref:

Huang, Whitney K., Adam H. Monahan, and Francis W. Zwiers. “Estimating concurrent climate extremes: A conditional approach.” Weather and Climate Extremes 33 (2021): 100332.

5. page 5, Fig. 1

Related to the major point I arise, it would be more comprehensible, especially for the dominant scientists who are the ‘users’ for applying statistical models and methods, to utilize real data applications such as extreme precipitation, temperature, or wind speed.

6. The dividing line between the limitations ``The rigidity of stability‘’ and ``Models only for limited sub-classes of possible tail structures‘’ is not that clear to me, and the authors may consider combining them into one section.

7. page 7

I find the argument of ``Lack of physically realistic models‘’ a bit weak on the physical side, and it would benefit from elaborating on the potential use of physically-meaningful stochastic partial differential equation models in modeling extremes to make it more accessible to a wider audience.

8. page 8, lines 25-26 ``they often remain quite expensive to apply in moderate-to-high dimensions in terms of number of spatial locations‘’

It would be ideal to provide a numerical range to give domain users an idea of what ``moderate-to-high dimensions‘’ might mean in terms of spatial locations.

Review: Modeling of spatial extremes in environmental data science: time to move away from max-stable processes — R0/PR3

Conflict of interest statement

Reviewer declares none.

Comments

As the manuscript’s title suggests, the objective of this paper is argue for the field to move away from MSP based analyses and towards other approaches. To this end, the authors highlight the drawbacks of MSP based modeling approaches and then propose alternative approaches.

The authors' thesis is important and interesting, and may be of interest to Environmetal Data Science readers in general. After addressing the comments below the paper should be considered for publication.

Comments:

* It is important to keep the target audience in mind for a paper such as this, because readers from differing backgrounds may receive different take home messages. Readers from the extremes community are likely well aware of many of the drawbacks that you mention, and would likely be supportive of rigorous model checking to ensure that the assumed model is a reasonable fit for the data. However, applied scientists may not be able to follow all aspects of the paper and may miss some of the important take away messages.

For example, I often collaborate with hydrologists and engineers for the purpose of modeling environmental extremes, and I have found that is still not uncommon for researchers like this to rely on subjective approaches, such as regional frequency analysis. I am a bit concerned that readers from this community may read this paper and abandon extremes based approaches in general. I would urge you to revise your paper to keep this perspective in mind.

* For applied researchers to embrace an approach such as MACHINE, they will need statistical software packages. One reason for the ubiquity of MSP based modeling approaches is the fitmaxstab function in the SpatialExtremes R package (as you mention in the paper). What software do you recommend for applied researchers to make use of your proposed approaches? If well established user friendly packages do not exist yet you should mention this as well.

* It may also be worth pointing out that other spatial extremes based approaches may be appropriate in certain circumstances. For example, if pointwise return level estimates are the desired output, a latent process model (or the so called spatial GEV model) may work well. This is important to point out as well.

Minor Comments:

* Maybe define MSP to denote `max stable process' at the beginning since you use it so often.

* Maybe mention that you propose a seven step procedure in the abstract. This seems to be an important take away of the paper.

* Define AI at first use (first page).

* l18-24: you use ‘hand’ three times in two sentences. This reads a bit awkward. Could you reword without losing anything?

* p4 l18: can you reword to avoid having to make the citation possessive?

* p4 l20: change `boosted' to something else (maybe buoyed?).

* p5 l29: should not be possessive

* p7 l40: artifacts is misspelled

* p8 l43: here you use double quotes whereas you’ve previously used single quotes (need to be consistent)

* p9 l12: artifacts is misspelled again

* p14 l46: I’m not sure if you mean systemic or systematic?

Recommendation: Modeling of spatial extremes in environmental data science: time to move away from max-stable processes — R0/PR4

Comments

Both reviewers highlighted the relevance of the manuscript but also pointed out several aspects that need to be addressed by the authors before it can be accepted for publication in EDS. Based on their feedback, my recommendation is for a minor revision.

Decision: Modeling of spatial extremes in environmental data science: time to move away from max-stable processes — R0/PR5

Comments

No accompanying comment.

Author comment: Modeling of spatial extremes in environmental data science: time to move away from max-stable processes — R1/PR6

Comments

Dear Editors, and Reviewers,

We have now revised our opinion paper, and have thoroughly addressed all comments raised by the reviewers.

We hope you will appreciate our efforts to make the paper more accessible to a broad applied audience, and that you will find it satisfactory for publication in Environmental Data Science.

We have attached our point-by-point response, as well as a Diff file highlighting all our changes.

We look forward to hearing back from you, and thank you for your consideration.

With many thanks and best wishes,

Raphaël Huser (on behalf of all authors)

Review: Modeling of spatial extremes in environmental data science: time to move away from max-stable processes — R1/PR7

Conflict of interest statement

Reviewer declares none.

Comments

This version has sufficiently addressed my comments and, in my opinion, should be accepted for publication.

Review: Modeling of spatial extremes in environmental data science: time to move away from max-stable processes — R1/PR8

Conflict of interest statement

Reviewer declares none.

Comments

Thank you for adequately addressing my concerns.

Recommendation: Modeling of spatial extremes in environmental data science: time to move away from max-stable processes — R1/PR9

Comments

Both reviewers were highly satisfied with the revised manuscript, so I am pleased to recommend it for publication in EDS.

Decision: Modeling of spatial extremes in environmental data science: time to move away from max-stable processes — R1/PR10

Comments

No accompanying comment.