Exploring the nonstationarity of coastal sea level probability distributions

Abstract Studies agree on a significant global mean sea level rise in the 20th century and its recent 21st century acceleration in the satellite record. At regional scale, the evolution of sea level probability distributions is often assumed to be dominated by changes in the mean. However, a quantification of changes in distributional shapes in a changing climate is currently missing. To this end, we propose a novel framework quantifying significant changes in probability distributions from time series data. The framework first quantifies linear trends in quantiles through quantile regression. Quantile slopes are then projected onto a set of four orthogonal polynomials quantifying how such changes can be explained by independent shifts in the first four statistical moments. The framework proposed is theoretically founded, general, and can be applied to any climate observable with close-to-linear changes in distributions. We focus on observations and a coupled climate model (GFDL-CM4). In the historical period, trends in coastal daily sea level have been driven mainly by changes in the mean and can therefore be explained by a shift of the distribution with no change in shape. In the modeled world, robust changes in higher order moments emerge with increasing $ {\mathrm{CO}}_2 $ concentration. Such changes are driven in part by ocean circulation alone and get amplified by sea level pressure fluctuations, with possible consequences for sea level extremes attribution studies.


Introduction
Regional and global sea levels are affected by both natural climate variations and anthropogenic climate change, with possible repercussions on densely populated coastal communities of great concern.Anthropogenic global warming recently motivated a number of studies characterizing rates and causes of sea level rise since the early 20th century.A reconstruction of global sea level trends since 1900 has been recently proposed by Dangendorf et al. (2019) and Frederikse et al. (2020).Both studies show a robust increase in mean sea level with trends of 1:6 ± 0:4 and 1:56± 0:33 mm/year over the periods 1900-2015 and 1900-2018, respectively.Such global trends are not constant over time and marked by an acceleration over the recent three decades, as quantified by the 3:1± 0:3 mm/year trend measured by altimetry since 1993 (WCRP Global Sea Level Budget Group, 2018.This acceleration has been mainly ascribed to an increase in ocean heat uptake driven by changes in Southern Hemisphere westerlies, as well as increased mass loss over Greenland (Dangendorf et al., 2019;Frederikse et al., 2020;Fox-Kemper et al., 2021).
Globally, the causes of trends in sea level since 1900 are reasonably well understood, with the largest contribution coming from glaciers (52%), followed by ocean thermal expansion (32%) and Greenland ice sheet mass loss (29%) plus a negative contribution coming from the land water storage (Frederikse et al., 2020).Regionally, on the other hand, sea level can be strongly affected by local variability in patterns of ocean circulation and water masses such as those caused by fluctuations in winds, ocean heating, and moisture fluxes (Han et al., 2017;Hamlington et al., 2020).Such dynamical changes can potentially mask sea level trends in regions dominated by large multidecadal variability.A known example is the observed large trend in sea level in the western Pacific ocean since 1990, mainly reflecting phases of the Pacific decadal oscillation (PDO) (Merrifield et al., 2012;Zhang and Church, 2012;Han et al., 2017).Changes not linked to ocean and atmosphere dynamics also play a role.An example is the vertical land movement in some areas as a consequence of glacial isostatic adjustment (GIA) since the last ice age, causing relatively large differences in long-term trends across basins (Tamisiea, 2011;Caron et al., 2018;Wang et al., 2021).Notwithstanding the large number of possible contributions, the sea level budget at tide gauges since 1958 has been recently closed in Wang et al. (2021).The authors identified sterodynamic sea level change as the main contributors to sea level rise in many locations, and GIA in few others (Wang et al., 2021) (as detailed in Gregory et al., 2019, sterodynamic changes arise from changes in ocean density and ocean circulation).
Trends in sea level at long time scales also impact changes in weather-like extremes.Recently, many studies have focused on quantification of current and future changes in sea level extremes driven by storms, tides, and waves (Buchanan et al., 2017;Wahl et al., 2017;Rasmussen et al., 2018;Sweet et al., 2020;Tebaldi et al., 2021).Such studies aim to characterize tails of sea level distributions in recent periods or in future scenarios via extreme value theory (EVT) (Coles, 2001) and often quantify changes in extremes solely as a function of changes in distributional mean (Tebaldi et al., 2021).A shift in the mean sea level is in fact recognized to be the primary driver of changes in tails of the distributions (Vousdoukas et al., 2018;Sreeraj et al., 2022.Few studies also explored extreme value analysis by considering changes in the median and width of the fitted generalized extreme value distributions.Examples range from the work of Wong et al. (2022), where the authors explored nonstationarity of extreme value statistics covarying with different climate variables, to Lee et al. (2017) quantifying links between frequencies of sea level extremes and global mean temperature.Similarly, Grinsted et al. (2013) investigated relationships between changes in extreme value analysis in storm surges and different predictors, from the PDO pattern to quasi-biennial oscillation among others.
While recent studies have focused on trends in the mean sea level or on large extremes through EVT, there has been less work quantifying how shapes of sea level probability distributions have been changing in the observational record and how they may change in a warming climate.This is no easy task, as quantifying trends in distributions and their significance under internal climate variability is not a welldefined problem and many measures could be adopted.Le Cozannet et al. (2019) studied changes in sea level distributions in climate models projections focusing on changes in the cumulative distribution functions.Another option would be to directly estimate changes in moments.A useful, comprehensive methodology was recently proposed by McKinnon et al. (2016) where the authors proposed a unified framework to study both linear changes in quantiles and moments of summer temperature time series.Changes in quantiles can be quantified through the quantile regression approach used by Bassett (1978) and Koenker and Hallock (2001).Temporal changes in quantiles of the distributions can be further summarized through projections onto few polynomials linking changes in quantiles to changes in statistical moments.McKinnon et al. (2016) empirically showed that Legendre polynomials may be suitable functions for this purpose.Motivated by quantifying changes in sea level rise and inspired by McKinnon et al. (2016), here we propose an alternative/complementary route to study temporal changes in probability distributions by building upon McKinnon et al. (2016) in two ways.First, based on the work of Cornish and Fisher (1937), we provide a general, analytical relationship between time changes in quantiles and the first four moments of a distribution.This analytical relationship allows us to build a theoretically founded methodology to explore changes in distributions.We show how the framework allows us to diagnose time changes in distributions as sums of independent shifts in the first four statistical moments.Second, we study the significance of such changes in the presence of internal climate variability by accounting for multiple testings (Benjamini and Hochberg, 1995).This approach limits the number of false positives in the inference step to obtain trustworthy results even in the presence of a large number of significance tests (James et al., 2013).Importantly, such linear framework is shown to work well even in case of weak nonlinearities, with few examples shown later with synthetic data in Section 3.4 and in Appendices F and G.
We apply the proposed framework to daily tide gauges data in the 1970-2017 period, for which many locations have reliable data, and in a few locations with longer records.We therefore quantify changes in sea level distributions and assess their significance under the internal variability of the system.
Observational results are then complemented by output from the GFDL-CM4 climate model (Held et al., 2019.We first focus on the historical experiment and compare it with observations.We then investigate a transient experiment with 1% CO 2 increase per year, starting from a preindustrial global CO 2 concentration and extending up to quadrupling after 140 years.Crucially, the modeled trends can be further decomposed into different sea level components.We therefore explore sea level rise as a result of ocean circulation only and when fluctuations in the atmospheric pressure at the sea surface (i.e., sea level pressure [SLP]) are included.
The paper is organized as follows.In Section 2, we present the data and the sea level decomposition adopted in this study.Section 3 presents the framework to study shifts in distributions.Results are presented in Section 4, and Section 5 concludes the paper.

Observational tide gauge record
We focus on local observations of daily sea level from tide gauge data provided by the University of Hawaii Sea Level Center (Caldwell et al., 2018) (https://uhslc.soest.hawaii.edu/).We first consider the period 1970-2017 and keep only tide gauges with less than 20% missing values.Data gaps are not filled.This step reduces the number of time series considered from 116 to 94.The time range 1970-2017 was chosen as a compromise between retaining high-quality, daily sea level observations and sampling a large portion of coastal areas (e.g., many tide gauges records in Japan do not start before 1969).We then investigate distributional shifts for tide gauges with more than 80 years of data and less than 20% of missing values, which amounts to 28 available tide gauges.Such long periods translate in a more robust statistical inference of changes in quantiles.For all tide gauges, the daily climatology is removed from each record by subtracting to each day its average over the whole period of interest.Details on the tide gauges preprocessing are further discussed in Appendix A. Start and end recording dates of tide gauges with records longer than 80 years are presented in Appendix B.

GFDL-CM4 climate model: Strengths and limitations
We consider outputs from the GFDL-CM4 model (Held et al., 2019).The ocean component is the MOM6 ocean model (Adcroft et al., 2019) with horizontal grid spacing of 0.25°and 75 vertical layers.With this grid spacing, the model cannot resolve many coastal bays and harbors.Nonetheless, the horizontal grid spacing of 0.25°allows to realistically represent the sloping of continental shelves and its sharp deepening at the boundaries with the open oceans.Such important feature cannot be represented in traditional 1°c oupled models.Accurate representation of coastal geometry is key to trustworthy simulations of storm surges and coastal sea level as discussed in Yin et al. (2020).
The atmospheric/land component is the AM4 model (Zhao, 2018a,b) with a horizontal grid spacing of roughly 1°and 33 vertical layers.Despite its relatively coarse grid spacing, the model simulates the physical characteristics of tropical cyclones reasonably well, allowing one to study their frequency and changes under different forcings (Zhao, 2018a).However, strong hurricanes (i.e., category 4 and 5) are not simulated, hence their impact on sea level extremes cannot be assessed in our study (Zhao, 2018a;Yin et al., 2020).
A drawback in CM4 is the missing contribution to sea level rise caused by melting of land ice due to the lack of an ice sheet model, similarly to most coupled models in the Coupled Model Intercomparison Project-phase 6 (CMIP6) (Eyring et al., 2016).Tides are not simulated and the associated tide surges (Rego and Li, 2018), driven by constructive interactions of tides and storm surges, are absent from CM4. Climate-unrelated factors such as GIA and terrestrial water storage, both relevant to sea level (mean) trends (Wang et al., 2021), are also not simulated by this model.As shown by Winton et al. (2020), CM4 is a high climate sensitivity model, and so it is likely too sensitive to anthropogenic forcing.A thorough investigation of CM4-model biases in terms of sea level variability was presented in Yin et al. (2020).Despite some of its limitations, the authors showed that the GFDL-CM4 model can serve as a useful framework to explore changes in sea level statistics under different forcing scenarios.It thus directly serves our interests in the present study.
The focus here is on two simulations.First, we consider one historical experiment in CM4, from 1970 to 2014 with external forcings consistent with observations.Second, we focus on an idealized experiment with a 1% CO 2 increase per year (hereafter "1pctCO2").This experiment simulates the climate system under a 1% increase of CO 2 per year for 150 years, from preindustrial global CO 2 concentrations up to quadrupling at year 140.We assume that the anthropogenic signal of sea level rise emerges during the first 50 years of the simulation (as shown by Yin et al., 2020 for the US east coast) and focus only on the last 100 years.The 1pctCO2 experiment is especially useful since under an incremental, yearly increase in CO 2 concentration, sea level statistics are expected to change almost linearly (at least in the last 100 years).This linear change allows us to leverage the framework proposed in Section 3 and to explore the emergence of changes in distributional shapes in coastal sea levels.
The model output is remapped to a uniform 0.5°grid and only grid cells in the latitudinal range À60 °,60 °½ are considered.In both simulations, outputs are daily averages.We focus only on coastal locations, accounting for 6,318 time series in each case.

Sea level decomposition
Following Gill and Niiler (1973), Yin et al. (2010), and Griffies and Greatbatch (2012), we decompose the sea level time tendency in a hydrostatic and Boussinesq ocean according to the following sea level budget where Δ refers to a temporal increment relative to an initial time.Variables η, P b , P a, and ρ , respectively, represent the sea level, bottom, and surface pressure and density at time t and longitude, latitude, vertical position x, y,z ð Þ.We make the Boussinesq approximation, which accounts for the constant reference density ρ 0 (e.g., Section 2.4 of Vallis, 2017).
In Eq. ( 1), P b is the ocean bottom pressure, so that ΔP b = ρ 0 g ð Þ measures the change in sea level associated with changes in water column mass.Changes in water column mass arise from the convergence of mass via ocean currents as well as water crossing the ocean free surface via e16-4 Fabrizio Falasca et al.
precipitation, evaporation, river runoff, and sea ice formation/melt.The SLP, P a , is caused by atmospheric and sea ice loading.The contribution from P a is referred to as the inverse barometer (e.g., Ponte, 2006), whereby increases in P a lead to a decrease in sea level.The final term in the budget (1) is the local steric effect, which is measured by the depth integral of changes to in situ ocean density.For example, a decrease in column integrated density, such as through ocean warming or freshening, leads to a sea level rise from local steric expansion.
Changes from bottom pressure and the local steric effect are associated with ocean dynamics and ocean density and are referred to as sterodynamic sea level changes (Gregory et al., 2019).Following the CMIP6 convention detailed in Appendix H of Griffies et al. (2016), the sea level diagnostic "zos" measures the regional sterodynamic changes by setting the global mean to zero at each diagnostic time step, with Griffies et al. (2016) referring to zos as the dynamic sea level.There have been many studies of changes to the global mean sea level (e.g., Bittermann et al., 2017;Le Bars et al., 2017;Frederikse et al., 2020;Palmer et al., 2021; among many others), with Yin et al. (2020) discussing the global thermosteric sea level rise in the CM4 model used here.When comparing models to observations, the global thermosteric sea level must be added to the dynamic sea level at every location in the ocean.However, our interest concerns changes in higher order moments that are independent of changes in the global mean sea level.Therefore, we focus our analysis on the dynamic sea level and inverse barometer only.
We furthermore focus on the anomaly patterns of the inverse barometer (i.e, we remove the daily climatology).Note that the inverse barometer contribution is not explicitly simulated by CMIP5 models.We thus diagnose this term offline by saving the SLP from the atmospheric model.As for tide gauges, we remove the climatological daily cycle from all fields before performing our analysis.

Methodology
In this work, we aim to quantify linear changes in sea level distributions in the observational records and climate model output.The methodology is general and can be potentially applied to any climate observable with close-to-linear changes in distributions.
Given a sea level time series, we apply quantile regression to measure linear trends in quantiles (Koenker and Bassett, 1978;Portnoy and Koenker, 1997;Koenker and Hallock, 2001).We then introduce a set of orthogonal functions and project the quantile slopes onto such functions, in order to quantify how changes in quantiles are driven by changes in the first four statistical moments.Crucially, slopes in the proposed polynomials are orthogonal to each other by construction, allowing us to decouple different sources of distributional changes.Here, we present the three main steps in the framework: (a) quantile regression, (b) projection onto polynomials, and (c) statistical significance test.A schematic of the proposed framework is presented in Figure 1.

Quantile regression
½ , of a random variable X returns a value x such that p ×100 ð Þ% of the values are less than x.For example, the 0.95 quantile Q X 0:95 ð Þ is a real number x such that 95% of the values of the random variable, X, are smaller than Þ= p, representing the probability p that X would take a value less than or equal to x.In what follows, we simplify the notation and refer to Q X p ð Þ as q p and to cumulative functions as F x ð Þ. Quantile regression allows us to estimate temporal changes in the conditional median (i.e., q 0:5 ) or any other quantile of a time-dependent distribution (Koenker and Bassett, 1978;Koenker and Hallock, 2001).Here, we focus on the case of linear regression.For each quantile, q p , we need to fit two parameters: an intercept β 0 q p À Á and a slope β 1 q p À Á (see Figure 1).Differently from linear regression for which we minimize the mean square error, the quantile regression process places asymmetric weights on positive and negative residuals (Koenker and Hallock, 2001).
Figure 1.Schematic of the proposed framework illustrated using a synthetic time series.We generate a time series g sampled from a time-dependent Beta distribution P s,t ð Þ (see Section 3.4).Temporal changes in statistical moments are computed analytically a priori and are chosen to come solely from the variance (second moment) and kurtosis (fourth moment).Step (a): we apply quantile regression for the range of quantiles q p , with p∈ 0:05,0:95 ½ every dp = 0:05 for a total of 19 slopes.In panel (a), we show the quantile regression for p = 0:95,0:5 and 0.05.Note that q = q p ð Þ and q p is used only for convenience.
Step (b): we project the 19 quantile slopes onto a set of four orthogonal polynomials (see panel b.3).Step (c): we quantify the statistical significance of coefficients dm i dt quantifying how independent changes in moments explain changes in quantiles computed in step (a) ( dm i dt ; with i∈ 1,4 ½ representing changes in mean, variance, skewness, and kurtosis, respectively).Note that in Section 3.3 we refer to dm i dt as a i to simplify the notation.Significant changes at the 95% level come for this synthetic time series solely from the second and fourth moments, as known from analytical results.Additional synthetic tests are presented in Section 3.4.
Formally, given n observations at times t i , written as g , and the assumption of linear relationship for each quantile q p t ð Þ,p ∈ 0,1 ½ , then the goal is to minimize the following cost function: arg min where the "check function" ½ , assigns asymmetric weights to residuals.For a given time series, we apply quantile regression for the range of quantiles q p with p ∈ 0:05,0:95 ½ every dp = 0:05 for a total of 19 slopes (as done in McKinnon et al., 2016).This number of slope was found to be sufficient to discover the right changes in moments, as shown later on in the next section.An example of quantile regression for p = 0:95,0:5 and 0:05 is shown in Figure 1a.
In practice, quantile regression studies depend on the existence of very fast algorithms (Chernozhukov et al., 2022).Portnoy and Koenker (1997) proposed a precise and time-efficient minimization method based on the Frisch-Newton algorithm.However, the computation time is still rather long in the case of multiple quantile regressions and bootstrap inference.To this end, very recently Chernozhukov et al. (2022) showed that the computation of many quantile regressions can be (vastly) accelerated by exploiting nearby quantile solutions.This method was essential for the completion of our work, especially when analyzing the climate model outputs, which included 12,636 time series each with 36,500 time steps.The algorithm can be found in the quantile regression package "quantreg" implemented in R as "quantile regression fitting via interior point methods" (i.e., method pfnb) (Koenker, 2022).In the case of sea level, each quantile q p has dimensionality of q p Â Ã = length ½ , so that the slopes have dimensions of

Linking changes in quantiles to changes in statistical moments
The quantile regression step allows us to quantify slopes in N quantiles.This quantification can come at the expense of interpretability, especially when N is large and when analyzing more than one time series.
To facilitate analysis and interpretation, we quantify how changes in quantiles of a distribution are driven by changes in the mean, variance, skewness, and kurtosis, therefore defining a single framework to study changes in quantiles and moments.All throughout the paper, we refer to the mean, variance, skewness, and kurtosis as m 1 ,m 2 ,m 3 ,m 4 ð Þfor practical convenience and report their mathematical expression in Appendix C. A possible way to link changes in quantile to changes in moments was proposed in McKinnon et al. (2016).The authors empirically derived a set of polynomials by observing how quantiles of a distribution change when changing its moments one at a time.The lack of orthogonality of such functions motivated the authors to adopt Legendre polynomials, which are orthogonal by construction and share similar shapes to the derived functions.Inspired by McKinnon et al. (2016), here we approach the problem from a theoretical point of view.We take advantage of the work of Cornish and Fisher (1937) and define a unified framework useful to study and explore temporal changes in both quantiles and moments of a distribution.

Stationary process
The starting point of our derivation is the Cornish-Fisher expansion (Cornish and Fisher, 1937;Wallace, 1958;Fisher and Cornish, 1960;Hill and Davis, 1968).Cornish and Fisher (1937) derived an asymptotic expansion expressing any quantile of a distribution as a function of its cumulants. 1Studies often focus on a modified version written in terms of the first four distributional moments.Furthermore, in practical applications higher order moments show large sensitivity to sampling fluctuations, and Bekki et al. (2009) showed that a fourth-order truncation often allows for very accurate estimations of quantiles.
Given a stationary distribution with mean m 1 , variance m 2 , skewness m 3 , and excess kurtosis m 4 parameters, the following asymptotic relation holds (Cornish and Fisher, 1937;Bekki et al., 2009): where q p is the quantile of the "true" distribution and z p is the quantile of a standard normal N 0, 1 ð Þ at p∈ 0,1 ½ .2In other words, z p is the inverse Þdυ of a Gaussian distribution with zero mean and unit variance.Importantly, q p , m 1, and ffiffiffiffiffi ffi m 2 p have the same dimension, whereas z p , m 3, and m 4 are nondimensional.A synthetic and a first climate-related test are discussed in Appendix D.
In the case of a normal distributions, m 3 = m 4 = 0 and the expansion (3) reduces to The inclusion of the third and fourth moments, m 3 and m 4 , allows us to approximate any quantile of a nonnormal distribution as a function of a normal (Gaussian) distribution.This formula is a powerful tool as it can provide asymptotic estimations of arbitrarily large quantiles, otherwise difficult from data alone.It is useful for distributions that do not show large differences from a normal distribution with the domain of validity further discussed by Maillard (2012) and Amédée-Manesme et al. (2019).

Nonstationary process
Here, we aim to understand and quantify how temporal changes in individual quantiles are driven by changes in statistical moments.In theory, each quantile q p is time dependent through all moments of the distribution.Here, we consider dependence only up to the first four moments: Here, m 1 ,m 2 ,m 3, and m 4 refer to the mean, variance, skewness, and excess kurtosis.We focus on linear changes in time and therefore write Each term ∂q p ∂m i in Eq. ( 5) can be further evaluated by differentiating Eq. (3): We assume relatively small deviation from Gaussian behavior and focus only on first-order changes in Eq. ( 6).This is a reasonable assumption as the dynamics of many climate observables often show a strong Gaussian component, especially when focusing on anomalies (after removing climatologies).In case of (relatively) large deviation from Gaussianity, the framework is still useful at first order as suggested by tests in Appendix D.1.In other words, we evaluate Eq. ( 6) locally at the point in the neighborhood of a Gaussian distribution.Therefore, we relate the slopes β 1 q p À Á computed in the quantile regression step in Eq. ( 2) to changes in moments as: Here, the set of polynomials, b i p ð Þ, are defined by evaluating the terms Such functions are scaled Hermite polynomials of the function z p and defined in the range p ∈ 0,1 ½ .The polynomials in Eq. ( 8) satisfy and so they are orthogonal to each other.We depict these four functions in Figure 1b. 3q.( 7) allows us to examine how time changes in distributional quantiles are driven by time changes in the first four moments.We note that apart from b 1 p ð Þ, all polynomials in Eq. ( 8) differ from the ones in McKinnon et al. (2016).As an example, an important difference is in the b 2 p ð Þ function, quantifying changes in variance.In the case of a drifting Gaussian distribution, b 2 p ð Þ quantifies the exact link between changes in moments and quantiles.In the "bulk" of the distribution, the function b 2 p ð Þ derived in Eq. (8) gives negative(positive) corrections to the correspondent function in McKinnon et al. (2016) for p greater (smaller) than p = 0:5.Furthermore, b 2 p ð Þ shows how changes in variance also drive changes in distributional tails, with quantiles diverging to þ∞ À∞ ð Þ when p ! 1 ð Þ0.The slopes β 1 q p À Á appearing in Eq. ( 7) can be efficiently estimated through the quantile regression step as shown in Eq. (2).The polynomials b i p ð Þ have been derived in Eq. ( 8) and follow from Cornish and Fisher (1937).Therefore, changes in moments, dm i dt , can be computed as the least-squares solution of Eq. ( 7).An example is shown in Figure 1b for a Beta distribution with changes coming exclusively from variance and kurtosis (i.e., m 2 and m 4 ).dm 1 dt is equivalent to a linear regression.We refer to the coefficients dm i dt as changes (or slopes) in moments.However, such inferred changes are orthogonal to each other, and this property may not be the case for changes in the true moments.The orthogonality constraint offers a useful and interpretable way to decompose independent sources of shifts in a distribution.
In the case of sea level, the dimensionality in changes in moments dm i dt ; i ∈ 1, 4 ½ is as follows:

Statistical significance
The significance test is performed via the bootstrapping methodology (Chernick et al., 2011).For a given dataset (i.e., model's output or tide gauges), we consider the j-th time series, x j t ð Þ, and estimate the statistical significance of each coefficient a i = dm i dt , i ∈ 1,4 ½ .We do so by permuting (with replacement) x j t ð Þ B times and recomputing the slopes in moments a i at each iteration.We choose a value of B = 1000 and show the sensitivity of such a choice for one time series in Figure F2 (results do not change on average for randomly chosen time series).This method allows to construct a distribution approximating the null distribution under the null hypothesis of stationarity.For this resampling approach, we use block-bootstrapping, with blocks of one season to satisfy the assumption of statistical independence required in the bootstrapping test (Chernick et al., 2011).The robustness on this choice of block size is mentioned in Section 4 and detailed in Appendix B. The specific blockbootstrapping considered is the moving block-bootstrapping that allows for overlapping blocks as described in Künsch (1989).
We consider the following two significance tests.
1.When dealing with one or few time series, we consider a two-tailed test and deem slopes as significant at level α if in the 1 À α 2 À Á % (or α 2 %) of the upper (or lower) tails of the bootstrapped distribution.We choose α = 0:05 (i.e., 95% significance level).2. When considering many time series, we face a multiple testing problem and more and more false positives are expected in the statistical inference (James et al., 2013.To control the ratio of false positives, we adopt the false discovery rate (FDR) test proposed by Benjamini and Hochberg (1995).For a given time series x j t ð Þ and a corresponding slope a i , i ∈ 1, 4 ½ , we compute a p-value p i .We perform this computation for all M time series in the dataset under investigation.We then rank all M p-values in the ascending order and keep only the first m < M so that p m is the largest pvalue such that p m <ϕ m M , where ϕ is the FDR.Note that the FDR is usually denoted by the letter "q".Here, we use ϕ to avoid confusions with quantiles q in the quantile regression step.In plain words, a FDR of 10% would imply that no more than 10% of the rejected null hypotheses are false positives.This method has been shown to be an efficient test to control on average the ratio of false positives arising from multiple testing (Benjamini and Hochberg, 1995) and found applications in climate science (Ventura et al., 2004;Wilks, 2006;Fountalis et al., 2018;Runge et al., 2019).
In our case, this step requires to approximate a p-value from a bootstrapped distribution.Given the j-th time series x j t ð Þ and a slope a i = dm i dt , i ∈ 1, 4 ½ , we denote a * 1 i , a * 2 i , …, a * B i as the B bootstrapped slopes.We define the correspondent p-value of a i as p i = , where and zero otherwise (James et al., 2013).

Synthetic tests
In this section, we test the proposed framework on two time-dependent distributions with known changes in moments.We consider two time-dependent processes sampled from a Gaussian and Beta distribution and display tests in Figure 2.
• We define a time-dependent Gaussian process P x,t ð Þ= 1 ffiffiffiffiffiffi ffi Þ 2 2bt defined over x ∈ ℝ with mean m 1 = at and variance m 2 = bt.a,b ∈ ℝ and t represents the time vector.We choose a = b = 5.In this case, we expect distributional changes to be driven exclusively by the mean and variance.
• We consider a stationary Beta distribution Þ , with Γ being the Gamma function (Davis, 1959) We then introduce a "drift" in this distribution by prescribing α = at and β = bt and choose a = 1 and b = 2.This defines a process with constant mean but time-dependent variance, skewness, and kurtosis.For these parameters, the exact moments are then In both cases, we create a time-dependent process g in the temporal range t ∈ 1, 3 ½ .In this temporal range, changes can be weakly nonlinear but still far from strong nonlinearities.Specifically, at each time step dt = 10 À4 we random sample from the two distributions P s,t ð Þ.We then apply the proposed framework to (a) estimate quantile slopes β 1 q p À Á and (b) compute the respective changes in moments dm i dt .By random sampling at each dt, consecutive time steps are independent by definition and block sizes in the boostrapping procedure are equal to 1 time step.Results for the Gaussian and Beta distributions cases are shown in Figure 2. The method correctly identifies the statistical moments driving changes in the distributions.We note that the proposed linear framework is still valid in the presence of weak nonlinearities such as for m 2 ,m 3, and m 4 for the process sampled from the Beta distribution.
Tests for a process with prescribed changes in only the variance, m 2 , and kurtosis, m 4 , are shown in the schematic in Figure 1.

Changes in coastal sea level distributions from tide gauges
Changes in sea level distributions measured by tide gauges for the period 1970-2017 are summarized in Figure 3 and Table 1.The statistical significance test used is the FDR (see Section 3.3) with ϕ = 0:05.The p-values are computed from the block-bootstrapped distributions (Section 3.3), and a sensitivity analysis on the chosen block-size is shown in Appendix E.
The main significant changes in the observed coastal sea level distributions come from a shift in the mean.Contributions coming from higher order moments are less significant and sporadic (see Table 1).Among the 92 tide gauges with significant changes in the mean, the average slope is 2.08 mm/year (see Table 1).However, if only the positive significant slopes are considered (78 of the 92 tide gauges), then the slope increases to 2.92 mm/year.
Drivers of mean changes in tide gauges have been discussed in Wang et al. (2021), where the authors identified stereodynamic sea level changes (i.e., changes driven by currents, temperature, and salinity) as the main contributors of the observed trends.Large trends in the US east coast mainly come from the combined effect of sterodynamic and GIA contributions.Downward sea level trends are driven mainly by GIA (e.g., Baltic Sea [Weisse et al., 2021]), terrestrial water storage, or mass loss of land ice (e.g., Alaska coastline).Statistically significant changes are marked with filled "circles," whereas insignificant changes are marked with "X"s.Statistical significance is computed using the FDR test with threshold ϕ = 0:05 .We further analyze tide gauges with (a) more than 80 years of data and (b) less than 20% of missing values, with results presented in Figure 4 and Table 1.The record spanned in such locations is reported in Appendix B. The significance test adopted is FDR.Also in this case, we observe significant changes in the distributional mean but not in higher order moments.As an exception, two records from Panama (i.e., Cristobal and Balboa) show a significant change in variance.Such changes can be partially explained by very large anomalous El Niño and La Niña events since 1980, as shown in Appendix E.
We conclude that, independently of the period analyzed, changes in tide gauge measured coastal sea level distributions can be characterized by a shift in the mean with no statistically significant change in shape.

GFDL-CM4 climate model
We quantify changes in sea level distributions in the historical experiment with CM4, focusing on the dynamic sea level plus inverse barometer (i.e., η dyn þ η ib ) during the period 1970-2014.Results are shown in Figure 5 and significant changes are reported in Table 2.In this period, roughly 43% of the coastal area experiences significant changes in the mean sea level.Among such changes, positive sea level rise is detected mainly along the US east coast, North and East Africa, as well as Europe and Oceania.Negative trends are simulated along coastlines in East Asia, western North America, and almost the whole South American coast.Importantly, GFDL-CM4 does not exhibit changes in the shapes of distributions in the historical period, which agrees with our tide gauge analysis (Table 2).
We next explore changes in statistics in the last 100 years of the 1pctCO2 experiment with results given in Figure 6 and Table 2. To investigate the main physical drivers of distributional shifts, we first analyze the dynamic sea level η dyn (Figure 6, left column) and then consider the effect of the inverse barometer, that is, η = η dyn þ η ib (Figure 6, right column).We observe that in both cases a large number of coastal locations experience a significant change in the mean of the distribution (Figure 6a,b).The fraction of significant changes is slightly lower when the inverse barometer effect is included ($ 67% vs $79%) but still much larger than the 43% found in the historical experiment.One difference with the historical experiment is the emergence of mean sea level changes in the South and South-East part of Africa, Indonesia, and India.Additionally, many negative trends in mean sea level found in the historical experiment switch sign, especially in the North-West America, East Asia (i.e., Sea of Japan, East and South China sea) and East Africa.
A major difference with both the observational dataset (Section 4.1) and the CM4 historical experiment is the emergence of changes in higher order moments in the 1pctCO2 experiment.This emergence shows that changes in shapes of daily sea level distributions are indeed possible, at least in simulations as CO 2 increases.We detect coherent positive significant changes in variance along the Japan coast (Figure 6c,d).In addition, when also the inverse barometer is included, we see the emergence of positive changes in variance in part of Indonesia and in North-and South-East Africa and East India (Figure 6d).Adding the inverse barometer component always implies a larger fraction of significant changes in second, third, and fourth moments.This result holds for shifts in variance and it is much more evident in the case of third and fourth moments as shown in Table 2. Spatially coherent changes in skewness include positive trends around North Indonesia and in the Philippines and Northern Europe (Figure 6f).Interestingly, we also detect many significant negative changes in skewness, mainly in Japan coasts, North East US, virtually all coasts in the Mediterranean Sea and North East Africa (Figure 6f).
Figure 5. Projection onto mean, variance, skewness and kurtosis functions for the GFDL-CM4 historical experiment.We consider the dynamic sea level and inverse barometer (i.e., η dyn þ η ib ) contributions in the period 1970-2014.Statistical significance is computed using the FDR test with threshold ϕ = 0:05, respectively, for the mean, variance, skewness, and kurtosis slopes.The global thermosteric contribution is not included in the analysis.Only statistical significant slopes are reported.Future work will further address possible causes of such higher order changes.Hereafter, we focus mainly on drivers of shifts in kurtosis and on large, spatially coherent changes in the Mediterranean sea and in Northern Europe.
Shifts in the kurtosis functions are negligible in the dynamic sea level contributions and detected only when the inverse barometer is included (Table 2).The largest, spatially coherent domain with changes in kurtosis is found in the Mediterranean, as shown in Figure 6h.This area is marked by significant (negative) changes in both skewness and kurtosis functions.To further explore such distributional shifts, we considered the time series with largest changes in kurtosis in the Mediterranean and analyzed it separately.Results are shown in Appendix G.The histograms for the first and last 40 years of data (see Figure G1c,d) show a decrease in skewness from À0:06 to À0:37 and an excess kurtosis from 0.83 to 0.03 (closer to the Gaussian case of 0).
Such changes in skewness and kurtosis in the Mediterranean come solely from changes in SLP anomalies (see Figure 6g,h).This result indicates a drop in frequency of (large) negative SLP anomalies (see Eq. 1) and points to a large decrease in frequency of hurricanes in the Mediterranean, the so-called Figure 6.Projection onto mean, variance, skewness, and kurtosis functions for the GFDL-CM4 experiment with 1% CO 2 yearly increase for 100 years.(a,c,e,g) Dynamic sea level only (i.e., η dyn ).(b,d,f, h) Dynamic sea level and inverse barometer contributions (i.e., η dyn þ η ib ).Statistical significance is computed using the FDR test with threshold ϕ = 0:05 respectively for the mean, variance, skewness, and kurtosis slopes.The global thermosteric contribution is not included in the analysis.Only statistical significant slopes are reported.
Changes in kurtosis are present in regions outside the Mediterranean sea.The majority of such changes are (a) found to be negative and (b) coming only from the inverse barometer effect (Figure 6h).This possibly points out to changes in the statistics of intense convective systems.This seems to be large in agreement with Priestley and Catto (2022) where the authors quantified a robust decrease in cyclone numbers, independent of the season, in CMIP6 models projections.On the other hand, the strongest cyclones are projected to have higher intensities (mean SLP and vorticity) and larger tropospheric wind speed (with changes dependent on different seasons and hemispheres) (Priestley and Catto, 2022).
A large number of coastal location experience positive changes in skewness in Northern Europe (North sea and Baltic sea).Such changes are already present in the dynamic sea level field, η dyn , and show no qualitative changes when the inverse barometer is included.Positive changes in skewness in the dynamic sea level field possibly indicate a shift to larger frequency of storm surges.This is in agreement with Gaslikova et al. (2013), where the authors studied changes in North sea storm surges in future climate scenarios (i.e., 1961-2100 period) by comparing changes in the 99th percentiles of water levels in 30-year windows.Gaslikova et al. (2013) used a hydrodynamical model and quantified a (small) increase in frequency of such extreme events, consistent with an increase in frequency of intense westerly winds.An increase in frequency of positive large sea level rise anomalies (positive slopes in skewness) in that region is then consistent with such increase in wind forcing.Importantly, such anthropogenic signals are superimposed on large, decadal oscillations leading to uncertainties coming from the system's internal variability.
Spatially coherent changes in extreme sea levels over the Mediterranean sea and Northern Europe can be in part linked to projected changes in atmospheric storm track activity and cyclone intensities over Europe as shown by Pinto et al. (2007).The authors analyzed ensembles of climate change projections and identified particularly strong reductions of cyclone intensities in subtropical areas such as in the Mediterranean sea and a large increase in extreme surface winds in Great Britain, North, and Baltic seas.Importantly, the role of internal variability was shown to be important and large differences were identified among ensemble members (Pinto et al., 2007).Such changes were found to mostly hold in the new generation of climate models, and Priestley and Catto (2022) quantified a decrease in cyclone activity over the Mediterranean in CMIP6 future projections, together with changes in the North Atlantic storm track.

Conclusions
An in-depth assessment of changes in shapes of sea level distributions in observations and in climate simulations has not been addressed in the current literature.In fact, a large focus in the last Intergovernmental Panel on Climate Change report has been on changes in the mean of the distributions (Fox-Kemper et al., 2021).Furthermore, changes in sea level extremes through EVT are often quantified solely as a function of changes in the distributional mean (Tebaldi et al., 2021).This motivated us to address the following questions regarding changes in sea level probability distributions: • Are changes in higher order moments in observations (a) significant but small compared to the shift in the mean or (b) consistent with the climate system's internal variability?• How do shapes of sea level distribution evolve under CO 2 forcing at centennial time scales, at least in a climate model framework?Which sea level component (i.e., sterodynamic and inverse barometer effects) is responsible for such changes?
Our work addressed and answered these questions from a novel statistical point of view by exploring changes in sea level distributions in tide gauges and in the GFDL-CM4 model.
To achieve these goals, we introduced a novel and general statistical framework to study changes in both quantiles and moments of a distribution and quantify their significance under the system's internal variability.Importantly, shifts in moments are captured by suitable orthogonal functions, therefore capturing independent sources of distributional changes.The framework is inspired by the methodology proposed in McKinnon et al. (2016) but differs in terms of polynomials and statistical tests.Here, the choice of polynomials follows from the rigorous theoretical work of Cornish and Fisher (1937) and Fisher and Cornish (1960).Their theory allows us to derive a precise formalism to investigate changes in both quantiles and moments.Additionally, the recent work of Chernozhukov et al. (2022) allows for very fast computation of quantile regression, thus making the methodology feasible even when dealing with thousands of time series each with tens of thousands of time steps, as commonly found with climate model applications.In the case of a drifting Gaussian distribution, the first and second polynomials quantify the exact link between changes in moments and quantiles.The third and fourth polynomials adjust for non-Gaussian behavior.Future work may consider the Cornish-Fisher expansion for asymptotic estimates of very large quantiles otherwise difficult (or impossible) to do from data alone, therefore providing a useful alternative to EVT.
The statistical significance test adopts the FDR proposed by Benjamini and Hochberg (1995), therefore controlling against multiple testing.The bootstrapping methodology is robust under the sample size B = 1000.In general, even for sample sizes as small as B = 100 we can still obtain meaningful results, as shown for one time series in Figure F2.However, as in any other resampling test, a larger number of samples (i.e., B = 5000) would probably be preferred and here avoided because of long run time (same as in McKinnon et al., 2016).The methodology is linear but still relevant in the case of weak nonlinearities, with an example shown for the Beta distribution in Section 3.4.Strongly nonlinear trends in the mean of the distribution will not be detected by this linear framework, but such nonlinearities would result in significant changes in higher order moments.The underlying physical mechanisms leading to changes in sea level distributions can be nonlinear, such as for a weakening Atlantic meridional overturning circulation (AMOC) commonly found in climate model projections (Weijer et al., 2020).For example, in the GFDL-CM4 climate model simulation considered here, the AMOC strength weakened by roughly a factor of two after 100 years of enhanced CO2 forcing (see Figure 13b of Yin et al., 2020).Even so, statistical properties of the sea level distribution retained a close to linear behavior, thus allowing for the methodology developed in this paper to remain useful.
We applied the above statistical framework to coastal daily sea level measured by tide gauges.Nearly every record shows a significant change in the mean of the distribution for the period 1970-2017.In contrast, we detect no significant change in higher order moments for the same period, with this conclusion robust to sampling only tide gauges with more than 80 years of data.We conclude that changes in coastal sea level in the historical period exhibited just a shift in the mean, with no significant change in the shape of the probability distribution.Likewise, no changes in higher order statistical moments are found in the GFDL-CM4 historical simulation.Hence, changes in the simulated coastal sea level arise mainly by changes in the mean, which agrees with the tide gauges.We note that multidecadal oscillations could in principle impact statistical attribution in some locations when focusing on "short" datasets.However, the same conclusions are obtained for tide gauges with 47 years (period 1970-2017) and more than 80 years of data.Conclusions on observations are therefore robust on the chosen period.
We then considered a CM4 experiment with 1%/year CO 2 forcing, thus allowing us to quantify changes in sea level in a warming (modeled) world.In this experiment, we find (a) a large increase in significant changes in the mean of the distribution compared to the historical run and (b) the emergence of changes in higher order moments.
Interestingly, changes in the second and third moments are already present in the dynamic-sea-levelonly analysis.These changes are therefore driven solely by the evolution of ocean circulation, caused by changes in water column mass and local steric effects.Such changes get amplified when SLP fluctuations are included, accounting for the inverse barometer effect (Ponte, 2006).Shifts in kurtosis are for the most part negative and possibly consistent with results of Priestley and Catto (2022) where the authors observed robust decrease in cyclone numbers, independent of the season, in CMIP6 models projections.One striking difference with the dynamic-sea-level-only analysis is the emergence of large changes in skewness and kurtosis along the Mediterranean coast, possibly pointing to drops in the frequency of medicanes, as shown in González-Alemán et al. (2019).A decrease in frequency of medicanes has been shown to hold on average for different regional models, but some model runs show no significant changes (Romera et al., 2017).Larger ensembles and different scenarios are then needed to better quantify the impact of statistical changes in medicanes on coastal sea level.Finally, a large increase in skewness values in the North and Baltic seas is present in the dynamic-sea-level-only analysis and shows no (qualitative) differences with the inclusion of inverse barometer.This is consistent with an increase in frequency of intense westerly winds in that region as shown by Pinto et al. (2007) and Gaslikova et al. (2013).
We note that the GFDL-CM4 does not simulate category 4 and 5 tropical cyclones (see Yin et al., 2020).Hence, shifts in higher order moments of the distributions are likely underestimated in the simulation.Such changes in cyclones frequency should be further explored using different models and scenarios.Furthermore, such information should be taken into account in the context of extreme value attribution where extreme sea level is often modeled solely as a function of the distributional mean.
A limitation of this work is that we focused only on one climate model and we did not consider ensembles of simulations.First, the climate is a chaotic system and ensemble simulations are often needed to explore the system's variability.Second, climate models are affected by structural model errors and a collection of models is required for more confident climate projections (Smith, 2002;Frigg et al., 2014).The emergence of significant changes in higher order moments in the model analyzed here should be seen as a proof of concept rather than a quantitative prediction about the real world.However, this result clearly suggests that quantifying sea level changes solely as a function of the mean of the distribution can be too simplistic.Future work should focus on different models, ensemble runs, and scenarios in order to quantify possible uncertainties coming from different model structures and internal variability.
Results of this study could guide new analyses focused on regions experiencing changes in higher order moments.An example would be to make use of higher resolution regional models to further study dynamical processes in those locations.Additionally, future studies could focus on changes in high (i.e., 0.95) and small (i.e., 0.05) quantiles only in such locations across different models and scenarios.Finally, future work will explore changes in the probability distribution of relevant climate variables other than sea level, from temperature to precipitation, therefore contributing to a better understanding of climate variability and its linkages with natural and anthropogenic forcing.

Figure 2 .
Figure 2. Testing the methodology on time series t 1 ,s 1 ð Þ, t 2 ,s 2 ð Þ,… f g sampled from time-dependent Gaussian and Beta distributions P s,t ð Þ.(a) Time series sampled from a time-dependent Gaussian distribution.(b) Linear quantile slopes for the time series shown in panel (a).The slopes β 1 q pÀ Á are computed for p ∈ 0:05,0:95 ½ every dp = 0:05 and indicated as blue dots.The black dashed line indicates the projection onto polynomials as defined in Eq. (7).Green "check" marks indicate statistically significant (95% confidence level) changes in moments; red "crosses" indicate nonsignificant changes.(c) Time series sampled from a time-dependent Beta distribution.(d) Same as panel (b) but for the case of the Beta distribution.In both cases, the method identifies the statistical moments driving changes in the distributions.

Figure 3 .
Figure 3. Projection onto mean, variance, skewness, and kurtosis polynomials for the period 1970-2017.Statistically significant changes are marked with filled "circles," whereas insignificant changes are marked with "X"s.Statistical significance is computed using the FDR test with threshold ϕ = 0:05 .

Figure 4 .
Figure 4. Projection onto mean, variance, skewness, and kurtosis functions.All tide gauges considered have a record longer than 80 years and less than 20% of missing data.Statistically significant changes are marked with filled "circles," whereas insignificant changes are marked with "X"s.Statistical significance is computed using the FDR test with threshold ϕ = 0:05 for the mean, variance, skewness, and kurtosis slopes.The starting and ending date for each tide gauge is shown in Appendix B.

Table 1 .
Percentage of significant coefficients for the FDR test.

Table 2 .
Percentage of significant coefficients for GFDL-CM4 historical and 1pctCO2 experiments.