Search results for Statistics for environmental sciences

15 - Canonical Correlation Analysis
Timothy DelSole, George Mason University, Virginia, Michael Tippett, Columbia University, New York
Book:

Statistical Methods for Climate Scientists

Published online:

03 February 2022

Print publication:

24 February 2022, pp 335-365
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

The correlation coefficient measures the linear relation between scalar X and scalar Y. How can the linear relation between vector X and vector Y be measured?Canonical Correlation Analysis (CCA) provides a way. CCA finds a linear combination of X, and a (separate) linear combination of Y, that maximizes the correlation. The resulting maximized correlation is called a canonical correlation. More generally, CCA decomposes two sets of variables into an ordered sequence of component pairs ordered such that the first pair has maximum correlation, the second has maximum correlation subject to being uncorrelated with the first, and so on. The entire decomposition can be derived from a Singular Value Decomposition of a suitable matrix. If the dimension of the X and Y vectors is too large, overfitting becomes a problem. In this case, CCA often is computed using a few principal components of X and Y. The criterion for selecting the number of principal components is not standard. The Mutual Information Criterion (MIC) introduced in Chapter 14 is used in this chapter.

21 - Ensemble Square Root Filters
Timothy DelSole, George Mason University, Virginia, Michael Tippett, Columbia University, New York
Book:

Statistical Methods for Climate Scientists

Published online:

03 February 2022

Print publication:

24 February 2022, pp 489-509
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

The previous chapter discussed data assimilation for the case in which the variables have known Gaussian distributions. However, in atmospheric and oceanic data assimilation, the distributions are neither Gaussian nor known, and the large number of state variables creates numerical challenges. This chapter discusses a class of algorithms, called Ensemble Square Root Filters, for performing data assimilation with high-dimensional, nonlinear systems. The basic idea is to use a collection of forecasts (called an ensemble) to estimate the statistics of the background distribution. In addition, observational information is incorporated by adjusting individual ensemble members (i.e., forecasts) rather than computing an entire distribution. This chapter discusses three standard filters: the Ensemble Transform Kalman Filter (ETKF), the Ensemble Square Root Filter (EnSRF), and the Ensemble Adjustment Kalman Filter (EAKF). However, ensemble filters often experience filter divergence, in which the analysis no longer tracks the truth. This chapter discusses standard approaches to mitigating filter divergence, namely covariance inflation and covariance localization.

8 - Linear Regression: Least Squares Estimation
Timothy DelSole, George Mason University, Virginia, Michael Tippett, Columbia University, New York
Book:

Statistical Methods for Climate Scientists

Published online:

03 February 2022

Print publication:

24 February 2022, pp 185-209
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Some variables can be modeled by a linear combination of other random variables, plus random noise. Such models are used to quantify the relation between variables, to make predictions, and to test hypotheses about the relation between variables. After identifying the variables to include in a model, the next step is to estimate the coefficients that multiply them, called the regression parameters. This chapter discusses the least squares method for estimating regression parameters. The least squares method estimates the parameters by minimizing the sum of squared differences between the fitted model and the data. This chapter also describes measures for the goodness of fit and an illuminating geometric interpretation of least squares fitting. The least squares method is illustrated on various routine calculations in weather and climate analysis (e.g., fitting a trend). Procedures for testing hypotheses about linear models are discussed in the next chapter.

14 - Multivariate Linear Regression
Timothy DelSole, George Mason University, Virginia, Michael Tippett, Columbia University, New York
Book:

Statistical Methods for Climate Scientists

Published online:

03 February 2022

Print publication:

24 February 2022, pp 314-334
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Multivariate linear regression is a method for modeling linear relations between two random vectors, say X and Y. Common reasons for using multivariate regression include (1) to predicting Y given X, (2) to testing hypotheses about the relation between X and Y, and (3) to projecting Y onto prescribed time series or spatial patterns. Special cases of multivariate regression models include Linear Inverse Models (LIMs) and Vector Autoregressive Models. Multivariate regression also is fundamental to other statistical techniques, including canonical correlation analysis, discriminant analysis, and predictable component analysis. This chapter introduces multivariate linear regression and discusses estimation, measures of association, hypothesis testing, and model selection. In climate studies, model selection often involves selecting Y as well as X. For instance, Y may be a set of principal components that need to be chosen, which is not a standard selection problem. This chapter introduces a criterion for selecting X and Y simultaneously called Mutual Information Criterion (MIC).

9 - Linear Regression: Inference
Timothy DelSole, George Mason University, Virginia, Michael Tippett, Columbia University, New York
Book:

Statistical Methods for Climate Scientists

Published online:

03 February 2022

Print publication:

24 February 2022, pp 210-236
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

The method of least squares will fit any model to a data set, but is the resulting model "good"?One criterion is that the model should fit the data significantly better than a simpler model with fewer predictors. After all, if the fit is not significantly better, then the model with fewer predictors is almost as good. For linear models, this approach is equivalent to testing if selected regression parameters vanish. This chapter discusses procedures for testing such hypotheses. In interpreting such hypotheses, it is important to recognize that a regression parameter for a given predictor quantifies the expected rate of change of the predict and while holding the other predictors constant. Equivalently, the regression parameter quantifies the dependence between two variables after controlling or regressing out other predictors. These concepts are important for identifying a confounding variable, which is a third variable that influences two variables to produce a correlation between those two variables. This chapter also discusses how detection and attribution of climate change can be framed in a regression model framework.

Appendix
Timothy DelSole, George Mason University, Virginia, Michael Tippett, Columbia University, New York
Book:

Statistical Methods for Climate Scientists

Published online:

03 February 2022

Print publication:

24 February 2022, pp 510-513
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

12 - Principal Component Analysis
Timothy DelSole, George Mason University, Virginia, Michael Tippett, Columbia University, New York
Book:

Statistical Methods for Climate Scientists

Published online:

03 February 2022

Print publication:

24 February 2022, pp 273-297
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Large data sets are difficult to grasp. To make progress, we often seek a few quantities that capture as much of the information in the data as possible. In this chapter, we discuss a procedure called Principal Component Analysis (PCA), also called Empirical Orthogonal Function (EOF) analysis, which finds the components that minimizes the sum square difference between the components and the data. The components are ordered such that the first approximates the data the best (in a least squares sense), the second approximates the data the best among all components orthogonal to the first, and so on. In typical climate applications, a principal component consists of two parts: (1) a fixed spatial structure, called an Empirical Orthogonal Function (EOF), and (2) its time-dependent amplitude, called a PC time series. The EOFs are orthogonal and the PC time series are uncorrelated. Principal components often are used as input to other analyses, such as linear regression, canonical correlation analysis, predictable components analysis, or discriminant analysis. The procedure for performing area-weighted PCA is discussed in detail in this chapter.

6 - The Power Spectrum
Timothy DelSole, George Mason University, Virginia, Michael Tippett, Columbia University, New York
Book:

Statistical Methods for Climate Scientists

Published online:

03 February 2022

Print publication:

24 February 2022, pp 126-155
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

This chapter introduces the power spectrum. The power spectrum is the Fourier Transform of the autocovariance function, and the autocovariance function is the (inverse) Fourier Transform of the power spectrum. As such, the power spectrum and autocovariance function offer two complementary but mathematically equivalent descriptions of a stochastic process. The power spectrum quantifies how variance is distributed over frequencies and is useful for identifying periodic behavior in time series. The discrete Fourier transform of a time series can be summarized in a periodogram, which provides a starting point for estimating power spectra. Estimation of the power spectrum can be counterintuitive because the uncertainty in periodogram elements does not decrease with increasing sample size. To reduce uncertainty, periodogram estimates are averaged over a frequency interval called the bandwidth. Trends and discontinuities in time series can lead to similar low-frequency structure despite very different temporal characteristics. Spectral analysis provides a particularly insightful way to understand the behavior of linear filters.

Index
Timothy DelSole, George Mason University, Virginia, Michael Tippett, Columbia University, New York
Book:

Statistical Methods for Climate Scientists

Published online:

03 February 2022

Print publication:

24 February 2022, pp 523-526
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Statistical Methods for Climate Scientists

Timothy DelSole, Michael Tippett
Published online:

03 February 2022

Print publication:

24 February 2022
- Book
- - Get access
    
    Buy a print copy
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
A comprehensive introduction to the most commonly used statistical methods relevant in atmospheric, oceanic and climate sciences. Each method is described step-by-step using plain language, and illustrated with concrete examples, with relevant statistical and scientific concepts explained as needed. Particular attention is paid to nuances and pitfalls, with sufficient detail to enable the reader to write relevant code. Topics covered include hypothesis testing, time series analysis, linear regression, data assimilation, extreme value analysis, Principal Component Analysis, Canonical Correlation Analysis, Predictable Component Analysis, and Covariance Discriminant Analysis. The specific statistical challenges that arise in climate applications are also discussed, including model selection problems associated with Canonical Correlation Analysis, Predictable Component Analysis, and Covariance Discriminant Analysis. Requiring no previous background in statistics, this is a highly accessible textbook and reference for students and early-career researchers in the climate sciences.

Appendix D - Statistical Inference
Manfred Mudelsee
Book:

Statistical Analysis of Climate Extremes

Published online:

23 April 2020

Print publication:

14 May 2020, pp 152-164
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

References
Manfred Mudelsee
Book:

Statistical Analysis of Climate Extremes

Published online:

23 April 2020

Print publication:

14 May 2020, pp 179-194
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Appendix G - List of Symbols and Abbreviations
Manfred Mudelsee
Book:

Statistical Analysis of Climate Extremes

Published online:

23 April 2020

Print publication:

14 May 2020, pp 174-178
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Frontmatter
Manfred Mudelsee
Book:

Statistical Analysis of Climate Extremes

Published online:

23 April 2020

Print publication:

14 May 2020, pp i-iv
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Appendix E - Numerical Techniques
Manfred Mudelsee
Book:

Statistical Analysis of Climate Extremes

Published online:

23 April 2020

Print publication:

14 May 2020, pp 165-171
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Appendix A - Climate Measurements
Manfred Mudelsee
Book:

Statistical Analysis of Climate Extremes

Published online:

23 April 2020

Print publication:

14 May 2020, pp 135-138
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Dedication
Manfred Mudelsee
Book:

Statistical Analysis of Climate Extremes

Published online:

23 April 2020

Print publication:

14 May 2020, pp v-vi
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

3 - Methods
Manfred Mudelsee
Book:

Statistical Analysis of Climate Extremes

Published online:

23 April 2020

Print publication:

14 May 2020, pp 27-49
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

4 - Floods and Droughts
Manfred Mudelsee
Book:

Statistical Analysis of Climate Extremes

Published online:

23 April 2020

Print publication:

14 May 2020, pp 50-67
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

6 - Hurricanes and Other Storms
Manfred Mudelsee
Book:

Statistical Analysis of Climate Extremes

Published online:

23 April 2020

Print publication:

14 May 2020, pp 103-134
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Statistics for environmental sciences

Refine search

Refine search

Actions for selected content:

312 results in Statistics for environmental sciences

15 - Canonical Correlation Analysis

Summary

21 - Ensemble Square Root Filters

Summary

8 - Linear Regression: Least Squares Estimation

Summary

14 - Multivariate Linear Regression

Summary

9 - Linear Regression: Inference

Summary

Appendix

12 - Principal Component Analysis

Summary

6 - The Power Spectrum

Summary

Index

Statistical Methods for Climate Scientists

Appendix D - Statistical Inference

References

Appendix G - List of Symbols and Abbreviations

Frontmatter

Appendix E - Numerical Techniques

Appendix A - Climate Measurements

Dedication

3 - Methods

4 - Floods and Droughts

6 - Hurricanes and Other Storms

Statistics for environmental sciences

Refine search

Refine search

Actions for selected content:

Save Search

312 results in Statistics for environmental sciences

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Statistical Methods for Climate Scientists