Search results for Statistical theory and methods

2 - Multiparameter Exponential Families
Bradley Efron, Stanford University, California
Book:

Exponential Families in Theory and Practice

Published online:

25 November 2022

Print publication:

15 December 2022, pp 48-87
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Contents
Bradley Efron, Stanford University, California
Book:

Exponential Families in Theory and Practice

Published online:

25 November 2022

Print publication:

15 December 2022, pp v-vi
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

1 - One-parameter Exponential Families
Bradley Efron, Stanford University, California
Book:

Exponential Families in Theory and Practice

Published online:

25 November 2022

Print publication:

15 December 2022, pp 1-47
- Chapter
- - You have access
- PDF
- Export citation

Exponential Families in Theory and Practice

Bradley Efron
Published online:

25 November 2022

Print publication:

15 December 2022
- Book
- - Get access
    
    Buy a print copy
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
During the past half-century, exponential families have attained a position at the center of parametric statistical inference. Theoretical advances have been matched, and more than matched, in the world of applications, where logistic regression by itself has become the go-to methodology in medical statistics, computer-based prediction algorithms, and the social sciences. This book is based on a one-semester graduate course for first year Ph.D. and advanced master's students. After presenting the basic structure of univariate and multivariate exponential families, their application to generalized linear models including logistic and Poisson regression is described in detail, emphasizing geometrical ideas, computational practice, and the analogy with ordinary linear regression. Connections are made with a variety of current statistical methodologies: missing data, survival analysis and proportional hazards, false discovery rates, bootstrapping, and empirical Bayes analysis. The book connects exponential family theory with its applications in a way that doesn't require advanced mathematical preparation.

Part I - Elements of Probability Theory
Ery Arias-Castro, University of California, San Diego
Book:

Principles of Statistical Analysis

Published online:

22 July 2022

Print publication:

25 August 2022, pp 1-2
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

21 - Regression Analysis
from Part III - Elements of Statistical Inference
Ery Arias-Castro, University of California, San Diego
Book:

Principles of Statistical Analysis

Published online:

22 July 2022

Print publication:

25 August 2022, pp 329-355
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Beyond quantifying the amount of association between two variables, as was the goal in a previous chapter, regression analysis aims at describing that association and/or at predicting one of the variables based on the other ones. Examples of applications where this is needed abound in engineering and a broad range of industries. For example, in the insurance industry, when pricing a policy, the predictor variable encapsulates the available information about what is being insured, and the response variable is a measure of risk that the insurance company would take if underwriting the policy. In this context, a procedure is solely evaluated based on its performance at predicting that risk, and can otherwise be very complicated and have no simple interpretation. The chapter covers both local methods such as kernel regression (e.g., local averaging) and empirical risk minimization over a parametric model (e.g., linear models fitted by least squares). Cross-validation is introduced as a method for estimating the prediction power of a certain regression or classification metod.

3 - Distributions on the Real Line
from Part I - Elements of Probability Theory
Ery Arias-Castro, University of California, San Diego
Book:

Principles of Statistical Analysis

Published online:

22 July 2022

Print publication:

25 August 2022, pp 34-40
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Measurements are often numerical in nature, which naturally leads to distributions on the real line. We start our discussion of such distributions in the present chapter, and in the process introduce the concept of random variable, which is really a device to facilitate the writing of probability statements and the derivation of the corresponding computations. We introduce objects such as the distribution function, survival function, and quantile function, any of which characterizes in the underlying distribution.

Dedication
Ery Arias-Castro, University of California, San Diego
Book:

Principles of Statistical Analysis

Published online:

22 July 2022

Print publication:

25 August 2022, pp vii-vii
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

6 - Multivariate Distributions
from Part I - Elements of Probability Theory
Ery Arias-Castro, University of California, San Diego
Book:

Principles of Statistical Analysis

Published online:

22 July 2022

Print publication:

25 August 2022, pp 68-77
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Some experiments lead to considering not one, but several measurements. As before, each measurement is represented by a random variable, and these are stacked into a random vector. For example, in the context of an experiment that consists in flipping a coin multiple times, we defined in a previous chapter as many random variables, each indicating the result of one coin flip. These are then concatenated to form a random vector, compactly describing the outcome of the entire experiment. Concepts such as conditional probability and independence are introduced.

17 - Multiple Numerical Samples
from Part III - Elements of Statistical Inference
Ery Arias-Castro, University of California, San Diego
Book:

Principles of Statistical Analysis

Published online:

22 July 2022

Print publication:

25 August 2022, pp 271-288
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

We consider an experiment that yields, as data, a sample of independent and identically distributed (real-valued) random variables with a common distribution on the real line. The estimation of the underlying mean and median is discussed at length, and bootstrap confidence intervals are constructed. Tests comparing the underlying distribution to a given distribution (e.g., the standard normal distribution) or a family of distribution (e.g., the normal family of distributions) are introduced. Censoring, which is very common in some clinical trials, is briefly discuss.

10 - Sampling and Simulation
from Part II - Practical Considerations
Ery Arias-Castro, University of California, San Diego
Book:

Principles of Statistical Analysis

Published online:

22 July 2022

Print publication:

25 August 2022, pp 127-137
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

In this chapter we introduce some tools for sampling from a distribution. We also explain how to use computer simulations to approximate probabilities and, more generally, expectations, which can allow one to circumvent complicated mathematical derivations. The methods that are introduced include Monte Carlo sampling/integration, rejection sampling, and Markov Chain Monte Carlo sampling.

7 - Expectation and Concentration
from Part I - Elements of Probability Theory
Ery Arias-Castro, University of California, San Diego
Book:

Principles of Statistical Analysis

Published online:

22 July 2022

Print publication:

25 August 2022, pp 78-99
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

An expectation is simply a weighted mean, and means are at the core of Probability Theory and Statistics. In Statistics, in particular, such expectations are used to define parameters of interest. It turns out that an expectation can be approximated by an empirical average based on a sample from the distribution of interest, and the accuracy of this approximation can be quantified via what is referred to as concentration inequalities.

Part II - Practical Considerations
Ery Arias-Castro, University of California, San Diego
Book:

Principles of Statistical Analysis

Published online:

22 July 2022

Print publication:

25 August 2022, pp 125-126
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Contents
Ery Arias-Castro, University of California, San Diego
Book:

Principles of Statistical Analysis

Published online:

22 July 2022

Print publication:

25 August 2022, pp ix-xiii
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

8 - Convergence of Random Variables
from Part I - Elements of Probability Theory
Ery Arias-Castro, University of California, San Diego
Book:

Principles of Statistical Analysis

Published online:

22 July 2022

Print publication:

25 August 2022, pp 100-112
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

An empirical average will converge, in some sense, to the corresponding expectation. This famous result, called the Law of Large Numbers, can be anticipated based on the concentration inequalities introduced in the previous chapter, but some appropriate notions of convergence for random variables need to be defined in order to make a rigorous statement. Beyond mere convergence, the fluctuations of an empirical average around the associated expectation can be characterized by the Central Limit Theorem, and are known to be Gaussian in some asymptotic sense. The chapter also discusses the limit of extremes such as the maximum of a sample.

9 - Stochastic Processes
from Part I - Elements of Probability Theory
Ery Arias-Castro, University of California, San Diego
Book:

Principles of Statistical Analysis

Published online:

22 July 2022

Print publication:

25 August 2022, pp 113-124
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Stochastic processes model experiments whose outcomes are collections of variables organized in some fashion. We focus here on Markov processes, which include random walks (think of the fortune of a person gambling on black/red at the roulette over time) and branching processes (think of the behavior of a population of an asexual species where each individual gives birth to a number of otherwise identical offsprings according to a given probability distribution) .

4 - Discrete Distributions
from Part I - Elements of Probability Theory
Ery Arias-Castro, University of California, San Diego
Book:

Principles of Statistical Analysis

Published online:

22 July 2022

Print publication:

25 August 2022, pp 41-53
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

In this chapter we consider distributions on the real line that have a discrete support. It is indeed common to count certain occurrences in an experiment, and the corresponding counts are invariably integer-valued. In fact, all the major distributions of this type are supported on the (non-negative) integers. We introduce the main ones here.

19 - Correlation Analysis
from Part III - Elements of Statistical Inference
Ery Arias-Castro, University of California, San Diego
Book:

Principles of Statistical Analysis

Published online:

22 July 2022

Print publication:

25 August 2022, pp 299-308
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

We consider an experiment resulting in two paired numerical variables. The general goal addressed in this chapter is that of quantifying the strength of association between these two variables. By association we mean dependence. Contrary to the previous chapter, here the two variables can be measurements of completely different kinds (e.g., height and weight). Several measures of association are introduced, and used to test for independence.

5 - Continuous Distributions
from Part I - Elements of Probability Theory
Ery Arias-Castro, University of California, San Diego
Book:

Principles of Statistical Analysis

Published online:

22 July 2022

Print publication:

25 August 2022, pp 54-67
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

In some areas of mathematics, physics, and elsewhere, continuous objects and structures are often motivated, or even defined, as limits of discrete objects. For example, in mathematics, the real numbers are defined as the limit of sequences of rational numbers, and in physics, the laws of thermodynamics arise as the number of particles in a system tends to infinity (the so-called thermodynamic or macroscopic limit). Taking certain discrete distributions (discussed in the previous chapter) to their continuous limits, which is done by letting their support size increase to infinity in a controlled manner, gives rise to continuous distributions on the real line. We introduce and discuss such distributions in this chapter, including the normal (aka Gaussian) family of distributions, and in the process cover probability densities.

18 - Multiple Paired Numerical Samples
from Part III - Elements of Statistical Inference
Ery Arias-Castro, University of California, San Diego
Book:

Principles of Statistical Analysis

Published online:

22 July 2022

Print publication:

25 August 2022, pp 289-298
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

We consider in this chapter experiments where the variables of interest are paired. Importantly, we assume that these variables are directly comparable (in contrast with the following two chapters). Crossover trials are important examples of such experiments. The main question of interest here is that of exchangeability, which reduces to testing for symmetry when there are only two variables.

Statistical theory and methods

Refine search

Refine search

Actions for selected content:

2348 results in Statistical theory and methods

2 - Multiparameter Exponential Families

Contents

1 - One-parameter Exponential Families

Exponential Families in Theory and Practice

Part I - Elements of Probability Theory

21 - Regression Analysis

Summary

3 - Distributions on the Real Line

Summary

Dedication

6 - Multivariate Distributions

Summary

17 - Multiple Numerical Samples

Summary

10 - Sampling and Simulation

Summary

7 - Expectation and Concentration

Summary

Part II - Practical Considerations

Contents

8 - Convergence of Random Variables

Summary

9 - Stochastic Processes

Summary

4 - Discrete Distributions

Summary

19 - Correlation Analysis

Summary

5 - Continuous Distributions

Summary

18 - Multiple Paired Numerical Samples

Summary

Statistical theory and methods

Refine search

Refine search

Actions for selected content:

Save Search

2348 results in Statistical theory and methods

Exponential Families in Theory and Practice

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary