Search results for Pattern Recognition and Machine Learning

1 - Algorithms and Inference
from Part I - Classic Statistical Inference
Bradley Efron, Stanford University, California, Trevor Hastie, Stanford University, California
Book:

Computer Age Statistical Inference

Published online:

05 July 2016

Print publication:

21 July 2016, pp 3-11
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

20 - Inference After Model Selection
from Part III - Twenty-First-Century Topics
Bradley Efron, Stanford University, California, Trevor Hastie, Stanford University, California
Book:

Computer Age Statistical Inference

Published online:

05 July 2016

Print publication:

21 July 2016, pp 394-420
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Part III - Twenty-First-Century Topics
Bradley Efron, Stanford University, California, Trevor Hastie, Stanford University, California
Book:

Computer Age Statistical Inference

Published online:

05 July 2016

Print publication:

21 July 2016, pp 269-270
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

6 - Empirical Bayes
from Part II - Early Computer-Age Methods
Bradley Efron, Stanford University, California, Trevor Hastie, Stanford University, California
Book:

Computer Age Statistical Inference

Published online:

05 July 2016

Print publication:

21 July 2016, pp 75-90
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Contents
Bradley Efron, Stanford University, California, Trevor Hastie, Stanford University, California
Book:

Computer Age Statistical Inference

Published online:

05 July 2016

Print publication:

21 July 2016, pp ix-xiv
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Part I - Classic Statistical Inference
Bradley Efron, Stanford University, California, Trevor Hastie, Stanford University, California
Book:

Computer Age Statistical Inference

Published online:

05 July 2016

Print publication:

21 July 2016, pp 1-2
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

2 - Frequentist Inference
from Part I - Classic Statistical Inference
Bradley Efron, Stanford University, California, Trevor Hastie, Stanford University, California
Book:

Computer Age Statistical Inference

Published online:

05 July 2016

Print publication:

21 July 2016, pp 12-21
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

10 - The Jackknife and the Bootstrap
from Part II - Early Computer-Age Methods
Bradley Efron, Stanford University, California, Trevor Hastie, Stanford University, California
Book:

Computer Age Statistical Inference

Published online:

05 July 2016

Print publication:

21 July 2016, pp 155-180
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

17 - Random Forests and Boosting
from Part III - Twenty-First-Century Topics
Bradley Efron, Stanford University, California, Trevor Hastie, Stanford University, California
Book:

Computer Age Statistical Inference

Published online:

05 July 2016

Print publication:

21 July 2016, pp 324-350
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

In the modern world we are often faced with enormous data sets, both in terms of the number of observations n and in terms of the number of variables p. This is of course good news—we have always said the more data we have, the better predictive models we can build. Well, we are there now—we have tons of data, and must figure out how to use it.
Although we can scale up our software to fit the collection of linear and generalized linear models to these behemoths, they are often too modest and can fall way short in terms of predictive power. A need arose for some general purpose tools that could scale well to these bigger problems, and exploit the large amount of data by fitting a much richer class of functions, almost automatically. Random forests and boosting are two relatively recent innovations that fit the bill, and have become very popular as “out-thebox” learning algorithms that enjoy good predictive performance. Random forests are somewhat more automatic than boosting, but can also suffer a small performance hit as a consequence.
These two methods have something in common: they both represent the fitted model by a sum of regression trees. We discuss trees in some detail in Chapter 8. A single regression tree is typically a rather weak prediction model; it is rather amazing that an ensemble of trees leads to the state of the art in black-box predictors!
We can broadly describe both these methods very simply.
Random forest Grow many deep regression trees to randomized versions of the training data, and average them. Here “randomized” is a wideranging term, and includes bootstrap sampling and/or subsampling of the observations, as well as subsampling of the variables.
Boosting Repeatedly grow shallow trees to the residuals, and hence build up an additive model consisting of a sum of trees.
The basic mechanism in random forests is variance reduction by averaging. Each deep tree has a high variance, and the averaging brings the variance down. In boosting the basic mechanism is bias reduction, although different flavors include some variance reduction as well. Both methods inherit all the good attributes of trees, most notable of which is variable selection.

Subject Index
Bradley Efron, Stanford University, California, Trevor Hastie, Stanford University, California
Book:

Computer Age Statistical Inference

Published online:

05 July 2016

Print publication:

21 July 2016, pp 467-475
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

3 - Bayesian Inference
from Part I - Classic Statistical Inference
Bradley Efron, Stanford University, California, Trevor Hastie, Stanford University, California
Book:

Computer Age Statistical Inference

Published online:

05 July 2016

Print publication:

21 July 2016, pp 22-37
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

5 - Parametric Models and Exponential Families
from Part I - Classic Statistical Inference
Bradley Efron, Stanford University, California, Trevor Hastie, Stanford University, California
Book:

Computer Age Statistical Inference

Published online:

05 July 2016

Print publication:

21 July 2016, pp 53-72
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Part II - Early Computer-Age Methods
Bradley Efron, Stanford University, California, Trevor Hastie, Stanford University, California
Book:

Computer Age Statistical Inference

Published online:

05 July 2016

Print publication:

21 July 2016, pp 73-74
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

21 - Empirical Bayes Estimation Strategies
from Part III - Twenty-First-Century Topics
Bradley Efron, Stanford University, California, Trevor Hastie, Stanford University, California
Book:

Computer Age Statistical Inference

Published online:

05 July 2016

Print publication:

21 July 2016, pp 421-445
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

18 - Neural Networks and Deep Learning
from Part III - Twenty-First-Century Topics
Bradley Efron, Stanford University, California, Trevor Hastie, Stanford University, California
Book:

Computer Age Statistical Inference

Published online:

05 July 2016

Print publication:

21 July 2016, pp 351-374
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

7 - James–Stein Estimation and Ridge Regression
from Part II - Early Computer-Age Methods
Bradley Efron, Stanford University, California, Trevor Hastie, Stanford University, California
Book:

Computer Age Statistical Inference

Published online:

05 July 2016

Print publication:

21 July 2016, pp 91-107
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Preface
Bradley Efron, Stanford University, California, Trevor Hastie, Stanford University, California
Book:

Computer Age Statistical Inference

Published online:

05 July 2016

Print publication:

21 July 2016, pp xv-xvii
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Statistical inference is an unusually wide-ranging discipline, located as it is at the triple-point of mathematics, empirical science, and philosophy. The discipline can be said to date from 1763, with the publication of Bayes’ rule (representing the philosophical side of the subject; the rule's early advocates considered it an argument for the existence of God). The most recent quarter of this 250-year history—from the 1950s to the present—is the “computer age” of our book's title, the time when computation, the traditional bottleneck of statistical applications, became faster and easier by a factor of a million.
The book is an examination of how statistics has evolved over the past sixty years—an aerial view of a vast subject, but seen from the height of a small plane, not a jetliner or satellite. The individual chapters take up a series of influential topics—generalized linear models, survival analysis, the jackknife and bootstrap, false-discovery rates, empirical Bayes, MCMC, neural nets, and a dozen more—describing for each the key methodological developments and their inferential justification.
Needless to say, the role of electronic computation is central to our story. This doesn't mean that every advance was computer-related. A land bridge had opened to a new continent but not all were eager to cross. Topics such as empirical Bayes and James–Stein estimation could have emerged just as well under the constraints of mechanical computation. Others, like the bootstrap and proportional hazards, were pureborn children of the computer age. Almost all topics in twenty-first-century statistics are now computer-dependent, but it will take our small plane a while to reach the new millennium.
Dictionary definitions of statistical inference tend to equate it with the entire discipline. This has become less satisfactory in the “big data” era of immense computer-based processing algorithms. Here we will attempt, not always consistently, to separate the two aspects of the statistical enterprise: algorithmic developments aimed at specific problem areas, for instance random forests for prediction, as distinct from the inferential arguments offered in their support.

12 - Cross-Validation and Cp Estimates of Prediction Error
from Part II - Early Computer-Age Methods
Bradley Efron, Stanford University, California, Trevor Hastie, Stanford University, California
Book:

Computer Age Statistical Inference

Published online:

05 July 2016

Print publication:

21 July 2016, pp 208-232
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

8 - Generalized Linear Models and Regression Trees
from Part II - Early Computer-Age Methods
Bradley Efron, Stanford University, California, Trevor Hastie, Stanford University, California
Book:

Computer Age Statistical Inference

Published online:

05 July 2016

Print publication:

21 July 2016, pp 108-130
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

19 - Support-Vector Machines and Kernel Methods
from Part III - Twenty-First-Century Topics
Bradley Efron, Stanford University, California, Trevor Hastie, Stanford University, California
Book:

Computer Age Statistical Inference

Published online:

05 July 2016

Print publication:

21 July 2016, pp 375-393
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Pattern Recognition and Machine Learning

Refine search

Refine search

Actions for selected content:

2327 results in Pattern Recognition and Machine Learning

1 - Algorithms and Inference

20 - Inference After Model Selection

Part III - Twenty-First-Century Topics

6 - Empirical Bayes

Contents

Part I - Classic Statistical Inference

2 - Frequentist Inference

10 - The Jackknife and the Bootstrap

17 - Random Forests and Boosting

Summary

Subject Index

3 - Bayesian Inference

5 - Parametric Models and Exponential Families

Part II - Early Computer-Age Methods

21 - Empirical Bayes Estimation Strategies

18 - Neural Networks and Deep Learning

7 - James–Stein Estimation and Ridge Regression

Preface

Summary

12 - Cross-Validation and Cp Estimates of Prediction Error

8 - Generalized Linear Models and Regression Trees

19 - Support-Vector Machines and Kernel Methods

Pattern Recognition and Machine Learning

Refine search

Refine search

Actions for selected content:

Save Search

2327 results in Pattern Recognition and Machine Learning

Summary

Summary