Search results for Statistical theory and methods

Part 1A - Single-level regression
Andrew Gelman, Columbia University, New York, Jennifer Hill, Columbia University, New York
Book:

Data Analysis Using Regression and Multilevel/Hierarchical Models

Published online:

05 September 2012

Print publication:

18 December 2006, pp 29-30
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

We start with an overview of classical linear regression and generalized linear models, focusing on practical issues of fitting, understanding, and graphical display. We also use this as an opportunity to introduce the statistical package R.

10 - Causal inference using more advanced models
from Part 1B - Working with regression inferences
Andrew Gelman, Columbia University, New York, Jennifer Hill, Columbia University, New York
Book:

Data Analysis Using Regression and Multilevel/Hierarchical Models

Published online:

05 September 2012

Print publication:

18 December 2006, pp 199-234
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Chapter 9 discussed situations in which it is dangerous to use a standard linear regression of outcome on predictors and an indicator variable for estimating causal effects: when there is imbalance or lack of complete overlap or when ignorability is in doubt. This chapter discusses these issues in more detail and provides potential solutions for each.
Imbalance and lack of complete overlap
In a study comparing two treatments (which we typically label “treatment” and “control”), causal inferences are cleanest if the units receiving the treatment are comparable to those receiving the control. Until Section 10.5, we shall restrict ourselves to ignorable models, which means that we only need to consider observed pre-treatment predictors when considering comparability.
For ignorable models, we consider two sorts of departures from comparability—imbalance and lack of complete overlap. Imbalance occurs if the distributions of relevant pre-treatment variables differ for the treatment and control groups. Lack of complete overlap occurs if there are regions in the space of relevant pre-treatment variables where there are treated units but no controls, or controls but no treated units.
Imbalance and lack of complete overlap are issues for causal inference largely because they force us to rely more heavily on model specification and less on direct support from the data.

16 - Multilevel modeling in Bugs and R: the basics
from Part 2B - Fitting multilevel models
Andrew Gelman, Columbia University, New York, Jennifer Hill, Columbia University, New York
Book:

Data Analysis Using Regression and Multilevel/Hierarchical Models

Published online:

05 September 2012

Print publication:

18 December 2006, pp 345-374
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

In this chapter we introduce the fitting of multilevel models in Bugs as run from R. Following a brief introduction to Bayesian inference in Section 16.2, we fit a varying-intercept multilevel regression, walking through each step of the model. The computations in this chapter parallel Chapter 12 on basic multilevel models. Chapter 17 presents computations for the more advanced linear and generalized linear models of Chapters 12–15.
Why you should learn Bugs
As illustrated in the preceding chapters, we can quickly and easily fit many multilevel linear and generalized linear models using the lmer() function in R. Functions such as lmer(), which use point estimates of variance parameters, are useful but can run into problems. When the number of groups is small or the multilevel model is complicated (with many varying intercepts, slopes, and non-nested components), there just might not be enough information to estimate variance parameters precisely. At that point, we can get more reasonable inferences using a Bayesian approach that averages over the uncertainty in all the parameters of the model.
We recommend the following strategy for multilevel modeling:
Start by fitting classical regressions using the lm() and glm() functions in R. Display and understand these fits as discussed in Part 1 of this book.
Set up multilevel models—that is, allow intercepts and slopes to vary, using non-nested groupings if appropriate—and fit using lmer(), displaying as discussed in most of the examples of Part 2A.
[…]

Contents
Andrew Gelman, Columbia University, New York, Jennifer Hill, Columbia University, New York
Book:

Data Analysis Using Regression and Multilevel/Hierarchical Models

Published online:

05 September 2012

Print publication:

18 December 2006, pp ix-xvi
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

18 - Likelihood and Bayesian inference and computation
from Part 2B - Fitting multilevel models
Andrew Gelman, Columbia University, New York, Jennifer Hill, Columbia University, New York
Book:

Data Analysis Using Regression and Multilevel/Hierarchical Models

Published online:

05 September 2012

Print publication:

18 December 2006, pp 387-414
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Most of this book concerns the interpretation of regression models, with the understanding that they can be fit to data fairly automatically using R and Bugs. However, it can be useful to understand some of the theory behind the model fitting, partly to connect to the usual presentation of these models in statistics and econometrics.
This chapter outlines some of the basic ideas of likelihood and Bayesian inference and computation, focusing on their application to multilevel regression. One point of this material is to connect multilevel modeling to classical regression; another is to give enough insight into the computation to allow you to understand some of the practical computational tips presented in the next chapter.
Least squares and maximum likelihood estimation
We first present the algebra for classical regression inference, which is then generalized when moving to multilevel modeling. We present the formulas here without derivation; see the references listed at the end of the chapter for more.
Least squares
The classical linear regression model is yi = Xiβ + ∊i, where y and ∊ are (column) vectors of length n, X is a n × k matrix, and β is a vector of length k. The vector β of coefficients is estimated so as to minimize the errors ∊i.

Appendixes
Andrew Gelman, Columbia University, New York, Jennifer Hill, Columbia University, New York
Book:

Data Analysis Using Regression and Multilevel/Hierarchical Models

Published online:

05 September 2012

Print publication:

18 December 2006, pp 545-546
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

13 - Multilevel linear models: varying slopes, non-nested models, and other complexities
from Part 2A - Multilevel regression
Andrew Gelman, Columbia University, New York, Jennifer Hill, Columbia University, New York
Book:

Data Analysis Using Regression and Multilevel/Hierarchical Models

Published online:

05 September 2012

Print publication:

18 December 2006, pp 279-300
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

14 - Multilevel logistic regression
from Part 2A - Multilevel regression
Andrew Gelman, Columbia University, New York, Jennifer Hill, Columbia University, New York
Book:

Data Analysis Using Regression and Multilevel/Hierarchical Models

Published online:

05 September 2012

Print publication:

18 December 2006, pp 301-324
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Multilevel modeling is applied to logistic regression and other generalized linear models in the same way as with linear regression: the coefficients are grouped into batches and a probability distribution is assigned to each batch. Or, equivalently (as discussed in Section 12.5), error terms are added to the model corresponding to different sources of variation in the data. We shall discuss logistic regression in this chapter and other generalized linear models in the next.
State-level opinions from national polls
Dozens of national opinion polls are conducted by media organizations before every election, and it is desirable to estimate opinions at the levels of individual states as well as for the entire country. These polls are generally based on national randomdigit dialing with corrections for nonresponse based on demographic factors such as sex, ethnicity, age, and education.
Here we describe a model developed for estimating state-level opinions from national polls, while simultaneously correcting for nonresponse, for any survey response of interest. The procedure has two steps: first fitting the model and then applying the model to estimate opinions by state:
We fit a regression model for the individual response y given demographics and state. This model thus estimates an average response θl for each cross-classification l of demographics and state. In our example, we have sex (male or female), ethnicity (African American or other), age (4 categories), education (4 categories), and 51 states (including the District of Columbia); thus l = 1, …, L = 3264 categories.
[…]

20 - Sample size and power calculations
from Part 3 - From data collection to model understanding to model checking
Andrew Gelman, Columbia University, New York, Jennifer Hill, Columbia University, New York
Book:

Data Analysis Using Regression and Multilevel/Hierarchical Models

Published online:

05 September 2012

Print publication:

18 December 2006, pp 437-456
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Choices in the design of data collection
Multilevel modeling is typically motivated by features in existing data or the object of study—for example, voters classified by demography and geography, students in schools, multiple measurements on individuals, and so on. Consider all the examples in Part 2 of this book. In some settings, however, multilevel data structures arise by choice from the data collection process. We briefly discuss some of these options here.
Unit sampling or cluster sampling
In a sample survey, data are collected on a set of units in order to learn about a larger population. In unit sampling, the units are selected directly from the population. In cluster sampling, the population is divided into clusters: first a sample of clusters is selected, then data are collected from each of the sampled clusters.
In one-stage cluster sampling, complete information is collected within each sampled cluster. For example, a set of classrooms is selected at random from a larger population, and then all the students within each sampled classroom are interviewed. In two-stage cluster sampling, a sample is performed within each sampled cluster. For example, a set of classrooms is selected, and then a random sample of ten students within each classroom is selected and interviewed. More complicated sampling designs are possible along these lines, including adaptive designs, stratified cluster sampling, sampling with probability proportional to size, and various combinations and elaborations of these.

A - Six quick tips to improve your regression modeling
from Appendixes
Andrew Gelman, Columbia University, New York, Jennifer Hill, Columbia University, New York
Book:

Data Analysis Using Regression and Multilevel/Hierarchical Models

Published online:

05 September 2012

Print publication:

18 December 2006, pp 547-550
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Fit many models
Think of a series of models, starting with the too-simple and continuing through to the hopelessly messy. Generally it's a good idea to start simple. Or start complex if you'd like, but prepare to quickly drop things out and move to the simpler model to help understand what's going on. Working with simple models is not a research goal—in the problems we work on, we usually find complicated models more believable—but rather a technique to help understand the fitting process.
A corollary of this principle is the need to be able to fit models relatively quickly. Realistically, you don't know what model you want to be fitting, so it's rarely a good idea to run the computer overnight fitting a single model. At least, wait until you've developed some understanding by fitting many models.
Do a little work to make your computations faster and more reliable
This sounds like computational advice but is really about statistics: if you can fit models faster, you can fit more models and better understand both data and model. But getting the model to run faster often has some startup cost, either in data preparation or in model complexity.
Data subsetting
Related to the “multiple model” approach are simple approximations that speed the computations. Computers are getting faster and faster—but models are getting more and more complicated! And so these general tricks might remain important.

Part 1B - Working with regression inferences
Andrew Gelman, Columbia University, New York, Jennifer Hill, Columbia University, New York
Book:

Data Analysis Using Regression and Multilevel/Hierarchical Models

Published online:

05 September 2012

Print publication:

18 December 2006, pp 135-136
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

We now discuss how to go beyond simply looking at regression coefficients, first by using simulation to summarize and propagate inferential uncertainty, and then by considering how regression can be used for causal inference.

5 - Logistic regression
from Part 1A - Single-level regression
Andrew Gelman, Columbia University, New York, Jennifer Hill, Columbia University, New York
Book:

Data Analysis Using Regression and Multilevel/Hierarchical Models

Published online:

05 September 2012

Print publication:

18 December 2006, pp 79-108
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Logistic regression is the standard way to model binary outcomes (that is, data yi that take on the values 0 or 1). Section 5.1 introduces logistic regression in a simple example with one predictor, then for most of the rest of the chapter we work through an extended example with multiple predictors and interactions.
Logistic regression with a single predictor
Example: modeling political preference given income
Conservative parties generally receive more support among voters with higher incomes. We illustrate classical logistic regression with a simple analysis of this pattern from the National Election Study in 1992. For each respondent i in this poll, we label yi = 1 if he or she preferred George Bush (the Republican candidate for president) or 0 if he or she preferred Bill Clinton (the Democratic candidate), for now excluding respondents who preferred Ross Perot or other candidates, or had no opinion. We predict preferences given the respondent's income level, which is characterized on a five-point scale.
The data are shown as (jittered) dots in Figure 5.1, along with the fitted logistic regression line, a curve that is constrained to lie between 0 and 1. We interpret the line as the probability that y = 1 given x—in mathematical notation, Pr(y = 1|x).

15 - Multilevel generalized linear models
from Part 2A - Multilevel regression
Andrew Gelman, Columbia University, New York, Jennifer Hill, Columbia University, New York
Book:

Data Analysis Using Regression and Multilevel/Hierarchical Models

Published online:

05 September 2012

Print publication:

18 December 2006, pp 325-342
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

As with linear and logistic regressions, generalized linear models can be fit to multilevel structures by including coefficients for group indicators and then adding group-level models. We illustrate in this chapter with three examples from our recent applied research: an overdispersed Poisson model for police stops, a multinomial logistic model for storable voting, and an overdispersed Poisson model for social networks.
Overdispersed Poisson regression: police stops and ethnicity
We return to the New York City police example introduced in Sections 1.2 and 6.2, where we formulated the problem as an overdispersed Poisson regression, and here we generalize to a multilevel model. In order to compare ethnic groups while controlling for precinct-level variation, we perform multilevel analyses using the city's 75 precincts. Allowing precinct-level effects is consistent with theories of policing such as the “broken windows” model that emphasize local, neighborhood-level strategies. Because it is possible that the patterns are systematically different in neighborhoods with different ethnic compositions, we divide the precincts into three categories in terms of their black population: precincts that were less than 10% black, 10%–40% black, and more than 40% black. We also account for variation in stop rates between the precincts within each group. Each of the three categories represents roughly one-third of the precincts in the city, and we perform separate analyses for each set.
Overdispersion as a variance component
As discussed in Chapter 6, data that are fit by a generalized linear model are overdispersed if the data-level variance is higher than would be predicted by the model.

7 - Simulation of probability models and statistical inferences
from Part 1B - Working with regression inferences
Andrew Gelman, Columbia University, New York, Jennifer Hill, Columbia University, New York
Book:

Data Analysis Using Regression and Multilevel/Hierarchical Models

Published online:

05 September 2012

Print publication:

18 December 2006, pp 137-154
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Whenever we represent inferences for a parameter using a point estimate and standard error, we are performing a data reduction. If the estimate is normally distributed, this summary discards no information because the normal distribution is completely defined by its mean and variance. But in other cases it can be useful to represent the uncertainty in the parameter estimation by a set of random simulations that represent possible values of the parameter vector (with more likely values being more likely to appear in the simulation). By simulation, then, we mean summarizing inferences by random numbers rather than by point estimates and standard errors.
Simulation of probability models
In this section we introduce simulation for two simple probability models. The rest of the chapter discusses how to use simulations to summarize and understand regressions and generalize linear models, and the next chapter applies simulation to model checking and validation. Simulation is important in itself and also prepares for multilevel models, which we fit using simulation-based inference, as described in Part 2B.
A simple example of discrete predictive simulation
How many girls in 400 births? The probability that a baby is a girl or boy is 48.8% or 51.2%, respectively. Suppose that 400 babies are born in a hospital in a given year. How many will be girls?

4 - Linear regression: before and after fitting the model
from Part 1A - Single-level regression
Andrew Gelman, Columbia University, New York, Jennifer Hill, Columbia University, New York
Book:

Data Analysis Using Regression and Multilevel/Hierarchical Models

Published online:

05 September 2012

Print publication:

18 December 2006, pp 53-78
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

It is not always appropriate to fit a classical linear regression model using data in their raw form. As we discuss in Sections 4.1 and 4.4, linear and logarithmic transformations can sometimes help in the interpretation of the model. Nonlinear transformations of the data are sometimes necessary to more closely satisfy additivity and linearity assumptions, which in turn should improve the fit and predictive power of the model. Section 4.5 presents some other univariate transformations that are occasionally useful. We have already discussed interactions in Section 3.3, and in Section 4.6 we consider other techniques for combining input variables.
Linear transformations
Linear transformations do not affect the fit of a classical regression model, and they do not affect predictions: the changes in the inputs and the coefficients cancel in forming the predicted value Xβ. However, well-chosen linear transformation can improve interpretability of coefficients and make a fitted model easier to understand. We saw in Chapter 3 how linear transformations can help with the interpretation of the intercept; this section provides examples involving the interpretation of the other coefficients in the model.
Scaling of predictors and regression coefficients. The regression coefficient βj represents the average difference in y comparing units that differ by 1 unit on the jth predictor and are otherwise identical. In some cases, though, a difference of 1 unit on the x-scale is not the most relevant comparison.

Contents
Michael J. Wichura, University of Chicago
Book:

The Coordinate-Free Approach to Linear Models

Published online:

23 November 2009

Print publication:

23 October 2006, pp vii-x
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Frontmatter
Michael J. Wichura, University of Chicago
Book:

The Coordinate-Free Approach to Linear Models

Published online:

23 November 2009

Print publication:

23 October 2006, pp i-vi
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

7 - Analysis of Covariance
Michael J. Wichura, University of Chicago
Book:

The Coordinate-Free Approach to Linear Models

Published online:

23 November 2009

Print publication:

23 October 2006, pp 141-163
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Preface
Michael J. Wichura, University of Chicago
Book:

The Coordinate-Free Approach to Linear Models

Published online:

23 November 2009

Print publication:

23 October 2006, pp xi-xiv
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

When I was a graduate student in the mid-1960s, the mathematical theory underlying analysis of variance and regression became clear to me after I read a draft of William Kruskal's monograph on the so-called coordinate-free, or geometric, approach to these subjects. Alas, with Kruskal's demise, this excellent treatise will never be published.
From time to time during the 1970s, 80s, and early 90s, I had the good fortune to teach the coordinate-free approach to linear models, more precisely, to Model I analysis of variance and linear regression with nonrandom predictors. While doing so, I evolved my own set of lecture notes, presented here. With regard to inspiration and content, my debt to Kruskal is clear. However, my notes differ from Kruskal's in many ways. To mention just a few, my notes are intended for a one- rather than three-quarter course. The notes are aimed at statistics graduate students who are already familiar with the basic concepts of linear algebra, such as linear subspaces and linear transformations, and who have already had some exposure to the matricial formulation of the GLM, perhaps through a methodology course, and who are interested in the underlying theory. I have also included Tjur experimental designs and some of the highlights of the optimality theory for estimation and testing in linear models under the assumption of normality, feeling that the elegant setting provided by the coordinate-free approach is a natural one in which to place these jewels of mathematical statistics.

3 - Random Vectors
Michael J. Wichura, University of Chicago
Book:

The Coordinate-Free Approach to Linear Models

Published online:

23 November 2009

Print publication:

23 October 2006, pp 45-59
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Statistical theory and methods

Refine search

Refine search

Actions for selected content:

2348 results in Statistical theory and methods

Part 1A - Single-level regression

Summary

10 - Causal inference using more advanced models

Summary

16 - Multilevel modeling in Bugs and R: the basics

Summary

Contents

18 - Likelihood and Bayesian inference and computation

Summary

Appendixes

13 - Multilevel linear models: varying slopes, non-nested models, and other complexities

14 - Multilevel logistic regression

Summary

20 - Sample size and power calculations

Summary

A - Six quick tips to improve your regression modeling

Summary

Part 1B - Working with regression inferences

Summary

5 - Logistic regression

Summary

15 - Multilevel generalized linear models

Summary

7 - Simulation of probability models and statistical inferences

Summary

4 - Linear regression: before and after fitting the model

Summary

Contents

Frontmatter

7 - Analysis of Covariance

Preface

Summary

3 - Random Vectors

Statistical theory and methods

Refine search

Refine search

Actions for selected content:

Save Search

2348 results in Statistical theory and methods

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary