Hostname: page-component-77c78cf97d-d2fvj Total loading time: 0 Render date: 2026-05-04T20:12:53.064Z Has data issue: false hasContentIssue false

Stacked Regression and Poststratification

Published online by Cambridge University Press:  23 December 2019

Joseph T. Ornstein*
Affiliation:
Brown School, Washington University in St. Louis, St. Louis, MO63130, USA. Email: jornstein@wustl.edu
Rights & Permissions [Opens in a new window]

Abstract

I develop a procedure for estimating local-area public opinion called stacked regression and poststratification (SRP), a generalization of classical multilevel regression and poststratification (MRP). This procedure employs a diverse ensemble of predictive models—including multilevel regression, LASSO, k-nearest neighbors, random forest, and gradient boosting—to improve the cross-validated fit of the first-stage predictions. In a Monte Carlo simulation, SRP significantly outperforms MRP when there are deep interactions in the data generating process, without requiring the researcher to specify a complex parametric model in advance. In an empirical application, I show that SRP produces superior local public opinion estimates on a broad range of issue areas, particularly when trained on large datasets.

Information

Type
Letter
Copyright
Copyright © The Author(s) 2019. Published by Cambridge University Press on behalf of the Society for Political Methodology.
Figure 0

Table 1. The SRP Procedure.

Figure 1

Figure 1. A representative simulation from the Monte Carlo analysis. Disaggregation, MRP, and SRP estimates are plotted against true subnational unit means. Parameter values: $\unicode[STIX]{x1D703}=5$, $\unicode[STIX]{x1D70C}=0.4$, $N=15\,000$, $M=200$, $n=5000$, $\unicode[STIX]{x1D70E}^{2}=5$.

Figure 2

Figure 2. Relative performance of disaggregation, MRP, and SRP estimates, varying $\unicode[STIX]{x1D703}$. Parameters Used: $\unicode[STIX]{x1D70C}=0.4$, $n=5000$, $M=200$, $N=15\,000$, $\unicode[STIX]{x1D70E}^{2}=5$. Points denote 10-run averages.

Figure 3

Figure 3. Plots in the left column compare SRP and MRP correlations, varying sample size. Plots in the right column compare Mean Absolute Error.

Figure 4

Figure 4. Mean performance of SRP and MRP across all simulations, varying sample size.

Supplementary material: File

Ornstein supplementary material

Ornstein supplementary material

Download Ornstein supplementary material(File)
File 1.3 MB