Hostname: page-component-89b8bd64d-ktprf Total loading time: 0 Render date: 2026-05-06T02:08:24.905Z Has data issue: false hasContentIssue false

Improving Subnational Opinion Estimation from Cluster-Sampled Polls

Published online by Cambridge University Press:  13 November 2024

Michael Auslen*
Affiliation:
Department of Government, University of Texas at Austin, Austin, TX, USA
Rights & Permissions [Opens in a new window]

Abstract

The development of multilevel regression and poststratification (MRP) has allowed scholars to more accurately estimate subnational public opinion using national polls. However, MRP generally recovers less accurate estimates from polls whose respondents are selected using cluster sampling – also called area-probability sampling. This is in part because cluster-sampled polls rely on a complex form of random sampling focused on national representativeness that may result in small or unrepresentative subsamples in subnational geographies. This has limited MRP’s usefulness in subnational opinion estimation in several contexts, including historical polls in the US, where cluster-sampling was common into the 1980s, and large academic studies in many countries today. In this paper, I propose two approaches to improve estimation from MRP with cluster-sampled polls. The first is pooling data from multiple surveys to produce a larger sample of clusters. The second is clustered MRP (CMRP), which extends MRP by modeling opinion using the geographic information included in a survey’s cluster-sampling procedure. Using simulations, I show that both methods improve upon traditional MRP, and I validate them using historical polls in the US

Information

Type
Original Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2024. Published by Cambridge University Press on behalf of the State Politics and Policy Section of the American Political Science Association
Figure 0

Figure 1. Opposition to Abortion Ban from Traditional MRP on 1980 Gallup Poll.Note: The lefthand panel plots opposition to a ban on abortions estimated using Traditional MRP. Darker states are more opposed to a ban. The righthand panel plots estimated opposition against state estimates of abortion liberalism from Brace et al. (2002). The blue curve is a least-squares regression of the relationship between the variables.

Figure 1

Table 1. Utah respondents in Gallup abortion poll

Figure 2

Figure 2. Results of Simulations: Traditional MRP with Cluster-Sampled Polls.Note: Each point reports results for a series of 100 simulations under a given set of conditions. Simulations vary the standard deviation parameter, $ \sigma $, for one variable’s effect on opinion. All other $ \sigma $ parameters are set to 0.1. Error bars cover results from 95% of simulations.

Figure 3

Figure 3. Pooling Presidential Election Polls for MRP.Note: Points represent the improvement observed from using a pooled sample over a single presidential election poll.

Figure 4

Figure 4. Results of Simulations: Testing CMRP.Note: Points report results for 100 simulations under a set of conditions. Simulations vary the standard deviation $ \sigma $ for one variable’s effect on opinion at a time. All models were performed on clustered samples. Percent Change in MAE and percent of states improving are relative measures, comparing CMRP to traditional MRP, and deep CMRP to deep MRP. Error bars cover 95% of simulations. Axis limits are constrained to preserve readability.

Figure 5

Figure 5. Validating CMRP with Presidential Election Polls.Note: Points represent the improvement observed from using a given CMRP method on presidential election polls, compared to an analogous traditional MRP method. Traditional MRP is used as a baseline for CMRP; Deep MRP is used as a baseline for Deep CMRP model. Detailed results are in Supplementary Appendix E.

Supplementary material: File

Auslen supplementary material

Auslen supplementary material
Download Auslen supplementary material(File)
File 1.7 MB