Hostname: page-component-89b8bd64d-mmrw7 Total loading time: 0 Render date: 2026-05-06T20:21:21.923Z Has data issue: false hasContentIssue false

Worth Weighting? How to Think About and Use Weights in Survey Experiments

Published online by Cambridge University Press:  25 May 2018

Luke W. Miratrix
Affiliation:
Graduate School of Education, Harvard University, Cambridge, MA 02138, USA
Jasjeet S. Sekhon*
Affiliation:
Department of Political Science and Statistics, UC Berkeley, Berkeley, CA 94720, USA. Email: sekhon@berkeley.edu
Alexander G. Theodoridis
Affiliation:
Department of Political Science, UC Merced, Merced, CA 95340, USA
Luis F. Campos
Affiliation:
Department of Statistics, Harvard University, Cambridge, MA 02138, USA
Rights & Permissions [Opens in a new window]

Abstract

The popularity of online surveys has increased the prominence of using sampling weights to enhance claims of representativeness. Yet, much uncertainty remains regarding how these weights should be employed in survey experiment analysis: should they be used? If so, which estimators are preferred? We offer practical advice, rooted in the Neyman–Rubin model, for researchers working with survey experimental data. We examine simple, efficient estimators, and give formulas for their biases and variances. We provide simulations that examine these estimators as well as real examples from experiments administered online through YouGov. We find that for examining the existence of population treatment effects using high-quality, broadly representative samples recruited by top online survey firms, sample quantities, which do not rely on weights, are often sufficient. We found that sample average treatment effect (SATE) estimates did not appear to differ substantially from their weighted counterparts, and they avoided the substantial loss of statistical power that accompanies weighting. When precise estimates of population average treatment effects (PATE) are essential, we analytically show poststratifying on survey weights and/or covariates highly correlated with outcomes to be a conservative choice. While we show substantial gains in simulations, we find limited evidence of them in practice.

Information

Type
Articles
Copyright
Copyright © The Author(s) 2018. Published by Cambridge University Press on behalf of the Society for Political Methodology. 
Figure 0

Table 1. Simulations A & B. Performance of different estimators as estimators for the PATE for (A) a heterogeneous treatment effect scenario with $\unicode[STIX]{x1D70F}=32.59$ and (B) a constant treatment effect of $\unicode[STIX]{x1D70F}=30$. For each estimator, we have, from left to right, its expected value, bias, standard error, root mean squared error, average bootstrap SE estimate, and coverage across 10,000 trials.

Figure 1

Figure 1. (a) Bias, (b) standard error, and (c) root mean squared error of estimates when selection probability is increasingly related to the potential outcomes (Simulation C). The horizontal axis, $\unicode[STIX]{x1D6FE}$, varies the strength of the relationship between outcome and weight. Gray are SATE-targeting, black PATE-targeting. Solid are oracle estimators using all potential outcomes of the sample, dashed are actual estimators. The thicker lines are averages over the 20 simulated populations in light gray dots.

Figure 2

Figure 2. Relative efficiency of $\hat{\unicode[STIX]{x1D70F}}_{hh}$ versus $\hat{\unicode[STIX]{x1D70F}}_{\text{SATE}}$ of the estimates for the 92 experiments.

Figure 3

Figure 3. Quantile–quantile plot of the relative standardized differences$\hat{\unicode[STIX]{x1D6FF}}$ of the estimates for the 92 experiments grouped into the larger surveys they are part of.

Supplementary material: PDF

Miratrix et al. supplementary material

Miratrix et al. supplementary material 1

Download Miratrix et al. supplementary material(PDF)
PDF 396.4 KB