Hostname: page-component-89b8bd64d-ktprf Total loading time: 0 Render date: 2026-05-07T05:28:00.904Z Has data issue: false hasContentIssue false

What to Observe When Assuming Selection on Observables

Published online by Cambridge University Press:  27 June 2025

Kevin M. Quinn*
Affiliation:
Department of Quantitative Theory & Methods and School Law, Emory University, Atlanta, GA, USA
Guoer Liu
Affiliation:
Department of Political Science, University of California, San Diego, CA, USA
Lee Epstein
Affiliation:
Department of Political Science, Washington University, St. Louis, MO, USA
Andrew D. Martin
Affiliation:
Department of Political Science and Department of Statistics and Data Science and School of Law, Washington University, St. Louis, MO, USA
*
Corresponding author: Kevin M. Quinn; Email: kevin.michael.quinn@emory.edu
Rights & Permissions [Opens in a new window]

Abstract

Political scientists regularly rely on a selection-on-observables assumption to identify causal effects of interest. Once a causal effect has been identified in this way, a wide variety of estimators can, in principle, be used to consistently estimate the effect of interest. While these estimators are all justified by appeals to the same causal identification assumptions, they often differ greatly in how they make use of the data at hand. For instance, methods based on regression rely on an explicit model of the outcome variable but do not explicitly model the treatment assignment process, whereas methods based on propensity scores explicitly model the treatment assignment process but do not explicitly model the outcome variable. Understanding the tradeoffs between estimation methods is complicated by these seemingly fundamental differences. In this paper we seek to rectify this problem. We do so by clarifying how most estimators of causal effects that are justified by an appeal to a selection-on-observables assumption are all special cases of a general weighting estimator. We then explain how this commonality provides for diagnostics that allow for meaningful comparisons across estimation methods—even when the methods are seemingly very different. We illustrate these ideas with two applied examples.

Information

Type
Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press on behalf of The Society for Political Methodology
Figure 0

Table 1 Re-analysis of Black and Owens (2016) with multiple approaches to estimating the ATT.

Figure 1

Figure 1 Weighted mean covariate balance (assessed by TASMD) and KS test statistic for control units using multiple ATT estimation methods in the re-analysis of Black and Owens (2016).Note: In the TASMD plot, each symbol shows the standardized difference between the weighted mean of the control and treated data for each covariate and method. The gray vertical line marks where the TASMD value is 0.1. In the KS statistic plot, each symbol shows the maximum absolute difference in the ECDFs of the control and treated groups, using both raw and weighted control data. The gray vertical line marks KS statistics at 0. SC med is the JCS score of the median Supreme Court justice; Panel JCS is the ideological distance between a judge and the remaining panelists; JCS score is each judge’s JCS score; Ideo. Distance is the ideological distance between the judge and the president; Court reversal is whether the circuit court reversed the lower court; Circuit med is the JCS score of the median judge on the circuit; and Case pub. is whether the case was published.

Figure 2

Figure 2 Quantile-quantile plot for Ideo. Distance with raw and entropy-balancing-weighted data for contender judges.Note: The X-axis depicts weighted quantiles of ideological distance between the judge and the president for the control units, while the Y-axis shows depicts weighted quantiles of ideological distance for the treated group, which is the target distribution in this case. A point on the 45-degree line indicates equality of the quantiles represented by that point.

Figure 3

Table 2 Re-analysis of Eggers and Hainmueller (2009) with multiple estimation methods.

Figure 4

Figure 3 Weighted mean covariate balance (assessed by TASMD) and KS test statistics for control units using multiple ATT estimation methods in the re-analysis of Eggers and Hainmueller (2009).Note: In the TASMD plot, each symbol represents the standardized difference between the weighted mean of the control data and the mean of the target data (i.e., the sample mean of that covariate among all treated units in the sample). The gray vertical line marks where the TASMD value is 0.1. In KS statistic plot, each symbol shows the maximum absolute difference in the ECDFs of the control and treated groups, using both raw and weighted control data. The gray vertical line marks KS statistics at 0.

Figure 5

Figure 4 Quantile-quantile plot for Birth Year and Death Year with raw data and genetic matching weighted data for ATT estimation among conservative politicians.Note: The X-axes show the weighted birth and death year quantiles for the control units, while the Y-axes shows the corresponding quantiles for the treated group, which is the target distribution in this case. Points on the 45-degree line correspond to equality of the quantiles represented by that point.

Supplementary material: File

Quinn et al. supplementary material

Quinn et al. supplementary material
Download Quinn et al. supplementary material(File)
File 362.3 KB