Skip to main content
×
Home
    • Aa
    • Aa

Entropy Balancing for Causal Effects: A Multivariate Reweighting Method to Produce Balanced Samples in Observational Studies

  • Jens Hainmueller (a1)
Abstract

This paper proposes entropy balancing, a data preprocessing method to achieve covariate balance in observational studies with binary treatments. Entropy balancing relies on a maximum entropy reweighting scheme that calibrates unit weights so that the reweighted treatment and control group satisfy a potentially large set of prespecified balance conditions that incorporate information about known sample moments. Entropy balancing thereby exactly adjusts inequalities in representation with respect to the first, second, and possibly higher moments of the covariate distributions. These balance improvements can reduce model dependence for the subsequent estimation of treatment effects. The method assures that balance improves on all covariate moments included in the reweighting. It also obviates the need for continual balance checking and iterative searching over propensity score models that may stochastically balance the covariate moments. We demonstrate the use of entropy balancing with Monte Carlo simulations and empirical applications.

Copyright
References
Hide All
M. Brookhart , S. Schneeweiss , K. Rothman , R. Glynn , J. Avorn , and T. Sturmer 2006. Variable selection for propensity score models. American Journal of Epidemiology 163: 1149–56.

M. Brookhart , S. Schneeweiss , K. Rothman , R. Glynn , J. Avorn , and T. Sturmer 2006. Variable selection for propensity score models. American Journal of Epidemiology 163: 1149–56.

S. DellaVigna , and E. Kaplan 2007. The Fox News effect: Media bias and voting. Quarterly Journal of Economics 122: 1187–34.

S. DellaVigna , and E. Kaplan 2007. The Fox News effect: Media bias and voting. Quarterly Journal of Economics 122: 1187–34.

W. Deming , and F. Stephan 1940. On the least squares adjustment of a sampled frequency table when the expected marginal totals are known. The Annals of Mathematical Statistics 11: 427–44.

W. Deming , and F. Stephan 1940. On the least squares adjustment of a sampled frequency table when the expected marginal totals are known. The Annals of Mathematical Statistics 11: 427–44.

C. Drake 1993. Effects of misspecification of the propensity score on estimators of treatment effect. Biometrics 49: 1231–36.

C. Drake 1993. Effects of misspecification of the propensity score on estimators of treatment effect. Biometrics 49: 1231–36.

A. Eggers , and J. Hainmueller 2009. MPs for sale? Returns to office in postwar British politics. American Political Science Review 103: 513–33.

A. Eggers , and J. Hainmueller 2009. MPs for sale? Returns to office in postwar British politics. American Political Science Review 103: 513–33.

M. Frölich 2007. Propensity score matching without conditional independence assumption with an application to the gender wage gap in the United Kingdom. The Econometrics Journal 10: 359407.

M. Frölich 2007. Propensity score matching without conditional independence assumption with an application to the gender wage gap in the United Kingdom. The Econometrics Journal 10: 359407.

X. Gu , and P. Rosenbaum 1993. Comparison of multivariate matching methods: Structures, distances, and algorithms. Journal of Computational and Graphical Statistics 2: 405–20.

X. Gu , and P. Rosenbaum 1993. Comparison of multivariate matching methods: Structures, distances, and algorithms. Journal of Computational and Graphical Statistics 2: 405–20.

J. Hahn 1998. On the role of the propensity score in efficient semiparametric estimation of average treatment effects. Econometrica 66: 315–31.

J. Hahn 1998. On the role of the propensity score in efficient semiparametric estimation of average treatment effects. Econometrica 66: 315–31.

B. B. Hansen , and J. Bowers 2008. Covariate balance in simple, stratified and clustered comparative studies. Statistical Science 23: 219–36.

B. B. Hansen , and J. Bowers 2008. Covariate balance in simple, stratified and clustered comparative studies. Statistical Science 23: 219–36.

L. Hansen 1982. Large sample properties of generalized method of moments estimators. Econometrica 50: 1029–54.

L. Hansen 1982. Large sample properties of generalized method of moments estimators. Econometrica 50: 1029–54.

J. Hellerstein , and G. Imbens 1999. Imposing moment restrictions from auxiliary data by weighting. The Review of Economics and Statistics 81: 114.

J. Hellerstein , and G. Imbens 1999. Imposing moment restrictions from auxiliary data by weighting. The Review of Economics and Statistics 81: 114.

K. Hirano , and G. Imbens 2001. Estimation of causal effects using propensity score weighting: An application of data on right hear catherization. Health Services and Outcomes Research Methodology 2: 259–78.

K. Hirano , and G. Imbens 2001. Estimation of causal effects using propensity score weighting: An application of data on right hear catherization. Health Services and Outcomes Research Methodology 2: 259–78.

K. Hirano , G. Imbens , and G. Ridder 2003. Efficient estimation of average treatment effects using the estimated propensity score. Econometrica 71: 1161–89.

K. Hirano , G. Imbens , and G. Ridder 2003. Efficient estimation of average treatment effects using the estimated propensity score. Econometrica 71: 1161–89.

D. Ho , K. Imai , G. King , and E. Stuart 2007. Matching as nonparametric preprocessing for reducing model dependence in parametric causal inference. Political Analysis 15: 199236.

D. Horvitz , and D. Thompson 1952. A generalization of sampling without replacement from a finite universe. Journal of the American Statistical Association 47: 663–85.

D. Horvitz , and D. Thompson 1952. A generalization of sampling without replacement from a finite universe. Journal of the American Statistical Association 47: 663–85.

K. Imai , G. King , and E. Stuart 2008. Misunderstandings among experimentalists and observationalists: Balance test fallacies in causal inference. Journal of the Royal Statistical Society, Series A 171: 481502.

K. Imai , G. King , and E. Stuart 2008. Misunderstandings among experimentalists and observationalists: Balance test fallacies in causal inference. Journal of the Royal Statistical Society, Series A 171: 481502.

G. Imbens 1997. One-step estimators for over-identified generalized method of moments models. The Review of Economic Studies 64: 359–83.

G. Imbens 1997. One-step estimators for over-identified generalized method of moments models. The Review of Economic Studies 64: 359–83.

G. Imbens 2004. Nonparametric estimation of average treatment effects under exogeneity: A review. Review of Economics and Statistics 86: 429.

G. Imbens 2004. Nonparametric estimation of average treatment effects under exogeneity: A review. Review of Economics and Statistics 86: 429.

G. Imbens , R. Spady , and P. Johnson 1998. Information theoretic approaches to inference in moment condition models. Econometrica 66: 333–57.

G. Imbens , R. Spady , and P. Johnson 1998. Information theoretic approaches to inference in moment condition models. Econometrica 66: 333–57.

C. Ireland , and S. Kullback 1968. Contingency tables with given marginals. Biometrika 55: 179–88.

C. Ireland , and S. Kullback 1968. Contingency tables with given marginals. Biometrika 55: 179–88.

Y. Kitamura , and M. Stutzer 1997. An information-theoretic alternative to generalized method of moments estimation. Econometrica 65: 861–74.

Y. Kitamura , and M. Stutzer 1997. An information-theoretic alternative to generalized method of moments estimation. Econometrica 65: 861–74.

J. Ladd , and G. Lenz 2009. Exploiting a rare communication shift to document the persuasive power of the news media. American Journal of Political Science 53: 394–10.

J. Ladd , and G. Lenz 2009. Exploiting a rare communication shift to document the persuasive power of the news media. American Journal of Political Science 53: 394–10.

D. McCaffrey , G. Ridgeway , and A. Morral 2004. Propensity score estimation with boosted regression for evaluating adolescent substance abuse treatment. Psychological Methods 9: 403–25.

D. McCaffrey , G. Ridgeway , and A. Morral 2004. Propensity score estimation with boosted regression for evaluating adolescent substance abuse treatment. Psychological Methods 9: 403–25.

A. Owen 2001. Empirical likelihood. Boca Raton, FL: Chapman & Hall.

A. Owen 2001. Empirical likelihood. Boca Raton, FL: Chapman & Hall.

J. Qin , and J. Lawless 1994. Empirical likelihood and general estimating equations. Annals of Statistics 22: 300–25.

J. Qin , and J. Lawless 1994. Empirical likelihood and general estimating equations. Annals of Statistics 22: 300–25.

J. Qin , B. Zhang , and D. Leung 2009. Empirical likelihood in missing data problems. Journal of the American Statistical Association 104: 1492–503.

J. Qin , B. Zhang , and D. Leung 2009. Empirical likelihood in missing data problems. Journal of the American Statistical Association 104: 1492–503.

T. Read , and N. Cressie 1988. Goodness-of-fit statistics for discrete multivariate data. New York: Springer.

T. Read , and N. Cressie 1988. Goodness-of-fit statistics for discrete multivariate data. New York: Springer.

J. Robins , A. Rotnitzky , and L. Zhao 1995. Analysis of semiparametric regression models for repeated outcomes in the presence of missing data. Journal of the American Statistical Association 90: 106–21.

J. Robins , A. Rotnitzky , and L. Zhao 1995. Analysis of semiparametric regression models for repeated outcomes in the presence of missing data. Journal of the American Statistical Association 90: 106–21.

P. R. Rosenbaum , and D. B. Rubin 1983. The central role of the propensity score in observational studies for causal effects. Biometrika 70: 4155.

P. R. Rosenbaum , and D. B. Rubin 1983. The central role of the propensity score in observational studies for causal effects. Biometrika 70: 4155.

D. Rubin 2006. Matched sampling for causal effects. Cambridge: Cambridge University Press.

D. Rubin 2006. Matched sampling for causal effects. Cambridge: Cambridge University Press.

S. Schennach 2007. Point estimation with exponentially tilted empirical likelihood. The Annals of Statistics 35: 634–72.

S. Schennach 2007. Point estimation with exponentially tilted empirical likelihood. The Annals of Statistics 35: 634–72.

J. S. Sekhon 2009. Opiates for the matches: Matching methods for causal inference. Annual Review of Political Science 12: 487–08.

J. S. Sekhon 2009. Opiates for the matches: Matching methods for causal inference. Annual Review of Political Science 12: 487–08.

J. Smith , and P. Todd 2001. Reconciling conflicting evidence on the performance of propensity-score matching methods. American Economic Review 91: 112–18.

J. Smith , and P. Todd 2001. Reconciling conflicting evidence on the performance of propensity-score matching methods. American Economic Review 91: 112–18.

Z. Zhao 2004. Using matching to estimate treatment effects: Data requirements, matching metrics, and Monte Carlo evidence. Review of Economics and Statistics 86: 91107.

Z. Zhao 2004. Using matching to estimate treatment effects: Data requirements, matching metrics, and Monte Carlo evidence. Review of Economics and Statistics 86: 91107.

Recommend this journal

Email your librarian or administrator to recommend adding this journal to your organisation's collection.

Political Analysis
  • ISSN: 1047-1987
  • EISSN: 1476-4989
  • URL: /core/journals/political-analysis
Please enter your name
Please enter a valid email address
Who would you like to send this to? *
×
MathJax
Type Description Title
PDF
Supplementary Materials

Hainmueller supplementary material
Appendix

 PDF (665 KB)
665 KB

Metrics

Altmetric attention score

Full text views

Total number of HTML views: 1
Total number of PDF views: 108 *
Loading metrics...

Abstract views

Total abstract views: 305 *
Loading metrics...

* Views captured on Cambridge Core between 4th January 2017 - 26th September 2017. This data will be updated every 24 hours.