Skip to main content
×
Home
    • Aa
    • Aa

Causal Inference without Balance Checking: Coarsened Exact Matching

  • Stefano M. Iacus (a1), Gary King (a2) and Giuseppe Porro (a3)
Abstract

We discuss a method for improving causal inferences called “Coarsened Exact Matching” (CEM), and the new “Monotonic Imbalance Bounding” (MIB) class of matching methods from which CEM is derived. We summarize what is known about CEM and MIB, derive and illustrate several new desirable statistical properties of CEM, and then propose a variety of useful extensions. We show that CEM possesses a wide range of statistical properties not available in most other matching methods but is at the same time exceptionally easy to comprehend and use. We focus on the connection between theoretical properties and practical applications. We also make available easy-to-use open source software for R, Stata, and SPSS that implement all our suggestions.

Copyright
Corresponding author
e-mail: king@harvard.edu (corresponding author)
Footnotes
Hide All

Edited by Jonathan N. Katz

Authors' note: Open source R, Stata, and SPSS software to implement the methods described herein (called CEM) is available at http://gking.harvard.edu/cem; the CEM algorithm is also available via a standard interface offered in the R package MatchIt. Thanks to Erich Battistin, Nathaniel Beck, Matt Blackwell, Andy Eggers, Adam Glynn, Justin Grimmer, Jens Hainmueller, Ben Hansen, Kosuke Imai, Guido Imbens, Fabrizia Mealli, Walter Mebane, Clayton Nall, Enrico Rettore, Jamie Robins, Don Rubin, Jas Sekhon, Jeff Smith, Kevin Quinn, and Chris Winship for helpful comments. All information necessary to replicate the results in this paper appear in Iacus, King, and Porro (2011b).

Footnotes
Linked references
Hide All

This list contains references from the content that can be linked to their source. For a full set of references and notes please see the PDF or HTML where available.

Alberto Abadie , and Javier Gardeazabal . 2003. The economic costs of conflict: A case study of the Basque Country. American Economic Review 93: 113–32.

Alberto Abadie , and Javier Gardeazabal . 2003. The economic costs of conflict: A case study of the Basque Country. American Economic Review 93: 113–32.

Peter C. Austin , and Muhammad M. Mamdani 2006. A comparison of propensity score methods: A case-study estimating the effectiveness of post-AMI statin use. Statistics in Medicine 25: 2084–106.

Peter C. Austin , and Muhammad M. Mamdani 2006. A comparison of propensity score methods: A case-study estimating the effectiveness of post-AMI statin use. Statistics in Medicine 25: 2084–106.

Daniel Paul Carpenter . 2002. Groups, the media, agency waiting costs, and FDA drug approval. American Journal of Political Science 46: 490505.

Daniel Paul Carpenter . 2002. Groups, the media, agency waiting costs, and FDA drug approval. American Journal of Political Science 46: 490505.

Richard K. Crump , V. Joseph Hotz , Guido W. Imbens , and Oscar Mitnik . 2009. Dealing with limited overlap in estimation of average treatment effects. Biometrika 96: 187.

Richard K. Crump , V. Joseph Hotz , Guido W. Imbens , and Oscar Mitnik . 2009. Dealing with limited overlap in estimation of average treatment effects. Biometrika 96: 187.

Rajeev H. Dehejia , and Sadek Wahba . 1999. Causal effects in nonexperimental studies: Re-evaluating the evaluation of training programs. Journal of the American Statistical Association 94: 1053–62.

Rajeev H. Dehejia , and Sadek Wahba . 1999. Causal effects in nonexperimental studies: Re-evaluating the evaluation of training programs. Journal of the American Statistical Association 94: 1053–62.

Rajeev H. Dehejia , and Sadek Wahba . 2002. Propensity score matching methods for non-experimental causal studies. Review of Economics and Statistics 84: 151–61.

Rajeev H. Dehejia , and Sadek Wahba . 2002. Propensity score matching methods for non-experimental causal studies. Review of Economics and Statistics 84: 151–61.

Ben Hansen . 2008. The prognostic analogy of the propensity score. Biometrika 95: 481–88.

Ben Hansen . 2008. The prognostic analogy of the propensity score. Biometrika 95: 481–88.

James Heckman , H. Ichimura , and P. Todd 1997. Matching as an econometric evaluation estimator: Evidence from evaluating a job training program. Review of Economic Studies 64: 605–54.

James Heckman , H. Ichimura , and P. Todd 1997. Matching as an econometric evaluation estimator: Evidence from evaluating a job training program. Review of Economic Studies 64: 605–54.

Daniel Ho , Kosuke Imai , Gary King , and Elizabeth Stuart . 2007. Matching as nonparametric preprocessing for reducing model dependence in parametric causal inference. Political Analysis 15: 199236. http://gking.harvard.edu/files/abs/matchp-abs.shtml (accessed 2007).

Stefano M. Iacus , Gary King , and Giuseppe Porro . 2009. CEM: Coarsened Exact Matching Software. Journal of Statistical Software 30(9), http://gking.harvard.edu/cem.

Stefano M. Iacus , Gary King , and Giuseppe Porro . 2009. CEM: Coarsened Exact Matching Software. Journal of Statistical Software 30(9), http://gking.harvard.edu/cem.

Stefano M. Iacus , Gary King , and Giuseppe Porro . 2011. Multivariate matching methods that are Monotonic Imbalance Bounding. Journal of the American Statistical Association. http://gking.harvard.edu/files/abs/cem-math-abs.shtml.

Stefano M. Iacus , Gary King , and Giuseppe Porro . 2011. Multivariate matching methods that are Monotonic Imbalance Bounding. Journal of the American Statistical Association. http://gking.harvard.edu/files/abs/cem-math-abs.shtml.

Stefano M. Iacus , and Giuseppe Porro . 2007. Missing data imputation, matching and other applications of random recursive partitioning. Computational Statistics and Data Analysis 52: 773–89.

Stefano M. Iacus , and Giuseppe Porro . 2007. Missing data imputation, matching and other applications of random recursive partitioning. Computational Statistics and Data Analysis 52: 773–89.

Stefano M. Iacus , and Giuseppe Porro . 2008. Invariant and metric free proximities for data matching: An R package. Journal of Statistical Software 25(11): 122.

Stefano M. Iacus , and Giuseppe Porro . 2008. Invariant and metric free proximities for data matching: An R package. Journal of Statistical Software 25(11): 122.

Kosuke Imai , Gary King , and Clayton Nall . 2009. The essential role of pair matching in cluster-randomized experiments, with application to the Mexican universal health insurance evaluation. Statistical Science 24(1): 2953. http://gking.harvard.edu/files/abs/cluster-abs.shtml.

Kosuke Imai , Gary King , and Clayton Nall . 2009. The essential role of pair matching in cluster-randomized experiments, with application to the Mexican universal health insurance evaluation. Statistical Science 24(1): 2953. http://gking.harvard.edu/files/abs/cluster-abs.shtml.

Kosuke Imai , and D. A. van Dyk 2004. Causal inference with general treatment regimes: Generalizing the propensity score. Journal of the American Statistical Association 99: 854–66.

Kosuke Imai , and D. A. van Dyk 2004. Causal inference with general treatment regimes: Generalizing the propensity score. Journal of the American Statistical Association 99: 854–66.

Guido W. Imbens 2000. The role of the propensity score in estimating dose-response functions. Biometrika 87: 706–10.

Guido W. Imbens 2000. The role of the propensity score in estimating dose-response functions. Biometrika 87: 706–10.

Guido W. Imbens 2004. Nonparametric estimation of average treatment effects under exogeneity: A review. Review of Economics and Statistics 86: 429.

Guido W. Imbens 2004. Nonparametric estimation of average treatment effects under exogeneity: A review. Review of Economics and Statistics 86: 429.

Guido W. Imbens , and Joshua D. Angrist 1994. Identification and estimation of local average treatment effects. Econometrica 62: 467–75.

Guido W. Imbens , and Joshua D. Angrist 1994. Identification and estimation of local average treatment effects. Econometrica 62: 467–75.

Gary King , and Langche Zeng . 2006. The dangers of extreme counterfactuals. Political Analysis 14: 131–59. http://gking.harvard.edu/files/abs/counterft-abs.shtml.

Gary King , and Langche Zeng . 2006. The dangers of extreme counterfactuals. Political Analysis 14: 131–59. http://gking.harvard.edu/files/abs/counterft-abs.shtml.

Gary King , and Langche Zeng . 2007. When can history be our guide? The pitfalls of counterfactual inference. International Studies Quarterly 51: 183210. http://gking.harvard.edu/files/abs/counterf-abs.shtml.

Gary King , and Langche Zeng . 2007. When can history be our guide? The pitfalls of counterfactual inference. International Studies Quarterly 51: 183210. http://gking.harvard.edu/files/abs/counterf-abs.shtml.

Bo Lu , Elaine Zanuto , Robert Hornik , and Paul R. Rosenbaum 2001. Matching with doses in an observational study of a media campaign against drug abuse. Journal of the American Statistical Association 96: 1245–53.

Bo Lu , Elaine Zanuto , Robert Hornik , and Paul R. Rosenbaum 2001. Matching with doses in an observational study of a media campaign against drug abuse. Journal of the American Statistical Association 96: 1245–53.

Paul W. Mielke , and Kenneth J. Berry 2007. Permutation methods: A distance function approach. New York: Springer.

Paul W. Mielke , and Kenneth J. Berry 2007. Permutation methods: A distance function approach. New York: Springer.

Stephen L. Morgan , and Christopher Winship . 2007. Counterfactuals and causal inference: Methods and principles for social research. Cambridge: Cambridge University Press.

Stephen L. Morgan , and Christopher Winship . 2007. Counterfactuals and causal inference: Methods and principles for social research. Cambridge: Cambridge University Press.

Paul R. Rosenbaum , Richard N. Ross , and Jeffrey H. Silber 2007. Minimum distance matched sampling with fine balance in an observational study of treatment for ovarian cancer. Journal of the American Statistical Association 102: 7583.

Paul R. Rosenbaum , Richard N. Ross , and Jeffrey H. Silber 2007. Minimum distance matched sampling with fine balance in an observational study of treatment for ovarian cancer. Journal of the American Statistical Association 102: 7583.

Donald B. Rubin 1976. Inference and missing data. Biometrika 63: 581–92.

Donald B. Rubin 1976. Inference and missing data. Biometrika 63: 581–92.

Donald B. Rubin 1987. Multiple imputation for nonresponse in surveys. New York: John Wiley.

Donald B. Rubin 1987. Multiple imputation for nonresponse in surveys. New York: John Wiley.

Donald B. Rubin 2001. Using propensity scores to help design observational studies: Application to the tobacco litigation. Health Services & Outcomes Research Methodology 2: 169–88.

Donald B. Rubin 2001. Using propensity scores to help design observational studies: Application to the tobacco litigation. Health Services & Outcomes Research Methodology 2: 169–88.

Donald B. Rubin 2006. Matched sampling for causal effects. Cambridge, UK: Cambridge University Press.

Donald B. Rubin 2006. Matched sampling for causal effects. Cambridge, UK: Cambridge University Press.

David W. Scott 1992. Multivariate density estimation. Theory, practice and visualization. New York: John Wiley & Sons, Inc.

David W. Scott 1992. Multivariate density estimation. Theory, practice and visualization. New York: John Wiley & Sons, Inc.

Hideaki Shimazaki , and Shigeru Shinomoto . 2007. A method for selecting the bin size of a time histogram. Neural Computation 19: 1503–27.

Hideaki Shimazaki , and Shigeru Shinomoto . 2007. A method for selecting the bin size of a time histogram. Neural Computation 19: 1503–27.

Jeffrey A. Smith , and Petra E. Todd 2005. Does matching overcome LaLonde's critique of nonexperimental estimators? Journal of Econometrics 125: 305–53.

Jeffrey A. Smith , and Petra E. Todd 2005. Does matching overcome LaLonde's critique of nonexperimental estimators? Journal of Econometrics 125: 305–53.

Ebonya L. Washington 2008. Female socialization: How daughters affect their legislator fathers' voting on woman's issues. American Economic Review 98: 311–32.

Ebonya L. Washington 2008. Female socialization: How daughters affect their legislator fathers' voting on woman's issues. American Economic Review 98: 311–32.

Recommend this journal

Email your librarian or administrator to recommend adding this journal to your organisation's collection.

Political Analysis
  • ISSN: 1047-1987
  • EISSN: 1476-4989
  • URL: /core/journals/political-analysis
Please enter your name
Please enter a valid email address
Who would you like to send this to? *
×
MathJax

Metrics

Altmetric attention score

Full text views

Total number of HTML views: 0
Total number of PDF views: 53 *
Loading metrics...

Abstract views

Total abstract views: 267 *
Loading metrics...

* Views captured on Cambridge Core between 4th January 2017 - 19th August 2017. This data will be updated every 24 hours.