Hostname: page-component-77f85d65b8-8v9h9 Total loading time: 0 Render date: 2026-04-20T07:04:30.580Z Has data issue: false hasContentIssue false

How should we estimate inverse probability weights with possibly misspecified propensity score models?

Published online by Cambridge University Press:  15 August 2024

Hiroto Katsumata*
Affiliation:
Institute of Social Science, The University of Tokyo, Tokyo, Japan
Rights & Permissions [Opens in a new window]

Abstract

Inverse probability weighting is a common remedy for missing data issues, notably in causal inference. Despite its prevalence, practical application is prone to bias from propensity score model misspecification. Recently proposed methods try to rectify this by balancing some moments of covariates between the target and weighted groups. Yet, bias persists without knowledge of the true outcome model. Drawing inspiration from the quasi maximum likelihood estimation with misspecified statistical models, I propose an estimation method minimizing a distance between true and estimated weights with possibly misspecified models. This novel approach mitigates bias and controls mean squared error by minimizing their upper bounds. As an empirical application, it gives new insights into the study of foreign occupation and insurgency in France.

Information

Type
Original Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2024. Published by Cambridge University Press on behalf of EPS Academic Ltd.
Figure 0

Figure 1. Goals of the proposed method.

Figure 1

Figure 2. A numerical example: The true and estimated weights and propensity scores with a misspecified propensity score model. Notes: This figure compares the performance of the proposed estimator (triangles) and MLE estimator (circles) with a misspecified propensity score model in terms of inverse probability weights (left panel) and propensity scores (right panel). In the left (right) panel, the x-axis represents the true weights (propensity scores), and the y-axis represents the estimated ones. The shaded area in the left panel indicates that the difference between the true and estimated weights is small (less than two) whereas the shaded area in the right panel indicates the corresponding area where the difference in weights is small (not the difference in propensity scores).

Figure 2

Table 1. Comparison of the proposed and existing methods for the average outcome estimation

Figure 3

Figure 3. The performance of the proposed estimators with different levels of regularization Notes: The horizontal dotted line indicates the RMSE of the uniform weights.

Figure 4

Table 2. Simulation results: linear outcome models

Figure 5

Table 3. Simulation results: quadratic outcome models

Figure 6

Table 4. Simulation results: exponential outcome models

Figure 7

Figure 4. Simulation results: Bias and RMSE with misspecified propensity score models Notes: This figure compares the performance of the DBW DR estimator (x-axis) with the calibrated weighting DR (circles), CBPS DR (triangles), and entropy balancing DR (squares) estimators (y-axis) in terms of the absolute bias (left panel) and RMSE (right panel). Those above the diagonal line indicate that the DBW DR works better than each of these estimators and those below indicate the opposite.

Figure 8

Figure 5. Upper bounds of bias due to distributional imbalance for various methods, Notes: As the distance from the demarcation line increases, bias due to multivariate distributional imbalance becomes severe, but the proposed estimator mitigates it more effectively than the others.

Figure 9

Figure 6. Political devolution decreases resistance activities only near the demarcation line Notes: The left panel presents the ATE estimates by various estimators, diverging as the distance from the demarcation line increases. This divergence is consistent with the estimated upper bounds of bias shown in Figure 5. The right panel presents the ATE estimates and their standard errors estimated by the nDBW DR estimator, which demonstrates that the treatment effects decrease steeply toward zero as the distance from the demarcation line increases.

Figure 10

Figure 7. Distribution balance by various methods.

Supplementary material: File

Katsumata supplementary material

Katsumata supplementary material
Download Katsumata supplementary material(File)
File 703.4 KB