Hostname: page-component-5db58dd55d-jhf8m Total loading time: 0 Render date: 2026-06-02T21:31:03.980Z Has data issue: false hasContentIssue false

A Note on Listwise Deletion versus Multiple Imputation

Published online by Cambridge University Press:  03 August 2018

Thomas B. Pepinsky*
Affiliation:
Department of Government, Cornell University, USA. Email: pepinsky@cornell.edu
Rights & Permissions [Opens in a new window]

Abstract

This letter compares the performance of multiple imputation and listwise deletion using a simulation approach. The focus is on data that are “missing not at random” (MNAR), in which case both multiple imputation and listwise deletion are known to be biased. In these simulations, multiple imputation yields results that are frequently more biased, less efficient, and with worse coverage than listwise deletion when data are MNAR. This is the case even with very strong correlations between fully observed variables and variables with missing values, such that the data are very nearly “missing at random.” These results recommend caution when comparing the results from multiple imputation and listwise deletion, when the true data generating process is unknown.

Information

Type
Letter
Copyright
Copyright © The Author(s) 2018. Published by Cambridge University Press on behalf of the Society for Political Methodology. 
Figure 0

Table 1. Bias in listwise deletion and multiple imputation.

Figure 1

Figure 1. Simulation results.

Figure 2

Figure 2. Simulation results by missingness.

Figure 3

Figure 3. Simulation results by informativeness of the proxies.

Figure 4

Figure 4. Simulation results from a cluster randomized experiment.

Figure 5

Figure 5. Replication of Dettman, Pepinsky, and Pierskalla (2017).

Supplementary material: File

Pepinsky supplementary material

Pepinsky supplementary material 1

Download Pepinsky supplementary material(File)
File 264.9 KB