Hostname: page-component-6766d58669-rxg44 Total loading time: 0 Render date: 2026-05-18T11:43:29.622Z Has data issue: false hasContentIssue false

The shape of and solutions to the MTurk quality crisis

Published online by Cambridge University Press:  24 April 2020

Ryan Kennedy
Affiliation:
Department of Political Science, University of Houston, Houston, TX, USA
Scott Clifford*
Affiliation:
Department of Political Science, University of Houston, Houston, TX, USA
Tyler Burleigh
Affiliation:
Clover Health, Jersey City, NJ, USA
Philip D. Waggoner
Affiliation:
Computational Social Science, University of Chicago, Chicago, USA
Ryan Jewell
Affiliation:
Department of Political Science, University of Houston, Houston, TX, USA
Nicholas J. G. Winter
Affiliation:
Department of Politics, University of Virginia, Charlottesville, VA, USA
*
*Corresponding author. Email: sclifford@uh.edu
Rights & Permissions [Opens in a new window]

Abstract

Amazon's Mechanical Turk is widely used for data collection; however, data quality may be declining due to the use of virtual private servers to fraudulently gain access to studies. Unfortunately, we know little about the scale and consequence of this fraud, and tools for social scientists to detect and prevent this fraud are underdeveloped. We first analyze 38 studies and show that this fraud is not new, but has increased recently. We then show that these fraudulent respondents provide particularly low-quality data and can weaken treatment effects. Finally, we provide two solutions: an easy-to-use application for identifying fraud in the existing datasets and a method for blocking fraudulent respondents in Qualtrics surveys.

Information

Type
Original Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright
Copyright © The European Political Science Association 2020
Figure 0

Figure 1. Audit of past studies.

Figure 1

Figure 2. Prevalence of low-quality data by respondent IP type in study 1.

Figure 2

Figure 3. Comparing treatment effects among valid and fraudulent respondents.

Figure 3

Figure 4. Prevalence of low-quality data by respondent IP type in study 2.

Figure 4

Table 1. Comparison between IP Hub and Know Your IP

Figure 5

Figure 5. Path diagram of screening protocol.

Supplementary material: PDF

Kennedy et al. supplementary material

Kennedy et al. supplementary material

Download Kennedy et al. supplementary material(PDF)
PDF 1.6 MB