Hostname: page-component-89b8bd64d-72crv Total loading time: 0 Render date: 2026-05-08T13:30:38.742Z Has data issue: false hasContentIssue false

The average laboratory samples a population of 7,300 Amazon Mechanical Turk workers

Published online by Cambridge University Press:  01 January 2023

Neil Stewart*
Affiliation:
Department of Psychology, University of Warwick, Coventry, CV4 7AL, UK
Christoph Ungemach
Affiliation:
Columbia University
Adam J. L. Harris
Affiliation:
University College London
Daniel M. Bartels
Affiliation:
University of Chicago
Ben R. Newell
Affiliation:
University of New South Wales
Gabriele Paolacci
Affiliation:
Rotterdam School of Management, Erasmus University Rotterdam
Jesse Chandler
Affiliation:
University of Michigan and Mathematica Policy Research
Rights & Permissions [Opens in a new window]

Abstract

Using capture-recapture analysis we estimate the effective size of the active Amazon Mechanical Turk (MTurk) population that a typical laboratory can access to be about 7,300 workers. We also estimate that the time taken for half of the workers to leave the MTurk pool and be replaced is about 7 months. Each laboratory has its own population pool which overlaps, often extensively, with the hundreds of other laboratories using MTurk. Our estimate is based on a sample of 114,460 completed sessions from 33,408 unique participants and 689 sessions across seven laboratories in the US, Europe, and Australia from January 2012 to March 2015.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
The authors license this article under the terms of the Creative Commons Attribution 4.0 License.
Copyright
Copyright © The Authors [2015] This is an Open Access article, distributed under the terms of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Figure 0

Figure 1: The details of timing, HIT acceptance and location requirements, and pay and duration across the seven labs. The first row shows the timing of the experiments by laboratory. A dot, jittered vertically, represents a single HIT. The second and third rows show the differences between laboratories in HIT acceptance rates and location requirements for participation. The final row shows scatter plots of the median pay against duration for each experiment. Each circle is a batch and its area is proportional to the number of HITs. The dashed line is the $7.25 per hour US federal minimum wage, with batches under the line paying less. Note, scales differ over panels.

Figure 1

Figure 2: Open population analysis results. Error bars are the extent of 95% confidence intervals.

Figure 2

Figure 3: Open population analysis results for different hourly rates of pay. Column headings give the ranges of pay rates for the four quartiles in the distribution of hourly pay. Error bars are the extent of 95% confidence intervals.

Figure 3

Figure 4: Open population analysis results for different size batches. Column headings give the ranges of batch sizes for the four quartiles in the distribution of batch sizes. Error bars are the extent of 95% confidence intervals.

Figure 4

Figure 5: The distribution of the number of other batches completed within a laboratory.

Figure 5

Figure 6: The distribution of the number of other laboratories visited.

Figure 6

Figure 7: The joint distribution of worker and laboratory capture probabilities, together with marginal distributions.

Figure 7

Figure A1: Five fish are caught and tagged on the first day. Another four fish are caught on the second day. In this second catch, one quarter are tagged. Thus there are 20 fish in the pond.

Figure 8

Table 1: Frequencies of capture histories for Red-Back Voles.

Figure 9

Table 2: The X model matrices for the M0, Mt, Mh, and Mb Poisson regression.

Figure 10

Table 3: Frequencies of capture histories for eider ducks.

Figure 11

Table 4: The model matrix for the Poisson regression.

Figure 12

Table 5: Parameter values from the Poisson regression.

Figure 13

Table 6: Calculating the open population model parameters from Poisson regression coefficients.

Figure 14

Table 7: Jolly-Seber model parameters.

Supplementary material: File

Stewart et al. supplementary material

Stewart et al. supplementary material 1
Download Stewart et al. supplementary material(File)
File 7.2 MB
Supplementary material: File

Stewart et al. supplementary material

Stewart et al. supplementary material 2
Download Stewart et al. supplementary material(File)
File 3.4 KB