Hostname: page-component-6766d58669-bkrcr Total loading time: 0 Render date: 2026-05-24T20:51:31.248Z Has data issue: false hasContentIssue false

Sampling Matters: What Simulation Modeling and Microrefuse Sampling Practice Reveal about Archaeological Sampling, Training, and Design

Published online by Cambridge University Press:  06 April 2026

Isaac Imran Taber Ullah*
Affiliation:
Department of Anthropology, San Diego State University, San Diego, CA, USA
*
Rights & Permissions [Opens in a new window]

Abstract

Archaeological sampling is a critical yet inconsistently applied aspect of field methodology. Poorly designed strategies produce biased or irreproducible results, especially when recovery is labor-intensive and research expectations are high. This article addresses that challenge through the lens of spatial microrefuse analysis, using simulation modeling to evaluate current practices and improve sampling design, training, and planning. A review of 27 published microrefuse studies reveals wide variation in sampling strategies, unit sizes, and volumes, with little evidence of statistical justification. To explore the consequences of this variation, I introduce the Archaeological Sampling Experiment Laboratory (tASEL), an open-source simulation tool developed in NetLogo and archived in the CoMSES model library. tASEL allows archaeologists to construct artifact distributions and test random, systematic, or hybrid sampling frames with immediate visual and statistical feedback. I used tASEL to conduct 22,000 virtual sampling experiments across two artifact distributions: a diffuse random scatter and a highly clustered pattern. Results show that sampling performance varies significantly by distribution, sample size, and frame design. Random strategies produced the highest accuracy and lowest bias. I conclude by demonstrating how tASEL can be used in classroom and field contexts to improve sampling literacy and support more robust archaeological practice.

Resumen

Resumen

El muestreo arqueológico es un aspecto fundamental, aunque aplicado de manera inconsistente, de la metodología de campo. Las estrategias mal diseñadas producen resultados sesgados o irreproducibles, especialmente cuando la recuperación es laboriosa y las expectativas de investigación son altas. Este artículo aborda este problema desde la perspectiva del análisis espacial de microdesechos, utilizando modelos de simulación para evaluar las prácticas actuales y mejorar el diseño, la formación y la planificación del muestreo. Una revisión de 27 estudios publicados sobre microdesechos revela una gran variación en las estrategias de muestreo, los tamaños de las unidades y los volúmenes, con escasa evidencia de fundamentación estadística. Para explorar las consecuencias de esta variación, presento el Archaeological Sampling Experiment Laboratory (tASEL), una herramienta de simulación de código abierto desarrollada en NetLogo y archivada en la biblioteca de modelos CoMSES. tASEL permite a los arqueólogos construir distribuciones de artefactos y probar marcos de muestreo aleatorios, sistemáticos o híbridos con retroalimentación visual y estadística inmediata. Utilicé tASEL para realizar 22 000 experimentos virtuales de muestreo sobre dos distribuciones de artefactos: una dispersión aleatoria difusa y un patrón fuertemente agrupado. Los resultados muestran que el rendimiento del muestreo varía de forma significativa según la distribución, el tamaño de la muestra y el diseño del marco. Las estrategias aleatorias produjeron la mayor exactitud y el menor sesgo. Concluyo demostrando cómo tASEL puede emplearse en contextos de aula y de campo para mejorar la alfabetización en muestreo y apoyar una práctica arqueológica más robusta.

Information

Type
Article
Creative Commons
Creative Common License - CCCreative Common License - BYCreative Common License - NCCreative Common License - SA
This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike licence (http://creativecommons.org/licenses/by-nc-sa/4.0), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the same Creative Commons licence is used to distribute the re-used or adapted article and the original article is properly cited. The written permission of Cambridge University Press or the rights holder(s) must be obtained prior to any commercial use.
Copyright
© The Author(s), 2026. Published by Cambridge University Press on behalf of Society for American Archaeology.
Figure 0

Table 1. Summary of Sampling Methods Reported in 27 Published Microrefuse Studies, 1982–2025.

Figure 1

Table 2. Summary of Sample Gathering Methods Used in the 27 Reviewed Microrefuse Studies.

Figure 2

Table 3. Summary of Sampling Strategies Reported in the 27 Reviewed Microrefuse Studies.

Figure 3

Table 4. Model Entities and Their State Variables in tASEL.

Figure 4

Figure 1. Summary of sampling methods from 27 published microrefuse studies (1982–2021), highlighting the variability and inconsistency in sampling practices. Shown are (A) sample unit areas, (B) sediment volumes collected per sample, and (C) the total number of samples processed.

Figure 5

Figure 2. Workflows supported by tASEL: (A) teaching and training workflow using Manual mode. Students define artifact distributions and apply sampling strategies interactively, observing results visually and statistically to understand bias, recovery, and estimation accuracy; (B) research and design workflow using Iterative mode. Practitioners define realistic artifact distributions and test candidate sampling strategies through automated batch simulations. Both workflows begin with shared stages (Initiate, Design, Simulate), facilitating smooth transitions between instructional and field planning contexts.

Figure 6

Figure 3. Simulated artifact distributions with different sampling scenarios illustrating the effects of sampling strategy and artifact distribution on microrefuse recovery. The left column shows a clustered artifact distribution scenario, and the right column shows a random artifact distribution scenario. Sampling scenarios depicted in each row are full (100%) sampling results (a, b), systematic 7 × 7 grid-based sampling (c, d), and random sampling of 50 units (e, f). Artifacts are shown as black points, and sampled units are represented by colored squares. Negative samples are shown in red and positive samples are shown green or cyan, where green represents a sample with only one artifact present, and cyan represents a sample unit with two or more artifacts present.

Figure 7

Table 5. Experimental Design of Simulation Scenarios Conducted with tASEL.

Figure 8

Figure 4. Density difference distributions for four sampling strategies applied to a diffuse random artifact distribution. Plots show the relative deviation between sampled and true artifact densities. Subplots illustrate (A) 10,000 iterations of random sampling with different sample sizes and locations (n = 1 to 2,500), (B) 10,000 iterations of systematic grid sampling with different sample sizes and origin offsets (n = 1 to 2,500), (C) 1,000 iterations of random sampling with fixed sample size but varying locations (n = 50), and (D) 1,000 iterations of systematic sampling with a 7 × 7 grid, but variable origin offset (n = 49 to 56). Vertical dashed lines indicate the theoretical point where the mean of the sampled distribution is the same as the real distribution; black curves represent fitted normal distributions for comparison.

Figure 9

Figure 5. Density difference distributions for four sampling strategies applied to a clustered artifact distribution. Plots show the relative deviation between sampled and true artifact densities. Subplots illustrate (A) 10,000 iterations of random sampling with different sample sizes and locations (n = 1 to 2,500), (B) 10,000 iterations of systematic grid sampling with different sample sizes and origin offsets (n = 1 to 2,500), (C) 1,000 iterations of random sampling with fixed sample size but varying locations (n = 50), and (D) 1,000 iterations of systematic sampling with a 7 × 7 grid, but variable origin offset (n = 49, 53, or 65). Vertical dashed lines indicate the theoretical point where the mean of the sampled distribution is the same as the real distribution; black curves represent fitted normal distributions for comparison.

Figure 10

Figure 6. Absolute mean density difference plotted across binned sample-size intervals for the four full-sweep experiments (EXP-01, EXP-02, EXP-05, EXP-06). Each curve shows the mean absolute density difference within each sample-size bin derived from 10,000 simulation iterations. Lines connect bin-centered values, and vertical gridlines mark bin boundaries. Missing points (most visibly in EXP-06) occur where no simulation runs produced sample sizes falling within those bins.

Figure 11

Table 6. Summary of Results from Simulation Experiments EXP-01 to EXP-08.

Supplementary material: File

Ullah supplementary material 1

Supplementary Material 1. Introductory Sampling Strategies Lab Assignment.
Download Ullah supplementary material 1(File)
File 14.5 KB
Supplementary material: File

Ullah supplementary material 2

Supplementary Text 2. Advanced Sampling Strategies Lab Assignment.
Download Ullah supplementary material 2(File)
File 24.4 KB
Supplementary material: File

Ullah supplementary material 3

Supplementary Text 3. tASEL Field Planning Workbook.
Download Ullah supplementary material 3(File)
File 26.4 KB