Hostname: page-component-89b8bd64d-ksp62 Total loading time: 0 Render date: 2026-05-07T12:34:03.659Z Has data issue: false hasContentIssue false

Using Simulated Training Data to Locate Archaeological Sites with Machine Learning

Published online by Cambridge University Press:  27 April 2026

Katherine Peck*
Affiliation:
Department of Anthropology, University of New Mexico, Albuquerque, NM, USA Cultural Resource Sciences Program, New Mexico Consortium, Los Alamos, NM, USA
Claudine Gravel-Miguel
Affiliation:
Department of Anthropology, University of New Mexico, Albuquerque, NM, USA Cultural Resource Sciences Program, New Mexico Consortium, Los Alamos, NM, USA
Grant Snitker
Affiliation:
Cultural Resource Sciences Program, New Mexico Consortium, Los Alamos, NM, USA School of Social Sciences, Utah State University, Logan, UT, USA
Matthew Helmer
Affiliation:
Department of Geography and Anthropology, Louisiana State University, Baton Rouge, LA, USA
*
Corresponding author: Katherine Peck; Email: kmpeck@unm.edu
Rights & Permissions [Opens in a new window]

Abstract

Archaeologists have demonstrated the value of deep learning models for detecting archaeological objects in lidar data. As landscape-level projects become the norm, archaeological data derived from deep learning predictions can be integrated into these initiatives through coupled natural-cultural landscapes planning. However, the paucity of archaeological training datasets limits the application of deep learning models to relatively common and well-documented object classes. Using procedurally generated training datasets may be one approach to overcome this bottleneck. To test the efficacy of procedural generation for developing deep learning training data, we trained models to detect a novel object class (hypothesized historic tar kilns) in the Kisatchie National Forest in Louisiana. We developed two procedural generation approaches to embed simulated archaeological objects in a lidar-derived DEM and used these datasets to train deep learning (Mask R-CNN) models. We then evaluated model predictions within lidar-derived visualizations and during field survey. Our trained models detected targets with high recall but low precision. Field investigation suggested that the objects were not tar kilns but a different historic feature class. This study suggests that models trained on simulated objects are a useful addition to lidar analysis tool kits and can be directly integrated into archaeological field investigation workflows.

Resumen

Resumen

La arqueología ha demostrado el valor de los modelos de aprendizaje profundo (“deep learning” en inglés) para detectar restos arqueológicos en datos lidar. A medida que los proyectos a nivel paisajístico se generalizan en la praxis arqueológica, los datos derivados de las predicciones del aprendizaje profundo pueden integrarse en estas iniciativas mediante la planificación conjunta de paisajes naturales y culturales. Sin embargo, la escasez de conjuntos de datos de entrenamiento específicamente arqueológico limita la aplicación de los modelos de aprendizaje profundo a clases de objetos relativamente comunes y/o bien documentados. El uso de conjuntos de datos de entrenamiento generados procedimentalmente puede ser un enfoque para superar este cuello de botella. Para probar la eficacia de la generación por procedimientos para desarrollar datos de entrenamiento de aprendizaje profundo, hemos entrenado modelos para detectar una nueva clase de objetos (hipotéticos hornos de brea de época histórica) en el Kisatchie National Forest, Louisiana. Desarrollamos dos enfoques de generación procedimental para incrustar objetos arqueológicos simulados en un DEM derivado de lidar y utilizamos estos conjuntos de datos para entrenar modelos de aprendizaje profundo (mask region-based convolutional neural network; Mask R-CNN). A continuación, evaluamos las predicciones del modelo dentro de las visualizaciones derivadas de lidar y durante el estudio de campo. Nuestros modelos entrenados detectaron objetivos con un alto nivel de recuperación, pero con baja precisión. La investigación de campo sugirió que los objetos no eran hornos de brea, sino una clase de características históricas diferente. Este estudio sugiere que los modelos entrenados con objetos simulados son un complemento útil a los kits de herramientas de análisis lidar y pueden integrarse directamente en los flujos de trabajo de investigación arqueológica de campo.

Information

Type
Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Open Practices
Open materials
Copyright
© The Author(s), 2026. Published by Cambridge University Press on behalf of Society for American Archaeology.
Figure 0

Figure 1. Idealized DL workflow with historic foundations as an example object.

Figure 1

Figure 2. Map of KNF ranger districts, Louisiana (Basemaps: Open Street Map, ESRI).

Figure 2

Figure 3. (a) Typical tar kiln cross section (based on Combes 1974); (b) archaeological tar kiln (Snitker et al. 2022).

Figure 3

Figure 4. (Top) Unknown archaeological objects in the KNF, Louisiana, surrounded by mima mounds; (bottom) tar kilns in the FMNF, South Carolina.

Figure 4

Figure 5. (Left) Training data for Methods 1 and 2 in the KNF; (right) training data for Method 3, FMNF (Basemap: Open Street Map).

Figure 5

Figure 6. (a) Simulated object placement exclusion model on a 256 × 256-pixel tile (black areas represent suitable object placement locations); (b) simulated object placement in a 256 × 256-pixel tile; objects too close together are dropped; (c) collection pit placement rules; (d) the script generates a tar kiln perimeter (II) around each placed point (I) as a circular buffer, modifies it with random noise (III), and then generates the tar kiln interior (IV) and collection pit (V).

Figure 6

Figure 7. (Top) Different iterations of procedurally generated targets on the same 256 × 256-pixel tile; (bottom) their associated generated annotation masks.

Figure 7

Table 1. Final Training Dataset Metrics for Methods 1 and 2.

Figure 8

Figure 8. (a) SLRM (top, left) from the FMNF and inverted SLRM (top, right) with shared scale and legend, featuring a “standard” southeastern tar kiln at center. Also shown is the elevation profile of a tar kiln in SLRM (bottom left) and SLRM after DEM inversion (bottom right); (b) SLRM (top) and elevation profile (bottom) showing unmodified possible tar kilns from the KNF.

Figure 9

Figure 9. Perpendicular profiles from a single target (left) and mima mound (right) before (top) and after (bottom) smoothing.

Figure 10

Figure 10. (Top) New possible targets located by all methods; (bottom) the two most common false-positive categories across each method. All images share the same scale.

Figure 11

Figure 11. Howitzer emplacement (illustration modified from Figure 38 in Corps of Engineers Field Fortifications manual FM 5-15; United States War Department 1944:79).

Figure 12

Table 2. Object-by-Object Metrics for All Models When Applied to the Testing Area.

Supplementary material: File

Peck et al. supplementary material 1

Supplementary Material 1. Glossary of Deep Learning Terms (text and table).
Download Peck et al. supplementary material 1(File)
File 20.2 KB
Supplementary material: File

Peck et al. supplementary material 2

Supplementary Material 2. Target Measurements (text and table).
Download Peck et al. supplementary material 2(File)
File 16.8 KB
Supplementary material: File

Peck et al. supplementary material 3

Supplementary Material 3. Method 1–3 Parameters (text).
Download Peck et al. supplementary material 3(File)
File 15.6 KB
Supplementary material: File

Peck et al. supplementary material 4

Supplementary Material 4. Systematic Review of Model Predictions (text, table, and figures).
Download Peck et al. supplementary material 4(File)
File 2.4 MB