Hostname: page-component-5db58dd55d-f6s65 Total loading time: 0 Render date: 2026-06-02T02:27:14.054Z Has data issue: false hasContentIssue false

When Computers Dream of Charcoal

Using Deep Learning, Open Tools, and Open Data to Identify Relict Charcoal Hearths in and around State Game Lands in Pennsylvania

Published online by Cambridge University Press:  31 August 2021

Benjamin P. Carter*
Affiliation:
Department of Sociology and Anthropology, Muhlenberg College, 2400 Chew Street, Allentown, PA 18045, USA
Jeff H. Blackadar
Affiliation:
Department of History, Carleton University, 400 Patterson Hall, 1125 Colonel By Drive, Ottowa, Ontario, K1S 5B6, Canada
Weston L. A. Conner
Affiliation:
Office of Admissions, Lehigh University, 27 Memorial Drive West, Bethlehem, PA 18015, USA
*
(bcarter@muhlenberg.edu, corresponding author)
Rights & Permissions [Opens in a new window]

Abstract

This research employs machine learning (Mask Region-Based Convolutional Neural Networks [Mask R-CNN]) and cluster analysis (Density-based spatial clustering of applications with noise [DBSCAN]) to identify more than 20,000 relict charcoal hearths (RCHs) organized in large “fields” within and around State Game Lands (SGLs) in Pennsylvania. This research has two important threads that we hope will advance the archaeological study of landscapes. The first is the significant historical impact of charcoal production, a poorly understood industry of the late eighteenth to early twentieth century, on the historic and present landscape of the United States. Although this research focuses on charcoal production in Pennsylvania, it has broad application for both identifying and contextualizing historical charcoal production throughout the world and for better understanding modern charcoal production. The second thread is the use of open data, open source, and open access tools to conduct this analysis, as well as the open publication of the resultant data. Not only does this research demonstrate the significance of open access tools and data but the open publication of our code as well as our data allow others to replicate our work, to tweak our code and protocols for their own work, and reuse our results.

Esta investigación emplea el aprendizaje automatizado (Redes Neuronales Convolucionales basadas en Regiones “Máscara” [Mask R-CNN; en sus siglas en inglés]) y el análisis de agrupamientos o clústers (Agrupamiento Espacial Basado en Densidad de Aplicaciones con Ruido [DBSCAN; en sus siglas en inglés]), para identificar más de 20,000 áreas de combustión de hornos de producción de carbón (RCHs; en sus siglas inglés), dispuestos en “campos” amplios dentro y alrededor de Campos de Caza Estatales (SGLs; en sus siglas inglés), en Pensilvania. Esta investigación tiene dos importantes desafíos que esperamos que desarrollará el estudio de los paisajes en arqueología. El primero es el impacto histórico significativo de la producción de carbón, una industria poco entendida de la época temprana del S. XVIII e inicios del S. XIX, del paisaje histórico y actual de Estados Unidos. No obstante, esta investigación se centra alrededor de la producción de carbón en Pensilvania, tiene una aplicación amplia para la identificación y contextualización de la producción de carbón histórica en todo el mundo y para lograr un mejor entendimiento de la producción moderna de carbón. El segundo desafío es el uso de las herramientas de datos libres, fuentes libres y accesos libres para llevar a cabo este análisis, así como la publicación libre del dato resultante. Esta investigación no solamente demuestra el significado de las herramientas y los datos libres, sino que además la publicación libre de nuestro código, así como nuestros datos, permitirá a otros replicar nuestro trabajo, refinar nuestro código y protocolos para su propia investigación, así como reusar nuestros resultados.

Information

Type
Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Open Practices
Open materials
Copyright
Copyright © The Author(s), 2021. Published by Cambridge University Press on behalf of Society for American Archaeology
Figure 0

FIGURE 1. (a) Two colliers removing charcoal from the meiler. The underlying flat, circular hearth can be seen excavated into the slope; Wayne National Forest, Ohio, May 1942 (National Archives, Record Group 95: Records of the Forest Service, 1870–2008, National Archives Identifier: 2129419; Local Identifier: 419985). (b) A simplified cross section of a meiler ready to be ignited (image by Jeff Blackadar).

Figure 1

FIGURE 2. Map indicating all Pennsylvania State Game Lands showing those included in training the model and the lidar tiles used for this study (image by Benjamin Carter and Weston Conner).

Figure 2

FIGURE 3. Comparison of relict charcoal hearths on level and sloped landscape in slope analysis, hillshade, and using the Profile Tool plugin for QGIS (image by Benjamin Carter).

Figure 3

FIGURE 4. Slope analysis of a portion of SGL 33 with manually recognized relict charcoal hearths (RCHs) in white numbered squares and Mask R-CNN–identified RCHs in light blue squares. RCHs with both blue and white were recognized by both methods. The number above the square is the confidence score (image by Benjamin Carter).

Figure 4

FIGURE 5. A close-up of six manually recognized RCHs shown in Figure 4 demonstrating similarities and variation (image courtesy of Jeff Blackadar).

Figure 5

TABLE 1. Manually Recognized True Positives at a Selected Set of SGLs by Confidence Bin.

Figure 6

FIGURE 6. Map showing predicted RCHs and their inclusion in the three DBSCAN cluster analyses compared to a manual review that identified true positives in and around SGL 43. Base map is the hillshade created using methods described herein. See Figure 7 for preserved land labels (image by Benjamin Carter).

Figure 7

FIGURE 7. Map showing true positives and false negatives for RCHs in and around SGL 43. Note that both are concentrated in preserved areas. Base map is the hillshade create using methods described herein (image by Benjamin Carter).

Figure 8

TABLE 2. DBSCAN Clusters of Mask R-CNN–Identified RCHs Compared to Manual Recognition at SGL 43.

Figure 9

FIGURE 8. Map showing a heat map of predicted RCHs included in all three DBSCAN analyses compared to tiles included in the sample. Darker red indicates lower density, and lighter red indicates higher density (image by Benjamin Carter).