Hostname: page-component-89b8bd64d-rbxfs Total loading time: 0 Render date: 2026-05-08T12:35:01.713Z Has data issue: false hasContentIssue false

Artificial Intelligence and the Interpretation of the Past

Published online by Cambridge University Press:  18 December 2025

Matthew Magnani*
Affiliation:
Department of Anthropology and the Climate Change Institute, University of Maine, Orono, ME, USA
Jon Clindaniel
Affiliation:
Chicago Center for Computational Social Science, University of Chicago, Chicago, IL, USA
*
Corresponding author: Matthew Magnani; Email: matthew.magnani@maine.edu
Rights & Permissions [Opens in a new window]

Abstract

Artificial intelligence is reshaping the contemporary world. Trickling deeper into archaeology and history, these technological changes will influence how the past is written about and visualized. Through the evaluation of text and images generated using AI, this article considers the systemic biases present in reconstructed archaeological scenes. We draw on advances in computer science, running large-scale, computational analyses to evaluate patterns in content. We present a case study examining Neanderthal behavior, juxtaposing published archaeological knowledge with images and text made using AI. Our study reveals a low correspondence between scientific literature and artificially intelligent material, which reflects dated knowledge and cultural anachronisms. Used to identify patterns in (mis)representations of the past, the methodology can be applied to understand the distance between scholarly knowledge and any domain of content generated using AI, across any archaeological time depth and beyond the discipline.

Resumen

Resumen

La inteligencia artificial está transformando el mundo contemporáneo. Al expandirse hacia la arqueología y la historia, estos cambios tecnológicos influirán en la manera en que se escribe y se visualiza el pasado. Mediante la creación de textos e imágenes, generados por inteligencia artificial, este artículo examina los sesgos sistémicos presentes en la reconstrucción de escenas arqueológicas. Desarrollamos enfoques cada vez más frecuentes en la arqueología que se basan en avances de las ciencias computacionales, realizando análisis computacionales a gran escala para evaluar patrones en textos e imágenes. Presentamos un estudio de caso sobre el comportamiento neandertal, en el que contrastamos el conocimiento arqueológico publicado con imágenes y textos generados por IA. Nuestro estudio revela una baja correspondencia entre la literatura científica y los contenidos producidos mediante inteligencia artificial, los cuales reflejan conocimientos desactualizados y anacronismos culturales. Esta metodología, empleada para identificar patrones en las (mal)representaciones del pasado, puede aplicarse para comprender la distancia entre el conocimiento académico y cualquier tipo de contenido generado con IA, en cualquier profundidad temporal arqueológica e incluso más allá de la disciplina.

Information

Type
Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press on behalf of Society for American Archaeology.
Figure 0

Figure 1. Images closest to average embedding from the four different prompts; clockwise from the top with prompt revision, with prompt revision (expert), no prompt revision (expert), and no prompt revision.

Figure 1

Figure 2. Availability of “Neanderthal” article content type by year in the collected Constellate dataset.

Figure 2

Figure 3. Clusters of scholarly abstracts identified by HDBSCAN and projected into two dimensions by UMAP. Abstracts that could not be assigned to a cluster are denoted with the color gray. Note that the average embeddings for AI-generated content are all presented via larger circles and dark outlines (they are all identified as belonging to Cluster 0).

Figure 3

Figure 4. Subclusters of scholarly abstracts in cluster 0, as identified by HDBSCAN using a leaf-based cluster selection method and projected into two dimensions using UMAP. AI-generated text embeddings have been superimposed according to their predicted cluster membership. Note that the average embeddings for AI-generated content are all presented via larger circles, colored by their subcluster membership and outlined in black. Embeddings that could not be assigned to a cluster are denoted with the color gray.

Figure 4

Figure 5. Subclusters of scholarly abstracts in cluster 0, as identified by HDBSCAN using a leaf-based cluster selection method and projected into two dimensions using UMAP. AI-generated image embeddings have been superimposed according to their predicted cluster membership. Note that the average embeddings for AI-generated content are all presented via larger circles, colored by their subcluster membership and outlined in black. Embeddings that could not be assigned to a cluster are denoted with the color gray.

Figure 5

Table 1. AI-Generated Content Type by Closest Average Year and the Most Salient Terms in Scholarly Abstracts from the Era (±5 Years).

Figure 6

Figure 6. Boxplot of the year each scholarly article in an identified content cluster was published, with the average closest year to AI-generated images and text, based on semantic similarity (Table 1). We chose the most recent average year for each AI model from Table 1 to present in this figure.