Hostname: page-component-5db58dd55d-8mwbx Total loading time: 0 Render date: 2026-06-01T14:32:55.202Z Has data issue: false hasContentIssue false

A Standard for the Scholarly Citation of Archaeological Data as an Incentive to Data Sharing

Published online by Cambridge University Press:  05 April 2018

Ben Marwick*
Affiliation:
University of Washington, Seattle, WA 98115, USA
Suzanne E. Pilaar Birch
Affiliation:
University of Georgia, Athens, GA 30602, USA
*
(bmarwick@uw.edu, corresponding author)
Rights & Permissions [Opens in a new window]

Abstract

How do archaeologists share their research data, if at all? We review what data are, according to current influential definitions, and previous work on the benefits, costs, and norms of data sharing in the sciences broadly. To understand data sharing in archaeology, we present the results of three pilot studies: requests for data by e-mail, review of data availability in published articles, and analysis of archaeological datasets deposited in repositories. We find that archaeologists are often willing to share but that discipline-wide sharing is patchy and ad hoc. Legislation and mandates are effective at increasing data sharing, but editorial policies at journals lack adequate enforcement. Although most of the data available at repositories are licensed to enable flexible reuse, only a small proportion of the data are stored in structured formats for easy reuse. We present some suggestions for improving the state of date sharing in archaeology; among these is a standard for citing datasets to ensure that researchers making their data publicly available receive appropriate credit.

¿Cómo comparten los arqueólogos sus datos de investigación, si lo hacen? Revisamos qué datos son, de acuerdo con las actuales definiciones influyentes, y el trabajo previo sobre los beneficios, costos y estándares de intercambio de datos en las ciencias en general. Para comprender el intercambio de datos en arqueología, presentamos los resultados de tres estudios piloto: solicitudes de datos por correo electrónico; revisión de la disponibilidad de datos en artículos publicados y análisis de conjuntos de datos arqueológicos depositados en repositorios. Encontramos que los arqueólogos están a menudo dispuestos a compartir, pero el intercambio de toda la disciplina es desigual y ad hoc. La legislación y los mandatos son eficaces para aumentar el intercambio de datos, pero las políticas editoriales de las revistas carecen de aplicación adecuada. Aunque la mayoría de los datos disponibles en repositorios están autorizados para permitir la reutilización flexible, sólo una pequeña proporción de los datos se almacena en formatos estructurados para su fácil reutilización. Presentamos algunas sugerencias para mejorar el estado de la fecha de intercambio en arqueología, incluyendo un estándar para citar conjuntos de datos para asegurar que los investigadores que hacen sus datos disponibles públicamente reciben crédito apropiado.

Information

Type
SPECIAL SECTION: DIGITAL DATA REUSE IN ARCHAEOLOGY
Copyright
Copyright 2018 © Society for American Archaeology 
Figure 0

TABLE 1. Selection of Definitions of Research Data.

Figure 1

TABLE 2. Selection of Types of Data Encountered by Archaeologists.

Figure 2

TABLE 3. Summary of Public and Community Costs of Data Sharing.

Figure 3

TABLE 4. Summary of Public and Community Benefits of Data Sharing.

Figure 4

TABLE 5. Summary of Individual Benefits to Data Sharing.

Figure 5

TABLE 6. Summary of Stodden's “Reproducible Research Standard.”

Figure 6

TABLE 7. Summary of a Selection of Previous Studies of Data Sharing in Various Fields.

Figure 7

FIGURE 1. Summary of results of e-mail requests for data: (a) responses by year of journal article publication; (b) student status of first author; (c) country location of first author; (d) journal. AAS = Archaeological and Anthropological Sciences; AQ = American Antiquity; ARCHMTRY = Archaeometry; JAS = Journal of Archaeological Science; JFA = Journal of Field Archaeology; JHE = Journal of Human Evolution; LT = Lithic Technology; PLOSONE = PLOS ONE; SCI = Science.

Figure 8

FIGURE 2. Summary of relationships among file types, data types, and data locations: (a) file types and data locations; (b) data types and data locations.

Figure 9

FIGURE 3. Distributions of data centers, languages, and licenses (see text for details) among archaeology datasets known to DataCite. Horizontal axis is the number of digital object identifiers. DANS = Data Archiving and Networked Services; en = English; nl = Dutch; swe = Swedish; de = German; it = Italian.

Figure 10

FIGURE 4. Change over time in archaeology datasets known to DataCite. (a) Counts of datasets per year by the major data centers; red lines indicate major increases in the number of datasets in repositories. Note that the date on the horizontal axis indicates when the data were collected, not when they were deposited in the data center. (b) Temporal trends in the number of digital object identifiers (DOIs) at individual data centers. DANS = Data Archiving and Networked Services.

Figure 11

FIGURE 5. Summary of licenses used by data centers with archaeological data. ADS = Archaeology Data Service; DANS = Data Archiving and Networked Services.

Figure 12

FIGURE 6. Relationships between data centers and file formats. The 10 most frequently found file formats are shown, in the data centers with these files from the top 10 data centers by number of digital object identifiers. Open file formats for structured data, such as CSV, are not common. DANS = Data Archiving and Networked Services.

Figure 13

FIGURE 7. Citation of archaeological data digital object identifiers tracked by the Crossref Event Data service. Inset shows the distribution of these citations over time. DANS = Data Archiving and Networked Services, GBIF = Global Biodiversity Information Facility.

Figure 14

TABLE 8. Nine Key Things to Do before Sharing Your Data.