Hostname: page-component-89b8bd64d-4ws75 Total loading time: 0 Render date: 2026-05-13T21:16:17.390Z Has data issue: false hasContentIssue false

Statistical signals of copying are robust to time- and space-averaging

Published online by Cambridge University Press:  13 April 2023

Mason Youngblood*
Affiliation:
Minds and Traditions Group, Max Planck Institute for Geoanthropology, Jena, Germany
Helena Miton
Affiliation:
Minds and Traditions Group, Max Planck Institute for Geoanthropology, Jena, Germany Santa Fe Institute, Santa Fe, New Mexico, USA
Olivier Morin
Affiliation:
Minds and Traditions Group, Max Planck Institute for Geoanthropology, Jena, Germany Institut Jean Nicod, ENS, EHESS, PSL University, CNRS, Paris, France
*
*Corresponding author. E-mail: masonyoungblood@gmail.com

Abstract

Cattle brands (ownership marks left on animals) are subject to forces influencing other graphic codes: the copying of constituent parts, pressure for distinctiveness and pressure for complexity. The historical record of cattle brands in some US states is complete owing to legal registration, providing a unique opportunity to assess how sampling processes leading to time- and space-averaging influence our ability to make inferences from limited datasets in fields like archaeology. In this preregistered study, we used a dataset of ~81,000 Kansas cattle brands (1990–2016) to explore two aspects: (1) the relative influence of copying, pressure for distinctiveness and pressure for complexity on the creation and diffusion of brand components; and (2) the effects of time- and space-averaging on statistical signals. By conducting generative inference with an agent-based model, we found that the patterns in our data are consistent with copying and pressure for intermediate complexity. In addition, by comparing mixed and structured datasets, we found that these statistical signals of copying are robust to, and possibly boosted by, time- and space-averaging.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
Copyright © The Author(s), 2023. Published by Cambridge University Press
Figure 0

Figure 1. Examples of two cattle brands registered in the US state of Kansas in 1884 (https://www.kansasmemory.org/item/309140/). Both brands are composed of two components: on the left a stylised ‘N’ and ‘P’, and on the right a rotated ‘(‘ and ‘S’).

Figure 1

Figure 2. The posterior distributions for the two parameters of the ABM computed with random forest ABC, plotted against the priors (dotted).

Figure 2

Table 1. The median estimates, 95% quantile-based credible intervals, and root mean square log/logit-transformed errors for each parameter. λ controls the shape of the Poisson distribution from which the number of components in new brands is drawn, and C is the exponent to which the components frequencies are raised.

Figure 3

Table 2. The mean estimates and 95% credible intervals for each parameter in the best fitting models for the time-mixed (left) and space-mixed (right) subsets

Figure 4

Figure 3. The accuracy of the shuffling model's predictions when applied to a structured (left panel), time-mixed and space-mixed (right panel) version of Y-SE-2, the dataset with the highest sample size. Each point above the diagonal is a unique combination of components, and components on both axes are ranked by their commonness in the entire dataset. Green is a true positive (exists and S ≥ 1), blue is a true negative (does not exist and S < 1), orange is a false positive (does not exist and S ≥ 1), and yellow is a false negative (exists and S < 1).

Figure 5

Table 3. The mean estimates and 95% credible intervals for each parameter in the best fitting model.

Figure 6

Figure 4. (a) A basic simulation of cultural transmission over three timesteps, with a fully-connected population of 30 agents transmitting two variants (A and B) with a conformity bias (C = 2). A and B are equally represented in the first timestep. The bar plot in the top right shows the frequencies of A and B in t3, and the bar plot in the top left shows the frequencies of A and B in a time-mixed subsample of the same size from t1 to t3. The dashed lines show the expected frequencies under random copying. (b) The simulated component frequency distributions from the cattle brand ABM under several conditions (five iterations each). Orange and blue are the frequency distributions from 2016 when C = 0 (frequency information does not matter) and C = 10 (extreme conformity), respectively. Yellow and green are the frequency distributions from time-mixed subsamples of the same size collected across all years when C = 0 and C = 10, respectively. The black dashed line is the frequency distribution expected when C = 1 (random copying).

Supplementary material: PDF

Youngblood et al. supplementary material

Youngblood et al. supplementary material

Download Youngblood et al. supplementary material(PDF)
PDF 1.2 MB