Rescaling radiocarbon data: A method for addressing inter-site sampling heterogeneity in reconstructing population history

Jiyoung Park; Sejin Kim; Taechang Jo; Jangsuk Kim

doi:10.1017/RDC.2026.10203

Rescaling radiocarbon data: A method for addressing inter-site sampling heterogeneity in reconstructing population history

Published online by Cambridge University Press: 24 March 2026

Jiyoung Park

Sejin Kim ,

Taechang Jo and

Jangsuk Kim

Show author details

Jiyoung Park: Affiliation:
Department of Archaeology and Art History, Seoul National University, Republic of Korea
Sejin Kim: Affiliation:
Department of Archaeology and Art History, Seoul National University, Republic of Korea
Taechang Jo: Affiliation:
Department of Mathematics, Inha University, Republic of Korea
Jangsuk Kim*: Affiliation:
Department of Archaeology and Art History, Seoul National University, Republic of Korea
*: Corresponding author: Jangsuk Kim; Email: jangsuk@snu.ac.kr

Article contents

Abstract
Introduction
Sampling biases and the scale of archaeological research of population
Methods
Results
Case study: Demographic dynamics in the proto- and early historical periods of Korea
Discussion
Final remarks
Supplementary material
Data accessibility
Author contributions
Declarations of interest
References

Rights & Permissions

Abstract

Radiocarbon dates have become a cornerstone in archaeological reconstructions of past population dynamics. The increasing reliance on large-scale radiocarbon databases, usually aggregated from diverse sources, however, raises persistent concerns about sampling bias, especially heterogeneous sampling intensity across sites. In this paper, we introduce a rescaling method that adjusts the frequency of dates in radiocarbon datasets in proportion to dwelling counts at the settlement level, using weighting and bootstrap resampling. Through a series of simulations, we show that this approach consistently yields probability distributions that more closely reflect hypothetical population trends, particularly in contexts with high inter-settlement variability in sampling intensity. We apply our method to archaeological data from two areas in Korea, the Yeongsan and Geum River Basins, during the Proto–Three Kingdoms (1C BC–AD 3C) and Three Kingdoms Periods (AD 4–7C). Results demonstrate that rescaled datasets offer significantly different interpretations of population organization and reconfiguration than those derived from original data. This study highlights the importance of addressing sampling heterogeneity in local-scale demographic research and suggests that rescaling is a valuable complement to existing bias-correction strategies in archaeological studies of demography.

Keywords

bootstrap resampling heterogeneous sampling intensity Korea legacy data population radiocarbon dates weighting

Information

Type: Research Article
Information: Radiocarbon , First View , pp. 1 - 26

DOI: https://doi.org/10.1017/RDC.2026.10203 [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives licence (https://creativecommons.org/licenses/by-nc-nd/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided that no alterations are made and the original article is properly cited. The written permission of Cambridge University Press or the rights holder(s) must be obtained prior to any commercial use and/or adaptation of the article.
Copyright: © The Author(s), 2026. Published by Cambridge University Press on behalf of University of Arizona

1. Introduction

Over the past two decades the use of radiocarbon date frequencies to infer population history, referred to as the “dates-as-data” approach, has significantly reshaped archaeological studies. Prior to the adoption of this framework, demographic interpretations often rested on a priori assumptions drawn from population growth models such as logistic or Malthusian trajectories (e.g., Binford and Binford Reference Binford and Binford1969; Boserup Reference Boserup1965; Cohen Reference Cohen and Reed1977; Hassan Reference Hassan1981; Redman Reference Redman1978). Without empirical tools to reconstruct population histories, such studies tended to assume, rather than interrogate, demographic change over time. Although the core idea can be traced back to Rick (Reference Rick1987), the development and application of summed probability distributions (SPDs) catalyzed its integration into archaeology, prompting a broader rethinking of population dynamics in archaeology. Conventional demographic models are increasingly being challenged, as studies employing this approach have documented asynchronous fluctuations, abrupt demographic collapses, and regionally divergent, complicated trajectories (Shennan and Edinborough Reference Shennan and Edinborough2007; Shennan et al. Reference Shennan, Downey, Timpson, Edinborough, Colledge, Kerig, Manning and Thomas2013). It has also encouraged a departure from earlier reliance on typological sequences and the assumed durations of pottery phases (Bevan and Crema Reference Bevan and Crema2021). It is noteworthy that its extensive use has been attributed largely to the construction of large-scale radiocarbon databases across various regions, including AustArch (Williams et al. Reference Williams, Ulm, Smith and Reid2014), CARD (Gajewski et al. Reference Gajewski, Munoz, Peros, Viau, Morlan and Betts2011), EUROEVOL (Manning and Timpson Reference Manning and Timpson2014), NERD (Palmisano et al. Reference Palmisano, Bevan, Lawrence and Shennan2022), RADON (Hinz et al. Reference Hinz, Furholt, Müller, Rinne, Raetzel-Fabian, Sjögren and Wotzka2012), SCAR (Gayo et al. Reference Gayo, Latorre and Santoro2015), p3k14c (Bird et al. Reference Bird, Miranda, Vander Linden, Robinson, Bocinsky, Nicholson, Capriles, Finley, Gayo, Gil, d’Alpoim Guedes, Hoggarth, Kay, Loftus, Lombardo, Mackie, Palmisano, Solheim, Kelly and Freeman2022), MesoRad, as well as databases from Korea (Hwang Reference Hwang2021; Kim and Seong Reference Kim and Seong2022; Oh et al. Reference Oh, Conte, Kang, Kim and Hwang2017; Park et al. Reference Park, Wright and Kim2017; Seong and Kim Reference Seong and Kim2022; Wright et al. Reference Wright, Kim, Park, Yang and Kim2020) and Japan (Kudo Reference Kudo2018; Kudo et al. Reference Kudo, Sakamoto, Hakozaki, Stevens and Crema2023).

However, the dates-as-data approach and the application of SPDs have generated a range of conceptual and methodological challenges. Price et al. (Reference Price, Capriles, Hoggarth, Bocinsky, Ebert and Jones2020) categorize these issues into two groups: the “bias problem” and the “summary problem.” The bias problem was particularly emphasized in early critiques (e.g., Brown Reference Brown2015; Michczyńska and Pazdur Reference Michczyńska and Pazdur2004; Surovell et al. Reference Surovell, Finley, Smith, Brantingham and Kelly2009; Williams Reference Williams2012), which questioned the core assumptions of this method – the presumed correspondence between the frequency of dates and demographic trends. While recognizing its potential of the approach in general, they pointed out that the frequency of radiocarbon dates is critically affected by human behaviors of the past, taphonomic processes, differential preservation, recovery biases, and uneven sampling strategies.

More recent discussions tend to shift toward “summary problems” (e.g., Bamforth and Grund Reference Bamforth and Grund2012; Bronk Ramsey Reference Bronk Ramsey2017; Brown Reference Brown2015; Carleton Reference Carleton2021; Carleton and Groucutt Reference Carleton and Groucutt2021; Contreras and Meadows Reference Contreras and Meadows2014; Heaton Reference Heaton2022; Heaton et al. Reference Heaton, Al-assam and Bard2025; Kerr and McCormick Reference Kerr and McCormick2014; Timpson et al. Reference Timpson, Colledge, Crema, Edinborough, Kerig, Manning, Thomas and Shennan2014, Reference Timpson, Barberena, Thomas, Méndez and Manning2021). Despite their varied emphases, these studies center on two key issues in general: (1) the conceptual validity of SPDs as representations of population trends—because SPDs, the most widely employed tool to summarize radiocarbon datasets, are only sums of measurement uncertainty of discrete individual radiocarbon dates, they risk conflating statistical noise with empirical signals of genuine demographic change (Carleton Reference Carleton2021; Heaton Reference Heaton2022), and (2) the lack of well-defined inferential models and the inherent difficulty of interpreting SPD curves in a statistically formal way (Crema and Shoda Reference Crema and Shoda2021; Timpson et al. Reference Timpson, Barberena, Thomas, Méndez and Manning2021). In response, numerous studies have sought to improve the inferential foundations of analyzing date frequencies or to develop alternative methods. As Crema (Reference Crema2022) reviewed, these efforts include embedding the approach within null hypothesis testing, parametric model fitting, and other formal statistical procedures (Brown Reference Brown2017; Crema and Shoda Reference Crema and Shoda2021; DiNapoli et al. Reference DiNapoli, Crema, Lipo, Rieth and Hunt2021; Price et al. Reference Price, Capriles, Hoggarth, Bocinsky, Ebert and Jones2020; Timpson et al. Reference Timpson, Barberena, Thomas, Méndez and Manning2021). As radiocarbon datasets continue to grow in scale and computational tools improve, the refinement of these methods represents a welcome development.

Yet, over the last decade, the bias problem has received relatively less attention, compared to the intense efforts devoted to the summary problem. This disparity warrants attention, particularly as contemporary demographic research relies ever more heavily on large-scale radiocarbon databases. Although these databases have greatly facilitated archaeological research on population history, significant questions remain regarding the representativeness of the data they contain; data are usually compiled from independently conducted investigations with varying objectives and methodologies. One of the most persistent challenges is heterogeneity in sampling intensity (e.g., Crema Reference Crema2022), arising from disparities in research scope and interests, field methodologies, preservation conditions, and funding (e.g., Becerra-Valdivia et al. Reference Becerra-Valdivia, Leal-Cervantes, Wood and Higham2020; Davies et al. Reference Davies, Holdaway and Fanning2016; Downey et al. Reference Downey, Bocaege, Kerig, Edinborough and Shennan2014; Gayo et al. Reference Gayo, Latorre and Santoro2015; Kerr and McCormick Reference Kerr and McCormick2014; Rhode et al. Reference Rhode, Brantingham, Perreault and Madsen2014; Shennan et al. Reference Shennan, Downey, Timpson, Edinborough, Colledge, Kerig, Manning and Thomas2013; Surovell and Brantingham Reference Surovell and Brantingham2007; Timpson et al. Reference Timpson, Colledge, Crema, Edinborough, Kerig, Manning, Thomas and Shennan2014). Although binning strategies have been proposed to address some of these issues (e.g., Bevan et al. Reference Bevan, Colledge, Fuller, Fyfe, Shennan and Stevens2017; Codding et al. Reference Codding, Roberts, Eckerle, Brewer, Medina, Vernon and Spangler2024; Crema and Bevan Reference Crema and Bevan2021; Shennan et al. Reference Shennan, Downey, Timpson, Edinborough, Colledge, Kerig, Manning and Thomas2013; Timpson et al. Reference Timpson, Colledge, Crema, Edinborough, Kerig, Manning, Thomas and Shennan2014, Reference Timpson, Barberena, Thomas, Méndez and Manning2021), correcting for sampling biases embedded in legacy datasets remains an arduous task. Recent innovations in statistical modeling and hypothesis testing may offer limited inferential utility unless such biases are systematically accounted for.

This study addresses the issue of sampling heterogeneity across settlements/projects and proposes a method for adjusting inter-settlement variation—rescaling the frequency of radiocarbon dates per settlement using dwelling counts. Through simulations of hypothetical populations under diverse conditions, we generate random samples and apply rescaling via mathematical weighting and bootstrap resampling. We then compare the simulated populations, unadjusted samples, and rescaled datasets to assess how rescaling alters the resulting distributions and whether it offers a closer mathematical alignment with the known simulation inputs. Finally, we apply our method to a radiocarbon dataset from Korea to demonstrate how interpretations of demographic dynamics are significantly affected by whether or not rescaling is applied.

To summarize simulated data and visualize their distributions, this study employs SPDs while fully acknowledging their conceptual and methodological caveats (e.g., Bamforth and Grund Reference Bamforth and Grund2012; Bronk Ramsey Reference Bronk Ramsey2017; Brown Reference Brown2015; Carleton Reference Carleton2021; Carleton and Groucutt Reference Carleton and Groucutt2021; Contreras and Meadows Reference Contreras and Meadows2014; Heaton Reference Heaton2022; Heaton et al. Reference Heaton, Al-assam and Bard2025; Kerr and McCormick Reference Kerr and McCormick2014; Timpson et al. Reference Timpson, Colledge, Crema, Edinborough, Kerig, Manning, Thomas and Shennan2014, Reference Timpson, Barberena, Thomas, Méndez and Manning2021). Earlier studies sometimes treated SPD curves as direct reflections of demographic history and occasionally overinterpreted fluctuations without adequately accounting for uncertainties in the calibration process or biases in the underlying data. Today, few archaeologists approach SPDs in such a simplistic manner. In archaeological practice, SPDs remain among the accessible and widely used tools for visually summarizing radiocarbon datasets to explore long-term population trends. We suggest that the value of SPDs lies not in offering exact reconstructions of prehistoric population dynamics, but in serving as visual and heuristic devices for generating hypotheses and guiding further archaeological inquiry. In this study, we do not use SPDs to draw archaeological conclusions; instead, we use them to compare various sample sets and evaluate divergences between hypothetical populations, random sample sets, and adjusted datasets.

2. Sampling biases and the scale of archaeological research of population

Archaeological studies of demographic dynamics using radiocarbon dates span a wide range of spatial and temporal scales, depending on the research question at hand; from highly localized inquiries to continental-scale reconstructions, and from timeframes of several centuries to tens of millennia. Large-scale studies have addressed long-term population fluctuations and continental processes such as the spread of farming (Aurenche et al. Reference Aurenche, Galet, Régagnon-Caroline and Évin2001; Cortell-Nicolau et al. Reference Cortell-Nicolau, Rivas, Crema, Shennan, García-Puchol, Kolář, Staniuk and Timpson2025; Crema et al. Reference Crema, Stevens and Shoda2022; Downey et al. Reference Downey, Bocaege, Kerig, Edinborough and Shennan2014; Oh et al. Reference Oh, Conte, Kang, Kim and Hwang2017; Shennan Reference Shennan, Bocquet-Appel and Bar-Yosef2008; Shennan et al. Reference Shennan, Downey, Timpson, Edinborough, Colledge, Kerig, Manning and Thomas2013; Timpson et al. Reference Timpson, Colledge, Crema, Edinborough, Kerig, Manning, Thomas and Shennan2014; Vander Linden and Silva Reference Vander Linden and Silva2020) as well as supraregional reorganizations of hunter-gatherer populations (Blockley et al. Reference Blockley, Donahue and Pollard2000; Bocquet-Appel et al. Reference Bocquet-Appel, Demars, Noiret and Dobrowsky2005; Codding et al. Reference Codding, Roberts, Eckerle, Brewer, Medina, Vernon and Spangler2024; Crema et al. Reference Crema, Habu, Kobayashi and Madella2016; Freeman et al. Reference Freeman, Hard, Mauldin and Anderies2021; French Reference French2015; Jørgensen et al. Reference Jørgensen, Pesonen and Tallavaara2022; Kim and Seong Reference Kim and Seong2022; Kuzmin and Keates Reference Kuzmin and Keates2005; Schmidt et al. Reference Schmidt, Gehlen, Winkler, Arrizabalaga, Arts, Bicho, Crombé, Eriksen, Grimm, Kapustka, Langlais, Mevel, Naudinot, Nerudová, Niekus, Peresani, Riede, Sauer, Schön, Sobkowiak-Tabaka, Vandendriessche, Weber, Zander, Zimmermann and Maier2025; Schmidt and Zimmermann Reference Schmidt and Zimmermann2019; Seong and Kim Reference Seong and Kim2022; Tallavaara et al. Reference Tallavaara, Pesonen and Oinonen2010). At finer spatial scales, the dates-as-data approach has also proven useful in analyzing local-scale demographic dynamics, including aggregation and dispersion or the emergence and relocation of centers (Birch Reference Birch2012; Crema Reference Crema2013; Manning et al. Reference Manning, Lorentzen and Hart2021; Park et al. Reference Park, Wright and Kim2017; Popescu et al. Reference Popescu, Covătaru, Opriș, Bălășescu, Carozza, Radu, Haită, Sava, Barton and Lazăr2023; Ritchie et al. Reference Ritchie, Ritchie, Blake, Simons and Lepofsky2024; Ritchison Reference Ritchison2020; Ritchison et al. Reference Ritchison, Doubles and Meyers2025; Wright et al. Reference Wright, Kim, Park, Yang and Kim2020). Although sampling bias is a universal concern across all scales of research, the spatiotemporal scale and explanatory scope of a study inevitably influence how datasets are curated and how sampling biases, especially those related to heterogeneous sampling intensity, are addressed.

Among the most widely adopted strategies to control for sampling heterogeneity is the binning method. Introduced over a decade ago (Shennan et al. Reference Shennan, Downey, Timpson, Edinborough, Colledge, Kerig, Manning and Thomas2013), binning has become a standard technique in demographic reconstructions at multiple scales (e.g., Codding et al. Reference Codding, Roberts, Eckerle, Brewer, Medina, Vernon and Spangler2024; Crema et al. Reference Crema, Habu, Kobayashi and Madella2016; Palmisano et al. Reference Palmisano, Bevan and Shennan2017; Timpson et al. Reference Timpson, Colledge, Crema, Edinborough, Kerig, Manning, Thomas and Shennan2014, Reference Timpson, Barberena, Thomas, Méndez and Manning2021). Although the conceptual underpinnings of this method have been critiqued (e.g., Heaton et al. Reference Heaton, Al-assam and Bard2025), its limitations are widely acknowledged (Bevan and Crema Reference Bevan and Crema2021; Crema Reference Crema2022; Crema and Kobayashi Reference Crema and Kobayashi2020; Timpson et al. Reference Timpson, Colledge, Crema, Edinborough, Kerig, Manning, Thomas and Shennan2014). Binning first groups radiocarbon dates into spatial or temporal bins of fixed width. Within each bin, the probability distributions of individual dates are summed, and the result is normalized by dividing by the number of dates per bin. These normalized bin-level SPDs are then aggregated to construct the final SPD curve (Crema Reference Crema2022; Shennan et al. Reference Shennan, Downey, Timpson, Edinborough, Colledge, Kerig, Manning and Thomas2013). The equal weighting of bins reduces the impact of heavily dated loci or periods, thereby mitigating overrepresentation (Crema Reference Crema2022). While especially suited to large-scale studies, binning has also been applied successfully in smaller-scale contexts (e.g., Crema et al. Reference Crema, Habu, Kobayashi and Madella2016; Crema and Kobayashi Reference Crema and Kobayashi2020; Oh and Conte Reference Oh and Conte2022; Park and Kim Reference Park and Kim2024). However, when the objective is to reconstruct settlement-level dynamics particularly at finer scales where variation in site size and their relationships matters, the utility of binning becomes limited. By design, binning is less sensitive to variation in site size or occupation intensity, as it typically involves aggregating and averaging dates within a spatial or temporal bin, thereby equalizing the contribution of each bin regardless of the number of dates it contains.

In sedentary societies composed of multiple settlements, reconstructing spatiotemporal demographic dynamics requires first estimating the lifespans of individual settlements. Archaeologists have long wrestled with how to reliably estimate key parameters of settlement lifespans, such as the timing of initial occupation, the duration of habitation, and fluctuations in population size, and with how to integrate these parameters into broader models of regional settlement patterns. One of the most widely used proxies for estimating settlement-level population history has been dwelling count (Duff and Wilshusen Reference Duff and Wilshusen2000; Hassan Reference Hassan and Schiffer1978; Hill Reference Hill1970; Kirch and Rallu Reference Kirch and Rallu2007; Kolb Reference Kolb1985; Longacre Reference Longacre1975; Ortman and Coffey Reference Ortman and Coffey2017; Parton and Clark Reference Parton and Clark2022; Plog Reference Plog1975; Schacht Reference Schacht1984). Before radiocarbon dating became standard practice, changes in dwelling count were often correlated with ceramic typologies. However, as discussed above and as others have shown (Bevan and Crema Reference Bevan and Crema2021; Crema and Kobayashi Reference Crema and Kobayashi2020; Kolář et al. Reference Kolář, Macek, Tkáč and Szabó2016; Petrie and Lynam Reference Petrie and Lynam2020; Plog and Hantman Reference Plog and Hantman1990), such approaches imposed discrete population models and introduced interpretive uncertainties due to inconsistencies in ceramic phase durations. While some studies attempted to overcome these problems by assuming uniform or normal distributions of dwellings over time (Ortman Reference Ortman2016; Plog Reference Plog1974, Reference Plog1975; Porčić and Nikolić Reference Porčić and Nikolić2016; Roberts et al. Reference Roberts, Mills, Clark, Haas, Huntley and Trowbridge2012; Schacht Reference Schacht1980, Reference Schacht1984), continuous settlement histories remain difficult to reconstruct using pottery chronologies alone.

With the growing availability of radiocarbon data, we propose that the frequency distribution of dated dwellings within a settlement provides an empirically grounded proxy for inferring the settlement’s lifespan. When all (or a sufficiently representative sample of) dwellings are dated, the temporal distribution of those dates can plausibly reflect the duration and demographic trajectory of the settlement’s occupation. If SPDs are employed to summarize these distributions despite their known limitations (Heaton et al. Reference Heaton, Al-assam and Bard2025; Timpson et al. Reference Timpson, Barberena, Thomas, Méndez and Manning2021), then settlement-level SPDs may serve as a more informative proxy than conventional estimates based solely on ceramic phase durations. Within this framework, an overall SPD for a study area can be constructed as the normalized sum of individual settlement-level SPDs. Although cultural and behavioral variables—such as variation in dwelling longevity, the presence of multi-household structures, and degrees of residential mobility—introduce interpretive uncertainty, settlement-level SPDs can be an effective approximation for reconstructing local demographic histories and, in aggregate, broader spatiotemporal population patterns.

Nevertheless, as noted, the number of dated dwellings varies considerably across settlements, countries, and research projects. Without proper adjustment, such variation can distort interpretations by leading to over-or underestimation of settlement size or duration. To address this challenge, we propose a method for correcting inter-settlement variation in sampling intensity. Our approach leverages dwelling count data to rescale radiocarbon datasets, allowing for more proportionate representations of relative population sizes and occupational durations across settlements and presenting new insights into past population structure.

3. Methods

To control for heterogeneity in sampling intensity across settlements, we developed a rescaling approach applied at the settlement level. Our framework implements two closely related procedures: settlement-level weighting and bootstrap resampling. The underlying premise is that each settlement possesses a unique occupation span and demographic history, which can be probabilistically approximated if all dwellings are dated. When only a subset of dwellings is dated, rescaling the temporal distribution of those dates can offer a representative estimate of the full settlement lifespan. Applied systematically across all settlements within a study area, this approach mitigates distortions introduced by uneven sampling intensity.

To evaluate the performance of this rescaling framework in reducing bias, we conducted a series of simulations. Hypothetical populations were generated and subjected to random sampling. The random samples were then rescaled using weighting and bootstrap procedures proportional to settlement size. We compared the resulting SPDs of the sampled and rescaled datasets to those of the hypothetical populations, evaluating the extent to which rescaling improves representativeness.

3.1. Generating hypothetical populations and sampling

We begin by generating populations composed of $K$ dwellings, each of which has a radiocarbon date, distributed across $M$ sedentary settlements. Each settlement, ${S_i}\left( {i = 1, \ldots, M} \right)$ , contains ${k_i}\left( { \ge5} \right)$ dwellings, $D_i^{\,j}\left( {j = 1, \ldots, {k_i}} \right)$ , thus $\mathop \sum \nolimits_{i = 1}^M {k_i} = K$ . In this simulation, $K = 1000$ and $M = 10$ , such that each hypothetical population consists of 1,000 dwellings distributed across 10 settlements (Settlement 1, Settlement 2, … and Settlement 10), with each settlement containing at least five dwellings.

The simulated datasets—hereafter referred to as “hypothetical populations”—are generated under a range of conditions involving variations in settlement size and lifespan. To introduce variability, settlement sizes ( ${k_i}$ ) were drawn from either normal or power-law distributions of dwelling count per settlement, commonly found in empirical data (Duffy Reference Duffy2015; Fletcher Reference Fletcher1986; Johnson Reference Johnson1980). Settlement lifespans were modeled using three distributions—uniform, normal, and skewed (beta distribution, $\alpha \; = 2$ , $\beta \; = \;5$ ). Each dwelling was assigned an uncalibrated radiocarbon date with a standard deviation of 40 years, in accordance with the given conditions.

We generate hypothetical populations for two timespans: 600 years (2200–1600 BP) and 1500 years (4500–3000 BP). These different intervals were selected to assess the effects of temporal density of dates (see below), and to avoid the calibration uncertainty associated with the “Hallstatt Plateau” (Pearson et al. Reference Pearson, Pilcher and Baillie1983; Stäuble and Hiller Reference Stäuble and Hiller1997; Wijma et al. Reference Wijma, Aerts, van der Plicht and Zondervan1996), which ranges between circa 2800 and 2400 cal BP. Within each interval, settlement occupation spans (the begin-and end-dates) were randomly assigned: 50–300 years for the 600-year cases and 100–500 years for the 1500-year cases. This yielded 12 hypothetical population models used as null references for comparison (Figures 1 and 2 in Supplementary 1).

Figure 1.

Boxplots showing RMSE distances from the hypothetical populations, ${\rm{L}}\left( {\rm{t}} \right)$ , to random sample sets, ${\rm{P}}\left( {\rm{t}} \right)$ : (a) 600 years and (b) 1500 years.

Figure 2.

Examples of SPD Comparisons: 600 years, Maximum Sample Fraction 30%, 25-year rolling mean applied.

We note that the probability distribution of calibrated radiocarbon date of dwelling $d_i^{\,j}\left( t \right)$ for each dwelling $D_i^{\,j}$ . Then, aggregated probability distribution of dates of each settlement can be defined as

(1)

$${L_i}\left( t \right) = {1 \over K}\sum \limits_{j = 1}^{{k_i}} d_i^{\,j}\left( t \right)\;,$$

and the normalized SPD of the population as

(2)

$$L\left( t \right) = {1 \over K}\mathop \sum \limits_{i = 1}^M \mathop \sum \limits_{j = 1}^{{k_i}} d_i^{\,j}\left( t \right) = \mathop \sum \limits_{i = 1}^M {L_i}\left( t \right).$$

From each hypothetical population, we randomly sampled dwellings from each settlement. Each settlement had a minimum of five samples, and the maximum sampling fraction was set to 20%, 30%, or 40%. Sampling intensity per settlement was randomly determined within these bounds, simulating heterogeneous sampling intensity. Each combination of population model and sampling fraction was replicated 50 times.

Let $P\left( t \right)$ be the normalized SPD from the sampled dataset, where ${n_i}$ is the sample size for settlement $i$ and $N = \mathop \sum \nolimits_{i = 1}^M {n_i}$ is the total number of sampled dwellings. The overall SPD and settlement-level SPDs from the sampled data are:

(3)

$$P\left( t \right) = {1 \over N}\mathop \sum \limits_{i = 1}^M \left[ {\mathop \sum \limits_{j = 1}^{{n_i}} d_i^{\,j}\left( t \right)} \right] = \mathop \sum \limits_{i = 1}^M {P_i}\left( t \right),$$

and

(4)

$${P_i}\left( t \right) = {1 \over N}\mathop \sum \limits_{j = 1}^{{n_i}} d_i^{\,j}\left( t \right)$$

, respectively.

3.2. Rescaling: Weighting and bootstrap resampling

To address the issue of uneven sampling intensity across settlements, we propose two rescaling approaches, settlement-level weighting and bootstrap resampling, designed to adjust for disparities between the number of dated dwellings and the total dwelling counts. Both methods aim to approximate the population history that would be inferred if all dwellings within each settlement were dated, thereby mitigating distortions arising from heterogeneous sampling intensities. In the sections that follow, we outline the implementation of each method and describe how they are applied to construct adjusted SPDs.

3.2.1. Weighting

Suppose we have ${n_i}$ sampled dates from ${k_i}$ dwellings for each settlement ${S_i}$ . The probability density function $d_i^{\,j}\left( t \right)$ for each sampled date can be aggregated and weighted at a settlement-level by defining ${W_i}\left( t \right)$ as

(5)

$${W_i}\left( t \right) = {{{k_i}} \over K}\left[ {{1 \over {{n_i}}}\mathop \sum \limits_{j = 1}^{{n_i}} d_i^{\,j}\left( t \right)} \right] = \;{{{k_i}N} \over {{n_i}K}}{P_i}\left( t \right)$$

since

(6)

$${P_i}\left( t \right) = {1 \over N}\mathop \sum \limits_{j = 1}^{{n_i}} d_i^{\,j}\left( t \right) = {{{n_i}} \over N}\left[ {{1 \over {{n_i}}}\mathop \sum \limits_{j = 1}^{{n_i}} d_i^{\,j}\left( t \right)} \right]$$

Then, the weighted overall SPD $W\left( t \right)\;$ is $\mathop \sum \nolimits_{i = 1}^M {W_i}\left( t \right)$ .

3.2.2. Bootstrap resampling

The second rescaling approach applies the bootstrap method originally proposed by Efron (Reference Efron1979). Bootstrapping is a non-parametric resampling technique that draws samples with replacement from an observed dataset to estimate statistical parameters such as means, variances, and confidence intervals (Aczel Reference Aczel1995). This technique is particularly well suited to small samples with unknown or complex distributions and has been widely applied in archaeological research, including examination of prehistoric population histories (e.g., Downey et al. Reference Downey, Bocaege, Kerig, Edinborough and Shennan2014; Drennan et al. Reference Drennan, Berrey and Peterson2015; Eren et al. Reference Eren, Chao, Hwang and Colwell2012; McLaughlin Reference McLaughlin2019; Price et al. Reference Price, Wolfhagen and Otárola-Castillo2016; Rick Reference Rick1987; Robinson et al. Reference Robinson, Zahid, Codding, Haas and Kelly2019; but see Heaton et al. Reference Heaton, Al-assam and Bard2025 for a critique).

In our study, we employ a modified version of the bootstrap method. For each settlement, we perform resampling with replacement, drawing the number of dates equal to the total count of dwellings in that settlement, repeating this process $b$ times per settlement. With ${k_i}$ dwellings, we get the settlement-level aggregation of resampled data, ${R_i}\left( t \right)$ ,

(7)

$${R_i}\left( t \right) = {{{k_i}} \over K}\left[ {{1 \over {b{k_i}}}\mathop \sum \limits_{j = 1}^{b{k_i}} d_{i}^{\,j}\left( t \right)} \right]$$

and overall SPD of resampled dataset $R\left( t \right)$ is $\mathop \sum \nolimits_{i = 1}^M {R_i}\left( t \right)$ .

Bootstrap resampling is repeated 30 times per settlement (i.e., $b = 30$ ). Unlike standard applications of bootstrapping aimed at estimating population parameters such as mean, standard deviation and confidence interval, our goal is to enhance sample representativeness for SPD generation. Therefore, we pooled the 30 resampled iterations and aggregated them to produce a rescaled probability density function proportional to the total dwelling count of each settlement. This results in a resampled dataset for each settlement that is 30 times larger than the original number of dwellings. The settlement-level probability density functions were then summed to construct an area-wide aggregated SPD reflecting overall demographic trends. For a hypothetical population of 1,000 dwellings across 10 settlements, the procedure generates 30,000 resampled dates per simulation. Then, the results are normalized.

3.3. Comparison and evaluation

We assess the degree to which the SPDs of sampled and rescaled datasets approximate the hypothetical populations. Specifically, we examine how rescaling affects correspondence with both the overall population-level SPDs, $L\left( t \right)$ , and the individual settlement lifespans, ${L_i}\left( t \right)$ , relative to random sample sets. We use root mean squared error (RMSE) for a quantitative measure of dissimilarity. It can be evaluated by the square root of the sum of squared differences between the corresponding function values across all time bins as

(8)

$$\left\| {f - g} \right\| = \sqrt {{1 \over n}\mathop \sum \limits_{i = 1}^n {{\left|\, {f\left( {{t_i}} \right) - g\left( {{t_i}} \right)} \right|}^2}} $$

where $n$ is the number of time bins, which is 1 year in this study.

This metric provides a single scalar value reflecting the overall difference between two distributions. A lower RMSE score indicates a closer match to the reference population. It is particularly useful in our context because it is sensitive to both systematic bias (e.g., temporal shifts in peaks and dips) and stochastic noise (e.g., sampling variation), capturing overall fidelity of the sampled SPD to the target population curve (Scott Reference Scott2015). In each simulation, we calculate RMSE distances from the hypothetical population distributions ( $L\left( t \right)$ and ${L_i}\left( t \right)$ ) to the randomly sampled datasets ( $P\left( t \right)$ and ${P_i}\left( t \right)$ ), and the rescaled datasets ( $W\left( t \right)$ , ${W_i}\left( t \right)$ , $R\left( t \right)$ , and ${R_i}\left( t \right)$ ), respectively. This allows us to evaluate whether rescaling reduces deviation from the true population more effectively than unadjusted random sampling under a variety of conditions. To ensure comparability, all SPDs are normalized prior to RMSE calculation.

4. Results

4.1. Overall change in population size inferred from SPDs

Normalized SPDs derived from random sample sets, $P\left( t \right)$ , exhibit notable variation in shape across the 50 iterations and frequently deviate from the patterns observed in the hypothetical populations ( $L\left( t \right)$ ), irrespective of the underlying parameters (Figures 3 and 4 in Supplementary 1). Increasing the maximum sampling fraction per settlement enlarges the overall sample size; on average, 122 dates ( $s = 19$ ) for the 20% cap, 175 ( $s = 33$ ) for 30%, and 218 ( $s = 44$ ) for 40%. But, as indicated in Figure 1, it does not consistently reduce variance or improve representativeness. Rather, higher maximum sampling fractions tend to amplify inter-settlement heterogeneity, suggesting that larger sample sizes alone do not necessarily yield closer approximations when sampling intensity is uneven across settlements.

Figure 3.

Examples of SPD Comparisons: 1500 years, Maximum Sample Fraction 30%, 50-year rolling mean applied.

Figure 4.

Comparisons of RMSE distances from the hypothetical populations, ${\rm{L}}\left( {\rm{t}} \right)$ , to random sample sets, ${\rm{P}}\left( {\rm{t}} \right)$ , weighted sample sets, ${\rm{W}}\left( {\rm{t}} \right)$ , and bootstrap resampled datasets, ${\rm{R}}\left( {\rm{t}} \right)$ .

Rescaling the random sample sets in proportion to the number of dwellings per settlement—whether via weighting ( $W\left( t \right)$ ) or bootstrap resampling ( $R\left( t \right)$ ), which yield nearly identical results—consistently produces SPDs that better approximate the original hypothetical populations, $L\left( t \right)$ . Figures 2 and 3 show eight representative cases—four from each of the datasets constructed with 600-and 1500-year intervals—selected to illustrate the effect of rescaling. For instance, as shown in Figure 2a, the SPD derived from the random sample set suggests overall population growth with a minor dip around 1900 cal BP. In contrast, both the hypothetical population and the rescaled dataset indicate a marked population decline following 2000 cal BP and a steady recovery from 1850 cal BP. Similarly, in Figure 2c, the rescaled and hypothetical SPDs exhibit a consistent growth trend punctuated by a dip near 1900 cal BP, while the random sample departs noticeably from this trajectory. Figure 3a likewise demonstrates that, whereas the sampled data show a declining trend after 3800 cal BP, the rescaled data better capture the shape of the hypothetical SPD, including another peak around 3400 cal BP.

Figure 4 presents boxplots of RMSE scores across all experimental conditions. In nearly all cases, rescaled datasets show lower RMSE values and smaller variances, indicating not only improved fidelity to the target population but also reduced susceptibility to sampling noise. Table 1 summarizes the comparative performance of the rescaling approaches. Across 1,800 iterations, the rescaled datasets yielded lower RMSE scores in 1,704 cases (94.7%) for weighting and in 1,702 cases (94.6%) for resampling. Both two-sample t-tests and pairwise Wilcoxon signed-rank tests confirm the statistical significance of these differences (t-test, $p \lt 0.0001$ ; Wilcoxon signed-rank test, $p \lt 0.0001$ ). A closer look at Table 1 reveals that the efficacy of rescaling improves incrementally with higher maximum sampling fractions and longer temporal durations. In the 600-year simulations, rescaled datasets yield lower RMSE scores than random samples in 265 to 286 of 300 iterations for weighting and in 262 to 286 of 300 iterations for resampling. This becomes more pronounced in the 1500-year simulations, where the number of successful cases rises to between 292 and 295 for weighting and between 292 and 296 for resampling. For both durations, higher sampling fraction, which in general implies greater inter-settlement heterogeneity in sampling intensity, tends to result in rescaling more consistently reducing RMSE.

Table 1.

Summary of RMSE: Case Count. (Detailed information is provided in Tables 1 and 2 in Supplementary 1)

4.2. Lifespans of individual settlements and inter-settlement relationships

The issue of heterogeneous sampling intensity presents a more serious challenge when interpreting the organization of settlements using radiocarbon datasets. Spatial analyses that aim to investigate demographic dynamics—such as inter-settlement relationships, aggregation and dispersion processes, and the emergence or relocation of central places—rely on reliable estimation of individual settlement size and duration.

Figure 5 presents selected cases from our simulation, illustrating aggregated probability distributions for ten individual settlements. In these cases, the probability distributions derived from random samples ( ${P_i}\left( t \right)$ ) over-or underrepresent the size and occupation span of settlements. Figures 5 and 6 together reveal how such distortions in the temporal representation of individual settlements can mislead interpretations of both spatial dynamics and broader population trends. By contrast, rescaling the random samples in proportion to the number of dwellings per settlement, ${W_i}\left( t \right)$ and ${R_i}\left( t \right)$ , yields probability distributions that more closely approximate the actual temporal profiles of the simulated settlements, ${L_i}\left( t \right)$ .

Figure 5.

Examples of Settlement SPDs: (a) Power-law & Normal distribution, (b) Power-law & Skewed distribution (600 years, Maximum Sample Fraction 30%, 25-year rolling mean applied). For details of settlements, see Table 2 in Supplementary 1.

Figure 6.

Examples of Settlement SPDs: (a) Power-law & Uniform distribution, (b) Power-law & Normal distribution (1500 years, Maximum Sample Fraction 30%, 50-year rolling mean applied). For details of settlements, see Table 2 in Supplementary 1.

In Figure 5a (600-year timespan), the random sample set underrepresents Settlement 1 spanning roughly from 2200 to 1800 cal BP, and overrepresents Settlement 2 (see Figures 1 and 2 in Supplementary 1, for detailed information about simulated settlements). Figure 5b illustrates the considerable underrepresentation of Settlement 1, along with overrepresentation of Settlements 2 and 4 when an SPD is produced using the random sample set. A comparable pattern is observed in the 1500-year timespan. In Figure 6a, Settlement 1 (ca. 3500–3000 cal BP) is significantly underrepresented in the random sample, while Settlement 4 is overrepresented. In Figure 6b, Settlement 1 (spanning ca. 5000–4500 cal BP), the largest among the ten, is again underrepresented. In all cases, the rescaled datasets more faithfully reconstruct the original lifespans and relative size of the settlements. Figure 7 confirms this trend. Random sample sets consistently yield higher RMSE scores and greater variance compared to rescaled datasets across all settlements. Severe misrepresentation is most likely when small sample sizes are drawn from large settlements, or vice versa, particularly in power-law distribution of settlement size. Settlement 1 ( $n = 363$ ) and Settlement 2 ( $n = 183$ ), the largest settlements (Table 1 in Supplementary 1), show higher mean RMSE scores and greater variances, indicating heightened susceptibility to sampling bias. In contrast, rescaling consistently lowers both the average RMSE distances and their variability across all settlements.

Figure 7.

Comparisons of RMSE distances from the hypothetical populations, ${{\rm{L}}_{\rm{i}}}\left( {\rm{t}} \right)$ , to random sample sets, ${{\rm{P}}_{\rm{i}}}\left( {\rm{t}} \right)$ , weighted sample sets, ${{\rm{W}}_{\rm{i}}}\left( {\rm{t}} \right)$ and bootstrapped datasets, ${{\rm{R}}_{\rm{i}}}\left( {\rm{t}} \right)$ by settlement: (a) 600 years (b) 1500 years (Maximum Sample Fraction 30%)

5. Case study: Demographic dynamics in the proto- and early historical periods of Korea

Our experiments reveal that rescaling consistently produces SPDs that better approximate those of hypothetical populations, providing different interpretations of population history. We apply our rescaling method to radiocarbon datasets from archaeological sites in Korea dating to the proto- and early historical periods. Beginning in 1C BC, Korea witnessed a significant increase in social complexity, driven primarily by the introduction of iron technology from China, which was applied to agricultural tools and weaponry. This period, termed the Proto–Three Kingdoms period (1C BC–AD 3C), saw rapid population growth and the emergence of numerous competing polities. This shift eventually led to the formation of three ancient states by late AD 3C, inaugurating the Three Kingdoms period (AD 4–7C). According to both Korean and contemporaneous Chinese historical sources, Baekje—one of the three kingdoms—originated as a small polity in present-day Seoul. Historical texts suggest that it developed into a centralized state by expanding southward through military conquest and/or political consolidation (Kim Reference Kim1998 [1145]; Noh Reference Noh1987). This expansion is archaeologically supported by the spread of Baekje-style pottery and tomb types across central-western and southwestern Korea (Kim Reference Kim2007; Kwon Reference Kwon2001; Lee Reference Lee2022; Park Reference Park2001, Reference Park2007). By AD 6C, peripheral areas were increasingly incorporated into Baekje’s domain, though the timing and character of these processes remain subjects of debate (for a recent overview, see Kim Reference Kim2024). The southward advance of Baekje appears to have entailed reorganizations of settlement patterns in the peripheries, including the emergence and relocation of population concentrations (Park et al. Reference Park, Wright and Kim2017; Wright et al. Reference Wright, Kim, Park, Yang and Kim2020).

For this case study, we examine two areas: the upper-middle Yeongsan River Basin in the southwestern and the upper Geum River Basin in central-western Korea (Figure 8). Since the early 2000s, extensive archaeological surveys and excavations, largely prompted by government-led infrastructure development and river refurbishment projects, have uncovered a wide array of settlements in both areas, ranging from small hamlets to large villages with more than 1000 pit dwellings. While most excavations have been undertaken by cultural resource management (CRM) firms, the Bureau of Cultural Heritage of the South Korean government supervises all stages of investigation to ensure methodological consistency and reporting quality. By national regulation, excavation reports must be published and made publicly available within two years of completion. As of 2024, the number of published radiocarbon dates in South Korea stands at approximately 20,000, making it one of the most comprehensive and densely sampled radiocarbon databases globally (Hwang Reference Hwang2021; Kim and Seong Reference Kim and Seong2022; Oh et al. Reference Oh, Conte, Kang, Kim and Hwang2017; Park et al. Reference Park, Wright and Kim2017; Seong and Kim Reference Seong and Kim2022; Wright et al. Reference Wright, Kim, Park, Yang and Kim2020). In the dataset, 3561 dates from dwellings between 2100 and 1500 BP are reported from 481 sites across South Korea, which cover the period of interest in this study.

Figure 8.

Study areas and sites: (a) Locations. (b) The Geum River Basin. (1. Bokryong-dong-1; 2. Bokryong-dong-2; 3. Bongmyeong-dong; 4. Daepyong-ri; 5. Juk-dong; 6. Naseong-ri; 7. Songjeol-dong; 8. Yonggye-dong; 9. Yongho-Hapgang-ri) (c) The Yeongsan River Basin. (1. Dongnim-dong; 2. Hanam-dong; 3. Heukseok-dong; 4. Oseon-dong; 5. Sanjeong-dong; 6. Seonam-dong; 7: Sinchang-dong; Taemok-ri; 9: Yeonsan-dong; 10. Yongdu-dong; 11. Yongsan-dong)

We selected settlements with at least five dated dwellings from the study areas. Published radiocarbon dates were critically assessed and filtered to retain only those directly associated with dwellings attributable to the Proto- and Early Three Kingdoms periods. Anomalous or archaeologically questionable dates were excluded. In cases where multiple dates were obtained from a single dwelling, the results were statistically combined to avoid overrepresentation. We then applied our rescaling procedures to the filtered datasets for each area and produced SPD and spatiotemporal KDE analyses using the R package rcarbon (Crema and Bevan Reference Crema and Bevan2021). Details about the settlements selected for this study, including locations, number of dwellings, dates, and sample fractions, are provided in Supplementary 2.

5.1. The Yeongsan River Basin

In the Yeongsan River Basin, we analyze 142 filtered radiocarbon dates drawn from eleven settlements comprising a total of 3,287 dwellings. The proportion of dated dwellings varies widely between settlements, ranging from 1.34% to 76.92% (Table 2 in Supplementary 2). The overall SPDs of the original and rescaled datasets (Figure 9a) both depict a general trend of population growth from 2000 to 1750 cal BP, followed by a rapid increase peaking around 1700 cal BP and stabilization thereafter, with a minor dip near 1650 cal BP, which would be negligible.

Figure 9.

Analytic result of the Yeongsan River Basin. (a) Overall SPD; (b) Probability Distributions of Settlements (Site numbers correspond to Figure 8); (c) KDE analyses over time.

However, when disaggregated to the level of individual settlements, the two datasets diverge remarkably (Figure 9b). The unrescaled data imply that, prior to 1700 cal BP, populations were relatively evenly distributed across small, similarly sized settlements, until the Yeonsan-dong settlement (No. 9) in the southern basin rapidly emerged as the dominant center. By contrast, the rescaled data indicate that the Taemok-ri (No. 8) in the north was the largest and most populous settlement until around 1700 cal BP, after which Yeonsan-dong and Oseon-dong (No. 4) expanded abruptly, giving rise to a coexistence of tripartite population centers.

These divergent reconstructions of the population history are also evident in the KDE analysis (Figure 9c). The unrescaled original dataset highlights the emergence of a new population concentration in the southern basin, with a southwestward shift beginning around 1800 cal BP. The rescaled dataset, however, suggests that the Taemok-ri settlement remained a major population center well after 1700 cal BP, coexisting with other concentrations in the middle basin.

5.2. The Geum River Basin

In the Geum River Basin, we analyze 195 filtered radiocarbon dates from nine settlements comprising a total of 1,442 dwellings. The overall sampling fraction was 13.6%, higher than that of the Yeongsan River Basin, but with considerable variation across individual settlements, ranging from 1.8% to 42.1% (Table 3 in Supplementary 2). Notably, 119 dates were reported from the Songjeol-dong settlement in the northern part of the basin.

At the aggregate level, the SPDs of the original and rescaled datasets both show general population growth until 1700 cal BP, followed by stabilization (Figure 10a). However, the growth trajectories and relative magnitudes differ between the datasets. The probability distributions of individual settlements diverge dramatically, yielding contrasting population histories (Figure 10b). The unrescaled original dataset portrays Songjeol-dong (No. 7) as a dominant population center from 1800 to 1650 cal BP. In contrast, the rescaled dataset indicates that Yonggye-dong (No. 8), located in the south, was the largest settlement from 1950 to 1800 cal BP, after which it dissolved rapidly as Songjeol-dong rose as the principal settlement. These differences are further underscored by the KDE analysis (Figure 10c). The unrescaled original data suggest that Songjeol-dong sustained the largest population throughout the period, whereas the rescaled data show high population density in the south (Yonggye-dong) prior to 1800 cal BP, followed by a sharp demographic shift toward the north (Songjeol-dong).

Figure 10.

Analytic result of the Geum River Basin. (a) Overall SPD; (b) Probability Distributions of Settlements (Site numbers correspond to Figure 8); (c) KDE analyses over time.

The disparities between the two reconstructions are largely attributed to differential sampling fractions (Supplementary 2). Yonggye-dong, which was occupied earlier, had a low sample fraction (eight dates from 448 dwellings; 1.8%), whereas the sample fraction of Songjeol-dong, the population of which increased substantially only after 1800 cal BP, is much higher (119 dates from 558 dwellings; 21.3%). These imbalances likely contribute to significant underrepresentation of the earlier-occupied Yonggye-dong (Figure 10b).

To summarize, the case studies of the Yeongsan and Geum River Basins demonstrate that the use of unadjusted radiocarbon datasets can lead to significantly different reconstructions of local population histories than those produced through rescaling. Although it is not possible to determine whether the rescaled SPDs better reflect past demographic realities, they provide reconstructions more consistent with the established ceramic chronology of the region (Cho Reference Cho2007; Kim Reference Kim2000, Reference Kim2007; Lee Reference Lee2011; Yun Reference Yun2014) than those generated by SPDs based on random sampling alone.

6. Discussion

Recent methodological developments of the dates-as-data approach (e.g., Carleton Reference Carleton2021; Crema Reference Crema2022; Heaton Reference Heaton2022; Heaton et al. Reference Heaton, Al-assam and Bard2025; Timpson et al. Reference Timpson, Barberena, Thomas, Méndez and Manning2021) have enhanced its statistical rigor. These studies advocate embedding demographic inference within formal modeling frameworks, including null hypothesis testing and model fitting. Moving beyond the direct application of unidirectional population growth models or null-hypothesis testing, more recent research (e.g., Crema and Shoda Reference Crema and Shoda2021; DiNapoli et al. Reference DiNapoli, Crema, Lipo, Rieth and Hunt2021; Price et al. Reference Price, Capriles, Hoggarth, Bocinsky, Ebert and Jones2020; Timpson et al. Reference Timpson, Barberena, Thomas, Méndez and Manning2021) has attempted to model population fluctuations. These represent important methodological advancements; however, several key issues warrant further consideration.

First, current methodological studies predominantly focus on changes in total population size over time at a macro-scale. While understanding such broad-scale trends is essential, we argue that, from an archaeological perspective, this captures only one dimension of past demographic dynamics. Many fundamental research questions in archaeological demography concern population reorganization, including processes such as relocations (Kim and Seong Reference Kim and Seong2022; Seong and Kim Reference Seong and Kim2022), community aggregation and dispersion (Barrier Reference Barrier2017; Birch Reference Birch2012; Feinman and Neitzel Reference Feinman and Neitzel2023; Gyucha Reference Gyucha2019; Haggis Reference Haggis and Birch2013; Kohler et al. Reference Kohler, VanBuskirk and Ruscavage-Barz2004; Whallon Reference Whallon2006), the emergence and decline of regional centers (Hill et al. Reference Hill, Clark, Doelle and Lyons2004; Kohler and Varien Reference Kohler and Varien2012; Ortman et al. Reference Ortman, Cabaniss, Sturm and Bettencourt2015; Ortman and Coffey Reference Ortman and Coffey2017; Park et al. Reference Park, Wright and Kim2017; Smith Reference Smith2023; Wright et al. Reference Wright, Kim, Park, Yang and Kim2020), and shifts in settlement hierarchies or heterarchies (Duffy Reference Duffy2015; Kowalewski Reference Kowalewski and Birch2014; Peterson and Drennan Reference Peterson and Drennan2005). These population dynamics are critical for understanding sociopolitical integration, economic interaction, and land-use strategies; yet they may occur with little or no net change in overall population size.

More importantly, the reliability of population reconstructions is not guaranteed by the application of statistically rigorous summary methods alone. While techniques such as null hypothesis testing and model fitting are valuable for evaluating the structure and potential biases of SPDs, they do not automatically transform biased datasets into explanatory models. A critical issue concerns the representativeness of radiocarbon data and the sampling biases inherent in legacy datasets, which significantly affect interpretive validity. As interest in past population dynamics has grown and the dates-as-data approach has become increasingly widespread, particularly alongside the construction of radiocarbon databases, greater attention must be devoted to addressing these limitations.

Given these challenges, an important question arises: how can known biases, particularly those arising from uneven sampling intensity, be managed? To address this issue, we developed and tested a rescaling method aimed at mitigating sampling heterogeneity at the level of individual settlements. We compared SPDs generated from hypothetical populations, randomly sampled datasets (the archaeologist’s practical analogue), and rescaled datasets. SPDs were used not as direct measures of population change but as heuristic tools to demonstrate how significantly summary results can diverge between actual populations and sampled data, and to evaluate whether rescaling can recover the original demographic signals.

Our simulations indicate that rescaled datasets, adjusted using dwelling counts of individual settlements, consistently yield SPDs that more closely approximate the demographic trajectories of hypothetical populations than do unadjusted random samples. The simulation results proved robust across a variety of conditions, including different settlement size distributions (normal vs. power-law), varied settlement lifespans (uniform, normal, skewed), and differing temporal spans (600 vs. 1500 years). Although rescaling can amplify distortions—as correctly noted by McLaughlin (Reference McLaughlin2019) and Heaton et al. (Reference Heaton, Al-assam and Bard2025) —especially when the original datasets are already highly biased, such instances were rare in our simulations: only 96 cases (5.3%) for weighting and 98 (5.4%) for resampling out of 1,800 total iterations (Table 1). In over 94% of cases, rescaling reduced RMSE scores between sample SPDs and reference populations, confirming its general effectiveness.

The benefits of rescaling become more pronounced as sampling heterogeneity across settlements increases. While increasing the maximum sample fraction per settlement raises total sample size, it does not necessarily improve demographic approximations unless inter-settlement sampling heterogeneity is addressed. Rescaling performs particularly well under conditions of low temporal density. In our 1500-year simulations, it achieved 97–98% improvement, compared to 87–95% in the 600-year simulations—suggesting that rescaling is especially effective when the temporal density of sampled radiocarbon dates is low.

These advantages are especially evident in reconstructing inter-settlement dynamics. As emphasized in prior studies (e.g., Bevan and Crema Reference Bevan and Crema2021; Birch-Chapman and Jenkins Reference Birch-Chapman and Jenkins2019; Brown et al. Reference Brown, Reed and Glowacki2013; Crown Reference Crown1991; Dewar Reference Dewar1991; Drennan et al. Reference Drennan, Berrey and Peterson2015; Petrie and Lynam Reference Petrie and Lynam2020; Plog Reference Plog1974, Reference Plog1975; Prentiss et al. Reference Prentiss, Lenert, Foor, Goodale and Schlegel2003; Schacht Reference Schacht1984; Shott Reference Shott1992; Varien et al. Reference Varien, Ortman, Kohler, Glowacki and Johnson2007), estimating the duration and intensity of occupation at individual settlements constitutes a crucial first step toward understanding regional population organization, including the formation, relocation, and dissolution of population centers. Our simulation suggests that SPDs based on radiocarbon dates from dwellings, particularly when rescaled relative to dwelling counts, can serve as reasonable proxies for reconstructing settlement occupation histories. They further show that even when only five radiocarbon dates are available per settlement, rescaling significantly reduces both RMSE values and variance in SPD-based reconstructions. This effect is especially critical for large settlements: without rescaling, they may be severely underrepresented, leading to misinterpretations of broader demographic organization (e.g., Settlements 1 and 2 in power-law distributions; see Figures 5 and 6). We do not claim that rescaling inherently “improves” the quality of radiocarbon datasets in reconstructing demographic dynamics that are ultimately unknowable. Rather, we suggest that controlling for sampling heterogeneity through rescaling may generate alternative hypotheses and pose new research questions for future archaeological investigation.

Our case study further demonstrates how sampling heterogeneity can skew empirical interpretations of demographic dynamics. The unrescaled original and rescaled datasets yielded markedly different reconstructions of population reorganization during Baekje’s expansion in proto- and early historical Korea. While they do not produce significantly different overall population trajectories for the Yeongsan and Geum River Basins, the spatial and temporal configurations of population concentrations diverge substantially (see above). These discrepancies are largely attributable to low sample fractions in the largest settlements: only 21 of 1043 dwellings in Taemok-ri (2.01%) and 8 of 448 in Yonggye-dong (1.8%) were dated (Table 1 in Supplementary 2), resulting in substantial underestimation of population size of these population concentrations. The apparent stability in population size reflected in the overall SPD for the Yeongsan River Basin (Figure 9a), despite the underrepresentation of Taemok-ri, is likely due to the relatively high sampling fractions in smaller contemporaneous settlements (for instance, Shinchang-dong, where 10 of 13 dwellings were dated) which inadvertently compensated for the deficit. Although definitive validation remains challenging, the patterns from the rescaled dataset align more closely with interpretations based on Korean pottery chronologies (Cho Reference Cho2007; Kim Reference Kim2000, Reference Kim2007; Lee Reference Lee2011; Yun Reference Yun2014). This highlights a critical point: even when overall SPDs appear consistent and robust, uncorrected inter-settlement sampling heterogeneity poses a significant risk of producing fundamentally incongruent archaeological interpretations of spatial reorganization.

Combining radiocarbon dates and dwelling counts to explore population history is not entirely new. Crema and Kobayashi (Reference Crema and Kobayashi2020), for example, analyzed a large dataset of pit dwellings from the central Japanese Jomon, using Bayesian models of ceramic phases. In their framework, dwelling counts served as an independent demographic proxy modeled in parallel with radiocarbon-based inference. Our approach, by contrast, differs in both rationale and implementation. We do not treat dwelling counts as a separate proxy but use them as a corrective factor to address inter-settlement sampling heterogeneity, thereby mitigating biases potentially inherent in given datasets. Where sampling intensity is heterogeneous and variation in settlement size and lifespan is central to the research question, rescaling can be applied to datasets prior to formal modeling. Thus, rescaling and modeling are complementary.

Conceptually, our rescaling method shares objectives with binning strategies, though it differs in emphasis (Crema Reference Crema2022; Shennan et al. Reference Shennan, Downey, Timpson, Edinborough, Colledge, Kerig, Manning and Thomas2013; Timpson et al. Reference Timpson, Colledge, Crema, Edinborough, Kerig, Manning, Thomas and Shennan2014). Binning seeks to equalize contributions of sites regardless of sample size, reducing overrepresentation of intensively dated loci or periods. It primarily addresses variation in site density (Crema Reference Crema2022; Shennan et al. Reference Shennan, Downey, Timpson, Edinborough, Colledge, Kerig, Manning and Thomas2013; Timpson et al. Reference Timpson, Colledge, Crema, Edinborough, Kerig, Manning, Thomas and Shennan2014), proving effective in large-scale studies. In contrast, our method targets relative population size, occupation duration, and historical trajectory at the settlement level. It is particularly suited to finer-scale demographic studies of sedentary societies, where inter-site population dynamics are critical. Despite differences in focus, rescaling and binning can serve distinct yet synergistic roles, with their applicability shaped by the specific research question and analytical scale.

Nonetheless, we acknowledge that rescaling has limitations. Its effectiveness depends on the assumption that dwelling counts are meaningfully related to population size and occupation span—an assumption vulnerable to taphonomic biases, incomplete excavation, multi-household structures, or variability in dwelling use duration. Moreover, the method may be infeasible where dwelling count data are unavailable. Expanding the applicability of rescaling will require alternative proxies, such as settlement area (Downey et al. Reference Downey, Bocaege, Kerig, Edinborough and Shennan2014; Drennan et al. Reference Drennan, Berrey and Peterson2015), total house floor area (Kuijt and Marciniak Reference Kuijt and Marciniak2024; Naroll Reference Naroll1956), artifact quantity or density (Gallivan Reference Gallivan2002; Kohler and Blinman Reference Kohler and Blinman1987; Ortman and Cooper Reference Ortman and Cooper2021), or integrated environmental and archaeological indicators—all of which pose interpretive challenges. While our method addresses inter-settlement or project-level sampling heterogeneity, it does not resolve broader structural biases. Differences in survey coverage, excavation strategy, reporting practices, and research agendas can introduce systematic and often invisible distortions (Crema et al. Reference Crema, Stevens and Shoda2022). Future research may incorporate such biases into rescaling frameworks.

7. Final remarks

The continued construction and use of large radiocarbon databases have created significant opportunities for investigating population dynamics in archaeology. However, because these datasets usually comprise samples submitted independently by numerous researchers with diverse aims, research strategies and focuses, they remain susceptible to various biases including heterogeneous sampling intensity across projects and sites. Despite commendable and ongoing efforts to improve data hygiene (Bevan and Crema Reference Bevan and Crema2021; Bird et al. Reference Bird, Miranda, Vander Linden, Robinson, Bocinsky, Nicholson, Capriles, Finley, Gayo, Gil, d’Alpoim Guedes, Hoggarth, Kay, Loftus, Lombardo, Mackie, Palmisano, Solheim, Kelly and Freeman2022; Fernández-López de Pablo et al. Reference Fernández-López de Pablo, Gutiérrez-Roig, Gómez-Puche, McLaughlin, Silva and Lozano2019; Palmisano et al. Reference Palmisano, Bevan, Lawrence and Shennan2022), it remains a practical challenge when working with aggregated legacy datasets.

In this study, we have proposed a simple, straightforward method for addressing this issue: rescaling radiocarbon data using settlement-level weighting and bootstrap resampling based on dwelling counts. Our approach aligns with multi-proxy frameworks (e.g., French Reference French2015; Lawrence et al. Reference Lawrence, Palmisano and de Gruchy2021; Schmidt et al. Reference Schmidt, Hilpert, Kretschmer, Peters, Broich, Schiesberg, Vogels, Wendt, Zimmermann and Maier2021; Schmidt and Zimmermann Reference Schmidt and Zimmermann2019), which incorporate diverse datasets and methodologies to assess and mitigate interpretive bias. While our method alone does not eliminate all sources of error, we hope it contributes to improving the reliability and resolution of demographic inferences drawn from radiocarbon dates. By integrating rescaling into broader analytical workflows, archaeologists can produce more robust reconstructions of past population dynamics at both local and regional scales.

Supplementary material

To view supplementary material for this article, please visit https://doi.org/10.1017/RDC.2026.10203

Data accessibility

Data and reproducible code for all analyses discussed in the paper can be found at https://github.com/s3jin33/RescalingRadiocarbonData

Acknowledgments

This work was supported by the Research Grant from Seoul National University (100–20230150). We thank Matthew Conte and an anonymous reviewer for valuable comments on earlier versions of this paper.

Author contributions

Jiyoung Park: writing-original draft, analysis, data curation, investigation.

Sejin Kim: writing-original draft, analysis, investigation, visualization.

Taechang Jo: writing-original draft, methodology, validation.

Jangsuk Kim: conceptualization, writing-original draft, methodology, funding acquisition, supervision.

Declarations of interest

The authors have no conflicts of interest to declare.

References

Aczel, AD (1995) Improved radiocarbon age estimation using the bootstrap. Radiocarbon 37(3), 845–849. doi:10.1017/S0033822200014922.CrossRef Google Scholar

Aurenche, O, Galet, P, Régagnon-Caroline, E and Évin, J (2001) Proto-Neolithic and Neolithic cultures in the Middle East—the birth of agriculture, livestock raising, and ceramics: A calibrated ¹⁴C chronology 12,500–5500 cal BC. Radiocarbon 43(3), 1191–1202. doi:10.1017/S0033822200038480.CrossRef Google Scholar

Bamforth, DB and Grund, B (2012) Radiocarbon calibration curves, summed probability distributions, and early Paleoindian population trends in North America. Journal of Archaeological Science 39(6), 1768–1774. doi:10.1016/j.jas.2012.01.017.CrossRef Google Scholar

Barrier, CR (2017) Town aggregation and abandonment during the era of urban transformations in the Cahokia region: Bayesian modeling of the Washausen mound-town. Journal of Archaeological Science: Reports 11, 523–535. doi:10.1016/j.jasrep.2016.12.027.Google Scholar

Becerra-Valdivia, L, Leal-Cervantes, R, Wood, R and Higham, T (2020) Challenges in sample processing within radiocarbon dating and their impact in ¹⁴C-dates-as-data studies. Journal of Archaeological Science 113, 105043. doi:10.1016/j.jas.2019.105043.CrossRef Google Scholar

Bevan, A, Colledge, S, Fuller, D, Fyfe, R, Shennan, S and Stevens, C (2017) Holocene fluctuations in human population demonstrate repeated links to food production and climate. Proceedings of the National Academy of Sciences 114(49). doi:10.1073/pnas.1709190114.CrossRef Google Scholar

Bevan, A and Crema, ER (2021) Modifiable reporting unit problems and time series of long-term human activity. Philosophical Transactions of the Royal Society B: Biological Sciences 376(1816), 20190726. doi:10.1098/rstb.2019.0726.CrossRef Google Scholar PubMed

Binford, SR and Binford, LR (eds) (1969) New perspectives in archeology. Chicago: Aldine.Google Scholar

Birch, J (2012) Coalescent communities: Settlement aggregation and social integration in Iroquoian Ontario. American Antiquity 77(4), 646–670. doi:10.7183/0002-7316.77.4.646.CrossRef Google Scholar

Birch-Chapman, S and Jenkins, E (2019) A Bayesian approach to calculating Pre-Pottery Neolithic structural contemporaneity for reconstructing population size. Journal of Archaeological Science 112, 105033. doi:10.1016/j.jas.2019.105033.CrossRef Google Scholar

Bird, D, Miranda, L, Vander Linden, M, Robinson, E, Bocinsky, RK, Nicholson, C, Capriles, JM, Finley, JB, Gayo, EM, Gil, A, d’Alpoim Guedes, J, Hoggarth, JA, Kay, A, Loftus, E, Lombardo, U, Mackie, M, Palmisano, A, Solheim, S, Kelly, RL and Freeman, J (2022) p3k14c, a synthetic global database of archaeological radiocarbon dates . Scientific Data 9(1), 27. doi:10.1038/s41597-022-01118-7.CrossRef Google Scholar PubMed

Blockley, SP, Donahue, RE and Pollard, AM (2000) Radiocarbon calibration and Late Glacial occupation in northwest Europe. Antiquity 74(283), 112–119. doi:10.1017/S0003598X00066199.CrossRef Google Scholar

Bocquet-Appel, J-P, Demars, P-Y, Noiret, L and Dobrowsky, D (2005) Estimates of Upper Palaeolithic meta-population size in Europe from archaeological data. Journal of Archaeological Science 32(11), 1656–1668. doi:10.1016/j.jas.2005.05.006.CrossRef Google Scholar

Boserup, E (1965) The Conditions of Agricultural Growth: The Economics of Agrarian Change under Population Pressure. London: Allen & Unwin.Google Scholar

Bronk Ramsey, C (2017) Methods for summarizing radiocarbon datasets. Radiocarbon 59(6), 1809–1833. doi:10.1017/RDC.2017.108.CrossRef Google Scholar

Brown, GM, Reed, PF and Glowacki, DM (2013) Chacoan and Post-Chaco occupations in the Middle San Juan Region: Changes in settlement and population. KIVA 78(4), 417–448. doi:10.1179/0023194013Z.0000000008.CrossRef Google Scholar

Brown, WA (2015) Through a filter, darkly: Population size estimation, systematic error, and random error in radiocarbon-supported demographic temporal frequency analysis. Journal of Archaeological Science 53, 133–147. doi:10.1016/j.jas.2014.10.013.CrossRef Google Scholar

Brown, WA (2017) The past and future of growth rate estimation in demographic temporal frequency analysis: Biodemographic interpretability and the ascendance of dynamic growth models. Journal of Archaeological Science 80, 96–108. doi:10.1016/j.jas.2017.02.003.CrossRef Google Scholar

Carleton, WC (2021) Evaluating Bayesian radiocarbon-dated event count (REC) models for the study of long-term human and environmental processes. Journal of Quaternary Science 36(1), 110–123. doi:10.1002/jqs.3256.CrossRef Google Scholar

Carleton, WC and Groucutt, HS (2021) Sum things are not what they seem: Problems with point-wise interpretations and quantitative analyses of proxies based on aggregated radiocarbon dates. The Holocene 31(4), 630–643. doi:10.1177/0959683620981700.CrossRef Google Scholar

Cho, S-G (2007) Relative Chronological Studies of Pottery of the Dwelling Sites during the Proto-Three Kingdoms Period to Baekje Period in Cheongju Region. Prehistory and Ancient History 26, 41–88. doi:10.23024/pah.2007.26.41. [in Korean]Google Scholar

Codding, BF, Roberts, H, Eckerle, W, Brewer, SC, Medina, ID, Vernon, KB and Spangler, JS (2024) Can we reliably detect adaptive responses of hunter-gatherers to past climate change? Examining the impact of Mid-Holocene drought on Archaic settlement in the Basin-Plateau Region of North America. Quaternary International 689–690, 5–15. doi:10.1016/j.quaint.2023.06.014.CrossRef Google Scholar

Cohen, MN (1977) Population pressure and the origins of agriculture: An archaeological example from the coast of Peru. In Reed, CA (ed), Origins of Agriculture. De Gruyter Mouton, 135–178. doi:10.1515/9783110813487.135.CrossRef Google Scholar

Contreras, DA and Meadows, J (2014) Summed radiocarbon calibrations as a population proxy: A critical evaluation using a realistic simulation approach. Journal of Archaeological Science 52, 591–608. doi:10.1016/j.jas.2014.05.030.CrossRef Google Scholar

Cortell-Nicolau, A, Rivas, J, Crema, ER, Shennan, S, García-Puchol, O, Kolář, J, Staniuk, R and Timpson, A (2025) Demographic interactions between the last hunter-gatherers and the first farmers. Proceedings of the National Academy of Sciences 122(14), e2416221122. doi:10.1073/pnas.2416221122.CrossRef Google Scholar PubMed

Crema, ER (2013) Cycles of change in Jomon settlement: A case study from eastern Tokyo Bay. Antiquity 87(338), 1169–1181. doi:10.1017/S0003598X00049930.CrossRef Google Scholar

Crema, ER (2022) Statistical inference of prehistoric demography from frequency distributions of radiocarbon dates: A review and a guide for the perplexed. J Archaeol Method Theory 29(4), 1387–1418. doi:10.1007/s10816-022-09559-5.CrossRef Google Scholar

Crema, ER and Bevan, A (2021) Inference from large sets of radiocarbon dates: Software and methods. Radiocarbon 63(1), 23–39. doi:10.1017/RDC.2020.95.CrossRef Google Scholar

Crema, ER, Habu, J, Kobayashi, K and Madella, M (2016) Summed probability distribution of ¹⁴C dates suggests regional divergences in the population dynamics of the Jomon period in eastern Japan. PloS one 11(4). doi:10.1371/journal.pone.0154809.CrossRef Google Scholar PubMed

Crema, ER and Kobayashi, K (2020) A multi-proxy inference of Jōmon population dynamics using bayesian phase models, residential data, and summed probability distribution of ¹⁴C dates. Journal of Archaeological Science 117, 105136. doi:10.1016/j.jas.2020.105136.CrossRef Google Scholar

Crema, ER and Shoda, S (2021) A Bayesian approach for fitting and comparing demographic growth models of radiocarbon dates: A case study on the Jomon-Yayoi transition in Kyushu (Japan). PLoS One 16(5), e0251695. doi:10.1371/journal.pone.0251695.CrossRef Google Scholar

Crema, ER, Stevens, CJ and Shoda, S (2022) Bayesian analyses of direct radiocarbon dates reveal geographic variations in the rate of rice farming dispersal in prehistoric Japan. Science advances 8(38), eadc9171–eadc9171. doi:10.1126/sciadv.adc9171.CrossRef Google Scholar PubMed

Crown, PL (1991) Evaluating the construction sequence and population of Pot Creek Pueblo, northern New Mexico. American Antiquity 56(2), 291–314. doi:10.2307/281420.CrossRef Google Scholar

Davies, B, Holdaway, SJ and Fanning, PC (2016) Modelling the palimpsest: An exploratory agent-based model of surface archaeological deposit formation in a fluvial arid Australian landscape. The Holocene 26(3), 450–463. doi:10.1177/0959683615609754.CrossRef Google Scholar

Dewar, RE (1991) Incorporating variation in occupation span into settlement-pattern analysis. American Antiquity 56(4), 604–620. doi:10.2307/281539.CrossRef Google Scholar

DiNapoli, RJ, Crema, ER, Lipo, CP, Rieth, TM and Hunt, TL (2021) Approximate Bayesian computation of radiocarbon and paleoenvironmental record shows population resilience on Rapa Nui (Easter Island). Nature Communications 12(1), 3939. doi:10.1038/s41467-021-24252-z.CrossRef Google Scholar PubMed

Downey, SS, Bocaege, E, Kerig, T, Edinborough, K and Shennan, S (2014) The neolithic demographic transition in Europe: Correlation with juvenility index supports interpretation of the summed calibrated radiocarbon date probability distribution (SCDPD) as a valid demographic proxy. PloS one 9(8), e105730. doi: 10.1371/journal.pone.0105730.CrossRef Google Scholar PubMed

Drennan, RD, Berrey, CA and Peterson, CE (2015) Regional settlement demography in archaeology. Clinton Corners, New York: Eliot Werner Publications.CrossRef Google Scholar

Duff, AI and Wilshusen, RH (2000) Prehistoric Population Dynamics in the Northern San Juan Region, a.d. 950–1300. KIVA 66(1), 167–190. doi:10.1080/00231940.2000.11758426.CrossRef Google Scholar

Duffy, PR (2015) Site size hierarchy in middle-range societies. Journal of Anthropological Archaeology 37, 85–99. doi:10.1016/j.jaa.2014.12.001.CrossRef Google Scholar

Efron, B (1979) Bootstrap methods: Another look at the Jackknife. The Annals of Statistics 7(1), 1–26. doi:10.1214/aos/1176344552.CrossRef Google Scholar

Eren, MI, Chao, A, Hwang, W-H and Colwell, RK (2012) Estimating the richness of a population when the maximum number of classes is fixed: A nonparametric solution to an archaeological problem. PLoS One 7(5), e34179. doi:10.1371/journal.pone.0034179.CrossRef Google Scholar

Feinman, GM and Neitzel, JE (2023) The social dynamics of settling down. Journal of Anthropological Archaeology 69, 101468. doi:10.1016/j.jaa.2022.101468.CrossRef Google Scholar

Fernández-López de Pablo, J, Gutiérrez-Roig, M, Gómez-Puche, M, McLaughlin, R, Silva, F and Lozano, S (2019) Palaeodemographic modelling supports a population bottleneck during the Pleistocene-Holocene transition in Iberia. Nature Communications 10(1), 1872. doi:10.1038/s41467-019-09833-3.CrossRef Google Scholar PubMed

Fletcher, R (1986) Settlement archaeology: World-wide comparisons. World Archaeology 18(1), 59–83. doi:10.1080/00438243.1986.9979989.CrossRef Google Scholar

Freeman, J, Hard, RJ, Mauldin, RP and Anderies, JM (2021) Radiocarbon data may support a Malthus-Boserup model of hunter-gatherer population expansion. Journal of Anthropological Archaeology 63, 101321. doi:10.1016/j.jaa.2021.101321.CrossRef Google Scholar

French, JC (2015) The demography of the Upper Palaeolithic hunter–gatherers of Southwestern France: A multi-proxy approach using archaeological data. Journal of Anthropological Archaeology 39, 193–209. doi:10.1016/j.jaa.2015.04.005.CrossRef Google Scholar

Gajewski, K, Munoz, S, Peros, M, Viau, A, Morlan, R and Betts, M (2011) The Canadian Archaeological Radiocarbon Database (CARD), archaeological ¹⁴C Dates in North America and their paleoenvironmental context. Radiocarbon 53(2), 371–394. doi:10.1017/S0033822200056630.CrossRef Google Scholar

Gallivan, MD (2002) Measuring Sedentariness and Settlement Population: Accumulations Research in the Middle Atlantic Region. American Antiquity 67(3), 535–557. doi:10.2307/1593825.CrossRef Google Scholar

Gayo, EM, Latorre, C and Santoro, CM (2015) Timing of occupation and regional settlement patterns revealed by time-series analyses of an archaeological radiocarbon database for the South-Central Andes (16°–25°S). Quaternary International 356, 4–14. doi:10.1016/j.quaint.2014.09.076.CrossRef Google Scholar

Gyucha, A (2019) Coming Together: Comparative Approaches to Population Aggregation and Early Urbanization. Albany: State University of New York Press.10.1353/book64169CrossRef Google Scholar

Haggis, DC (2013) Social organization and aggregated settlement structure in an archaic Greek City on Crete (ca. 600 BC). In Birch, J (ed), From Prehistoric Villages to Cities. New York: Routledge, 63–86.Google Scholar

Hassan, FA (1978) Demographic Archaeology. In Schiffer, MB (ed), Advances in Archaeological Method and Theory. San Diego: Academic Press. 49–103.CrossRef Google Scholar

Hassan, FA (1981) Demographic Archaeology. New York: Academic Press.10.1016/B978-0-12-624180-8.50010-XCrossRef Google Scholar

Heaton, TJ (2022) Non-parametric Calibration of Multiple Related Radiocarbon Determinations and their Calendar Age Summarisation. Journal of the Royal Statistical Society Series C: Applied Statistics 71(5), 1918–1956. doi:10.1111/rssc.12599.CrossRef Google Scholar

Heaton, TJ, Al-assam, S and Bard, E (2025) A New Approach to Radiocarbon Summarisation: Rigorous Identification of Variations/Changepoints in the Occurrence Rate of Radiocarbon Samples Using a Poisson Process. arXiv:2501.15980v3 Available at https://arxiv.org/abs/2501.15980.CrossRef Google Scholar

Hill, JB, Clark, JJ, Doelle, WH and Lyons, PD (2004) Prehistoric demography in the southwest: Migration, coalescence, and Hohokam population decline. American Antiquity 69(4), 689–716. doi:10.2307/4128444.CrossRef Google Scholar

Hill, JN (1970) Broken K Pueblo: Prehistoric social organization in the American Southwest. Tucson: University of Arizona Press.Google Scholar

Hinz, M, Furholt, M, Müller, J, Rinne, C, Raetzel-Fabian, D, Sjögren, K-G and Wotzka, H-P (2012) RADON - Radiocarbon dates online 2012. Central European database of ¹⁴C dates for the Neolithic and the Early Bronze Age. Journal of Neolithic Archaeology. doi:10.12766/jna.2012.65.CrossRef Google Scholar

Hwang, J (2021) Radiocarbon dating in Korean archaeology. Yongnam Archaeological Review (90), 5–30. [in Korean]10.47417/yar.2021.90.5CrossRef Google Scholar

Johnson, GA (1980) Rank-size convexity and system integration: A view from archaeology. Economic Geography 56(3), 234–247. doi:10.2307/142715.CrossRef Google Scholar

Jørgensen, EK, Pesonen, P and Tallavaara, M (2022) Climatic changes cause synchronous population dynamics and adaptive strategies among coastal hunter-gatherers in Holocene northern Europe. Quaternary Research 108, 107–122. doi:10.1017/qua.2019.86.CrossRef Google Scholar

Kerr, TR and McCormick, F (2014) Statistics, sunspots and settlement: Influences on sum of probability curves. Journal of Archaeological Science 41, 493–501. doi:10.1016/j.jas.2013.09.002.CrossRef Google Scholar

Kim, B (1998 [1145]) Samguksagi. Gwacheon. [in Korean]Google Scholar

Kim, J, Seong, C (2022) Final Pleistocene and early Holocene population dynamics and the emergence of pottery on the Korean Peninsula. Quaternary International 608–609, 203–214. doi:10.1016/j.quaint.2020.10.049.CrossRef Google Scholar

Kim, J, Hwang, J, Kim, JK, Oh, Y, Ahn, S-M, Seong, C, Choi, J and Lee, C (2018) Assessment of Radiocarbon Dates from Korea: Re-measurement of Anomalous Dates. Journal of Korean Archaeological Society 107, 124–165. [in Korean]Google Scholar

Kim, K (2024) Research trends and issues related to the period of Baekje’s annexation of Mahan. The Journal of Korean Ancient History 116, 299–332. doi:10.37331/JKAH.2024.12.116.299. [in Korean]CrossRef Google Scholar

Kim, SO (2000) A Study on the Chronology of the Mahan Settlements in the Honam Province. Journal of the Honam Archaeological Society 11, 29–77. [in Korean]Google Scholar

Kim, SO (2007) The development of settlements in the Geum River Basin from the Proto-Three Kingdoms to Three Kingdoms Period. Journal of Korean Archaeological Society 65, 4–45. [in Korean]Google Scholar

Kirch, PV and Rallu, J-L (2007) The Growth and Collapse of Pacific Island Societies: Archaeological and Demographic Perspectives. Honolulu: Hawai’i University Press.Google Scholar

Kohler, TA and Blinman, E (1987) Solving mixture problems in archaeology: Analysis of ceramic materials for dating and demographic reconstruction. Journal of Anthropological Archaeology 6(1), 1–28. doi:10.1016/0278-4165(87)90015-8.CrossRef Google Scholar

Kohler, TA, VanBuskirk, S and Ruscavage-Barz, S (2004) Vessels and villages: Evidence for conformist transmission in early village aggregations on the Pajarito Plateau, New Mexico. Journal of Anthropological Archaeology 23(1), 100–118. doi:10.1016/j.jaa.2003.12.003.CrossRef Google Scholar

Kohler, TA and Varien, MD (eds) (2012) Emergence and Collapse of Early Villages: Models of Central Mesa Verde Archaeology. Berkeley: University of California Press.Google Scholar

Kolář, J, Macek, M, Tkáč, P and Szabó, P (2016) Spatio-temporal modelling as a way to reconstruct patterns of past human activities. Archaeometry 58(3), 513–528. doi:10.1111/arcm.12182.CrossRef Google Scholar PubMed

Kolb, CC (1985) Demographic estimates in archaeology: Contributions from ethnoarchaeology on Mesoamerican peasants. Current Anthropology 26(5), 581–599. doi:10.1086/203348.CrossRef Google Scholar

Kowalewski, SA (2014) The work of making community. In Birch, J (ed), From Prehistoric Villages to Cities. New York: Routledge, 201–218.Google Scholar

Kudo, Y (2018) Approach for creating database of the radiocarbon dates published on the archaeological research reports in Japan. Bulletin of the National Museum of Japanese History 212, 251.Google Scholar

Kudo, Y, Sakamoto, M, Hakozaki, M, Stevens, CJ and Crema, ER (2023) An Archaeological Radiocarbon Database of Japan. Journal of Open Archaeology Data 11(11), 1–9. doi:10.5334/joad.115.CrossRef Google Scholar

Kuijt, I and Marciniak, A (2024) How many people lived in the world’s earliest villages? Reconsidering community size and population pressure at Neolithic Çatalhöyük. Journal of Anthropological Archaeology 74, 101573. doi:10.1016/j.jaa.2024.101573.CrossRef Google Scholar

Kuzmin, YV and Keates, SG (2005) Dates are not just data: Paleolithic settlement patterns in Siberia derived from radiocarbon records. American Antiquity 70(4), 773–789. doi:10.2307/40035874.CrossRef Google Scholar

Kwon, O-Y (2001) The change of state formation from Paekcheguk to Paekche. History and Reality 40, 30–56. [in Korean]Google Scholar

Lawrence, D, Palmisano, A and de Gruchy, MW (2021) Collapse and continuity: A multi-proxy reconstruction of settlement organization and population trajectories in the northern Fertile Crescent during the 4.2 kya Rapid Climate Change event. PloS one 16(1), e0244871. doi:10.1371/journal.pone.0244871.CrossRef Google Scholar

Lee, H (2022) Agricultural Strategy during Baekje’s State Formation Period. Journal of Korean Archaeological Society 2022(2), 479–503. [in Korean]10.47439/JKRAS.2022.2.479CrossRef Google Scholar

Lee, YC (2011) Settlement change of the upper region of Youngsan River and the transformation of being Baekje. BaekjeHakbo 6, 107–140. [in Korean]Google Scholar

Longacre, WA (1975) Population Dynamics at the Grasshopper Pueblo, Arizona. Memoirs of the Society for American Archaeology 30, 71–74. doi:10.1017/S0081130000003804.CrossRef Google Scholar

Manning, K and Timpson, A (2014) The demographic response to Holocene climate change in the Sahara. Quaternary Science Reviews 101, 28–35. doi:10.1016/j.quascirev.2014.07.003.CrossRef Google Scholar

Manning, SW, Lorentzen, B and Hart, JP (2021) Resolving Indigenous village occupations and social history across the long century of European permanent settlement in Northeastern North America: The Mohawk River Valley ∼1450–1635 CE. PLOS ONE 16(10), e0258555. doi:10.1371/journal.pone.0258555.CrossRef Google Scholar

McLaughlin, TR (2019) On applications of space–time modelling with open-source ¹⁴C age calibration. J Archaeol Method Theory 26(2), 479–501. doi:10.1007/s10816-018-9381-3.CrossRef Google Scholar

Michczyńska, DJ and Pazdur, A (2004) Shape analysis of cumulative probability density function of radiocarbon dates set in the study of climate change in the Late Glacial and Holocene. Radiocarbon 46(2), 733–744. doi:10.1017/S0033822200035773.CrossRef Google Scholar

Naroll, R (1956) A preliminary index of social development. American anthropologist 58(4), 687–715.10.1525/aa.1956.58.4.02a00080CrossRef Google Scholar

Noh, CK (1987) A study on the state formation process of Baekje. The Review of Korean Studies 10, 95–121.Google Scholar

Oh, Y and Conte, M (2022) ¹⁴C dates as data? Looking at population change in the Bronze Age Central region through summed probability distributions of radiocarbon dates. Archaeology: Journal of the Jungbu Archaeological Society 21(1), 5–41. doi:10.46760/jbgogo.2022.21.1.5. [in Korean]Google Scholar

Oh, Y, Conte, M, Kang, S, Kim, J and Hwang, J (2017) Population fluctuation and the adoption of food production in prehistoric Korea: Using radiocarbon dates as a proxy for population change. Radiocarbon 59(6), 1761–1770. doi:10.1017/RDC.2017.122.CrossRef Google Scholar

Ortman, SG (2016) Uniform probability density analysis and population history in the northern Rio Grande. J Archaeol Method Theory 23(1), 95–126. doi:10.1007/s10816-014-9227-6.CrossRef Google Scholar PubMed

Ortman, SG, Cabaniss, AHF, Sturm, JO and Bettencourt, LMA (2015) Settlement scaling and increasing returns in an ancient society. Science Advances 1(1), e1400066. doi:10.1126/sciadv.1400066.CrossRef Google Scholar

Ortman, SG and Coffey, GD (2017) Settlement scaling in middle-range societies. American Antiquity 82(4), 662–682. doi:10.1017/aaq.2017.42.CrossRef Google Scholar

Ortman, SG and Cooper, Z (2021) Artifact density and population density in settlement pattern research. Journal of Archaeological Science: Reports 39, 103189. doi:10.1016/j.jasrep.2021.103189.Google Scholar

Palmisano, A, Bevan, A, Lawrence, D and Shennan, S (2022) The NERD dataset: Near East radiocarbon dates between 15,000 and 1,500 cal. yr. BP. Journal of Open Archaeology Data 10(2), 1–9. doi:10.5334/joad.90.CrossRef Google Scholar

Palmisano, A, Bevan, A and Shennan, S (2017) Comparing archaeological proxies for long-term population patterns: An example from central Italy. Journal of Archaeological Science 87, 59–72. doi:10.1016/j.jas.2017.10.001.CrossRef Google Scholar

Park, J and Kim, S (2024) Population Dynamics in the Mahan, Baekje, and Hanye Regions from the 1st to 6th Centuries AD. Journal of Korean Ancient Historical Society 124, 213–239. [in Korean]10.18040/sgs.2024.124.213CrossRef Google Scholar

Park, J, Wright, DK and Kim, J (2017) Change in settlement distribution and the emergence of an early state: A spatial analysis of radiocarbon dates from Southwestern Korea. Radiocarbon 59(6), 1779–1791. doi:10.1017/RDC.2017.93.CrossRef Google Scholar

Park, S (2001) The Development of Baekje. Seoul: Seogyeong Press. [in Korean]Google Scholar

Park, S (2007) Some pattern of Early Baekche’s local cemetery continuation in view of localization of periphery by political center. The Journal of Korean Ancient History 48, 155–186. [in Korean]Google Scholar

Parton, P and Clark, G (2022) Using lidar and Bayesian inference to reconstruct archaeological populations in the Kingdom of Tonga. Journal of Archaeological Science: Reports 45, 103610. doi:10.1016/j.jasrep.2022.103610.Google Scholar

Pearson, GW, Pilcher, JR and Baillie, MGL (1983) High-precision ¹⁴C measurement of Irish Oaks to show the natural ¹⁴C variations from 200 BC to 4000 BC. Radiocarbon 25(2), 179–186. doi:10.1017/S0033822200005464.CrossRef Google Scholar

Peterson, CE and Drennan, RD (2005) Communities, settlements, sites, and surveys: Regional-scale analysis of prehistoric human interaction. American Antiquity 70(1), 5–30. doi:10.2307/40035266.CrossRef Google Scholar

Petrie, CA and Lynam, F (2020) Revisiting settlement contemporaneity and exploring stability and instability: Case studies from the Indus Civilization. Journal of Field Archaeology 45(1), 1–15. doi:10.1080/00934690.2019.1664848.CrossRef Google Scholar

Plog, FT (1974) The Study of Prehistoric Change. New York: Academic Press.Google Scholar

Plog, FT (1975) Demographic studies in southwestern prehistory. Memoirs of the Society for American Archaeology 30, 94–103.10.1017/S0081130000003841CrossRef Google Scholar

Plog, S and Hantman, JL (1990) Chronology construction and the study of prehistoric culture change. Journal of Field Archaeology 17(4), 439–456. doi:10.2307/530005.CrossRef Google Scholar

Popescu, GM, Covătaru, C, Opriș, I, Bălășescu, A, Carozza, L, Radu, V, Haită, C, Sava, T, Barton, CM and Lazăr, C (2023) sine qua non: Inferring kodjadermen-Gumelnița-Karanovo vi population dynamics from aggregated probability distributions of radiocarbon dates. Radiocarbon 65(2), 463–484. doi:10.1017/RDC.2023.6.CrossRef Google Scholar

Porčić, M and Nikolić, M (2016) The Approximate Bayesian Computation approach to reconstructing population dynamics and size from settlement data: demography of the Mesolithic-Neolithic transition at Lepenski Vir. Archaeol Anthropol Sci 8(1), 169–186. doi:10.1007/s12520-014-0223-2.CrossRef Google Scholar

Prentiss, WC, Lenert, M, Foor, TA, Goodale, NB and Schlegel, T (2003) Calibrated radiocarbon dating at Keatley Creek: The chronology of occupation at a complex hunter-gatherer village. American Antiquity 68(4), 719–735. doi:10.2307/3557069.CrossRef Google Scholar

Price, M, Wolfhagen, J and Otárola-Castillo, E (2016) Confidence intervals in the analysis of mortality and survivorship curves in zooarchaeology. American Antiquity 81(1), 157–173. doi:10.7183/0002-7316.81.1.157.CrossRef Google Scholar

Price, MH, Capriles, JM, Hoggarth, JA, Bocinsky, K, Ebert, CE and Jones, JH (2020) End-to-end Bayesian analysis of ¹⁴C dates reveals new insights into lowland Maya demography. bioRxiv:2020–07. doi:10.1101/2020.07.02.185256.Google Scholar

Redman, CL (1978) The Rise of Civilization: From Early Farmers to Urban Society in the Ancient Near East. San Francisco: W. H. Freeman.Google Scholar

Research Centre of Dolmens in Northeast Asia (2020) Gwangju Yeonsan-dong Sanjeong Site: District 4. Hwasun: Research Centre of Dolmens in Northeast Asia.Google Scholar

Rhode, D, Brantingham, PJ, Perreault, C and Madsen, DB (2014) Mind the gaps: Testing for hiatuses in regional radiocarbon date sequences. Journal of Archaeological Science 52, 567–577. doi:10.1016/j.jas.2014.02.022.CrossRef Google Scholar

Rick, JW (1987) Dates as data: An examination of the Peruvian Preceramic radiocarbon record. American Antiquity 52(1), 55–73. doi:10.2307/281060.CrossRef Google Scholar

Ritchie, PM, Ritchie, J, Blake, M, Simons, E and Lepofsky, D (2024) Settling the record: 3,000 years of continuity and growth in a Coast Salish settlement constellation. Journal of Anthropological Archaeology 73, 101570. doi:10.1016/j.jaa.2024.101570.CrossRef Google Scholar

Ritchison, BT (2020) Using radiocarbon data to chronologically control population density estimates derived from systematically collected intra-settlement distributional data. Radiocarbon 62(6), 1577–1597. doi:10.1017/RDC.2020.107.CrossRef Google Scholar

Ritchison, BT, Doubles, CZ and Meyers, MS (2025) Mind the gap: Modeling Mississippian migration and frontier settlement in southwest Virginia, USA. Journal of Anthropological Archaeology 78, 101664. doi:10.1016/j.jaa.2025.101664.CrossRef Google Scholar

Roberts, JM, Mills, BJ, Clark, JJ, Haas, WR, Huntley, DL and Trowbridge, MA (2012) A method for chronological apportioning of ceramic assemblages. Journal of Archaeological Science 39(5), 1513–1520. doi:10.1016/j.jas.2011.12.022.CrossRef Google Scholar

Robinson, E, Zahid, HJ, Codding, BF, Haas, R and Kelly, RL (2019) Spatiotemporal dynamics of prehistoric human population growth: radiocarbon “dates as data” and population ecology models. Journal of Archaeological Science 101, 63–71. doi:10.1016/j.jas.2018.11.006.CrossRef Google Scholar

Schacht, RM (1980) Two models of population growth. American Anthropologist 82(4), 782–798. doi:10.1525/aa.1980.82.4.02a00040.CrossRef Google Scholar

Schacht, RM (1984) The contemporaneity problem. American Antiquity 49(4), 678–695. doi:10.2307/279736.CrossRef Google Scholar

Schmidt, I, Gehlen, B, Winkler, K, Arrizabalaga, A, Arts, N, Bicho, N, Crombé, P, Eriksen, BV, Grimm, SB, Kapustka, K, Langlais, M, Mevel, L, Naudinot, N, Nerudová, Z, Niekus, M, Peresani, M, Riede, F, Sauer, F, Schön, W, Sobkowiak-Tabaka, I, Vandendriessche, H, Weber, M-J, Zander, A, Zimmermann, A and Maier, A (2025) Large scale and regional demographic responses to climatic changes in Europe during the Final Palaeolithic. PLOS ONE 20(4), e0310942. doi:10.1371/journal.pone.0310942.CrossRef Google Scholar PubMed

Schmidt, I, Hilpert, J, Kretschmer, I, Peters, R, Broich, M, Schiesberg, S, Vogels, O, Wendt, KP, Zimmermann, A and Maier, A (2021) Approaching prehistoric demography: proxies, scales and scope of the Cologne Protocol in European contexts. Philosophical Transactions of the Royal Society B: Biological Sciences 376(1816), 20190714. doi:10.1098/rstb.2019.0714.CrossRef Google Scholar PubMed

Schmidt, I and Zimmermann, A (2019) Population dynamics and socio-spatial organization of the Aurignacian: Scalable quantitative demographic data for western and central Europe. PloS one 14(2), e0211562. doi:10.1371/journal.pone.0211562.CrossRef Google Scholar PubMed

Scott, DW (2015) Multivariate Density Estimation: Theory, Practice, and Visualization. 2nd edn. Hoboken, New Jersey: John Wiley & Sons.10.1002/9781118575574CrossRef Google Scholar

Seong, C and Kim, J (2022) Moving in and moving out: Explaining final Pleistocene-Early Holocene hunter-gatherer population dynamics on the Korean Peninsula. Journal of Anthropological Archaeology 66, 101407. doi:10.1016/j.jaa.2022.101407.CrossRef Google Scholar

Shennan, S (2008) Population processes and their consequences in Early Neolithic Central Europe. In Bocquet-Appel, J-P, Bar-Yosef, O (eds), The Neolithic Demographic Transition and Its Consequences. Dordrecht: Springer Netherlands, 315–329. doi:10.1007/978-1-4020-8539-0_12.CrossRef Google Scholar

Shennan, S, Downey, SS, Timpson, A, Edinborough, K, Colledge, S, Kerig, T, Manning, K and Thomas, MG (2013) Regional population collapse followed initial agriculture booms in mid-Holocene Europe. Nat Commun 4(1), 2486. doi:10.1038/ncomms3486.CrossRef Google Scholar PubMed

Shennan, S and Edinborough, K (2007) Prehistoric population history: From the Late Glacial to the Late Neolithic in Central and Northern Europe. Journal of Archaeological Science 34(8), 1339–1345. doi:10.1016/j.jas.2006.10.031.CrossRef Google Scholar

Shott, MJ (1992) Radiocarbon dating as a probabilistic technique: The Childers site and Late Woodland occupation in the Ohio Valley. American Antiquity 57(2), 202–230. doi:10.2307/280728.CrossRef Google Scholar

Smith, ME (2023) Urban Life in the Distant Past: The Prehistory of Energized Crowding. Cambridge: Cambridge University Press.10.1017/9781009249027CrossRef Google Scholar

Stäuble, H and Hiller, A (1997) An extended prehistoric well field in the Opencast Mine area of Zwenkau, Germany. Radiocarbon 40(2), 721–733. doi:10.1017/S0033822200018671.CrossRef Google Scholar

Surovell, TA and Brantingham, PJ (2007) A note on the use of temporal frequency distributions in studies of prehistoric demography. Journal of Archaeological Science 34(11), 1868–1877. doi:10.1016/j.jas.2007.01.003.CrossRef Google Scholar

Surovell, TA, Finley, JB, Smith, GM, Brantingham, PJ and Kelly, R (2009) Correcting temporal frequency distributions for taphonomic bias. Journal of archaeological Science 36(8), 1715–1724. doi:10.1016/j.jas.2009.03.029.CrossRef Google Scholar

Tallavaara, M, Pesonen, P and Oinonen, M (2010) Prehistoric population history in eastern Fennoscandia. Journal of Archaeological Science 37(2), 251–260. doi:10.1016/j.jas.2009.09.035.CrossRef Google Scholar

Timpson, A, Barberena, R, Thomas, MG, Méndez, C and Manning, K (2021) Directly modelling population dynamics in the South American Arid Diagonal using ¹⁴C dates. Philosophical Transactions of the Royal Society B: Biological Sciences 376(1816), 20190723. doi:10.1098/rstb.2019.0723.CrossRef Google Scholar

Timpson, A, Colledge, S, Crema, E, Edinborough, K, Kerig, T, Manning, K, Thomas, MG and Shennan, S (2014) Reconstructing regional population fluctuations in the European Neolithic using radiocarbon dates: A new case-study using an improved method. Journal of Archaeological Science 52, 549–557. doi:10.1016/j.jas.2014.08.011.CrossRef Google Scholar

Vander Linden, M and Silva, F (2020) Dispersals as demographic processes: testing and describing the spread of the Neolithic in the Balkans. Philosophical Transactions of the Royal Society B: Biological Sciences 376(1816), 20200231. doi:10.1098/rstb.2020.0231.CrossRef Google Scholar PubMed

Varien, MD, Ortman, SG, Kohler, TA, Glowacki, DM and Johnson, CD (2007) Historical ecology in the Mesa Verde region: Results from the Village Ecodynamics Project. American Antiquity 72(2), 273–299. doi:10.2307/40035814.CrossRef Google Scholar

Whallon, R (2006) Social networks and information: Non-“utilitarian” mobility among hunter-gatherers. Journal of Anthropological Archaeology 25(2), 259–270. doi:10.1016/j.jaa.2005.11.004.CrossRef Google Scholar

Wijma, S, Aerts, AT, van der Plicht, J and Zondervan, A (1996) The Groningen AMS facility. Nuclear Instruments and Methods in Physics Research Section B: Beam Interactions with Materials and Atoms 113(1), 465–469. doi:10.1016/0168-583X(95)01420-9.CrossRef Google Scholar

Williams, AN (2012) The use of summed radiocarbon probability distributions in archaeology: A review of methods. Journal of Archaeological Science 39(3), 578–589. doi:10.1016/j.jas.2011.07.014.CrossRef Google Scholar

Williams, AN, Ulm, S, Smith, M and Reid, J (2014) AustArch: a database of ¹⁴C and non-¹⁴C ages from archaeological sites in Australia: Composition, compilation and review. Internet Archaeology 36, 1–12. doi:10.11141/ia.36.6.Google Scholar

Wright, DK, Kim, J, Park, J, Yang, J and Kim, J (2020) Spatial modeling of archaeological site locations based on summed probability distributions and hot-spot analyses: A case study from the Three Kingdoms Period, Korea. Journal of Archaeological Science 113, 105036. doi:10.1016/j.jas.2019.105036.CrossRef Google Scholar

Yun, J (2014) Changes in settlements in following Baekje territorialization in Hoseo Region. Baekje Research Series 59, 1–44. [in Korean]Google Scholar

Figure 1. Boxplots showing RMSE distances from the hypothetical populations, ${\rm{L}}\left( {\rm{t}} \right)$, to random sample sets, ${\rm{P}}\left( {\rm{t}} \right)$: (a) 600 years and (b) 1500 years.

Figure 2. Examples of SPD Comparisons: 600 years, Maximum Sample Fraction 30%, 25-year rolling mean applied.

Figure 3. Examples of SPD Comparisons: 1500 years, Maximum Sample Fraction 30%, 50-year rolling mean applied.

Figure 4. Comparisons of RMSE distances from the hypothetical populations, ${\rm{L}}\left( {\rm{t}} \right)$, to random sample sets, ${\rm{P}}\left( {\rm{t}} \right)$, weighted sample sets, ${\rm{W}}\left( {\rm{t}} \right)$, and bootstrap resampled datasets, ${\rm{R}}\left( {\rm{t}} \right)$.

Table 1. Summary of RMSE: Case Count. (Detailed information is provided in Tables 1 and 2 in Supplementary 1)

Figure 5. Examples of Settlement SPDs: (a) Power-law & Normal distribution, (b) Power-law & Skewed distribution (600 years, Maximum Sample Fraction 30%, 25-year rolling mean applied). For details of settlements, see Table 2 in Supplementary 1.

Figure 6. Examples of Settlement SPDs: (a) Power-law & Uniform distribution, (b) Power-law & Normal distribution (1500 years, Maximum Sample Fraction 30%, 50-year rolling mean applied). For details of settlements, see Table 2 in Supplementary 1.

Figure 7. Comparisons of RMSE distances from the hypothetical populations, ${{\rm{L}}_{\rm{i}}}\left( {\rm{t}} \right)$, to random sample sets, ${{\rm{P}}_{\rm{i}}}\left( {\rm{t}} \right)$, weighted sample sets, ${{\rm{W}}_{\rm{i}}}\left( {\rm{t}} \right)$ and bootstrapped datasets, ${{\rm{R}}_{\rm{i}}}\left( {\rm{t}} \right)$ by settlement: (a) 600 years (b) 1500 years (Maximum Sample Fraction 30%)

Figure 8. Study areas and sites: (a) Locations. (b) The Geum River Basin. (1. Bokryong-dong-1; 2. Bokryong-dong-2; 3. Bongmyeong-dong; 4. Daepyong-ri; 5. Juk-dong; 6. Naseong-ri; 7. Songjeol-dong; 8. Yonggye-dong; 9. Yongho-Hapgang-ri) (c) The Yeongsan River Basin. (1. Dongnim-dong; 2. Hanam-dong; 3. Heukseok-dong; 4. Oseon-dong; 5. Sanjeong-dong; 6. Seonam-dong; 7: Sinchang-dong; Taemok-ri; 9: Yeonsan-dong; 10. Yongdu-dong; 11. Yongsan-dong)

Figure 9. Analytic result of the Yeongsan River Basin. (a) Overall SPD; (b) Probability Distributions of Settlements (Site numbers correspond to Figure 8); (c) KDE analyses over time.

Figure 10. Analytic result of the Geum River Basin. (a) Overall SPD; (b) Probability Distributions of Settlements (Site numbers correspond to Figure 8); (c) KDE analyses over time.

Park et al. supplementary material 1

Park et al. supplementary material

File 2.3 MB

Park et al. supplementary material 2

Park et al. supplementary material

File 115.5 KB

Article contents

Rescaling radiocarbon data: A method for addressing inter-site sampling heterogeneity in reconstructing population history

Abstract

Keywords

Information

1. Introduction

2. Sampling biases and the scale of archaeological research of population

3. Methods

3.1. Generating hypothetical populations and sampling

3.2. Rescaling: Weighting and bootstrap resampling

3.2.1. Weighting

3.2.2. Bootstrap resampling

3.3. Comparison and evaluation

4. Results

4.1. Overall change in population size inferred from SPDs

4.2. Lifespans of individual settlements and inter-settlement relationships

5. Case study: Demographic dynamics in the proto- and early historical periods of Korea

5.1. The Yeongsan River Basin

5.2. The Geum River Basin

6. Discussion

7. Final remarks

Supplementary material

Data accessibility

Acknowledgments

Author contributions

Declarations of interest

References

Park et al. supplementary material 1

Park et al. supplementary material 2

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests