Hostname: page-component-77f85d65b8-8wtlm Total loading time: 0 Render date: 2026-03-30T08:11:36.466Z Has data issue: false hasContentIssue false

Rescaling radiocarbon data: A method for addressing inter-site sampling heterogeneity in reconstructing population history

Published online by Cambridge University Press:  24 March 2026

Jiyoung Park
Affiliation:
Department of Archaeology and Art History, Seoul National University, Republic of Korea
Sejin Kim
Affiliation:
Department of Archaeology and Art History, Seoul National University, Republic of Korea
Taechang Jo
Affiliation:
Department of Mathematics, Inha University, Republic of Korea
Jangsuk Kim*
Affiliation:
Department of Archaeology and Art History, Seoul National University, Republic of Korea
*
Corresponding author: Jangsuk Kim; Email: jangsuk@snu.ac.kr
Rights & Permissions [Opens in a new window]

Abstract

Radiocarbon dates have become a cornerstone in archaeological reconstructions of past population dynamics. The increasing reliance on large-scale radiocarbon databases, usually aggregated from diverse sources, however, raises persistent concerns about sampling bias, especially heterogeneous sampling intensity across sites. In this paper, we introduce a rescaling method that adjusts the frequency of dates in radiocarbon datasets in proportion to dwelling counts at the settlement level, using weighting and bootstrap resampling. Through a series of simulations, we show that this approach consistently yields probability distributions that more closely reflect hypothetical population trends, particularly in contexts with high inter-settlement variability in sampling intensity. We apply our method to archaeological data from two areas in Korea, the Yeongsan and Geum River Basins, during the Proto–Three Kingdoms (1C BC–AD 3C) and Three Kingdoms Periods (AD 4–7C). Results demonstrate that rescaled datasets offer significantly different interpretations of population organization and reconfiguration than those derived from original data. This study highlights the importance of addressing sampling heterogeneity in local-scale demographic research and suggests that rescaling is a valuable complement to existing bias-correction strategies in archaeological studies of demography.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BYCreative Common License - NCCreative Common License - ND
This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives licence (https://creativecommons.org/licenses/by-nc-nd/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided that no alterations are made and the original article is properly cited. The written permission of Cambridge University Press or the rights holder(s) must be obtained prior to any commercial use and/or adaptation of the article.
Copyright
© The Author(s), 2026. Published by Cambridge University Press on behalf of University of Arizona
Figure 0

Figure 1. Boxplots showing RMSE distances from the hypothetical populations, ${\rm{L}}\left( {\rm{t}} \right)$, to random sample sets, ${\rm{P}}\left( {\rm{t}} \right)$: (a) 600 years and (b) 1500 years.

Figure 1

Figure 2. Examples of SPD Comparisons: 600 years, Maximum Sample Fraction 30%, 25-year rolling mean applied.

Figure 2

Figure 3. Examples of SPD Comparisons: 1500 years, Maximum Sample Fraction 30%, 50-year rolling mean applied.

Figure 3

Figure 4. Comparisons of RMSE distances from the hypothetical populations, ${\rm{L}}\left( {\rm{t}} \right)$, to random sample sets, ${\rm{P}}\left( {\rm{t}} \right)$, weighted sample sets, ${\rm{W}}\left( {\rm{t}} \right)$, and bootstrap resampled datasets, ${\rm{R}}\left( {\rm{t}} \right)$.

Figure 4

Table 1. Summary of RMSE: Case Count. (Detailed information is provided in Tables 1 and 2 in Supplementary 1)

Figure 5

Figure 5. Examples of Settlement SPDs: (a) Power-law & Normal distribution, (b) Power-law & Skewed distribution (600 years, Maximum Sample Fraction 30%, 25-year rolling mean applied). For details of settlements, see Table 2 in Supplementary 1.

Figure 6

Figure 6. Examples of Settlement SPDs: (a) Power-law & Uniform distribution, (b) Power-law & Normal distribution (1500 years, Maximum Sample Fraction 30%, 50-year rolling mean applied). For details of settlements, see Table 2 in Supplementary 1.

Figure 7

Figure 7. Comparisons of RMSE distances from the hypothetical populations, ${{\rm{L}}_{\rm{i}}}\left( {\rm{t}} \right)$, to random sample sets, ${{\rm{P}}_{\rm{i}}}\left( {\rm{t}} \right)$, weighted sample sets, ${{\rm{W}}_{\rm{i}}}\left( {\rm{t}} \right)$ and bootstrapped datasets, ${{\rm{R}}_{\rm{i}}}\left( {\rm{t}} \right)$ by settlement: (a) 600 years (b) 1500 years (Maximum Sample Fraction 30%)

Figure 8

Figure 8. Study areas and sites: (a) Locations. (b) The Geum River Basin. (1. Bokryong-dong-1; 2. Bokryong-dong-2; 3. Bongmyeong-dong; 4. Daepyong-ri; 5. Juk-dong; 6. Naseong-ri; 7. Songjeol-dong; 8. Yonggye-dong; 9. Yongho-Hapgang-ri) (c) The Yeongsan River Basin. (1. Dongnim-dong; 2. Hanam-dong; 3. Heukseok-dong; 4. Oseon-dong; 5. Sanjeong-dong; 6. Seonam-dong; 7: Sinchang-dong; Taemok-ri; 9: Yeonsan-dong; 10. Yongdu-dong; 11. Yongsan-dong)

Figure 9

Figure 9. Analytic result of the Yeongsan River Basin. (a) Overall SPD; (b) Probability Distributions of Settlements (Site numbers correspond to Figure 8); (c) KDE analyses over time.

Figure 10

Figure 10. Analytic result of the Geum River Basin. (a) Overall SPD; (b) Probability Distributions of Settlements (Site numbers correspond to Figure 8); (c) KDE analyses over time.

Supplementary material: File

Park et al. supplementary material 1

Park et al. supplementary material
Download Park et al. supplementary material 1(File)
File 2.3 MB
Supplementary material: File

Park et al. supplementary material 2

Park et al. supplementary material
Download Park et al. supplementary material 2(File)
File 115.5 KB