Hostname: page-component-89b8bd64d-5bvrz Total loading time: 0 Render date: 2026-05-07T18:19:26.325Z Has data issue: false hasContentIssue false

Tracing contact and migration in pre-Bantu Southern Africa through lexical borrowing

Published online by Cambridge University Press:  23 July 2025

Anne-Maria Fehn*
Affiliation:
CIBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, InBIO Laboratório Associado, Campus de Vairão, Universidade do Porto, Porto, Portugal Biopolis Program in Genomics, Biodiversity and Land Planning, CIBIO, Campus de Vairão, Vairão, Portugal
Bonny E. Sands
Affiliation:
Department of English, Northern Arizona University, Flagstaff, AZ, USA
Admire Phiri
Affiliation:
CIBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, InBIO Laboratório Associado, Campus de Vairão, Universidade do Porto, Porto, Portugal Biopolis Program in Genomics, Biodiversity and Land Planning, CIBIO, Campus de Vairão, Vairão, Portugal Department of Linguistics, University of the Free State, Bloemfontein, South Africa
Maitseo Bolaane
Affiliation:
San Research Centre, University of Botswana, Gaborone, Botswana
Gaseitsiwe Masunga
Affiliation:
Okavango Research Institute, University of Botswana, Maun, Botswana
Ezequiel Fabiano
Affiliation:
CIBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, InBIO Laboratório Associado, Campus de Vairão, Universidade do Porto, Porto, Portugal Biopolis Program in Genomics, Biodiversity and Land Planning, CIBIO, Campus de Vairão, Vairão, Portugal University of Namibia, Windhoek, Namibia
Jorge Rocha
Affiliation:
CIBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, InBIO Laboratório Associado, Campus de Vairão, Universidade do Porto, Porto, Portugal Biopolis Program in Genomics, Biodiversity and Land Planning, CIBIO, Campus de Vairão, Vairão, Portugal Departamento de Biologia, Faculdade de Ciências, Universidade do Porto, Porto, Portugal
*
Corresponding author: Anne-Maria Fehn; Email: afehn@cibio.up.pt

Abstract

Lexical borrowing may provide valuable clues about the sociohistorical context of language contact. Here we explore patterns of vocabulary transfer between languages from three families (Kx’a, Tuu, Khoe-Kwadi) comprising the linguistic unit commonly referred to as Southern African Khoisan. In our data set, 20% of 1,706 roots are shared between at least two families. By applying a carefully chosen set of linguistic and extralinguistic criteria, we were able to trace the origin of 71% of shared roots, with the remaining 29% constituting good candidates for ancient contact or shared common ancestry of the forager families Kx’a and Tuu. More than half of the shared roots for which an origin could be determined trace back to Khoe-Kwadi and were borrowed into languages of other families within two major confluence zones with different sociohistorical profiles: (i) the Central Kalahari characterized by egalitarian interaction between languages of all three families and (ii) the southern and south-western Kalahari Basin fringes showing unilateral transfer from Khoe-Kwadi-speaking herders into resident forager groups. The findings of this study complement genetic and archaeological research on southern Africa and testify to the value of linguistics in the multidisciplinary inference of contact and migration scenarios.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press.
Figure 0

Figure 1. Map showing the approximate geographic origin of each data set used in the present study. Individual data sets are identified by language family and number as referenced in Supplementary Table S1. The legend groups doculects into language families and widely accepted subgroups (cf. Fehn & Rocha, 2023; Güldemann, 2014a; Heine & Honken, 2010; Heine & König, 2015; Vossen, 1997).

Figure 1

Figure 2. Language relationships as assessed with the GeneContent distance (Gray et al., 2010); both the heatmap based on linguistic similarity (a) and the NeighborNet (b) clearly identify the three families Kx’a (green), Tuu (red) and Khoe-Kwadi (blue), as well as their major subgroups. Lexical sharing between the three families is evident as reticulation in the centre of the network.

Figure 2

Figure 3. (a) Bayesian phylogenetic trees used to identify major language clusters within each family. (b) Proportion of roots with different sharing patterns in each language cluster. (c) Percentage of roots that were transferred from one’s own family to other families (top bars) or that originated in other families; that is, loanwords (bottom bars). All proportions refer to fractions of total roots per language cluster. The horizontal bars below the bar plots indicate the language family: Kx’a (green), Tuu (red) and Khoe-Kwadi (blue).

Figure 3

Figure 4. Borrowing profiles of Kx’a, Tuu and Khoe-Kwadi languages. The size of the pie chart corresponds to the percentage of borrowed vocabulary, as indicated for each figure. (a) shows the composition of the borrowed vocabulary in each language; (b) shows Kx’a borrowings in languages from the Tuu and Khoe-Kwadi families, broken down to the highest resolution level that could be obtained for each root (Proto-Kx’a, Proto-!Xun, Proto-ǂ’Amkoe); (c) shows Khoe-Kwadi borrowings in languages from the Kx’a and Tuu families, broken down to the highest resolution level that could be obtained for each root (Proto-Khoe-Kwadi, Proto-Khoe, Proto-Kalahari Khoe, Proto-Khoekhoe); (d) shows Tuu borrowings in languages from the Kx’a and Khoe-Kwadi families, broken down to the highest resolution level that could be obtained for each root (Proto-Tuu, Proto-Taa-Lower Nossob, Proto-Taa).

Figure 4

Figure 5. (a) Percentages of roots with unidentified origin in each language cluster, according to their sharing patterns. (b) Pie chart decomposing roots with unidentified origin shared between Kx’a and Tuu, and between Kx’a, Tuu and Khoe-Kwadi (n=70), according to whether they are shared: (i) between geographically unconnected units (inheritance or ancient contact); (ii) areally between neighbours (areal contact) or (iii) in an unresolvable setting (unresolved); (c) map showing the geographic distribution of the categories shown in (B) across our Kx’a and Tuu data set. The size of the pie chart corresponds to the percentage of shared roots with unidentified origins in each language cluster, according to the legend.

Figure 5

Figure 6. Hypothetical migration routes of Kwadi, Kalahari Khoe and Khoekhoe speakers, out of a core on the western Kalahari Basin fringe where Proto-Khoe-Kwadi and Proto-Khoe emerged (in blue). Historically attested Kx’a- and Tuu-speaking areas are indicated in green and red, respectively. The micro-contact zone of the Central Kalahari where languages of all three families exchange loanwords is encircled by a dashed line.

Supplementary material: File

Fehn et al. supplementary material 1

Fehn et al. supplementary material
Download Fehn et al. supplementary material 1(File)
File 9.3 MB
Supplementary material: File

Fehn et al. supplementary material 2

Fehn et al. supplementary material
Download Fehn et al. supplementary material 2(File)
File 641.2 KB