Hostname: page-component-77f85d65b8-jkvpf Total loading time: 0 Render date: 2026-03-27T21:54:45.320Z Has data issue: false hasContentIssue false

Permutation test applied to lexical reconstructions partially supports the Altaic linguistic macrofamily

Published online by Cambridge University Press:  01 June 2021

Alexei S. Kassian*
Affiliation:
School of Advanced Studies in the Humanities, The Russian Presidential Academy of National Economy and Public Administration, Moscow, Russia
George Starostin*
Affiliation:
Institute for Oriental and Classical Studies, National Research University Higher School of Economics, Moscow, Russia Santa Fe Institute, New Mexico, USA
Ilya M. Egorov
Affiliation:
School of Advanced Studies in the Humanities, The Russian Presidential Academy of National Economy and Public Administration, Moscow, Russia
Ekaterina S. Logunova
Affiliation:
Institute of Linguistics, Russian State University for the Humanities, Moscow, Russia
Anna V. Dybo
Affiliation:
Institute of Linguistics, Russian Academy of Sciences, Moscow, Russia
*
*Corresponding authors. E-mails: a.kassian@gmail.com, gstarst1@gmail.com.
*Corresponding authors. E-mails: a.kassian@gmail.com, gstarst1@gmail.com.

Abstract

In this paper, we present the results of our analysis of the 110-item basic wordlists for four reconstructed and one ancient languages, the linguistic ancestors of five language families which are hypothesized to constitute the Altaic (a.k.a. Transeurasian) macrofamily: Proto-Turkic, Proto-Mongolic, Proto-Tungusic, Middle Korean and Proto-Japonic wordlists. Protolanguage wordlists were reconstructed according to strict criteria of semantic reconstruction, based on accurate semantic glossing of forms in daughter languages. Each involved form was encoded into a bi-consonantal CC-shaped sequence using the consonant class method, after which a recently developed weighted permutation test was applied. In a typical situation, our algorithm makes a small number of type 1 errors (false positive), but the number of type 2 errors (false negative) can be substantial. Our main finding is that pairs between the Nuclear Altaic taxa – Turkic, Mongolic and Tungusic – as well as the Turkic-Japonic and Tungusic-Japonic pairs demonstrate significant p-values. In some cases, this can be attributed to either ancient contacts or genealogical relationships, but at least for the Turkic–Japonic pair, a contact scenario is unlikely owing to geographical remoteness.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright
Copyright © The Author(s), 2021. Published by Cambridge University Press
Figure 0

Figure 1. Modern distribution of the five language families that constitute the Altaic (a.k.a. Transeurasian) macrofamily.

Figure 1

Table 1. Consonant classes

Figure 2

Table 2. Probabilities of phonetic matches between five Altaic wordlists (S. Starostin's Japonic *d-reconstruction) obtained by the weighted permutation test

Figure 3

Table 3. Probabilities of phonetic matches between five Altaic wordlists (‘conservative’ Japonic *y-reconstruction) obtained by the weighted permutation test

Supplementary material: PDF

Kassian et al. supplementary material

Kassian et al. supplementary material 1

Download Kassian et al. supplementary material(PDF)
PDF 865.7 KB
Supplementary material: File

Kassian et al. supplementary material

Kassian et al. supplementary material 2

Download Kassian et al. supplementary material(File)
File 19.3 KB
Supplementary material: File

Kassian et al. supplementary material

Kassian et al. supplementary material 3

Download Kassian et al. supplementary material(File)
File 193.2 KB