Hostname: page-component-89b8bd64d-4ws75 Total loading time: 0 Render date: 2026-05-07T05:28:43.538Z Has data issue: false hasContentIssue false

Investigating environmental effects on phonology using diachronic models

Published online by Cambridge University Press:  03 January 2024

Frederik Hartmann
Affiliation:
University of North Texas, Denton, Texas, USA
Seán G. Roberts*
Affiliation:
Cardiff University, Cardiff, UK
Paul Valdes
Affiliation:
University of Bristol, Bristol, UK
Rebecca Grollemund
Affiliation:
University of Missouri-Columbia, Columbia, Missouri, USA
*
Corresponding author: Seán G. Roberts; Email: RobertsS55@cardiff.ac.uk

Abstract

Previous work has proposed various mechanisms by which the environment may affect the emergence of linguistic features. For example, dry air may cause careful control of pitch to be more effortful, and so affect the emergence of linguistic distinctions that rely on pitch such as lexical tone or vowel inventories. Criticisms of these proposals point out that there are both historical and geographic confounds that need to be controlled for. We take a causal inference approach to this problem to design the most detailed test of the theory to date. We analyse languages from the Bantu language family, using a prior geographic–phylogenetic tree of relationships to establish where and when languages were spoken. This is combined with estimates of humidity for those times and places, taken from historical climate models. We then estimate the strength of causal relationships in a causal path model, controlling for various influences of inheritance and borrowing. We find no evidence to support the previous claims that humidity affects the emergence of lexical tone. This study shows how using causal inference approaches lets us test complex causal claims about the cultural evolution of language.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
Copyright © The Author(s), 2024. Published by Cambridge University Press
Figure 0

Figure 1. Scatter plots of all contemporary Bantu languages in the dataset by the four variables in question and humidity. Red figure: correlation coefficient of the two variables.

Figure 1

Figure 2. Histogram of the distribution of humidity predictor slopes for the simulated datasets under the null model.

Figure 2

Figure 3. Evolution of humidity (left) and tones (right) along the Bantu language family tree.

Figure 3

Figure 4. Two examples of clustering of decedents of two nodes in the phylomorpho space.

Figure 4

Figure 5. Bantu family tree with node importance ranging from blue (less important) to red (more important).

Figure 5

Figure 6. (a) Causal relationships between cultural variables (white circles) at different points in time (T0, T1, T2) and the environment (blue circles) at different geographic locations (L0, L1, L2) via inheritance (black and grey lines, e.g. from Z to Y), borrowing (red lines, e.g. from N1 to Y), and a target effect of the environment on the cultural variable (blue line, e.g. X to Y). (b) A simplification of the graph focused on node Y, with the environmental effects collapsed into one node.

Figure 6

Figure 7. Visualization of the interpolation of tones in neighbouring lineages at a given split time.

Figure 7

Table 1. Variables in best models

Figure 8

Figure 8. Posterior distribution of conditional effects of humidity on the four datasets. Blue, density of posterior samples; dashed curve, prior density; black line segment, 95% highest density interval; black dot, posterior mean.

Figure 9

Table 2. Savage–Dickey density ratios for the humidity effects across the models

Figure 10

Table A1. Posterior summary of the variables by model. The parameter names correspond to the parameters outlined in the model equations in Section 4.3

Figure 11

Table A2. loo comparison with expected log predictive density (ELPD) for analysis of variable tones.

Figure 12

Table A3. loo comparison with ELPD for analysis of variable vowelratio.

Figure 13

Table A4. loo comparison with ELPD for analysis of variable vowels.

Figure 14

Table A5. loo comparison with ELPD for analysis of variable vowels (ASJP).

Figure 15

Figure A1. An example of the nodes with distance 0.02 from a selected node, highlighted in blue.

Supplementary material: File

Hartmann et al. supplementary material 1

Hartmann et al. supplementary material
Download Hartmann et al. supplementary material 1(File)
File 39.7 KB
Supplementary material: File

Hartmann et al. supplementary material 2

Hartmann et al. supplementary material
Download Hartmann et al. supplementary material 2(File)
File 82.7 KB
Supplementary material: File

Hartmann et al. supplementary material 3

Hartmann et al. supplementary material
Download Hartmann et al. supplementary material 3(File)
File 20.9 KB
Supplementary material: File

Hartmann et al. supplementary material 4

Hartmann et al. supplementary material
Download Hartmann et al. supplementary material 4(File)
File 50 MB