Hostname: page-component-89b8bd64d-z2ts4 Total loading time: 0 Render date: 2026-05-12T13:38:33.975Z Has data issue: false hasContentIssue false

Classifying evolutionary forces in language change using neural networks

Published online by Cambridge University Press:  16 October 2020

Folgert Karsdorp
Affiliation:
Royal Netherlands Academy of Arts and Sciences, Meertens Institute, Amsterdam, The Netherlands
Enrique Manjavacas
Affiliation:
Department of Literature, University of Antwerp, Antwerp, Belgium
Lauren Fonteyn
Affiliation:
Leiden University Centre for Linguistics, Leiden University, Leiden, The Netherlands
Mike Kestemont
Affiliation:
Department of Literature, University of Antwerp, Antwerp, Belgium

Abstract

A fundamental problem in research into language and cultural change is the difficulty of distinguishing processes of stochastic drift (also known as neutral evolution) from processes that are subject to selection pressures. In this article, we describe a new technique based on deep neural networks, in which we reformulate the detection of evolutionary forces in cultural change as a binary classification task. Using residual networks for time series trained on artificially generated samples of cultural change, we demonstrate that this technique is able to efficiently, accurately and consistently learn which aspects of the time series are distinctive for drift and selection, respectively. We compare the model with a recently proposed statistical test, the Frequency Increment Test, and show that the neural time series classification system provides a possible solution to some of the key problems associated with this test.

Information

Type
Methods Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright
Copyright © The Author(s), 2020. Published by Cambridge University Press on behalf of Evolutionary Human Sciences
Figure 0

Figure 1. Time series generated with the Wright–Fisher model with increasing selection coefficients β. For each selection strength value β, we simulated 1000 time series of 200 generations (100 shown). Starting frequency values were set to 0.5. The top row displays the results for the FIT, and the bottom row shows the results of applying the time series classifier. Grey lines indicate correct classifications, yellow lines mark incorrect classifications and blue lines indicate inconclusive cases for which the normality assumption is not met. The accuracy scores for the FIT are computed by excluding non-normally distributed time series. The percentage of time series for which the test is inapplicable is given.

Figure 1

Figure 2. Interaction between selection coefficients and number of bins. For each unique combination of selection coefficient and number of bins, we simulate 1000 time series using the Wright–Fisher model, and compute their mean error rate. Simulations were run for 200 generations. The simulated time series were subsequently binned according to the specified number of bins. Subplot (A) displays the FIT results. Subplot (B) plots the same results but masking (a) all samples violating the normality assumption and (b) all samples with too few data points after adjusting for absorption events (white colour). The results for the time series classifier are shown in subplot (C). The colour bar at the far right of the plot functions as a legend of the error rate values.

Figure 2

Figure 3. The impact of binning on the false-positive rate of the FIT and the time series classifier. With β set to 0 (i.e. stochastic drift), we simulate 1000 time series for each number of bins using the Wright–Fisher model. Simulations were run for 200 generations. The plot shows the mean error-rate for each bin number (solid lines), as well as the 95% confidence interval (shaded area) which was computed using a bootstrap procedure.

Figure 3

Figure 4. Relative frequency of past tense variants over time. The y-axis displays the fraction of regular variants of a particular verb. The variants of five verbs (dreamt–dreamed, lit–lighted, snuck–sneaked, spillt–spilled, spoilt–spoiled) are highlighted.

Figure 4

Figure 5. Results of applying the FIT and the TSC to the verb time series. The upper panel displays the results for the FIT with a variable-width binning strategy, the middle the results for a fixed-width binning strategy and the bottom panel those for the TSC. Circles indicate that the normality assumption of the t-test is met (according to a Shapiro–Wilk test with a threshold of p ≥ 0.1), while squares indicate that it is not met (p < 0.1). The colouring of the circles or squares corresponds to the results of the FIT. Unfilled items correspond to a FIT p-values of ≥0.2. Blue items have a p-values of <0.2, and yellow items have a p-value <0.05. The bottom panel shows the results for the TSC. Items with a probability greater than 0.5 to be generated through selection are coloured yellow; the others are unfilled. The bottom rows underneath each panel display little pie charts, which provide information about the classifying consistency of the two systems across the different binning strategies. The colour black marks neutral, stochastic drift (with a FIT p-value ≥0.2 and a TSC probability value ≤0.5). Blue is reserved for the FIT results and corresponds to time series with a p-value smaller than 0.2. Yellow is used for time series with a corresponding FIT p-value of <0.05, as well as for series with a TSC probability value >0.5. Finally, white is used for time series that violate the normality assumption.

Figure 5

Table 1. Overview of potential problems with detecting evolutionary forces in language change (and cultural change in general). Unsolved problems are marked as ×; problems solved with the time series classification task are marked as ✓. Problems in need of more research are marked as .

Supplementary material: PDF

Karsdorp et al. Supplementary Materials

Karsdorp et al. Supplementary Materials

Download Karsdorp et al. Supplementary Materials(PDF)
PDF 868 KB