Hostname: page-component-77f85d65b8-jkvpf Total loading time: 0 Render date: 2026-03-29T16:31:45.424Z Has data issue: false hasContentIssue false

Getting a (big) data-based grip on ideological change. Evidence from Belgian Dutch

Published online by Cambridge University Press:  14 May 2020

Stefan Grondelaers*
Affiliation:
Centre for Language Studies, Radboud University Nijmegen, Nijmegen, The Netherlands
Dirk Speelman
Affiliation:
Quantitative Lexicology and Variational Linguistics Research Unit, Department of Linguistics, University of Leuven, Leuven, Belgium
Chloé Lybaert
Affiliation:
MULTIPLES - Research Centre for Multilingual Practices and Language Learning in Society, Department of Translation, Interpreting and Communication, Ghent University, Ghent, Belgium
Paul van Gent
Affiliation:
Centre for Language Studies, Radboud University Nijmegen, Nijmegen, The Netherlands
*
Author for correspondence: Stefan Grondelaers, Email: s.grondelaers@let.ru.nl
Rights & Permissions [Opens in a new window]

Abstract

In this paper we introduce a computationally enriched experimental tool designed to investigate language ideology (change). In a free response experiment, 211 respondents returned three adjectives in reaction to the labels for five regional varieties, one ethnic variety and two supra-regional varieties of Belgian Dutch, as well as the standard accent of Netherlandic Dutch. Valence information (pertaining to the positive/negative character of the responses) and big data–based distributional analysis (to detect semantic similarity between the responses) were used to cluster the response adjectives into 11 positive and 11 negative evaluative dimensions. Correspondence analysis was subsequently used to compute and visualize the associations between these evaluative dimensions and the investigated language labels, in order to generate “perceptual maps” of the Belgian language repertoire. Contrary to our expectations, these maps unveiled not only the dominant value system which drives standard usage, but also the competing ideology which frames the increasingly occurring non-standard forms. In addition, they revealed a much richer stratification than the “one variety good, all other varieties bad” dichotomy we had anticipated: while VRT-Dutch remains the superior (albeit increasingly virtual) standard for Belgian Dutch, the stigmatized colloquial variety Tussentaal is gradually being accepted as a practical lingua franca, and the Ghent-accent is boosted by modern prestige (dynamism) features. Even more crucially, separate perceptual maps for the older and younger respondents lay bare generational change: there is a growing conceptual proximity between VRT-Dutch and Tussentaal in the younger perceptions.

Information

Type
Articles
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright
© The Author(s) 2020. Published by Cambridge University Press
Figure 0

Table 1. Frequency, unicity, and valence of top 10 adjectives by variety label

Figure 1

Table 2. Token/type ratio and weighted unicity by variety label

Figure 2

Table 3. Valence per variety by participant group

Figure 3

Table 4. Low valence (negative) and high valence (positive) clusters

Figure 4

Figure 1. Biplot of correspondence analysis on the full dataset, with 8 cases (variety labels) and 22 variables (11 positive & 11 negative adjective clusters).

Figure 5

Figure 2. Biplots of three-dimensional correspondence analysis on the full dataset, with 8 cases (variety labels) and 22 variables (11 positive & 11 negative adjective clusters), depicted from four angles.

Figure 6

Figure 3. Biplot of correspondence analysis on data produced by older respondents, with 8 cases (variety labels) and 22 variables (11 positive & 11 negative adjective clusters).

Figure 7

Figure 4. Biplot of correspondence analysis on data produced by younger respondents, with 8 cases (variety labels) and 22 variables (11 positive & 11 negative adjective clusters).

Figure 8

Figure 5. Clustering dendrogram based on association profiles for eight variety labels (older respondents).

Figure 9

Figure 6. Clustering dendrogram based on association profiles for eight variety labels (younger respondents).