Hostname: page-component-89b8bd64d-dvtzq Total loading time: 0 Render date: 2026-05-05T17:35:15.575Z Has data issue: false hasContentIssue false

Ranking Dutch intensifiers: a usage-based approach

Published online by Cambridge University Press:  28 February 2020

MICHAEL RICHTER
Affiliation:
Department of Computer Science, Natural Language Processing, Leipzig University
ROELAND VAN HOUT
Affiliation:
Centre for Language Studies, Radboud University Nijmegen
Rights & Permissions [Opens in a new window]

Abstract

The present approach estimates the strength of intensifiers in Dutch by computing their information values in a language corpus, that is, contextual information content (Cohen Priva, 2008; Piantadosi, Tily, & Gibson, 2011) and Shannon Information (Shannon & Weaver, 1948), to respectively explain the use value and the expressive value of intensifiers when they intensify a predicative adjective. Conflicting strength values help in understanding the high number of intensifiers commonly available in particular languages and the constant need for adding new ones. Our approach underlines the relevance of two measures of information content (IC) for ranking intensifiers: (i) IC in context: the more combinatorial or transitional options an intensifier has, the higher its contextual information content and consequently its use value; and (ii) IC in relation to all alternative intensifiers: the higher the surprisal value that the occurrence of an intensifier evokes, the higher its expressive value. We shall investigate the validity of these two measures by researching a large corpus of Dutch tweets and shall test whether the values of these two measures can predict the stacking order in sequences of intensifiers.

Information

Type
Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright
© UK Cognitive Linguistics Association, 2020
Figure 0

TABLE 1. Transition probabilities of echt ‘really’

Figure 1

TABLE 2. Transition probabilities of erg ‘very’

Figure 2

Fig. 1. Scatterplot of variables ICTRANS and ICLOCAL. The exceptional position of echt ‘really’ and zo ‘so’ is clearly visible. The point cloud at the bottom right consists of intensifiers with identical ICLOCAL and ICTRANS values. This is shown by connecting lines with identical origin.

Figure 3

TABLE 3. ICTRANS values are predicted to decrease and ICLOCAL values are predicted to increase the closer an intensifier is to the adjective; ’yes’ means that the two values involved have the predicted order, ‘no’ means a violation

Figure 4

TABLE 4. Nine triplets violating decreasing ICTRANS values in triplets of intensifiers

Figure 5

TABLE 5. Six triplets violating decreasing ICLOCAL values in triplets of intensifiers