Hostname: page-component-6766d58669-mzsfj Total loading time: 0 Render date: 2026-05-19T22:20:17.445Z Has data issue: false hasContentIssue false

Understanding demonstrative reference in text: a new taxonomy based on a new corpus

Published online by Cambridge University Press:  28 March 2022

Alfons Maes*
Affiliation:
Department of Communication and Cognition (TiCC), Tilburg University, Tilburg, Netherlands
Emiel Krahmer
Affiliation:
Department of Communication and Cognition (TiCC), Tilburg University, Tilburg, Netherlands
David Peeters
Affiliation:
Department of Communication and Cognition (TiCC), Tilburg University, Tilburg, Netherlands
*
*Corresponding author. Email: maes@tilburguniversity.edu
Rights & Permissions [Opens in a new window]

Abstract

Endophoric demonstratives such as this and that are among the most frequently used words in written texts. Nevertheless, it remains unclear how exactly they should be subdivided and classified in terms of their different types of use. Here, we develop a new taxonomy of endophoric demonstratives based on a large-scale corpus including three written genres: news items, encyclopedic texts, and book reviews. The taxonomy enables analysts to reliably code endophoric demonstratives based on objectively applicable criteria, while at the same time making them aware of many subtle borderline cases. We consider the taxonomy as a theoretical foundation for future theoretical and empirical work into endophoric demonstratives, and as an analytical tool allowing researchers to unify and compare the results of studies on endophoric demonstratives coming from different genres and languages.

Information

Type
Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright
© The Author(s), 2022. Published by Cambridge University Press
Figure 0

Table 1. Typical examples for each class in our taxonomyIn these and following examples, we have underlined the proposed antecedent or postcedent, presented the critical demonstrative in boldface, and added the source (News, Wikipedia, and Reviews) and the corpus ID number.

Figure 1

Fig. 1. Proposed taxonomy of endophoric demonstrative reference, and the number of demonstratives observed in our corpus per class, after data exclusion (see Section 2)

Figure 2

Table 2. Across the three genres (All), and per genre (News, Wiki, and Reviews), the overall number of words, the number of demonstratives in total (and per 1,000 words between brackets), and the number (and percentages between brackets) of text-based, situation-based, and excluded demonstratives

Figure 3

Table 3. Across the three genres (All), and per genre (News, Wiki, and Reviews) (i) the proportion of situation-based demonstratives for the variables demonstrative type: proximal (vs. distal) and number: singular (vs. plural), (ii) the proportion of text-based demonstratives for the variables demonstrative type: proximal (vs. distal), number: singular (vs. plural); form: pronouns, unmodified NPs, modified NPs, modified elliptic NPs; syntactic function: subject (vs. non-subject); sentence position: initial (vs. non-initial); and type of referent: abstract, concrete, human vs. named human; (iii) the proportion of anaphoric and cataphoric demonstratives for the variables antecedent type: nominal (vs. non-nominal); and referential distance: same sentence (vs. earlier sentence); (iv) the proportion of unmodified NP anaphora with nominal antecedent with either a lexically same (vs. different) head noun.