Hostname: page-component-89b8bd64d-ksp62 Total loading time: 0 Render date: 2026-05-06T23:05:17.930Z Has data issue: false hasContentIssue false

A multilingual corpus study of the competition between past and perfect in narrative discourse

Published online by Cambridge University Press:  06 October 2021

MARTIJN VAN DER KLIS
Affiliation:
Utrecht University, Trans 10, 3512 JK Utrecht, The Netherlands m.h.vanderklis@uu.nl
BERT LE BRUYN
Affiliation:
Utrecht University, Trans 10, 3512 JK Utrecht, The Netherlands b.s.w.lebruyn@uu.nl
HENRIËTTE DE SWART
Affiliation:
Utrecht University, Trans 10, 3512 JK Utrecht, The Netherlands h.deswart@uu.nl
Rights & Permissions [Opens in a new window]

Abstract

The western European present perfect is subject to substantial crosslinguistic variation. The literature, however, focuses on individual languages or on comparisons of a restricted number of languages. We piece together the puzzle and do so in a data-driven way by comparing the use of the present perfect through a parallel corpus based on the French novel L’Étranger and its translations in Italian, German, Dutch, European Spanish, British English, and Modern Greek. We introduce and showcase Translation Mining, a software suite combining a parallel corpus database with annotation and analysis tools. Translation Mining allows us to generate descriptive statistics of tense use across languages but also to visualize variation through its multidimensional scaling component and to link the variation we find to the underlying data through its integrated setup. We confirm that the present perfect competes with the past and we reveal the fine-grained scalar nature of the variation. To complete the puzzle, we ascertain the dimensions of variation, ranging from lexical and compositional semantics to dynamic semantics and pragmatics.1

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BYCreative Common License - NCCreative Common License - SA
This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike licence (https://creativecommons.org/licenses/by-nc-sa/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the same Creative Commons licence is included and the original work is properly cited. The written permission of Cambridge University Press must be obtained for commercial re-use.
Copyright
© The Author(s), 2021. Published by Cambridge University Press
Figure 0

Figure 1 (colour online) The TimeAlign annotation interface.

Figure 1

Table 1 Correspondences between language-specific verb forms and tense-aspect categories. The ‘other’ category lists forms that are not classified in one of the main tense-aspect categories, but do appear as forms in the current dataset.

Figure 2

Table 2 Inventory of the verb forms used in the translations of the 348 instances of the Passé Composé in L’Étranger.

Figure 3

Figure 2 (colour online) Raw MDS output based on the 7-tuples of our 348 datapoints.

Figure 4

Figure 3 (colour online) Temporal maps displaying perfect and past use per language.

Figure 5

Figure 4 Subset relation in the distribution of the perfect.

Figure 6

Figure 5 (colour online) The interactive Translation Mining interface.