Hostname: page-component-77c78cf97d-d2fvj Total loading time: 0 Render date: 2026-04-24T08:55:14.366Z Has data issue: false hasContentIssue false

Variation and change in the productivity of be going to V in the Corpus of Historical American English, 1810–2009

Published online by Cambridge University Press:  25 June 2025

Tanja Säily*
Affiliation:
Department of Languages, University of Helsinki , PO Box 24, 00014 Helsinki, Finland
Florent Perek
Affiliation:
Department of English Language and Linguistics, University of Birmingham , Edgbaston, Birmingham B15 2TT, UK
Jukka Suomela
Affiliation:
Department of Computer Science, Aalto University, PO Box 15400, 00076 Aalto, Finland
*
Corresponding author: Tanja Säily; Email: tanja.saily@helsinki.fi
Rights & Permissions [Opens in a new window]

Abstract

This article focuses on the poster child of grammaticalization, be going to V. First expressing ‘motion with intention’, in Early Modern English the construction came to signify ‘motionless intention’. The grammaticalization process continued in Late Modern English with subjectification, so that ‘intention’ was gradually replaced by ‘prediction’. We study the process from Late Modern to Present-Day English in the 200-million-word fiction section of the Corpus of Historical American English (COHA), 1810–2009, for which we have gender metadata on the authors. We focus on the productivity of the construction by comparing type frequencies, i.e., the number of different verbs following be going to. Our research questions are how the grammaticalization is reflected in the productivity of the construction, and whether the social factor of gender played a role in the process. We study the internal factors of mental verbs, inanimate subjects and passive voice; to this end, we use robust statistical methods to compare type frequencies and proportions of types over time. We also investigate the semantics of the verb types by drawing on techniques from distributional semantics. Our wider aim is to enrich the cognitively oriented theory of Construction Grammar with insights from historical sociolinguistics.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press
Figure 0

Table 1. Size of the dataset by decade and author gender. Total word counts include authors of unknown gender

Figure 1

Figure 1. The user interface of types3 on macOS

Figure 2

Figure 2. Change in the type frequency of begoing to V by the number of running words in COHA novels

Figure 3

Figure 3. Gender variation and change in the type frequency of begoing to V by the number of running words in COHA novels

Figure 4

Figure 4. Change in the type frequency of begoing to V by the number of tokens of the construction in COHA novels

Figure 5

Figure 5. Gender variation and change in the type frequency of begoing to V by the number of tokens of the construction in COHA novels

Figure 6

Figure 6. Gender variation and change in the proportion of mental verb types in the begoing to V construction in COHA novels

Figure 7

Figure 7. Gender variation and change in the proportion of verb types used with the inanimate subject it in the begoing to V construction in COHA novels

Figure 8

Figure 8. Gender variation and change in the proportion of verb types used with passive voice in the begoing to V construction in COHA novels

Figure 9

Figure 9. Change in the proportion of verb types used with passive voice in the begoing to V construction in COHA novels

Figure 10

Table 2. Change in the semantic distribution of be going to V for each gender, as measured by Pearson’s ρ

Figure 11

Table 3. Comparison of the semantic distribution of be going to V between genders over time, as measured by Pearson’s ρ

Figure 12

Figure 10. Variation in type proportions in each class over time, for men (blue) vs women (red). Samples are taken from four periods: 1820–59, 1860–99, 1900–39, 1940–79

Figure 13

Table 4. Semantic clusters of verbs chosen for the analysis