Hostname: page-component-89b8bd64d-72crv Total loading time: 0 Render date: 2026-05-07T09:55:16.143Z Has data issue: false hasContentIssue false

Isomorphism-inspired theorising about optionality and variation: no empirical support from English grammar

Published online by Cambridge University Press:  02 May 2025

Ruiming Ma*
Affiliation:
Department of Linguistics, KU Leuven, Blijde-Inkomststraat 21, PO Box 3308, 3000, Leuven, Belgium
Thomas Van Hoey
Affiliation:
Department of Linguistics, KU Leuven, Blijde-Inkomststraat 21, PO Box 3308, 3000, Leuven, Belgium FWO, KU Leuven, Blijde-Inkomststraat 21, PO Box 3308, 3000, Leuven, Belgium
Benedikt Szmrecsanyi
Affiliation:
Department of Linguistics, KU Leuven, Blijde-Inkomststraat 21, PO Box 3308, 3000, Leuven, Belgium
*
Corresponding author: Ruiming Ma; Email: ruiming.ma@kuleuven.be
Rights & Permissions [Opens in a new window]

Abstract

Language variation (specifically: optionality between different ways of saying the same thing, as in check out the places vs check the places out) tends to be considered abnormal, suboptimal, short-lived, dysfunctional and needlessly complex, especially in functional or cognitive linguistic circles. In this contribution, we are assessing these assumptions: does grammatical optionality increase the relative complexity (or: difficulty) of language production? We use a corpus-based psycholinguistics research design with a variationist twist and analyse SWITCHBOARD, a corpus of conversational spoken American English. We ask if and how grammatical optionality correlates with two symptoms of production difficulty, namely filled pauses (um and uh) and unfilled pauses (speech planning time). Our dataset covers 108,487 conversational turns in SWITCHBOARD, 22 grammatical alternation types yielding 57,032 optionality contexts, 589,124 unfilled pauses and 43,801 filled pauses. Analysis shows that overall optionality contexts do not make speech production more dysfluent – regardless of how many language-internal probabilistic constraints are in operation, or how many variants there are to choose from. With that being said, we show how some alternations in the grammar of English are more prone to attract or repel production difficulties than others. All told, our results call into question old dogmas in theoretical linguistics, such as the Principle of Isomorphism or the Principle of No Synonymy.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press
Figure 0

Table 1. Summary of speaker demographics in the SWITCHBOARD corpus

Figure 1

Figure 1. The number of silences vs filled pauses in 108,487 conversational turns investigated in SWITCHBOARD. The scatterplot was jittered on both axes (±0.5) for clearer visualisation. The linear regression line is significant and weakly positive $ \left(r=0.07,p<0.001\right) $.

Figure 2

Table 2. Average number of constraints and number of variants of the 22 grammatical alternations under analysis

Figure 3

Table 3. Distribution of number of variable contexts per turn among all speakers in the SWITCHBOARD corpus

Figure 4

Table 4. Mixed-effects linear regression testing the fixed effects of numbers of grammatical variable contexts per turn, turn duration, mean word length, speech rate, and the random intercept of speaker on the sum of scaled number of filled and unfilled pausesModel fit by maximum likelihood. AIC = -401,958.1, Marginal R2 = 0.506, Conditional R2 = 0.603. Variation inflation factors < 1.18. Nobservations = 108,487.

Figure 5

Table 5. The baseline model: mixed-effects linear regression testing the fixed effects of turn duration, mean word length, speech rate, and the random intercept of speaker and grammatical alternation type on the sum of scaled number of filled and unfilled pausesModel fit by maximum likelihood. AIC = -92,496.1, Marginal R2 = 0.479, Conditional R2 = 0.600. Variation inflation factors <1.08. Nobservations = 25,703.

Figure 6

Table 6. The enhanced model: mixed-effects linear regression testing the fixed effects of number of constraints and number of variants per turn, turn duration, mean word length, speech rate, and the random intercept of speaker and grammatical alternation type on the sum of scaled number of filled and unfilled pausesModel fit by maximum likelihood. AIC = -92493.4, Marginal R2 = 0.480, Conditional R2 = 0.601. Variation inflation factors <1.08. Nobservations = 25,703.

Figure 7

Figure 2. Estimates and confidence intervals (estimates ± standard error) for adjustments to intercept by grammatical alternation type in the Enhanced Baseline Model. Alternations whose estimates are located to the right of the dotted line (> 0) attract dysfluencies, estimates to the left of the dotted line dispel dysfluencies.