Hostname: page-component-6766d58669-7cz98 Total loading time: 0 Render date: 2026-05-20T15:36:09.179Z Has data issue: false hasContentIssue false

When to (not) split the infinitive: factors governing patterns of syntactic variation in Twitter-style Philippine English

Published online by Cambridge University Press:  14 February 2024

WILKINSON DANIEL WONG GONZALES*
Affiliation:
Department of English 3/F Fung King Hey Building The Chinese University of Hong Kong Shatin, New Territories Hong Kong SAR, People's Republic of China wdwonggonzales@cuhk.edu.hk
Rights & Permissions [Opens in a new window]

Abstract

The variability of adverbial placement in the modified infinitive construction (i.e. split infinitives vs. full infinitives with adverbial pre- and post-modification) has been widely discussed in the (American English) literature. Yet a convincing generalized explanation for the variation that simultaneously incorporates language-internal and language-external factors has yet to be found, particularly in English varieties that have not received as much scholarly attention as standardized varieties. This article investigates modified infinitive syntactic variation in Twitter-style Philippine English (PhE) using a 135-million-word Twitter corpus. It adopts a Bayesian approach in conducting a multiple multinomial regression analysis of the said variation, with the help of Deep-Learning-based demographic inference tools. Although the conditioning effects of some factors diverge from patterns discussed in prior work, the results generally show that language-internal (e.g. stress and rhythm, adverb type, adverb length) and language-external factors (i.e. time, age, sex, geography) jointly shape the choice to split the infinitive in this linguistic style of PhE.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
Copyright © The Author(s), 2024. Published by Cambridge University Press
Figure 0

Table 1. Stress and rhythm conditions for modified infinitive constructions (Calle-Martín & Miranda-García 2009: 356) S = stressed, U = unstressed, ? = claimed to be rare

Figure 1

Figure 1. Cities from which the TCOPE was sampled, grouped by the three sociopolitical regions of the Philippines. Figures were adopted from the corpus overview papers (Gonzales 2023b)

Figure 2

Figure 2. Graphical user interface of Twitter Corpus Suite (COPE)

Figure 3

Table 2. Bayesian model posterior draw estimates for predictors influencing likelihood to split the infinitive (reference levels in boldface)

Figure 4

Table 3. Proportion of factors that have a higher probability (pd > 0.79) of a non-zero effect on the likelihood of use of pre- or post-modified infinitive over the split infinitive construction

Figure 5

Figure 3. Distribution of modified infinitive variants by notable non-prosodic language-internal factors. Box plots and scatter plots indicate partial effects. On the y-axes of the box and scatter plots, negative values indicate likelihood to use a split infinitive, whereas positive values indicate likelihood to use pre-/post-infinitive modification.

Figure 6

Figure 4. Marginal effects of interaction between length of adverb relative to the verb (length in syllables) and likelihood of ambiguity (distance from sentence boundary)

Figure 7

Figure 5. Distribution of modified infinitive variants by prosodic language-internal factors (main effects)

Figure 8

Table 4. Observed effects of stress and rhythm conditions on modified infinitive variation in previous studies vs. this study

Figure 9

Figure 6. Distribution of modified infinitive variants – Conditions A, B and C given Condition D

Figure 10

Figure 7. Distribution of modified infinitive variants by language-external factors (main effects); Box plots and scatter plots indicate partial effects. On the y-axes of the box and scatter plots, negative values indicate likelihood to use a split infinitive, whereas positive values indicate likelihood to use pre-/post-infinitive modification.