Hostname: page-component-89b8bd64d-7zcd7 Total loading time: 0 Render date: 2026-05-10T08:42:55.362Z Has data issue: false hasContentIssue false

Disfluency doesn’t happen in isolation

Exploring how individual disfluency features co-occur in L2 speaking performances

Published online by Cambridge University Press:  19 May 2025

Xun Yan*
Affiliation:
Department of Linguistics, School of Literatures, Cultures and Linguistics, University of Illinois at Urbana-Champaign, Urbana, IL, USA Beckman Institute for Advanced Science and Technology, Urbana, IL, USA
Ping-Lin Chuang
Affiliation:
Duolingo Inc, Pittsburgh, PA, USA
Yulin Pan
Affiliation:
Department of Linguistics, School of Literatures, Cultures and Linguistics, University of Illinois at Urbana-Champaign, Urbana, IL, USA
Huiying Cai
Affiliation:
Department of Linguistics, School of Literatures, Cultures and Linguistics, University of Illinois at Urbana-Champaign, Urbana, IL, USA
Shelley Staples
Affiliation:
Department of English, University of Arizona, Tucson, AZ, USA
Mariana Centanin Bertho
Affiliation:
Department of Spanish and Portuguese, Yale University, New Haven, CT, USA
*
Corresponding author: Xun Yan; Email: xunyan@illinois.edu.
Rights & Permissions [Opens in a new window]

Abstract

The construct of second language (L2) utterance fluency is typically operationalized through various individual temporal features. However, in natural speech, fluency (or disfluency) is often characterized by the clustering of multiple temporal features, collectively revealing the speaker’s effort in speech production or disfluency recovery. In this study, we explore the co-occurrence patterns of disfluency features in L2 speech and their associations with speakers’ L2 oral proficiency. We initially segmented all speech samples into analysis of speech (AS)-units. Within each AS-unit, six individual fluency features were manually coded, standardized, and subsequently subjected to a hierarchical-based k-means cluster analysis to examine their co-occurrence patterns. The results revealed four distinct disfluency clusters. A subsequent qualitative analysis of disfluencies in each cluster revealed distinct distributional patterns, disfluency makeup, and communicative functions. Additionally, the proportions of different disfluency clusters were significantly influenced by speakers’ proficiency level, first language background, and their interaction. These findings carry implications for L2 speaking research in general, shedding light on the intricate nature of speech fluency and presenting an alternative approach to the operationalization of this multidimensional construct.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press
Figure 0

Table 1. Corpus used for the study.

Figure 1

Figure 1. Screenshot of disfluency coding on ELAN.Note: In the Disfluency_Markin tier, [] = original or intended utterance; () = reparandum; {} = editing phase; and ^ = silent pause.

Figure 2

Table 2. Structure of disfluency.

Figure 3

Table 3. Descriptive statistics and Spearman correlations of individual disfluency features.

Figure 4

Figure 2. Dendrogram and scree plot for hierarchical cluster analysis.

Figure 5

Table 4. Cluster centroids of micro-disfluency features.

Figure 6

Figure 3. Cluster centroids of micro-disfluency features.Note: FP = filled pause; RF = reformulation; RP = repetition; SP = silent pause.

Figure 7

Table 5. MANOVA and ANOVA analysis of proficiency level and L1 on disfluency cluster proportions.

Figure 8

Figure 4. Correlations between proportion of disfluency clusters and IELTS scores.Note: * p < .05; ** p < .01; *** p < .001.

Figure 9

Figure 5. Correlations between proportion of disfluency clusters and IELTS scores by speaker L1.Note: * p < .05; ** p < .01; *** p < .001. KSA = Kingdom of Saudi Arabia.

Figure 10

Table 6. Example AS-units within each cluster.

Figure 11

Table 7. Summary of findings.