Hostname: page-component-5db58dd55d-688nx Total loading time: 0 Render date: 2026-06-02T18:27:10.176Z Has data issue: false hasContentIssue false

Temporal relationships between speech and hand gestures in the vicinity of potential turn boundaries in German and Swedish conversation

Published online by Cambridge University Press:  21 July 2025

Margaret Zellers*
Affiliation:
Institut für Skandinavistik, Frisistik, & Allgemeine Sprachwissenschaft, Kiel University, Kiel, Germany
Jan Gorisch
Affiliation:
Department of Pragmatics, Leibniz-Institute for the German Language, Mannheim, Germany
David House
Affiliation:
Division of Speech, Music and Hearing, KTH Royal Institute of Technology, Stockholm, Sweden
*
Corresponding author: Margaret Zellers; Email: mzellers@isfas.uni-kiel.de
Rights & Permissions [Opens in a new window]

Abstract

Both gesture and talk are basic building blocks of face-to-face conversation. In this study, we address the temporal dynamics of hand gesture phases relative to places and types of turn transition. We annotated gesture features and measured temporal aspects of gesture related to speech in two languages, German and Swedish. We found variation in the temporal relationships of gesture types and alignment of gesture phases that relate to the management of turn-taking in conversation. Specifically, the frequency of different gesture phases accompanying the offset of speech differed depending on whether the same speaker held the floor or whether a new speaker took up a turn. In addition, we found that differences in temporal alignment of gesture phases can distinguish between the type of turn transition that is upcoming up to a second before the place of transition is reached. Our results emphasize the importance of the interaction of the verbal and the gestural modality to maintain the smooth flow of conversation.

Information

Type
Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press
Figure 0

Table 1. Metadata for the FOLK and Spontal files

Figure 1

Figure 1. Screenshot of the annotation environment in ELAN (data from FOLK).

Figure 2

Figure 2. Ongoing gesture phase at time of speech offset, German data left, Swedish data right. The y-axis shows the proportion of gesture phases at each transition type, rather than raw counts.

Figure 3

Table 2. Gesture phases according to transition types observed at the time of speech offset in completion points in German and Swedish

Figure 4

Figure 3. Distribution of gesture phases over time according to transition types; German data left, Swedish data right. The zero point is at the speech offset at a potential turn boundary (PTB). On the y-axis we count which gesture phase the current speaker is currently in. All counts across all phases sum up to the number of completion points in the data with the specific transition label. The counts in the slice at time point zero, accords with the numbers in Table 2.

Figure 5

Figure 4. Model estimates for strokes. The x-axis shows the time offset from the completion point. The y-axis shows the estimated probability of strokes on a logarithmic scale. The random factor (speaker) is considered in the plot. We used the R package effects (Fox et al., 2022) in plotting the estimates.

Figure 6

Table 3. Upper part: statistical evaluation for each logistic model: R2m = explained variation without random factor (speaker) and R2c corrected for the random factor. Lower part: p-values (log probabilities) for each factor and the interaction with time. The reference group (Intercept) is backchannels in German. Bold text shows predictors that achieve statistical significance.

Figure 7

Figure 5. Estimates for the models for gesture phases preparations, holds, retractions, and none.

Figure 8

Figure 6. Probability of strokes over time (zero = offset of speech) according to transition types. Stretches where the confidence intervals do not overlap can be considered as significantly different.

Figure 9

Figure 7. Probability of preparations over time (zero = offset of speech) according to transition types.

Figure 10

Figure 8. Probability of holds over time (0 = offset of speech) according to transition types.

Figure 11

Figure 9. Probability of none over time (zero = offset of speech) according to transition types.