Hostname: page-component-89b8bd64d-b5k59 Total loading time: 0 Render date: 2026-05-05T14:10:17.848Z Has data issue: false hasContentIssue false

Backchannel behavior is idiosyncratic

Published online by Cambridge University Press:  22 February 2024

Peter Blomsma*
Affiliation:
Tilburg University, Tilburg, The Netherlands
Julija Vaitonyté
Affiliation:
Tilburg University, Tilburg, The Netherlands
Gabriel Skantze
Affiliation:
KTH Royal Institute of Technology, Stockholm, Sweden
Marc Swerts
Affiliation:
Tilburg University, Tilburg, The Netherlands
*
Corresponding author: Peter Blomsma; Email: peter.blomsma@gmail.com
Rights & Permissions [Opens in a new window]

Abstract

In spoken conversations, speakers and their addressees constantly seek and provide different forms of audiovisual feedback, also known as backchannels, which include nodding, vocalizations and facial expressions. It has previously been shown that addressees backchannel at specific points during an interaction, namely after a speaker provided a cue to elicit feedback from the addressee. However, addressees may differ in the frequency and type of feedback that they provide, and likewise, speakers may vary the type of cues they generate to signal the backchannel opportunity points (BOPs). Research on the extent to which backchanneling is idiosyncratic is scant. In this article, we quantify and analyze the variability in feedback behavior of 14 addressees who all interacted with the same speaker stimulus. We conducted this research by means of a previously developed experimental paradigm that generates spontaneous interactions in a controlled manner. Our results show that (1) backchanneling behavior varies between listeners (some addressees are more active than others) and (2) backchanneling behavior varies between BOPs (some points trigger more responses than others). We discuss the relevance of these results for models of human–human and human–machine interactions.

Information

Type
Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2024. Published by Cambridge University Press
Figure 0

Figure 1. Visual impression of the o-cam experiment. First, the participant is prepared (A–C); after that, 11 rounds are played: In each round, the participant is shown four figures (D), followed by a description of one of those figures (E) after which the participant indicates which figure is described (F).

Figure 1

Figure 2. Illustration of a part of the speaker stimulus, with at each point in time the number of judges that indicated the presence of a BOP. If three judges or more indicated a BOP at a certain point, then this point is considered as a genuine BOP.

Figure 2

Table 1. Pitch properties of backchannel-inviting cues, compared to those of non-cues

Figure 3

Table 2. Averages of different channels (and standard deviations) over backchannel-inviting cues and non-backchannel-inviting cues

Figure 4

Table 3. Averages of different speaker channels (and standard deviations) of backchannel-inviting cues that precede LBRs vs cues that precede continuers

Figure 5

Table 4. Pitch properties of backchannel-inviting cues that precede LBRs vs cues that precede continuers

Figure 6

Table 5. Averages of different channels (and standard deviations) over BOPs and non-BOPs

Figure 7

Figure 3. Head movement frequency during BOPs and outside of BOPs.

Figure 8

Figure 4. Head movement amplitude during BOPs and outside of BOPs.

Figure 9

Figure 5. Number of times addressees vocalized BOPs versus the number of vocalized non-BOPs.

Figure 10

Table 6. Differences in feedback behavior between addressees

Figure 11

Figure 6. Values for head movement (frequency and amplitude) and vocalizations (sound) for each addressee. Frequency and amplitude are scaled, such that 1 represents the maximum value and 0 represents the lowest value.

Figure 12

Figure 7. Values for head movement (frequency and amplitude) and vocalizations (sound) for each BOP. Frequency and amplitude are scaled, such that 1 represents the maximum value and 0 represents the lowest value.

Figure 13

Figure 8. Behavior for each addressee per BOP. Only the first 15 BOPs due to visualization restrictions.

Figure 14

Table 7. Differences in feedback behavior within addressees

Figure 15

Table 8. Averages of head movement and vocalizations over continuer BOPs and last backchannel of round (LBR)