Hostname: page-component-89b8bd64d-z2ts4 Total loading time: 0 Render date: 2026-05-06T05:12:41.172Z Has data issue: false hasContentIssue false

Introduction to the special issue ‘Multimodal prosody: prosody in the spoken and visual domains’

Published online by Cambridge University Press:  04 March 2026

Núria Esteve-Gibert*
Affiliation:
Faculty of Psychology and Education Sciences, Universitat Oberta de Catalunya, Spain
Pilar Prieto
Affiliation:
Institució Catalana de Recerca i Estudis Avançats , Spain Translation and Language Sciences, Pompeu Fabra University , Spain
Stefanie Shattuck-Hufnagel
Affiliation:
Research Laboratory of Electronics (RLE), Massachusetts Institute of Technology , USA
*
Corresponding author: Núria Esteve-Gibert; Email: nesteveg@uoc.edu
Rights & Permissions [Opens in a new window]

Abstract

The objective of this special issue is to present innovative research demonstrating that prosody needs to be reconceptualized as an inherently multimodal phenomenon, manifested across the spoken and/or visual domains. The studies included are organized into three core themes. Theme 1 addresses the temporal alignment of spoken and visual aspects of prosody, and how this is shaped by linguistic factors, speaker-specific traits (such as neurodiversity) and language learning patterns. Theme 2 deals with the coordination of spoken and visual aspects of prosody in conveying pragmatic intent, focusing on aspects such as negation, emotion and epistemic stance. Theme 3 explores how visual signals, including head movements and manual signals, fulfil essential prosodic roles across diverse sign language typologies. Taken together, the empirical evidence presented here shows that prosody is also embodied and that our bodily movements can manifest prosodic characteristics. On the one hand, they show the need to comprehensively re-evaluate our understanding of how speakers, listeners and learners engage with the prosodic dimension of language. On the other hand, they reveal that non-referential gestures are deeply meaningful and prosodically structured. Ultimately, visual cues are presented as indispensable for building accurate models of the human language capacity.

Information

Type
Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2026. Published by Cambridge University Press