Skip to main content Accessibility help
×
Hostname: page-component-5db58dd55d-jhf8m Total loading time: 0 Render date: 2026-05-25T11:29:41.075Z Has data issue: false hasContentIssue false

Section 1 - The Physiology of Rhythm

Published online by Cambridge University Press:  23 April 2026

Lars Meyer
Affiliation:
Max Planck Institute for Human Cognitive and Brain Sciences
Antje Strauss
Affiliation:
University of Konstanz

Information

Figure 0

Figure 1.1(A) Mechanism of inspiration and expiration phases;Figure 1.1(A) long description.

Figure 1

Figure 1.1(B) top: Centers of the brain involved in breathing control; (B) bottom: Example of rib cage breathing movement at rest (tidal breathing) and speech breathing (read speech) monitored using inductive plethysmography.Figure 1.1(B) long description.

Figures 1.1A and B adapted from openstax.org under the Creative Commons Attribution 4.0 International License. Legends have been slightly modified for readability reasons and for consistency with the text.
Figure 2

Figure 2.1 Jaw displacement values for different emphasis conditions.Jaw displacement values (mm) for each syllable in the utterance Pam said bat that fat cat at that mat, spoken in five utterance conditions by a single speaker. The emphasized words, from top to bottom, are BAT, THAT, FAT, CAT, and no emphasis.Figure 2.1 long description.

Erickson et al. (2015)
Figure 3

Figure 2.2 Samples of default and intent nuclear stress.Jaw tracings for AE speakers for the utterance I saw five bright highlights in the sky tonight. For the top figure, gray arrows from left to right point to foot stress on five, phrase stress on the first syllable of the compound word highlights, and utterance (nuclear) stress on sky. For the bottom figure, nuclear stress is instead on high(lights), phrasal stress is on sky, and foot stress is on five, as indicated by the white arrows.

Figure 4

Figure 2.3 Schematized steady stades and fast transit intervals.A schematized figure on steady states (thick solid lines) and fast transit intervals (solid lines with arrows) of a syllable, as divided by de/acceleration peaks. The areas marked CVC are the duration of the acoustic segments. The steady states of the crucial segmental articulators (here, vertical positions of the tongue tip or lower lip) form the CVC segments. The syllable articulator (the jaw) displays shorter steady states than both C and V segments. Note that the steady state of the tongue body (a low vowel) does not align with the V segment. Still unresolved questions are marked with dotted lines. The schematized figure is based on findings of Svensson Lundmark (2023) and Svensson Lundmark and Erickson (2024).

Reproduced with permission from M. Svensson Lundmark and D. Erickson, JSLHR, https://doi.org/10.1044/2024_JSLHR-23-00092, accepted; licensed under a Creative Commons Attribution (CC BY) license.
Figure 5

Figure 2.4 Jaw displacement patterns for five speakers.Bar graph displays of average amount of jaw displacement (mm) shown on y-axis with content word on the x-axis (Pam, chance, chat, nap), for each of the five speakers (S1, S2, S3, S4, and S5). Jaw displacement ranges from 15 to 30 mm for S1, 10 to 30 mm for S2, 21 to 31 mm for S3, 5 to 20 mm for S4, and 45 to 53 mm for S5.Figure 2.4 long description.

Figure 6

Figure 2.5 Correlation of vowel duration and jaw displacement.Correlation plot on acoustic vowel duration (s) and jaw displacement (mm) for each speaker. Jaw displacement is mean-centered for reasons of clarity. The four content words are represented by different symbols.Figure 2.5 long description.

Figure 7

Figure 2.6 Probability of perceived prominence and boundaries.(Top) jaw displacement for the utterance; (bottom) probability of nuclear prominence (dash lines with circle symbols) and phrase boundaries for each word produced by (a) S1, (b) S2, (c) S3, (d) S4, and (e) S5.Figure 2.6 long description.

Figure 8

Figure 2.7 Jaw displacement patterns for French, Japanese, and Mandarin.Jaw displacement patterns from top to bottom for French (Natacha didn’t attach her cat, Pasha, who escaped), Japanese (That’s why Mana’s hair is smooth), and Mandarin (Mother curses the horse). The term AP refers to accent phrase, and the final AP is referred to as an IP (intonational phrase).Figure 2.7 long description.

Figure 9

Figure 2.8 Jaw displacement for L2 Japanese and French speakers, and an L1 English speaker.Jaw displacement patterns for L1 Japanese (top row), L1 French (middle row) and L1 English (bottom row) speakers for the utterance I saw five bright highlights in the sky tonight.

Figure 10

Figure 3.1 Spectral profiles of language-relevant brain areas.Normalized power spectra as found in Keitel and Gross (2016) for 12 brain areas according to the automated anatomical labelling (AAL) atlas (Tzourio-Mazoyer et al., 2002; Bohland et al., 2009). Upper rows show the left hemisphere, lower rows the right hemisphere (see schematic area projections for area locations). Shaded error bars illustrate the standard error of the mean across participants. Power peaks are labelled according to their peak frequency (e.g., delta, theta, alpha, beta).Figure 3.1 long description.

Schematic area projections are used with the kind permission from Jason W. Bohland.
Figure 11

Figure 4.1A: During head rotations, endolymph fluid lags behind angular rotation of the head due to inertia, and its displacement bends the cilia of the hair cells in one of the three canals, anterior, lateral, and posterior.

Figure 12

Figure 4.1B: Within the macula of the utricle and saccule, which encode horizontal and vertical accelerations of the head, respectively, the otolith crystals move in response to gravity and head motion, bending stereocilia of the hair cells and triggering an afferent response.

Figure 13

Figure 4.1C: The signal transduced by the otoliths is called gravito-inertial acceleration, which is the vector sum of forces due to gravity and inertial motion of the head. These have the same physical consequences on the otolith organs, such that tilting the head backward and accelerating forward with the head upright results in the same signal to the brain via cranial nerve VIII.Figure 4.1C: long description.

Created with biorender.com.
Figure 14

Figure 4.2 Force transfer and multimodal prosody.Graphical depiction of audio and visual components of naturalistic speech recorded using a motion capture setup. Top: the vertical acceleration for four different mocap markers. Below: estimate of the speaker’s vertical GRF. The lowpass envelope is superimposed. Bottom: raw speech signal.The figures above show the speaker at each time frame associated with the time points demarcated by the vertical lines across the four plots (see x-axis). The stressed syllable in “ground” manifests closely in time with the contour of the GRF estimate – a function of the peak in downward acceleration of the speaker’s head, trunk, and arm movements.Figure 4.2 long description.

Data from the Trinity Speech-Gesture database (Ferstl and McDonnell, 2018). Time derivatives of the body movement and the vertical GRF estimated from raw motion capture data (Burger and Toiviainen, 2013).
Figure 15

Figure 6.1 Syllabic rhythm in the speech perception–action cycle.Speakers (left) produce an utterance with a syllabic rhythm through opening and closing movements of the mouth (mandibular cycle) and associated fluctuations in sound energy. Maximized oral aperture during vowel production generates energy peaks that define the rhythm’s rate over successive inter-peak intervals. In speakers (through feedback, left) and listeners (right), this rhythm maps onto neural oscillations in the delta-to-theta frequency range (about 2–8 Hz). This direct mapping and/or the repeated use of interval-based temporal processing mechanisms allows for factoring “time” into behavior and into intrapersonal and interpersonal adaptation. Adaptation entails timely and predictive activation to optimally allocate neural and cognitive resources in production and perception. At the highest level of the underlying processing hierarchy, the PFC provides the temporal integration and coordination capacities that bridge the temporally separate elements of the utterance into one behavioral gestalt for monitoring, planning, and comprehension.Figure 6.1 long description.

Figure 16

Figure 7.1 Explanation of f0 declination and final lenghthening in two examples.A) Oscillogram of human language. The spoken text is: “Always there had been war between the giants and the gods.” The duration of the “s” of “giants” and “gods” is annotated, and the final “s” is much longer, demonstrating final lengthening. B) The spectrogram for the same human sentence is shown with a solid line indicating the f0. A clear f0 decline is visible. C) An oscillogram of a budgerigar twittering. D) The corresponding spectrogram of the twittering. The f0 is shown as an extra solid line, and an f0 decline is visible.Figure 7.1 long description.

Save book to Kindle

To save this book to your Kindle, first ensure no-reply@cambridge.org is added to your Approved Personal Document E-mail List under your Personal Document Settings on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part of your Kindle email address below. Find out more about saving to your Kindle.

Note you can select to save to either the @free.kindle.com or @kindle.com variations. ‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi. ‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.

Find out more about the Kindle Personal Document Service.

  • The Physiology of Rhythm
  • Edited by Lars Meyer, Max Planck Institute for Human Cognitive and Brain Sciences, Antje Strauss, University of Konstanz
  • Book: Rhythms of Speech and Language
  • Online publication: 23 April 2026
  • Chapter DOI: https://doi.org/10.1017/9781009295888.002
Available formats
×

Save book to Dropbox

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Dropbox.

  • The Physiology of Rhythm
  • Edited by Lars Meyer, Max Planck Institute for Human Cognitive and Brain Sciences, Antje Strauss, University of Konstanz
  • Book: Rhythms of Speech and Language
  • Online publication: 23 April 2026
  • Chapter DOI: https://doi.org/10.1017/9781009295888.002
Available formats
×

Save book to Google Drive

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Google Drive.

  • The Physiology of Rhythm
  • Edited by Lars Meyer, Max Planck Institute for Human Cognitive and Brain Sciences, Antje Strauss, University of Konstanz
  • Book: Rhythms of Speech and Language
  • Online publication: 23 April 2026
  • Chapter DOI: https://doi.org/10.1017/9781009295888.002
Available formats
×