Rhythmic Structure in Cross-Modal Infant-Directed Communication

doi:10.1017/9781009295888.045

38 - Rhythmic Structure in Cross-Modal Infant-Directed Communication

from Section 6 - Rhythm in Language Acquisition

Published online by Cambridge University Press: 23 April 2026

Marina Kalashnikova ,

Laura Fernández-Merino and

Sofia Russo

Edited by

Lars Meyer and

Antje Strauss

Show author details

Lars Meyer: Affiliation:
Max Planck Institute for Human Cognitive and Brain Sciences
Antje Strauss: Affiliation:
University of Konstanz

Book contents

Summary

Infant-directed communication has been proposed to facilitate early language development, not only by providing infants with ample native language input but also by tailoring this input to infants’ individual developmental needs. In particular, extensive research has investigated prosodic and phonetic adaptations in caregivers’ infant-directed speech proposed to support early language acquisition, but more recently, research focus has shifted to the rhythmical properties of this register. This chapter reviews this recent evidence, and argues that rhythmic optimization is not limited to infants’ early speech input. Instead, it is present across the auditory, visual, and tactile domains of caregiver–infant communication. We will argue that infants enjoy access to optimized intersensory rhythmic input, which scaffolds their ability to segment the continuous speech signal into meaningful linguistic units, even when these units occur with weak regularity in naturally produced adult-directed speech.

Keywords

language acquisition infant-directed communication infant-directed speech infant-directed singing intersensory rhythm perception intersensory rhythmic input sensory motor synchronization early vestibular tactile and somatosensory experiences music perception music exposure

Information

Type: Chapter
Information: Rhythms of Speech and Language
Physiology, Cognition, Culture
, pp. 687 - 706

DOI: https://doi.org/10.1017/9781009295888.045 [Opens in a new window]

Publisher: Cambridge University Press

Print publication year: 2026
Creative Commons: This content is Open Access and distributed under the terms of the Creative Commons Attribution licence CC-BY-NC 4.0 https://creativecommons.org/cclicenses/

38 Rhythmic Structure in Cross-Modal Infant-Directed Communication

38.1 Introduction

Young infants are sensitive to the rhythmic patterns and regularities in their auditory input, which is primarily composed of the speech and singing that they hear from their caregivers (Hilton et al., Reference Hilton, Moser and Bertolo2022). This sensitivity is considered to play a fundamental role in the process of early language acquisition in the first years of life (Gervain, Reference Gervain2018; Gervain and Mehler, Reference Gervain and Mehler2010; Jusczyk, Reference Jusczyk1997; Nazzi and Ramus, Reference Nazzi and Ramus2003), as well as support development of more advanced language processing abilities in childhood such as phonological awareness and reading. Infants begin learning the rhythmic patterns of their native language or languages in utero when they perceive limited segmental but intact suprasegmental information from ambient speech (Moon and Fifer, Reference Moon and Fifer2000). This equips them with the ability to differentiate their native language(s) from rhythmically different nonnative languages already at birth (DeCasper and Fifer, Reference DeCasper and Fifer1980; Nazzi et al., Reference Nazzi, Bertoncini and Mehler1998). This initial language-specific rhythmic sensitivity becomes further fine-tuned and extended over infants’ first year of life. This is observed in the emerging abilities to discriminate between rhythmically similar languages (Bosch and Sebastián-Gallés, Reference Bosch and Sebastián-Gallés1997; Molnar et al., Reference Molnar, Gervain and Carreiras2014) and to detect native prosodic patterns at the utterance, lexical, and sub-lexical levels (Friederici et al., Reference Friederici, Friedrich and Christophe2007; Höhle et al., Reference Höhle, Bijeljac-Babic, Herold, Weissenborn and Nazzi2009; Weber et al., Reference Weber, Hahne, Friedrich and Friederici2004). These early prosodic abilities, in turn, support the processing and segmentation of incoming continuous speech (e.g., Jusczyk et al., Reference Jusczyk, Cutler and Redanz1993; Hallé and de Boysson-Bardies, Reference Hallé and de Boysson-Bardies1996; Mattys et al., Reference Mattys, Jusczyk, Luce and Morgan1999; Morgan and Saffran, Reference Morgan and Saffran1995; and refer to Chapter 39 for a detailed discussion of infants’ early perception of native language rhythm).

In this chapter, we discuss how infants’ environments are optimized for supporting their challenging task of identifying and learning the prosodic and rhythmic patterns of their native language(s). We review findings showing that this optimization is not manifested as an enhancement of language-specific rhythmic cues in infant-directed speech. Instead, we discuss evidence for the presence of language-general rhythmic regularities in natural caregiver–infant interactions, in which caregivers do not only communicate with infants through speech but also music, touch, and movement. We show that infants are exposed to multiple rhythmic input modalities that combined comprise optimized intersensory rhythmic input that infants enjoy from their first months of gestation and after birth. This optimized multisensory stimulation fosters infants’ early ability to extract rhythmic regularities from their environment, supporting subsequent language acquisition.

38.2 Infant-Directed Speech: Rhythmic Information in Infants’ Early Language Input

Infants’ early linguistic environment consists primarily of infant-directed (ID) speech, which refers to the speech style or register that adults spontaneously use in interactions with young infants (Golinkoff et al., Reference Golinkoff, Can, Soderstrom and Hirsh-Pasek2015; Soderstrom, Reference Soderstrom2007). Compared to speech used in interactions among adults or adult-directed (AD) speech, ID speech is characterized by affective, acoustic, and linguistic properties proposed to serve specific functions such as regulating infants’ emotional states (Papoušek et al., Reference Papoušek, Bornstein, Nuzzo, Papoušek and Symmes1990), capturing and maintaining infants’ attention during communicative interactions (Fernald and Simon, Reference Fernald and Simon1984), and facilitating early language development (Kuhl, Reference Kuhl2000). Evidence for ID speech has been documented across languages and language families, and adult listeners are highly accurate at differentiating it from AD speech even in unfamiliar languages, suggesting universality in the acoustic properties of ID speech (Hilton et al., Reference Hilton, Moser and Bertolo2022). Even though ID speech is produced by adults for an infant audience, infants are not passive perceivers of this speech register. From their first months of life, infants prefer listening to ID compared to AD speech (Byers-Heinlein et al., Reference Byers-Heinlein, Tsui and Bergmann2021; Cooper and Aslin, Reference Cooper and Aslin1990), and their active feedback shapes the acoustic properties of the ID speech produced by their caregivers (Lam and Kitamura, Reference Lam and Kitamura2012; Murray and Trevarthen, Reference Murray and Trevarthen1986; Smith and Trainor, Reference Smith and Trainor2008). Caregivers are also sensitive to their infants’ developmental needs and evolving listening preferences (Kitamura and Burnham, Reference Kitamura and Burnham1998, Reference Kitamura and Burnham2003; Kitamura and Lam, Reference Kitamura and Lam2009), and adjust their speech accordingly. For instance, ID speech to newborns is highly soothing and comforting, while ID speech to six–nine-month-olds acquires more directive and attention-grabbing qualities (Kitamura and Burnham, Reference Kitamura and Burnham2003), reflecting infants’ readiness to actively engage with objects and people in their environment and attend to linguistic information in their speech input (Newman and Hussain, Reference Newman and Hussain2006). Similarly, as infants approach their second birthday, ID speech becomes faster and exhibits a reduction in mean pitch and pitch range (Cox et al., Reference Cox, Bergmann and Fowler2023), progressively approximating the less prosodically exaggerated child-directed and AD speech registers.

Several acoustic and prosodic adaptations in ID relative to AD speech have been proposed to directly support early language processing and subsequent language acquisition. For instance, phonetic exaggeration, typically indexed by the expansion of the acoustic distance between the three corner vowels /i/, /u/, and /a/ (Burnham et al., Reference Burnham, Kitamura and Vollmer-Conna2002; Kuhl et al., Reference Kuhl, Andruski and Chistovich1997), has been proposed to directly foster early speech perception abilities by providing infants with exposure to clear speech and more distinct phonetic categories (Kuhl, Reference Kuhl2000). Indeed, infants show more efficient vowel processing and word recognition when presented with ID speech containing acoustically exaggerated vowels (Lovcevic et al., Reference Lovcevic, Burnham and Kalashnikova2022; Peter et al., Reference Peter, Kalashnikova, Santos and Burnham2016; Song et al., Reference Song, Demuth and Morgan2010), and the extent to which individual mothers exaggerate vowels in ID speech correlates with their infants’ concurrent speech perception abilities and future vocabulary size (Hartman et al., Reference Hartman, Ratner and Newman2017; Kalashnikova and Burnham, Reference Kalashnikova and Burnham2018; Kalashnikova and Carreiras, Reference Kalashnikova and Carreiras2022; Liu et al., Reference Liu, Kuhl and Tsao2003; but see Martin et al., Reference Martin, Schatz and Versteegh2015; McMurray et al., Reference McMurray, Kovack-Lesh, Goodwin and McEchron2013, for findings contradicting this claim). Prosodic characteristics of ID speech including expanded pitch range and slow speech rate have also been linked to enhanced speech processing. In experimental paradigms, infants are more successful at segmenting continuous ID speech characterized by expanded pitch range (Floccia et al., Reference Floccia, Keren-Portnoy and DePaolis2016; Schreiner and Mani, Reference Schreiner and Mani2017; Thiessen et al., Reference Thiessen, Hill and Saffran2005) and recognizing words produced in slow ID speech (Song et al., Reference Song, Demuth and Morgan2010).

Based on this evidence linking the characteristics of caregivers’ ID speech to language outcomes, several studies have hypothesized that specific acoustic cues to the rhythmic patterns of infants’ native language would also be exaggerated compared to AD speech, and that this would support the development of native prosodic competence and speech segmentation abilities. Traditionally, the world’s languages have been classified as belonging to one of three rhythmic classes defined by the size of their prosodic units, which include the metrical foot in stress-timed languages (e.g., English), the syllable in syllable-timed languages (e.g., Spanish), and the mora in mora-timed languages (e.g., Japanese) (Abercrombie, Reference Abercrombie1967). More recently, this classification has been debated (e.g., Turk and Shattuck-Hufnagel, Reference Turk and Shattuck-Hufnagel2013), but there is evidence that a language’s rhythmic structure can be defined by several durational measures that represent the proportion and variability of duration with which consonantal and vocalic intervals occur in that language (e.g., Ramus et al., Reference Ramus, Nespor and Mehler1999, Reference Ramus, Hauser, Miller, Morris and Mehler2000; and refer to Chapter 11 for a discussion of other measures of speech rhythm). A recent meta-analysis shows that infants rely on these durational cues in language differentiation tasks, whereby smaller differences in vocalic variability and larger differences in consonantal variability between languages lead to better discrimination performance from birth to 12 months (Gasparini et al., Reference Gasparini, Langus, Tsuji and Boll-Avetisyan2021).

These language-specific durational cues may be difficult to extract for a novice language-learner, so they would be expected to be exaggerated in the slower and prosodically exaggerated ID speech. To test this, Payne et al. (Reference Payne, Post, Astruc, Prieto and Vanrell2009) compared durational rhythmic cues in English, Spanish, and Catalan ID speech to two-, four-, and six-year-olds (a register also referred to as child-directed speech, reflecting the recipient children’s ages). These three languages differ significantly in their rhythmic structure (English is stress-timed, Spanish is syllable-timed, and Catalan falls in between). Results showed that even though ID speech differed acoustically from AD speech in each language, instead of enhancing cross-linguistic differences, the ID adaptations resulted in greater rhythmic similarities. Across languages, ID speech was more vocalic and had reduced variability in consonantal segments. Lee et al. (Reference Lee, Kitamura, Burnham and McAngus Todd2014) assessed the same cues in English ID speech to infants from 0 to 12 months but failed to replicate the ID versus AD speech differences reported by Payne et al. However, using a modelling approach that assessed the sonorant structure of ID and AD speech (which measures the regularity with which syllables occur in speech based on prominence cues such as intensity, pitch, spectral balance, and duration; Lee and Todd, Reference Lee and Todd2004), this study also reported that stressed and unstressed syllables were differentiated to a lesser extent in ID compared to AD speech. Finally, Tajima et al. (Reference Tajima, Tanaka, Martin and Mazuka2013) investigated the durational cues of ID and AD speech in Japanese, a mora-timed language. Their analyses also failed to reveal any ID adjustments that would support the identification of mora segments in continuous speech. As can be seen, there are some inconsistencies across these studies, which are likely due to cross-dialectal and cross-linguistic differences and assessments of speech directed to infants and children spanning a wide age range. Despite this issue, this evidence suggests that ID speech does not enhance, and may even distort, durational cues that signal language-specific rhythm structure, potentially in favor of the prosodic adjustments that serve the affective and attention-grabbing functions of this speech register (Kempe et al., Reference Kempe, Brooks and Gillis2005).

Even if caregivers do not exaggerate language-specific rhythmic properties in ID speech, the findings reviewed above indicate that the segmental and suprasegmental properties of this register result in a temporal structure that is significantly different from AD speech (Nencheva and Lew-Williams, Reference Nencheva and Lew-Williams2022). Specifically, ID speech exhibits greater rhythmic regularities, resulting in a more isochronous signal. At first glance, this may appear to contradict the notion that ID speech is optimized for promoting the acquisition of language-specific rhythmic competence, but it may be the case that rhythmic regularization leads to more efficient neural encoding and processing of speech by young infants. This argument is based on extensive evidence from neurophysiological studies for entrainment between endogenous oscillatory neural activity and incoming sensory information (Ding et al., Reference Ding, Patel and Chen2017; and see Chapters 3, 5, 35, 36, and 37).

In the case of speech, specifically, linguistic information across multiple timescales is conveyed by the energy fluctuations of the amplitude envelope (i.e., the lexical and phrasal rate 1–4 Hz, the syllabic rate 5–8 Hz, and the phonological rate 30–50 Hz), which are tracked by corresponding frequency bands of neural oscillations in the auditory cortex (delta, theta, and gamma bands, respectively) (Ding et al., Reference Ding, Melloni, Zhang, Tian and Poeppel2016; Ding and Simon, Reference Ding and Simon2014; Ghitza, Reference Ghitza2012; Peelle and Davis, Reference Peelle and Davis2012; Poeppel and Assaneo, Reference Poeppel and Assaneo2020; and refer to Chapter 35 for a detailed discussion of cortical tracking of speech). Speech processing is primarily governed by efficient entrainment to the slowly occurring information in the speech envelope (delta and theta oscillation bands) (Ghitza, Reference Ghitza2012; Gross et al., Reference Gross, Hoogenboom and Thut2013; Luo and Poeppel, Reference Luo and Poeppel2007), which significantly correlates with speech intelligibility and successful comprehension (Arnal and Giraud, Reference Arnal and Giraud2012; Vanthornhout et al., Reference Vanthornhout, Decruy, Wouters, Simon and Francart2018). Both top-down processes and bottom-up information contribute to the efficiency of cortical entrainment in individual listeners (Di Liberto et al., Reference Di Liberto, Lalor and Millman2018; Doelling et al., Reference Doelling, Arnal, Ghitza and Poeppel2014; Peelle et al., Reference Peelle, Gross and Davis2013). Entrainment is more efficient when listeners have higher proficiency in the language in which the stimuli are presented (Lizarazu et al., Reference Lizarazu, Carreiras and Molinaro2023), and when they actively attend to the stimulus string (Golumbic et al., Reference Zion Golumbic, Ding and Bickel2013; Obleser and Kayser, Reference Obleser and Kayser2019). The bottom-up cues that modulate entrainment include speech rate and rhythmic regularity (Aubanel et al., Reference Aubanel, Davis and Kim2016; Lizarazu et al., Reference Lizarazu, Lallier and Molinaro2019). Thus, it is plausible that young infants who have limited access to top-down information during speech processing particularly benefit from exposure to ID speech, which is precisely a slow, attention-grabbing, and more isochronous speech signal.

Several studies support this proposal. Falk and Kello (Reference Falk and Kello2017) analyzed the temporal structure of the amplitude envelope of German ID speech and singing to six-month-old infants. They found that acoustic energy across frequencies corresponding to different units in the linguistic hierarchy were clustered to a greater extent in ID than AD registers. That is, the hierarchical temporal structure of ID registers was more regular compared to AD counterparts. Leong et al. (Reference Leong, Kalashnikova, Burnham and Goswami2014) assessed naturally produced English ID and AD speech to infants from seven to 11 months, focusing specifically on low-frequency information in the amplitude envelope. Their findings revealed greater delta–theta phase synchronization in ID speech compared to greater theta–gamma synchronization in AD speech. These results indicate greater rhythmic regularity in ID compared to AD speech, in particular at the rates at which stressed and unstressed syllables occur. Pérez-Navarro et al. (Reference Pérez-Navarro, Lallier, Clark, Flanagan and Goswami2022) recently replicated this result for Spanish child-directed speech to four-year-olds, suggesting that these adjustments extend across rhythm classes (i.e., in stress-timed English and syllable-timed Spanish) and may be preserved as children develop more advanced language abilities over time. Critically, the benefits of these rhythmic adjustments in ID and child-directed speech are reflected in direct measures of infant speech processing: Infants show more efficient neural entrainment to ID compared to AD speech (Kalashnikova et al., Reference Kalashnikova and Burnham2018; Menn et al., Reference Menn, Michel, Meyer, Hoehl and Männel2022a). Thus, the language-general rhythmic regularities in ID speech play an important role in facilitating early processing and encoding of speech, which may in turn promote infants’ subsequent learning of the specific rhythmic patterns of their native language and the extraction of meaningful linguistic units from continuous speech.

38.3 ID Singing: A Multimodal Source of Rhythmic Information

The temporal adjustments approximate ID speech to the rhythmically regular and melodic structure of music (Daikoku and Goswami, Reference Daikoku and Goswami2022). Music is another highly prominent auditory signal available to infants before and after birth (Kisilevsky et al., Reference Kisilevsky, Hains, Jacquet, Granier-Deferre and Lecanuet2004), and provides them with valuable rhythmic information (Papadimitriou et al., Reference Papadimitriou, Smyth, Politimou, Franco and Stewart2021). After birth, the most common source of music for most infants is their caregivers’ singing (Trehub et al., Reference Trehub, Unyk and Kamenetsky1997). Caregivers across the world’s cultures have been documented to sing to their infants (Trehub and Russo, Reference Trehub, Russo, Russo, Ilari and Cohen2020), and their ID singing differs from AD singing by a number of acoustic and prosodic properties including higher pitch, increased amplitude, and greater frequency variation. ID singing has been proposed to serve primarily an affective function by regulating infants’ arousal and emotional states (Cirelli et al., Reference Cirelli, Trehub and Trainor2018; Salimpoor et al., Reference Salimpoor, Benovoy, Longo, Cooperstock and Zatorre2009; Trainor, Reference Trainor2006) as well as promoting social bonding in caregiver–infant interactions (Cirelli and Trehub, Reference Cirelli and Trehub2019), crucial for healthy socio-emotional development (Poćwierz-Marciniak and Harciarek, Reference Poćwierz-Marciniak and Harciarek2021). As it is the case for ID speech, caregivers dynamically adjust the characteristics of ID singing according to their infants’ emotional and developmental needs and their own intention to capture and maintain their infants’ attention or modify the infants’ emotional state (Delavenne et al., Reference Delavenne, Gratier and Devouche2013; Rock et al., Reference Rock, Trainor and Addison1999). Lullabies, for instance, have a slower tempo, simpler melodies, and a reduced pitch range, and they aim to calm and soothe the infant. Play songs, on the other hand, have a faster tempo, more complex and varied melodies, and a wider pitch range, and their aim is to engage and animate the infant (Trainor and Trehub, Reference Trainor and Trehub1998). Infants, in turn, enjoy listening to ID singing by relaxing, manifesting less distress, and exhibiting more positive emotions (Shannon, Reference Shannon2006; Trehub et al., Reference Trehub, Ghazban and Corbeil2015). Interestingly, infants’ responses are also molded by the type of songs that they hear. Infants direct attention internally, focusing on self-regulation, during lullabies, but direct their attention externally toward the caregiver through mutual gaze during rhythmic play songs (Rock et al., Reference Rock, Trainor and Addison1999).

Critically, the exaggerated acoustic features of ID singing delineate its hierarchical beat structure, helping infants process phrase boundaries, rhythm, and grouping structures (Longhi, Reference Longhi2009). This means that not only the musical but also the linguistic information encountered in this register can be processed more effectively, supporting language acquisition. Several recent studies have assessed neural entrainment in young infants in response to pre-recorded sung ID speech and live, naturally produced ID singing and found efficient tracking at the delta and theta frequencies, relevant for speech encoding (Attaheri et al., Reference Attaheri, Choisdealbha and Di Liberto2022; Menn et al., Reference Menn, Michel, Meyer, Hoehl and Männel2022a; Nguyen et al., Reference Nguyen, Reisner and Lueger2023). Nguyen et al. (Reference Nguyen, Reisner and Lueger2023) measured neural entrainment of seven-month-old infants in response to lullabies and play songs, which display different acoustic and prosodic properties, as discussed above. Infants tracked lullabies more efficiently, which was primarily related to the slow tempo and beat clarity in this song type. However, infants were more likely to produce rhythmic movements when listening to play songs, which was related to the perceived loudness of these stimuli. Thus, it appears that these different types of ID songs not only elicit different attentional responses from infants (Rock et al., Reference Rock, Trainor and Addison1999) but may also have differential functions in facilitating infants’ encoding of the linguistic information in these songs (Franco et al., Reference Franco, Suttora, Spinelli, Kozar and Fasolo2022). Evidence is starting to emerge with several studies showing moderate but significant correlations between individual infants’ efficiency of neural entrainment to ID songs measured in the first year of life and language outcomes, in particular, vocabulary size in the second year (Attaheri et al., Reference Attaheri, Choisdealbha and Di Liberto2022; Menn et al., Reference Menn, Ward and Braukmann2022b; Nguyen et al., Reference Nguyen, Reisner and Lueger2023). Thus, the regular prosodic and rhythmic structure of ID singing, similarly to ID speech, may play a significant role in fostering infants’ speech processing and supporting later acquisition of the specific rhythmic properties of their native language and subsequent language acquisition.

ID speech and singing convey auditory-only information that facilitates infants’ auditory processing. However, solely focusing on the auditory modality misrepresents the true multimodal nature of caregiver–infant communication and fails to capture the facilitative role that ID adjustments in other sensory modalities may play in supporting language development, to which we now turn.

Infants are typically exposed to ID speech and singing in the context of active face-to-face interactions with their caregivers, in which the auditory input is supported by rich multimodal information that can also foster the process of encoding and segmenting incoming speech input. For instance, infants attend to the speaker’s face when listening to ID speech and singing, which leads to enhanced neural entrainment and greater interpersonal synchronization with their caregivers (Lense et al., Reference Lense, Shultz, Astésano and Jones2022; Tan et al., Reference Tan, Kalashnikova, Di Liberto, Crosse and Burnham2022). Caregivers also employ touch and gesture to emphasize relevant linguistic information when interacting with their young infants (Abu-Zhaya et al., Reference Abu-Zhaya, Seidl and Cristia2017; O’Neill et al., Reference O’Neill, Bard, Linnell and Fluck2005). Furthermore, and particularly relevant to infants’ perception of rhythmic information in the input, ID singing (but not ID speech; Zentner and Eerola, Reference Zentner and Eerola2010) elicits active rhythmic movement from the infants themselves. Therefore, in addition to perceiving rhythmic information in the auditory modality, infants have the opportunity to experience it in their own passive and active body movements.

Similarly to adults, infants are enticed to move when they hear music (Gordon, Reference Gordon2003). Several studies show that infants in their first year of life are more likely to produce rhythmic movements in response to ID singing and musical sequences compared to other stimuli, including ID speech (e.g., Ilari, Reference Ilari2015; Nguyen et al., Reference Nguyen, Reisner and Lueger2023; Zentner and Eerola, Reference Zentner and Eerola2010). Most importantly, there is also evidence to suggest that infants’ movements are coordinated with the temporal structure of the input, signalling a connection between infants’ rhythmic perception and rhythmic movement production (de l’Etoile et al., Reference de l’Etoile, Bennett and Zopluoglu2020; Longhi, Reference Longhi2009; Zentner and Eerola, Reference Zentner and Eerola2010). For instance, a recent study by de l’Etoile et al. (Reference de l’Etoile, Bennett and Zopluoglu2020) assessed whether infants’ movements related to rhythmic regularities in the stimuli, and whether infants were sensitive to rhythmic training. Six–10-month-old infants’ movements were recorded in response to silence, irregular auditory stimuli, and rhythmically regular song recordings. Infants’ movements were more regularly timed in response to the rhythmically regular songs. While infants’ ability to precisely time their rhythmic movements to the tempo of musical stimuli that they hear seems to emerge much later in preschool years (Drake et al., Reference Drake, Jones and Baruch2000; Eerola et al., Reference Eerola, Luck and Toiviainen2006), they show early sensitivity to the rhythmic structure of music, which is observed in the greater quantity and higher regularity of their rhythmic movements.

38.4 Early Vestibular, Tactile, and Somatosensory Experiences with Rhythm

Movement plays a significant role in processing rhythm, so it is evident that hearing is not the only sensory modality involved in processing rhythm in music and speech. Most importantly, infants start perceiving rhythm through vestibular, tactile, and somatosensory (VTS) experiences (Provasi et al., Reference Provasi, Anderson and Barbu-Roth2014) well before they start moving in response to music and ID singing. Specifically, the first experiences with rhythm occur already in utero, encompassing multiple sources of rhythmic stimulation produced by the mother’s body movements (Lecanuet and Schaal, Reference Lecanuet and Schaal2002). As a consequence, vestibular and somatosensory input has been found to be central to rhythm perception after birth and during early development (Phillips-Silver and Trainor, Reference Phillips-Silver and Trainor2005; Tichko et al., Reference Tichko, Kim and Large2022; Trainor et al., Reference Trainor, Gao, Lei, Lehtovaara and Harris2009; and refer to Chapter 4 for a sensorimotor account of multimodal prosody, including the role of the vestibular system). The VTS system is the first to develop in the human fetus: Cutaneous and trigeminal somatosensory receptors mature at around four to seven weeks of gestation (Humphrey, Reference Humphrey1964). While the early emergence of somatosensory anatomy and function is shortly followed by vestibular and chemosensory functioning, auditory and visual functioning are first observed much later in prenatal development (Bremner and Spence, Reference Bremner and Spence2017). From the third trimester of gestation, fetuses perceive rhythmic signals coming from the intrauterine and extrauterine environment through bone conduction (Sohmer et al., Reference Sohmer, Perez, Sichel, Priner and Freeman2001). Specifically, the maternal voice is transmitted directly to the amniotic fluid via body tissues and bones, with pitch and the first overtones being fully conducted through the spine and the pelvic arch (Petitjean, Reference Petitjean1989). External low frequencies (corresponding to the first four octaves of the piano) are perceived in utero, with bone conduction allowing for intelligibility of both speech and music rhythmic patterns (Granier-Deferre et al., Reference Granier-Deferre and Busnel2011; Mampe et al., Reference Mampe, Friederici, Christophe and Wermke2009).

VTS mechanisms, crucial for early rhythmic experiences, also provide the context in which fetal “hearing” occurs. Maternal heartbeat, breathing, and walking all produce movement together with sound (Kisilevsky et al., Reference Kisilevsky, Killen, Muir and Low1991), so it is probably the cross-modal temporal synchrony generated by the coupling of VTS and auditory signals that is crucial in shaping rhythm processing in utero (Provasi et al., Reference Provasi, Anderson and Barbu-Roth2014). Consistently, premature infants who receive VTS stimulation adapt their breathing to the rate and acceleration of VTS stimuli (Zimmerman and Barlow, Reference Zimmerman and Barlow2012). The very early exposure to the isochronous pulse of maternal heartbeat and breathing patterns may shape the development of neural and physiological systems supporting rhythm perception after birth toward familiar patterns (Teie, Reference Teie2016). Memory of these experiences is maintained after birth (Ullal-Gupta et al., Reference Ullal-Gupta, Vanden Bosch der Nederlanden, Tichko, Lahav and Hannon2013), with newborns preferring to listen to music and nursery rhymes heard prenatally compared to unfamiliar tracks (DeCasper and Spence, Reference DeCasper and Spence1986; Hepper, Reference Hepper1991). That is, experience of VTS stimulation combines with auditory input to comprise the prenatal experience of rhythm. The effects of these early multimodal experiences are seen in the infant preferences toward regular, binary rhythms after birth and throughout development (DeCasper and Sigafoos, Reference DeCasper and Sigafoos1983; Doheny et al., Reference Doheny, Hurwitz, Insoft, Ringer and Lahav2012; Lahav et al., Reference Lahav, Saltzman and Schlaug2007; Tichko et al., Reference Tichko, Kim and Large2022).

Passive experience with movements that generate VTS stimulation continues after birth, significantly influencing the perception of auditory rhythm in infants (Phillips-Silver and Trainor, Reference Phillips-Silver and Trainor2005). Across cultures, caregivers use VTS rhythms to calm their newborns, for instance by moving them back and forth or by rocking them while singing or walking (Provasi et al., Reference Provasi, Anderson and Barbu-Roth2014). A recent study demonstrated that the cadence of parent movements and walking is related to the spontaneous motor tempo of young infants (Rocha et al., Reference Rocha, Southgate and Mareschal2021). Furthermore, studies with adult and infant participants showed that body movement can bias the auditory encoding of ambiguous rhythms (Phillips-Silver and Trainor, Reference Phillips-Silver and Trainor2005, Reference Phillips-Silver and Trainor2007). Phillips-Silver and Trainor (Reference Phillips-Silver and Trainor2005) exposed infants to a rhythmically ambiguous auditory pattern while they were bounced in time with a duple or a triple meter. After this exposure, infants preferred the meter pattern to which they were bounced. Watching someone else moving in time with a meter, however, was not sufficient to elicit similar effects in infants or adults, demonstrating that the active movement of the body is crucial in rhythm encoding and processing (Phillips-Silver and Trainor, Reference Phillips-Silver and Trainor2007, Reference Phillips-Silver and Trainor2008). These findings confirm that active and passive motion generating VTS experience not only shapes the early development of structural and functional mechanisms underlying rhythm processing, but that it also influences rhythm processing right after birth and throughout development.

VTS experiences continue to mediate the interaction between rhythm ability and language development throughout infancy and early childhood. VTS and auditory rhythmic abilities have been proposed as a potential scaffold for early interpersonal synchrony between mothers and infants (Trehub, Reference Trehub2003), and VTS–auditory coupling has an impact in sustaining early language acquisition. For instance, synchronous tactile cues were shown to help four- and five-month-olds to find words in continuous speech, promoting word learning (Abu-Zhaya et al., Reference Abu-Zhaya, Seidl and Cristia2017; Seidl and Cristià, Reference Seidl and Cristià2008). Moreover, audio-tactile stimulation resulted in enhanced event related potentials (ERPs) and higher beta-band activity (at 15−16 Hz) recorded using electroencephalography (EEG), compared to auditory-only stimulation in eight-month-old infants (Tanaka et al., Reference Tanaka, Kanakogi, Kawasaki and Myowa2018). Consistently, parents tend to spontaneously synchronize the location and timing of their touches on the infant’s body with word rhythm in ID speech (Custode and Tamis-LeMonda, Reference Custode and Tamis-LeMonda2020; Lew-Williams et al., Reference Lew-Williams, Ferguson, Abu-Zhaya and Seidl2019; Tincoff et al., Reference Tincoff, Seidl, Buckley, Wojcik and Cristia2019). Later in development, preschoolers who can entrain their motion to an external beat display more faithful neural encoding of temporal modulations in speech and achieve higher scores on tests of early language skills (Woodruff Carr et al., Reference Woodruff Carr, White-Schwoch, Tierney, Strait and Kraus2014).

On the other hand, rhythmic deficits including VTS and sensorimotor abilities are shared by different atypically developing populations, including developmental dyslexia (Beker et al., Reference Beker, Foxe and Molholm2021; Goswami, Reference Goswami2002; Power et al., Reference Power, Mead, Barnes and Goswami2013), attention deficit disorder (Carrer, Reference Carrer2015; Puyjarinet et al., Reference Puyjarinet, Bégel, Lopez, Dellacherie and Dalla Bella2017), autism (Fitzpatrick et al., Reference Fitzpatrick, Romero and Amaral2017; Franich et al., Reference Franich, Wong, Yu and To2021, see also Chapter 47 on rhythmic-prosodic synchrony in speakers with and without autism), and developmental coordination disorder (Chang et al., Reference Chang, Li and Chan2021; Puyjarinet et al., Reference Puyjarinet, Bégel, Lopez, Dellacherie and Dalla Bella2017; Trainor et al., Reference Trainor, Chang, Cairney and Li2018). For instance, individuals with dyslexia who exhibit weaker performance in rhythm perception and production tasks also tend to show weaker phonological awareness (Flaugnacco et al., Reference Flaugnacco, Lopez and Terribili2014; Forgeard et al., Reference Forgeard, Schlaug and Norton2008; Goswami et al., Reference Goswami, Gerson and Astruc2010; Huss et al., Reference Huss, Verney, Fosker, Mead and Goswami2011; Kalashnikova et al., Reference Kalashnikova, Burnham and Goswami2021; Lee et al., Reference Lee, Sie, Chen and Cheng2015; Thomson and Goswami, Reference Thomson and Goswami2008) and poorer reading skills (Dellatolas et al., Reference Dellatolas, Watier, Le Normand, Lubart and Chevrie-Muller2009; Flaugnacco et al., Reference Flaugnacco, Lopez and Terribili2015; Goswami et al., Reference Goswami, Gerson and Astruc2010, Reference Goswami, Huss, Mead, Fosker and Verney2013; Muneaux et al., Reference Muneaux, Ziegler, Truc, Thomson and Goswami2004; Thomson and Goswami, Reference Thomson and Goswami2008). Furthermore, individuals with dyslexia also demonstrate impaired processing of auditory rise times, which has been linked to inefficient entrainment between the neural oscillatory activity and the speech signal (Huss et al., Reference Huss, Verney, Fosker, Mead and Goswami2011; Leong et al., Reference Leong, Hämäläinen, Soltész and Goswami2011; Chapter 35).

38.5 Directions for Future Research

This chapter highlights the need for research on early rhythm development to focus on understanding how infants’ rhythmic experiences intertwine across modalities, with VTS, auditory, visual, and sensorimotor inputs being simultaneously perceived and processed in an integrated manner. Specifically, it is vital to understand how these different input modalities interact and are integrated, as well as their combined effects on infants’ language development. Recent methodological and technological advances offer the opportunity to investigate this multimodal integration in increasingly more ecologically valid paradigms (see Chapter 36). This can be achieved by using infant-friendly neurophysiological and neuroimaging techniques (e.g., EEG, functional near infrared spectroscopy) combined with behavioral online measures of infants’ gaze or motion tracking (e.g., Nguyen et al., Reference Nguyen, Reisner and Lueger2023; Rocha et al., Reference Rocha, Attaheri and Choisdealbha2024), which provide temporally precise indices of infants’ rhythm perception and production. These measures can now be successfully assessed in the context of live caregiver–infant interactions (e.g., Haresign et al., Reference Haresign, Phillips and Whitehorn2022; Nguyen et al., Reference Nguyen, Abney, Salamander, Bertenthal and Hoehl2021; Piazza et al., Reference Piazza, Hasenfratz, Hasson and Lew-Williams2020; Wass et al., Reference Wass, Whitehorn, Haresign, Phillips and Leong2020), which also capture the interpersonal synchrony during rhythm production and the dynamic changes in caregivers’ behaviors produced in response to the infants’ communicative cues.

This research will also lead to significant practical applications such as the development of rhythm-based intervention programs for infants with developmental or neurological disorders given that rhythm training has been shown to improve language and reading-related abilities (Bonacina et al., Reference Bonacina, Cancer, Lanzi, Lorusso and Antonietti2015; Flaugnacco et al., Reference Flaugnacco, Lopez and Terribili2015; Habib et al., Reference Habib, Lardy and Desiles2016; Overy, Reference Overy2000; Thomson et al., Reference Thomson, Leong and Goswami2013). Evidence for the effectiveness of rhythmic training outside the auditory and visual modalities, which remains highly limited to date, will also provide key information for optimizing the early rhythmic experiences of infants affected by sensory deprivation early in life (e.g., Hidalgo et al., Reference Hidalgo, Falk and Schön2017; Holland et al., Reference Holland, Bouwer, Dalgelish and Hurtig2010; Karam et al., Reference Karam, Russo, Branje, Price and Fels2008; Özcan et al., Reference Özcan, Caligiore, Sperati, Moretta and Baldassarre2016; Petitto et al., Reference Petitto, Holowka, Sergio and Ostry2001; Russo, Reference Russo2023).

38.6 Conclusion

This chapter has reviewed the growing evidence that ID communication is rich in rhythmic information, and supports the development of infants’ early rhythm perception and production abilities and plays a fundamental role in the process of early language development. This evidence demonstrates that compared to AD speech, rhythmic information in ID speech, infants’ primary source of linguistic information, exhibits higher rhythmic regularity, which facilitates infants’ encoding and processing of this register. This rhythmic regularity is also a characteristic of ID singing, a prominent signal used in natural caregiver–infant interactions. Most importantly, these auditory signals are not the only sources of regular rhythmic information available to infants. Young infants experience rhythm passively and actively across multiple modalities, including auditory, visual, tactile, and sensorimotor. The integration of rhythmic information from these different sources sustains the encoding of auditory rhythmic stimuli from the earliest stages of development and supports infants in the task of extracting rhythmic information specific to the language or languages spoken in their environment.

38.7 Acknowledgements

This work was supported by the Basque Government through the BERC 2022–2025 program and funded by the Spanish State Research Agency through BCBL Severo Ochoa excellence accreditation CEX2020–001010/AEI/10.13039/501100011033. Marina Kalashnikova’s work was supported by the Spanish State Research Agency through the Ramon y Cajal research fellowship, RYC2018–024284-I. Laura Fernández-Merino’s work was supported by a Predoctoral Grant from the Spanish Ministry of Science, Innovation and Universities and the European Social Fund, PRE2019–087623. Sofia Russo’s work was supported by a Postdoctoral Grant from the Department of Developmental Psychology and Socialization, Università degli Studi di Padova Rif. 2022ASSDPSS14.

Box 38.1Chapter Overview

Summary

Sensitivity to their native language’s rhythmic patterns allows infants to segment continuous speech. ID speech contains rhythmic cues, but it is not the only mode of ID communication to do so. Early development of rhythmic skills is supported by infants’ experience with rhythm across auditory, visual, tactile, and sensorimotor modalities.

Implications

Infants’ rhythm experience is not restricted to the auditory domain. Future research should investigate how infants’ rhythmic experiences intertwine and become integrated across sensory modalities involved in ID communication. This understanding is vital for defining the environmental factors that facilitate the development of early rhythmic abilities and language acquisition.

Gains

Infants experience optimized intersensory rhythmic input. This multisensory stimulation plays an important role in facilitating efficient entrainment between infants’ oscillatory neural activity and the input signal as well as in promoting intra-personal synchronization within the caregiver–infant dyad, both critical for successful language acquisition and healthy socio-cognitive development.

References

Abercrombie, D. (1967). Elements of General Phonetics. Edinburgh University Press.Google Scholar

Abu-Zhaya, R., Seidl, A., and Cristia, A. (2017). Multimodal infant-directed communication: How caregivers combine tactile and linguistic cues. Journal of Child Language, 44(5), 1088–1116. https://doi.org/10.1017/S0305000916000416 CrossRef Google Scholar PubMed

Arnal, L. H., and Giraud, A.-L. (2012). Cortical oscillations and sensory predictions. Trends in Cognitive Sciences, 16(7), 390–398.10.1016/j.tics.2012.05.003CrossRef Google Scholar PubMed

Attaheri, A., Choisdealbha, Á. N., Di Liberto, G. M., et al. (2022). Delta-and theta-band cortical tracking and phase-amplitude coupling to sung speech by infants. NeuroImage, 247, 118698. https://doi.org/10.1016/j.neuroimage.2021.118698 CrossRef Google Scholar PubMed

Aubanel, V., Davis, C., and Kim, J. (2016). Exploring the role of brain oscillations in speech perception in noise: Intelligibility of isochronously retimed speech. Frontiers in Human Neuroscience, 10, 430. https://doi.org/10.3389/fnhum.2016.00430 CrossRef Google Scholar PubMed

Beker, S., Foxe, J. J., and Molholm, S. (2021). Oscillatory entrainment mechanisms and anticipatory predictive processes in children with autism spectrum disorder. Journal of Neurophysiology, 126(5), 1783–1798. https://doi.org/10.1152/jn.00329.2021 CrossRef Google Scholar PubMed

Bonacina, S., Cancer, A., Lanzi, P. L., Lorusso, M. L., and Antonietti, A. (2015). Improving reading skills in students with dyslexia: The efficacy of a sublexical training with rhythmic background. Frontiers in Psychology, 6, 1510. https://doi.org/10.3389/fpsyg.2015.01510 CrossRef Google Scholar PubMed

Bosch, L., and Sebastián-Gallés, N. (1997). Native-language recognition abilities in 4-month-old infants from monolingual and bilingual environments. Cognition, 65(1), 33–69. https://doi.org/10.1016/S0010-0277(97)00040-1 CrossRef Google Scholar PubMed

Bremner, A., and Spence, C. (2017). The development of tactile perception. Advances in Child Development and Behavior, 52, 227–268.10.1016/bs.acdb.2016.12.002CrossRef Google Scholar PubMed

Burnham, D., Kitamura, C., and Vollmer-Conna, U. (2002). What’s new, pussycat? On talking to babies and animals. Science, 296(5572), 1435–1435. https://doi.org/10.1126/science.1069587 CrossRef Google Scholar PubMed

Byers-Heinlein, K., Tsui, A. S. M., Bergmann, C., et al. (2021). A multi-lab study of bilingual infants: Exploring the preference for infant-directed speech. Advances in Methods and Practices in Psychological Science, 4(1). https://doi.org/10.1177/2515245920974622 CrossRef Google Scholar PubMed

Carrer, L. R. J. (2015). Music and sound in time processing of children with ADHD. Frontiers in Psychiatry, 6, 127. https://doi.org/10.3389/fpsyt.2015.00127 CrossRef Google Scholar PubMed

Chang, A., Li, Y.-C., Chan, J. F., et al. (2021). Inferior auditory time perception in children with motor difficulties. Child Development, 92(5), e907–e923. https://doi.org/10.1111/cdev.13537 CrossRef Google Scholar PubMed

Cirelli, L. K., and Trehub, S. E. (2019). Dancing to Metallica and Dora: Case study of a 19-month-old. Frontiers in Psychology, 10, 1073. https://doi.org/10.3389/fpsyg.2019.01073 CrossRef Google Scholar PubMed

Cirelli, L. K., Trehub, S. E., and Trainor, L. J. (2018). Rhythm and melody as social signals for infants. Annals of the New York Academy of Sciences, 1423(1), 66–72. https://doi.org/10.1111/nyas.13580 CrossRef Google Scholar

Cooper, R. P., and Aslin, R. N. (1990). Preference for infant-directed speech in the first month after birth. Child Development, 61(5), 1584–1595. https://doi.org/10.1111/j.1467-8624.1990.tb02885.x CrossRef Google Scholar PubMed

Cox, C., Bergmann, C., Fowler, E., et al. (2023). A systematic review and Bayesian meta-analysis of the acoustic features of infant-directed speech. Nature Human Behaviour, 7(1), 114–133. https://doi.org/10.1038/s41562-022-01452-1 CrossRef Google Scholar PubMed

Custode, S. A., and Tamis-LeMonda, C. (2020). Cracking the code: Social and contextual cues to language input in the home environment. Infancy, 25(6), 809–826. https://doi.org/10.1111/infa.12361 CrossRef Google Scholar PubMed

Daikoku, T., and Goswami, U. (2022). Hierarchical amplitude modulation structures and rhythm patterns: Comparing Western musical genres, song, and nature sounds to Babytalk. Plos One, 17(10), e0275631. https://doi.org/10.1371/journal.pone.0275631 CrossRef Google Scholar

DeCasper, A. J., and Fifer, W. P. (1980). Of human bonding: Newborns prefer their mothers’ voices. Science, 208(4448), 1174–1176. https://doi.org/10.1126/science.7375928 CrossRef Google Scholar PubMed

DeCasper, A. J., and Sigafoos, A. D. (1983). The intrauterine heartbeat: A potent reinforcer for newborns. Infant Behavior & Development, 6(1), 19–25. https://doi.org/10.1016/S0163-6383(83)80004-6 CrossRef Google Scholar

DeCasper, A. J., and Spence, M. J. (1986). Prenatal maternal speech influences newborns’ perception of speech sounds. Infant Behavior and Development, 9(2), 133–150. https://doi.org/10.1016/0163-6383(86)90025-1 CrossRef Google Scholar

Delavenne, A., Gratier, M., and Devouche, E. (2013). Expressive timing in infant-directed singing between 3 and 6 months. Infant Behavior and Development, 36(1), 1–13. https://doi.org/10.1016/j.infbeh.2012.10.004 CrossRef Google Scholar PubMed

de l’Etoile, S. K., Bennett, C., and Zopluoglu, C. (2020). Infant movement response to auditory rhythm. Perceptual and Motor Skills, 127(4), 651–670. https://doi.org/10.1177/0031512520922642 CrossRef Google Scholar PubMed

Dellatolas, G., Watier, L., Le Normand, M.-T., Lubart, T., and Chevrie-Muller, C. (2009). Rhythm reproduction in kindergarten, reading performance at second grade, and developmental dyslexia theories. Archives of Clinical Neuropsychology, 24(6), 555–563. https://doi.org/10.1093/arclin/acp044 CrossRef Google Scholar PubMed

Di Liberto, G. M., Lalor, E. C., and Millman, R. E. (2018). Causal cortical dynamics of a predictive enhancement of speech intelligibility. NeuroImage, 166, 247–258.10.1016/j.neuroimage.2017.10.066CrossRef Google Scholar PubMed

Ding, N., and Simon, J. Z. (2014). Cortical entrainment to continuous speech: Functional roles and interpretations. Frontiers in Human Neuroscience, 8, 311. https://doi.org/10.3389/fnhum.2014.00311 CrossRef Google Scholar PubMed

Ding, N., Melloni, L., Zhang, H., Tian, X., and Poeppel, D. (2016). Cortical tracking of hierarchical linguistic structures in connected speech. Nature Neuroscience, 19(1), 158–164. https://doi.org/10.1038/nn.4186 CrossRef Google Scholar PubMed

Ding, N., Patel, A. D., Chen, L., et al. (2017). Temporal modulations in speech and music. Neuroscience and Biobehavioral Reviews, 81, 181–187. https://doi.org/10.1016/j.neubiorev.2017.02.011 CrossRef Google Scholar PubMed

Doelling, K. B., Arnal, L. H., Ghitza, O., and Poeppel, D. (2014). Acoustic landmarks drive delta–theta oscillations to enable speech comprehension by facilitating perceptual parsing. NeuroImage, 85, 761–768. https://doi.org/10.1016/j.neuroimage.2013.06.035 CrossRef Google Scholar PubMed

Doheny, L., Hurwitz, S., Insoft, R., Ringer, S., and Lahav, A. (2012). Exposure to biological maternal sounds improves cardiorespiratory regulation in extremely preterm infants. Journal of Maternal-Fetal & Neonatal Medicine, 25(9), 1591–1594. https://doi.org/10.3109/14767058.2011.648237 CrossRef Google Scholar PubMed

Drake, C., Jones, M. R., and Baruch, C. (2000). The development of rhythmic attending in auditory sequences: Attunement, referent period, focal attending. Cognition, 77(3), 251–288. https://doi.org/10.1016/S0010-0277(00)00106-2 CrossRef Google Scholar PubMed

Eerola, T., Luck, G., and Toiviainen, P. (2006). An investigation of pre-schoolers’ corporeal synchronization with music. Proceedings of the Ninth International Conference on Music Perception and Cognition, Alma Mater Studiorium University of Bologna, Bologna, Italy, pp. 472–476.Google Scholar

Falk, S., and Kello, C. T. (2017). Hierarchical organization in the temporal structure of infant-direct speech and song. Cognition, 163, 80–86. https://doi.org/10.1016/j.cognition.2017.02.017 CrossRef Google Scholar PubMed

Fernald, A., and Simon, T. (1984). Expanded intonation contours in mothers’ speech to newborns. Developmental Psychology, 20(1), 104. https://doi.org/10.1037/0012-1649.20.1.104 CrossRef Google Scholar

Fitzpatrick, P., Romero, V., Amaral, J. L., et al. (2017). Evaluating the importance of social motor synchronization and motor skill for understanding autism. Autism Research, 10(10), 1687–1699. https://doi.org/10.1002/aur.1808 CrossRef Google Scholar PubMed

Flaugnacco, E., Lopez, L., Terribili, C., et al. (2014). Rhythm perception and production predict reading abilities in developmental dyslexia. Frontiers in Human Neuroscience, 8, 392. https://doi.org/10.3389/fnhum.2014.00392 CrossRef Google Scholar PubMed

Flaugnacco, E., Lopez, L., Terribili, C., et al. (2015). Music training increases phonological awareness and reading skills in developmental dyslexia: A randomized control trial. PLoS One, 10(9), e0138715. https://doi.org/10.3389/fnhum.2014.00392 CrossRef Google Scholar PubMed

Floccia, C., Keren-Portnoy, T., DePaolis, R., et al. (2016). British English infants segment words only with exaggerated infant-directed speech stimuli. Cognition, 148, 1–9. https://doi.org/10.1016/j.cognition.2015.12.004 CrossRef Google Scholar PubMed

Forgeard, M., Schlaug, G., Norton, A., et al. (2008). The relation between music and phonological processing in normal-reading children and children with dyslexia. Music Perception, 25(4), 383–390. https://doi.org/10.1525/mp.2008.25.4.383 CrossRef Google Scholar

Franco, F., Suttora, C., Spinelli, M., Kozar, I., and Fasolo, M. (2022). Singing to infants matters: Early singing interactions affect musical preferences and facilitate vocabulary building. Journal of Child Language, 49(3), 552–577. https://doi.org/10.1017/S0305000921000167 CrossRef Google Scholar PubMed

Franich, K., Wong, H. Y., Yu, A. C. L., and To, C. K. S. (2021). Temporal coordination and prosodic structure in autism spectrum disorder: Timing across speech and non-speech motor domains. Journal of Autism and Developmental Disorders, 51(8), 2929–2949. https://doi.org/10.1007/s10803-020-04758-z CrossRef Google Scholar PubMed

Friederici, A. D., Friedrich, M., and Christophe, A. (2007). Brain responses in 4-month-old infants are already language specific. Current Biology, 17(14), 1208–1211. https://doi.org/10.1016/j.cub.2007.06.011 CrossRef Google Scholar PubMed

Gasparini, L., Langus, A., Tsuji, S., and Boll-Avetisyan, N. (2021). Quantifying the role of rhythm in infants’ language discrimination abilities: A meta-analysis. Cognition, 213, 104757. https://doi.org/10.1016/j.cognition.2021.104757 CrossRef Google Scholar PubMed

Gervain, J. (2018). The role of prenatal experience in language development. Current Opinion in Behavioral Sciences, 21, 62–67. https://doi.org/10.1016/j.cobeha.2018.02.004 CrossRef Google Scholar

Gervain, J., and Mehler, J. (2010). Speech perception and language acquisition in the first year of life. Annual Review of Psychology, 61, 191–218. https://doi.org/10.1146/annurev.psych.093008.100408 CrossRef Google Scholar PubMed

Ghitza, O. (2012). On the role of theta-driven syllabic parsing in decoding speech: Intelligibility of speech with a manipulated modulation spectrum. Frontiers in Psychology, 3, 238. https://doi.org/10.3389/fpsyg.2012.00238 CrossRef Google Scholar PubMed

Golinkoff, R. M., Can, D. D., Soderstrom, M., and Hirsh-Pasek, K. (2015). (Baby) talk to me: The social context of infant-directed speech and its effects on early language acquisition. Current Directions in Psychological Science, 24(5), 339–344. https://doi.org/10.1177/0963721415595345 CrossRef Google Scholar

Gordon, E. (2003). A Music Learning Theory for Newborn and Young Children. Gia Publications.Google Scholar

Goswami, U. (2002). Phonology, reading development, and dyslexia: A cross-linguistic perspective. Annals of Dyslexia, 52(1), 139–163. https://doi.org/10.1007/s11881-002-0010-0 CrossRef Google Scholar

Goswami, U., Gerson, D., and Astruc, L. (2010). Amplitude envelope perception, phonology and prosodic sensitivity in children with developmental dyslexia. Reading and Writing, 23(8), 995–1019. https://doi.org/10.1007/s11145-009-9186-6 CrossRef Google Scholar

Goswami, U., Huss, M., Mead, N., Fosker, T., and Verney, J. P. (2013). Perception of patterns of musical beat distribution in phonological developmental dyslexia: Significant longitudinal relations with word reading and reading comprehension. Cortex, 49(5), 1363–1376. https://doi.org/10.1016/j.cortex.2012.05.005 CrossRef Google Scholar PubMed

Granier-Deferre, C., and Busnel, M.-C. (2011). L’audition prénatale, quoi de neuf? Spirale, 3, 17–32. https://doi.org/10.3917/spi.059.0017 CrossRef Google Scholar

Gross, J., Hoogenboom, N., Thut, G., et al. (2013). Speech rhythms and multiplexed oscillatory sensory coding in the human brain. PLoS Biology, 11(12), e1001752. https://doi.org/10.1371/journal.pbio.1001752 CrossRef Google Scholar PubMed

Habib, M., Lardy, C., Desiles, T., et al. (2016). Music and dyslexia: A new musical training method to improve reading and related disorders. Frontiers in Psychology, 7, 26. https://doi.org/10.3389/fpsyg.2016.00026 CrossRef Google Scholar PubMed

Hallé, P. A., and de Boysson-Bardies, B. (1996). The format of representation of recognized words in infants’ early receptive lexicon. Infant Behavior and Development, 19(4), 463–481. https://doi.org/10.1016/S0163-6383(96)90007-7 CrossRef Google Scholar

Haresign, I. M., Phillips, E. A. M., Whitehorn, M., et al. (2022). Measuring the temporal dynamics of inter-personal neural entrainment in continuous child–adult EEG hyperscanning data. Developmental Cognitive Neuroscience, 54, 101093. https://doi.org/10.1016/j.dcn.2022.101093 CrossRef Google Scholar

Hartman, K. M., Ratner, N. B., and Newman, R. S. (2017). Infant-directed speech (IDS) vowel clarity and child language outcomes. Journal of Child Language, 44(5), 1140–1162. https://doi.org/10.1017/S0305000916000520 CrossRef Google Scholar PubMed

Hepper, P. G. (1991). An examination of fetal learning before and after birth. Irish Journal of Psychology, 12(2), 95–107. https://doi.org/10.1080/03033910.1991.10557830 CrossRef Google Scholar

Hidalgo, C., Falk, S., and Schön, D. (2017). Speak on time! Effects of a musical rhythmic training on children with hearing loss. Hearing Research, 351, 11–18. https://doi.org/10.1016/j.heares.2017.05.006 CrossRef Google Scholar PubMed

Hilton, C. B., Moser, C. J., Bertolo, M., et al. (2022). Acoustic regularities in infant-directed speech and song across cultures. Nature Human Behaviour, 6(11), 1545–1556. https://doi.org/10.1038/s41562-022-01410-x CrossRef Google Scholar

Höhle, B., Bijeljac-Babic, R., Herold, B., Weissenborn, J., and Nazzi, T. (2009). Language specific prosodic preferences during the first half year of life: Evidence from German and French infants. Infant Behavior and Development, 32(3), 262–274. https://doi.org/10.1016/j.infbeh.2009.03.004 CrossRef Google Scholar PubMed

Holland, S., Bouwer, A. J., Dalgelish, M., and Hurtig, T. M. (2010). Feeling the beat where it counts: Fostering multi-limb rhythm skills with the haptic drum kit. Proceedings of the Fourth International Conference on Tangible, Embedded, and Embodied Interaction, Association for Computing Machinery, New York, pp. 21–28. https://doi.org/10.1145/1709886.1709892 CrossRef Google Scholar

Humphrey, T. (1964). Some correlations between the appearance of human fetal reflexes and the development of the nervous system. Progress in Brain Research, 4, 93–135.10.1016/S0079-6123(08)61273-XCrossRef Google Scholar

Huss, M., Verney, J. P., Fosker, T., Mead, N., and Goswami, U. (2011). Music, rhythm, rise time perception and developmental dyslexia: Perception of musical meter predicts reading and phonology. Cortex, 47(6), 674–689. https://doi.org/10.1016/j.cortex.2010.07.010 CrossRef Google Scholar PubMed

Ilari, B. (2015). Rhythmic engagement with music in early childhood: A replication and extension. Journal of Research in Music Education, 62(4), 332–343. https://doi.org/10.1177/0022429414555984 CrossRef Google Scholar

Jusczyk, P. W. (1997). The Discovery of Spoken Language. MIT Press.Google Scholar

Jusczyk, P. W., Cutler, A., and Redanz, N. J. (1993). Infants’ preference for the predominant stress patterns of English words. Child Development, 64(3), 675–687. https://doi.org/10.1111/j.1467-8624.1993.tb02935.x CrossRef Google Scholar PubMed

Kalashnikova, M., and Burnham, D. (2018). Infant-directed speech from seven to nineteen months has similar acoustic properties but different functions. Journal of Child Language, 45(5), 1035–1053. https://doi.org/10.1017/S0305000917000629 CrossRef Google Scholar PubMed

Kalashnikova, M., and Carreiras, M. (2022). Input quality and speech perception development in bilingual infants’ first year of life. Child Development, 93(1), e32–e46. https://doi.org/10.1111/cdev.13686 CrossRef Google Scholar PubMed

Kalashnikova, M., Burnham, D., and Goswami, U. (2021). Rhythm discrimination and metronome tapping in 4-year-old children at risk for developmental dyslexia. Cognitive Development, 60, 101129.10.1016/j.cogdev.2021.101129CrossRef Google Scholar

Kalashnikova, M., Peter, V., Di Liberto, G. M., Lalor, E. C., and Burnham, D. (2018). Infant-directed speech facilitates seven-month-old infants’ cortical tracking of speech. Scientific Reports, 8(1), 13745.10.1038/s41598-018-32150-6CrossRef Google Scholar PubMed

Karam, M., Russo, F. A., Branje, C., Price, E., and Fels, D. I. (2008). Towards a model human cochlea: Sensory substitution for crossmodal audio-tactile displays. Proceedings of Graphics Interface 2008 (GI ’08). Canadian Information Processing Society, pp. 267–274.Google Scholar

Kempe, V., Brooks, P. J., and Gillis, S. (2005). Diminutives in child-directed speech supplement metric with distributional word segmentation cues. Psychonomic Bulletin & Review, 12, 145–151. https://doi.org/10.3758/BF03196360 CrossRef Google Scholar PubMed

Kisilevsky, B. S., Killen, H., Muir, D. W., and Low, J. A. (1991). Maternal and ultrasound measurements of elicited fetal movements: A methodologic consideration. Obstetrics and Gynecology, 77(6), 889–892.Google Scholar PubMed

Kisilevsky, B. S., Hains, S. M. J., Jacquet, A.-Y., Granier-Deferre, C., and Lecanuet, J.-P. (2004). Maturation of fetal responses to music. Developmental Science, 7, 550–559. https://doi.org/10.1111/j.1467-7687.2004.00379.x CrossRef Google Scholar PubMed

Kitamura, C., and Burnham, D. (1998). Acoustic and affective qualities of IDS in English. Proceedings of the Fifth International Conference on Spoken Language Processing (ICSLP 1998), paper 0909. https://doi.org/10.21437/ICSLP.1998-371 CrossRef Google Scholar

Kitamura, C., and Burnham, D. (2003). Pitch and communicative intent in mother’s speech: Adjustments for age and sex in the first year. Infancy, 4(1), 85–110. https://doi.org/10.1207/S15327078IN0401_5 CrossRef Google Scholar

Kitamura, C., and Lam, C. (2009). Age-specific preferences for infant-directed affective intent. Infancy, 14(1), 77–100. https://doi.org/10.1080/15250000802569777 CrossRef Google Scholar PubMed

Kuhl, P. K. (2000). A new view of language acquisition. Proceedings of the National Academy of Sciences, 97(22), 11850–11857. https://doi.org/10.1073/pnas.97.22.11850 CrossRef Google Scholar PubMed

Kuhl, P. K., Andruski, J. E., Chistovich, I. A., et al. (1997). Cross-language analysis of phonetic units in language addressed to infants. Science, 277(5326), 684–686. https://doi.org/10.1126/science.277.5326.684 CrossRef Google Scholar PubMed

Lahav, A., Saltzman, E., and Schlaug, G. (2007). Action representation of sound: Audiomotor recognition network while listening to newly acquired actions. Journal of Neuroscience, 27(2), 308–314. https://doi.org/10.1523/JNEUROSCI.4822-06.2007 CrossRef Google Scholar PubMed

Lam, C., and Kitamura, C. (2012). Mommy, speak clearly: Induced hearing loss shapes vowel hyperarticulation. Developmental Science, 15(2), 212–221. https://doi.org/10.1111/j.1467-7687.2011.01118.x CrossRef Google Scholar PubMed

Lecanuet, J.-P., and Schaal, B. (2002). Sensory performances in the human foetus: A brief summary of research. Intellectica, 34(1), 29–56.Google Scholar

Lee, C. S., and Todd, N. P. M. (2004). Towards an auditory account of speech rhythm: Application of a model of the auditory “primal sketch” to two multi-language corpora. Cognition, 93(3), 225–254. https://doi.org/10.1016/j.cognition.2003.10.012 CrossRef Google Scholar

Lee, C. S., Kitamura, C., Burnham, D., and McAngus Todd, N. P. (2014). On the rhythm of infant versus adult-directed speech in Australian English. Journal of the Acoustical Society of America, 136(1), 357–365. https://doi.org/10.1121/1.4883479 CrossRef Google Scholar PubMed

Lee, H.-Y., Sie, Y.-S., Chen, S.-C., and Cheng, M.-C. (2015). The music perception performance of children with and without dyslexia in Taiwan. Psychological Reports, 116(1), 13–22. https://doi.org/10.2466/15.28.PR0.116k15w8 CrossRef Google Scholar PubMed

Lense, M. D., Shultz, S., Astésano, C., and Jones, W. (2022). Music of infant-directed singing entrains infants’ social visual behavior. Proceedings of the National Academy of Sciences, 119(45), e2116967119. https://doi.org/10.1073/pnas.2116967119 CrossRef Google Scholar PubMed

Leong, V., Hämäläinen, J., Soltész, F., and Goswami, U. (2011). Rise time perception and detection of syllable stress in adults with developmental dyslexia. Journal of Memory and Language, 64(1), 59–73. https://doi.org/10.1016/j.jml.2010.09.003 CrossRef Google Scholar

Leong, V., Kalashnikova, M., Burnham, D., and Goswami, U. (2014). Infant-directed speech enhances temporal rhythmic structure in the envelope. 15th Annual Conference of the International Speech Communication Association 2014, pp. 2563–2567. https://doi.org/10.21437/Interspeech.2014-549 CrossRef Google Scholar

Lew-Williams, C., Ferguson, B., Abu-Zhaya, R., and Seidl, A. (2019). Social touch interacts with infants’ learning of auditory patterns. Developmental Cognitive Neuroscience, 35, 66–74. https://doi.org/10.1016/j.dcn.2017.09.006 CrossRef Google Scholar PubMed

Liu, H.-M., Kuhl, P. K., and Tsao, F.-M. (2003). An association between mothers’ speech clarity and infants’ speech discrimination skills. Developmental Science, 6(3), F1–F10. https://doi.org/10.1111/1467-7687.00275 CrossRef Google Scholar

Lizarazu, M., Carreiras, M., and Molinaro, N. (2023). Theta-gamma phase–amplitude coupling in auditory cortex is modulated by language proficiency. Human Brain Mapping, 44(7), 2862–2872. https://doi.org/10.1002/hbm.26250 CrossRef Google Scholar PubMed

Lizarazu, M., Lallier, M., and Molinaro, N. (2019). Phase–amplitude coupling between theta and gamma oscillations adapts to speech rate. Annals of the New York Academy of Sciences, 1453(1), 140–152. https://doi.org/10.1111/nyas.14099 CrossRef Google Scholar PubMed

Longhi, E. (2009). “Songese”: Maternal structuring of musical interaction with infants. Psychology of Music, 37(2), 195–213.10.1177/0305735608097042CrossRef Google Scholar

Lovcevic, I., Burnham, D., and Kalashnikova, M. (2022). Language development in infants with hearing loss: Benefits of infant-directed speech. Infant Behavior and Development, 67, 101699. https://doi.org/10.1016/j.infbeh.2022.101699 CrossRef Google Scholar PubMed

Luo, H., and Poeppel, D. (2007). Phase patterns of neuronal responses reliably discriminate speech in human auditory cortex. Neuron, 54(6), 1001–1010. https://doi.org/10.1016/j.neuron.2007.06.004 CrossRef Google Scholar PubMed

Mampe, B., Friederici, A. D., Christophe, A., and Wermke, K. (2009). Newborns’ cry melody is shaped by their native language. Current Biology, 19(23), 1994–1997. https://doi.org/10.1016/j.cub.2009.09.064 CrossRef Google Scholar PubMed

Martin, A., Schatz, T., Versteegh, M., et al. (2015). Mothers speak less clearly to infants than to adults: A comprehensive test of the hyperarticulation hypothesis. Psychological Science, 26(3), 341–347.10.1177/0956797614562453CrossRef Google Scholar PubMed

Mattys, S. L., Jusczyk, P. W., Luce, P. A., and Morgan, J. L. (1999). Phonotactic and prosodic effects on word segmentation in infants. Cognitive Psychology, 38(4), 465–494.10.1006/cogp.1999.0721CrossRef Google Scholar PubMed

McMurray, B., Kovack-Lesh, K. A., Goodwin, D., and McEchron, W. (2013). Infant directed speech and the development of speech perception: Enhancing development or an unintended consequence? Cognition, 129(2), 362–378.10.1016/j.cognition.2013.07.015CrossRef Google Scholar PubMed

Menn, K. H., Michel, C., Meyer, L., Hoehl, S., and Männel, C. (2022a). Natural infant-directed speech facilitates neural tracking of prosody. NeuroImage, 251, 118991. https://doi.org/10.1016/j.neuroimage.2022.118991 CrossRef Google Scholar PubMed

Menn, K. H., Ward, E. K., Braukmann, R., et al. (2022b). Neural tracking in infancy predicts language development in children with and without family history of autism. Neurobiology of Language, 3(3), 495–514. https://doi.org/10.1162/nol_a_00074 CrossRef Google Scholar PubMed

Molnar, M., Gervain, J., and Carreiras, M. (2014). Within-rhythm class native language discrimination abilities of Basque-Spanish monolingual and bilingual infants at 3.5 months of age. Infancy, 19(3), 326–337. https://doi.org/10.1111/infa.12041 CrossRef Google Scholar

Moon, C. M., and Fifer, W. P. (2000). Evidence of transnatal auditory learning. Journal of Perinatology, 20(1), S37–S44. https://doi.org/10.1038/sj.jp.7200448 CrossRef Google Scholar PubMed

Morgan, J. L., and Saffran, J. R. (1995). Emerging integration of sequential and suprasegmental information in preverbal speech segmentation. Child Development, 66(4), 911–936.10.2307/1131789CrossRef Google Scholar PubMed

Muneaux, M., Ziegler, J. C., Truc, C., Thomson, J., and Goswami, U. (2004). Deficits in beat perception and dyslexia: Evidence from French. NeuroReport, 15(8), 1255–1259. https://doi.org/10.1097/01.wnr.0000127459.31232.c4 CrossRef Google Scholar PubMed

Murray, L., and Trevarthen, C. (1986). The infant’s role in mother–infant communications. Journal of Child Language, 13(1), 15–29. https://doi.org/10.1017/S0305000900000271 CrossRef Google Scholar PubMed

Nazzi, T., and Ramus, F. (2003). Perception and acquisition of linguistic rhythm by infants. Speech Communication, 41(1), 233–243. https://doi.org/10.1016/S0167-6393(02)00106-1 CrossRef Google Scholar

Nazzi, T., Bertoncini, J., and Mehler, J. (1998). Language discrimination by newborns: Toward an understanding of the role of rhythm. Journal of Experimental Psychology: Human Perception and Performance, 24(3), 756. https://doi.org/10.1037/0096-1523.24.3.756 Google Scholar PubMed

Nencheva, M. L., and Lew-Williams, C. (2022). Understanding why infant-directed speech supports learning: A dynamic attention perspective. Developmental Review, 66, 101047. https://doi.org/10.1016/j.dr.2022.101047 CrossRef Google Scholar

Newman, R. S., and Hussain, I. (2006). Changes in preference for infant-directed speech in low and moderate noise by 4.5- to 13-month-olds. Infancy, 10(1), 61–76. https://doi.org/10.1207/s15327078in1001_4 CrossRef Google Scholar PubMed

Nguyen, T., Abney, D. H., Salamander, D., Bertenthal, B. I., and Hoehl, S. (2021). Proximity and touch are associated with neural but not physiological synchrony in naturalistic mother–infant interactions. NeuroImage, 244, 118599. https://doi.org/10.1016/j.neuroimage.2021.118599 CrossRef Google Scholar

Nguyen, T., Reisner, S., Lueger, A., et al. (2023). Sing to me, baby: Infants show neural tracking and rhythmic movements to live and dynamic maternal singing. Developmental Cognitive Neuroscience, 64, 101313. https://doi.org/10.1016/j.dcn.2023.101313 CrossRef Google Scholar PubMed

Obleser, J., and Kayser, C. (2019). Neural entrainment and attentional selection in the listening brain. Trends in Cognitive Sciences, 23(11), 913–926. https://doi.org/10.1016/j.tics.2019.08.004 CrossRef Google Scholar PubMed

O’Neill, M., Bard, K. A., Linnell, M., and Fluck, M. (2005). Maternal gestures with 20-month-old infants in two contexts. Developmental Science, 8(4), 352–359. https://doi.org/10.1111/j.1467-7687.2005.00423.x CrossRef Google Scholar PubMed

Overy, K. (2000). Dyslexia, temporal processing and music: The potential of music as an early learning aid for dyslexic children. Psychology of Music, 28(2), 218–229.10.1177/0305735600282010CrossRef Google Scholar

Özcan, B., Caligiore, D., Sperati, V., Moretta, T., and Baldassarre, G. (2016). Transitional wearable companions: A novel concept of soft interactive social robots to improve social skills in children with autism spectrum disorder. International Journal of Social Robotics, 8, 471–481. https://doi.org/10.1007/s12369-016-0373-8 CrossRef Google Scholar

Papadimitriou, A., Smyth, C., Politimou, N., Franco, F., and Stewart, L. (2021). The impact of the home musical environment on infants’ language development. Infant Behavior & Development, 65, 101651. https://doi.org/10.1016/j.infbeh.2021.101651 CrossRef Google Scholar PubMed

Papoušek, M., Bornstein, M. H., Nuzzo, C., Papoušek, H., and Symmes, D. (1990). Infant responses to prototypical melodic contours in parental speech. Infant Behavior and Development, 13(4), 539–545. https://doi.org/10.1016/0163-6383(90)90022-Z CrossRef Google Scholar

Payne, E., Post, B., Astruc, L., Prieto, P., and Vanrell, M. D. M. (2009). Rhythmic modification in child directed speech. Philology & Phonetics, 12, 123–144.Google Scholar

Peelle, J. E., and Davis, M. H. (2012). Neural oscillations carry speech rhythm through to comprehension. Frontiers in Psychology, 3, 320. https://doi.org/10.3389/fpsyg.2012.00320 CrossRef Google Scholar PubMed

Peelle, J. E., Gross, J., and Davis, M. H. (2013). Phase-locked responses to speech in human auditory cortex are enhanced during comprehension. Cerebral Cortex, 23(6), 1378–1387. https://doi.org/10.1093/cercor/bhs118 CrossRef Google Scholar

Pérez-Navarro, J., Lallier, M., Clark, C., Flanagan, S., and Goswami, U. (2022). Local temporal regularities in child-directed speech in Spanish. Journal of Speech, Language, and Hearing Research, 65(10), 3776–3788. https://doi.org/10.1044/2022_JSLHR-22-00111 CrossRef Google Scholar PubMed

Peter, V., Kalashnikova, M., Santos, A., and Burnham, D. (2016). Mature neural responses to infant-directed speech but not adult-directed speech in pre-verbal infants. Scientific Reports, 6(1), 34273. https://doi.org/10.1038/srep34273 CrossRef Google Scholar

Petitjean, C. (1989). Une condition de l’audition foetale: la conduction sonore osseuse: conséquences cliniques et applications pratiques envisagées. MD dissertation, University of Besançon.Google Scholar

Petitto, L. A., Holowka, S., Sergio, L. E., and Ostry, D. (2001). Language rhythms in baby hand movements. Nature, 413(6851), 35–36. https://doi.org/10.1038/35092613 CrossRef Google Scholar PubMed

Phillips-Silver, J., and Trainor, L. J. (2005). Feeling the beat: Movement influences infant rhythm perception. Science, 308(5727), 1430–1430. https://doi.org/10.1126/science.1110922 CrossRef Google Scholar PubMed

Phillips-Silver, J., and Trainor, L. J. (2007). Hearing what the body feels: Auditory encoding of rhythmic movement. Cognition, 105(3), 533–546. https://doi.org/10.1016/j.cognition.2006.11.006 CrossRef Google Scholar PubMed

Phillips-Silver, J., and Trainor, L. J. (2008). Vestibular influence on auditory metrical interpretation. Brain and Cognition, 67(1), 94–102. https://doi.org/10.1016/j.bandc.2007.11.007 CrossRef Google Scholar PubMed

Piazza, E. A., Hasenfratz, L., Hasson, U., and Lew-Williams, C. (2020). Infant and adult brains are coupled to the dynamics of natural communication. Psychological Science, 31(1), 6–17. https://doi.org/10.1177/0956797619878698 CrossRef Google Scholar

Poćwierz-Marciniak, I., and Harciarek, M. (2021). The effect of musical stimulation and mother’s voice on the early development of musical abilities: A neuropsychological perspective. International Journal of Environmental Research and Public Health, 18(16), 8467. https://doi.org/10.3390/ijerph18168467 CrossRef Google Scholar PubMed

Poeppel, D., and Assaneo, M. F. (2020). Speech rhythms and their neural foundations. Nature Reviews Neuroscience, 21(6), 322–334. https://doi.org/10.1038/s41583-020-0304-4 CrossRef Google Scholar PubMed

Power, A. J., Mead, N., Barnes, L., and Goswami, U. (2013). Neural entrainment to rhythmic speech in children with developmental dyslexia. Frontiers in Human Neuroscience, 7, 777. https://doi.org/0.3389/fnhum.2013.00777 CrossRef Google Scholar PubMed

Provasi, J., Anderson, D. I., and Barbu-Roth, M. (2014). Rhythm perception, production, and synchronization during the perinatal period. Frontiers in Psychology, 5, 1048. https://doi.org/10.3389/fpsyg.2014.01048 CrossRef Google Scholar PubMed

Puyjarinet, F., Bégel, V., Lopez, R., Dellacherie, D., and Dalla Bella, S. (2017). Children and adults with attention-deficit/hyperactivity disorder cannot move to the beat. Scientific Reports, 7(1), 1–11. https://doi.org/10.1038/s41598-017-11295-w CrossRef Google Scholar

Ramus, F., Nespor, M., and Mehler, J. (1999). Correlates of linguistic rhythm in the speech signal. Cognition, 73(3), 265–292. https://doi.org/10.1016/s0010-0277(00)00101-3 CrossRef Google Scholar PubMed

Ramus, F., Hauser, M. D., Miller, C., Morris, D., and Mehler, J. (2000). Language discrimination by human newborns and by cotton-top tamarin monkeys. Science, 288(5464), 349–351. https://doi.org/10.1126/science.288.5464.349 CrossRef Google Scholar PubMed

Rocha, S., Southgate, V., and Mareschal, D. (2021). Rate of infant carrying impacts infant spontaneous motor tempo. Royal Society Open Science, 8(9), 210608. https://doi.org/10.1098/rsos.210608 CrossRef Google Scholar PubMed

Rocha, S., Attaheri, A., Choisdealbha, Á. N., et al. (2024). Precursors to infant sensorimotor synchronization to speech and non-speech rhythms: A longitudinal study. Developmental Science, 27, e13483. https://doi.org/10.1111/desc.13483 CrossRef Google Scholar PubMed

Rock, A. M. L., Trainor, L. J., and Addison, T. L. (1999). Distinctive messages in infant-directed lullabies and play songs. Developmental Psychology, 35(2), 527. https://doi.org/10.1037/0012-1649.35.2.527 CrossRef Google Scholar PubMed

Russo, S. (2023). Cross-Domain and Cross-Sensory Effects of Rhythm on Language Development. University of Padua, Italy.Google Scholar PubMed

Salimpoor, V. N., Benovoy, M., Longo, G., Cooperstock, J. R., and Zatorre, R. J. (2009). The rewarding aspects of music listening are related to degree of emotional arousal. PLoS One, 4(10), e7487. https://doi.org/10.1371/journal.pone.0007487 CrossRef Google Scholar PubMed

Schreiner, M. S., and Mani, N. (2017). Listen up! Developmental differences in the impact of IDS on speech segmentation. Cognition, 160, 98–102. https://doi.org/10.1016/j.cognition.2016.12.003 CrossRef Google Scholar PubMed

Seidl, A., and Cristià, A. (2008). Developmental changes in the weighting of prosodic cues. Developmental Science, 11(4), 596–606. https://doi.org/10.1111/j.1467-7687.2008.00704.x CrossRef Google Scholar PubMed

Shannon, K. (2006). Infant behavioral responses to infant-directed singing and other maternal interactions. Infant Behavior and Development, 29(3), 456–470. https://doi.org/10.1016/j.infbeh.2006.03.002 Google Scholar

Smith, N. A., and Trainor, L. J. (2008). Infant-directed speech is modulated by infant feedback. Infancy, 13(4), 410–420. https://doi.org/10.1080/15250000802188719 CrossRef Google Scholar

Soderstrom, M. (2007). Beyond babytalk: Re-evaluating the nature and content of speech input to preverbal infants. Developmental Review, 27(4), 501–532. https://doi.org/10.1016/j.dr.2007.06.002 CrossRef Google Scholar

Sohmer, H., Perez, R., Sichel, J.-Y., Priner, R., and Freeman, S. (2001). The pathway enabling external sounds to reach and excite the fetal inner ear. Audiology and Neurotology, 6(3), 109–116. https://doi.org/10.1159/000046817 CrossRef Google Scholar PubMed

Song, J. Y., Demuth, K., and Morgan, J. (2010). Effects of the acoustic properties of infant-directed speech on infant word recognition. Journal of the Acoustical Society of America, 128(1), 389–400. https://doi.org/10.1121/1.3419786 CrossRef Google Scholar PubMed

Tajima, K., Tanaka, K., Martin, A., and Mazuka, R. (2013). Is the mora rhythm of Japanese more strongly observed in infant-directed speech than in adult-directed speech? Journal of the Acoustical Society of America, 133 (Supplement 5), 3341. https://doi.org/10.1121/1.4805643 CrossRef Google Scholar

Tan, S. H. J., Kalashnikova, M., Di Liberto, G. M., Crosse, M. J., and Burnham, D. (2022). Seeing a talking face matters: The relationship between cortical tracking of continuous auditory-visual speech and gaze behaviour in infants, children and adults. NeuroImage, 256, 119217. https://doi.org/10.1016/j.neuroimage.2022.119217 Google Scholar

Tanaka, Y., Kanakogi, Y., Kawasaki, M., and Myowa, M. (2018). The integration of audio-tactile information is modulated by multimodal social interaction with physical contact in infancy. Developmental Cognitive Neuroscience, 30, 31–40. https://doi.org/10.1016/j.dcn.2017.12.001 CrossRef Google Scholar PubMed

Teie, D. (2016). A comparative analysis of the universal elements of music and the fetal environment. Frontiers in Psychology, 7, 1158. https://doi.org/10.3389/fpsyg.2016.01158 CrossRef Google Scholar PubMed

Thiessen, E. D., Hill, E. A., and Saffran, J. R. (2005). Infant-directed speech facilitates word segmentation. Infancy, 7(1), 53–71. https://doi.org/10.1207/s15327078in0701_5 CrossRef Google Scholar PubMed

Thomson, J. M., and Goswami, U. (2008). Rhythmic processing in children with developmental dyslexia: Auditory and motor rhythms link to reading and spelling. Journal of Physiology-Paris, 102(1–3), 120–129. https://doi.org/10.1016/j.jphysparis.2008.03.007 CrossRef Google Scholar PubMed

Thomson, J. M., Leong, V., and Goswami, U. (2013). Auditory processing interventions and developmental dyslexia: A comparison of phonemic and rhythmic approaches. Reading and Writing, 26, 139–161. https://doi.org/10.1007/s11145-012-9359-6 CrossRef Google Scholar

Tichko, P., Kim, J. C., and Large, E. W. (2022). A dynamical, radically embodied, and ecological theory of rhythm development. Frontiers in Psychology, 13, 653696. https://doi.org/10.3389/fpsyg.2022.653696 CrossRef Google Scholar PubMed

Tincoff, R., Seidl, A., Buckley, L., Wojcik, C., and Cristia, A. (2019). Feeling the way to words: Parents’ speech and touch cues highlight word-to-world mappings of body parts. Language Learning and Development, 15(2), 103–125. https://doi.org/10.1080/15475441.2018.1533472 CrossRef Google Scholar

Trainor, L. J. (2006). Innateness, learning, and the difficulty of determining whether music is an evolutionary adaptation: A commentary on Justus & Hutsler (2005) and McDermott & Hauser (2005). Music Perception, 24(1), 105–110. https://doi.org/10.1525/mp.2006.24.1.105 CrossRef Google Scholar

Trainor, L., and Trehub, S. (1998). Singing to infants: Lullabies and play songs. Advances in Infancy Research, 12, 43–78.Google Scholar

Trainor, L. J., Chang, A., Cairney, J., and Li, Y.-C. (2018). Is auditory perceptual timing a core deficit of developmental coordination disorder? Annals of the New York Academy of Sciences, 1423(1), 30–39. https://doi.org/10.1111/nyas.13701 CrossRef Google Scholar PubMed

Trainor, L. J., Gao, X., Lei, J.-J., Lehtovaara, K., and Harris, L. R. (2009). The primal role of the vestibular system in determining musical rhythm. Cortex, 45(1), 35–43. https://doi.org/10.1016/j.cortex.2007.10.014 CrossRef Google Scholar PubMed

Trehub, S. E. (2003). The developmental origins of musicality. Nature Neuroscience, 6(7), 669–673. https://doi.org/10.1038/nn1084 CrossRef Google Scholar PubMed

Trehub, S. E., and Russo, F. A. (2020). Infant-directed singing from a dynamic multimodal perspective: Evolutionary origins, cross-cultural variation, and relation to infant-directed speech. In Russo, F. A., Ilari, B., and Cohen, A. J. (Eds.), The Routledge Companion to Interdisciplinary Studies in Singing, Volume I: Development (first edition) (pp. 249–261). Routledge. https://doi.org/10.4324/9781315163734 CrossRef Google Scholar

Trehub, S. E., Ghazban, N., and Corbeil, M. (2015). Musical affect regulation in infancy. Annals of the New York Academy of Sciences, 1337(1), 186–192. https://doi.org/10.1111/nyas.12622 CrossRef Google Scholar PubMed

Trehub, S. E., Unyk, A. M., Kamenetsky, S. B., et al. (1997). Mothers’ and fathers’ singing to infants. Developmental Psychology, 33(3), 500. https://doi.org/10.1037/0012-1649.33.3.500 CrossRef Google Scholar PubMed

Turk, A., and Shattuck-Hufnagel, S. (2013). What is speech rhythm? A commentary on Arvaniti and Rodriquez, Krivokapić, and Goswami and Leong. Laboratory Phonology, 4(1), 93–118. https://doi.org/10.1515/lp-2013-0005 CrossRef Google Scholar

Ullal-Gupta, S., Vanden Bosch der Nederlanden, C. M., Tichko, P., Lahav, A., and Hannon, E. E. (2013). Linking prenatal experience to the emerging musical mind. Frontiers in Systems Neuroscience, 7, 48. https://doi.org/10.3389/fnsys.2013.00048 CrossRef Google Scholar

Vanthornhout, J., Decruy, L., Wouters, J., Simon, J. Z., and Francart, T. (2018). Speech intelligibility predicted from neural entrainment of the speech envelope. Journal of the Association for Research in Otolaryngology, 19, 181–191. https://doi.org/10.1007/s10162-018-0654-z CrossRef Google Scholar PubMed

Wass, S. V., Whitehorn, M., Haresign, I. M., Phillips, E., and Leong, V. (2020). Interpersonal neural entrainment during early social interaction. Trends in Cognitive Sciences, 24(4), 329–342. https://doi.org/10.1016/j.tics.2020.01.006 CrossRef Google Scholar PubMed

Weber, C., Hahne, A., Friedrich, M., and Friederici, A. D. (2004). Discrimination of word stress in early infant perception: Electrophysiological evidence. Cognitive Brain Research, 18(2), 149–161. https://doi.org/10.1016/j.cogbrainres.2003.10.001 CrossRef Google Scholar PubMed

Woodruff Carr, K., White-Schwoch, T., Tierney, A. T., Strait, D. L., and Kraus, N. (2014). Beat synchronization predicts neural speech encoding and reading readiness in preschoolers. Proceedings of the National Academy of Sciences, 111(40), 14559–14564. https://doi.org/10.1073/pnas.1406219111 CrossRef Google Scholar PubMed

Zentner, M., and Eerola, T. (2010). Rhythmic engagement with music in infancy. Proceedings of the National Academy of Sciences, 107(13), 5768–5773. https://doi.org/10.1073/pnas.1000121107 CrossRef Google Scholar PubMed

Zimmerman, E., and Barlow, S. M. (2012). The effects of vestibular stimulation rate and magnitude of acceleration on central pattern generation for chest wall kinematics in preterm infants. Journal of Perinatology, 32(8), 614–620. https://doi.org/10.1038/jp.2011.177 CrossRef Google Scholar PubMed

Zion Golumbic, E. M., Ding, N., Bickel, S., et al. (2013). Mechanisms underlying selective neuronal tracking of attended speech at a “cocktail party.” Neuron, 77(5), 980–991. https://doi.org/10.1016/j.neuron.2012.12.037 CrossRef Google Scholar

Accessibility standard: WCAG 2.0 A

Why this information is here

This section outlines the accessibility features of this content - including support for screen readers, full keyboard navigation and high-contrast display options. This may not be relevant for you.

Accessibility Information

The HTML of this chapter conforms to version 2.0 of the Web Content Accessibility Guidelines (WCAG), ensuring core accessibility principles are addressed and meets the basic (A) level of WCAG compliance, addressing essential accessibility barriers.

Content Navigation

Table of contents navigation
Allows you to navigate directly to chapters, sections, or non‐text items through a linked table of contents, reducing the need for extensive scrolling.

Index navigation
Provides an interactive index, letting you go straight to where a term or subject appears in the text without manual searching.

Reading Order & Textual Equivalents

Single logical reading order
You will encounter all content (including footnotes, captions, etc.) in a clear, sequential flow, making it easier to follow with assistive tools like screen readers.

Full alternative textual descriptions
You get more than just short alt text: you have comprehensive text equivalents, transcripts, captions, or audio descriptions for substantial non‐text content, which is especially helpful for complex visuals or multimedia.

Visualised data also available as non-graphical data
You can access graphs or charts in a text or tabular format, so you are not excluded if you cannot process visual displays.

Visual Accessibility

Use of colour is not sole means of conveying information
You will still understand key ideas or prompts without relying solely on colour, which is especially helpful if you have colour vision deficiencies.

Book contents

38 - Rhythmic Structure in Cross-Modal Infant-Directed Communication

Summary

Keywords

Information

38.1 Introduction

38.2 Infant-Directed Speech: Rhythmic Information in Infants’ Early Language Input

38.3 ID Singing: A Multimodal Source of Rhythmic Information

38.4 Early Vestibular, Tactile, and Somatosensory Experiences with Rhythm

38.5 Directions for Future Research

38.6 Conclusion

38.7 Acknowledgements

Summary

Implications

Gains

References

Accessibility standard: WCAG 2.0 A

Why this information is here

Accessibility Information

Content Navigation

Reading Order & Textual Equivalents

Visual Accessibility

Save book to Kindle

Save book to Dropbox

Save book to Google Drive