Maturational Constraints on Tracking of Temporal Attention in Infant Language Acquisition

doi:10.1017/9781009295888.044

37 - Maturational Constraints on Tracking of Temporal Attention in Infant Language Acquisition

from Section 6 - Rhythm in Language Acquisition

Published online by Cambridge University Press: 23 April 2026

Tineke M. Snijders and

Katharina H. Menn

Edited by

Lars Meyer and

Antje Strauss

Show author details

Lars Meyer: Affiliation:
Max Planck Institute for Human Cognitive and Brain Sciences
Antje Strauss: Affiliation:
University of Konstanz

Book contents

Summary

Children are active learners: They selectively attend to important information. Rhythmic neural tracking of speech is central to active language learning. This chapter evaluates recent research showing that neural oscillations in the infant brain synchronize with the rhythm of speech, tracking it at different frequencies. This process predicts word segmentation and later language abilities. We argue that rhythmic neural speech tracking reflects infants’ attention to specific parts of the speech signal (e.g., stressed syllables), and simultaneously acts as a core mechanism for maximizing temporal attention onto those parts. Rhythmic neural tracking of speech puts a constraint on neural processing, which maximizes the uptake of relevant information from the noisy multimodal environment. We hypothesize this to be influenced by neural maturation. We end by evaluating the implications of this proposal for language acquisition research, and discuss how differences in neural maturation relate to variance in language development in autism.

Keywords

neural tracking of speech infants temporal attention word segmentation maturation autism

Information

Type: Chapter
Information: Rhythms of Speech and Language
Physiology, Cognition, Culture
, pp. 664 - 686

DOI: https://doi.org/10.1017/9781009295888.044 [Opens in a new window]

Publisher: Cambridge University Press

Print publication year: 2026
Creative Commons: This content is Open Access and distributed under the terms of the Creative Commons Attribution licence CC-BY-NC 4.0 https://creativecommons.org/cclicenses/

37 Maturational Constraints on Tracking of Temporal Attention in Infant Language Acquisition

37.1 Introduction

To learn about their world, infants have to make sense of the ‘great blooming, buzzing confusion’ of their environment (James, Reference James1890). They have to learn that the streams of sound that their caregivers emit are communicative and meaningful. Acquiring language is a key developmental milestone that children reach in their early years. Already before their first birthday, infants learn many things about speech and language. For example, in their first year of life, infants discover which sounds are meaningful in their native language (Kuhl, Reference Kuhl2004; Werker and Tees, Reference Werker and Tees1984), they learn to segment words from the continuous speech stream (Jusczyk, Reference Jusczyk1999), and they start to link these word forms to meaning (Johnson, Reference Johnson2016). Infants’ brains are ‘language-ready’ (Hagoort, Reference Hagoort2017), but their brains are also still rapidly developing in interaction with their environment (Westermann, Reference Westermann2016). The environment in which language learning needs to occur is usually noisy, with many possible referents, cluttered visual information (Yu et al., Reference Yu, Zhang, Slone and Smith2021), and auditory background noise. We now know that children are active learners (Bazhydai et al., Reference Bazhydai, Westermann and Parise2020; Begus et al., Reference Begus, Gliga and Southgate2016; Kidd et al., Reference Kidd, Piantadosi and Aslin2012; Stahl and Feigenson, Reference Stahl and Feigenson2015): they selectively attend to important information. Being able to select the relevant information for language learning enables language growth (D’souza et al., Reference D’souza, D’souza and Karmiloff-Smith2017). It is essential to know what neural processes help children in this attentional selection for language learning and how environmental cues and neural maturation influence these processes. This insight will help us understand individual differences in language development and give clues for providing an optimal learning situation in both typical and atypical development.

The current chapter will showcase the potential importance of neural tracking, that is, the alignment between neural activity and rhythmic speech patterns, for attentional selection during speech processing development. We will review recent research on neural speech tracking in infants and its relation to later language development. Finally, we will discuss how electrophysiological maturation across infancy may change neural tracking in infancy and influence the trajectory of both typical and atypical language development.

37.1.1 Using Rhythm for First-Language Acquisition

One important cue that infants use for language learning is rhythm (Gleitman and Wanner, Reference Gleitman and Wanner1982). Newborns can already distinguish different languages based on their rhythmic characteristics (Nazzi et al., Reference Nazzi, Floccia and Bertoncini1998; Ramus and Mehler, Reference Ramus and Mehler1999; Ramus et al., Reference Ramus and Mehler1999). Seven- to eight-month-olds use rhythmic properties to segment words from a continuous speech stream (Johnson and Jusczyk, Reference Johnson and Jusczyk2001; Jusczyk et al., Reference Jusczyk1999). This has been proposed to be an important bootstrapping mechanism for language learning (Gervain et al., Reference Gervain, Christophe, Mazuka, Gussenhoven and Chen2020; Höhle, Reference Höhle2009).

Our hypothesis is that the oscillatory properties of the human brain are particularly suited to pick up rhythmic properties of language. In the current chapter, we specify how the neural tracking of rhythmic speech properties might help children to selectively attend to important information in their input, thus paving the way for language learning.

37.1.2 Proposal: Importance of Neural Tracking for Temporal Attention and Impact Maturation

We here propose that rhythmic neural tracking of speech (Giraud and Poeppel, Reference Giraud and Poeppel2012), that is, the synchronisation between neural oscillations and speech rhythm, is central to active language learning. Neural oscillations provide temporal windows of alternating reduced and enhanced excitability (Buzsáki and Watson, Reference Buzsáki and Watson2022; Fries, Reference Fries2015; VanRullen, Reference VanRullen2016), enabling more effective processing at high-excitability states (Lakatos et al., Reference Lakatos, Musacchia and O’Connel2013; VanRullen, Reference VanRullen2016). Neural synchronisation to external stimuli has been proposed to allow for sensory selection (Schroeder and Lakatos, Reference Schroeder and Lakatos2009). During speech processing, neural tracking of speech acoustics might help to group information into analysable units such as words and phrases (Ding and Simon, Reference Ding and Simon2014; Goswami, Reference Goswami2018; Keitel and Gross, Reference Keitel and Gross2016; see also Chapters 3, 5, and 35). We here propose that neural tracking assists language development by guiding infants’ attention towards informative units in speech, helping infants to learn to segment the continuous speech signal into informative units and from there bootstrap language learning (Gervain et al., Reference Gervain, Christophe, Mazuka, Gussenhoven and Chen2020; Höhle, Reference Höhle2009). It is important to realise that the infant brain is still rapidly developing, with electrophysiological brain activity speeding up with infant development (Anderson and Perone, Reference Anderson and Perone2018; Cellier et al., Reference Cellier, Riddle, Petersen and Hwang2021; Menn et al., Reference Menn, Männel and Meyer2023a). We hypothesise that this electrophysiological maturation gives rise to different processing constraints and opportunities at different points in development, with optimal analysis time windows shifting with development.

37.2 Neural Tracking of Speech

In adults, it is now well established that rhythmic properties of speech set up a predictive context (Rothermich and Kotz, Reference Rothermich and Kotz2013) that is crucial for speech decoding (Gagnepain et al., Reference Gagnepain, Henson and Davis2012; Rimmele et al., Reference Rimmele, Morillon, Poeppel and Arnal2018; Zion Golumbic et al., Reference Zion Golumbic, Poeppel and Schroeder2012). Rhythm in speech is most obvious in the amplitude envelope modulation of the speech waveform (see Figure 37.1; Giraud and Poeppel, Reference Giraud and Poeppel2012; Goswami, Reference Goswami2012), with clear peaks from 2 to 10 Hz across languages, corresponding to the syllable rate (see Poeppel and Assaneo, Reference Poeppel and Assaneo2020, for a recent review). At a higher frequency (~30–50 Hz), modulations are associated with phonemic features, and at a lower temporal modulation rate (<4 Hz) with prosodic stress and lexical and phrasal structure, for example through the intonation contour (Giraud and Poeppel, Reference Giraud and Poeppel2012; Rosen et al., Reference Rosen, Carlyon, Darwin and Russell1992).

Figure 37.1

Illustration of neural tracking of speech.

Electrophysiological activity in the delta and theta range is assumed to synchronise to amplitude modulations in speech (see also Chapter 35). The line above the speech signal displays the amplitude envelope. Note that the delta and theta band is lower in infants compared to the canonical frequency bands in adults (Anderson and Perone, Reference Anderson and Perone2018; Cellier et al., Reference Cellier, Riddle, Petersen and Hwang2021), and that the speech rates in infant-directed speech are typically slower than in adult-directed speech, with ~3–6 Hz as the typical infant-directed syllable rate (Cox et al., Reference Cox, Bergmann and Fowler2023; Raneri et al., Reference Raneri, Von Holzen, Newman and Ratner2020).

A schematic depicts a comparison of electrophysiological activity and speech signals. See long description.

Figure 37.1 Long description

Two types of brain waves are depicted; delta is less than 4 hertz and theta is between 4 through 8 hertz. A waveform of speech is shown. The text below the speech signal reads, Mom my's Speaking Slowly. Prosodic rate and syllabic rate are denoted with broken lines.

By now, it is well established that electrophysiological brain activity tracks the temporal modulations in speech (Figure 37.1; Gross et al., Reference Gross, Hoogenboom and Thut2013; Luo and Poeppel, Reference Luo and Poeppel2007; see Poeppel and Assaneo, Reference Poeppel and Assaneo2020, for a review). In the brain, alternating periods of excitation and inhibition result in rhythmic fluctuations of neuronal activity. The speed of fluctuation depends on internal neuronal frequency properties and differs between neuronal populations (Buzsáki, Reference Buzsáki2006; Buzsáki and Watson, Reference Buzsáki and Watson2022; Hutcheon and Yarom, Reference Hutcheon and Yarom2000). Rhythmic fluctuations across larger groups of neurons can be measured as neural oscillations at different frequencies on the scalp using electroencephalography (EEG). Neural oscillations provide windows of alternating reduced and enhanced excitability, giving temporal windows for analysing and grouping information (Buzsáki and Watson, Reference Buzsáki and Watson2022; Fries, Reference Fries2015; VanRullen, Reference VanRullen2016). At rest, oscillations in the auditory cortex are hierarchically organised into the delta (< 4 Hz), theta (4–8 Hz), and gamma (> 30 Hz) range (Giraud et al., Reference Giraud, Kleinschmidt and Poeppel2007; Keitel and Gross, Reference Keitel and Gross2016), and thus closely match the frequencies of stress patterns, syllables, and phonemes. This resulted in the proposal that rhythmic properties of speech entrain neuronal firing (Giraud and Poeppel, Reference Giraud and Poeppel2012; Gross et al., Reference Gross, Hoogenboom and Thut2013; Lalor and Foxe, Reference Lalor and Foxe2010; Luo and Poeppel, Reference Luo and Poeppel2007; Peelle and Davis, Reference Peelle and Davis2012), causing neurons to align both the frequency and the phase of their firing patterns to the input (Regan, Reference Regan1977; Zaehle et al., Reference Zaehle, Lenz, Ohl and Herrmann2010). This rhythmical neural tracking enables the forming of temporal predictions about salient events in the input, ensuring the brain is most excitable at times when the speech signal carries the most information (Lakatos et al., Reference Lakatos, Musacchia and O’Connel2013; Large and Jones, Reference Large and Jones1999; Rimmele et al., Reference Rimmele, Morillon, Poeppel and Arnal2018; Schroeder and Lakatos, Reference Schroeder and Lakatos2009). This helps in grouping information in analysable units such as words, syllables, and phrases (Ding and Simon, Reference Ding and Simon2014; Goswami, Reference Goswami2018; Keitel et al., Reference Keitel, Gross and Kayser2018) and facilitates speech processing (Cason and Schön, Reference Cason and Schön2012; Doelling et al., Reference Doelling, Arnal, Ghitza and Poeppel2014; Henry and Obleser, Reference Henry and Obleser2012; Keitel et al., Reference Keitel, Gross and Kayser2018; Peelle et al., Reference Peelle, Gross and Davis2013; see Meyer, Reference Meyer2018, for a review) by assisting the segmentation and identification of linguistic units from speech.

Natural speech is not perfectly rhythmic, and bottom-up cues alone might be insufficient to explain the synchronisation between neural activity and the speech envelope (Meyer et al., Reference Meyer, Sun and Martin2020). Indeed, speech tracking has been found to be influenced by cross-modality influences as well as to be top-down-modulated by linguistic knowledge and attention. The influence of linguistic knowledge on neural tracking has, for example, been shown by Ding et al. (Reference Ding, Melloni, Zhang, Tian and Poeppel2016), while other studies have confirmed that neural tracking is modulated by semantic content (Broderick et al., Reference Broderick, Anderson and Lalor2019; Kaufeld et al., Reference Kaufeld, Bosker and Ten Oever2020).

In addition to linguistic knowledge, visual information also affects neural tracking of speech (Crosse et al., Reference Crosse, Butler and Lalor2015; Power et al., Reference Power, Mead, Barnes and Goswami2012a; Zion Golumbic et al., Reference Zion Golumbic, Cogan, Schroeder and Poeppel2013). Rhythmic movements of the mouth, lips, and jaw often occur in synchrony with the auditory signal, even slightly preceding it (Chandrasekaran et al., Reference Chandrasekaran, Trubanova, Stillittano, Caplier and Ghazanfar2009). This makes facial cues important for following or even predicting the rhythm of speech and thus likely aiding speech tracking (Bourguignon et al., Reference Bourguignon, Baart, Kapnoula and Molinaro2020; Park et al., Reference Park, Kayser, Thut and Gross2016, Reference Park, Ince, Schyns, Thut and Gross2018; Zoefel, Reference Zoefel2021). Indeed, visual information from mouth movements aids in synchronising neural oscillations in both adults (Bauer et al., Reference Bauer, Debener and Nobre2020; Biau et al., Reference Biau, Wang, Park, Jensen and Hanslmayr2021; Bourguignon et al., Reference Bourguignon, Baart, Kapnoula and Molinaro2020; Peelle and Sommers, Reference Peelle and Sommers2015; Thézé et al., Reference Thézé, Giraud and Mégevand2020; Zoefel, Reference Zoefel2021) and children (Power et al., Reference Power, Foxe, Forde, Reilly and Lalor2012b; but see Çetinçelik et al., Reference Çetinçelik, Rowland and Snijders2023, Reference Çetinçelik, Jordan-Barros, Rowland and Snijders2024).

Finally, neural tracking is also modulated by attention. Speech is rarely heard under ideal acoustic conditions, so listeners must selectively attend to the relevant speech stream and filter out irrelevant noise. Neural entrainment has been proposed to be a core mechanism for attentional selection, maximising temporal attention on to the behaviourally important parts of the signal (Lakatos et al., Reference Lakatos, Karmos, Mehta, Ulbert and Schroeder2008; Obleser and Kayser, Reference Obleser and Kayser2019; Zion Golumbic et al., Reference Zion Golumbic, Poeppel and Schroeder2012). Indeed, when presented with multiple talkers simultaneously, rhythmic neural tracking helps in attending to one of multiple speech streams and synchronisation reflects the attended speaker (O’Sullivan et al., Reference O’Sullivan, Power and Mesgarani2015; Power et al., Reference Power, Foxe, Forde, Reilly and Lalor2012b; Zion Golumbic et al., Reference Zion Golumbic, Poeppel and Schroeder2012).

It is good to realise that the speech–brain synchronisation measured in most studies needs not to arise from an alignment of ongoing endogenous neural oscillations, that is, from underlying oscillatory activity that is shifted in phase due to the rhythmical input (Figure 37.1). Instead, the synchronisation may reflect a series of auditory responses evoked by acoustic extrema in the speech signal, which are superimposed on neural activity and thus appear in the same frequency as the speech rhythm (see, for example, Keitel et al., Reference Keitel, Obleser, Jessen and Henry2021). Recent evidence regarding the involvement of genuine oscillations in speech tracking suggests that, at least in some cases, rhythmic responses persist even after stimulation has ended (van Bree et al., Reference van Bree, Sohoglu, Davis and Zoefel2021; Zoefel et al., Reference Zoefel, ten Oever and Sack2018). This suggests an involvement of oscillatory entrainment in speech tracking, likely in combination with evoked responses (Doelling et al., Reference Doelling, Florencia Assaneo, Bevilacqua, Pesaran and Poeppel2019).

It is important to keep the distinction between evoked and oscillatory accounts in mind when interpreting findings from neural tracking. However, oscillations have been argued to reflect basic operating mechanisms of the brain, which are employed by specialised cognitive processes (Friederici and Singer, Reference Friederici and Singer2015; Fries, Reference Fries2015). During speech processing, the brain needs to flexibly adapt its operating frequencies to the speech characteristics. Even evoked responses will therefore necessarily, at least to a certain degree, occur within the frequency ranges that the brain is able to process and communicate in. As we will argue below, maturation of underlying oscillatory circuits during infancy constrains the information that infants can process and thus affects neural tracking – even if the underlying mechanism were evoked rather than entrained.

37.3 Neural Speech Tracking in Infants

In recent years, there has been increasing evidence that electrophysiological activity in the infant brain already tracks the rhythm of speech (Attaheri et al., Reference Attaheri, Choisdealbha and Di Liberto2022; Menn et al., Reference Menn, Michel, Meyer, Hoehl and Männel2022a; Ortiz Barajas et al., Reference Ortiz Barajas, Guevara and Gervain2021). In particular, it has been shown that newborns track the syllable rate (3–6 Hz) of simple repeated sentences in the native and non-native language (Ortiz Barajas et al., Reference Ortiz Barajas, Guevara and Gervain2021). This study did not test the tracking of other rhythms, therefore leaving it unclear whether newborns already track the slow prosodic (stress) rate and the fast phoneme rate in speech. The youngest age for which tracking of prosodic stress has been shown is for four-month-olds, who were found to track sung nursery rhymes in the delta and theta rate (Attaheri et al., Reference Attaheri, Choisdealbha and Di Liberto2022). However, infants’ early focus on prosody (Nazzi et al., Reference Nazzi, Jusczyk and Johnson2000) makes it likely that they already track prosodic stress earlier. The youngest age tested for phoneme-rate tracking is 10-month-olds by Menn et al. (Reference Menn, Ward and Braukmann2022b), who found significant tracking of the phoneme rate of spoken nursery rhymes in these infants. More research is needed to investigate the onset of neural tracking of speech in the prosodic stress rate and the phonemic rate.

At least by seven months of age, infants do not require perfectly rhythmic speech for neural tracking but can also track natural speech, such as cartoons (Jessen et al., Reference Jessen, Fiedler, Münte and Obleser2019), maternal speech in natural interactions (Menn et al., Reference Menn, Michel, Meyer, Hoehl and Männel2022a), and live maternal singing (Nguyen et al., Reference Nguyen, Reisner and Lueger2023). Given that natural speech can at most be considered quasi-rhythmic (Jadoul et al., Reference Jadoul, Ravignani, Thompson, Filippi and de Boer2016; Turk and Shattuck-Hufnagel, Reference Turk and Shattuck-Hufnagel2013), robust synchronisation of neural activity to speech likely requires continuous updating through top-down modulation. Similar to adults, there is some evidence for a modulation of infants’ neural tracking of speech by visual information, linguistic knowledge, and attention.

Tan et al. (Reference Tan, Kalashnikova, Di Liberto, Crosse and Burnham2022) compared neural tracking in visual-only, auditory-only, and audiovisual speech, finding an audiovisual speech benefit for five-month-old infants and adults, but not for four-year-olds. Another study did not find a benefit of visual cues in 10-month-olds (in ideal listening conditions with slow infant-directed speech (IDS) without background noise), showing equally robust neural tracking of audiovisual speech when visual cues were present versus when they were blocked (Çetinçelik et al., Reference Çetinçelik, Jordan-Barros, Rowland and Snijders2024). Possibly, infant brains particularly rely on audiovisual information prior to the onset of linguistic knowledge. At later ages the audiovisual speech benefit is largest in relatively noisy and challenging conditions (Ross et al., Reference Ross, Saint-Amour, Leavitt, Javitt and Foxe2006; Sumby and Pollack, Reference Sumby and Pollack1954).

Evidence for an influence of linguistic knowledge on neural tracking of speech acoustics in infants currently only comes from artificial language studies showing that statistical learning modulates tracking of artificial speech. Choi et al. (Reference Choi, Batterink, Black, Paller and Werker2020) presented six-month-old infants with trisyllabic pseudowords concatenated to syllable strings, which were presented at a fixed syllable rate while the infants’ EEG was recorded. While the infants initially showed synchronisation to the syllable rate only, they transitioned to neural tracking of both the syllable rate and the rate of the trisyllabic pseudowords by the end of the experiment. This progression to tracking of the pseudoword rate indicates a top-down influence of newly acquired knowledge on neural tracking of the artificial speech stream, though studies on naturalistic speech are currently still lacking.

In addition to visual information and linguistic knowledge, infants’ neural tracking also likely benefits from attentional selection. Kalashnikova et al. (Reference Kalashnikova, Peter, Di Liberto, Lalor and Burnham2018) observed stronger tracking to IDS compared to adult-directed speech (ADS) in seven-month-old infants. The authors attribute this IDS tracking benefit to infants’ increased attention to IDS (Cooper and Aslin, Reference Cooper and Aslin1990; Frank et al., Reference Frank, Alcock and Arias-Trejo2020). It should be noted, though, that the studies by Tan et al. (Reference Tan, Kalashnikova, Di Liberto, Crosse and Burnham2022) and Çetinçelik et al. (Reference Çetinçelik, Jordan-Barros, Rowland and Snijders2024) observed no relationship between attention (to visual cues) and neural tracking. It is also possible that the IDS tracking benefit is based on increased amplitude modulations at the rate of prosodic stress in IDS over ADS (Leong et al., Reference Leong, Kalashnikova, Burnham and Goswami2017; Menn et al., Reference Menn, Michel, Meyer, Hoehl and Männel2022a; Räsänen et al., Reference Räsänen, Kakouros and Soderstrom2018).

37.4 Infants’ Neural Tracking and Their Later Language Development

Multiple studies suggest that infants’ rhythmic neural tracking of speech relates to language abilities. Snijders (Reference Snijders2020) demonstrated that 7.5-month-olds’ neural tracking of spoken nursery rhymes at the rhythm of stressed syllables (1.5–2 Hz) relates to their word segmentation abilities at nine months. Expanding on this finding, neural tracking at the stressed-syllable rate at 10 months has been found to predict vocabulary development at two years (Menn et al., Reference Menn, Ward and Braukmann2022b) and at 18 months (Çetinçelik et al., Reference Çetinçelik, Jordan-Barros, Rowland and Snijders2024). The predictive effect of the tracking of slow rhythms (0.5–4 Hz) in speech for vocabulary development was replicated by Attaheri et al. (Reference Attaheri, Choisdealbha and Rocha2024) using spoken nursery rhymes.

Interestingly, some studies provide evidence for a relationship between neural tracking at the syllable rate, rather than the stressed-syllable rate, and vocabulary acquisition. Both Hahn and Snijders (Reference Hahn and Snijders2023) and Çetinçelik et al. (Reference Çetinçelik, Rowland and Snijders2023) found a positive relationship between 10-month-olds’ neural tracking of speech in the syllable rate and vocabulary growth until 18 months. Note that the syllable rates in these studies were relatively low, as they were based on the syllable rate of the actual IDS stimuli used in the experiments – resulting in a syllable rate within the canonical delta frequency range (2.5–3.5 Hz both in Hahn and Snijders, Reference Hahn and Snijders2023, and in the studies of Çetinçelik et al.).

Taken together, recent studies provide evidence that rhythmic neural tracking of speech predicts word segmentation and later vocabulary, but further studies are needed to establish whether tracking of specific frequency ranges related to stimulus characteristics is especially relevant for language acquisition or whether there is a more general role for neural tracking in the delta frequency range.

37.5 Possible Mechanisms

As presented above, speech tracking at specific frequency ranges might be related to later language development. One interpretation would be that infants who preferably track in that specific frequency range are somehow at an advantage for language development. This may be a result of their individual ‘electrophysiological profile’, that is, the location and power distribution of prominent peaks (and potentially also non-rhythmic, aperiodic activity) in the infant’s electrophysiological spectrum (Ostlund et al., Reference Ostlund, Donoghue and Anaya2022). In particular, individual differences in electrophysiological maturation between infants will lead to differences in spectral characteristics of brain rhythms, which will allow them to process information at different frequencies. Infants whose electrophysiological profile leads them to preferentially track at the stressed-syllable rate may benefit in their use of rhythmic cues for word segmentation. Another interpretation would be that tracking is flexible, with infants adapting the frequency they are tracking depending on which parts of the input signal they currently pay attention to. In this interpretation, tracking at specific frequency ranges for specific stimuli might be beneficial for language development. Neural speech tracking would then reflect infants’ attention to specific parts of the speech signal (e.g., stressed syllables), simultaneously acting as a core mechanism for maximising temporal attention on these parts (Lakatos et al., Reference Lakatos, Karmos, Mehta, Ulbert and Schroeder2008; Obleser and Kayser, Reference Obleser and Kayser2019; Zion Golumbic et al., Reference Zion Golumbic, Poeppel and Schroeder2012). We would like to argue for a combination of the two interpretations: neural speech tracking maximises the uptake of relevant information from the noisy multimodal environment, while being constrained by the maturation of the underlying oscillatory circuits (see Haegens and Zion Golumbic, Reference Haegens and Zion Golumbic2017; Meyer et al., Reference Meyer, Sun and Martin2020; Rimmele et al., Reference Rimmele, Morillon, Poeppel and Arnal2018, for related accounts of adult speech processing). Successful speech processing and language learning require neural activity to adapt flexibly to the quasi-rhythmic input, which can only occur within the limits of the developing neural system (Menn et al., Reference Menn, Männel and Meyer2023a).

37.5.1 Maturational Constraints

To understand the mechanistic role of neural tracking in language development and how this changes with age, we need to take brain maturation into account. The system’s constraints change with maturation, which will impact language processing possibilities. The infant brain is not fully developed at birth and maturational aspects of the brain are reflected in its electrophysiology (Hill et al., Reference Hill, Clark, Bigelow, Lum and Enticott2022; Schaworonkow and Voytek, Reference Schaworonkow and Voytek2021; Vanhatalo and Kaila, Reference Vanhatalo and Kaila2006). In infancy, slow neural oscillations are predominant and there is a general speed-up of electrophysiological rhythms across early childhood (Anderson and Perone, Reference Anderson and Perone2018; see Figure 37.2).Footnote ¹ The individual alpha peak frequency (iAPF) is one of the most robust markers of cerebral maturation (Rodríguez-Martínez et al., Reference Rodríguez-Martínez, Ruiz-Martínez, Barriga Paulino and Gómez2017; Valdés-Sosa et al., Reference Valdés-Sosa, Biscay and Galán1990). In the developing brain at posterior sites, the dominant alpha rhythm gradually shifts from 3–6 Hz in infants to 8–12 Hz in adulthood (Cellier et al., Reference Cellier, Riddle, Petersen and Hwang2021; Gable et al., Reference Gable, Miller and Bernat2022; Marshall et al., Reference Marshall, Bar-Haim and Fox2002; Schaworonkow and Voytek, Reference Schaworonkow and Voytek2021; Stroganova et al., Reference Stroganova, Orekhova and Posikera1999). The gradual increase of the iAPF is possibly a product of increased myelination (i.e., the formation of white matter tracts in the brain, especially between thalamus and cortex; Freschl et al., Reference Freschl, Azizi, Balboa, Kaldy and Blaser2022; Segalowitz et al., Reference Segalowitz, Santesso and Jetha2010). Faster iAPF has been related to increases in speed of information processing (Klimesch et al., Reference Klimesch, Doppelmayr, Schimke and Pachinger1996; Surwillo, Reference Surwillo1961) and attentional performance (Tröndle et al., Reference Tröndle, Popov, Dziemian and Langer2022). In particular, the iAPF has been hypothesised to reflect the size of the temporal integration window (Bastiaansen et al., Reference Bastiaansen, Berberyan, Stekelenburg, Schoffelen and Vroomen2020; Cecere et al., Reference Cecere, Rees and Romei2015; VanRullen, Reference VanRullen2016; White, Reference White1963; but see Buergers and Noppeney, Reference Buergers and Noppeney2022; London et al., Reference London, Benwell and Cecere2022; Ruzzoli et al., Reference Ruzzoli, Torralba, Morís Fernández and Soto-Faraco2019). The temporal integration window is the time needed to separate two events (either cross-modal or within modality). This means that a faster iAPF will make it easier to segregate information occurring in quick temporal succession, indicating that the acceleration in iAPF across infancy and childhood will allow children to dissociate information at smaller temporal intervals as they mature. In line with this, infants initially have very long temporal integration windows (Hochmann and Kouider, Reference Hochmann and Kouider2022; Tsurumi et al., Reference Tsurumi, Kanazawa, Yamaguchi and Kawahara2021): while adults can differentiate tones if they are separated by at least 20 ms (Giraud, Reference Giraud2020; Joliot et al., Reference Joliot, Ribary and Llinás1994), 7.5-month-old infants need a total of ~150 ms difference in tone onsets in order to process two tones as separate (Benasich and Tallal, Reference Benasich and Tallal2002). The iAPF might thus reflect processing constraints that change with development, determining the limits of neural temporal processing. Notably, alpha activity is not classically associated with neural tracking during speech processing (see Meyer, Reference Meyer2018, for a review). However, the acceleration of alpha and the corresponding decrease in temporal integration windows has been associated with higher audiovisual integration abilities (Ronconi et al., Reference Ronconi, Vitale and Federici2023; Zhou et al., Reference Zhou, Cui and Yang2022). Given the importance of visual cues for infants’ language acquisition (Çetinçelik et al., Reference Çetinçelik, Rowland and Snijders2021; Hollich et al., Reference Hollich, Newman and Jusczyk2005) and potentially also the neural tracking of audiovisual speech (Power et al., Reference Power, Foxe, Forde, Reilly and Lalor2012b), maturation in peak alpha frequency may be especially related to the development of audiovisual speech processing.

Figure 37.2

Overview of EEG maturation in infancy.

Electrophysiological activity during speech processing during infancy (modelled from Menn et al., Reference Menn, Männel and Meyer2023b). Slow electrophysiological activity (< 5–10 Hz) is initially prevalent. There is a general acceleration in the frequencies of electrophysiological activity across early childhood (A). High-frequency activity starts to emerge around six months of age (B).

(A)

Panel a shows a heatmap of the ontogenetic acceleration of electrophysiological activity. See long description.

Figure 37.2(A) Long description

The darker shades of gray represent higher levels of activity. The horizontal axis is age in months, and the vertical axis is frequency. A color gradient scale ranges from minus 20 to 30. The values are estimated.

(B)

Panel b shows a multi-line graph of relative power in decibels versus frequency. See long description.

Figure 37.2(B) Long description

It plots four declining lines that originate at (2, 40) and terminate at (48, minus 10), (48, minus 8) and (48, minus 40). The legends for 1 month, 6 months, 12 months and 18 months are given at the top right of the graph. The values are estimated.

In addition to the increase in iAPF, fast oscillatory activity (i.e., gamma-band rhythms) only gradually emerges in the infant brain, based on continuous changes in the excitation–inhibition balance across development. In the adult brain, there is a balance between excitatory and inhibitory activity, and neural excitation is followed by somewhat proportional inhibition (Shu et al., Reference Shu, Hasenstaub and McCormick2003). The excitation–inhibition balance in the brain matures with development, giving rise to windows of plasticity in which the excitation–inhibition balance is optimal for neural plasticity and learning, thus enabling a sensitive period during childhood (Werker and Hensch, Reference Werker and Hensch2015). The excitation–inhibition balance is also crucial for the emergence of neural oscillations (Buzsáki, Reference Buzsáki2006; Buzsáki and Watson, Reference Buzsáki and Watson2022; Poil et al., Reference Poil, Hardstone, Mansvelder and Linkenkaer-Hansen2012), which arise from alternating periods of excitation and inhibition. Slow electrophysiological activity in the delta and theta range is already present in the auditory language areas prenatally (Arichi et al., Reference Arichi, Whitehead and Barone2017; Chipaux et al., Reference Chipaux, Colonnese and Mauguen2013; Moghimi et al., Reference Moghimi, Shadkam and Mahmoudzadeh2020; Routier et al., Reference Routier, Mahmoudzadeh and Panzani2017; Vecchierini et al., Reference Vecchierini, André and d’Allest2007). In contrast, faster oscillatory rhythms (i.e., gamma-range activity), which require the rapid interaction between excitatory neurons and inhibitory interneurons, only gradually emerge towards the second half of the first year (Le Van Quyen et al., Reference Le Van Quyen, Khalilov and Ben-Ari2006; Pivik et al., Reference Pivik, Andres and Tennal2019). This is potentially caused by the delayed migration of inhibitory interneurons until after birth (Xu et al., Reference Xu, Broadbelt and Haynes2011). It has recently been proposed that this trajectory of electrophysiological development from slow to fast affects infants’ processing of temporal information in speech (Menn et al., Reference Menn, Männel and Meyer2023a; see Figure 37.2). The developing brain might be initially well suited for picking up especially the low-frequency rhythmic regularities in the environment (such as prosodic stress and syllable rhythms) but struggles with information at shorter timescales, such as individual phonemes. Indeed, it has been shown that infants still struggle to segment individual speech sounds from fluent speech. Bijeljac-Babic et al. (Reference Bijeljac-Babic, Bertoncini and Mehler1993) tested newborns’ ability to discriminate short speech sequences. While newborns showed significant discrimination of bisyllabic versus trisyllabic sequences, they showed no evidence for discriminating bisyllabic utterances that only differ in the number of phonemes within a syllable. This indicates that newborns’ speech processing initially focuses on larger units of speech, but they are not yet able to process the fast pace of the phoneme rhythm. Young infants’ inability to process phoneme-rate information in fluent speech may seem at odds with countless studies demonstrating their remarkable ability to discriminate between unfamiliar phonemes (Kuhl, Reference Kuhl2007; see Werker et al., Reference Werker, Yeung and Yoshida2012, for a comprehensive review). However, these studies typically present to-be-distinguished phonemes individually with long inter-stimulus intervals, which may suit infants’ long temporal integration windows. Infants learn phonemes from fluent speech, and they show the first signs of native phoneme acquisition around 6–12 months of age, coinciding with the emergence of high-frequency electrophysiological activity. This activity would potentially allow them to segment phonemes from fluent speech. It is therefore likely that the emergence of high-frequency electrophysiological activity constrains phonological acquisition towards the second half of the first year. Studies investigating young infants’ phoneme recognition in fluent speech are currently scarce (but see Menn et al., Reference Menn, Männel and Meyer2023b).

Taken together, there is strong evidence for a maturation of electrophysiological processing speed across infancy and early childhood, as indexed by both the acceleration of iAPF and the emergence of high-frequency activity. This maturation in electrophysiological processing abilities may provide infants with novel possibilities to process speech as they age (see Elman, Reference Elman1996, for a similar idea on chronotopic constraints). We hypothesise that developmental constraints on speech processing may guide infants’ attention to specific parts of the speech signal, namely those at timescales the infant is equipped to process based on their electrophysiological capabilities. This will be reflected in their neural tracking. As a result, different input rhythms are important across development, initially slow prosodic rhythms and later also the faster phonological rhythms (also see Menn et al., Reference Menn, Männel and Meyer2023a).

37.6 Implications for Language Acquisition Research

Children are active learners, selectively attending to important information. We argue that this is reflected in their neural tracking of speech, with neural tracking reflecting their temporal attention. Neural tracking maximises the uptake of relevant information from the noisy multimodal environment while being constrained by the maturation of the underlying oscillatory circuits. While it has been shown that infants track speech from an early age, it is currently still unclear which factors affect infants’ neural tracking. More research is needed to establish the modulation of infants’ neural tracking by neural maturation, as well as by cross-modal influences, linguistic knowledge, and attention. We expect neural tracking to change with development (due to both maturational constraints and developing linguistic knowledge), but also with, for instance, task demands and motivational or attentional state.

Our proposal has several consequences for language acquisition research. It is important to take individual differences in electrophysiological profile into account, both maturational differences as well as individual differences (neurodiversity, see below in Section 37.6.1) and changes in attentional or motivational state. First of all, we hypothesise changes in tracking based on neural development. Electrophysiological maturation constrains infants’ possibilities for speech processing, initially only allowing them to focus on slow prosodic and syllable rhythms of speech. In line with electrophysiological acceleration, we hypothesise that neural tracking will transition to faster speech rhythms across the first year of life, as electrophysiological maturation allows them also to process speech information at this timescale. We hope that more awareness of and knowledge about maturational constraints in language acquisition will further our understanding of speech processing, including interpretations about phonemic processing in continuous speech, and how larger units and chunks might be most effectively processed there (Menn et al., Reference Menn, Männel and Meyer2023a, Reference Menn, Männel and Meyer2023b).

Secondly, neural tracking will be influenced by linguistic and cognitive development. As linguistic knowledge is built up, different rhythms will be more important for infants’ speech processing, affecting neural tracking. Furthermore, linguistic knowledge will serve as a top-down influence on tracking (Choi et al., Reference Choi, Batterink, Black, Paller and Werker2020). In addition, cognitive processes such as working memory and executive functioning are developing and may affect neural tracking. When assessing developmental differences, it is important to distinguish general brain maturation effects from effects due to the development of cognitive processing and representations (with brain maturation and cognitive development obviously also having mutual influences).

Thirdly, besides the developmental effects due to electrophysiological maturation and cognitive development, we expect that infants’ focus on different timescales in speech is also affected by task demands and infant state. Above, we already discussed possible influences of multimodal input, but also other characteristics of the input might affect neural tracking. For instance, while we assume that infants initially employ information in the slow prosodic stress rhythm for higher-level linguistic abstraction, there may be situations in which this information is not informative. This could, for instance, be the case in studies with artificially rhythmic stimulus materials. We then expect infants to shift their attention to different rates providing more informative cues, which would be reflected in an increase of neural tracking in the attended rate and a decrease of tracking in the normally expected prosodic rate. Also, in natural speech, different language and stimulus characteristics can determine whether it is important for the child to track specific rhythms, for example depending on whether the stressed-syllable rhythm gives cues to segment words from continuous speech in the particular language or stimulus set. Cross-linguistic differences in rhythmic cues and their informativeness for linguistic inference may also lead to different results in studies investigating different languages. Additionally, neural tracking may shift depending on the infant’s current state. It may be easier to process bottom-up cues provided by acoustic amplitude modulations compared to top-down influences on tracking, which may require more effort (Song and Iverson, Reference Song and Iverson2018). Infants may therefore resort to bottom-up tracking at the rate of strong acoustic modulations in cases of low ‘motivation’ (e.g., if the infant is tired).

Thus, it is important to take stimulus characteristics into account. Researchers often use generic frequency bands to establish neural tracking, while it is crucial to report and use stimulus-specific frequency characteristics of the speech input (see Keitel et al., Reference Keitel, Gross and Kayser2018). Only then can we discover how stimulus characteristics interact with underlying oscillatory possibilities in establishing neural tracking, and the mechanisms through which neural tracking might be related to successful language acquisition. Besides stimulus-specific speech regularities, stimulus variability also needs to be taken into account. We expect different properties of neural tracking when stimuli are repeated over and over again (such as in Ortiz Barajas et al., Reference Ortiz Barajas, Guevara and Gervain2021), compared to when there is more variability (in both content and prosody) in natural speech.

37.6.1 Implications for Atypical Language Acquisition: Autism

Besides maturational differences, individual differences in neural make-up (‘neurodiversity’) are also expected to be important for neural tracking and its relation with language acquisition. Neurodevelopmental conditions often give rise to variation in attentional selection capabilities, which can enable language growth or result in language delay (D’souza et al., Reference D’souza, D’souza and Karmiloff-Smith2017; Grice et al., Reference Grice, Wehrle and Krüger2023). We argue that differences in neural constraints will result in different neural tracking possibilities, which might be reflections of how attentional processes influence language acquisition, possibly through differences in the mechanistic constraints that are in place. Here, we will work out how neural constraints and their maturation might relate to variability in language development in autism (see Chapter 47 for an overview of behavioural entrainment in autism).

One current hypothesis about biological mechanisms in autism states that the balance of neural excitation and inhibition (E/I balance) is altered in autistics (Bruining et al., Reference Bruining, Hardstone and Juarez-Martinez2020; Dickinson et al., Reference Dickinson, Jones and Milne2016; Rubenstein and Merzenich, Reference Rubenstein and Merzenich2003; Snijders et al., Reference Snijders, Milivojevic and Kemner2013) and that this E/I imbalance may lead to differences in neural oscillations. Indeed, autistic children show differential development in EEG oscillations and non-oscillatory electrophysiological activity (Tierney et al., Reference Tierney, Gabard-Durnam, Vogel-Farley, Tager-Flusberg and Nelson2012), and these electrophysiological differences relate to language development (Romeo et al., Reference Romeo, Choi and Gabard-Durnam2021; Shuffrey et al., Reference Shuffrey, Pini and Potter2022; Wilkinson et al., Reference Wilkinson, Gabard-Durnam and Kapur2020). In a recent study, it has been shown that the development of E/I imbalances across childhood and adolescence are associated with individual differences in listening comprehension in both autistic and non-autistic children (Plueckebaum et al., Reference Plueckebaum, Meyer, Beck and Menn2023). Additionally, there are some indications that the maturation of iAPF is atypical in autistic children (Edgar et al., Reference Edgar, Dipiero and McBride2019; Green et al., Reference Green, Dipiero and Koppers2022; but see Carter Leno et al., Reference Carter Leno, Pickles and van Noordt2021; Lefebvre et al., Reference Lefebvre, Delorme and Delanoë2018). Individual differences in iAPF development for autistic individuals have been related to atypicalities in temporal audiovisual integration. In particular, there are indications that autistic individuals may show a widened temporal-binding window for integration and consequently decreased sensitivity to asynchrony in audiovisual speech (Zhou et al., Reference Zhou, Cui and Yang2022). This suggests that autistic individuals employ visual information to a much lesser degree for generating auditory predictions. Indeed, Ronconi et al. (Reference Ronconi, Vitale and Federici2023) showed that audiovisual integration in autistic children is primarily driven by auditory processing, which phase-resets visual activity. Whether reduced reliance on visual information in autism impacts neural tracking of speech is an open issue.

In a recent study assessing infants with a family history of autism, we did not identify differences in neural tracking of sung audiovisual nursery rhymes compared to infants with no autism family history (Menn et al., Reference Menn, Ward and Braukmann2022b), although the identified relation between an increase in stressed-syllable tracking at 10 months with a later larger vocabulary was stronger for infants with an autism family history. In contrast, in a small sample of adults, reduced neural tracking of auditory-only speech in autistic versus non-autistic individuals has been identified (Jochaut et al., Reference Jochaut, Lehongre and Saitovitch2015). Differences between these studies might reflect developmental differences, or might be due to different stimuli (song versus speech, audio versus audiovisual). In future work, it is important to establish how neural tracking of speech is related to neural development, and how that might result in variability in language acquisition also in other E/I-atypical populations. Research investigating the development of E/I balance in infancy is only just emerging, but recent studies have reported early imbalances also for infants with genetic risks for ADHD (Begum-Ali et al., Reference Begum‐Ali, Goodwin and Mason2022; Carter Leno et al., Reference Carter Leno, Pickles and van Noordt2021), and their relationship to language development poses an exciting venue for future research.

37.7 Conclusion

In this chapter we reviewed current evidence on infants’ neural tracking of speech. Neural oscillations in the infant brain synchronise with the rhythm of speech, tracking it at different frequencies. This predicts word segmentation and later language abilities. We present the hypothesis that rhythmic neural speech tracking reflects infants’ attention to specific parts of the speech signal (e.g., stressed syllables), and simultaneously acts as a core mechanism for maximising temporal attention on to those parts. Neural constraints on speech tracking might be influenced by neural maturation, and we set out how this might be reflected in both typical and atypical language development.

Box 37.1Chapter Overview

Summary

This chapter reviews research on infants’ neural tracking of speech, and how this process predicts later language abilities. We hypothesise that neural speech tracking reflects infants’ temporal attention to specific parts of the speech signal. Neural maturation in typical and atypical development might influence constraints on neural tracking.

Implications

Future research on neural tracking of speech should take maturational constraints into account, as well as individual differences herein. Temporal stimulus characteristics should always be specifically described to understand the interaction between environmental input, brain and language development, and infant state.

Gains

Understanding underlying neural mechanisms in the developing brain and their interaction with the environment is crucial for understanding individual differences in speech perception and language development.

Footnotes

¹ Note that while there are some studies suggesting a relationship between structural (i.e., white matter) maturation and language ability in children (Klein et al., Reference Klein, Berger, Goucha, Friederici and Grosse Wiesmann2023), the relationship between white matter maturation and the emergence of oscillations during development is less clear. We therefore focus on the emergence of oscillations.

References

Anderson, A. J., and Perone, S. (2018). Developmental change in the resting state electroencephalogram: Insights into cognition and the brain. Brain and Cognition, 126, 40–52. https://doi.org/10.1016/j.bandc.2018.08.001 CrossRef Google Scholar PubMed

Arichi, T., Whitehead, K., Barone, G., et al. (2017). Localization of spontaneous bursting neuronal activity in the preterm human brain with simultaneous EEG-fMRI. eLife, 6, e27814. https://doi.org/10.7554/eLife.27814 CrossRef Google Scholar PubMed

Attaheri, A., Choisdealbha, Á. N., Di Liberto, G. M., et al. (2022). Delta- and theta-band cortical tracking and phase-amplitude coupling to sung speech by infants. NeuroImage, 247, 118698. https://doi.org/10.1016/j.neuroimage.2021.118698 CrossRef Google Scholar PubMed

Attaheri, A., Choisdealbha, Á. N., Rocha, S., et al. (2024). Infant low-frequency EEG cortical power, cortical tracking and phase-amplitude coupling predicts language a year later. PLoS One, 19(12), e0313274. https://doi.org/10.1371/journal.pone.0313274 CrossRef Google Scholar

Bastiaansen, M., Berberyan, H., Stekelenburg, J. J., Schoffelen, J. M., and Vroomen, J. (2020). Are alpha oscillations instrumental in multisensory synchrony perception? Brain Research, 1734, 146744. https://doi.org/10.1016/j.brainres.2020.146744 CrossRef Google Scholar PubMed

Bauer, A.-K. R., Debener, S., and Nobre, A. C. (2020). Synchronisation of neural oscillations and cross-modal influences. Trends in Cognitive Sciences, 24(6), 481–495. https://doi.org/10.1016/j.tics.2020.03.003 CrossRef Google Scholar PubMed

Bazhydai, M., Westermann, G., and Parise, E. (2020). ‘I don’t know but I know who to ask’: 12-month-olds actively seek information from knowledgeable adults. Developmental Science, 23(5), e12938. https://doi.org/10.1111/desc.12938 CrossRef Google Scholar PubMed

Begum‐Ali, J., Goodwin, A., Mason, L., et al. (2022). Altered theta–beta ratio in infancy associates with family history of ADHD and later ADHD‐relevant temperamental traits. Journal of Child Psychology and Psychiatry, 63(9), 1057–1067.10.1111/jcpp.13563CrossRef Google Scholar PubMed

Begus, K., Gliga, T., and Southgate, V. (2016). Infants’ preferences for native speakers are associated with an expectation of information. Proceedings of the National Academy of Sciences, 113(44), 12397–12402. https://doi.org/10.1073/pnas.1603261113 CrossRef Google Scholar PubMed

Benasich, A. A., and Tallal, P. (2002). Infant discrimination of rapid auditory cues predicts later language impairment. Behavioural Brain Research, 136(1), 31–49. https://doi.org/10.1016/S0166-4328(02)00098-0 CrossRef Google Scholar PubMed

Biau, E., Wang, D., Park, H., Jensen, O., and Hanslmayr, S. (2021). Auditory detection is modulated by theta phase of silent lip movements. Current Research in Neurobiology, 2, 100014. https://doi.org/10.1016/j.crneur.2021.100014 CrossRef Google Scholar PubMed

Bijeljac-Babic, R., Bertoncini, J., and Mehler, J. (1993). How do 4-day-old infants categorize multisyllabic utterances? Developmental Psychology, 29(4), 711–721.10.1037/0012-1649.29.4.711CrossRef Google Scholar

Bourguignon, M., Baart, M., Kapnoula, E. C., and Molinaro, N. (2020). Lip-reading enables the brain to synthesize auditory features of unknown silent speech. Journal of Neuroscience, 40(5), 1053–1065. https://doi.org/10.1523/JNEUROSCI.1101-19.2019 CrossRef Google Scholar PubMed

Broderick, M. P., Anderson, A. J., and Lalor, E. C. (2019). Semantic context enhances the early auditory encoding of natural speech. Journal of Neuroscience, 39(38), 7564–7575. https://doi.org/10.1523/JNEUROSCI.0584-19.2019 CrossRef Google Scholar PubMed

Bruining, H., Hardstone, R., Juarez-Martinez, E. L., et al. (2020). Measurement of excitation–inhibition ratio in autism spectrum disorder using critical brain dynamics. Scientific Reports, 10(1), 1–15. https://doi.org/10.1038/s41598-020-65500-4 CrossRef Google Scholar PubMed

Buergers, S., and Noppeney, U. (2022). The role of alpha oscillations in temporal binding within and across the senses. Nature Human Behaviour, 6(5), 5. https://doi.org/10.1038/s41562-022-01294-x CrossRef Google Scholar PubMed

Buzsáki, G. (2006). Rhythms of the Brain. Oxford University Press. https://doi.org/10.1093/acprof:oso/9780195301069.001.0001 CrossRef Google Scholar

Buzsáki, G., and Watson, B. O. (2022). Brain rhythms and neural syntax: Implications for efficient coding of cognitive content and neuropsychiatric disease. Dialogues in Clinical Neuroscience, 14(4), 345–367. https://doi.org/10.31887/DCNS.2012.14.4/gbuzsaki CrossRef Google Scholar

Carter Leno, V., Pickles, A., van Noordt, S., et al. (2021). 12-month peak alpha frequency is a correlate but not a longitudinal predictor of non-verbal cognitive abilities in infants at low and high risk for autism spectrum disorder. Developmental Cognitive Neuroscience, 48, 100938. https://doi.org/10.1016/j.dcn.2021.100938 CrossRef Google Scholar PubMed

Cason, N., and Schön, D. (2012). Rhythmic priming enhances the phonological processing of speech. Neuropsychologia, 50(11), 2652–2658.10.1016/j.neuropsychologia.2012.07.018CrossRef Google Scholar PubMed

Cecere, R., Rees, G., and Romei, V. (2015). Individual differences in alpha frequency drive crossmodal illusory perception. Current Biology, 25(2), 231–235. https://doi.org/10.1016/j.cub.2014.11.034 CrossRef Google Scholar PubMed

Cellier, D., Riddle, J., Petersen, I., and Hwang, K. (2021). The development of theta and alpha neural oscillations from ages 3 to 24 years. Developmental Cognitive Neuroscience, 50, 100969. https://doi.org/10.1016/j.dcn.2021.100969 CrossRef Google Scholar PubMed

Çetinçelik, M., Rowland, C. F., and Snijders, T. M. (2021). Do the eyes have it? A systematic review on the role of eye gaze in infant language development. Frontiers in Psychology, 11, 1–16. https://doi.org/10.3389/fpsyg.2020.589096 CrossRef Google Scholar

Çetinçelik, M., Rowland, C. F., and Snijders, T. M. (2023). Ten-month-old infants’ neural tracking of naturalistic speech is not facilitated by the speaker’s eye gaze. Developmental Cognitive Neuroscience, 64, 101297. https://doi.org/10.1016/j.dcn.2023.101297 CrossRef Google Scholar

Çetinçelik, M., Jordan-Barros, A., Rowland, C., and Snijders, T. M. (2024). The effect of visual speech cues on neural tracking of speech in 10-month-old infants. European Journal of Neuroscience, 60(6), 5381–5399. https://doi.org/10.1111/ejn.16492 CrossRef Google Scholar PubMed

Chandrasekaran, C., Trubanova, A., Stillittano, S., Caplier, A., and Ghazanfar, A. A. (2009). The natural statistics of audiovisual speech. PLoS Computational Biology, 5(7), e1000436. https://doi.org/10.1371/journal.pcbi.1000436 CrossRef Google Scholar PubMed

Chipaux, M., Colonnese, M. T., Mauguen, A., et al. (2013). Auditory stimuli mimicking ambient sounds drive temporal ‘delta-brushes’ in premature infants. PLoS One, 8(11), e79028. https://doi.org/10.1371/journal.pone.0079028 CrossRef Google Scholar PubMed

Choi, D., Batterink, L. J., Black, A. K., Paller, K. A., and Werker, J. F. (2020). Preverbal infants discover statistical word patterns at similar rates as adults: Evidence from neural entrainment. Psychological Science, 31(9), 1161–1173. https://doi.org/10.1177/0956797620933237 CrossRef Google Scholar

Cooper, R. P., and Aslin, R. N. (1990). Preference for infant-directed speech in the first month after birth. Child Development, 61(5), 1584–1595. https://doi.org/10.1111/j.1467-8624.1990.tb02885.x CrossRef Google Scholar PubMed

Cox, C., Bergmann, C., Fowler, E., et al. (2023). A systematic review and Bayesian meta-analysis of the acoustic features of infant-directed speech. Nature Human Behaviour, 7(1), 1. https://doi.org/10.1038/s41562-022-01452-1 Google Scholar PubMed

Crosse, M. J., Butler, J. S., and Lalor, E. C. (2015). Congruent visual speech enhances cortical entrainment to continuous auditory speech in noise-free conditions. Journal of Neuroscience, 35(42), 14195–14204. https://doi.org/10.1523/JNEUROSCI.1829-15.201 CrossRef Google Scholar PubMed

Dickinson, A., Jones, M., and Milne, E. (2016). Measuring neural excitation and inhibition in autism: Different approaches, different findings and different interpretations. Brain Research, 1648, 277–289. https://doi.org/10.1016/j.brainres.2016.07.011 CrossRef Google Scholar PubMed

Ding, N., and Simon, J. Z. (2014). Cortical entrainment to continuous speech: Functional roles and interpretations. Frontiers in Human Neuroscience, 8, 311. https://doi.org/10.3389/fnhum.2014.00311 CrossRef Google Scholar PubMed

Ding, N., Melloni, L., Zhang, H., Tian, X., and Poeppel, D. (2016). Cortical tracking of hierarchical linguistic structures in connected speech. Nature Neuroscience, 19(1), 158–164. https://doi.org/10.1038/nn.4186 CrossRef Google Scholar PubMed

Doelling, K. B., Arnal, L. H., Ghitza, O., and Poeppel, D. (2014). Acoustic landmarks drive delta–theta oscillations to enable speech comprehension by facilitating perceptual parsing. NeuroImage, 85, 761–768. https://doi.org/10.1016/j.neuroimage.2013.06.035 CrossRef Google Scholar PubMed

Doelling, K. B., Florencia Assaneo, M., Bevilacqua, D., Pesaran, B., and Poeppel, D. (2019). An oscillator model better predicts cortical entrainment to music. Proceedings of the National Academy of Sciences of the United States of America, 116(20), 10113–10121. https://doi.org/10.1073/pnas.1816414116 CrossRef Google Scholar PubMed

D’souza, D., D’souza, H., and Karmiloff-Smith, A. (2017). Precursors to language development in typically and atypically developing infants and toddlers: The importance of embracing complexity. Journal of Child Language, 44(3), 591–627. https://doi.org/10.1017/S030500091700006X CrossRef Google Scholar PubMed

Edgar, J. C., Dipiero, M., McBride, E., et al. (2019). Abnormal maturation of the resting-state peak alpha frequency in children with autism spectrum disorder. Human Brain Mapping, 40(11), 3288–3298. https://doi.org/10.1002/hbm.24598 CrossRef Google Scholar PubMed

Elman, J. L. (1996). Rethinking Innateness: A Connectionist Perspective on Development. MIT Press.10.7551/mitpress/5929.001.0001CrossRef Google Scholar

Frank, M. C., Alcock, K. J., Arias-Trejo, N., et al. (2020). Quantifying sources of variability in infancy research using the infant-directed-speech preference. Advances in Methods and Practices in Psychological Science, 3(1), 24–52. https://doi.org/10.1177/2515245919900809 Google Scholar

Freschl, J., Azizi, L. A., Balboa, L., Kaldy, Z., and Blaser, E. (2022). The development of peak alpha frequency from infancy to adolescence and its role in visual temporal processing: A meta-analysis. Developmental Cognitive Neuroscience, 57, 101146. https://doi.org/10.1016/j.dcn.2022.101146 CrossRef Google Scholar PubMed

Friederici, A. D., and Singer, W. (2015). Grounding language processing on basic neurophysiological principles. Trends in Cognitive Sciences, 19(6), 329–338. https://doi.org/10.1016/j.tics.2015.03.012 CrossRef Google Scholar PubMed

Fries, P. (2015). Rhythms for cognition: Communication through coherence. Neuron, 88(1), 220–235. https://doi.org/10.1016/j.neuron.2015.09.034 CrossRef Google Scholar PubMed

Gable, P., Miller, M., and Bernat, E. (2022). The Oxford Handbook of EEG Frequency. Oxford University Press.10.1093/oxfordhb/9780192898340.001.0001CrossRef Google Scholar

Gagnepain, P., Henson, R. N., and Davis, M. H. (2012). Temporal predictive codes for spoken words in auditory cortex. Current Biology, 22(7), 615–621. https://doi.org/10.1016/j.cub.2012.02.015 CrossRef Google Scholar PubMed

Gervain, J., Christophe, A., and Mazuka, R. (2020). Prosodic bootstrapping. In Gussenhoven, C. and Chen, A. (Eds.), The Oxford Handbook of Language Prosody (pp. 562–573). Oxford University Press. https://doi.org/10.1093/oxfordhb/9780198832232.013.36 Google Scholar

Giraud, A.-L. (2020). Oscillations for all ¯_(ツ)_/¯? A commentary on Meyer, Sun and Martin (2020). Language, Cognition and Neuroscience, 35(9), 1106–1113. https://doi.org/10.1080/23273798.2020.1764990 CrossRef Google Scholar

Giraud, A.-L., and Poeppel, D. (2012). Cortical oscillations and speech processing: Emerging computational principles and operations. Nature Neuroscience, 15(4), 511–517. https://doi.org/10.1038/nn.3063 CrossRef Google Scholar PubMed

Giraud, A.-L., Kleinschmidt, A., Poeppel, D., et al. (2007). Endogenous cortical rhythms determine cerebral specialization for speech perception and production. Neuron, 56(6), 1127–1134. https://doi.org/10.1016/j.neuron.2007.09.038 CrossRef Google Scholar PubMed

Gleitman, L. R., and Wanner, E. (1982). Language Acquisition: The State of the Art. Cambridge University Press.Google Scholar

Goswami, U. (2012). Entraining the brain: Applications to language research and links to musical entrainment. Empirical Musicology Review, 7(1–2), 57–63. https://doi.org/10.18061/1811/52980 CrossRef Google Scholar

Goswami, U. (2018). A neural basis for phonological awareness? An oscillatory temporal-sampling perspective. Current Directions in Psychological Science, 27(1), 56–63. https://doi.org/10.1177/0963721417727520 CrossRef Google Scholar

Green, H. L., Dipiero, M., Koppers, S., et al. (2022). Peak alpha frequency and thalamic structure in children with typical development and autism spectrum disorder. Journal of Autism and Developmental Disorders, 52(1), 103–112. https://doi.org/10.1007/s10803-021-04926-9 CrossRef Google Scholar PubMed

Grice, M., Wehrle, S., Krüger, M., et al. (2023). Linguistic prosody in autism spectrum disorder: An overview. Language and Linguistics Compass, 17(5), e12498. https://doi.org/10.1111/lnc3.12498 CrossRef Google Scholar

Gross, J., Hoogenboom, N., Thut, G., et al. (2013). Speech rhythms and multiplexed oscillatory sensory coding in the human brain. PLoS Biology, 11(12), e1001752. https://doi.org/10.1371/journal.pbio.1001752 CrossRef Google Scholar PubMed

Haegens, S., and Zion Golumbic, E. (2017). Rhythmic facilitation of sensory processing: A critical review. Neuroscience & Biobehavioral Reviews, 86, 150–165. https://doi.org/10.1016/j.neubiorev.2017.12.002 CrossRef Google Scholar PubMed

Hagoort, P. (2017). The core and beyond in the language-ready brain. Neuroscience & Biobehavioral Reviews, 81, 194–204. https://doi.org/10.1016/j.neubiorev.2017.01.048 CrossRef Google Scholar PubMed

Hahn, L. E., and Snijders, T. M. (2023). Infants’ neural tracking of rhythm in spoken nursery rhymes relates to their vocabulary size. PsyArXiv Preprints. https://doi.org/10.31234/osf.io/scnxq CrossRef Google Scholar

Henry, M. J., and Obleser, J. (2012). Frequency modulation entrains slow neural oscillations and optimizes human listening behavior. Proceedings of the National Academy of Sciences, 109(49), 20095–20100. https://doi.org/10.1073/pnas.1213390109 CrossRef Google Scholar PubMed

Hill, A. T., Clark, G. M., Bigelow, F. J., Lum, J. A. G., and Enticott, P. G. (2022). Periodic and aperiodic neural activity displays age-dependent changes across early-to-middle childhood. Developmental Cognitive Neuroscience, 54, 101076. https://doi.org/10.1016/j.dcn.2022.101076 CrossRef Google Scholar PubMed

Hochmann, J.-R., and Kouider, S. (2022). Acceleration of information processing en route to perceptual awareness in infancy. Current Biology, 32(5), 1206–1210. https://doi.org/10.1016/j.cub.2022.01.029 CrossRef Google Scholar PubMed

Höhle, B. (2009). Bootstrapping mechanisms in first language acquisition. Linguistics, 47(2), 359–382. https://doi.org/10.1515/LING.2009.013 CrossRef Google Scholar

Hollich, G., Newman, R. S., and Jusczyk, P. W. (2005). Infants’ use of synchronized visual information to separate streams of speech. Child Development, 76(3), 598–613. https://doi.org/10.1111/j.1467-8624.2005.00866.x CrossRef Google Scholar PubMed

Hutcheon, B., and Yarom, Y. (2000). Resonance, oscillation and the intrinsic frequency preferences of neurons. Trends in Neurosciences, 23(5), 216–222. https://doi.org/10.1016/S0166-2236(00)01547-2 CrossRef Google Scholar PubMed

Jadoul, Y., Ravignani, A., Thompson, B., Filippi, P., and de Boer, B. (2016). Seeking temporal predictability in speech: Comparing statistical approaches on 18 world languages. Frontiers in Human Neuroscience, 10, 586. www.frontiersin.org/articles/10.3389/fnhum.2016.0058610.3389/fnhum.2016.00586CrossRef Google Scholar PubMed

James, W. (1890). The Principles of Psychology, Vol I. Henry Holt and Co. https://doi.org/10.1037/10538-000 Google Scholar

Jessen, S., Fiedler, L., Münte, T. F., and Obleser, J. (2019). Quantifying the individual auditory and visual brain response in 7-month-old infants watching a brief cartoon movie. NeuroImage, 202, 116060. https://doi.org/10.1016/j.neuroimage.2019.116060 CrossRef Google Scholar PubMed

Jochaut, D., Lehongre, K., Saitovitch, A., et al. (2015). Atypical coordination of cortical oscillations in response to speech in autism. Frontiers in Human Neuroscience, 9, 171. https://doi.org/10.3389/fnhum.2015.00171 CrossRef Google Scholar PubMed

Johnson, E. K. (2016). Constructing a proto-lexicon: An integrative view of infant language development. Annual Review of Linguistics, 2(1), 391–412. https://doi.org/10.1146/annurev-linguistics-011415-040616 CrossRef Google Scholar

Johnson, E. K., and Jusczyk, P. W. (2001). Word segmentation by 8-month-olds: When speech cues count more than statistics. Journal of Memory and Language, 44(4), 548–567. https://doi.org/10.1006/jmla.2000.2755 CrossRef Google Scholar

Joliot, M., Ribary, U., and Llinás, R. (1994). Human oscillatory brain activity near 40 Hz coexists with cognitive temporal binding. Proceedings of the National Academy of Sciences, 91(24), 11748–11751. https://doi.org/10.1073/pnas.91.24.11748 CrossRef Google Scholar PubMed

Jusczyk, P. W. (1999). How infants begin to extract words from speech. Trends in Cognitive Sciences, 3(9), 323–328. https://doi.org/10.1016/S1364-6613(99)01363-7 CrossRef Google Scholar PubMed

Jusczyk, P. W., Houston, D. M., and Newsome, M. (1999). The beginnings of word segmentation in English-learning infants. Cognitive Psychology, 39(3–4), 159–207. https://doi.org/10.1006/cogp.1999.0716 CrossRef Google Scholar PubMed

Kalashnikova, M., Peter, V., Di Liberto, G. M., Lalor, E. C., and Burnham, D. (2018). Infant-directed speech facilitates seven-month-old infants’ cortical tracking of speech. Scientific Reports, 8, 13745. https://doi.org/10.1038/s41598-018-32150-6 CrossRef Google Scholar PubMed

Kaufeld, G., Bosker, H. R., Ten Oever, S., et al. (2020). Linguistic structure and meaning organize neural oscillations into a content-specific hierarchy. Journal of Neuroscience, 40(49), 9467–9475. https://doi.org/10.1523/JNEUROSCI.0302-20.2020 CrossRef Google Scholar PubMed

Keitel, A., and Gross, J. (2016). Individual human brain areas can be identified from their characteristic spectral activation fingerprints. PLoS Biology, 14(6), e1002498. https://doi.org/10.1371/journal.pbio.1002498 CrossRef Google Scholar PubMed

Keitel, A., Gross, J., and Kayser, C. (2018). Perceptually relevant speech tracking in auditory and motor cortex reflects distinct linguistic features. PLoS Biology, 16(3), e2004473. https://doi.org/10.1371/journal.pbio.2004473 CrossRef Google Scholar PubMed

Keitel, C., Obleser, J., Jessen, S., and Henry, M. J. (2021). Frequency-specific effects in infant electroencephalograms do not require entrained neural oscillations: A commentary on Köster et al. (2019). Psychological Science, 32(6), 966–971. https://doi.org/10.1177/09567976211001317 CrossRef Google Scholar PubMed

Kidd, C., Piantadosi, S. T., and Aslin, R. N. (2012). The Goldilocks effect: Human infants allocate attention to visual sequences that are neither too simple nor too complex. PLoS One, 7(5), e36399. https://doi.org/10.1371/journal.pone.0036399 CrossRef Google Scholar PubMed

Klein, C. C., Berger, P., Goucha, T., Friederici, A. D., and Grosse Wiesmann, C. (2023). Children’s syntax is supported by the maturation of BA44 at 4 years, but of the posterior STS at 3 years of age. Cerebral Cortex, 33(9), 5426–5435. https://doi.org/10.1093/cercor/bhac430 CrossRef Google Scholar PubMed

Klimesch, W., Doppelmayr, M., Schimke, H., and Pachinger, T. (1996). Alpha frequency, reaction time, and the speed of processing information. Journal of Clinical Neurophysiology, 13(6), 511.10.1097/00004691-199611000-00006CrossRef Google Scholar PubMed

Kuhl, P. K. (2004). Early language acquisition: Cracking the speech code. Nature Reviews Neuroscience, 5(11), 831–843. https://doi.org/10.1038/nrn1533 CrossRef Google Scholar PubMed

Kuhl, P. K. (2007). Cracking the speech code: How infants learn language. Acoustical Science and Technology, 28(2), 71–83. https://doi.org/10.1250/ast.28.71 CrossRef Google Scholar

Lakatos, P., Karmos, G., Mehta, A. D., Ulbert, I., and Schroeder, C. E. (2008). Entrainment of neuronal attentional selection. Science, 320, 23–25. https://doi.org/10.1126/science.1154735 CrossRef Google Scholar PubMed

Lakatos, P., Musacchia, G., O’Connel, M. N., et al. (2013). The spectrotemporal filter mechanism of auditory selective attention. Neuron, 77(4), 750–761. https://doi.org/10.1016/j.neuron.2012.11.034 CrossRef Google Scholar PubMed

Lalor, E. C., and Foxe, J. J. (2010). Neural responses to uninterrupted natural speech can be extracted with precise temporal resolution. European Journal of Neuroscience, 31(1), 189–193. https://doi.org/10.1111/j.1460-9568.2009.07055.x CrossRef Google Scholar PubMed

Large, E. W., and Jones, M. R. (1999). The dynamics of attending: How people track time-varying events. Psychological Review, 106(1), 119–159. https://doi.org/10.1037/0033-295X.106.1.119 CrossRef Google Scholar

Lefebvre, A., Delorme, R., Delanoë, C., et al. (2018). Alpha waves as a neuromarker of autism spectrum disorder: The challenge of reproducibility and heterogeneity. Frontiers in Neuroscience, 12, 662. www.frontiersin.org/articles/10.3389/fnins.2018.0066210.3389/fnins.2018.00662CrossRef Google Scholar PubMed

Leong, V., Kalashnikova, M., Burnham, D., and Goswami, U. (2017). The temporal modulation structure of infant-directed speech. Open Mind, 1(2), 78–90. https://doi.org/10.1162/OPMI_a_00008 CrossRef Google Scholar

Le Van Quyen, M., Khalilov, I., and Ben-Ari, Y. (2006). The dark side of high-frequency oscillations in the developing brain. Trends in Neurosciences, 29(7), 419–427. https://doi.org/10.1016/j.tins.2006.06.001 CrossRef Google Scholar PubMed

London, R. E., Benwell, C. S. Y., Cecere, R., et al. (2022). EEG alpha power predicts the temporal sensitivity of multisensory perception. European Journal of Neuroscience, 55(11–12), 3241–3255. https://doi.org/10.1111/ejn.15719 CrossRef Google Scholar PubMed

Luo, H., and Poeppel, D. (2007). Phase patterns of neuronal responses reliably discriminate speech in human auditory cortex. Neuron, 54(6), 1001–1010. https://doi.org/10.1016/j.neuron.2007.06.004 CrossRef Google Scholar PubMed

Marshall, P. J., Bar-Haim, Y., and Fox, N. A. (2002). Development of the EEG from 5 months to 4 years of age. Clinical Neurophysiology, 113(8), 1199–1208. https://doi.org/10.1016/S1388-2457(02)00163-3 CrossRef Google Scholar PubMed

Menn, K. H., Männel, C., and Meyer, L. (2023a). Does electrophysiological maturation shape language acquisition? Perspectives on Psychological Science, 18(6), 1271–1281. https://doi.org/10.1177/174569162311515 CrossRef Google Scholar PubMed

Menn, K. H., Männel, C., and Meyer, L. (2023b). Phonological acquisition depends on the timing of speech sounds: Deconvolution EEG modeling across the first five years. Science Advances, 9(44), eadh2560. https://doi.org/10.1126/sciadv.adh2560 CrossRef Google Scholar PubMed

Menn, K. H., Michel, C., Meyer, L., Hoehl, S., and Männel, C. (2022a). Natural infant-directed speech facilitates neural tracking of prosody. NeuroImage, 251, 118991. https://doi.org/10.1016/j.neuroimage.2022.118991 CrossRef Google Scholar PubMed

Menn, K. H., Ward, E. K., Braukmann, R., et al. (2022b). Neural tracking in infancy predicts language development in children with and without family history of autism. Neurobiology of Language, 3(3), 495–514. https://doi.org/10.1162/nol_a_00074 CrossRef Google Scholar PubMed

Meyer, L. (2018). The neural oscillations of speech processing and language comprehension: State of the art and emerging mechanisms. European Journal of Neuroscience, 48(7), 2609–2621. https://doi.org/10.1111/ejn.13748 CrossRef Google Scholar PubMed

Meyer, L., Sun, Y., and Martin, A. E. (2020). Synchronous, but not entrained: Exogenous and endogenous cortical rhythms of speech and language processing. Language, Cognition and Neuroscience, 35(9), 1089–1099. https://doi.org/10.1080/23273798.2019.1693050 CrossRef Google Scholar

Moghimi, S., Shadkam, A., Mahmoudzadeh, M., et al. (2020). The intimate relationship between coalescent generators in very premature human newborn brains: Quantifying the coupling of nested endogenous oscillations. Human Brain Mapping, 41(16), 4691–4703. https://doi.org/10.1002/hbm.25150 CrossRef Google Scholar PubMed

Nazzi, T., Floccia, C., and Bertoncini, J. (1998). Discrimination of pitch contours by neonates. Infant Behavior and Development, 21(4), 779–784. https://doi.org/10.1016/S0163-6383(98)90044-3 CrossRef Google Scholar

Nazzi, T., Jusczyk, P. W., and Johnson, E. K. (2000). Language discrimination by English-learning 5-month-olds: Effects of rhythm and familiarity. Journal of Memory and Language, 43(1), 1–19. https://doi.org/10.1006/jmla.2000.2698 CrossRef Google Scholar

Nguyen, T., Reisner, S., Lueger, A., et al. (2023). Sing to me, baby: Infants show neural tracking and rhythmic movements to live and dynamic maternal singing. Developmental Cognitive Neuroscience, 64, 101313. https://doi.org/10.1016/j.dcn.2023.101313 CrossRef Google Scholar PubMed

Obleser, J., and Kayser, C. (2019). Neural entrainment and attentional selection in the listening brain. Trends in Cognitive Sciences, 23(11), 913–926. https://doi.org/10.1016/j.tics.2019.08.004 CrossRef Google Scholar PubMed

Ortiz Barajas, M. C., Guevara, R., and Gervain, J. (2021). The origins and development of speech envelope tracking during the first months of life. Developmental Cognitive Neuroscience, 48, 100915. https://doi.org/10.1016/j.dcn.2021.100915 CrossRef Google Scholar PubMed

Ostlund, B., Donoghue, T., Anaya, B., et al. (2022). Spectral parameterization for studying neurodevelopment: How and why. Developmental Cognitive Neuroscience, 54, 101073. https://doi.org/10.1016/j.dcn.2022.101073 CrossRef Google Scholar PubMed

O’Sullivan, J. A., Power, A. J., Mesgarani, N., et al. (2015). Attentional selection in a cocktail party environment can be decoded from single-trial EEG. Cerebral Cortex, 25(7), 1697–1706. https://doi.org/10.1093/cercor/bht355 CrossRef Google Scholar

Park, H., Kayser, C., Thut, G., and Gross, J. (2016). Lip movements entrain the observers’ low-frequency brain oscillations to facilitate speech intelligibility. eLife, 5, e14521. https://doi.org/10.7554/eLife.14521 CrossRef Google Scholar PubMed

Park, H., Ince, R. A. A., Schyns, P. G., Thut, G., and Gross, J. (2018). Representational interactions during audiovisual speech entrainment: Redundancy in left posterior superior temporal gyrus and synergy in left motor cortex. PLoS Biology, 16(8), e2006558. https://doi.org/10.1371/journal.pbio.2006558 CrossRef Google Scholar PubMed

Peelle, J. E., and Davis, M. H. (2012). Neural oscillations carry speech rhythm through to comprehension. Frontiers in Psychology, 3, 320. https://doi.org/10.3389/fpsyg.2012.00320 CrossRef Google Scholar PubMed

Peelle, J. E., and Sommers, M. S. (2015). Prediction and constraint in audiovisual speech perception. Cortex, 68, 169–181. https://doi.org/10.1016/j.cortex.2015.03.006 CrossRef Google Scholar PubMed

Peelle, J. E., Gross, J., and Davis, M. H. (2013). Phase-locked responses to speech in human auditory cortex are enhanced during comprehension. Cerebral Cortex, 23(6), 1378–1387. https://doi.org/10.1093/cercor/bhs118 CrossRef Google Scholar

Pivik, R. T., Andres, A., Tennal, K. B., et al. (2019). Resting gamma power during the postnatal critical period for GABAergic system development is modulated by infant diet and sex. International Journal of Psychophysiology, 135, 73–94. https://doi.org/10.1016/j.ijpsycho.2018.11.004 CrossRef Google Scholar PubMed

Plueckebaum, H., Meyer, L., Beck, A.-K., and Menn, K. H. (2023). The Developmental Trajectory of Functional Excitation–Inhibition Balance Relates to Language Abilities in Autistic and Allistic Children. Autism Research.10.1002/aur.2992CrossRef Google Scholar PubMed

Poeppel, D., and Assaneo, M. F. (2020). Speech rhythms and their neural foundations. Nature Reviews Neuroscience, 21(6), 6. https://doi.org/10.1038/s41583-020-0304-4 CrossRef Google Scholar PubMed

Poil, S.-S., Hardstone, R., Mansvelder, H. D., and Linkenkaer-Hansen, K. (2012). Critical-state dynamics of avalanches and oscillations jointly emerge from balanced excitation/inhibition in neuronal networks. Journal of Neuroscience, 32(29), 9817–9823. https://doi.org/10.1523/JNEUROSCI.5990-11.2012 CrossRef Google Scholar PubMed

Power, A. J., Mead, N., Barnes, L., and Goswami, U. (2012a). Neural entrainment to rhythmically presented auditory, visual, and audio-visual speech in children. Frontiers in Psychology, 3, 216. https://doi.org/10.3389/fpsyg.2012.00216 CrossRef Google Scholar PubMed

Power, A. J., Foxe, J. J., Forde, E.-J., Reilly, R. B., and Lalor, E. C. (2012b). At what time is the cocktail party? A late locus of selective attention to natural speech. European Journal of Neuroscience, 35(9), 1497–1503. https://doi.org/10.1111/j.1460-9568.2012.08060.x CrossRef Google Scholar

Ramus, F., and Mehler, J. (1999). Language identification with suprasegmental cues: A study based on speech resynthesis. Journal of the Acoustical Society of America, 105(1), 512–521. https://doi.org/10.1121/1.424522 CrossRef Google Scholar PubMed

Ramus, F., Nespor, M., and Mehler, J. (1999). Correlates of linguistic rhythm in the speech signal. Cognition, 75(1), AD3–AD30. https://doi.org/10.1016/S0010-0277(00)00101-3 CrossRef Google Scholar

Raneri, D., Von Holzen, K., Newman, R., and Ratner, N. B. (2020). Change in maternal speech rate to preverbal infants over the first two years of life. Journal of Child Language, 47(6), 1263–1275. https://doi.org/10.1017/S030500091900093X CrossRef Google Scholar PubMed

Räsänen, O., Kakouros, S., and Soderstrom, M. (2018). Is infant-directed speech interesting because it is surprising? Linking properties of IDS to statistical learning and attention at the prosodic level. Cognition, 178, 193–206. https://doi.org/10.1016/j.cognition.2018.05.015 CrossRef Google Scholar PubMed

Regan, D. (1977). Steady-state evoked potentials. JOSA, 67(11), 1475–1489. https://doi.org/10.1364/JOSA.67.001475 CrossRef Google Scholar PubMed

Rimmele, J. M., Morillon, B., Poeppel, D., and Arnal, L. H. (2018). Proactive sensing of periodic and aperiodic auditory patterns. Trends in Cognitive Sciences, 22(10), 870–882. https://doi.org/10.1016/j.tics.2018.08.003 CrossRef Google Scholar PubMed

Rodríguez-Martínez, E. I., Ruiz-Martínez, F. J., Barriga Paulino, C. I., and Gómez, C. M. (2017). Frequency shift in topography of spontaneous brain rhythms from childhood to adulthood. Cognitive Neurodynamics, 11(1), 23–33. https://doi.org/10.1007/s11571-016-9402-4 CrossRef Google Scholar PubMed

Romeo, R. R., Choi, B., Gabard-Durnam, L. J., et al. (2021). Parental language input predicts neuroscillatory patterns associated with language development in toddlers at risk of autism. Journal of Autism and Developmental Disorders, 52, 2717–2731. https://doi.org/10.1007/s10803-021-05024-6b CrossRef Google Scholar PubMed

Ronconi, L., Vitale, A., Federici, A., et al. (2023). Neural dynamics driving audio-visual integration in autism. Cerebral Cortex, 33(3), 543–556. https://doi.org/10.1093/cercor/bhac083 CrossRef Google Scholar PubMed

Rosen, S., Carlyon, R. P., Darwin, C. J., and Russell, I. J. (1992). Temporal information in speech: Acoustic, auditory and linguistic aspects. Philosophical Transactions of the Royal Society of London. Series B: Biological Sciences, 336(1278), 367–373. https://doi.org/10.1098/rstb.1992.0070 CrossRef Google Scholar PubMed

Ross, L. A., Saint-Amour, D., Leavitt, V. M., Javitt, D. C., and Foxe, J. J. (2006). Do you see what I am saying? Exploring visual enhancement of speech comprehension in noisy environments. Cerebral Cortex, 17(5), 1147–1153. https://doi.org/10.1093/cercor/bhl024 CrossRef Google Scholar

Rothermich, K., and Kotz, S. A. (2013). Predictions in speech comprehension: fMRI evidence on the meter–semantic interface. NeuroImage, 70, 89–100. https://doi.org/10.1016/j.neuroimage.2012.12.013 CrossRef Google Scholar PubMed

Routier, L., Mahmoudzadeh, M., Panzani, M., et al. (2017). Plasticity of neonatal neuronal networks in very premature infants: Source localization of temporal theta activity, the first endogenous neural biomarker, in temporoparietal areas. Human Brain Mapping, 38(5), 2345–2358. https://doi.org/10.1002/hbm.23521 CrossRef Google Scholar PubMed

Rubenstein, J. L. R., and Merzenich, M. M. (2003). Model of autism: Increased ratio of excitation/inhibition in key neural systems. Genes, Brain and Behavior, 2(5), 255–267. https://doi.org/10.1034/j.1601-183X.2003.00037.x CrossRef Google Scholar PubMed

Ruzzoli, M., Torralba, M., Morís Fernández, L., and Soto-Faraco, S. (2019). The relevance of alpha phase in human perception. Cortex, 120, 249–268. https://doi.org/10.1016/j.cortex.2019.05.012 CrossRef Google Scholar PubMed

Schaworonkow, N., and Voytek, B. (2021). Longitudinal changes in aperiodic and periodic activity in electrophysiological recordings in the first seven months of life. Developmental Cognitive Neuroscience, 47, 100895. https://doi.org/10.1016/j.dcn.2020.100895 CrossRef Google Scholar PubMed

Schroeder, C. E., and Lakatos, P. (2009). Low-frequency neuronal oscillations as instruments of sensory selection. Trends in Neurosciences, 32(1), P9–18. https://doi.org/10.1016/j.tins.2008.09.012 CrossRef Google Scholar PubMed

Segalowitz, S. J., Santesso, D. L., and Jetha, M. K. (2010). Electrophysiological changes during adolescence: A review. Brain and Cognition, 72(1), 86–100. https://doi.org/10.1016/j.bandc.2009.10.003 CrossRef Google Scholar PubMed

Shu, Y., Hasenstaub, A., and McCormick, D. A. (2003). Turning on and off recurrent balanced cortical activity. Nature, 423(6937), 6937. https://doi.org/10.1038/nature01616 CrossRef Google Scholar PubMed

Shuffrey, L. C., Pini, N., Potter, M., et al. (2022). Aperiodic electrophysiological activity in preterm infants is linked to subsequent autism risk. Developmental Psychobiology, 64(4), e22271. https://doi.org/10.1002/dev.22271 CrossRef Google Scholar PubMed

Snijders, T. M. (2020). Getting the rhythm for infant language learning: Infants’ cortical tracking of speech rhythm relates to their word segmentation performance. Boston University Conference on Langauge Development, Boston, MA, USA.Google Scholar

Snijders, T. M., Milivojevic, B., and Kemner, C. (2013). Atypical excitation–inhibition balance in autism captured by the gamma response to contextual modulation. NeuroImage: Clinical, 3, 65–72. https://doi.org/10.1016/j.nicl.2013.06.015 CrossRef Google Scholar PubMed

Song, J., and Iverson, P. (2018). Listening effort during speech perception enhances auditory and lexical processing for non-native listeners and accents. Cognition, 179, 163–170. https://doi.org/10.1016/j.cognition.2018.06.001 CrossRef Google Scholar PubMed

Stahl, A. E., and Feigenson, L. (2015). Observing the unexpected enhances infants’ learning and exploration. Science, 348(6230), 91–94. https://doi.org/10.1126/science.aaa3799 CrossRef Google Scholar PubMed

Stroganova, T. A., Orekhova, E. V., and Posikera, I. N. (1999). EEG alpha rhythm in infants. Clinical Neurophysiology, 110(6), 997–1012. https://doi.org/10.1016/S1388-2457(98)00009-1 CrossRef Google Scholar PubMed

Sumby, W. H., and Pollack, I. (1954). Visual contribution to speech intelligibility in noise. Journal of the Acoustical Society of America, 26(2), 212–215. https://doi.org/10.1121/1.1907309 CrossRef Google Scholar

Surwillo, W. W. (1961). Frequency of the ‘alpha’ rhythm, reaction time and age. Nature, 191(4790), 4790. https://doi.org/10.1038/191823a0 CrossRef Google Scholar

Tan, S. H. J., Kalashnikova, M., Di Liberto, G. M., Crosse, M. J., and Burnham, D. (2022). Seeing a talking face matters: The relationship between cortical tracking of continuous auditory‐visual speech and gaze behaviour in infants, children and adults. NeuroImage, 256, 119217. https://doi.org/10.1016/j.neuroimage.2022.119217 Google Scholar

Thézé, R., Giraud, A.-L., and Mégevand, P. (2020). The phase of cortical oscillations determines the perceptual fate of visual cues in naturalistic audiovisual speech. Science Advances, 6(45), eabc6348. https://doi.org/10.1126/sciadv.abc6348 CrossRef Google Scholar PubMed

Tierney, A. L., Gabard-Durnam, L., Vogel-Farley, V., Tager-Flusberg, H., and Nelson, C. A. (2012). Developmental trajectories of resting eeg power: An endophenotype of autism spectrum disorder. PLoS One, 7(6), e39127. https://doi.org/10.1371/journal.pone.0039127 CrossRef Google Scholar PubMed

Tröndle, M., Popov, T., Dziemian, S., and Langer, N. (2022). Decomposing the role of alpha oscillations during brain maturation. eLife, 11, e77571. https://doi.org/10.7554/eLife.77571 CrossRef Google Scholar PubMed

Tsurumi, S., Kanazawa, S., Yamaguchi, M. K., and Kawahara, J. (2021). Attentional blink in preverbal infants. Cognition, 214, 104749. https://doi.org/10.1016/j.cognition.2021.104749 CrossRef Google Scholar PubMed

Turk, A., and Shattuck-Hufnagel, S. (2013). What is speech rhythm? A commentary on Arvaniti and Rodriquez, Krivokapić, and Goswami and Leong. Laboratory Phonology, 4(1), 93–118. https://doi.org/10.1515/lp-2013-0005 CrossRef Google Scholar

Valdés-Sosa, P., Biscay, R., Galán, L., et al. (1990). High resolution spectral EEG norms topography. Brain Topography, 3, 281–282.Google Scholar

van Bree, S., Sohoglu, E., Davis, M. H., and Zoefel, B. (2021). Sustained neural rhythms reveal endogenous oscillations supporting speech perception. PLoS Biology, 19(2), e3001142. https://doi.org/10.1371/journal.pbio.3001142 CrossRef Google Scholar PubMed

Vanhatalo, S., and Kaila, K. (2006). Development of neonatal EEG activity: From phenomenology to physiology. Seminars in Fetal and Neonatal Medicine, 11(6), 471–478. https://doi.org/10.1016/j.siny.2006.07.008 CrossRef Google Scholar PubMed

VanRullen, R. (2016). Perceptual cycles. Trends in Cognitive Sciences, 20(10), 723–735. https://doi.org/10.1016/j.tics.2016.07.006 CrossRef Google Scholar PubMed

Vecchierini, M.-F., André, M., and d’Allest, A. M. (2007). Normal EEG of premature infants born between 24 and 30 weeks gestational age: Terminology, definitions and maturation aspects. Clinical Neurophysiology, 37(5), 311–323. https://doi.org/10.1016/j.neucli.2007.10.008 CrossRef Google Scholar PubMed

Werker, J. F., and Hensch, T. K. (2015). Critical periods in speech perception: New directions. Annual Review of Psychology, 66(1), 173–196. https://doi.org/10.1146/annurev-psych-010814-015104 CrossRef Google Scholar PubMed

Werker, J. F., and Tees, R. C. (1984). Cross-language speech perception: Evidence for perceptual reorganization during the first year of life. Infant Behavior and Development, 7(1), 49–63. https://doi.org/10.1016/S0163-6383(84)80022-3 CrossRef Google Scholar

Werker, J. F., Yeung, H. H., and Yoshida, K. A. (2012). How do infants become experts at native-speech perception? Current Directions in Psychological Science, 21(4), 221–226. https://doi.org/10.1177/0963721412449459 CrossRef Google Scholar

Westermann, G. (2016). Experience-dependent brain development as a key to understanding the language system. Topics in Cognitive Science, 8(2), 446–458. https://doi.org/10.1111/tops.12194 CrossRef Google Scholar PubMed

White, C. T. (1963). Temporal numerosity and the psychological unit of duration. Psychological Monographs: General and Applied, 77(12), 1–37.10.1037/h0093860CrossRef Google Scholar PubMed

Wilkinson, C. L., Gabard-Durnam, L. J., Kapur, K., et al. (2020). Use of longitudinal EEG measures in estimating language development in infants with and without familial risk for autism spectrum disorder. Neurobiology of Language, 1(1), 33–53. https://doi.org/10.1162/nol_a_00002 CrossRef Google Scholar PubMed

Xu, G., Broadbelt, K. G., Haynes, R. L., et al. (2011). Late development of the gabaergic system in the human cerebral cortex and white matter. Journal of Neuropathology and Experimental Neurology, 70(10), 841–858. https://doi.org/10.1097/NEN.0b013e31822f471c CrossRef Google Scholar PubMed

Yu, C., Zhang, Y., Slone, L. K., and Smith, L. B. (2021). The infant’s view redefines the problem of referential uncertainty in early word learning. Proceedings of the National Academy of Sciences, 118(52), e2107019118. https://doi.org/10.1073/pnas.2107019118 CrossRef Google Scholar PubMed

Zaehle, T., Lenz, D., Ohl, F. W., and Herrmann, C. S. (2010). Resonance phenomena in the human auditory cortex: Individual resonance frequencies of the cerebral cortex determine electrophysiological responses. Experimental Brain Research, 203(3), 629–635. https://doi.org/10.1007/s00221-010-2265-8 CrossRef Google Scholar PubMed

Zhou, H., Cui, X., Yang, B., et al. (2022). Audiovisual temporal processing in children and adolescents with schizophrenia and children and adolescents with autism: Evidence from simultaneity-judgment tasks and eye-tracking data. Clinical Psychological Science, 10(3), 482–498. https://doi.org/10.1177/21677026211031543 CrossRef Google Scholar

Zion Golumbic, E. M., Poeppel, D., and Schroeder, C. E. (2012). Temporal context in speech processing and attentional stream selection: A behavioral and neural perspective. Brain and Language, 122(3), 151–161. https://doi.org/10.1016/j.bandl.2011.12.010 CrossRef Google Scholar PubMed

Zion Golumbic, E. M., Cogan, G. B., Schroeder, C. E., and Poeppel, D. (2013). Visual input enhances selective speech envelope tracking in auditory cortex at a ‘cocktail party’. Journal of Neuroscience, 33(4), 1417–1426. www.jneurosci.org/content/33/4/1417.short10.1523/JNEUROSCI.3675-12.2013CrossRef Google Scholar

Zoefel, B. (2021). Visual speech cues recruit neural oscillations to optimise auditory perception: Ways forward for research on human communication. Current Research in Neurobiology, 2, 100015. https://doi.org/10.1016/j.crneur.2021.100015 CrossRef Google Scholar PubMed

Zoefel, B., ten Oever, S., and Sack, A. T. (2018). The involvement of endogenous neural oscillations in the processing of rhythmic input: More than a regular repetition of evoked neural responses. Frontiers in Neuroscience, 12, 95. www.frontiersin.org/articles/10.3389/fnins.2018.0009510.3389/fnins.2018.00095CrossRef Google Scholar PubMed

Figure 37.1 Illustration of neural tracking of speech.Electrophysiological activity in the delta and theta range is assumed to synchronise to amplitude modulations in speech (see also Chapter 35). The line above the speech signal displays the amplitude envelope. Note that the delta and theta band is lower in infants compared to the canonical frequency bands in adults (Anderson and Perone, 2018; Cellier et al., 2021), and that the speech rates in infant-directed speech are typically slower than in adult-directed speech, with ~3–6 Hz as the typical infant-directed syllable rate (Cox et al., 2023; Raneri et al., 2020).Figure 37.1 long description.

Figure 37.2(A) Figure 37.2(A) long description.

Figure 37.2(B) Figure 37.2(B) long description.

Accessibility standard: WCAG 2.0 A

Why this information is here

This section outlines the accessibility features of this content - including support for screen readers, full keyboard navigation and high-contrast display options. This may not be relevant for you.

Accessibility Information

The HTML of this chapter conforms to version 2.0 of the Web Content Accessibility Guidelines (WCAG), ensuring core accessibility principles are addressed and meets the basic (A) level of WCAG compliance, addressing essential accessibility barriers.

Content Navigation

Table of contents navigation
Allows you to navigate directly to chapters, sections, or non‐text items through a linked table of contents, reducing the need for extensive scrolling.

Index navigation
Provides an interactive index, letting you go straight to where a term or subject appears in the text without manual searching.

Reading Order & Textual Equivalents

Single logical reading order
You will encounter all content (including footnotes, captions, etc.) in a clear, sequential flow, making it easier to follow with assistive tools like screen readers.

Full alternative textual descriptions
You get more than just short alt text: you have comprehensive text equivalents, transcripts, captions, or audio descriptions for substantial non‐text content, which is especially helpful for complex visuals or multimedia.

Visualised data also available as non-graphical data
You can access graphs or charts in a text or tabular format, so you are not excluded if you cannot process visual displays.

Visual Accessibility

Use of colour is not sole means of conveying information
You will still understand key ideas or prompts without relying solely on colour, which is especially helpful if you have colour vision deficiencies.

Book contents

37 - Maturational Constraints on Tracking of Temporal Attention in Infant Language Acquisition

Summary

Keywords

Information

37.1 Introduction

37.1.1 Using Rhythm for First-Language Acquisition

37.1.2 Proposal: Importance of Neural Tracking for Temporal Attention and Impact Maturation

37.2 Neural Tracking of Speech

37.3 Neural Speech Tracking in Infants

37.4 Infants’ Neural Tracking and Their Later Language Development

37.5 Possible Mechanisms

37.5.1 Maturational Constraints

37.6 Implications for Language Acquisition Research

37.6.1 Implications for Atypical Language Acquisition: Autism

37.7 Conclusion

Summary

Implications

Gains

Footnotes

References

Accessibility standard: WCAG 2.0 A

Why this information is here

Accessibility Information

Content Navigation

Reading Order & Textual Equivalents

Visual Accessibility

Save book to Kindle

Save book to Dropbox

Save book to Google Drive