Phonological Variation in Child-Directed Speech is Modulated by Lexical Frequency

Eon-Suk KO; Jongho JUN

doi:10.1017/S0305000923000466

Phonological Variation in Child-Directed Speech is Modulated by Lexical Frequency

Published online by Cambridge University Press: 22 September 2023

Eon-Suk KO

and

Jongho JUN

Show author details

Eon-Suk KO*: Affiliation:
Department of English Language and Literature, Chosun University, Gwangju, Korea
Jongho JUN*: Affiliation:
Department of Linguistics, Seoul National University, Seoul, Korea
*: Corresponding authors: Eon-Suk Ko and Jongho Jun; Emails: eonsukko@chosun.ac.kr; jongho@snu.ac.kr.
Corresponding authors: Eon-Suk Ko and Jongho Jun; Emails: eonsukko@chosun.ac.kr; jongho@snu.ac.kr.

Article contents

Abstract
Introduction
Experiment 1: Canonicality of CDS and ADS in word teaching
Experiment 2: Canonicality in low- and high-frequency words in ADS
Discussion
Additional analysis: Teasing apart register and frequency effects
General discussion
Conclusion
Data availability
Competing interest
Footnotes
References

Rights & Permissions

Abstract

We investigate whether child-directed speech (CDS) contains a higher proportion of canonical pronunciations compared to adult-directed speech (ADS), focusing on Korean noun stem-final obstruent variation. In a word-teaching task, we observed that mothers use a higher rate of canonical pronunciation when addressing infants than when addressing adults. In a follow-up experiment, adults exhibited a higher rate of canonical pronunciation for high- than low-frequency words. Additional analyses conducted with only the high-frequency monosyllabic words from the two experiments found no evidence for simplified phonology in CDS when lexical frequency was controlled for. Our findings suggest that the higher rate of canonical forms in CDS, with respect to Korean morphophonological rules, is mediated by the frequency of word usage. Thus, the didactic function of CDS phonology appears to be a byproduct of mothers using familiar words with children. These results highlight the importance of considering word usage in investigating the nature of CDS.

Type: Article
Information: Journal of Child Language , Volume 51 , Issue 2 , March 2024 , pp. 288 - 313

DOI: https://doi.org/10.1017/S0305000923000466 [Opens in a new window]
Copyright: © The Author(s), 2023. Published by Cambridge University Press

Introduction

How does a child internalize the grammar of adult language and what role does language input play in this process? Although children’s early production grammar differs from that of adults’ phonology (Do, Reference Do2013; Hayes, Reference Hayes, Kager, Pater and Zonneveld2004), their production patterns eventually conform to adult grammar. How does a child learn the different realizations of the same morpheme in various phonological contexts? To answer this question, we attend to the role of speech addressed to infants, or child-directed speech (CDS), in children’s acquisition of phonological alternation. Specifically, we investigate patterns of phonological variation in Korean noun stem-final obstruents /t^h/, /c^h/, and /p^h/ before a vowel-initial suffix. Our analysis of production data from two experiments suggests that phonological variation is reduced in CDS, which might facilitate children’s learning of morphemes and phonological rules. Crucially, however, we argue that such didactic function of CDS is a serendipitous outcome mediated by lexical frequency.

Our unconscious phonological knowledge largely consists of three components: the system for contrasts (e.g., [l]ake vs. [r]ake), the set of legal structures (e.g., [blɪk] vs. *[bnɪk]), and patterns of alternation, i.e., varying realization of a single morpheme in different phonological contexts (e.g., can[z] ~ cap[s]) (Hayes, Reference Hayes, Kager, Pater and Zonneveld2004). Previous research on the role of CDS in child’s learning of phonology often focused on their acquisition of phonological contrasts. The approaches were mostly phonetic, at the sub-phonemic level, acoustically investigating phenomena such as vowel formants (e.g., Bernstein Ratner, Reference Bernstein Ratner1982, 1984a; Hartman et al., Reference Hartman, Ratner and Newman2017; Kuhl et al., Reference Kuhl, Andruski, Chistovich, Chistovich, Kozhevnikova, Ryskina, Stolyarova, Sundberg and Lacerda1997; Liu et al., Reference Liu, Tsao and Kuhl2007), voice-onset time [VOT] (Baran et al., Reference Baran, Laufer and Daniloff1977; McMurray et al., Reference McMurray, Kovack-Lesh, Goodwin and McEchron2013; Moslin, Reference Moslin1979; Sundberg & Lacerda, Reference Sundberg and Lacerda1999), and allophonic variations of a phoneme such as /t/ (Dilley et al., Reference Dilley, Gamache, Wang, Houston and Bergeson2019; Fritsche et al., Reference Fritsche, Shattuck-Hufnagel and Song2021). The current study, however, focuses on infants’ acquisition of phonological alternation, an issue which is at a segmental or phonological level. Our research specifically centers on stem-final obstruent variation in Korean, which refers to the variation of obstruents at the end of noun stems. We investigate whether this variability is decreased in CDS, which has the potential to benefit children’s acquisition of phonological grammar and vocabulary development. A reduction in variability, if found in our investigation, could indicate the reduced application of phonological rules by mothers. But it could also be a secondary effect of other factors, such as the greater use of high frequency word types in CDS compared to ADS (Jones et al., Reference Jones, Cabiddu, Barrett, Castro and Lee2023).

The impact of the frequency of words used in CDS on children’s learning of phonological alternations is an area of investigation that has largely been overlooked, although there is research relating lexical frequency to children’s production patterns (e.g., Zamuner, Reference Zamuner2004) and language outcome (e.g., Cychosz et al., Reference Cychosz, Edwards, Ratner, Torrington and Newman2021). Research on infants’ acquisition of phonological categories often takes an input-output perspective, testing for evidence of phonetic enhancements (e.g., Fritsche et al., Reference Fritsche, Shattuck-Hufnagel and Song2021; McMurray et al., Reference McMurray, Kovack-Lesh, Goodwin and McEchron2013) or simplified phonology (e.g., Bernstein Ratner, Reference Bernstein Ratner1984b; Buckler et al., Reference Buckler, Goy and Johnson2018; Dilley et al., Reference Dilley, Millett, Mcauley and Bergeson2014) in the input that might facilitate children’s acquisition of phonology. However, studies have rarely investigated the role of input frequency in infants’ learning of morphophonological rules, which is the focus of our research. Our theoretical approach in phonology incorporates insights from the usage-based or frequency effects approach (Bybee, Reference Bybee2001; Coetzee, Reference Coetzee, Schardl, Walkow and Abdurrahman2002; B. S. Phillips, Reference Phillips, Bybee and Hopper2001) in the design of the experiment and analyses.

The goal of the present study is to test the tutorial function hypothesis of CDS by analyzing perceptually distinct morphophonological alternations in Korean noun stem-final variation. As described in section 1.2, the phenomenon involves an optional morphophonemic process that is phonetically unnatural as in [k’oc ^h-i] ~ [k’os-i] for the underlying /k’oc^h-i/ ‘flower-Nom’. This is in contrast to the optional coarticulatory assimilation process investigated in earlier research, such as the alternation in ca[t] box ~ ca[p] box (Buckler et al., Reference Buckler, Goy and Johnson2018; Dilley et al., Reference Dilley, Millett, Mcauley and Bergeson2014). The non-assimilatory alternation in the final obstruents of Korean noun stems provides an excellent opportunity to test the tutorial function hypothesis of CDS, as the canonical realizations are relatively distinct in perception from their non-canonical counterparts. The distinctive nature of the Korean morpho-phonological alternation will make it easier for infants to identify the underlying representation of the morpheme when the rule is not applied, compared to phonetically natural rules like English regressive or anticipatory assimilation (e.g., cat box [kæp bɑks]). Hence, if the tutorial function hypothesis of CDS (i.e., CDS provides enhanced evidence for canonical forms) is true, we expect to find that the canonical variant of Korean noun stem-final obstruents occurs with higher frequency in Korean mothers’ CDS than in ADS.

The rest of the introduction provides background information in developmental psychology and theoretical phonology relevant to our study. We first discuss the role of CDS in language acquisition, and outline the phonology of Korean noun stem-final obstruent variation. We additionally summarize the notion of lexical frequency effects as proposed in the framework of usage-based phonology (Bybee, Reference Bybee2001; B. S. Phillips, Reference Phillips1984).

Debates on the tutorial vs affective function of child-directed speech

When adults, or even children, talk to a young child, they adapt their speech and use a special register called CDS. It is characterized as being higher and more variable in pitch (e.g., Fernald & Simon, Reference Fernald and Simon1984; Fernald & Kuhl, Reference Fernald and Kuhl1987; Katz et al., Reference Katz, Cohn and Moore1996), clearer in pronunciation (e.g., Bernstein Ratner, Reference Bernstein Ratner1984a, Reference Bernstein Ratner1984b; Burnham et al., Reference Burnham, Kitamura and Vollmer-Conna2002; Kuhl et al., Reference Kuhl, Andruski, Chistovich, Chistovich, Kozhevnikova, Ryskina, Stolyarova, Sundberg and Lacerda1997; Liu et al., Reference Liu, Tsao and Kuhl2007) and simpler in syntax (e.g., J. R. Phillips, Reference Phillips1973 – cf. Newport et al., Reference Newport, Gleitman, Gleitman, Snow and Ferguson1977) than adult-directed speech (ADS). A characteristic of CDS that has received less attention is the simplicity of the vocabulary utilized by caregivers. Studies looking into the lexical composition of CDS report a greater number of high frequency word types in CDS than ADS (Goodman et al., Reference Goodman, Dale and Li2008; Jones et al., Reference Jones, Cabiddu, Barrett, Castro and Lee2023), and an increase in the proportion of rare word types with child age (Rowe, Reference Rowe2012). Additionally, a high proportion of basic-level words (Anglin, Reference Anglin1977) in young children could also be indicative of the greater use of high frequency words in CDS. The summary characteristics of CDS discussed here primarily draw from research conducted on Western populations. However, it is important to note that ethnographic studies on less investigated languages often report distinct characteristics and functions of CDS (e.g., Casillas et al., Reference Casillas, Brown and Levinson2020, and references therein). The extent to which CDS might directly facilitate language acquisition has been a topic of controversy. Some suggest that CDS serves as a linguistic model by providing the child listener with language input efficient for learning, which we will refer to as the tutorial function hypothesis for CDS. For example, acoustic studies of CDS suggest that there is less overlap in the acoustic cues to different phonological categories than ADS, which might facilitate infants’ learning of phonological contrasts (Burnham et al., Reference Burnham, Kitamura and Vollmer-Conna2002; Fernald, Reference Fernald2000; Kuhl et al., Reference Kuhl, Andruski, Chistovich, Chistovich, Kozhevnikova, Ryskina, Stolyarova, Sundberg and Lacerda1997; Liu et al., Reference Liu, Tsao and Kuhl2007; Werker et al., Reference Werker, Pons, Dietrich, Kajikawa, Fais and Amano2007). However, others are skeptical about the didactic role of CDS, with emphasis instead being placed on its function to modulate attention and communicate emotion (Baran et al., Reference Baran, Laufer and Daniloff1977; Buckler et al., Reference Buckler, Goy and Johnson2018; Cristia & Seidl, Reference Cristia and Seidl2014; Singh et al., Reference Singh, Morgan and Best2002), which we will refer to as the affective function hypothesis for CDS.

As mentioned earlier, previous research on the function of maternal speech has mostly focused on testing hypotheses about enhanced phonetic cues for phonological contrasts. Our focus, however, is on the phonological level. The two differing views on the role of CDS in language acquisition have led to different predictions about the degree of phonological rule application in CDS. Under the tutorial function hypothesis of the CDS, mothers might reduce the phonological variation in CDS, providing children with enhanced evidence for underlying or canonical morphemic forms. Alternatively, under the affective function hypothesis, mothers would use as many variants in CDS as in ADS, since mothers adapt their speech mainly to modulate the arousal or attention level of their infants and the enhanced evidence for canonical forms would be irrelevant to this function. The child would then have to learn lexical forms and phonological rules behind the variation simultaneously instead of being introduced to the underlying forms first, followed by relevant phonological rules.

Previous research examining the phonological and phonetic implementation of consonants in CDS reported mixed results about reduced phonological variability in CDS. In one of the earliest systematic investigations of phonological rule application in CDS, Bernstein Ratner (Reference Bernstein Ratner1984b) found that phonological processes such as dental deletion (want it → [wɒn it]), ð deletion (throw them → [θroʊ ɛm]), and the ts/s alternation (that’s nice → [ðæs naɪs]) applied much less frequently in CDS than ADS, though the opposite pattern was found for palatalization (did you → [dɪdʒu]). More recently, Dilley et al. (Reference Dilley, Millett, Mcauley and Bergeson2014) found a modest tendency for CDS to have a greater rate of canonical pronunciations in assimilatory contexts (e.g., gree[n] boats ~ gree[m] boats) than ADS. The data, however, was based only on four types of word pairs embedded in a story and read in the laboratory. In addition, at the phonetic level, Fritsche et al. (Reference Fritsche, Shattuck-Hufnagel and Song2021) reported that CDS contains a significantly higher proportion of canonical pronunciation for /t/ than ADS, which they interpret as a clear and enhanced signal of the phonemic category. Their study, however, had only 8 participants covering various developmental stages. These results are compatible with the tutorial function hypothesis though each study is not without certain limitations for generalizability.

There are also reports contradicting the predictions made by the tutorial function hypothesis of CDS. Shockey and Bond (Reference Shockey and Bond1980) conducted a study on the same phonological rules analyzed in Bernstein Ratner (Reference Bernstein Ratner1984b) and observed a higher frequency of rule application in CDS compared to ADS. The two studies, however, are hard to compare because they are based on different age groups and dialects of English. More recently, Buckler et al. (Reference Buckler, Goy and Johnson2018) tested the tutorial function hypothesis by investigating the place assimilation (e.g., gree[n] book ~ gree[m] book) phenomenon. They found that CDS contains as many assimilated, thus non-canonical, word forms in place assimilation contexts as ADS. Their findings suggest that mothers do not reduce the variability in their speech to their children, and that children learn canonical lexical forms and processes inducing phonological variation simultaneously.

It is interesting that both Dilley et al. (Reference Dilley, Millett, Mcauley and Bergeson2014) and Buckler et al. (Reference Buckler, Goy and Johnson2018) investigated regressive assimilation phenomena but yielded conflicting results. The discrepancy could be attributed to their different research methods, such as the type of data elicited and the coding used. Putting aside the details of their methods, however, it is worth noting that English place assimilation in connected speech is generally considered a phonetically natural process, motivated by the articulatory mechanism to adjust the place of assimilation across adjacent segments for more efficient articulation. In place assimilation contexts, such as the consonant cluster /nb/, the unaltered canonical realization ([nb]) is perceptually very similar to its altered non-canonical counterpart ([mb]) (Browman & Goldstein, Reference Browman and Goldstein1989; Ohala, Reference Ohala1990; and others – cf. Jun, Reference Jun1996). As it would be very difficult for a child (or an adult) to perceive unassimilated canonical realizations of words in place assimilation contexts, caregivers may not attempt to provide enhanced evidence for the canonical forms, as their adaptation of speech may not have a significant impact on infants’ learning due to the inherent perceptual challenge.

Stem-final obstruent variation in Korean

Standard Korean has a three-way laryngeal contrast between lenis, aspirated and tense (or glottalized) obstruents, as shown below.

(1) Three-way laryngeal contrasts among Korean obstruents

Korean noun stems ending with any of these obstruents exhibit alternations within the inflectional paradigm, as shown in (2). In isolation forms, they may only end in (unreleased) lenis stops due to coda neutralization, in which all coronal obstruents neutralize to [t] (e.g., /pat^h/ > [pat] in (2)), all labial stops neutralize to [p] and all velar stops neutralize to [k]. Before high front vocoids /i, j/, stem-final coronal stops such as /t/ and /t^h/ are realized as palato-alveolar affricates such as [c] and [c^h], respectively, due to coronal palatalization (e.g., /pat^h-i/ > [pac^hi] in (2)). Application of these rules yields a paradigm of alternations for noun-stems that end in obstruents. For instance, the final consonant of the noun stem /pat^h/ ‘field’ appears as [t^h] before an accusative case marker –ɨl or a locative case marker -e, [t] in the isolation form, and [c^h] before a nominative case marker –i (see the surface form in (2)). In a standard rule-based analysis of these stem-final alternations, the output form before vowel-initial suffixes (except for [i]-initial ones) is posited as the underlying form (which is reflected in standard Korean orthography). The aforementioned phonological rules apply to the underlying form, resulting in the production of unreleased stops and palatalized consonants in the isolation and nominative forms, respectively. This standard analysis is illustrated by the derivation of some allomorphic forms of the stem /pat^h/ ‘field’ in (2). Readers can observe [t^h]~[t] and [t^h]~ [c^h] alternations by comparing the unaltered stem forms in the accusative and locative cases with the corresponding isolation and nominative forms, respectively.

(2) Standard rule-based analysis of the alternations

Korean is currently undergoing an extensive historical change, which involves the emergence of innovative forms through analogy to frequently occurring forms. Pre-vocalic allomorphs of the noun stems, which to vary between [t^h] and [c^h] as in [pat^h-ɨl], [pat^h-e] and [pac^h-i] in (2), are now observed to exhibit innovative variants which vary according to the place of articulation of the obstruents (Jun, Reference Jun2010, and references therein). Stem-final coronal obstruents take on different forms including [s, c^h, t^h, c, t]. For example, /toc^h-e/ ‘sail-locative’ exhibits the alternation among [tose] ~ [toc^he] ~ [tot^he] ~ [toce] ~ [tote]. [s] is, in general, the most frequent or preferred variant, while [c] and [t] are the least frequent or preferred. [c^h] and [t^h] fall in between. In contrast, non-coronal aspirated and tense stops, /p^h, k^h, k’/, alternate with their homorganic lenis counterparts, [p, k]. For example, /ip^h-e/ ‘leaf-locative’ exhibits [ip^he] ~ [ipe], and /puək^h-e/ ‘kitchen-locative’ shows [puək^he] ~ [puəke].

Accordingly, Korean noun stems followed by vowel-initial suffixes can be realized with a variety of different output forms, including the canonical forms (underlying and palatalized forms) as well as innovative variants (see (3)). Note that the phonological constraint inducing coronal palatalization is dominant in Korean in the sense that alveolar stops such as [t^h] are never allowed to occur in palatalization contexts, (e.g., before a nominative case marker /-i/). Consequently, not only the unaltered underlying forms, but also the palatalized forms may be considered as canonical realizations of Korean nouns in the sense that these are the historically correct normative forms.Footnote ¹ On the other hand, those which are neither underlying nor palatalized forms can be classified as non-canonical realizations of Korean nouns. The table in (3) shows examples of canonical and non-canonical forms of some Korean noun-stem final consonants.

(3) Canonical and non-canonical realizations of Korean noun-stem final consonants (infrequent or non-existent alternations are given in parentheses; and gray indicates “not applicable.”)

It is worth noting that the final obstruents of Korean nouns, as well as their innovative variants, encompass sibilants such as [s, c^h] and aspirated stops such as [p^h, k^h, t^h]. Sibilants are perceptually prominent due to their loud and high-pitched noise (Johnson, Reference Johnson2006), and aspirated consonants are similarly salient. Therefore, in general, canonical and non-canonical pronunciations of Korean nouns are perceptually quite distinct from each other.

To sum up, a Korean noun such as /pat^h-e/ can be pronounced as the canonical [pat^he] or innovative [pase], which are currently in free variation due to the ongoing historical change. If infants are more frequently exposed to the innovative [pase], it will be a challenge for them to learn the underlying form of the noun stem /pat^h/. If CDS serves a didactic function, therefore, we would expect a higher proportion of canonical variants such as [pat^he] in CDS than ADS.

Effects of lexical frequency in phonology

For the past three decades or so, there has been increasing attention in phonology on variable phonological phenomena (Anttila, Reference Anttila1997; Bybee, Reference Bybee2001; Coetzee, Reference Coetzee, Schardl, Walkow and Abdurrahman2002; Gahl, Reference Gahl2008). One of the best-known phenomena affected by the frequency of word-usage is the optional t/d-deletion in English (e.g., west [wɛst] ~ [wɛs]). The deletion of word final t/d is more likely to occur in words with higher lexical frequency (e.g., just) than words with lower usage frequency (e.g., bust) (Bybee, Reference Bybee2002; Patrick, Reference Patrick1991). An additional factor that influences the probability of the t/d-deletion is the morphological status of t/d. As pointed out in Guy (Reference Guy1991), deletion is more likely to apply when the target segment is part of a monomorpheme (e.g., mist), less likely when it is part of the irregular past tense morpheme (e.g., kept), and least likely when it is the regular past tense suffix (e.g., missed).

Considering such an observation, the usage-based model of phonology (Bybee, Reference Bybee2001; B. S. Phillips, Reference Phillips1984, Reference Phillips, Bybee and Hopper2001) proposes two major effects of frequency: (1) high frequency words will change at a faster rate than low frequency items if the change is the result of a phonetic process (e.g., /t/ more likely to drop in the highly frequent just than bust ), and (2) high frequency words are more resistant to change, if the change is a grammatical or an analogical change based on the analysis of other forms (e.g., highly frequent English irregular verbs such as made or sang resistant to regularization). These somewhat contradictory effects of frequency can be explained in that the first type of effect is on articulatorily-motivated changes where any frequently repeated motor activities become more efficient (Bybee, Reference Bybee2001) and that frequently used words are more predictable, so speakers can afford to be less clear (Bell et al., Reference Bell, Brenier, Gregory, Girand and Jurafsky2009). The latter type, on the other hand, involves analogical change where high-frequency words with stronger mental representation resist changes, whereas low-frequency words are more vulnerable to the pressure to change (Kapatsinski, Reference Kapatsinski2021).

The phenomenon of our focus, Korean stem-final obstruent variation, is a morphophonological rule involving an analysis of grammatical morphemes. The usage-based approach predicts that there will be resistance to change thus a higher likelihood of canonical forms in high frequency words. Since CDS tends to use a greater number of high-frequency words than ADS (Jones et al., Reference Jones, Cabiddu, Barrett, Castro and Lee2023; also see the section Additional analysis: Teasing apart register and frequency effects), the usage-based model of phonology predicts a higher proportion of canonical forms in CDS.

Overview of the current research

Our main goal is to test the tutorial function hypothesis of CDS based on Korean stem-final obstruent variation, a phonetically unnatural morphophonological rule. We aim to investigate whether CDS adapts the application of phonology to help facilitate children’s discovery of the underlying forms. To address the question, we conducted two experiments employing Korean noun stems ending in /t^h/, /c^h/, and /p^h/. In Experiment 1, we focus on the effect of register and compare the proportion of canonical forms in the word teaching task in CDS and ADS. The results showed a significantly higher rate of canonical pronunciation in CDS than ADS, suggesting that morphophonemic alternations in CDS provide a greater opportunity for children to discover the underlying representation of a morpheme. Due to the nature of the task, however, the target words used in CDS are easy words, whereas those used for ADS were rare items. To tease apart the confounded effect of register from lexical frequency, we conducted Experiment 2, in which we compare the pronunciation of high- and low-frequency words in ADS, and found a significantly higher rate of canonical pronunciation in high-frequency words. Additional analyses, comparing items in Experiment 1 and 2 after controlling for word frequency and length, did not find a significantly higher rate of canonical pronunciations in CDS. Putting these results together, we propose a mediation model, in which the effect of register on phonological variation found in Experiment 1 is completely mediated by the frequency effect found in Experiment 2. The collection of the data in this study was carried out in accordance with the ethical standards of the Institutional Review Board of Chosun University. (Approval No. 2-1041055-AB-N-01-2018-51).

Experiment 1: Canonicality of CDS and ADS in word teaching

The purpose of this experiment was to test the hypothesis that CDS is pronounced with less phonological variation than ADS in target contexts for a morphophonological rule. If true, the higher rate of canonical forms in CDS could serve to provide enhanced evidence for infants to discover the underlying representation of morphemes and learn phonological alternations in their language.

Methods

Participants

Twenty-two Korean mothers of 11-month-old infants (M = 0;11.17, SD = 90 days, range = 0;8.21 to 0;17.09, 15 boys & 7 girls) participated in the study with their children. One additional participant’s data were discarded due to a technical failure. Ten dyads participated in the study on-line during the COVID-19 pandemic.

Procedure

For the 12 dyads who participated in the study in person, the experiment was conducted in a quiet greeting room of the child language lab at a Chosun University. The participating mothers sat on a sofa with their child on their lap, and taught target words embedded in a custom-made picture book to their own children and to another adult (available at https://osf.io/5crwh/ along with all data in this paper). The order of the register in the word teaching task was counter-balanced. Elicitation of CDS was done at a sofa while the mother interacted with her child without any intervention by the experimenter. For all ADS teaching sessions, a male research assistant served as the adult addressee while a female research assistant kept the infant occupied with toys. The recording of mothers’ production was made on a small clip-on digital recorder (SONY ICD-TX650) attached to the mothers’ clothing close to the mouth and was saved in a linear PCM format (48 kHz, 16 bit).

For the 10 dyads who participated in the experiment during the pandemic, the experiment was conducted and recorded via Zoom software. Before the scheduled experiment, we sent out a headset microphone (Microsoft LifeChat LX-3000) and two picture books to each family. A female experimenter administered the experiment online with the help of a male research assistant for eliciting the ADS in the same manner as the off-line experiment. Audio was extracted from the video recordingFootnote ² with the sampling rate of 44kHz and a 16-bit depth.

Stimuli

The two picture books, custom-made for each speech register, contained three target words for each register. The target words for CDS were /sup^h/ ‘woods,’ /k’oc^h/ ‘flower,’ and /pat^h/ ‘field,’ all of which are common words that frequently appear in picture books for Korean children, though unlikely to be firmly stored in the lexicon of the pre-verbal infants. For ADS, words that are not likely to be known by ordinary adults were chosen as the target words to teach, (e.g., /sʌp^h/ ‘brushwood,’ /koc^h/ ‘lynx,’ /sat^h/ ‘reed mat’). The decision to use different words for each register was made to maintain the ecological validity of the task. Each target word was embedded in three sentences in the beginning, middle, and final position of each sentence. The total number of occurrences for the target items was as follows: /sup^h/ 153, /k’oc^h/ 204, /pat^h/ 194 for CDS, and /sʌp^h/ 105, /koc^h/ 151, /sat^h/ 142 in ADS. Incidental occurrences of non-target items were excluded from the analysis in this section, but are included in the Additional Analyses section.

Participating mothers first read the three sentences containing each target word, then explained the target word to their children or the adult again in their own words. The story book contained nine additional nonce word targets used for another study. The recording session containing both the reading and the spontaneous speech for each register lasted about 10 minutes, respectively.

Coding

The recordings were transcribed in the CHAT format of CHILDES (MacWhinney, Reference MacWhinney2000) by two research assistants based on the criteria in Ko et al. (Reference Ko, Jo, On and Zhang2020). Utterances containing the target words were extracted using the kwal command of the CHILDES’ CLAN program and sent to Praat (Boersma & Weenink, Reference Boersma and Weenink2020) for coding phonological alternations. Two research assistants, who were knowledgeable about the phonological phenomenon but naïve about the purpose of this research, coded the data based on aural inspection of the target words. Since the phenomenon was categorical, spectral inspection of sub-segmental properties was not necessary for making judgments. The research assistants identified the orthographic form of the target word, which is also the underlying form in Korean, and transcribed the actual pronunciation. They then annotated the underlying and surface representation of the target consonant in the word. Although the phenomenon was robust and did not involve any particular difficulty in identifying the alternation, we analyzed the agreement between the two coders using a set of samples from Experiment 2 and found high agreement (Cohen’s κ= 0.97). Details of the agreement statistics are reported in the Experiment 2 section.

Results

The mean proportion of canonical form was higher in CDS (unaltered = 0.72, unaltered + palatalized = 0.79) than in ADS (unaltered = 0.38, unaltered + palatalized = 0.41). A breakdown of these numbers for each coda segment is summarized in Table 1, and shown for each coda segment and participant in Figure 1.

Table 1. Rate of Canonical Form Realization in Different Registers

Note. Numbers in parentheses = number of canonical realizations / total number of realizations. ADS: adult-directed speech; CDS: child-directed speech.

Figure 1. Rate of canonical pronunciation (unaltered + palatalized) for noun-stem final codas in ADS and CDS averaged for each participant and coda consonant.

As shown, the rates of canonical forms are higher in CDS (M = 0.79 or 436/551) than in ADS (M = 0.41 or 164/398), regardless of the type of stem-final coda obstruents. To statistically test this difference, we constructed a mixed effects logistic regression model as will be described below. The dataset consists of 949 productions from the six items (the number of occurrences is shown in parentheses): /sup^h/ (153), /k’oc^h/ (204), /pat^h/ (194), /sʌp^h/ (105), /koc^h/ (151) and /sat^h/ (142). A mixed effect logistic regression model was fitted to the data using the glmer function from the lmerTest package (Kuznetsova et al., Reference Kuznetsova, Brockhoff and Christensen2017) in R (R Core Team, 2022). The binary dependent variable was either “canonical” (underlying or palatalized) or “not” (reference level). In addition to register (ADS (reference), CDS), our main variable of interest, we added to the model coda (c^h (reference), p^h, t^h), (ɨ (reference), e, i), and style (reading (reference), spontaneous). All fixed effect factors were dummy coded, and the random effect structure of the model included random intercepts for both subjects and items, and by-subject random slopes for register and coda.Footnote ³ The resulting fixed effects are shown in Table 2.

Table 2. Mixed Effects Logistic Regression Testing the Proportion of Codas Being Realized in Their Canonical Output Form in CDS and ADS of Experiment 1

Note. CDS: child-directed speech; ADS: adult-directed speech.

*p < .05, **p < .01, *** p < .001.

The coefficient of register suggests that the rate of canonical form realization is higher in CDS than in ADS, supporting the tutorial function hypothesis of CDS. In addition, there were main effects of vowel (whereby the rate of canonical form realization is higher before /e/ and /i/ than before /ɨ/) and style (whereby the rate of canonical form realization is lower in spontaneous speech than in read speech). We additionally constructed a model that incorporated the experiment mode (online vs offline) as a fixed factor to investigate potential impacts of different data collection modes on the mothers’ pronunciation. However, we found no significant effect of the experiment mode (p = 0.99). Detailed results of this model can be accessed in the supplementary material available on the OSF repository.

Discussion

The results of our first experiment showed that mothers provide information on the underlying representation of coda consonants significantly more often in CDS. Our findings, therefore, seem to support the tutorial function hypothesis of CDS.

However, the study design may have potential confounds due to the focus on maintaining the ecological validity of the task. The mothers’ teaching targets consisted of common, high-frequency words, while the ones used to teach adults were rare, low-frequency words, as demonstrated by the token frequency of these words in the Sejong Corpus, one of the most representative corpora of Korean, as shown in (4).

(4) The usage frequency in the Sejong Corpus (B.-M. Kang & Kim, Reference Kang and Kim2009) of the target noun stems used in the current word-teaching task

It is, therefore, possible that the register effect is confounded with the frequency of the word-usage. To investigate the effect of lexical frequency on the canonicality of pronunciation, we conducted Experiment 2 where we compared high- and low-frequency words elicited from participants without involving infants.

Experiment 2: Canonicality in low- and high-frequency words in ADS

The goal of this experiment was to investigate the effects of frequency on the realization of the noun-stem final consonants. Specifically, we test the hypothesis that high frequency words are resistant to change, and, thus, demonstrate a greater proportion of canonical over innovative forms in production compared to low frequency words. The participants for this study were adults only, with whom we conducted a series of experiments designed to elucidate the effect of lexical frequency on phonological variation. In this paper, the notion of frequency refers to stem token frequency, including the forms with all types of suffixation.