The ubiquity of frequency effects in first language acquisition*

BEN AMBRIDGE; EVAN KIDD; CAROLINE F. ROWLAND; ANNA L. THEAKSTON

doi:10.1017/S030500091400049X

The ubiquity of frequency effects in first language acquisition*

Published online by Cambridge University Press: 03 February 2015

BEN AMBRIDGE ,

EVAN KIDD ,

CAROLINE F. ROWLAND and

ANNA L. THEAKSTON

Show author details

BEN AMBRIDGE*: Affiliation:
University of LiverpoolESRC International Centre for Language and Communicative Development (LuCiD)
EVAN KIDD: Affiliation:
Australian National UniversityARC Centre of Excellence for the Dynamics of Language ESRC International Centre for Language and Communicative Development (LuCiD)
CAROLINE F. ROWLAND: Affiliation:
University of LiverpoolESRC International Centre for Language and Communicative Development (LuCiD)
ANNA L. THEAKSTON: Affiliation:
University of ManchesterESRC International Centre for Language and Communicative Development (LuCiD)
*: Address for correspondence: Ben Ambridge, Department of Psychological Sciences, Institute of Psychology Health and Society, University of Liverpool, Eleanor Rathbone Building, Bedford St South, Liverpool, L69 7ZA. Email: Ben.Ambridge@Liverpool.ac.uk

Article contents

Abstract
Footnotes
References

Rights & Permissions

Abstract

This review article presents evidence for the claim that frequency effects are pervasive in children's first language acquisition, and hence constitute a phenomenon that any successful account must explain. The article is organized around four key domains of research: children's acquisition of single words, inflectional morphology, simple syntactic constructions, and more advanced constructions. In presenting this evidence, we develop five theses. (i) There exist different types of frequency effect, from effects at the level of concrete lexical strings to effects at the level of abstract cues to thematic-role assignment, as well as effects of both token and type, and absolute and relative, frequency. High-frequency forms are (ii) early acquired and (iii) prevent errors in contexts where they are the target, but also (iv) cause errors in contexts in which a competing lower-frequency form is the target. (v) Frequency effects interact with other factors (e.g. serial position, utterance length), and the patterning of these interactions is generally informative with regard to the nature of the learning mechanism. We conclude by arguing that any successful account of language acquisition, from whatever theoretical standpoint, must be frequency sensitive to the extent that it can explain the effects documented in this review, and outline some types of account that do and do not meet this criterion.

Information

Type: Review Article
Information: Journal of Child Language , Volume 42 , Issue 2 , March 2015 , pp. 239 - 273

DOI: https://doi.org/10.1017/S030500091400049X [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/3.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright: Copyright © Cambridge University Press 2015

INTRODUCTION

Frequency effects are ubiquitous in virtually every domain of human cognition and behaviour, from the perception of facial attractiveness (Grammer & Thornhill, Reference Grammer and Thornhill1994) and the processing of musical structure (Temperley, Reference Temperley2007) to language change (Bybee, Reference Bybee2010) and adult sentence processing (Ellis, Reference Ellis2002). Our goal in this target article is to argue that frequency effects are ubiquitous also in children's first language acquisition, and to summarize the different types of frequency effect that are observed across all of its subdomains. We argue, very simply, that frequency effects constitute a phenomenon for which any successful theory must account. Such a theory might be a generativist/nativist account, under which children have innate knowledge of abstract categories, but are sensitive to the frequency with which exemplars of these categories are present in the input (e.g. see Yang, Reference Yang2004, for a review). It could equally be a constructivist/usage-based account, under which children build up abstract constructions on the basis of the input, with the aid of little or no innate linguistic knowledge (e.g. Tomasello, Reference Tomasello2003). Regardless of whatever other theoretical assumptions are made, any successful account of language acquisition will need to incorporate frequency-sensitive learning mechanisms.

It is important, at the outset, to clarify our claim. We do not argue that sensitivity to input frequency must be the defining feature, or even the most important feature, of a successful account of acquisition (i.e. we do not argue for a frequency-driven or frequency-based mechanism). It is not difficult to think of factors that are more important than input frequency in at least some scenarios. For example, if we consider the straightforward token frequency of lexical items, there is every reason to believe that children will make more effort to store low-frequency input strings that can be used to obtain desired objects (e.g. cake) than higher-frequency strings that cannot (e.g. the). We argue, instead, for a learning mechanism that is minimally frequency sensitive, under which input frequency need not be the chief determinant of acquisition in all cases.

It is also important to make clear that a frequency-sensitive learning mechanism need not (and most probably does not) entail a mechanism that “computes and matches the frequency of various elements in the input” or acquires “knowledge of frequency” (Bohnacker, Reference Bohnacker, Gülzow and Gagarina2007, pp. 54–55; see Ambridge, Reference Ambridge, Gülzow and Gagarina2010, for discussion). Frequency in this sense (i.e. token frequency) need not be represented per se, but may be instantiated in the strength of representations or neural connections in exactly the same way that explicit and implicit memory for stimuli of all types is boosted by repetition. Similarly, type frequency information may be represented only indirectly, instantiated in the similarity structure of stored exemplars.

Thus far, our claim is relatively uncontroversial: few would disagree that at least some domains of language acquisition show frequency effects at some level (though see Roeper, Reference Roeper, Gülzow and Gagarina2007). But our claim is much broader: we propose that frequency effects are ubiquitous in every domain of child language acquisition and that any apparent null finding simply reflects a failure to conceptualize frequency appropriately, to find a sufficiently sensitive dependent measure, or to hold constant other relevant factors.

We illustrate this claim with evidence from four core domains: the acquisition of single words, inflectional morphology, simple syntactic constructions, and more advanced constructions. Within these sections, our overarching claim takes the form of five inter-related theses:

1. Levels and Kinds Thesis. Frequency effects exist at all levels and are of many different kinds. They are observed not only at the level of concrete lexical strings (perhaps the prototypical frequency effect), but also at the level of abstract categories (e.g. particular orderings of SUBJECT and OBJECT) and cues (e.g. animacy, givenness). There are token frequency effects (e.g. at the level of the word, the more often you hear a word, the more likely you are to learn it) and type frequency effects (e.g. at the level of inflectional morphology, the more verbs you hear with a particular inflectional ending, the more likely you are to learn that ending). There are effects of absolute frequency (e.g. high-frequency words will be learned earlier than low-frequency words) and relative frequency (e.g. of two competing forms, the most frequent will be dominant).
2. Age of Acquisition (AoA) Thesis. All other things being equal, frequent forms will be acquired before less-frequent forms. As we will see in more detail, since all other things are rarely – if ever – equal, this claim does not entail a one-to-one relationship between frequency and age of acquisition (and neither is the definition of ‘acquisition’ straightforward).
3. Prevent Error Thesis. High-frequency forms prevent (or at least reduce) errors in contexts in which they are the target. For example, we will see that third person singular verb forms – almost always the most frequent in the input – are invariably produced correctly in third person singular contexts.
4. Cause Error Thesis. Conversely, high-frequency forms also cause error in contexts in which a competing, related lower-frequency form is the target. For example, we will see that high-frequency third person singular verb forms are often used inappropriately in third person plural contexts.
5. Interaction Thesis. Finally, we propose that frequency effects will interact with other effects. One example is utterance position: high-frequency verbs are generally learned before lower-frequency verbs (a main effect of verb frequency), and this effect is boosted for verbs that occur frequently in utterance-final position (an interaction of verb frequency by utterance position). The downside of these interactions is that they can make frequency effects difficult to detect. The upside is that these interactions are generally informative with regard to the other factors that we need to build into the learning mechanism (e.g. sensitivity to utterance position or temporal ordering).

The remainder of this article synthesizes the considerable empirical support that exists for each of our theses across four domains: single words, inflectional morphology, simple syntactic constructions, and more advanced constructions. This strategy inevitably entails a degree of repetition and overlap, for which we make no apology. The point is that the frequency effects captured by these five theses do not rely on cherry-picking particular domains or debates, but are ubiquitous across first language acquisition.

At this point, we should also clarify that whenever we refer to frequency in this article, we mean input frequency. It is likely that children also show effects of output frequency (e.g. better performance with strings that they produce more often). However, we do not discuss such effects, as, other than in the domain of phonology (e.g. DePaolis, Vihman & Keren-Portnoy, Reference DePaolis, Vihman and Keren-Portnoy2011), few studies have attempted to dissociate effects of input and output frequency. Indeed, this will often prove to be rather difficult, given that the frequency distributions of utterances produced by children and their caregivers are generally extremely similar.

SINGLE WORDS

This section presents evidence for perhaps our two most straightforward theses; that – all else being equal – frequent forms are (a) acquired earlier than less frequent ones (AoA Thesis) and (b) associated with lower rates of error, and higher rates of correct use (Prevent Error Thesis). The findings discussed also constitute evidence for our Interaction Thesis.

In the adult psycholinguistics literature, frequency effects at the single-word level have been almost universally accepted for over a hundred years (e.g. Ebbinghaus, Reference Ebbinghaus1913 [1885]; though for one dissenting view, see Roeper, Reference Roeper, Gülzow and Gagarina2007, p. 26). Higher-frequency words are (i) remembered more easily in both recall and recognition tasks (e.g. Hulme, Roodenrys, Schweickert, Brown, Martin & Stuart, Reference Hulme, Roodenrys, Schweickert, Brown, Martin and Stuart1997), (ii) more easily identified, including when subject to audio degradation (Howes, Reference Howes1957; Savin, Reference Savin1963; Luce, Reference Luce1986), (iii) mispronounced less often (Dell, Reference Dell1990), (iv) judged more quickly and accurately in lexical decision tasks (Forster, Reference Forster, Wales and Walker1976; Balota, Cortese, Sergent-Marshall, Spieler & Yap, Reference Balota, Cortese, Sergent-Marshall, Spieler and Yap2004; Brysbaert & New, Reference Brysbaert and New2009), and (v) correctly judged as high-frequency in subjective frequency-estimation tasks (Balota, Pilotti & Cortese, Reference Balota, Pilotti and Cortese2001).

Similar frequency effects are apparent in children's acquisition (our AoA Thesis). As a rule, children learn frequent words before infrequent ones: American English-speaking children's most common first words in production are (in order) Daddy, Mommy, bye, hi, uh-oh, dog, no, ball, baby, and book (Fenson et al., Reference Fenson, Dale, Resnick, Bates, Thal, Hartung and Reilly1994), not, for example, coffee and computer (words that children certainly hear, just less frequently).

However, there is an important caveat to be made here, one that has sometimes been misunderstood. Our claim is not that frequency is the only predictor, but that frequent words are learned before infrequent ones, all other things being equal. Thus, we do not predict that there will be a one-to-one relationship between frequency and age of acquisition (which is just as well, since children's first word is rarely the). There are many other factors that influence acquisition: a word is more likely to be early learned if it is, inter alia, relevant to the child's communicative goals (Ninio, Reference Ninio2006), associated with an easily identifiable referent (Gentner, Reference Gentner and Kuczaj1982), imageable (Bird, Franklin & Howard, Reference Bird, Franklin and Howard2001), aligned with prosodic boundaries (Christophe & Dupoux, Reference Christophe and Dupoux1996), easy to segment from the continuous speech stream (Monaghan & Christiansen, Reference Monaghan and Christiansen2010), easy to say (Vihman & Vihman, Reference Vihman, Vihman, Arnon and Clark2011), and attested in a wide range of contexts (Naigles & Hoff-Ginsberg, Reference Naigles and Hoff-Ginsberg1998; Küntay & Slobin, Reference Küntay and Slobin2002). Our prediction, thus, is that, in a regression analysis, input frequency will make a significant unique contribution to the variance of the outcome measure (in this case, age of acquisition), even when all of these other factors are included in the model. Although few, if any, studies have controlled for all of these factors, this prediction is, in general, very well supported. For example, independent effects of input frequency on age of acquisition have been found looking across verbs (Naigles & Hoff-Ginsberg, Reference Naigles and Hoff-Ginsberg1998; Smiley & Huttenlocher, Reference Smiley, Huttenlocher, Tomasello and Merriman1995; Theakston, Lieven, Pine & Rowland, Reference Theakston, Lieven, Pine and Rowland2004), adjectives (Blackwell, Reference Blackwell2005), and nouns and function words (Goodman, Dale & Li, Reference Goodman, Dale and Li2008).

Turning now to our Prevent Error Thesis, the domain of single-word acquisition provides ample evidence that high-frequency forms are associated with lower rates of error, and higher rates of correct production and comprehension, than lower-frequency forms. The most direct evidence comes from studies in which word frequency is manipulated experimentally, which allow researchers to control out confounding factors using counterbalancing procedures. For example, Schwartz and Terrel (Reference Schwartz and Terrell1983) taught one- to three-year-old children either four novel nouns or four novel verbs. Each individual word+object/action pair was presented with high frequency (a total of 20 presentations) for half of the children and low frequency (10 presentations) for the remainder. Thus their finding that the high-frequency words were correctly recalled significantly more often than low-frequency words (a finding that held for both nouns and verbs) cannot realistically be attributed to any factor other than input frequency (for similar studies with L2 learners and children with SLI, see Rice, Oetting, Marquis, Bode & Pae, Reference Rice, Oetting, Marquis, Bode and Pae1994; Wang & Koda, Reference Wang and Koda2005; McGregor, Sheng & Ball, Reference McGregor, Sheng and Ball2007; Joe, Reference Joe2010; Eckerth & Tavakoli, Reference Eckerth and Tavakoli2012).

At the same time, while it is useful to be able to control factors such as imageability, prosody, and utterance position experimentally, our Interaction Thesis holds that interactions between frequency and one or more of these other effects are informative with regard to the nature of the language learning mechanism. A detailed analysis of all of these potential interactions is beyond the scope of the present article. However, two findings are relevant as an illustration of the informative nature of interactions between frequency and a second factor, here utterance position and utterance length. In their study of verb acquisition, Naigles and Hoff-Ginsberg (Reference Naigles and Hoff-Ginsberg1998) found that, in addition to overall input frequency, input frequency in utterance-final position was a significant predictor of age of acquisition. Relatedly, Brent and Siskind (Reference Brent and Siskind2001) found that age of acquisition was best predicted not by a word's overall input frequency, but by the frequency with which it appeared as the sole constituent of an utterance.

Consequently, interactions with other factors are not merely a source of noise that must be eliminated in order to observe frequency effects or that can be appealed to in order to explain away null findings. Rather, these interactions can constrain our theories, by informing us about the nature of the learning mechanism, For example, the finding of an interaction between frequency and utterance position (e.g. Naigles & Hoff-Ginsberg, Reference Naigles and Hoff-Ginsberg1998) suggests that we need to posit a learning mechanism that is sensitive to temporal order, rather than, for example, a mechanism that processes entire input sequences one batch at a time. Thus, our Interaction Thesis allows us to make general predictions about the learning mechanism that can be tested in other domains (e.g. morphosyntax; e.g. Freudenthal, Pine, Aguado-Orea & Gobet, Reference Freudenthal, Pine, Aguado-Orea and Gobet2007), and perhaps even non-linguistic domains such as memory for musical notes or sequences (e.g. Berz, Reference Berz1995).

INFLECTED FORMS

In this section we consider children's acquisition of morphologically inflected forms (mainly verbs, but also nouns), and the evidence that this domain provides for three of our theses. The first is that high-frequency forms (in this case surface strings) are associated with lower rates of error, and higher rates of correct use (Prevent Error Thesis). The second is that high-frequency forms can cause errors when used in inappropriate contexts, which – in this domain – essentially means inappropriate person/number contexts (Cause Error Thesis). The third is that there are different types of frequency effect (Levels & Kinds Thesis); the specific kinds of error contrasted here being (a) relative versus absolute and (b) type versus token frequency effects.

Many early investigations concluded that no effect of input frequency could be observed in the domain of the acquisition of inflectional morphology. For example, looking across fourteen different morphemes, Brown (Reference Brown1973) found no correlation between input frequency and age of acquisition, whether looking at individual child–caregiver dyads or across the whole group (see also Newport, Gleitman & Gleitman, Reference Newport, Gleitman, Gleitman, Snow and Ferguson1977; Gleitman & Wanner, Reference Gleitman, Wanner, Bornstein and Lamb1984; De Villiers, Reference De Villiers1985; though see Moerk, Reference Moerk1980, for a reanalysis of Brown's data that did yield frequency effects, and Moerk, Reference Moerk1981, and Pinker, Reference Pinker1981, for further discussion).

The problem with this study, however, is the use of age of ‘acquisition’ (which usually entails first production) in naturalistic speech as the dependent measure. This measure is problematic because children are motivated to talk about certain topics at the expense of others, and thus have little occasion to produce certain inflected forms, even if they know them well. For example, despite their high frequency in the input, children rarely produce second person singular forms. Raw production data simply cannot tell us whether children (a) have failed to learn these forms despite their high frequency or (b) have learned these forms, but find little use for them (e.g. young children are not interested in talking about what their listener is doing).

One solution is to use as our dependent measure not the age at which a particular form is first produced or the raw frequency of these forms in the child's speech but the proportion of correct versus incorrect uses in obligatory contexts. Because this is a proportional measure, it controls for the confound that, for example, first person singular contexts far outnumber third person singular contexts in children's speech. Thus, a better way of examining frequency effects is to test the prediction that the higher the frequency of the individual word form (i.e. the inflected, realized form, as opposed to the lemma), the higher the rate (i.e. proportion) of correct use, and the lower the rate of errors; whether errors of commission or omission (our Prevent Error Thesis).

When this prediction is tested, clear effects of input frequency are found, in both naturalistic (e.g. Theakston, Lieven, Pine & Rowland, Reference Theakston, Lieven, Pine and Rowland2005; Theakston & Lieven, Reference Theakston and Lieven2005, Reference Theakston and Lieven2008; Theakston & Rowland, Reference Theakston and Rowland2009) and experimental studies (e.g. Leonard, Caselli & Devescovi, Reference Leonard, Caselli and Devescovi2002; Dabrowska & Szczerbinski, Reference Dabrowska and Szczerbinski2006; Räsänen, Ambridge & Pine, Reference Räsänen, Ambridge and Pine2014). For example, Dabrowska and Szczerbinkski (Reference Dabrowska and Szczerbinski2006) found a correlation between the input frequency of genitive, dative, and accusative Polish noun case-marking inflections, and children's correct performance with novel noun inflection. These frequency effects are not merely an artefact caused by children's memory or processing difficulties. In adult studies of production latency, differences are found between more and less frequent forms of the same lemma (e.g. playing vs. plays; Jescheniak & Levelt, Reference Jescheniak and Levelt1994). Though, again, it is important to bear in mind that – consistent with our Interaction Thesis – frequency interacts with other factors, including serial position (e.g. Freudenthal et al., Reference Freudenthal, Pine, Aguado-Orea and Gobet2007; Gagarina, Reference Gagarina, Gülzow and Gagarina2007; Freudenthal, Pine & Gobet, Reference Freudenthal, Pine and Gobet2010;) and the form most recently produced by an interlocutor (e.g. Krajewski, Theakston, Lieven & Tomasello, Reference Krajewski, Theakston, Lieven and Tomasello2011).

A number of findings from this domain illustrate another of our theses: high-frequency forms not only prevent errors in contexts where they are the target, but Cause Error where a lower-frequency form is the target. For example, in a naturalistic study of child Spanish, Aguado-Orea (Reference Aguado-Orea2004) found high error rates for third person plural target forms (which are very rare in the input), almost all of which involved the substitution of much more frequent third person singular forms (see also Räsänen, Ambridge & Pine, unpublished observations, for Finnish). Similar findings were reported by Dabrowska (Reference Dabrowska2008) for case-marking errors, Theakston and Rowland (Reference Rowland and Theakston2009) for auxiliary is-for-are errors, and Cameron-Faulkner and Kidd (Reference Cameron-Faulkner and Kidd2007) for are-for-am errors (e.g. *I are playing).

Turning now to our Levels and Kinds Thesis, the domain of inflectional morphology also provides a useful illustration of the difference between the effects of token and type frequency. Token frequency is simply the number of times that a particular string (e.g. Mummy) occurs in the child's input. Type frequency is the number of different items that follow a particular morphosyntactic pattern. Precisely what is meant by the term ‘following a particular pattern’ varies from domain to domain, but a reasonably straightforward case occurs in the English past tense system (e.g. Bybee & Slobin, Reference Bybee and Slobin1982; Bybee & Moder, Reference Bybee and Moder1983). For example, the ow→ew pattern has a high type frequency because many verbs form their past tense in this way (e.g. blow/blew, know/knew, grow/grew, throw/threw), whilst the pattern exemplified by make/made has a very low type frequency (probably a type frequency of 1).

There is some evidence to suggest that patterns with high type frequency are more productive (i.e. more open to newcomers), though it is often difficult, when considering morphological systems, to separate the effect of type frequency from phonological heterogeneity (Janda, Reference Janda1990; Forrester & Plunkett, Reference Forrester, Plunkett, Ramand and Eiselt1994; Bybee, Reference Bybee1995; Hare, Elman & Daughterty, Reference Hare, Elman and Daughtery1995; Plunkett & Nakisa, Reference Plunkett and Nakisa1997; Bowerman & Choi, Reference Bowerman, Choi, Bowerman and Levinson2001; Dąbrowska & Szczerbinski, Reference Dabrowska and Szczerbinski2006; Nicoladis, Palmer & Marentette, Reference Nicoladis, Palmer and Marentette2007; Barðdal, Reference Barðdal2008; Suttle & Goldberg, Reference Suttle and Goldberg2011; Kirjavainen, Nikolaev & Kidd, Reference Kirjavainen, Nikolaev and Kidd2012; Ambridge & Lieven, Reference Ambridge, Lieven, MacWhinney and O'Grady2014). However, there is also evidence to suggest that inflected forms with very high token frequency (e.g. said) constitute unanalyzed frozen phrases, and so do not contribute to analogical generalization at all (e.g. the existence of say→said does not lead children to produce errors such as play→*pled or obey→*obed); see Baayen and Lieber (Reference Baayen and Lieber1991), Bybee (Reference Bybee1995), and Wang and Derwing (Reference Wang, Derwing, Chen and Tang1994).

The domain of inflectional morphology, in particular, English verb past tense and noun plural marking, also illustrates a further contrast within our Levels and Kinds Thesis – absolute vs. relative frequency. With regard to absolute frequency, this domain illustrates the common finding that the more frequent the irregular form (in absolute terms), the more likely children are to produce this form, as opposed to an error (also relevant to our Prevent Error Thesis). For example, the high-frequency irregulars blew and feet are less likely to be over-regularized (e.g. *blowed, *foots) than the low-frequency irregulars drank and shelves (e.g. *drinked and *shelfs) (Marchman, Reference Marchman1997; Marchman, Wulfeck & Weismer, Reference Marchman, Wulfeck and Weismer1999; Maslen, Theakston, Lieven & Tomasello, Reference Maslen, Theakston, Lieven and Tomasello2004).

With regard to relative frequency, errors are particularly common when the target form is infrequent relative to a high-frequency competitor form (e.g. a ‘zero-marked’ form, as in Yesterday I wanted/*want an ice-cream). For example, focusing on zero-marking errors in the domain of noun plural marking, Matthews and Theakston (Reference Matthews and Theakston2006) found that children often produced *two mouse, because the target (mice) is less frequent in the input than the competitor (mouse), but rarely produced *two foot, because the target (feet) is more common in the input than the competitor (foot).

The implication of our Levels and Kinds Thesis is that we need an account that incorporates different types of frequency effect: both absolute frequency (e.g. to explain why Mummy is learned before coffee or why feet resists overgeneralization better than does shelves) and relative frequency (e.g. to explain why children substitute low-frequency third person plural verb forms with erroneous high-frequency third person singular forms of the same verb, or mice with mouse, but not feet with foot). This does not necessarily entail positing that children must ‘decide’ whether to pay attention to absolute or relative frequency in a particular domain (which is just as well, since such a position would be untenable). Children are clearly sensitive to both relative and absolute frequency; the challenge is to posit a learning mechanism that yields effects at both of these levels.

One example is the learning model of Rescorla and Wagner (Reference Rescorla, Wagner, Black and Prokasy1972). In this model, the assumption is that a meaning or entity (e.g. MUMMY) has only a certain amount of associative strength to give out. If this entity is paired with one label (e.g. Mummy), this associative strength does not need to be shared: every pairing of MUMMY and Mummy strengthens the association between the two. If an entity (e.g. MOUSE) is paired with two labels (e.g. Mouse, Mice), its associative strength is shared between the two: every pairing of MOUSE and Mouse strengthens the link between MOUSE and Mouse at the expense of the link between MOUSE and Mice, and vice versa (Ramscar, Dye & McCauley, Reference Ramscar, Dye and McCauley2013; see Legate & Yang, Reference Legate and Yang2007, for a version of this account in the domain of Optional Infinitive errors). Regardless of the merits or otherwise of an associative account of word learning, the point is simply that a learning mechanism can yield effects of both absolute and relative frequency, without it somehow having to ‘decide’ which to use in each domain.

The moral here is that a sophisticated consideration of different possible types of frequency effect (Levels and Kinds Thesis) allows us to constrain theory building in a way that simplistic correlations between the input and output frequency of particular strings cannot. The need to account for effects of both absolute and relative frequency forces us to posit particular types of acquisition model that we may not otherwise have considered; specifically those that build in some form of competition between words with similar meanings and/or surface forms (MacWhinney, 2004). Thus a ‘frequency effect’ can never be an explanation or answer in its own right. Rather, it poses a question: What type of learning mechanism is needed to yield the particular types of frequency effect observed?

MULTIWORD STRINGS AND SIMPLE SYNTACTIC CONSTRUCTIONS

This section discusses frequency effects at the levels of multiword strings and grammatical (i.e. sentence-level) constructions. This domain is useful in particular for illustrating our claim that there exist many different types of frequency effect (Levels and Kinds Thesis), as well as providing evidence for our Prevent Error, Cause Error, and AoA Theses.

Multiword strings

The first type of frequency effect is one that we have discussed already: frequently occurring strings prevent or reduce errors (Prevent Error). This is true not only of single words (including inflected forms) but also of multiword strings. Bannard and Matthews (Reference Bannard and Matthews2008) found that children are better able to repeat four-word sequences found frequently in child-directed speech (CDS) than less-frequent four-word sequences, even when the frequency of the individual items and bigrams was carefully controlled (e.g. comparing a cup of tea with a cup of milk). Similar findings were observed by Matthews and Bannard (Reference Matthews and Bannard2010), Arnon and Snider (Reference Arnon and Snider2010), and Arnon and Clark (Reference Arnon and Clark2011; see also Conklin & Schmitt, Reference Conklin and Schmitt2012, for an overview of such effects in adults). In a different context, a number of studies (Mintz, Reference Mintz2003; Chemla, Mintz, Bernal, and Christophe, Reference Chemla, Mintz, Bernal and Christophe2009; Weisleder & Waxman, 2010; but see Erkelens, Reference Erkelens2009; Stumper, Bannard, Lieven & Tomasello, Reference Stumper, Bannard, Lieven and Tomasello2011) have demonstrated that children are also sensitive to frequent frames: “ordered pairs of words that frequently co-occur with exactly one word position intervening (occupied by any word)” (Mintz, Reference Mintz2003, p. 93).

The second type of frequency effect is also one that we have encountered previously: high-frequency strings not only prevent error when used correctly, but seem to cause errors when used incorrectly (Cause Error Thesis). For example, in a study of early negation, Cameron-Faulkner, Lieven, and Theakston (Reference Cameron-Faulkner, Lieven and Theakston2007) reported that early verbal negation was largely ungrammatical (e.g. no move, no drop it), and therefore reflected creative use on the part of the child (multiword utterances containing the negator no were very rare in the caregiver's speech). However, they argued that these early errors were in fact frequency driven – the child was using the most frequent, functionally generic, and salient single word negator in the input overall (no), which he creatively combined with verbs, resulting in a no+VERB frame. Later in development this made way for a shift towards the use of not+VERB (e.g. not going there, not open the lid), which they argued was due to the high frequency of not in multiword utterances in the input, although not necessarily in combination with verbs. Finally, the child shifted towards the use of auxiliary forms (e.g. Don't sit down here, I can't talk), but this shift was function-dependent (e.g. prohibition, inability) and was closely tied to the frequency of particular AUX+neg forms (e.g. don't, can't) to express particular functions in the input.

These complex effects encompassing frequency of both surface forms and communicative functions pose a challenge for researchers. We currently lack a good understanding of whether and how frequency effects change over the course of development, as a consequence of children's increasing semantic and pragmatic knowledge. Computational models provide one means of investigating how far it is possible to get with relatively simple surface-form learning, provided that the model is sensitive to frequency in an appropriate way (e.g. Freudenthal et al., Reference Freudenthal, Pine, Aguado-Orea and Gobet2007). Incorporating semantic and/or pragmatic coding into these kinds of model (e.g. Chang, Dell & Bock, Reference Chang, Dell and Bock2006) would allow researchers to determine what additional benefit this kind of frequency information provides to the learning mechanism, and how closely the corresponding output matches children's language at different stages in development.

Simple syntactic constructions

In the domain of simple grammatical constructions, we see effects of frequency at a variety of levels and of different kinds; frequency of (a) individual verbs, (b) verb+argument/construction combinations, and (c) abstract cues to word order (Levels and Kinds Thesis). For example, with regard to verb+argument combinations, the order in which children acquire verbs within the transitive and intransitive constructions is predicted by both the overall frequency of the verbs and the frequency of those verbs in those same constructions in the input (Ninio, Reference Ninio1999; Theakston, Lieven, Pine & Rowland, Reference Theakston, Lieven, Pine and Rowland2004), consistent with our AoA Thesis. Focusing on arguments, children's use of grammatical objects with verbs that can occur both transitively and intransitively mirrors the relative use of the two constructions with those same verbs in the input (Theakston, Lieven, Pine & Rowland, Reference Theakston, Lieven, Pine and Rowland2001). Similar findings are observed in so-called weird-word order studies (e.g. Akhtar, Reference Akhtar1999; Abbot-Smith, Lieven & Tomasello, Reference Abbot-Smith, Lieven and Tomasello2001; Matthews, Lieven, Theakston & Tomasello, Reference Matthews, Lieven, Theakston and Tomasello2005, Reference Matthews, Lieven, Theakston and Tomasello2007), in which children follow an experimenter's ungrammatical word order for low-frequency and novel verbs (e.g. Fox bear rammed, Elmo the car gopping), but correct the use of a high-frequency verb to the word order in which it has frequently been attested in the input (e.g. Fox pushed bear). Indeed, a number of grammaticality judgment studies have demonstrated that sensitivity to the frequency of particular verb+argument structure combinations continues into older childhood and adulthood (MacDonald, Reference MacDonald1994, Reference MacDonald and MacWhinney1999; Seidenberg, Reference Seidenberg1997; Ellis, Reference Ellis2002; Stefanowitsch & Gries, Reference Stefanowitsch and Gries2003; Theakston, Reference Theakston2004; Stefanowitsch, Reference Stefanowitsch2008; Wonnacott, Newport & Tanenhaus, Reference Wonnacott, Newport and Tanenhaus2008; Ambridge, Pine & Rowland, Reference Ambridge, Pine and Rowland2012; Ambridge, Pine, Rowland & Chang, Reference Ambridge, Pine, Rowland and Chang2012), with high-frequency combinations again protecting children from error (Prevent Errors).

Continuing our illustration of the Levels and Kinds Thesis, there is evidence that children are sensitive not only to the frequency of particular verb+arugment and verb+construction combinations, but also to the frequency of more abstract cues to word order (possibly at different developmental stages). In particular, investigations of children's developing sensitivity to cues such as word order, case marking, and animacy, in their interpretation of the simple transitive NVN construction, typically show that young children are better able to interpret sentences in which multiple cues indicate the same sentence interpretation than those in which only a single cue operates in isolation or cues conflict. This finding, which has been replicated across a number of languages, reflects the higher frequency of sentences with multiple supporting cues in the input (Bates & MacWhinney, Reference Bates, MacWhinney, Wanner and Gleitman1982; Slobin & Bever, 1982; Dittmar, Abbot-Smith, Lieven & Tomasello, Reference Dittmar, Abbot-Smith, Lieven and Tomasello2008; Goksun, Küntay & Naigles, Reference Göksun, Küntay and Naigles2008; Scott & Fisher, Reference Scott and Fisher2009; Chan, Lieven & Tomasello, Reference Chan, Lieven and Tomasello2009; Ibbotson, Theakston, Lieven & Tomasello, Reference Ibbotson, Theakston, Lieven and Tomasello2011; Candan, Küntay, Yeh, Cheung, Wagner & Naigles, 2012; Matsuo, Kita, Shinya, Wood & Naigles, Reference Matsuo, Kita, Shinya, Wood and Naigles2012; though see Lidz, Gleitman & Gleitman, Reference Lidz, Gleitman and Gleitman2004, for counter-arguments, and Goldberg, Reference Goldberg2004, for a critique of their approach). Later in development, however, children start to grasp the significance of individual, often rather infrequent, cues (e.g. the need to prioritise case marking over word order in German, reflecting a shift from the influence of highly frequent SVO word order, to less-frequent but highly reliable case marking; Dittmar et al., Reference Dittmar, Abbot-Smith, Lieven and Tomasello2008).

Further illustrating our Levels and Kinds Thesis, the domain of the acquisition of simple constructions exhibits a particularly interesting and well-studied interaction between type and token frequency. Several studies (Goldberg, Casenhiser & Sethuraman, Reference Goldberg, Casenhiser and Sethuraman2004; Casenhiser & Goldberg, Reference Casenhiser and Goldberg2005; Goldberg, Casenhiser & White, Reference Goldberg, Casenhiser and White2007) have found that children show an advantage for learning the meanings of ‘skewed’ constructions where one or two types constitute the lion's share of all constructional tokens, as compared to ‘balanced’ constructions where the tokens are divided more evenly amongst the types. The picture has been complicated by the fact that some studies have found no advantage for either type of distribution (Year & Gordon, Reference Year and Gordon2009), or even an advantage for a more balanced distribution (Siebenborn, Krajewski & Lieven, unpublished observations; see Johnson & Goldberg, unpublished observations, for discussion: online <http://www.princeton.edu/~adele/Princeton_Construction_Site/Publications_files/SkewedInput.pdf>). Whatever the overall pattern, for our present purposes, the important point is that – again – we see a case where careful examination of the different types of frequency effect observed constrains theory development by forcing us to build models that can yield these complex effects; effects that would have been missed entirely by an approach that focused solely on the relationship between the input and output frequency of particular tokens.

Although we have focused in this domain on our Levels and Kinds Thesis, this is not to say that our other theses are not supported here. Work on the development of simple grammatical constructions also illustrates our Cause Error Thesis. Theakston (Reference Theakston2012) found that, when producing simple transitive sentences with a discourse-new subject, children as old as five years often produced an underinformative pronoun subject (e.g. He rather than The cat). That is, children seemed to overgeneralize a particularly frequent transitive sentence subject, He (or perhaps even its ‘givenness’ property) into an inappropriate context (one in which the subject is discourse-new). With regard to the Prevent Error Thesis, Rowland and Noble (Reference Rowland and Noble2010) found that children showed better comprehension of dative sentences containing novel verbs when the recipient was a proper noun (e.g. I'm blicking Teddy the frog) than a definite determiner phrase (e.g. I'm blicking the Teddy the frog). Although other factors are no doubt relevant too (e.g. consecutive determiner+noun sequences are confusing), one relevant factor seems to be that 94% of datives in child-directed speech are of the former type. Thus frequency is preventing errors here; but frequency not of individual lexical items or categories, but of cues to thematic role assignment (e.g. ‘being a proper noun’ is a frequently heard cue to recipienthood).

In summary, whilst input frequency effects are straightforwardly (and hence uncontroversially) observed at the levels of individual words or surface strings, effects at the level of sentence constructions are much more evasive. We have argued, however, that frequency effects – token and type, AoA, and preventing and causing error – are no less ubiquitous in this domain than any other. The reason that they often elude discovery is that they tend to be rather abstract: what is relevant is often the frequency not of surface strings but of pairings between concrete lexical items and abstract constructions, of abstract cues to subjecthood, of type:token ratios within a given construction, and so on. Indeed, even when we might be tempted simply to count the number of occurrences of a particular word (e.g. go), the appropriate frequency measure – and the one that yields correlations between children's speech and their input (Theakston, Lieven, Pine & Rowland, Reference Theakston, Lieven, Pine and Rowland2002) – is the frequency of each of its different senses. In short, as the saying goes, not everything that can be (easily) counted counts, and vice versa.

Consequently, if we are to make progress in our understanding of children's acquisition of sentence-level constructions, we need to move away from models based only on surface form and towards models that include roles for abstract factors such as verb meaning, animacy, participant roles, construction-level semantics, and so on (e.g. St John & McClelland, Reference St John and McClelland1990; Gordon & Dell, Reference Gordon and Dell2003; Chang et al., Reference Chang, Dell and Bock2006; Chang, Reference Chang2009; Mayberry, Crocker & Knoeferle, Reference Mayberry, Crocker and Knoeferle2009; see McCauley & Christiansen, Reference McCauley and Christiansen2014, for a review). Of course, if, as we have claimed, abstract frequency effects are important at the level of simple constructions, they are likely to be even more important when considering the more advanced constructions to which we now turn.

MORE ADVANCED CONSTRUCTIONS

Both frequency effects in general, and our five theses in particular, scale up to more advanced constructions. Here we consider three construction types that have received considerable attention in the acquisition literature: questions (focusing mainly on wh-questions, which have tended to attract more research attention than yes/no questions), relative clauses, and passives.

Questions

Most agree that the very first questions that English-speaking children produce are rote-learned, frequently heard, probably unanalyzed strings, such as what's+that (often pronounced as whassat?). Many would also agree with Klima and Bellugi (Reference Klima, Bellugi, Lyons and Wales1966) that these very early questions include partially analyzed high-frequency formulae such as What-X-(doing)? and Where-X-(going)? (see also Fletcher, Reference Fletcher1985). However, the role of frequency beyond these earliest formulaic utterances is more controversial. Here we argue that there is ample evidence that children's early question acquisition is moulded by input frequency well into development. We suggest that studies of question acquisition support three of our theses: (i) that frequent items are acquired before infrequent ones, all else being equal (AoA); (ii) that high-frequency question types can Prevent Errors; and (iii) under some circumstances, an over-reliance on high-frequency forms can Cause Errors).

First, studying the order in which children start to produce wh-words demonstrates that a word's frequency affects how easily and early it is acquired (AoA). Wh-questions in particular provide a good test bed for investigating the effect of frequency on the acquisition of lexical items because they contain a built-in control for many of the other variables that we know interact with (and can mask the effect of) frequency. For example, in English, wh-words always appear in the same position – at the beginning on the clause – so controlling for the effect of sentence position on an item's salience is not necessary. Similarly, all wh-words are roughly equivalent in ease of production since all are one-syllable words which start with one of two phonemes (/w/ for what, where, why, when, and which and /h/ for how and who).

A number of studies have observed a correlation between order of acquisition and input frequency in a range of languages. For example, Rowland, Pine, Lieven, and Theakston (Reference Rowland, Pine, Lieven and Theakston2003) reported that the order in which the twelve Manchester corpus children began to produce English wh-words correlated with the frequency of the wh-words in their input, even when syntactic and semantic complexity were taken into account. Wode (Reference Wode1976), Forner (Reference Forner, Eckman and Hastings1979), Savic (Reference Savic1975), and Clancy (Reference Clancy1989) have reported similar findings for German, Serbo-Croatian, and Korean (see also Tyack & Ingram, Reference Tyack and Ingram1977; Bloom, Merkin & Wootten, Reference Bloom, Merkin and Wootten1982, for English; Okubo, Reference Okubo1967, for Japanese). Once again, input frequency is not the only relevant factor here, since it only accounted for only 13–36% of the variance in the order of wh-word acquisition (Rowland et al., Reference Rowland, Pine, Lieven and Theakston2003), as predicted by our Interaction Thesis, but it is a significant factor nonetheless.

Research into children's questions (both wh- and yes/no) also demonstrates how highly frequent sequences can help protect children from making syntactic errors when constructing sentences (Prevent Error). Although word order errors are rare in children's early productions, English-learning children make a surprising number of these errors in their early question formation. These errors include subject–auxiliary inversion errors in which the tense- and agreement-marked auxiliary occurs post-, instead of pre-subject (e.g. *What he can do?) and double-marking errors in which tense+agreement is marked twice (*What did he didn't want; *What is he isn't eating?; *Does she doesn't want a drink?). These errors pattern systematically, and therefore cannot be dismissed as momentary lapses or slips of the tongue. For example, they are generally more common with some wh-words (e.g. why) and auxiliaries (e.g. DO and the modal auxiliaries), and with negative questions (e.g. Why does she doesn't like it?; Can she can't see him?; Ambridge, Rowland, Theakston & Tomasello, Reference Ambridge, Rowland, Theakston and Tomasello2006; Rowland, Reference Rowland2007; Ambridge & Rowland, Reference Ambridge and Rowland2009; Rowland & Theakston, Reference Rowland and Theakston2009).

The many different theoretical accounts of these errors that have been proposed need not concern us here (e.g. Stromswold, Reference Stromswold1990; De Villiers, Reference De Villiers, Maxwell and Plunkett1991; Valian, Lasser & Mandelbaum, Reference Valian, Lasser and Mandelbaum1992; Santelmann, Berk, Austin, Somashekar & Lust, Reference Santelmann, Berk, Austin, Somashekar and Lust2002). The important point is that whatever other factors may affect rates of error (e.g. polarity and auxiliary type, as discussed above), questions are more susceptible to error when certain wh-words are combined with certain auxiliaries. For example, Rowland and Pine (Reference Rowland and Pine2000) reported that one child, Adam, produced Where shall questions correctly but made errors with What shall. Similarly, he produced errors with How can but not with How do. These findings suggest that, whatever other rules or abstractions young children are using, they are making at least some use of high-frequency lexical frames learned from the input (e.g. How do + X; Rowland & Pine, Reference Rowland and Pine2000: Rowland, Reference Rowland2007; Ambridge & Rowland, Reference Ambridge and Rowland2009). The relevant questions are thus protected from error, since the word order of the question is specified directly in the frames.

If this is the case, then one would expect to see higher error rates for lower-frequency question types for which the child has no frame available, and must therefore be generated using other strategies (e.g. generalizing from existing knowledge). Rowland (Reference Rowland2007; see also Dabrowska & Lieven, Reference Dąbrowska and Lieven2005; Ambridge & Rowland, Reference Ambridge and Rowland2009) directly tested the prediction that question types that had occurred with high frequency in the input would be picked up as frames by children and so would be protected from error. In an analysis of the yes/no and wh-questions produced by ten English-learning children aged two to five years, she reported significantly lower rates of error in question types that were highly frequent in the children's input than in low-frequency question types. Importantly, the analyses ruled out alternative explanations, such as the identity of the wh-word or auxiliary, or the input frequency of the individual words.

The domain of question acquisition also exhibits evidence for our Cause Error Thesis. An over-reliance on frequent frames can not only protect from error, but, in some cases, cause errors, when children use these frames inappropriately, for example by combining a wh-word+auxiliary frame (e.g. Why can), with an inappropriate declarative phrase (she can't drink the milk) to yield a doubling error (Why can she can't drink it the milk?; Dabrowska and Lieven, Reference Dąbrowska and Lieven2005, found that 20% of their potentially frame-derived questions were errors). Ambridge and Rowland (Reference Ambridge and Rowland2009) tested this prediction in an elicitation experiment with English-learning three- to four-year-olds. They reported that doubling errors were more likely to be produced by children who had already learnt the relevant wh+auxiliary frame (Why can), and speculated that doubling errors occurred when children combined these frames with a declarative fragment (Why can + she can't drink the milk), suggesting that stored high-frequency strings can sometimes cause, as well as protect from, error.

Once again, this is a domain in which frequency interacts with other factors such as cognitive complexity (Interaction Thesis). For example, both Rowland (Reference Rowland and Pine2007) and Ambridge and Rowland (Reference Ambridge and Rowland2009) reported that certain question types (e.g. Why don't, and, indeed, most negative questions) attracted higher rates of error than would be expected solely on the basis of input frequency. Again, the conclusion that other factors are also at play does not obviate the need for a frequency-sensitive learning mechanism and, indeed, constrains theory development by highlighting the need for a mechanism that explains the interaction of frequency with other relevant factors.

Finally, it is important to note that an explanation of the frequency effects outlined in this section need not necessarily incorporate the assumption of item-based frames. For example, under Westergaard's (Reference Westergaard2009) approach, children are learning and applying grammatical movement rules (as in the generativist theories mentioned above), but these are framed in terms of language-specific micro-cues that specify in detail when and where different grammatical rules apply. Cues for which there is a lot of evidence in the input (i.e. high-frequency cues) will inevitably be learned first. Thus, as we argued in the ‘Introduction’, a frequency-sensitive account will not necessarily be a constructivist one; a point to which we return in the final section.

Relative clauses

Throughout this article we have emphasized the existence of different types of frequency effect (Levels and Kinds Thesis), from those involving concrete strings to those involving abstract cues and constructions. In this section, we present evidence that frequency effects of the more abstract type are observed for children's acquisition of relative clauses. Thus, frequent forms, when appropriately defined, are associated with earlier acquisition (AoA) and lower error rates (Prevent Error).

At first glance, the bulk of past research on relative clauses (RCs) appears to present a clear counter-argument to the claim that frequency significantly influences acquisition. Most of this research has focused on the acquisition of subject (1) and object (2) RCs.

(1) The girl that chased the boy
(2) The boy that the girl chased

Let us first concentrate on the language for which we have the most data: English. Naturalistic and experimental studies suggest that children acquire subject RCs before object RCs (e.g. Diessel & Tomasello, Reference Diessel and Tomasello2000; Kidd & Bavin, Reference Kidd and Bavin2002). Additionally, a host of adult sentence processing studies have consistently reported a subject advantage for RC processing (e.g. Gibson, Reference Gibson1998). These results, especially the experimental data, are consistent, and replicate across typologically similar languages. This pattern is problematic for any argument that frequency influences syntactic acquisition, since, in English, object RCs are more frequent than subject RCs in child-directed speech (Diessel, Reference Diessel2004) and in spoken language in general (Roland et al., Reference Roland, Dick and Elman2007). We argue in this section that, far from constituting evidence against a frequency-sensitive learning mechanism, the case of RCs reveals the multiplicity of levels in which frequency exerts an influence on acquisition (Levels and Kinds Thesis).

Subject and object RCs differ substantially in their functional-distributional properties. Fox and Thompson (Reference Fox and Thompson1990) first identified a number of dimensions on which the two structures differ. One prominent dimension is the animacy of the head noun: subject RCs are significantly more likely than object RCs to contain an animate head noun, whereas the opposite is the case for inanimate heads. Second, object RCs typically contain discourse-old RC subjects. Finally, both Roland et al. (Reference Roland, Dick and Elman2007) and Fox and Thompson (Reference Fox and Thompson2007) have shown that object RCs in spoken English rarely contain a relative pronoun. As such, although most experimental studies tested object RCs like (2), which contain two animate NPs and an overt relative pronoun, the types of object RCs that are most frequent in spoken discourse more closely resemble (3).

(3) The film I saw last night

The distributional tendencies of object RCs are attributable to two functional properties of language (Du Bois, Reference Du Bois1987): (i) objects are typically inanimate, whereas subjects tend to be animate (typically human); and (ii) subjects tend to be discourse-old. These are statistical properties of language. The likelihood of overt relativizer (that, which) use is also subject to frequency constraints: Fox and Thompson (Reference Fox and Thompson2007) identified several variables that predict the use/non-use of the relativizer, one being whether or not the RC subject was expressed as a pronoun (leading to non-use). Although these distributional facts are often ignored in studies of RC acquisition, they exert significant influences on children's acquisition.

Studies of naturalistic speech show that children quickly converge on these frequency patterns. Diessel (Reference Diessel, Givón and Shibatani2009) reported on the distributional properties of subject and non-subject (predominantly object) RCs in Adam's (Brown, Reference Brown1973) and Abe's (Kuczaj, Reference Kuczaj1976) speech from the CHILDES corpus (MacWhinney, Reference MacWhinney2000). Non-subject RCs overwhelmingly contained inanimate head nouns (91·7%) and pronominal RC subjects (88·1%) (see also Kidd, Brandt, Lieven & Tomasello, Reference Kidd, Brandt, Lieven and Tomasello2007). These numbers closely resembled the frequency of different NP-types in simple transitive clauses in the children's speech, where 86·9% of all subjects were first or second person pronouns. Therefore, despite the fact that non-subject RCs do not follow canonical word order, they do mark syntactic roles canonically (i.e. subject = animate, given, object = inanimate) and in a manner that matches the distributional properties of simple transitive sentences. Crucially, these frequency estimates from corpora predict children's correct production and comprehension of RCs in controlled experimental contexts. For instance, Kidd et al. (Reference Kidd, Brandt, Lieven and Tomasello2007) and Brandt, Kidd, Lieven, and Tomasello (Reference Brandt, Kidd, Lieven and Tomasello2009) showed that the typical subject–object asymmetry is neutralized and in some instances reversed when three- to four-year-old English- and German-speaking children were tested on highly frequent object RC types (i.e. those with an inanimate head noun and a pronominal RC subject) (see also Arnon, Reference Arnon2010).

Thus, as we saw in ‘Simple syntactic constructions’, children's acquisition of RCs is influenced by frequency, but at the level of abstract cues (e.g. animacy, givenness) and lexical items (i.e. pronouns) that are frequently associated with particular sentence positions. These distributional frequencies predict earlier acquisition (AoA), as well as lower error rates, and hence higher rates of correct performance, in both comprehension and production (Prevent Error).

Potentially problematic for this conclusion is the finding that subject RCs are actually the first type of RC to emerge in children's speech (Diessel & Tomasello, Reference Diessel and Tomasello2000). A closer inspection, however, reveals that the vast majority of these early RCs are so-called ‘presentational amalgam’ constructions, as in (4) and (5).

(4) Here's a mouse go sleep
(5) That is a train go go

Lambrecht (Reference Lambrecht, Axmaker, Jaissen and Singmaster1988) described the presentational amalgam construction as a type of truncated RC, where the predicate nominal of the copular clause serves as the subject of the clause-final VP. Their status as true RCs in child language is equivocal: they are monoclausal and lack the obligatory relative pronoun. As such, they closely resemble canonical SV(O) clauses, leading to the possibility that children use their knowledge of frequent structural patterns to break into the syntax of RCs, after which their relative use of subject and object RCs closely approximates adult usage (see Fitz, Chang & Christiansen, Reference Fitz, Chang, Christiansen and Kidd2011, for a connectionist model that uses word-order patterns learned from canonical SVO sentences to acquire the structure of relative clauses). Thus, again, we find that there are many different types of frequency effect (Levels and Kinds Thesis), and that, provided we define ‘form’ at the appropriate level, more frequent forms are associated with earlier acquisition (AoA Thesis).

One final emerging piece of evidence regarding the role of frequency in RC acquisition comes from languages other than English. Several researchers have suggested that the traditional subject–object asymmetry observed in experimental studies of English (and other typologically similar languages) derives from the fact that subject RCs follow canonical word order, whereas object RCs do not (e.g. Bever, Reference Bever and Hayes1970; MacDonald & Christiansen, Reference MacDonald and Christiansen2002). This account makes the following prediction: object RCs should be acquired first and should be easier to understand in languages where their word order follows canonical word order. Chinese languages such as Mandarin and Cantonese follow this pattern. Although there are many more studies to conduct on these languages, there is some evidence in support of this prediction (Yip & Matthews, Reference Yip and Matthews2007; Chan, Matthews & Yip, Reference Chan, Matthews, Yip and Kidd2011; Chen & Shirai, Reference Chen and Shirai2014; though see Hsu, Hermon & Zukowski, Reference Hsu, Hermon and Zukowski2009). Thus, again, we see an effect of frequency, but at a very abstract level: the frequency of particular orderings of SUBJECT and OBJECT roles in the language as a whole; an effect far removed from a view under which the acquisition mechanism is sensitive only to the frequency of particular surface strings.

Whilst the evidence for frequency effects in this domain is clear, what remains unclear is how these effects are represented and implemented on-line. For instance, there is some evidence to suggest that many object RCs are produced using prefabricated chunks (e.g. the one pro VERB; see Fox & Thompson, Reference Fox and Thompson2007; Reali & Christiansen, Reference Reali and Christiansen2005), but the processing advantage shown for object RCs that have less prototypical features (e.g. the pen that I bought) raises the possibility that the constraints of animacy and RC subject might be implemented incrementally on-line (see Kidd et al., Reference Kidd, Brandt, Lieven and Tomasello2007). Given the importance of the wider question of the locus of frequency effects observed in first language acquisition, this is clearly an issue that requires further investigation.

Passives

Research on passives illustrates that frequency effects can be found not only within a given language, but also cross-linguistically (Levels and Kinds Thesis): across languages, a negative correlation is often observed between the relative frequency of a particular construction in the language and the age at which it is typically acquired by its speakers (AoA Thesis). Passives are highly dispreferred in languages like English, German, and Hebrew, and thus occur infrequently. Our most comprehensive naturalistic data come from English: in a large corpus study, Xiao, McEnery, and Qian (Reference Xiao, McEnery and Qian2006) reported that the percentage of all passive types (full and truncated, using either be or get) in spoken British English is 0·16%. Using the Brown (Reference Brown1973) corpus (i.e. American English), Gordon and Chafetz (Reference Gordon and Chafetz1991) reported that full passives occur in only ·005% of all sentences in CDS, whereas truncated passives occur 0·1% of the time. Not surprisingly, passives are also rare in the spontaneous speech of English-speaking children (Pinker, Lebeaux & Frost, Reference Pinker, Lebeaux and Frost1987; Israel, Johnson & Brooks, Reference Israel, Johnson and Brooks2000), a finding that is similar to reports on German (Mills, Reference Mills and Slobin1985) and Hebrew (Berman, Reference Berman and Slobin1985).

The learnability problem posed by infrequent and more advanced structures is well- worn territory in child language research, and the passive has been central to this debate. One way to evaluate how frequency matters is to compare languages such as English and German, in which the passive is infrequent, to languages where the passive occurs with much higher frequency. Indeed, there are several cases in the literature where higher passive frequency results in earlier acquisition (AoA Thesis). For instance, in Sesotho the passive is estimated to be ten times more frequent than it is in English (Kline & Demuth, Reference Kline and Demuth2010), which appears to result in comparatively earlier acquisition (Demuth, Reference Demuth1989; Demuth, Moloi & Machobane, Reference Demuth, Moloi and Machobane2010). Similar effects have been reported for Inuktitut (Allen & Crago, Reference Allen and Crago1996), Bahasa Indonesia (Gil, Reference Gil, Gagarina and Gülzow2006), and Ki'che’ Maya (Pye & Quixtan Poz, Reference Pye and Quixtan Poz1988). In every case the high frequency of passive use appears to stem from particular typological properties of the languages, which, in comparison to European languages, make the passive a less marked structure (Interaction Thesis).

Training studies in English complement the cross-linguistic work. In an early study, Whitehurst, Ironsmith, and Goldfein (Reference Whitehurst, Ironsmith and Goldfein1974) showed that modelling passives to four- to five-year-olds increased their production and comprehension, a finding corroborated by Vasilyeva, Huttenclocher, and Waterfall (Reference Vasilyeva, Huttenlocher and Waterfall2006) (for a training study of rare subject RCs in Turkish, see Sarilar, Matthews & Küntay, Reference Sarilar, Matthews and Küntay2013). The Whitehurst et al., study predates the structural priming literature (e.g. Bock, Reference Bock1986; Pickering & Ferreira, Reference Pickering and Ferreira2008), but nowadays would be interpreted as a priming effect. The passive is the most studied structure in priming studies conducted with developmental populations, showing a consistent priming effect (e.g. Savage, Lieven, Theakston & Tomasello, Reference Savage, Lieven, Theakston and Tomasello2003; Huttenlocher, Vasilyeva & Shimpi, Reference Huttenlocher, Vasilyeva and Shimpi2004; Messenger, Branigan & McLean, Reference Messenger, Branigan and McLean2011; Kidd, Reference Kidd2012).

The robust nature of the priming effect for the English passive has been explained with reference to the structure's low frequency – the so called inverse frequency effect, which describes the tendency for low-frequency structures to yield higher priming effects. Several explanations for this inverse frequency effect have been proposed, but the one that most naturally extends to acquisition is the argument that structural priming effects reflect implicit learning of structure (Chang et al., Reference Chang, Dell and Bock2006): children have a greater tendency to produce low-frequency forms after being primed because priming leads to larger representational change in comparison to more entrenched structures (e.g. the active transitive). Importantly, the account predicts that children will respond to low-frequency forms such as the passive differently across development: representational change in young children following exposure will be greater than in older children (effectively, younger children have more to learn). This leads to a prediction (or even perhaps a caution): we should not expect frequency effects to be uniform across developmental stages and, indeed, individual children (Levels and Kinds Thesis).

Finally, the acquisition of the passive has been shown to be either supported or hindered by its similarity or dissimilarity to other structural patterns. Abbot-Smith and Behrens (Reference Abbot-Smith and Behrens2006) showed that a German-speaking child acquired the stative sein-passive before the eventive werden-passive, even though the two forms are roughly equal in frequency in the input. However, the two passives overlap with other structures that serve to either support (in the case of the sein-passive) or hinder acquisition (in the case of the werden-passive). The acquisition of the sein-passive is facilitated by the previously learned morphologically and functionally similar present perfect, whereas the werden-passive cannot build on a previously acquired construction and competes in function with high-frequency modal verb constructions. Thus we have another instance where frequency at multiple levels interacts with other properties of language, in this case structural overlap, to determine acquisition (Interaction Thesis).

To conclude this section, there is ample evidence to suggest that frequency effects are observed not only for lexical strings and simple structures, but also for more advanced structures including questions, relative clauses, and passives. Because, in many cases, these frequency effects occur at the level of abstract categories, patterns, or cues, they are often more difficult to detect than frequency effects at the single-word or even construction level. When the data are analyzed at the appropriate level of abstraction, however, we see exactly the same types of frequency effect that are observed for other domains. One pressing challenge for future research in this domain is to better determine how frequency effects interact with other features of language, such as typology (e.g. see papers in Kidd, Reference Kidd2011).

THEORETICAL IMPLICATIONS

The present article reviewed frequency effects in four core domains: the acquisition of single words, inflectional morphology, simple syntactic constructions, and more advanced constructions. We argued that frequency effects are ubiquitous across all of these domains, and, indeed, across language acquisition in general. In summarizing this evidence, we argued that there exist different types of frequency effect; for example, effects at the levels of lexical strings and abstract sentence constructions, as well as effects of both type and token frequency and of relative and absolute frequency (Levels and Kinds Thesis). We presented evidence that high-frequency forms are associated with earlier acquisition (AoA Thesis) and lower rates of error (Prevent Error Thesis), but also that they can cause error when used inappropriately (Cause Error Thesis). Finally we argued that frequency effects interact with other effects, such as utterance position, and that such interactions can be informative with regard to the nature of the language acquisition mechanism (Interaction Thesis).

Whether or not we have succeeded in convincing the reader of all of these individual claims, we hope to have marshalled sufficient evidence to convince all but the most hardened classicist (in the sense of Newmeyer, Reference Newmeyer2003) of the ubiquity of frequency effects across all domains of child language acquisition, and that frequency effects therefore constitute a phenomenon for which any successful theory must be able to account.

As we noted in the ‘Introduction’, this might be either a generativist/nativist account that assumes knowledge of innate syntactic categories, principles, and parameters (e.g. Yang, Reference Yang2004; Westergaard, Reference Westergaard2009) or a constructivist/usage-based account that does not (e.g. Tomasello, Reference Tomasello2003). In principle, both classes of account could, given certain assumptions, explain the patterns of frequency effects outlined here. This is not to say, however, that all current theories can explain frequency effects, and that, by making reference to accounts that are incompatible with such effects, we are setting up a straw man. We have already mentioned in passing one account that explicitly denies any meaningful effect of frequency (Roeper, Reference Roeper, Gülzow and Gagarina2007). Much more common are proposals that do not explicitly rule out frequency effects (or, indeed, discuss them at all), but that posit learning procedures that not only (a) yield no frequency effects in their current form, but also (b) could yield no frequency effects without abandoning the core learning mechanism assumed.

An example is the triggering approach to setting word order parameters. Under such accounts (e.g. Sakas and Fodor, Reference Sakas and Fodor2012), children acquire the word order of their language (e.g. SVO for English), not by abstracting across input utterances, but by setting syntactic parameters (e.g. setting the specifier–head and head–complement parameters to the settings that yield SV and VO, respectively). Because the account includes no role for input-based learning, it does not explain the finding that word order is better learned for more frequent verbs (Matthews et al., Reference Matthews, Lieven, Theakston and Tomasello2005, Reference Matthews, Lieven, Theakston and Tomasello2007). Neither can the account straightforwardly be modified to yield such effects. It would be necessary to add the assumption that children learn word order by abstracting across input strings, which entirely obviates the need for the parameter-setting mechanism. The whole point of the account is to explain how children could use triggers to acquire word order rapidly, without having to build this knowledge gradually on the basis of the input. Thus there exist at least some accounts with which the type of frequency effects discussed in the present article are incompatible in principle.

However, while some individual accounts are incompatible with frequency effects, this is not true for whole families of accounts. Both constructivist and generativist accounts (including some parameter-setting accounts) can incorporate frequency-sensitive learning mechanisms. That said, we feel that it would be remiss of us to end this review sitting on the fence, and that we owe it to readers who have persisted this far to nail our colours to the theoretical mast. It will come as no surprise to anyone who has read any of our previous papers that these colours are those of the constructivist camp. But this is not a matter of research tradition, terminology, or simple preference; on our view, the constructivist account offers a more parsimonious account of frequency effects.

Let us illustrate this claim by returning to one of the domains that we have discussed here – inflectional morphology – and, specifically, to a phenomenon to which we have already alluded briefly. The phenomenon is that children sometimes produce agreement-/tense-less verb forms in contexts in which an inflected (here third person singular -s) form is required (e.g. *Dolly eat it). Importantly, both sides agree that this phenomenon is related to the input. For example, English and Dutch children hear these agreement-/tense-less verb forms frequently (e.g. in sentences such as Let Dolly eat it and Dolly can eat it), and so produce these errors at high rates. Italian and Spanish children hear these forms much less frequently, and so produce these errors rarely. Thus both generativist and constructivist researchers agree that this phenomenon can be explained only by positing some kind of frequency-sensitive learning mechanism.

Under a generativist account (e.g. Legate & Yang, Reference Legate and Yang2007), children use the input to set an innately given TENSE parameter to either a positive (the language requires tense/agreement marking) or negative setting (it does not). Because this parameter is set probabilistically on the basis of the input – i.e. in a way that is frequency sensitive – this account can explain why English and Dutch children, who hear these ‘bare’ forms frequently, produce more errors that Italian and Spanish children, who do not.

Under the constructivist account (e.g. Freudenthal et al., Reference Freudenthal, Pine, Aguado-Orea and Gobet2007; Räsänen et al., Reference Räsänen, Ambridge and Pine2014) children make these errors because they are learning from the input individual lexical forms and multiword strings (e.g. play, plays, Let Dolly play, etc.), which they sometimes use inappropriately (e.g. producing Let Dolly play, in a context where Dolly plays would be appropriate). This proposal not only offers a closer fit to the quantitative cross-linguistic pattern, but also explains why – within a given language – some verbs display higher error rates than others (Freudenthal et al., Reference Freudenthal, Pine and Gobet2010). For example, in English, the verbs that children frequently hear in ‘bare’ versus third person singular -s form, particularly in utterance-final position, are exactly those verbs that children frequently produce in bare form in third singular contexts (Theakston, Lieven & Tomasello, Reference Theakston, Lieven and Tomasello2003; Kirjavainen, Theakston & Lieven, Reference Kirjavainen, Theakston and Lieven2009; Freudenthal et al., Reference Freudenthal, Pine and Gobet2010; Räsänen et al., Reference Räsänen, Ambridge and Pine2014).

Now, as we argued above, there is no reason in principle why the generativist account could not be adapted to accommodate these lexical-level frequency findings. One could quite easily propose that, in addition to using input forms to set the TENSE parameter (Legate & Yang, Reference Legate and Yang2007), children additionally store input strings and, on a non-negligible proportion of occasions, produce utterances by retrieving these stored strings directly. Why then, do we favour the constructivist alternative? The reason is that the constructivist account yields these lexical input frequency effects naturally, using the core learning mechanism assumed by the account (i.e. the storage and reuse of strings from the input). In contrast, the generativist account yields these effects by discarding the core mechanism assumed by that account (at least, on a sufficiently large proportion of occasions for the effects to be detectable) and adding ancillary hypotheses that have no independent theoretical motivation within the account; that serve no purpose other than to explain otherwise recalcitrant findings.

An analogous situation applies in every domain that we have investigated. For example, children could acquire word order by setting innate complement–head and specifier–head parameters that spell out (amongst other things) the target order of the innate categories of SUBJECT, VERB, and OBJECT in the language being learned. But in order to explain the finding that children and adults have detailed knowledge of the frequency with which particular verbs have appeared in this construction, the generativist account would have to add the assumption that – in addition to setting this parameter – children record verb+construction collocation frequencies. Again, whilst for the generativist account this assumption is merely an ancillary hypothesis with no independent theoretical motivation, the phenomenon falls naturally and inevitably out of the constructivist account: if children learn the SUBJECT VERB OBJECT construction by abstracting across particular instances of that construction in the input, then the frequency with which each verb has appeared in this construction is immanent in the generalization. We would be the first to admit that there are many important language acquisition phenomena for which current constructivist accounts do not offer a satisfactory explanation; but, on our view, constructivist accounts, which have frequency sensitivity built into their very fabric, provide the most parsimonious explanation of the multiplicity of frequency effects discussed here.

To summarize, the current article has presented evidence of pervasive frequency effects across children's language acquisition. Frequency effects are observed across a variety of different domains, levels (e.g. lexical vs. abstract; type vs. token, absolute vs. relative), and outcome measures (e.g. age of acquisition, rates of error/correct use, types of error), and therefore constitute a phenomenon that demands explanation under any theoretical account. Although we have advocated a constructivist account, this is not to say that alternative approaches are incompatible with frequency effects in principle. The challenge for such accounts is to incorporate motivated mechanisms that yield frequency effects whilst preserving the core mechanistic assumptions of the account.

In conclusion, whilst – as we have tried to stress throughout – frequency isn't everything, frequency certainly isn't nothing. On the contrary, frequency effects constitute a phenomenon that any successful account of child language acquisition must explain.

Footnotes

[*]

The order of authorship is alphabetical.

References

REFERENCES

Abbot-Smith, K. & Behrens, H. (2006). How do known constructions influence the acquisition of other constructions? The German passive and future constructions. Cognitive Science 30, 995–1026.CrossRef Google Scholar PubMed

Abbot-Smith, K., Lieven, E. & Tomasello, M. (2001). What preschool children do and do not do with ungrammatical word orders. Cognitive Development 16, 679–92.CrossRef Google Scholar

Aguado-Orea, J. (2004). The acquisition of morpho-syntax in Spanish: implications for current theories of development. Unpublished PhD thesis, University of Nottingham.Google Scholar

Akhtar, N. (1999). Acquiring basic word order: evidence for data-driven learning of syntactic structure. Journal of Child Language 26, 339–56.CrossRef Google Scholar PubMed

Allen, S. E. & Crago, M. B. (1996). Early passive acquisition in Inuktitut. Journal of Child Language 23, 129–56.CrossRef Google Scholar PubMed

Ambridge, B. (2010). Review of Frequency effects in language acquisition: defining the limits of frequency as an explanatory concept, by Gülzow, I. & Gagarina, N. (eds). Journal of Child Language, 37, 453–60.Google Scholar

Ambridge, B. & Lieven, E. V. M. (2014). A constructivist account of child language acquisition. In MacWhinney, B. & O'Grady, W. (eds), The handbook of language emergence. Oxford: Wiley Blackwell.Google Scholar

Ambridge, B., Pine, J. M. & Rowland, C. F. (2012). Semantics versus statistics in the retreat from locative overgeneralization errors. Cognition 123, 260–79.CrossRef Google Scholar PubMed

Ambridge, B., Pine, J. M., Rowland, C. F. & Chang, F. (2012). The roles of verb semantics, entrenchment and morphophonology in the retreat from dative argument structure overgeneralization errors. Language 88, 45–81.CrossRef Google Scholar

Ambridge, B. & Rowland, C. F. (2009). Predicting children's errors with negative questions: testing a schema-combination account. Cognitive Linguistics 20, 225–66.CrossRef Google Scholar

Ambridge, B., Rowland, C. F., Theakston, A. L. & Tomasello, M. (2006). Comparing different accounts of inversion errors in children's non-subject wh-questions: ‘What experimental data can tell us?’ Journal of Child Language 33, 519–57.CrossRef Google Scholar PubMed

Arnon, I. (2010). Re-thinking child difficulty: the effect of NP-type on children's processing of relative clauses in Hebrew. Journal of Child Language 37, 27–57.CrossRef Google Scholar

Arnon, I. & Clark, E. V. (2011). Why ‘brush your teeth’ is better than ‘teeth’: children's word production is facilitated in familiar sentence-frames. Language Learning and Development 7, 107–29.Google Scholar

Arnon, I. & Snider, N. (2010). More than words: frequency effects for multi-word phrases. Journal of Memory and Language 62, 67–82.CrossRef Google Scholar

Baayen, R. & Lieber, R. (1991). Productivity and English derivation: a corpus-based study. Linguistics 29, 801–43.Google Scholar

Balota, D. A., Cortese, M. J., Sergent-Marshall, S. D., Spieler, D. H. & Yap, M. J. (2004). Visual word recognition of single-syllable words. Journal of Experimental Psychology: General 133, 283–316.CrossRef Google Scholar PubMed

Balota, D. A., Pilotti, M. & Cortese, M. J. (2001). Subjective frequency estimates for 2,938 monosyllabic words. Memory & Cognition 29, 639–47.CrossRef Google Scholar

Bannard, C. & Matthews, D. (2008). Stored word sequences in language learning: the effect of familiarity on children's repetition of four-word combinations. Psychological Science 19, 241–8.CrossRef Google Scholar PubMed

Barðdal, J. (2008). Productivity: evidence from case and argument structure in Icelandic. Amsterdam: John Benjamins.Google Scholar

Bates, E. & MacWhinney, B. (1982). Functionalist approaches to grammar. In Wanner, E. & Gleitman, L. (eds), Language acquisition: the state of the art, 173–218. New York: Cambridge University Press.Google Scholar

Berman, R. (1985). The acquisition of Hebrew. In Slobin, D. I. (ed.), The crosslinguistic study of language acquisition, Vol. 1, 255–371. Hillsdale NJ: Erlbaum.Google Scholar

Berz, W. L. (1995). Working memory in music: a theoretical model. Music Perception 12, 353–64.CrossRef Google Scholar

Bever, T. G. (1970). The cognitive basis for linguistic structure. In Hayes, J. R. (ed.), Cognition and the development of language, 279–362. New York: Wiley.Google Scholar

Bird, H., Franklin, S. & Howard, D. (2001). Age of acquisition and imageability ratings for a large set of words, including verbs and function words. Behavior Research Methods, Instruments & Computers 33, 73–9.Google Scholar

Blackwell, A. A. (2005). Acquiring the English adjective lexicon: relationships with input properties and adjectival semantic typology. Journal of Child Language 32, 535–62.CrossRef Google Scholar PubMed

Bloom, L., Merkin, S. & Wootten, J. (1982). Wh-questions: linguistic factors that contribute to the sequence of acquisition. Child Development 53, 1084–92.Google Scholar

Bock, J. K. (1986). Syntactic persistence in language production. Cognitive Psychology 18, 355–87.CrossRef Google Scholar

Bohnacker, U. (2007). The role of input frequency in article acquisition in early child Swedish. In Gülzow, I. & Gagarina, N. (eds), Frequency effects in language acquisition: defining the limits of frequency as an explanatory concept, 51–82. Berlin & New York: Mouton de Gruyter.Google Scholar

Bowerman, M. & Choi, S. (2001). Shaping meanings for language: universal and language-specific in the acquisition of spatial semantic categories. In Bowerman, Melissa & Levinson, Stephen C. (eds), Language acquisition and conceptual development, 475–511. Cambridge: Cambridge University Press.Google Scholar

Brandt, S., Kidd, E., Lieven, E. & Tomasello, M. (2009). The discourse bases of relativization: an investigation of young German- and English-speaking children's comprehension of relative clauses. Cognitive Linguistics 20, 539–70.CrossRef Google Scholar

Brent, M. R. & Siskind, J. M. (2001). The role of exposure to isolated words in early vocabulary development. Cognition 81, B33–44.Google Scholar

Brown, R. (1973). A first language: the early stages. Cambridge, MA: Harvard University Press.CrossRef Google Scholar

Brysbaert, M. & New, B. (2009). Moving beyond Kučera and Francis: a critical evaluation of current word frequency norms and the introduction of a new and improved word frequency measure for American English. Behavior Research Methods 41, 977–90.CrossRef Google Scholar

Bybee, J. L (1995). Regular morphology and the lexicon. Language and Cognitive Processes 10, 425–55.CrossRef Google Scholar

Bybee, J. L. (2010). Language, usage and cognition. Cambridge: Cambridge University Press.Google Scholar

Bybee, J. L. & Moder, C. L. (1983). Morphological classes as natural categories. Language 59, 251–70.Google Scholar

Bybee, J. L. & Slobin, D. I. (1982). Rules and schemas in the development and use of the English past tense. Language 58, 265–89.CrossRef Google Scholar

Cameron-Faulkner, T. & Kidd, E. (2007). I'm are what I'm are: the acquisition of first-person singular present BE. Cognitive Linguistics 18, 1–22.CrossRef Google Scholar

Cameron-Faulkner, T., Lieven, E. & Theakston, A. (2007). What part of no do children not understand? A usage-based account of multiword negation. Journal of Child Language 34, 251–82.CrossRef Google Scholar

Candan, A., Küntay, A. C., Yeh, Y. C., Cheung, H., Wagner, L. & Naigles, L. R. (2012). Language and age effects in children's processing of word order. Cognitive Development 27(3), 205–221.CrossRef Google Scholar

Casenhiser, D. & Goldberg, A. E. (2005). Fast mapping between a phrasal form and meaning. Developmental Science 8(6), 500–8.Google Scholar

Chan, A., Lieven, E. & Tomasello, M. (2009). Children's understanding of the agent–patient relations in the transitive construction: cross-linguistic comparisons between Cantonese, German, and English. Cognitive Linguistics 20, 267–300.CrossRef Google Scholar

Chan, A., Matthews, S. & Yip, V. (2011). The acquisition of relative clauses in Cantonese and Mandarin. In Kidd, E. (ed.), The acquisition of relative clauses: processing, typology, and function, 197–225. Amsterdam: John Benjamins.CrossRef Google Scholar

Chang, F. (2009). Learning to order words: a connectionist model of heavy NP shift and accessibility effects in Japanese and English. Journal of Memory and Language 61, 374–97.Google Scholar

Chang, F., Dell, G. S. & Bock, K. (2006). Becoming syntactic. Psychological Review 113, 234–72.Google Scholar

Chemla, E., Mintz, T. H., Bernal, S. & Christophe, A. (2009). Categorizing words using ‘frequent frames’: what cross-linguistic analyses reveal about distributional acquisition strategies. Developmental Science 12, 396–406.Google Scholar

Chen, J. & Shirai, Y. (2014). The acquisition of relative clauses in Mandarin Chinese. Journal of Child Language online: <doi:10.1017/S0305000914000300>.Google Scholar PubMed

Christophe, A. & Dupoux, E. (1996). Bootstrapping lexical acquisition: the role of prosodic structure. Linguistic Review 13, 383–412.CrossRef Google Scholar

Clancy, P. (1989). Form and function in the acquisition of Korean wh-questions. Journal of Child Language 16, 323–47.Google Scholar

Conklin, K. & Schmitt, N. (2012). The processing of formulaic language. Annual Review of Applied Linguistics 32, 45–61.Google Scholar

Dabrowska, E. (2008). The later development of an early-emerging system: the curious case of the Polish genitive. Linguistics 46, 629–50.CrossRef Google Scholar

Dąbrowska, E. & Lieven, E. (2005). Towards a lexically specific grammar of children's question constructions. Cognitive Linguistics 16, 437–74.CrossRef Google Scholar

Dabrowska, E. & Szczerbinski, M. (2006). Polish children's productivity with case marking: the role of regularity, type frequency, and phonological diversity. Journal of Child Language 33, 559–97.Google Scholar

De Villiers, J. G. (1985). Learning how to use verbs: lexical coding and the influence of the input. Journal of Child Language 12, 587–95.Google Scholar

De Villiers, J. G. (1991). Why questions? In Maxwell, T. & Plunkett, B. (eds), Papers in the acquisition of ‘wh’, 155–71. Amhurst, MA: University of Massachusetts.Google Scholar

Dell, G. S. (1990). Effects of frequency and vocabulary type on phonological speech errors. Language and Cognitive Processes 4, 313–49.CrossRef Google Scholar

Demuth, K. (1989). Maturation and the acquisition of Sesotho passive. Language 65, 56–80.CrossRef Google Scholar

Demuth, K., Moloi, F. & Machobane, M. (2010). Three year-olds’ comprehension, production and generalization of the Sesotho passives. Cognition 115, 238–51.Google Scholar

DePaolis, R. A., Vihman, M. M. & Keren-Portnoy, T. (2011). Do production patterns influence the processing of speech in prelinguistic infants? Infant Behavior and Development 34, 590–601.Google Scholar

Diessel, H. (2004). The acquisition of complex sentences. Cambridge: Cambridge University Press.Google Scholar

Diessel, H. (2009). On the role of frequency and similarity in the acquisition of subject and non-subject relative clauses. In Givón, T. & Shibatani, M. (eds), Syntactic complexity: diachrony, acquisition, neurocognition, evolution, 251–76. Amsterdam: John Benjamins.CrossRef Google Scholar

Diessel, H. & Tomasello, M. (2000). The development of relative clauses in spontaneous child speech. Cognitive Linguistics 11, 131–51.Google Scholar

Dittmar, M., Abbot-Smith, K., Lieven, E. & Tomasello, M. (2008). German children's comprehension of word order and case marking in causative sentences. Child Development 79, 1152–67.CrossRef Google Scholar PubMed

Du Bois, J. W. (1987) The discourse basis of ergativity. Language 63, 805–55.Google Scholar

Ebbinghaus, H. (1913 [1885]). Memory: a contribution to experimental psychology. New York: Teachers College, Columbia University.Google Scholar

Eckerth, J. & Tavakoli, P. (2012). The effects of word exposure frequency and elaboration of word processing on incidental L2 vocabulary acquisition through reading. Language Teaching Research 16, 227–52.CrossRef Google Scholar

Ellis, N. C. (2002). Frequency effects in language processing. Studies in Second Language Acquisition 24, 143–88.Google Scholar

Erkelens, M. A. (2009). Learning to categorize verbs and nouns: studies on Dutch. Utrecht: LOT Dissertations.Google Scholar

Fenson, L., Dale, P., Resnick, S., Bates, E., Thal, D., Hartung, J. & Reilly, J. (1994). Variability in early communication development. Monographs of the Society for Research in Child Development 59.Google Scholar

Fitz, H., Chang, F. & Christiansen, M. H. (2011). A connectionist account of the acquisition and processing of relative clauses. In Kidd, E. (ed.), The acquisition of relative clauses, 39–60. Amsterdam: John Benjamins.Google Scholar

Fletcher, P. (1985). A child's learning of English. Oxford: Blackwell.Google Scholar

Forner, M. (1979). The mother as LAD: interaction between order and frequency of parental input and child production. In Eckman, F. R. & Hastings, A. J. (eds), Studies in first and second language acquisition, 17–44. Rowley, MA: Newbury.Google Scholar

Forrester, N. & Plunkett, K. (1994). Learning the Arabic plural: the case of minority default mappings in connectionist networks. In Ramand, A. & Eiselt, K. (eds), Proceedings of the Sixteenth Annual Conference of the Cognitive Science Society, 319–23. Hillsdale, NJ: Erlbaum.Google Scholar

Forster, K. (1976). Accessing the mental lexicon. In Wales, R. J. & Walker, E. (eds), New approaches to language mechanisms, 257–87. Amsterdam: North Holland.Google Scholar

Fox, B. & Thompson, S. (1990). A discourse explanation of the grammar of relative clauses in English conversation. Language 66, 856–70.CrossRef Google Scholar

Fox, B. & Thompson, S. (2007). Relative clauses in English conversation: relativizers, frequency, and the notion of construction. Studies in Language 31, 293–326.CrossRef Google Scholar

Freudenthal, D., Pine, J. M., Aguado-Orea, J. & Gobet, F. (2007). Modeling the developmental patterning of finiteness marking in English, Dutch, German, and Spanish using MOSAIC. Cognitive Science 31, 311–41.Google Scholar

Freudenthal, D., Pine, J. M. & Gobet, F. (2010). Explaining quantitative variation in the rate of Optional Infinitive errors across languages: a comparison of MOSAIC and the Variational Learning Model. Journal of Child Language 37, 643–69.CrossRef Google Scholar PubMed

Gagarina, N. (2007). What happens when adults often use infinitives. In Gülzow, I. & Gagarina, N. (eds), Frequency effects in language acquisition: defining the limits of frequency as an explanatory concept, 205–36. Berlin & New York: Mouton de Gruyter.Google Scholar

Gentner, D. (1982). Why nouns are learned before verbs: linguistic relativity versus natural partitioning. In Kuczaj, S. A. (ed.), Language development, vol. 2: language, thought, and culture, 301–34. Hillsdale, NJ: Erlbaum.Google Scholar

Gibson, E. (1998). Linguistic complexity: locality of syntactic dependencies. Cognition 68, 1–76.Google Scholar

Gil, D. (2006) The acquisition of voice morphology in Jakarta Indonesian. In Gagarina, N. & Gülzow, I. (eds), The acquisition of verbs and their grammar: the effect of particular languages, 201–27. Dordrecht: Springer.Google Scholar

Gleitman, L. & Wanner, E. (1984). Current issues in language learning. In Bornstein, M. & Lamb, M. (eds), Perceptual, cognitive, and linguistic development, Vol. 2 of Developmental psychology: an advanced textbook, 297–356. Hillsdale, NJ: Erlbaum.Google Scholar

Göksun, T., Küntay, A. C. & Naigles, L. R. (2008). Turkish children use morphosyntactic bootstrapping in interpreting verb meaning. Journal of Child Language 35, 291–323.CrossRef Google Scholar PubMed

Goldberg, A. E. (2004). But do we need universal grammar? Comment on Lidz et al. (2003). Cognition 94, 77–84.CrossRef Google Scholar PubMed

Goldberg, A. E., Casenhiser, D. & Sethuraman, N (2004). Learning argument structure generalizations. Cognitive Linguistics 14, 289–316.Google Scholar

Goldberg, A. E., Casenhiser, D. & White, T. (2007). Constructions as categories of language. New Ideas in Psychology 25, 70–86.Google Scholar

Goodman, J. C., Dale, P. S. & Li, P. (2008). Does frequency count? Parental input and the acquisition of vocabulary. Journal of Child Language 35, 515–31.Google Scholar

Gordon, J. K. & Dell, G. S. (2003). Learning to divide the labor: an account of deficits in light and heavy verb production. Cognitive Science 27, 1–40.Google Scholar

Gordon, P. & Chafetz, J. (1991). Verb-based vs. class-based accounts of actionality effects in children's comprehension of the passive. Cognition 36, 227–54.CrossRef Google Scholar

Grammer, K. & Thornhill, R. (1994). Human (Homo sapiens) facial attractiveness and sexual selection: the role of symmetry and averageness. Journal of Comparative Psychology 108, 233–242.Google Scholar

Hare, M., Elman, J. L. & Daughtery, K. G. (1995). Default generalisation in connectionist networks. Language and Cognitive Processes 10, 601–30.CrossRef Google Scholar

Howes, D. (1957). On the relation between the intelligibility and frequency of occurrence of English words. Journal of the Acoustical Society of America 29, 296–305.CrossRef Google Scholar

Hsu, C. C. N., Hermon, G. & Zukowski, A. (2009). Young children's production of head-final relative clauses: elicited production data from Chinese children. Journal of East Asian Linguistics 18, 323–60.Google Scholar

Hulme, C., Roodenrys, S., Schweickert, R., Brown, G. D., Martin, S. & Stuart, G. (1997). Word-frequency effects on short-term memory tasks: evidence for a redintegration process in immediate serial recall. Journal of Experimental Psychology: Learning, Memory, and Cognition 23, 1217–32.Google Scholar PubMed

Huttenlocher, J., Vasilyeva, M. & Shimpi, P. (2004). Syntactic priming in young children. Journal of Memory and Language 50, 182–95.CrossRef Google Scholar

Ibbotson, P., Theakston, A., Lieven, E. & Tomasello, M. (2011). The role of pronoun frames in early comprehension of transitive constructions in English. Language Learning and Development 7, 24–39.CrossRef Google Scholar

Israel, M., Johnson, C. & Brooks, P. J. (2000). From states to events: the acquisition of English passive participles. Cognitive Linguistics 11, 103–29.CrossRef Google Scholar

Janda, R. D. (1990). Frequency, markedness and morphological change: on predicting the spread of noun-plural -s in Modern High German and West Germanic. Proceedings of the Eastern States Conference on Linguistics (ESCOL) 7, 136–53.Google Scholar

Jescheniak, J. D. & Levelt, W. J. M. (1994). Word frequency effects in speech production: retrieval of syntactic information and of phonological form. Journal of Experimental Psychology: Learning, Memory, and Cognition 20, 824–43.Google Scholar

Joe, A. (2010). The quality and frequency of encounters with vocabulary in an English for Academic Purposes programme. Reading in a Foreign Language 22, 117–38.Google Scholar

Kidd, E. (ed.) (2011). The acquisition of relative clauses: processing, typology, and function. Amsterdam: John Benjamins.Google Scholar

Kidd, E. (2012). Individual differences in syntactic priming in language acquisition. Applied Psycholinguistics, 33, 393–418.Google Scholar

Kidd, E. & Bavin, E. L. (2002). English-speaking children's comprehension of relative clauses: evidence for general-cognitive and language-specific constraints on development. Journal of Psycholinguistic Research 31, 599–617.Google Scholar

Kidd, E., Brandt, S., Lieven, E. & Tomasello, M. (2007). Object relatives made easy: a crosslinguistic comparison of the constraints influencing young children's processing of relative clauses. Language and Cognitive Processes 22, 860–97.Google Scholar

Kirjavainen, M., Nikolaev, A. & Kidd, E. (2012). The effect of frequency and phonological neighbourhood density on the acquisition of past tense verbs by Finnish children. Cognitive Linguistics 23, 273–315.Google Scholar

Kirjavainen, M., Theakston, A. & Lieven, E. (2009). Can input explain children's me-for-I errors? Journal of Child Language 36, 1091–114.Google Scholar

Klima, E. & Bellugi, U. (1966). Syntactic regularities in the speech of children. In Lyons, J. & Wales, J. R. (eds), Psycholinguistic papers: the proceedings of the Edinburgh conference, 183–208. Edinburgh: Edinburgh University Press.Google Scholar

Kline, M. & Demuth, K. (2010). Factors facilitating implicit learning: the case of the Sesotho passive. Language Acquisition 17, 220–34.Google Scholar

Krajewski, G., Theakston, A. L., Lieven, E. V. M. & Tomasello, M. (2011). How Polish children switch from one case to another when using novel nouns: challenges for models of inflectional morphology. Language and Cognitive Processes 26, 830–61.Google Scholar

Kuczaj, S. (1976). -ing, -s, and -ed: a study on the acquisition of certain verb inflections. Unpublished PhD dissertation, University of Minnesota.Google Scholar

Küntay, A. & Slobin, D. I. (2002). Putting interaction back into child language: examples from Turkish. Psychology of Language and Communication 6, 5–14.Google Scholar

Lambrecht, K. (1988). ‘There was a farmer had a dog’: syntactic amalgams revisited. In Axmaker, S., Jaissen, A. & Singmaster, H. (eds), Proceedings of the Fourteenth Annual Meeting of the Berkeley Linguistics Society, 319–39. Berkeley: Berkeley Linguistics Society.Google Scholar

Legate, J. A. & Yang, C. (2007). Morphosyntactic learning and the development of tense. Language Acquisition 14, 315–44.CrossRef Google Scholar

Leonard, L. B., Caselli, M. C. & Devescovi, A. (2002). Italian children's use of verb and noun morphology during the preschool years. First Language 3, 287–304.Google Scholar

Lidz, J., Gleitman, H. & Gleitman, L. (2003). Understanding how input matters: verb learning and the footprint of universal grammar. Cognition 87, 151–78.Google Scholar

Luce, P. A. (1986). A computational analysis of uniqueness points in auditory word recognition. Perception and Psychophysics 39, 155–8.CrossRef Google Scholar PubMed

MacDonald, M. C. (1994). Probabilistic constraints and syntactic ambiguity resolution. Language and Cognitive Processes 9, 157–201.CrossRef Google Scholar

MacDonald, M. C. (1999). Distributional information in language comprehension, production, and acquisition: three puzzles and a moral. In MacWhinney, B. (ed.), The emergence of language, 177–96. Mahwah, NJ: Erlbaum.Google Scholar

MacDonald, M. C. & Christiansen, M. H. (2002). Reassessing working memory: comment on Just and Carpenter (1992) and Waters and Caplan (1996). Psychological Review 109, 35–54.CrossRef Google Scholar PubMed

MacWhinney, B. (2000). The CHILDES project: tools for analyzing talk, 3rd ed.Mahwah, NJ: Lawrence Erlbaum Associates.Google Scholar

MacWhinney, B. (2004). A multiple process solution to the logical problem of language acquisition. Journal of Child Language 31(4), 883–914.CrossRef Google Scholar

Marchman, V. A. (1997). Children's productivity in the English past tense: the role of frequency, phonology, and neighborhood structure. Cognitive Science 21, 283–303.Google Scholar

Marchman, V. A., Wulfeck, B. & Weismer, S. E. (1999). Morphological productivity in children with normal language and SLI: a study of the English past tense. Journal of Speech, Language, and Hearing Research 42, 206–19.CrossRef Google Scholar PubMed

Maslen, R. J. C., Theakston, A. L., Lieven, E. V. M. & Tomasello, M. (2004). A dense corpus study of past tense and plural overregularization in English. Journal of Speech, Language, and Hearing Research 47, 1319–33.Google Scholar

Matsuo, A., Kita, S., Shinya, Y., Wood, G. C. & Naigles, L. (2012). Japanese two-year-olds use morphosyntax to learn novel verb meanings. Journal of Child Language 39(3), 637–63.Google Scholar

Matthews, D. & Bannard, C. (2010). Children's production of unfamiliar word sequences is predicted by positional variability and latent classes in a large sample of child-directed speech. Cognitive Science 34, 465–88.Google Scholar

Matthews, D., Lieven, E., Theakston, A. & Tomasello, M. (2005). The role of frequency in the acquisition of English word order. Cognitive Development 20, 121–36.CrossRef Google Scholar

Matthews, D., Lieven, E., Theakston, A. & Tomasello, M. (2007). French children's use and correction of weird word orders: a constructivist account. Journal of Child Language 34, 381–409.CrossRef Google Scholar

Matthews, D. & Theakston, A. L. (2006). Errors of omission in English-speaking children's production of plurals and the past tense: the effects of frequency, phonology, and competition. Cognitive Science 30, 1027–52.Google Scholar

Mayberry, M. R., Crocker, M. W. & Knoeferle, P. (2009). Learning to attend: a connectionist model of situated language comprehension. Cognitive Science 33, 449–96.Google Scholar

McCauley, S. M. & Christiansen, M. H. (2014). Prospects for usage-based computational models of grammatical development: argument structure and semantics roles. Wiley Interdisciplinary Reviews: Cognitive Science, 5(4), 489–499.Google Scholar

McGregor, K. K., Sheng, L. & Ball, T. (2007). Complexities of expressive word learning over time. Language, Speech, and Hearing Services in Schools, 38, 353–64.CrossRef Google Scholar PubMed

Messenger, K., Branigan, H. P. & McLean, J. F. (2011). Evidence for (shared) abstract structure underlying children's short and full passives. Cognition 121, 268–74.Google Scholar

Mills, A. (1985). The acquisition of German. In Slobin, D. I. (ed.), The crosslinguistic study of language acquisition, Vol. 1, 141–254. Hillsdale, NJ: Erlbaum.Google Scholar

Mintz, T. (2003). Frequent frames as a cue for grammatical categories in child directed speech. Cognition 90, 91–117.Google Scholar

Moerk, E. L. (1980). Relationships between parental input frequencies and children's language acquisition: a reanalysis of Brown's data. Journal of Child Language 7, 105–18.CrossRef Google Scholar PubMed

Moerk, E. L. (1981). To attend or not to attend to unwelcome reanalyses? A reply to Pinker. Journal of Child Language 8, 627–32.Google Scholar

Monaghan, P. & Christiansen, M. H. (2010). Words in puddles of sound: modelling psycholinguistic effects in speech segmentation. Journal of Child Language 37, 545–64.Google Scholar

Naigles, L. R. & Hoff-Ginsberg, E. (1998). Why are some verbs learned before other verbs? Effects of input frequency and structure on children's early verb use. Journal of Child Language 25, 95–120.Google Scholar

Newmeyer, F. J. (2003). Grammar is grammar and usage is usage. Language 79, 682–707.CrossRef Google Scholar

Newport, E. L., Gleitman, H. & Gleitman, L. A. (1977). ‘Mother, I'd rather do it myself’: some effects and non-effects of maternal speech style. In Snow, C. & Ferguson, C. (eds), Talking to children: language input and acquisition, 109–49. Cambridge: Cambridge University Press.Google Scholar

Nicoladis, E., Palmer, A. & Marentette, P. (2007). The role of type and token frequency in using past tense morphemes correctly. Developmental Science 10, 237–54.CrossRef Google Scholar PubMed

Ninio, A. (1999). Pathbreaking verbs in syntactic development and the question of prototypical transitivity. Journal of Child Language 26, 619–53.Google Scholar

Ninio, A. (2006). Language and the learning curve: a new theory of syntactic development. Oxford: Oxford University Press.Google Scholar

Okubo, A. (1967). Yooji gengo no hattatsu [Children's language development]. Tokyo: Tokyodoo.Google Scholar

Pickering, M. J. & Ferreira, V. S. (2008). Structural priming: a critical review. Psychological Bulletin 134, 427–59.CrossRef Google Scholar PubMed

Pinker, S. (1981). On the acquisition of grammatical morphemes. Journal of Child Language 8, 477–84.Google Scholar

Pinker, S., Lebeaux, D. & Frost, L. A. (1987). Productivity and constraints in the acquisition of the passive. Cognition 26, 195–267.Google Scholar

Plunkett, K. & Nakisa, R. C. (1997). A connectionist model of the Arabic plural system. Language and Cognitive Processes 12, 807–36.Google Scholar

Pye, C. & Quixtan Poz, P. (1988). Precocious passives (and antipassives) in Quiché Mayan. Papers and Reports on Child Language Development 27, 71–80.Google Scholar

Ramscar, M., Dye, M. & McCauley, S. (2013). Error and expectation in language learning: the curious absence of ‘mouses’ in adult speech. Language 89, 760–93.Google Scholar

Räsänen, S. H. M., Ambridge, B. & Pine, J. M. (2014) Infinitives or bare stems? Are English-speaking children defaulting to the highest-frequency form? Journal of Child Language 41(4), 756–79.Google Scholar

Reali, F. & Christiansen, M. H. (2005). Uncovering the richness of the stimulus: structure dependence and indirect statistical evidence. Cognitive Science 29, 1007–28.Google Scholar

Rescorla, R. A. & Wagner, A. R. (1972). A theory of Pavlovian conditioning: variations in the effectiveness of reinforcement and nonreinforcement. In Black, A. H. & Prokasy, W. F. (eds), Classical conditioning II: current research and theory, 64–99. New York: Appleton-Century-Crofts.Google Scholar

Rice, M. L., Oetting, J. B., Marquis, J., Bode, J. & Pae, S. Y. (1994). Frequency of input effects on word comprehension of children with Specific Language Impairment. Journal of Speech and Hearing Research 37, 106–22.CrossRef Google Scholar PubMed

Roeper, T. (2007). What frequency can do and what it can't. In Gülzow, I. & Gagarina, N. (eds), Frequency effects in language acquisition: defining the limits of frequency as an explanatory concept, 23–50. Berlin & New York: Mouton de Gruyter.Google Scholar

Roland, D., Dick, F. & Elman, J. (2007). Frequency of basic English grammatical structures: a corpus analysis. Journal of Memory and Language 57, 348–79.Google Scholar

Rowland, C. F. (2007). Explaining errors in children's questions. Cognition 104, 106–34.Google Scholar

Rowland, C. F. & Noble, C. L. (2010). The role of syntactic structure in children's sentence comprehension: evidence from the dative. Language Learning and Development 7, 55–75.Google Scholar

Rowland, C. F. & Pine, J. M. (2000). Subject–auxiliary inversion errors and wh-question acquisition: ‘What children do know?’ Journal of Child Language 27, 157–81.Google Scholar

Rowland, C. F., Pine, J. M., Lieven, E. V. & Theakston, A. L. (2003). Determinants of acquisition order in wh-questions: re-evaluating the role of caregiver speech. Journal of Child Language 30, 609–36.CrossRef Google Scholar PubMed

Rowland, C. F. & Theakston, A. L. (2009). The acquisition of auxiliary syntax: a longitudinal elicitation study, Part 2: the modals and auxiliary DO. Journal of Speech, Language, and Hearing Research 52, 1471–92.Google Scholar

Sakas, W. G. & Fodor, J. D. (2012). Disambiguating syntactic triggers. Language Acquisition 19, 83–143.CrossRef Google Scholar

Sarilar, A., Matthews, D. & Küntay, A. C. (2013). Hearing relative clauses boosts relative clause usage (and referential clarity) in young Turkish language learners. Applied Psycholinguistics online: <doi:10.1017/S0142716413000192>.Google Scholar

Santelmann, L., Berk, S., Austin, J., Somashekar, S. & Lust, B. (2002). Continuity and development in the acquisition of inversion in yes/no questions: dissociating movement and inflection. Journal of Child Language 29, 813–42.Google Scholar

Savage, C., Lieven, E. V. M., Theakston, A. & Tomasello, M. (2003). Testing the abstractness of children's linguistic representations: lexical and structural priming of syntactic constructions in young children. Developmental Science 6, 557–67.Google Scholar

Savic, S. (1975). Aspects of adult–child communication: the problem of question acquisition. Journal of Child Language 2, 251–60.Google Scholar

Savin, H. B. (1963). Word-frequency effects and errors in the perception of speech. Journal of the Acoustical Society of America 35, 200–6.CrossRef Google Scholar

Schwartz, R. G. & Terrell, B. Y. (1983). The role of input frequency in lexical acquisition. Journal of Child Language 10, 57–64.Google Scholar

Scott, R. M. & Fisher, C. (2009). Two-year-olds use distributional cues to interpret transitivity-alternating verbs. Language and Cognitive Processes 24, 777–803.Google Scholar

Seidenberg, M. S. (1997). Language acquisition and use: learning and applying probabilistic constraints. Science 275, 1599–604.Google Scholar

Slobin, D. I. & Bever, T. G. (1982). Children use canonical sentence schemas: a crosslinguistic study of word order and inflections. Cognition 12(3), 229–265.Google Scholar

Smiley, P. & Huttenlocher, J. (1995). Conceptual development and the child's early words for events, objects and persons. In Tomasello, M. & Merriman, W. (eds), Beyond names for things: the acquisition of verbs, 21–62. Hillsdale, NJ: Erlbaum.Google Scholar

St John, M. F. & McClelland, J. L. (1990). Learning and applying contextual constraints in sentence comprehension. Artificial Intelligence 46, 217–57.Google Scholar

Stefanowitsch, A. (2008). Negative evidence and preemption: a constructional approach to ungrammaticality. Cognitive Linguistics 19, 513–31.Google Scholar

Stefanowitsch, A. & Gries, S. T. (2003). Collostructions: investigating the interaction of words and constructions. International Journal of Corpus Linguistics 8, 209–43.Google Scholar

Stromswold, K. (1990). Learnability and the acquisition of auxiliaries. Unpublished PhD dissertation, MIT.Google Scholar

Stumper, B., Bannard, C., Lieven, E. V. M. & Tomasello, M. (2011). ‘Frequent frames’ in German child-directed speech: a limited cue to grammatical categories. Cognitive Science 35, 1190–205.Google Scholar

Suttle, L. & Goldberg, A. E. (2011). The partial productivity of constructions as induction. Linguistics 49, 1237–69.Google Scholar

Temperley, D. (2007). Music and probability. Cambridge, MA: MIT Press.Google Scholar

Theakston, A. L. (2004). The role of entrenchment in children's and adults’ performance on grammaticality judgement tasks. Cognitive Development 19, 15–34.Google Scholar

Theakston, A. L. (2012). ‘The spotty cow tickled the pig with a curly tail’: How do sentence position, preferred argument structure, and referential complexity affect children's and adults’ choice of referring expression? Applied Psycholinguistics 33, 691–724.CrossRef Google Scholar

Theakston, A. L. & Lieven, E. V. M. (2005). The acquisition of auxiliaries BE and HAVE: an elicitation study. Journal of Child Language 32, 587–616.Google Scholar

Theakston, A. L. & Lieven, E. V. M. (2008). The influence of discourse context on children's provision of auxiliary BE. Journal of Child Language 35(1), 129–58.Google Scholar

Theakston, A. L., Lieven, E. V. M., Pine, J. M. & Rowland, C. F. (2001). The role of performance limitations in the acquisition of verb-argument structure: an alternative account. Journal of Child Language 28, 127–52.Google Scholar

Theakston, A. L., Lieven, E. V. M., Pine, J. M. & Rowland, C. F. (2002). ‘Going’, ‘going’, ‘gone’: the acquisition of the verb ‘go’. Journal of Child Language 29, 783–811.Google Scholar

Theakston, A. L., Lieven, E. V. M., Pine, J. M. & Rowland, C. F. (2004). Semantic generality, input frequency and the acquisition of syntax. Journal of Child Language 31, 61–99.Google Scholar

Theakston, A. L., Lieven, E. V. M., Pine, J. M. & Rowland, C. F. (2005). The acquisition of auxiliary syntax: BE and HAVE. Cognitive Linguistics 16, 247–77.Google Scholar

Theakston, A. L., Lieven, E. V. M. & Tomasello, M. (2003). The role of the input in the acquisition of third person singular verbs in English. Journal of Speech, Language, and Hearing Research 46, 863–77.Google Scholar

Theakston, A. L. & Rowland, C. F. (2009). The acquisition of auxiliary syntax: a longitudinal elicitation study. Part 1: auxiliary BE. Journal of Speech, Language, and Hearing Research 52, 1449–70.Google Scholar

Tomasello, M. (2003). Constructing a language: a usage-based theory of language acquisition. Cambridge, MA: Harvard University Press.Google Scholar

Tyack, D. & Ingram, D. (1977). Children's production and comprehension of questions. Journal of Child Language 4, 211–28.CrossRef Google Scholar

Valian, V., Lasser, I. & Mandelbaum, D. (1992). Children's early questions. Paper presented at the 17th Annual Boston University Conference on Language Development, Boston, MA.Google Scholar

Vasilyeva, M., Huttenlocher, J. & Waterfall, H. (2006). Effects of language intervention on syntactic skill levels in preschoolers. Developmental Psychology 42, 164–74.Google Scholar

Vihman, M. M. & Vihman, V.-A. (2011). From first words to segments: a case study in phonological development. In Arnon, I. & Clark, E. V. (eds), Experience, variation, and generalization: learning a first language, 109–33. Amsterdam: John Benjamins.CrossRef Google Scholar

Wang, H. S. & Derwing, B. L. (1994). Some vowel schemas in three English morphological classes: experimental evidence. In Chen, M. Y. & Tang, O. C. L. (eds), In honor of Professor William S.-Y. Wang: interdisciplinary studies on language and language change, 561–75. Taipei: Pyramid Press.Google Scholar

Wang, M. & Koda, K. (2005). Commonalities and differences in word identification skills among learners of English as a second language. Language Learning 55, 71–98.Google Scholar

Weisleder, A. & Waxman, S. R. (2010). What's in the input? Frequent frames in child-directed speech offer distributional cues to grammatical categories in Spanish and English. Journal of Child Language 37(5), 1089–1108.Google Scholar

Westergaard, M. (2009). Usage-based vs. rule-based learning: the acquisition of word order in wh-questions in English and Norwegian. Journal of Child Language 36(5), 1023–74.Google Scholar

Whitehurst, G., Ironsmith, M. & Goldfein, M. (1974). Selective imitation of the passive construction through modeling. Journal of Experimental Child Psychology 17, 288–302.CrossRef Google Scholar

Wode, H. (1976). Some stages in the acquisition of questions by monolingual children. Word 27, 261–310.Google Scholar

Wonnacott, E., Newport, E. L. & Tanenhaus, M. K. (2008). Acquiring and processing verb argument structure: distributional learning in a miniature language. Cognitive Psychology 56, 165–209.Google Scholar

Xiao, R., McEnery, T. & Qian, Y. (2006). Passive constructions in English and Chinese: a corpus-based contrastive study. Languages in Contrast 6, 109–49.CrossRef Google Scholar

Yang, C. (2004). Universal Grammar, statistics or both? Trends in Cognitive Sciences 8, 451–6.Google Scholar

Year, J. & Gordon, P. (2009). Korean speakers’ acquisition of the English ditransitive construction: the role of verb prototype, input distribution, and frequency. Modern Language Journal 93, 399–417.Google Scholar

Yip, V. & Matthews, S. (2007). Relative clauses in Cantonese–English bilingual children. Studies in Second Language Acquisition 29, 277–300.Google Scholar

Article contents

The ubiquity of frequency effects in first language acquisition*

Abstract

Information

INTRODUCTION

SINGLE WORDS

INFLECTED FORMS

MULTIWORD STRINGS AND SIMPLE SYNTACTIC CONSTRUCTIONS

Multiword strings

Simple syntactic constructions

MORE ADVANCED CONSTRUCTIONS

Questions

Relative clauses

Passives

THEORETICAL IMPLICATIONS

Footnotes

References

REFERENCES

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests