Sampling children's spontaneous speech: how much is enough?

MICHAEL TOMASELLO; DANIEL STAHL

doi:10.1017/S0305000903005944

Abstract

There has been relatively little discussion in the field of child language acquisition about how best to sample from children's spontaneous speech, particularly with regard to quantitative issues. Here we provide quantitative information designed to help researchers make decisions about how best to sample children's speech for particular research questions (and/or how confident to be in existing analyses). We report theoretical analyses in which the major parameters are: (1) the frequency with which a phenomenon occurs in the real world, and (2) the temporal density with which a researcher samples the child's speech. We look at the influence of these two parameters in using spontaneous speech samples to estimate such things as: (a) the percentage of the real phenomenon actually captured, (b) the probability of capturing at least one target in any given sample, (c) the confidence we can have in estimating the frequency of occurrence of a target from a given sample, and (d) the estimated age of emergence of a target structure. In addition, we also report two empirical analyses of relatively infrequent child language phenomena, in which we sample in different ways from a relatively dense corpus (two children aged 2;0 to 3;0) and compare the different results obtained. Implications of these results for various issues in the study of child language acquisition are discussed.

Information

Footnotes

For their helpful comments we would like to thank the following people: Elena Lieven, Julian Pine, Gina Conti-Ramsden, Anna Theakston, Heike Behrens, and Caroline Rowland.

Crossref Citations

This article has been cited by the following publications. This list is generated based on data provided by Crossref.

Rowland, Caroline F. Pine, Julian M. Lieven, Elena V. M. and Theakston, Anna L. 2005. The Incidence of Error in Young Children'sWh-Questions. Journal of Speech, Language, and Hearing Research, Vol. 48, Issue. 2, p. 384.

Rowe, Meredith L. Pan, Barbara Alexander and Ayoub, Catherine 2005. Predictors of Variation in Maternal Talk to Children: A Longitudinal Study of Low-Income Families. Parenting, Vol. 5, Issue. 3, p. 259.

Roy, Deb 2005. Grounding words in perception and action: computational insights. Trends in Cognitive Sciences, Vol. 9, Issue. 8, p. 389.

Hutchins, Tiffany L. Brannick, Michael Bryant, Judith B. and Silliman, Elaine R. 2005. Methods for controlling amount of talk: Difficulties, considerations and recommendations. First Language, Vol. 25, Issue. 3, p. 347.

Lieven, Elena 2006. Symbol Grounding and Beyond. Vol. 4211, Issue. , p. 72.

Kidd, E. 2006. Encyclopedia of Language & Linguistics. p. 311.

Lieven, E. 2006. Encyclopedia of Language & Linguistics. p. 376.

Behrens, Heike 2006. The input–output relationship in first language acquisition. Language and Cognitive Processes, Vol. 21, Issue. 1-3, p. 2.

Ziegler, Gudrun 2006. Determination und Referenz im frühen L2 Erwerb Französisch. Zeitschrift für Literaturwissenschaft und Linguistik, Vol. 36, Issue. 3, p. 25.

Roy, Deb Patel, Rupal DeCamp, Philip Kubat, Rony Fleischman, Michael Roy, Brandon Mavridis, Nikolaos Tellex, Stefanie Salata, Alexia Guinness, Jethran Levit, Michael and Gorniak, Peter 2006. Symbol Grounding and Beyond. Vol. 4211, Issue. , p. 192.

Abbot‐Smith, Kirsten and Behrens, Heike 2006. How Known Constructions Influence the Acquisition of Other Constructions: The German Passive and Future Constructions. Cognitive Science, Vol. 30, Issue. 6, p. 995.

Goldberg, Adele E. 2006. Categories in Use. Vol. 47, Issue. , p. 33.

Morris, Bradley J. 2008. Logically Speaking: Evidence for Item-Based Acquisition of the Connectives AND & OR. Journal of Cognition and Development, Vol. 9, Issue. 1, p. 67.

Nap-Kolhoff, Elma and Broeder, Peter 2008. ‘I me Mine’ The Acquisition of Dutch Pronominal Possessives by L1 Children, L2 Children and L2 Adults. ITL - International Journal of Applied Linguistics, Vol. 155, Issue. , p. 23.

Chang, Franklin Lieven, Elena and Tomasello, Michael 2008. Automatic evaluation of syntactic learners in typologically-different languages. Cognitive Systems Research, Vol. 9, Issue. 3, p. 198.

Ke, Jinyun and Yao, Yao 2008. Analysing Language Development from a Network Approach*. Journal of Quantitative Linguistics, Vol. 15, Issue. 1, p. 70.

Bencini, Giulia M.L. and Valian, Virginia V. 2008. Abstract sentence representations in 3-year-olds: Evidence from language production and comprehension. Journal of Memory and Language, Vol. 59, Issue. 1, p. 97.

ELLIS, NICK C. SIMPSON‐VLACH, RITA and MAYNARD, CARSON 2008. Formulaic Language in Native and Second Language Speakers: Psycholinguistics, Corpus Linguistics, and TESOL. TESOL Quarterly, Vol. 42, Issue. 3, p. 375.

Lieven, E. 2008. Encyclopedia of Infant and Early Childhood Development. p. 187.

Download full list

Article contents

Sampling children's spontaneous speech: how much is enough?

Abstract

Information

Access options

Article purchase

Temporarily unavailable

Footnotes

This article has been cited by the following publications. This list is generated based on data provided by Crossref.

Article contents

Sampling children's spontaneous speech: how much is enough?

Abstract

Information

Access options

Article purchase

Temporarily unavailable

Footnotes

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests