Hostname: page-component-5f7774ffb-625c7 Total loading time: 0 Render date: 2026-02-20T09:55:55.372Z Has data issue: false hasContentIssue false

A cross-language perspective on speech information rate

Published online by Cambridge University Press:  19 February 2026

François Pellegrino*
Affiliation:
Laboratoire Dynamique du Langage, Université de Lyon and CNRS
Christophe Coupe*
Affiliation:
Laboratoire Dynamique du Langage, Université de Lyon and CNRS
Egidio Marsico*
Affiliation:
Laboratoire Dynamique du Langage, Université de Lyon and CNRS
*
DDL - ISH 14 Avenue Berthelot 69363 Lyon Cedex 7 France, [Francois.Pellegrino@univ-lyon2.fr]
DDL - ISH 14 Avenue Berthelot 69363 Lyon Cedex 7 France, [Francois.Pellegrino@univ-lyon2.fr]
DDL - ISH 14 Avenue Berthelot 69363 Lyon Cedex 7 France, [Francois.Pellegrino@univ-lyon2.fr]

Abstract

This article is a crosslinguistic investigation of the hypothesis that the average information rate conveyed during speech communication results from a trade-off between average information density and speech rate. The study, based on seven languages, shows a negative correlation between density and rate, indicating the existence of several encoding strategies. However, these strategies do not necessarily lead to a constant information rate. These results are further investigated in relation to the notion of syllabic complexity.

Information

Type
Research Article
Copyright
Copyright © 2011 Linguistic Society of America

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable

References

Ahissar, Ehud, Nagarajan, Srikantan, Ahissar, Merav, Protopapas, Athanassios, Mahncke, Henry; and Merzenich, Michael M.. 2001. Speech comprehension is correlated with temporal response patterns recorded from auditory cortex. Proceedings of the National Academy of Sciences 96.23.13367–72.Google Scholar
Aylett, Matthew P., and Turk, Alice. 2004. The smooth signal redundancy hypothesis: A functional explanation for relationships between redundancy, prosodic prominence, and duration in spontaneous speech. Language and Speech 47.1.3156.CrossRefGoogle ScholarPubMed
Aylett, Matthew P., and Turk, Alice. 2006. Language redundancy predicts syllabic duration and the spectral characteristics of vocalic syllable nuclei. Journal of the Acoustical Society of America 119. 3048–58.CrossRefGoogle ScholarPubMed
Baayen, R. Harald, Davidson, Douglas J.; and Bates, D. M.. 2008. Mixed-effects modeling with crossed random effects for subjects and items. Journal of Memory and Language 59. 390412.CrossRefGoogle Scholar
Baayen, R. Harald, Piepenbrock, Richard; and Rijn, H. van. 1993. The CELEX lexical database. Philadelphia: Linguistic Data Consortium, University of Pennsylvania.Google Scholar
Baddeley, Alan D. 2000. The episodic buffer: A new component of working memory? Trends in Cognitive Science 4. 417–23.CrossRefGoogle ScholarPubMed
Baddeley, Alan D. 2003. Working memory: Looking back and looking forward. Nature Reviews: Neuroscience 4. 829–39.CrossRefGoogle ScholarPubMed
Baddeley, Alan D., and Hitch, Graham J.. 1974. Working memory. The psychology of learning and motivation: Advances in research and theory, vol. 8, ed. by Bower, Gordon H., 4789. New York: Academic Press.Google Scholar
Bavelier, Daphne, Newport, Elissa L., Hall, Matthew L., Supalla, Ted; and Boutla, Mrim. 2006. Persistent difference in short-term memory span between sign and speech. Psychological Science 17.12.1090–92.CrossRefGoogle ScholarPubMed
Bell, Alan, Brenier, Jason M., Gregory, Michelle, Girand, Cynthia; and Jurafsky, Dan. 2009. Predictability effects on durations of content and function words in conversational English. Journal of Memory and Language 60.1.92111.CrossRefGoogle Scholar
Boutla, Mrim, Supalla, Ted, Newport, Elissa L.; and Bavelier, Daphne. 2004. Short-term memory span: Insights from sign language. Nature Neuroscience 7.9.9971002.CrossRefGoogle ScholarPubMed
Brown, Bruce L., Giles, H.; and Thakerar, J. N.. 1985. Speaker evaluation as a function of speech rate, accent and context. Language and Communication 5.3.207–20.CrossRefGoogle Scholar
Bybee, Joan L. 2006. Frequency of use and the organization of language. New York: Oxford University Press.Google Scholar
Campione, Estelle, and Véronis, Jean. 1998. A multilingual prosodic database. Paper presented at the 5th International Conference on Spoken Language Processing, Sydney, Australia.CrossRefGoogle Scholar
Cherry, E. Colin, Halle, Morris; and Jakobson, Roman. 1953. Toward the logical description of languages in their phonemic aspect. Language 29.1.3446.CrossRefGoogle Scholar
Cholin, Joana, Levelt, Willem J. M.; and Schiller, Niels O.. 2006. Effects of syllable frequency in speech production. Cognition 99. 205–35.CrossRefGoogle ScholarPubMed
Crystal, David. 1987. The Cambridge encyclopedia of language. Cambridge: Cambridge University Press.Google Scholar
Cysouw, Michael. 2005. Quantitative method in typology. Quantitative linguistics: An international handbook, ed. by Altmann, Gabriel, Köhler, Reinhard, and Piotrowski, Rajmund G., 554–78. Berlin: Mouton de Gruyter.Google Scholar
Dahl, Östen. 2004. The growth and maintenance of linguistic complexity. Amsterdam: John Benjamins.CrossRefGoogle Scholar
Evans, Nicholas, and Levinson, Stephen C.. 2009. The myth of language universals: Language diversity and its importance for cognitive science. Behavioral and Brain Sciences 32. 429–48.CrossRefGoogle ScholarPubMed
Fenk-Oczlon, Gertraud, and Fenk, August. 1999. Cognition, quantitative linguistics, and systemic typology. Linguistic Typology 3.2.151–77.CrossRefGoogle Scholar
Fenk-Oczlon, Gertraud, and Fenk, August. 2005. Crosslinguistic correlations between size of syllables, number of cases, and adposition order. Sprache und Natürlichkeit: Gedenkband für Willi Mayerthaler, ed. by Fenk-Oczlon, Gertraud and Winkler, Christian, 7586. Tübingen: Gunther Narr.Google Scholar
Ferguson, Charles A. 1978. Historical backgrounds of universals research. Universals of human language, vol. 1: Method and theory, ed. by Greenberg, Joseph H., 733. Stanford, CA: Stanford University Press.Google Scholar
Ferrer i Cancho, Ramon. 2005. Decoding least effort and scaling in signal frequency distributions. Physica A: Statistical Mechanics and its Applications 345. 275–84.CrossRefGoogle Scholar
Ferrer i Cancho, Ramon, and Solé., Richard V. 2003. Least effort and the origins of scaling in human language. Proceedings of the National Academy of Sciences 100.3.788–91.Google ScholarPubMed
Frank, Austin F., and Jaeger, T. Florian. 2008. Speaking rationally: Uniform information density as an optimal strategy for language production. Proceedings of the 30th annual conference of the Cognitive Science Society (CogSci08), 939–44. Austin, TX: Cognitive Science Society.Google Scholar
Genzel, Dmitriy, and Charniak, Eugene. 2003. Variation of entropy and parse trees of sentences as a function of the sentence number. Proceedings of the 2003 conference on Empirical Methods in Natural Language Processing, ed. by Collins, Michael and Steedman, Mark, 6572. East Stroudsburg, PA: Association for Computational Linguistics. Online: http://acl.ldc.upenn.edu/acl2003/emnlp/index.htm.CrossRefGoogle Scholar
Goldsmith, John A. 2000. On information theory, entropy, and phonology in the 20th century. Folia Linguistica 34. 12.85–100.CrossRefGoogle Scholar
Goldsmith, John A. 2002. Probabilistic models of grammar: Phonology as information minimization. Phonological Studies 5. 2146.Google Scholar
Greenberg, Joseph H. 1969. Language universals: A research frontier. Science 166. 473–78.CrossRefGoogle ScholarPubMed
Greenberg, Steven. 1999. Speaking in a shorthand—A syllable-centric perspective for understanding pronunciation variation. Speech Communication 29. 159–76.CrossRefGoogle Scholar
Greenberg, Steven, and Fosler-Lussier, Eric. 2000. The uninvited guest: Information's role in guiding the production of spontaneous speech. Proceedings of the CREST Workshop on Models of Speech Production: Motor Planning and Articulatory Modeling, Kloster Seeon, Germany.Google Scholar
Harris, John. 2005. Vowel reduction as information loss. Headhood, elements, specification and contrastivity, ed. by Carr, Philip, Durand, Jacques, and Ewen, Colin J., 119–32. Amsterdam: John Benjamins.Google Scholar
Hawkins, John A. 2004. Efficiency and complexity in grammars. Oxford: Oxford University Press.CrossRefGoogle Scholar
Hawkins, John A. 2009. An efficiency theory of complexity and related phenomena. In Sampson et al., 252–68.Google Scholar
Hay, Jennifer, Pierrehumbert, Janet; and Beckman, Mary E.. 2001. Speech perception, well-formedness, and the statistics of the lexicon. Phonetic interpretation: Papers in laboratory phonology 6, ed. by Local, John, Ogden, Richard, and Temple, Rosalind, 5874. Cambridge: Cambridge University Press.Google Scholar
Hockett, Charles F. 1953. Review of Shannon & Weaver 1949. Language 29.1.6993.CrossRefGoogle Scholar
Hockett, Charles F. 1966. The problem of universals in language. Universals of language, 2nd edn., ed. by Greenberg, Joseph H., 129. Cambridge, MA: MIT Press.Google Scholar
Hume, Elizabeth. 2006. Language specific and universal markedness: An information-theoretic approach. Paper presented at the annual meeting of the Linguistic Society of America, Colloquium on Information Theory and Phonology, Albuquerque, NM.Google Scholar
Jacewicz, Ewa, Fox, Robert A., O'Neill, Caitlin and Salmons, Joseph. 2009. Articulation rate across dialect, age, and gender. Language Variation and Change 21.2.233–56.CrossRefGoogle ScholarPubMed
Jakobson, Roman, and Halle, Morris. 1956. Fundamentals of language. The Hague: Mouton.Google Scholar
Johns-Lewis, Catherine. 1986. Prosodic differentiation of discourse modes. Intonation in discourse, ed. by Johns-Lewis, Catherine, 199220. San Diego: College-Hill Press.Google Scholar
Johnson, Keith. 2004. Massive reduction in conversational American English. Spontaneous speech: Data and analysis (Proceedings of the 1st Session of the 10th International Symposium), ed. by Yoneyama, Kiyoko and Maekawa, K., 2954. Tokyo: The National International Institute for Japanese Language.Google Scholar
Joos, Martin. 1936. Review of Zipf 1935. Language 12.3.196210.CrossRefGoogle Scholar
Juola, Patrick. 1998. Measuring linguistic complexity: The morphological tier. Journal of Quantitative Linguistics 5.3.206–13.CrossRefGoogle Scholar
Juola, Patrick. 2008. Assessing linguistic complexity. Language complexity: Typology, contact, change, ed. by Miestamo, Matti, Sinnemäki, Kaius, and Karlsson, Fred, 89108. Amsterdam: John Benjamins.CrossRefGoogle Scholar
Karlgren, Hans. 1961. Speech rate and information theory. Proceedings of the 4th International Congress of Phonetic Sciences (ICPhS), Helsinki, 671–77.Google Scholar
Keller, Franck. 2004. The entropy rate principle as a predictor of processing effort: An evaluation against eye-tracking data. Proceedings of the Conference on Empirical Methods in Natural Language Processing, Barcelona, 317–24.Google Scholar
King, Robert D. 1967. Functional load and sound change. Language 43.4.831–52.CrossRefGoogle Scholar
Komatsu, Masahiko, Arai, Takayuki; and Sugarawa, Tsutomu. 2004. Perceptual discrimination of prosodic types. Paper presented at Speech Prosody 2004, Nara, Japan.CrossRefGoogle Scholar
Kowal, Sabine, Wiese, Richard; and O'Connell, Daniel C.. 1983. The use of time in storytelling. Language and Speech 26.4.377–92.CrossRefGoogle Scholar
Kuperman, Victor, Ernestus, Mirjam; and Baayen, R. Harald. 2008. Frequency distributions of uniphones, diphones, and triphones in spontaneous speech. Journal of the Acoustical Society of America 124.6.3897–908.CrossRefGoogle ScholarPubMed
Kuperman, Victor, Pluymaekers, Mark, Ernestus, Mirjam; and Baayen, R. Harald. 2007. Morphological predictability and acoustic duration of interfixes in Dutch compounds. Journal of the Acoustical Society of America 121.4.2261–71.CrossRefGoogle ScholarPubMed
Ladefoged, Peter. 1975. A course in phonetics. Chicago: University of Chicago Press.Google Scholar
Ladefoged, Peter. 2007. Articulatory features for describing lexical distinctions. Language 83.1.161–80.CrossRefGoogle Scholar
Levelt, Willem J. M. 2001. Spoken word production: A theory of lexical access. Proceedings of the National Academy of Sciences 98.23.13464–71.CrossRefGoogle ScholarPubMed
Levy, Roger, and Jaeger, T. Florian. 2007. Speakers optimize information density through syntactic reduction. Advances in neural information processing systems 19, ed. by Schölkopf, Bernhard, Platt, John, and Hofmann, Thomas, 849–56. Cambridge, MA: MIT Press.Google Scholar
Lindblom, Björn. 1990. Explaining phonetic variation: A sketch of the H&H theory. Speech production and speech modelling, ed. by Hardcastle, William J. and Marchal, Alain, 403–39. Dordrecht: Kluwer.Google Scholar
Locke, John L. 2008. Cost and complexity: Selection for speech and language. Journal of Theoretical Biology 251.4.640–52.CrossRefGoogle ScholarPubMed
Maddieson, Ian. 1984. Patterns of sounds. Cambridge: Cambridge University Press.CrossRefGoogle Scholar
Maddieson, Ian. 2006. Correlating phonological complexity: Data and validation. Linguistic Typology 10.1.106–23.Google Scholar
Maddieson, Ian. 2009. Calculating phonological complexity. Approaches to phonological complexity, ed. by Pellegrino, François, Marsico, Egidio, Chitoran, Ioana, and Coupé, Christophe, 85110. Berlin: Mouton de Gruyter.Google Scholar
Marsico, Egidio, Maddieson, Ian, Coupé, Christophe; and Pellegrino, François. 2004. Investigating the ‘hidden’ structure of phonological systems. Berkeley Linguistics Society 30. 256–67.Google Scholar
Martinet, André. 1933. Remarques sur le système phonologique du français. Bulletin de la Société de linguistique de Paris 33. 191202.Google Scholar
Martinet, André. 1955. Économie des changements phonétiques: Traité de phonologie diachronique. Berne: Francke.Google Scholar
Martinet, André. 1962. Afunctional view of language. Oxford: Clarendon.Google Scholar
Mueller, Shane T., Seymour, Travis L., Kieras, David E.; and Meyer, David E.. 2003. Theoretical implications of articulatory duration, phonological similarity and phonological complexity in verbal working memory. Journal of Experimental Psychology: Learning, Memory, and Cognition 29.6.1353–80.Google ScholarPubMed
Nettle, Daniel. 1995. Segmental inventory size, word length, and communicative efficiency. Linguistics 33. 359–67.Google Scholar
New, Boris, Pallier, Christophe, Brysbaert, Marc; and Ferrand, Ludovic. 2004. Lexique 2: A new French lexical database. Behavior Research Methods, Instruments, & Computers 36.3.516–24.CrossRefGoogle ScholarPubMed
Ohala, John J. 2008. The emergent syllable. The syllable in speech production, ed. by Davis, Barbara L. and Zadjó, Krisztina, 179–86. New York: Lawrence Erlbaum.Google Scholar
Oudeyer, Pierre-Yves. 2006. Self-organization in the evolution of speech, trans. by Hurford, James R.. (Studies in the evolution of language.) Oxford: Oxford University Press.CrossRefGoogle Scholar
Pellegrino, François, Coupé, Christophe; and Marsico, Egidio. 2007. An information theory-based approach to the balance of complexity between phonetics, phonology and morphosyntax. Paper presented at the annual meeting of the Linguistic Society of America, Anaheim, CA.Google Scholar
Peng, Gang. 2005. Temporal and tonal aspects of Chinese syllables: A corpus-based comparative study of Mandarin and Cantonese. Journal of Chinese Linguistics 34.1.134–54.Google Scholar
Pennebaker, James W., Mehl, Matthias R.; and Niederhoffer, Kate G.. 2003. Psychological aspects of natural language use: Our words, our selves. Annual Review of Psychology 54. 547–77.CrossRefGoogle ScholarPubMed
Piantadosi, Steven T., Tily, Harry J.; and Gibson, Edward. 2009. The communicative lexicon hypothesis. Proceedings of the 31st annual meeting of the Cognitive Science Society (CogSci09), 2582–87. Austin, TX: Cognitive Science Society.Google Scholar
Pitt, Mark A., Johnson, Keith, Hume, Elizabeth, Kiesling, Scott; and Raymond, William. 2005. The Buckeye corpus of conversational speech: Labeling conventions and a test of transcriber reliability. Speech Communication 45. 9095.CrossRefGoogle Scholar
Plank, Franz. 1998. The co-variation of phonology with morphology and syntax: A hopeful history. Linguistic Typology 2.2.195230.CrossRefGoogle Scholar
Plotkin, Joshua B., and Nowak, Martin A.. 2000. Language evolution and information theory. Journal of Theoretical Biology 205.1.147–60.CrossRefGoogle ScholarPubMed
Pone, Massimiliano. 2005. Studio e realizzazione di una tastiera software pseudo-sillabica per utenti disabili. Genova: Università degli Studi di Genova thesis.Google Scholar
Port, Robert F., and Leary, Adam P.. 2005. Against formal phonology. Language 81.4.927–64.CrossRefGoogle Scholar
R Development Core Team. 2010. R: A language and environment for statistical computing. Vienna: R Foundation for Statistical Computing. Online: http://www.R-project.org.Google Scholar
Roach, Peter. 1998. Some languages are spoken more quickly than others. Language myths, ed. by Bauer, Laurie and Trudgill, Peter, 150–58. London: Penguin.Google Scholar
Sampson, Geoffrey, Gil, David; and Trudgill, Peter (eds.) 2009. Language complexity as an evolving variable. (Studies in the evolution of language 13.) Oxford: Oxford University Press.CrossRefGoogle Scholar
Schiller, Niels O. 2008. Syllables in psycholinguistic theory: Now you see them, now you don't. The syllable in speech production, ed. by Davis, Barbara L. and Zadjó, Krisztina, 155–76. New York: Lawrence Erlbaum.Google Scholar
Schweickert, Richard, and Boruff, Brian. 1986. Short-term memory capacity: Magic number or magic spell? Journal of Experimental Psychology: Learning, Memory, and Cognition 12. 419–25.Google ScholarPubMed
Segui, Juan, and Ferrand, Ludovic. 2002. The role of the syllable in speech perception and production. Phonetics, phonology and cognition, ed. by Durand, Jacques and Laks, Bernard, 151–67. Oxford: Oxford University Press.Google Scholar
Service, Elisabet. 1998. The effect of word length on immediate serial recall depends on phonological complexity, not articulatory duration. Quarterly Journal of Experimental Psychology 51A.283304.CrossRefGoogle Scholar
Shannon, Claude E., and Weaver, Warren. 1949. The mathematical theory of communication. Urbana: University of Illinois Press.Google Scholar
Shosted, Ryan K. 2006. Correlating complexity: A typological approach. Linguistic Typology 10.1.140.CrossRefGoogle Scholar
Surendran, Dinoj, and Levow, Gina-Anne. 2004. The functional load of tone in Mandarin is as high as that of vowels. Proceedings of Speech Prosody 2004, Nara, Japan, 99102.Google Scholar
Tamaoka, Katsuo, and Makioka, Shogo. 2004. Frequency of occurrence for units of phonemes, morae, and syllables appearing in a lexical corpus of a Japanese newspaper. Behavior Research Methods, Instruments, & Computers 36.3.531–47.CrossRefGoogle Scholar
Trubetzkoy, Nicholai S. 1939. Grundzüge der Phonologie. Travaux du cercle linguistique de Prague 7. [Translated into French as Principes de phonologie, by J. Cantineau. Paris: Klincksieck, 1949.]Google Scholar
Trudgill, Peter. 2004. Linguistic and social typology: The Austronesian migrations and phoneme inventories. Linguistic Typology 8.3.305–20.CrossRefGoogle Scholar
Twaddell, W. Freeman. 1935. On defining the phoneme. Language 11.1.562.CrossRefGoogle Scholar
van Son, Rob J. J. H., and Pols, Louis C. W.. 2003. Information structure and efficiency in speech production. Proceedings of Eurospeech 2003, Geneva, Switzerland, 769–72.Google Scholar
Verhoeven, Jo, Pauw, Guy De; and Kloots, Hanne. 2004. Speech rate in a pluricentric language: A comparison between Dutch in Belgium and the Netherlands. Language and Speech 47.3.297308.CrossRefGoogle Scholar
Wells, John C. 1982. Accents of English, Vol. 1. Cambridge: Cambridge University Press.CrossRefGoogle Scholar
Zipf, George K. 1935. The psycho-biology of language: An introduction to dynamic philology. Cambridge, MA: MIT Press.Google Scholar
Zipf, George K. 1937. Statistical methods and dynamic philology. Language 13.1.6070.CrossRefGoogle Scholar