Hostname: page-component-89b8bd64d-x2lbr Total loading time: 0 Render date: 2026-05-09T04:51:37.214Z Has data issue: false hasContentIssue false

Universality and language-specific experience in the perception of lexical tone and pitch

Published online by Cambridge University Press:  21 November 2014

DENIS BURNHAM*
Affiliation:
University of Western Sydney
BENJAWAN KASISOPA
Affiliation:
University of Western Sydney
AMANDA REID
Affiliation:
University of Western Sydney
SUDAPORN LUKSANEEYANAWIN
Affiliation:
Chulalongkorn University
FRANCISCO LACERDA
Affiliation:
Stockholm University
VIRGINIA ATTINA
Affiliation:
University of Western Sydney
NAN XU RATTANASONE
Affiliation:
Macquarie University
IRIS-CORINNA SCHWARZ
Affiliation:
Stockholm University
DIANE WEBSTER
Affiliation:
University of Western Sydney
*
ADDRESS FOR CORRESPONDENCE Denis Burnham, MARCS Institute, University of Western Sydney, Bankstown Campus Locked Bag 1797, Penrith New South Wales 2751, Australia. E-mail: denis.burnham@uws.edu.au
Rights & Permissions [Opens in a new window]

Abstract

Two experiments focus on Thai tone perception by native speakers of tone languages (Thai, Cantonese, and Mandarin), a pitch–accent (Swedish), and a nontonal (English) language. In Experiment 1, there was better auditory-only and auditory–visual discrimination by tone and pitch–accent language speakers than by nontone language speakers. Conversely and counterintuitively, there was better visual-only discrimination by nontone language speakers than tone and pitch–accent language speakers. Nevertheless, visual augmentation of auditory tone perception in noise was evident for all five language groups. In Experiment 2, involving discrimination in three fundamental frequency equivalent auditory contexts, tone and pitch–accent language participants showed equivalent discrimination for normal Thai speech, filtered speech, and violin sounds. In contrast, nontone language listeners had significantly better discrimination for violin sounds than filtered speech and in turn speech. Together the results show that tone perception is determined by both auditory and visual information, by acoustic and linguistic contexts, and by universal and experiential factors.

Information

Type
Articles
Creative Commons
Creative Common License - CCCreative Common License - BYCreative Common License - NCCreative Common License - SA
The online version of this article is published within an Open Access environment subject to the conditions of the Creative Commons Attribution licence http://creativecommons.org/licenses/by/3.0/.
Copyright
Copyright © Cambridge University Press 2014
Figure 0

Figure 1. (a) Fundamental frequency (F0) distribution of Thai tones, based on five Thai female productions of “ma” (described by Chao values as follows: Mid-33, Low-21, Falling-241, High-45, and Rising-315). (b) F0 of Mandarin tones, based on four Mandarin female productions of “ma” (described by Chao values as follows: High-55, Rising-35, Dipping-214, and Falling-51). (c) F0 distribution of Cantonese tones, based on two Cantonese female productions of “si” (described by Chao values as follows: High-55, Rising-25, Mid-33, Falling-21, Low-Rising-23, and Low-22). (d) F0 distribution of Swedish pitch accents (across two syllables) based on three Swedish female productions for two-syllable words. Pitch Accent 1 shows the single falling F0 pattern and Pitch Accent 2 shows the double peak in F0.

Figure 1

Figure 2. Mean d′ scores (bars are standard errors) for each language group, shown separately for auditory–visual/auditory-only (AV/AO) (a) clear and (b) noise and visual-only (VO) (c) clear and (d) noise, averaged over individual tone contrasts. Note the different scales used for AV/AO and VO figures, and standard errors are comparable across conditions.

Figure 2

Table 1. Discriminability of each tone pair for auditory only (AO) scores in clear, visual only (VO) in clear, and auditory–visual (AV-AO) augmentation in noise

Figure 3

Figure 3. The F (1, 175) values for single factor language ANOVAs conducted for all 10 tone contrasts in auditory only (AO) in clear, visual only (VO) in clear, and auditory–visual/AO (AV-AO) augmentation in noise. Blank cells indicate p > .05, no shading indicates p < .05, light shading indicates p < .01, and dark shading indicates p < .001. T, Thai; M, Mandarin; C, Cantonese; E, English; S, Swedish.

Figure 4

Figure 4. Mean d′ scores (by tone pair) for each language group, shown separately for auditory-only (AO) noise, auditory–visual (AV) noise, visual-only (VO) noise, and AV clear. Note the different scales used for AV/AO and VO figures.

Figure 5

Figure 5. Fundamental frequency distribution of speech, filtered speech, and violin stimuli on each Thai tone, shown with normalized pitch.

Figure 6

Figure 6. Thai, Cantonese, Swedish, and English speakers’ mean d′ scores for speech, filtered speech, and violin tone stimuli (bars indicate standard errors).