Hostname: page-component-89b8bd64d-4ws75 Total loading time: 0 Render date: 2026-05-09T11:34:22.590Z Has data issue: false hasContentIssue false

The flexibility and representational nature of phonological prediction in listening comprehension: evidence from the visual world paradigm

Published online by Cambridge University Press:  02 October 2023

Zitong Zhao
Affiliation:
CAS Key Laboratory of Behavioral Science, Institute of Psychology, Chinese Academy of Sciences, Beijing, China Department of Psychology, University of Chinese Academy of Sciences, Beijing, China
Jinfeng Ding
Affiliation:
CAS Key Laboratory of Behavioral Science, Institute of Psychology, Chinese Academy of Sciences, Beijing, China Department of Psychology, University of Chinese Academy of Sciences, Beijing, China
Jiayu Wang
Affiliation:
CAS Key Laboratory of Behavioral Science, Institute of Psychology, Chinese Academy of Sciences, Beijing, China Department of Psychology, University of Chinese Academy of Sciences, Beijing, China
Yiya Chen*
Affiliation:
Leiden University Centre for Linguistics, Leiden, Netherlands Leiden Institute for Brain and Cognition, Leiden, Netherlands
Xiaoqing Li*
Affiliation:
CAS Key Laboratory of Behavioral Science, Institute of Psychology, Chinese Academy of Sciences, Beijing, China Department of Psychology, University of Chinese Academy of Sciences, Beijing, China Jiangsu Collaborative Innovation Center for Language Ability, Jiangsu Normal University, Xuzhou, China
*
Corresponding authors: Yiya Chen, Xiaoqing Li; Emails: yiya.chen@hum.leidenuniv.nl; lixq@psych.ac.cn
Corresponding authors: Yiya Chen, Xiaoqing Li; Emails: yiya.chen@hum.leidenuniv.nl; lixq@psych.ac.cn
Rights & Permissions [Opens in a new window]

Abstract

Using the visual world paradigm with printed words, this study investigated the flexibility and representational nature of phonological prediction in real-time speech processing. Native speakers of Mandarin Chinese listened to spoken sentences containing highly predictable target words and viewed a visual array with a critical word and a distractor word on the screen. The critical word was manipulated in four ways: a highly predictable target word, a homophone competitor, a tonal competitor, or an unrelated word. Participants showed a preference for fixating on the homophone competitors before hearing the highly predictable target word. The predicted phonological information waned shortly but was re-activated later around the acoustic onset of the target word. Importantly, this homophone bias was observed only when participants were completing a ‘pronunciation judgement’ task, but not when they were completing a ‘word judgement’ task. No effect was found for the tonal competitors. The task modulation effect, combined with the temporal pattern of phonological pre-activation, indicates that phonological prediction can be flexibly generated by top-down mechanisms. The lack of tonal competitor effect suggests that phonological features such as lexical tone are not independently predicted for anticipatory speech processing.

Information

Type
Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2023. Published by Cambridge University Press
Figure 0

Table 1. Visual arrays of an example sentence

Figure 1

Table 2. Characteristics of the critical words and distractor words in Experiment 1 (mean (SD))

Figure 2

Figure 1. Non-linear smooths for the unrelated word (red), target word (green), homophone competitor (blue), and tonal competitor (purple) from −1,300 ms before the spoken target word to 1,000 ms after it in Experiment 1a. The shaded area shows 95% confidence intervals. The raw fixation proportions within the window latency from −1,600 ms to 1,000 ms can be found in the shared dataset links, which are the same for Experiment 1b and Experiment 2.

Figure 3

Figure 2. Log-ratio difference plot for target versus unrelated (top) and homophone versus unrelated (bottom) from −1,300 ms before the spoken target word to 1,000 ms after it in Experiment 1a. The shaded area shows 95% confidence intervals. The red lines at the bottom indicate time bins in which the difference between conditions was significant.

Figure 4

Figure 3. Non-linear smooths for the unrelated word (red), target word (green), homophone competitor (blue), and tonal competitor (purple) from −1,300 ms before the spoken target word to 1,000 ms after it in Experiment 1b. The shaded area shows 95% confidence intervals.

Figure 5

Figure 4. Log-ratio difference plot for target versus unrelated (top) and homophone versus unrelated (bottom) from −1,300 ms before the spoken target word to 1,000 ms after it in Experiment 1b. The shaded area shows 95% confidence intervals. The red lines at the bottom indicate time bins in which the difference between the conditions was significant.

Figure 6

Table 3. Characteristics of the critical words and distractor words in Experiment 2 (mean (SD))

Figure 7

Figure 5. Non-linear smooths for the unrelated word (red) and homophone competitor (blue) from −1,300 ms before the spoken target word to 1,000 ms after it in the ‘word judgement’ (top) and ‘pronunciation judgement’ (bottom) tasks of Experiment 2. The shaded area shows 95% confidence intervals.

Figure 8

Figure 6. Log-ratio difference plot for homophone versus unrelated of model GAMM_main from −1,300 ms before the spoken target word to 1,000 ms after it in the ‘word judgement’ (top) and ‘pronunciation judgement’ (bottom) tasks of Experiment 2. The shaded area shows 95% confidence intervals. The red lines at the bottom indicate time bins in which the difference between the conditions was significant.

Supplementary material: File

Zhao et al. supplementary material

Zhao et al. supplementary material

Download Zhao et al. supplementary material(File)
File 1.8 MB