Hostname: page-component-89b8bd64d-n8gtw Total loading time: 0 Render date: 2026-05-08T01:23:45.343Z Has data issue: false hasContentIssue false

Reversal rewards drive language switching during observational learning: Evidence from a dual-brain EEG study

Published online by Cambridge University Press:  04 February 2025

Junjun Huang
Affiliation:
Research Center of Brain and Cognitive Neuroscience, Liaoning Normal University, Dalian, China Key Laboratory of Brain and Cognitive Neuroscience, Dalian, China
Mengjie Lv
Affiliation:
Research Center of Brain and Cognitive Neuroscience, Liaoning Normal University, Dalian, China Key Laboratory of Brain and Cognitive Neuroscience, Dalian, China
Yingyi Xiang
Affiliation:
Research Center of Brain and Cognitive Neuroscience, Liaoning Normal University, Dalian, China Key Laboratory of Brain and Cognitive Neuroscience, Dalian, China
Shuang Liu
Affiliation:
Research Center of Brain and Cognitive Neuroscience, Liaoning Normal University, Dalian, China Key Laboratory of Brain and Cognitive Neuroscience, Dalian, China
Yujing Shen
Affiliation:
Research Center of Brain and Cognitive Neuroscience, Liaoning Normal University, Dalian, China Key Laboratory of Brain and Cognitive Neuroscience, Dalian, China
John W. Schwieter
Affiliation:
Language Acquisition, Multilingualism, and Cognition Laboratory / Bilingualism Matters, Wilfrid Laurier University, Waterloo, Canada Department of Linguistics and Languages, McMaster University, Hamilton, Canada
Huanhuan Liu*
Affiliation:
Research Center of Brain and Cognitive Neuroscience, Liaoning Normal University, Dalian, China Key Laboratory of Brain and Cognitive Neuroscience, Dalian, China
*
Corresponding author: Huanhuan Liu; Email: abcde69503@126.com
Rights & Permissions [Opens in a new window]

Abstract

Research on the cognitive neural mechanisms of language control often overlooks the role of rewards. To investigate how reversal rewards affect bilingual language switching during observational learning, we conducted a dual-brain electroencephalography (EEG) study. Participants, classified as direct learners or observers, performed a voluntary language-switching task under dynamic reward conditions. Our results demonstrated that both direct learners and observers exhibited high correct acquisition rates for the switch and non-switch behaviors in both pre- and post-reversal phases. Notably, direct learners and observers showed reduced switch costs in the post-reversal phase, highlighting enhanced language control efficiency. EEG analyses revealed that direct learners exhibited late positive component (LPC) switch costs in both pre- and post-reversal phases, while observers showed LPC switch costs only in the post-reversal phase. These findings support the Adaptive Control Hypothesis by highlighting the adaptability of language control mechanisms in response to dynamic reward environments during direct and observational learning.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press
Figure 0

Figure 1. Illustration of the experimental design.Notes. (a, b) denotes the settings for direct learners and observers regarding rewards and language-switching behaviors, respectively. (c) shows the flowchart of the experiment.

Figure 1

Figure 2. Correct acquisition rates of direct learners (a) and observers (b) in the learning phase and sequence.Notes. White circles and lines represent the mean and median of participants’ responses. Boxplots illustrate the interquartile range (25% and 75%). Colored asterisks (**) indicate significant differences (p < .05) between the two conditions.

Figure 2

Figure 3. RTs of direct learners (a) and observers (b) in the learning phase and sequence, and switching cost comparisons (c).Notes. White circles and lines represent the mean and median of participants’ responses. Boxplots illustrate the interquartile range (25% and 75%). Colored asterisks (**) indicate significant differences (p < .05) between the two conditions.

Figure 3

Figure 4 ERP waveforms, topographic maps and bar graphs of direct learners (a, c) and observers (b, d) in response to reversal rewards.Notes. Colored asterisks (**) indicate significant differences between the two conditions (p < .05). Bar graphs show the mean voltages of the P300 across sites for direct learners and observers in the first reward post−reversal and second reward post−reversal conditions. Error bars show the standard error of means. 1st = first reward post−reversal; 2nd = second reward post−reversal.

Figure 4

Figure 5. ERP waveforms, topographic maps, and bar graphs illustrate the impact of reversal rewards on language switching for direct learners (a, c) and observers (b, d).Notes. The N2 time window is defined as 250–350 ms. The LPC time window is defined as 370–600 ms. Colored asterisks (**) indicate significant differences between two conditions (p < .05). Bar graphs show mean voltages of the LPC across sites for direct learners and observers in the learning phase (post-reversal and pre-reversal) × sequence type (switch, non-switch) conditions. Error bars show the standard error of means. Post-Swi = Post-reversal Switch; Pre-Swi = Pre-reversal Switch; Post-NSwi = Post-reversal Non-switch; Pre-NSwi = Pre-reversal Non-switch.

Supplementary material: File

Huang et al. supplementary material 1

Huang et al. supplementary material
Download Huang et al. supplementary material 1(File)
File 16.8 KB
Supplementary material: File

Huang et al. supplementary material 2

Huang et al. supplementary material
Download Huang et al. supplementary material 2(File)
File 99.4 KB