Hostname: page-component-89b8bd64d-z2ts4 Total loading time: 0 Render date: 2026-05-08T16:58:39.879Z Has data issue: false hasContentIssue false

Multimodal training on L2 Japanese pitch accent: learning outcomes, neural correlates and subjective assessments

Published online by Cambridge University Press:  18 September 2024

Yukari Hirata*
Affiliation:
Center for Language and Brain, Colgate University, Hamilton, NY, USA
Erica Friedman
Affiliation:
Department of Speech, Language & Hearing Sciences, Boston University, Boston, MA, USA
Caroline Kaicher
Affiliation:
Department of Psychology, Stanford University, Stanford, CA, USA
Spencer D. Kelly
Affiliation:
Center for Language and Brain, Colgate University, Hamilton, NY, USA
*
Corresponding author: Yukari Hirata; Email: yhirata@colgate.edu
Rights & Permissions [Opens in a new window]

Abstract

Japanese pitch accent is phonemic, making it crucial for second-language learners to acquire. Building on theories of multimodal learning, the present study explored how auditory, visual and gestural training of Japanese pitch accent affected behavioral, neural and meta-cognitive aspects of pitch perception across two experiments. Experiment 1 used a between-subjects pre/posttest design to train native English speakers to perceive Japanese pitch accents in one of the following three conditions: (1) baseline (audio + flat notation), (2) pitch height notation (audio + notation mimicking pitch height) and (3) pitch height notation + a left-hand gesture (L-gesture) (to engage the contralateral right hemisphere specialized for suprasegmental pitch processing). Our results indicated that (2) pitch height notation training was most robust in its benefits, as participants in this condition improved on trained and novel words alike. Experiment 2 used a within-subjects design to extend Experiment 1 in three ways: adding a right-hand gesture (R-gesture) condition (to engage more segmental language areas in the left hemisphere), introducing a neural correlate of cognitive load (measured by EEG alpha and theta power) and performing a metacognitive subjective assessment of learning (e.g., ‘Which training did you find the most helpful?’). The results showed that although there were no differences among our four training conditions on learning outcomes or EEG power, participants made the most positive subjective evaluations about pitch height notation and R-gesture training. Together, the results suggest that there may be a ‘just right’ amount of multimodal instruction to boost learning and increase engagement during foreign language pitch instruction.

Information

Type
Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2024. Published by Cambridge University Press
Figure 0

Figure 1. Pretest stimuli. This visual slide (stimulus number 35) is an example of the pretest stimuli presented along with the audio of the whole sentence ‘mazu nokogiri janai’. The box shows the target word. The four pitch patterns (a)–(d) written in red at the bottom are the response alternatives for participants to choose from for the target word they had heard. The Ls and Hs represent lows and highs of pitch accent, respectively. Note that the Ls and Hs were used in the baseline flat notation training, and the spatial arrangement of those Ls and Hs captures the visual–spatial representation used in notation training and L-gesture training.

Figure 1

Figure 2. Training stimuli for each condition. (1) First slide for all participants, (2a) baseline flat notation training, (2b) notation training and (2c) L-gesture training. (The hand images were not displayed to the participant – they are used here to demonstrate the contour of the gesture produced by participants.)

Figure 2

Figure 3. Proportion correct test scores of the three groups in Experiment 1. Only the notation group improved for both trained and untrained items.

Figure 3

Table 1. Pitch identification accuracy for words corresponding to the four training conditions in Experiment 2

Figure 4

Table 2. Means and standard deviations of participants’ responses to the subjective assessment survey in Experiment 2