Hostname: page-component-89b8bd64d-4ws75 Total loading time: 0 Render date: 2026-05-10T17:39:38.593Z Has data issue: false hasContentIssue false

Voice processing ability predicts second-language phoneme learning in early bilingual adults

Published online by Cambridge University Press:  17 January 2025

Gaël Cordero*
Affiliation:
Department of Psychology, Faculty of Medicine and Health Sciences, Universitat Internacional de Catalunya, Barcelona, Spain
Jazmin R. Paredes-Paredes
Affiliation:
Department of Psychology, Faculty of Medicine and Health Sciences, Universitat Internacional de Catalunya, Barcelona, Spain
Manuel Perea
Affiliation:
Department of Methodology and ERI-Lectura, Universitat de València, Valencia, Spain Nebrija Research Center in Cognition, Universidad Antonio de Nebrija, Madrid, Spain
Nuria Sebastian-Galles
Affiliation:
Department of Information and Communication Technologies, Center for Brain and Cognition, Universitat Pompeu Fabra, Barcelona, Spain
Begoña Díaz
Affiliation:
Department of Psychology, Faculty of Medicine and Health Sciences, Universitat Internacional de Catalunya, Barcelona, Spain
*
Corresponding author: Gaël Cordero; Email: gcordero@uic.es
Rights & Permissions [Opens in a new window]

Abstract

Individuals differ greatly in their ability to learn the sounds of second languages, even when learning starts early in life. Recent research has suggested that the ability to identify the idiosyncratic acoustic variations introduced into the speech stream by the speaker might be relevant for second-language (L2) phoneme learning. However, only a positive correlation between voice recognition and phoneme learning has been shown. In the present study, we investigated whether voice processing ability predicts L2 phoneme learning. We employed a battery of behavioral cognitive ability measures to assess voice processing ability and L2 phoneme learning in 57 early bilingual adults. Confirmatory factor analyses (CFAs) and structural equation modeling (SEM) revealed that voice processing ability predicts L2 phoneme learning. Our findings align with theories of speech perception that attribute a fundamental role to the analysis of voice cues and suggest that the accurate identification of speaker-specific variation is also relevant for phoneme learning.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press
Figure 0

Figure 1. Accuracy scores of the indicators of voice processing ability (Spanish voice recognition, Chinese voice recognition, and voice discrimination) and L2 phoneme learning (categorization and lexical decision). Note that different accuracy transformed scores are depicted and direct visual comparison between the tasks is discouraged.

Figure 1

Figure 2. RT for the indicators for voice processing ability (Spanish voice recognition, Chinese voice recognition, and voice discrimination) and L2 phoneme learning (categorization and lexical decision).

Figure 2

Table 1. Descriptive statistics of the accuracy scores and reaction times of the indicators

Figure 3

Table 2. Covariance matrix of the accuracy score data

Figure 4

Table 3. Covariance matrix of the reaction time data (all indicators presented in ms).

Figure 5

Figure 3. Accuracy (3A) and RT (3B) CFAs with two correlated latent variables. Paths connecting the latent variables (circles) are the correlations between these constructs. The values between the latent variables and the manifest variables (squares) represent the standardized loadings of each task onto the latent variable. All loadings were significant at the p < .001 except for voice discrimination (VDT) to voice processing ability (p = .191) in the accuracy CFA. **p < .01, ***p < .001.

Figure 6

Table 4. Goodness-of-fit indices’ results of the CFAs

Figure 7

Figure 4. Accuracy (4A) and RT (4B) CFAs with a single latent variable. We present 4B for informational purposes only, since the RT data are misrepresented by this model (see Table 4). The values between the latent variable (circle) and the manifest variables (squares) represent the standardized loadings of each task onto the latent variable. All loadings were significant except for voice discrimination (VDT) in the accuracy CFA (p = .136). ** p < .01, *** p < .001.

Figure 8

Figure 5. Accuracy (5A) and RT (5B) SEMs showing the effect from the latent variable voice processing ability over L2 phoneme learning. The values between the latent variables (circles) and their respective manifest variables (squares) represent the standardized loadings of each task onto the corresponding latent variable. All loadings were significant, except for VDT to voice processing ability in 5A, which approached significance (p = .066). A dashed line represents non-significant results. * p < .05, ***p < .001.

Figure 9

Table 5. Goodness-of-fit indices’ results of the accuracy and RT SEMs

Figure 10

Table A1. Descriptive statistics of the proportion of accurate responses delivered to all experimental tasks

Figure 11

Table A2. Proportion of hits and false alarms for the VDT and LDT

Figure 12

Table A3. Descriptive statistics and between-group comparisons as a function of sex for the indicators for voice processing ability and L2 phoneme learning

Supplementary material: File

Cordero et al. supplementary material

Cordero et al. supplementary material
Download Cordero et al. supplementary material(File)
File 30.9 KB