Hostname: page-component-89b8bd64d-r6c6k Total loading time: 0 Render date: 2026-05-11T19:56:40.010Z Has data issue: false hasContentIssue false

The effectiveness of automatic speech recognition in ESL/EFL pronunciation: A meta-analysis

Published online by Cambridge University Press:  13 April 2023

Thuy Thi-Nhu Ngo
Affiliation:
National Taiwan Normal University, Taiwan (ntngo81@gmail.com)
Howard Hao-Jan Chen*
Affiliation:
National Taiwan Normal University, Taiwan (hjchen@ntnu.edu.tw)
Kyle Kuo-Wei Lai
Affiliation:
National Taiwan Normal University, Taiwan (laisanity8@gmail.com)
*
*All correspondence regarding this publication should be addressed to Howard Hao-Jan Chen (Email: hjchen@ntnu.edu.tw)
Rights & Permissions [Opens in a new window]

Abstract

This meta-analytic study explores the overall effectiveness of automatic speech recognition (ASR) on ESL/EFL student pronunciation performance. Data with 15 studies representing 38 effect sizes found from 2008 to 2021 were meta-analyzed. The findings of the meta-analysis indicated that ASR has a medium overall effect size (g = 0.69). Results from moderator analyses suggest that (1) ASR with explicit corrective feedback is largely effective, while ASR with indirect feedback (e.g. ASR dictation) is moderately effective; (2) ASR has a large effect on segmental pronunciation but a small effect on suprasegmental pronunciation; (3) medium to long treatment duration of ASR results in higher learning outcomes, but short duration offers no differential effect compared to a non-ASR condition; (4) practicing pronunciation with peers in an ASR condition produces a large effect, but the effect is small when practicing alone; (5) ASR is largely effective for adult (i.e. 18 years old and above) and intermediate English learners. Overall, ASR is a beneficial application and is recommended for assisting L2 student pronunciation development.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2023. Published by Cambridge University Press on behalf of EUROCALL, the European Association for Computer-Assisted Language Learning
Figure 0

Table 1. Overall effect size and the heterogeneity test

Figure 1

Table 2. Moderator analyses in treatment data

Figure 2

Table 3. Moderator analyses in population data

Figure 3

Table 4. Moderator analyses in publication data

Supplementary material: File

Thi-Nhu Ngo et al. supplementary material

Thi-Nhu Ngo et al. supplementary material

Download Thi-Nhu Ngo et al. supplementary material(File)
File 25.6 KB