Hostname: page-component-cb9f654ff-qc88w Total loading time: 0 Render date: 2025-08-10T04:02:41.276Z Has data issue: false hasContentIssue false

Automatic speech recognition and pronunciation learning

Published online by Cambridge University Press:  06 August 2025

Shannon McCrocklin
Affiliation:
Linguistics, Southern Illinois University, Carbondale, IL, USA
John Levis*
Affiliation:
English, Iowa State University, Ames, IA, USA
*
Corresponding author: John Levis; Email: jlevis@iastate.edu

Abstract

Image of the first page of this content. For PDF version, please use the ‘Save PDF’ preceeding this image.'

Information

Type
Research Timeline
Copyright
© The Author(s), 2025. Published by Cambridge University Press.

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable

References

Çalık, Ş. S., Küçükmanisa, A., & Kilimci, Z. H. (2024). A novel framework for mispronunciation detection of Arabic phonemes using audio-oriented transformer models. Applied Acoustics, 215, 109711. https://doi.org/10.1016/j.apacoust.2023.109711CrossRefGoogle Scholar
Cho, K. (2022). Deep learning. In Mitkov, R. (Ed.), The Oxford handbook of computational linguistics (pp. 359414). Oxford University Press.Google Scholar
Davis, K. H., Biddulph, R., & Balashek, S. (1952). Automatic recognition of spoken digits. The Journal of the Acoustical Society of America, 24(6), 637642. https://doi.org/10.1121/1.1906946CrossRefGoogle Scholar
Derwing, T. M., & Munro, M. J. (2015). Pronunciation fundamentals: Evidence-based perspectives for L2 teaching and research. John BenjaminsCrossRefGoogle Scholar
Guskaroska, A. (2020). ASR-dictation on smartphones for vowel pronunciation practice. Journal of Contemporary Philology, 3(2), 4561. https://doi.org/10.37834/jcp2020045gGoogle Scholar
Hincks, R. (2015). Technology and learning pronunciation. In Reed, M. & Levis, J. (Eds.), The handbook of English pronunciation (pp. 505519). John Wiley & Sons.CrossRefGoogle Scholar
Huensch, A. (2019). Pronunciation in foreign language classrooms: Instructors’ training, classroom practices, and beliefs. Language Teaching Research, 23(6), 745764. https://doi.org/10.1177/136216881876718CrossRefGoogle Scholar
Jenkins, J. (2000). The phonology of English as an international language. Oxford University Press.Google Scholar
Khaustova, V., Pyshkin, E., Khaustov, V., Blake, J., & Bogach, N. (2023, November). CAPTuring accents: An approach to personalize pronunciation training for learners with different L1 backgrounds. In International conference on speech and computer (pp. 5970). Springer Nature Switzerland. https://doi.org/10.1007/978-3-031-48312-7_5CrossRefGoogle Scholar
Lamel, L., & Gauvain, J.-L. (2022). Speech recognition. In Mitkov, R. (Ed.), The Oxford handbook of computational linguistics (pp. 770788). Oxford University Press.Google Scholar
Levis, J. (2020). Revisiting the intelligibility and nativeness principles. Journal of Second Language Pronunciation, 6(3), 310328. https://doi.org/10.1075/jslp.20050.levCrossRefGoogle Scholar
Levis, J., & Suvorov, R. (2020). Automatic speech recognition. In Chapelle, C. (Ed.), The encyclopedia of applied linguistics. Wiley. https://doi.org/10.1002/9781405198431.wbeal0066.pub2Google Scholar
McCrocklin, S. (2019). Learners’ feedback regarding ASR-based dictation practice for pronunciation learning. CALICO Journal, 36(2), 119137. https://doi.org/10.1558/cj.34738Google Scholar
Munro, M. J., & Derwing, T. M. (1995). Foreign accent, comprehensibility, and intelligibility in the speech of second language learners. Language Learning, 45(1), 7397. https://doi.org/10.1111/j.1467-1770.1995.tb00963.xCrossRefGoogle Scholar
Munro, M. J., & Derwing, T. M. (2020). Foreign accent, comprehensibility and intelligibility, redux. Journal of Second Language Pronunciation, 6(3), 283309. https://doi.org/10.1075/jslp.20038.munCrossRefGoogle Scholar
Murphy, J. M., & Baker, A. A. (2015). History of ESL pronunciation teaching. In Reed, M. & Levis, J. (Eds.), The handbook of English pronunciation (pp. 3665). Wiley-Blackwell.10.1002/9781118346952.ch3CrossRefGoogle Scholar
Rabiner, L., & Juang, B. H. (2008). Historical perspective of the field of ASR/ NLU. In Benesty, J., Sondhi, M. M., & Huang, Y. A. (Eds.), Springer handbook of speech processing (pp. 521538). Springer.CrossRefGoogle Scholar