Abstract
This research explores the performance of the Whisper's ASR system on different native and non-native English accents. The findings indicate better performance on North American vs British and Irish English accents; and on native vs native accents. The analysis also unearths links between speaker traits (sex, L1 typology, and L2 proficiency) and word error rate. An unsupervised K-means analysis identified ten distinct clusters in the data, providing valuable insights into the relationship between speaker characteristics and ASR performance. Additionally, the study found that Whisper performed better on read speech than on conversational speech. The implications of these findings are discussed.



![Author ORCID: We display the ORCID iD icon alongside authors names on our website to acknowledge that the ORCiD has been authenticated when entered by the user. To view the users ORCiD record click the icon. [opens in a new tab]](https://www.cambridge.org/engage/assets/public/coe/logo/orcid.png)