Hostname: page-component-77f85d65b8-5ngxj Total loading time: 0 Render date: 2026-03-27T21:13:12.544Z Has data issue: false hasContentIssue false

Automated analysis of common errors in L2 learner production: Prototype web application development

Published online by Cambridge University Press:  19 June 2025

Atsushi Mizumoto*
Affiliation:
Faculty of Foreign Language Studies, Kansai University , Osaka, Japan
Rights & Permissions [Opens in a new window]

Abstract

This research report presents the development and validation of Auto Error Analyzer, a prototype web application designed to automate the calculation of accuracy and its related metrics for measuring second language (L2) production. Building on recent advancements in natural language processing (NLP) and artificial intelligence (AI), Auto Error Analyzer introduces an automated accuracy measurement component, bridging a gap in existing assessment tools, which traditionally require human judgment for accuracy evaluation. By utilizing a state-of-the-art generative AI model (Llama 3.3) for error detection, Auto Error Analyzer analyzes L2 texts efficiently and cost-effectively, producing accuracy metrics (e.g., errors per 100 words). Validation results demonstrate high agreement between the tool’s error counts and human rater judgments (r = .94), with microaverage precision and recall in error detection being high as well (.96 and .94 respectively, F1 = .95), and its T-unit and clause counts matched outputs from established tools like L2SCA. Developed under open science principles to ensure transparency and replicability, the tool aims to support researchers and educators while emphasizing the complementary role of human expertise in language assessment. The possibilities of Auto Error Analyzer for efficient and scalable error analysis, as well as its limitations in detecting context-dependent and first-language (L1)-influenced errors, are also discussed.

Information

Type
Research Report
Creative Commons
Creative Common License - CCCreative Common License - BYCreative Common License - NCCreative Common License - SA
This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike licence (http://creativecommons.org/licenses/by-nc-sa/4.0), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the same Creative Commons licence is used to distribute the re-used or adapted article and the original article is properly cited. The written permission of Cambridge University Press must be obtained prior to any commercial use.
Copyright
© The Author(s), 2025. Published by Cambridge University Press
Figure 0

Figure 1. Auto Error Analyzer landing page.

Figure 1

Table 1. Indices calculated in the Auto Error Analyzer

Figure 2

Table 2. Error types in Auto Error Analyzer

Figure 3

Figure 2. Sample of error analysis output in Auto Error Analyzer.

Figure 4

Figure 3. Error distribution plot in Auto Error Analyzer.

Figure 5

Figure 4. Correlations of T-units and clauses calculated with web-based L2SCA and Auto Error Analyzer.

Figure 6

Figure 5. Correlation of errors per 100 words: Human raters and Auto Error Analyzer.