Hostname: page-component-89b8bd64d-5bvrz Total loading time: 0 Render date: 2026-05-14T06:48:33.319Z Has data issue: false hasContentIssue false

Univariate normality checking practices in L2 research: An AI-assisted systematic review

Published online by Cambridge University Press:  25 February 2026

Vahid Aryadoust*
Affiliation:
National Institute of Education, Nanyang Technological University, Singapore
Yichen Jia
Affiliation:
National Institute of Education, Nanyang Technological University, Singapore
*
Corresponding author: Vahid Aryadoust; Email: vahid.aryadoust@nie.edu.sg
Rights & Permissions [Opens in a new window]

Abstract

Regardless of whether one is analyzing quantitative data from research involving generative artificial intelligence (GenAI) or more classical methods, testing for normality remains a necessary step in statistical analysis. Although over 60 methods have been proposed for assessing univariate normality, previous systematic reviews show that normality testing remains underreported in L2 research. This paper addresses this gap by first reviewing the concept of normality and its role in parametric statistical inference. We then examine 12 normality assessment methods including five graphical and seven analytical methods selected based on their prominence in statistical literature and availability in commonly used software. Each method is explained in terms of its underlying mechanism and sensitivity to specific forms of nonnormality, such as skewness, tail heaviness, and multimodality. In the second part of the study, we review 237 empirical articles published between 2020 and 2025 in ten selected L2-focused Q1 journals, using AI-assisted annotation. Our findings reveal inconsistencies in how graphical tests are reported, a tendency to rely on tests such as the Kolmogorov-Smirnov without explicit attention to sample size constraints, and limited justification provided for critical values of skewness and kurtosis. These results indicate some divergence between recommended statistical practices and the procedures for normality testing reported in the L2 publications examined. The paper concludes with actionable recommendations for selecting and interpreting normality tests in L2 research contexts.

Information

Type
State of the Scholarship
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2026. Published by Cambridge University Press
Figure 0

Table 1. Overview of 61 normality tests categorized by methodological type and primary references presented by Hernandez (2021, pp. 3–11)

Figure 1

Figure 1. A representation of a Boxplot.

Figure 2

Figure 2. P-P plot (A) and Q-Q plot (B) with 20 unexpected values.

Figure 3

Table 2. An overview of graphical and analytical methods for assessing normality and their availability in common statistical packages (as of November 2025)

Figure 4

Figure 3. Classification of analytical normality tests by methodological approach.

Figure 5

Figure 4. PRISMA flow diagram for the review.

Figure 6

Figure 5. Prompt for ChatGPT-4o to code manuscripts.

Figure 7

Table 3. Normality checking reported for each study

Figure 8

Figure 6. Distribution of sample size in L2 studies using the Shapiro-Wilk (SW) test.

Figure 9

Figure 7. Normality reporting practices in using skewness.

Figure 10

Table 4. Practical recommendations for normality assessment in L2 research by sample size, method, and rationale.

Supplementary material: File

Aryadoust and Jia supplementary material

Aryadoust and Jia supplementary material
Download Aryadoust and Jia supplementary material(File)
File 24.8 KB