Hostname: page-component-6766d58669-7fx5l Total loading time: 0 Render date: 2026-05-15T13:55:37.238Z Has data issue: false hasContentIssue false

Proof of Reliability Convergence to 1 at Rate of Spearman–Brown Formula for Random Test Forms and Irrespective of Item Pool Dimensionality

Published online by Cambridge University Press:  01 January 2025

Jules L. Ellis*
Affiliation:
Open University of The Netherlands Radboud University Nijmegen
Klaas Sijtsma
Affiliation:
Tilburg University
*
Correspondence should be made to Jules L. Ellis, Faculty of Psychology, Open University of The Netherlands, Heerlen, The Netherlands. Email: jules.ellis@ou.nl
Rights & Permissions [Opens in a new window]

Abstract

It is shown that the psychometric test reliability, based on any true-score model with randomly sampled items and uncorrelated errors, converges to 1 as the test length goes to infinity, with probability 1, assuming some general regularity conditions. The asymptotic rate of convergence is given by the Spearman–Brown formula, and for this it is not needed that the items are parallel, or latent unidimensional, or even finite dimensional. Simulations with the 2-parameter logistic item response theory model reveal that the reliability of short multidimensional tests can be positively biased, meaning that applying the Spearman–Brown formula in these cases would lead to overprediction of the reliability that results from lengthening a test. However, test constructors of short tests generally aim for short tests that measure just one attribute, so that the bias problem may have little practical relevance. For short unidimensional tests under the 2-parameter logistic model reliability is almost unbiased, meaning that application of the Spearman–Brown formula in these cases of greater practical utility leads to predictions that are approximately unbiased.

Information

Type
Original Research
Creative Commons
Creative Common License - CCCreative Common License - BY
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
Copyright
Copyright © 2024 The Author(s)
Figure 0

Figure 1 Mean reliabilities in Simulation Study 1 for unidimensional cases with amax=2\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$a_{\textrm{max}}=2$$\end{document}. Note. Mean reliability and mean rescaled reliability as a function of test length, in 18 cases of unidimensional models. The cases are represented by different colors. Each point is based on 1000 random test versions.

Figure 1

Figure 2 Mean reliabilities in Simulation Study 1 for two-dimensional cases with amax=2\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$a_{\textrm{max}}=2$$\end{document}. Note. Mean reliability and mean rescaled reliability as a function of test length, in 18 cases of two-dimensional models. The cases are represented by different colors. Each point is based on 1000 random test versions.

Figure 2

Figure 3 Mean reliabilities in Simulation Study 1 for five-dimensional cases with amax=2\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$a_{\textrm{max}}=2$$\end{document}. Note. Mean reliability and mean rescaled reliability as a function of test length, in 18 cases of five-dimensional models. The cases are represented by different colors. Each point is based on 1000 random test versions.

Figure 3

Figure 4 Mean reliabilities in Simulation Study 1 for five-dimensional cases with amax=5\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$a_{\textrm{max}}=5$$\end{document}. Note. Mean reliability and mean rescaled reliability as a function of test length, in 18 cases of five-dimensional models. The cases are represented by different colors. Each point is based on 1000 random test versions.

Figure 4

Figure 5 Boxplots of the minimum and maximum errors in prediction of mean reliabilities. Note. Each boxplot is based on 18 minima and maxima, corresponding to 18 cases with amax=2\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$a_{\textrm{max}}=2$$\end{document}. Each minimum or maximum is based on 36 predictions of a mean reliability of one test length from a mean reliability of another test length, for 9 different test lengths.

Figure 5

Table 1 Maximum Errors of Simulation Study 2.

Figure 6

Figure 6 Boxplots of the standard deviations of the reliabilities as a function of the distribution type of the parameters and the length of the test versions. Note. Only unidimensional items with amax=2\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$a_{\textrm{max}}=2$$\end{document} were used in this plot. Each boxplot is based on 54 (Beta) or 15 (Binary 1 and Binary 2) or 100 (Irregular) standard deviations, and each standard deviation is based on 1000 reliabilities of random test forms drawn from the same item pool.