Hostname: page-component-77c78cf97d-7dld4 Total loading time: 0 Render date: 2026-04-25T05:48:40.065Z Has data issue: false hasContentIssue false

The numeric understanding measures: Developing and validating adaptive and nonadaptive numeracy scales

Published online by Cambridge University Press:  28 June 2023

Michael C. Silverstein*
Affiliation:
Department of Psychology, University of Oregon, Eugene, OR, USA Center for Science Communication Research, School of Journalism and Communication, University of Oregon, Eugene, OR, USA
Pär Bjälkebring
Affiliation:
Center for Science Communication Research, School of Journalism and Communication, University of Oregon, Eugene, OR, USA Department of Psychology, University of Gothenburg, Gothenburg, Sweden
Brittany Shoots-Reinhard
Affiliation:
Center for Science Communication Research, School of Journalism and Communication, University of Oregon, Eugene, OR, USA Department of Psychology, The Ohio State University, Columbus, OH, USA
Ellen Peters
Affiliation:
Department of Psychology, University of Oregon, Eugene, OR, USA Center for Science Communication Research, School of Journalism and Communication, University of Oregon, Eugene, OR, USA
*
Corresponding author: Michael C. Silverstein; Email: msilver2@uoregon.edu
Rights & Permissions [Opens in a new window]

Abstract

Numeracy—the ability to understand and use numeric information—is linked to good decision-making. Several problems exist with current numeracy measures, however. Depending on the participant sample, some existing measures are too easy or too hard; also, established measures often contain items well-known to participants. The current article aimed to develop new numeric understanding measures (NUMs) including a 1-item (1-NUM), 4-item (4-NUM), and 4-item adaptive measure (A-NUM). In a calibration study, 2 participant samples (n = 226 and 264 from Amazon’s Mechanical Turk [MTurk]) each responded to half of 84 novel numeracy items. We calibrated items using 2-parameter logistic item response theory (IRT) models. Based on item parameters, we developed the 3 new numeracy measures. In a subsequent validation study, 600 MTurk participants completed the new numeracy measures, the adaptive Berlin Numeracy Test, and the Weller Rasch-Based Numeracy Test, in randomized order. To establish predictive and convergent validities, participants also completed judgment and decision tasks, Raven’s progressive matrices, a vocabulary test, and demographics. Confirmatory factor analyses suggested that the 1-NUM, 4-NUM, and A-NUM load onto the same factor as existing measures. The NUM scales also showed similar association patterns to subjective numeracy and cognitive ability measures as established measures. Finally, they effectively predicted classic numeracy effects. In fact, based on power analyses, the A-NUM and 4-NUM appeared to confer more power to detect effects than existing measures. Thus, using IRT, we developed 3 brief numeracy measures, using novel items and without sacrificing construct scope. The measures can be downloaded as Qualtrics files (https://osf.io/pcegz/).

Information

Type
Empirical Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2023. Published by Cambridge University Press on behalf of the Society for Judgment and Decision Making and European Association for Decision Making
Figure 0

Figure 1 The structure of the adaptive numeric understanding measure with item parameters included (from Study 1; N’s = 224 and 264).

Figure 1

Table 1 Items that compose the adaptive numeric understanding measure in Figure 1’s order

Figure 2

Table 2 The items that compose the 4-item numeracy measure in order of difficulty (easiest to hardest) with item parameters (from Study 1, N’s = 224 and 264)

Figure 3

Figure 2 The results of the confirmatory factor analysis of the A-NUM, 4-NUM, BNT, and Weller numeracy measures. The values on the lines from numeracy to the measures represent the loading of each measure onto the latent factor. The values below the measures represent the error variance of the measures (i.e., the variance in scores unexplained by the latent factor). Note: p-values are <.001 for all paths.

Figure 4

Table 3 Means, standard deviations, and correlations for all constructs used to test convergent validity

Figure 5

Table 4 Effect sizes of focal numeracy effects in predictive validity tasks without and with adjusting for assessed Raven’s progressive matrices and vocabulary

Figure 6

Table 5 Summary of measure metrics

Supplementary material: File

Silverstein et al. supplementary material

Silverstein et al. supplementary material

Download Silverstein et al. supplementary material(File)
File 277 KB