Hostname: page-component-6766d58669-bp2c4 Total loading time: 0 Render date: 2026-05-21T08:44:40.680Z Has data issue: false hasContentIssue false

A Genetic Algorithm for Automated Assembly of Linear Multidimensional Forced-Choice Questionnaires

Published online by Cambridge University Press:  24 April 2026

Jianbin Fu*
Affiliation:
Educational Testing Service , USA
Patrick C. Kyllonen
Affiliation:
Educational Testing Service , USA
*
Corresponding author: Jianbin Fu; Email: jfu@ets.org
Rights & Permissions [Opens in a new window]

Abstract

This article proposes a genetic algorithm, the array histogram-based sampling algorithm (AHBSA), to assemble linear multidimensional forced-choice questionnaires (MFCQs), which are increasingly popular for measuring personality, with forced-choice items including any number of statements (block size). The algorithm also works for traditional multidimensional Likert and cognitive test forms, where items can be seen as having a block size of one. Real and simulated statement pools are used to evaluate AHBSA’s performance in terms of test reliability and running speed in assembling Likert forms and MFCQs with two (pair) and three (triplet) statements. Compared to mixed integer programming (MIP) and random assembly, AHBSA achieves as high or higher reliabilities under all conditions: in Likert forms, AHBSA reaches optimal solutions as MIP does, and in the MFCQs, AHBSA gains notable increases in reliabilities over MIP when the forms have no constraint on item direction (i.e., positively versus negatively keyed statements). AHBSA takes more time to converge than MIP. The findings and limitations are discussed, and suggestions for future work are provided.

Information

Type
Theory and Methods
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© Educational Testing Service, 2026. Published by Cambridge University Press on behalf of Psychometric Society
Figure 0

Algorithm 1 Schematic description of sampling process of one form in the initial generation.

Figure 1

Algorithm 2 Schematic description of sampling process of one form in a noninitial generation.

Figure 2

Algorithm 3 Schematic description of the whole loop of AHBSA.

Figure 3

Table 1 Correlation matrix of five domains

Figure 4

Table 2 Comparisons of theoretical reliabilities and running times on real forms

Figure 5

Table 3 Comparisons of quality of latent trait score estimates and running times on simulated forms

Figure 6

Table 4 Effect sizes (generalized ${\eta}^2$) for ANOVAs on true reliabilities in the simulation study

Figure 7

Figure 1 Average reliability over time for the best candidates in AHBSA and comparison to mean reliabilities in MIP and random in the 90-statement and NonDir condition.Note: AHBSA, array histogram-based sampling algorithm; MIP, mixed integer programming; NonDir, statement pool including 50% negative statements and no constraint on item direction. The mean theoretical reliability and running time were calculated using the best candidates by generation in up to 30 runs in AHBSA.

Figure 8

Table A1 Comparisons of average theoretical reliabilities on simulated forms