Hostname: page-component-76c49bb84f-bg8zk Total loading time: 0 Render date: 2025-07-08T11:00:35.068Z Has data issue: false hasContentIssue false

Multidimensional Item Response Theory in the Style of Collaborative Filtering

Published online by Cambridge University Press:  01 January 2025

Yoav Bergner
Affiliation:
New York University
Peter Halpin
Affiliation:
University of North Carolina-Chapel Hill
Jill-Jênn Vie*
Affiliation:
Inria
*
Correspondence should be made to Jill-Jênn Vie, Inria, UMR 9189 CRIStAL, 40 avenue Halley, 59650 Villeneuved’Ascq, France. Email: jill-jenn.vie@inria.fr

Abstract

This paper presents a machine learning approach to multidimensional item response theory (MIRT), a class of latent factor models that can be used to model and predict student performance from observed assessment data. Inspired by collaborative filtering, we define a general class of models that includes many MIRT models. We discuss the use of penalized joint maximum likelihood to estimate individual models and cross-validation to select the best performing model. This model evaluation process can be optimized using batching techniques, such that even sparse large-scale data can be analyzed efficiently. We illustrate our approach with simulated and real data, including an example from a massive open online course. The high-dimensional model fit to this large and sparse dataset does not lend itself well to traditional methods of factor interpretation. By analogy to recommender-system applications, we propose an alternative “validation” of the factor model, using auxiliary information about the popularity of items consulted during an open-book examination in the course.

Information

Type
Application Reviews and Case Studies
Copyright
Copyright © 2021 The Psychometric Society

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable

References

Alldredge, J.,& Gilb, N.. 1976. Ridge regression: An annotated bibliography. International Statistical Review, 44 (3), 355360. Google Scholar
Andersen, E.. 1970. Asymptotic properties of conditional maximum-likelihood estimators. Journal of the Royal Statistical Society Series B, 32 (2), 283301. CrossRefGoogle Scholar
Bartholomew, D. J., Knott, M., & Moutsaki, I.. 2011. Latent variable models and factor analysis, 3 London Arnold CrossRefGoogle Scholar
Bennett, J ., & Lanning, S ., (2007.). The Netflix prize. In Proceedings of KDD cup and workshop (Vol. 2007, p. 35).CrossRefGoogle Scholar
Bergner, Y.. 2017. Measurement and its uses in learning analytics. Handbook of Learning Analytics, 35 3548. CrossRefGoogle Scholar
Bergner, Y ., Colvin, K ., & Pritchard, D .E., (2015.). Estimation of ability from homework items when there are missing and/or multiple attempts. In Proceedings of the fifth international conference on learning analytics and knowledge—LAK ’15.CrossRefGoogle Scholar
Bergner, Y ., Droschler, S ., & Kortemeyer, G ., (2012.). Model-based collaborative filtering analysis of student response data: Machine-learning item response theory. Educational Data Mining.Google Scholar
Billsus, D ., & Pazzani, M .J., (1998). Learning collaborative information filters. In Icml (Vol. 98, pp. 46–54).Google Scholar
Birnbaum, A .,. (1968.). Some latent trait models and their use in inferring an examinee’s ability. In Statistical theories of mental test scores.Google Scholar
Bock, R. D.,& Aitkin, M.. 1981. Marginal maximum likelihood estimation of item parameters: Application of an EM algorithm. Psychometrika, 46 (4), 443459. CrossRefGoogle Scholar
Bradley, A. P.. 1997. The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recognition, 30 (7), 11451159. CrossRefGoogle Scholar
Browne, M. W.. 2001. An overview of analytic rotation in exploratory factor analysis. Multivariate Behavioral Research, 36 (1), 111150. CrossRefGoogle Scholar
Cai, L.. 2010. High-dimensional exploratory item factor analysis by a Metropolis–Hastings Robbins–Monro algorithm. Psychometrika, 75 3357. CrossRefGoogle Scholar
Cen, H ., Koedinger, K ., & Junker, B .,. (2006.). Learning factors analysis–a general method for cognitive model evaluation and improvement. In International conference on intelligent tutoring systems (pp. 164–175).CrossRefGoogle Scholar
Chalmers, R. P.. 2012. mirt: A multidimensional item response theory package for the R environment. Journal of Statistical Software, 48 (6), 129. CrossRefGoogle Scholar
Chen, Y., Li, X.,& Zhang, S.. 2019. Joint maximum likelihood estimation for high-dimensional exploratory item factor analysis. Psychometrika, 84 (1), 124146. CrossRefGoogle ScholarPubMed
Chen, Y ., Li, X ., & Zhang, S ., (2019b.). Structured latent factor analysis for large-scale data: Identifiability, estimability, and their implications arXiv:1712.08966[stat].Google Scholar
Cho, S .J.,& Rabe-Hesketh, S.. 2011. Alternating imputation posterior estimation of models with crossed random effects. Computational Statistics & Data Analysis, 55 (1), 1225. CrossRefGoogle Scholar
Chrysafiadi, K.,& Virvou, M.. 2013. Student modeling approaches: A literature review for the last decade. Expert Systems with Applications, 40 (11), 47154729. CrossRefGoogle Scholar
Desmarais, M ., C., & Pu, X ., (2005.). A Bayesian student model without hidden nodes and its comparison with item response theory. International Journal of Artificial Intelligence in Education.Google Scholar
Doan, T .,-N., & Sahebi, S ., (2019.). Rank-based tensor factorization for predicting student performance. In Proceedings of the 12th international conference on educational data mining (pp. 288–293).Google Scholar
Fan, J.,& Li, R.. 2001. Variable selection via nonconcave penalized likelihood and its oracle properties. Journal of the American Statistical Association, 96 (456), 13481360. CrossRefGoogle Scholar
Goodman, L .A.,& Kruskal, W. H.. 1954. Measures of association for cross classifications. Journal of the American Statistical Association, 49 (268), 732764. Google Scholar
Haberman, S., 1977. Maximum likelihood estimates in exponential response models The Annals of Statistics 5 (5) 815841 CrossRefGoogle Scholar
Hastie, T., Tibshirani, R.,& Friedman, J.. 2009. The elements of statistical learning. 2 New York Springer CrossRefGoogle Scholar
Hestenes, D., Wells, M.,& Swackhamer, G., 1992. Force concept inventory. The Physics Teacher, 30 (3), 141158. CrossRefGoogle Scholar
Hirose, K.,& Yamamoto, M.. 2015. Sparse estimation via nonconcave penalized likelihood in factor analysis model. Statistics and Computing, 25 (5), 863875. CrossRefGoogle Scholar
Holland, P.. 1990. On the sampling foundations of item response theory models. Psychometrika, 55 (4), 577601. CrossRefGoogle Scholar
Hu, B., Zhou, Y., Wang, J., Li, L., Shen, L., Yu, W., He, H.,& Zhang, N.. 2009. Application of Item response theory to collaborative filtering. Advances in neural networks—ISNN 2009, Berlin Springer 766773. CrossRefGoogle Scholar
Jin, S., Moustaki, I.,& Yang-Wallentin, F.. 2018. Approximated penalized maximum likelihood for exploratory factor analysis: An orthogonal case. Psychometrika, 83 (3), 628649. CrossRefGoogle Scholar
Kingma, D ., P. & Ba, J ., (2015.). Adam: A method for stochastic optimization. In 3rd international conference on learning representations, ICLR 2015, San Diego, CA, USA, May 7–9, 2015, conference track proceedings.arXiv:1412.6980.Google Scholar
Koren, Y ., & Bell, R ., (2015.). Advances in collaborative filtering. In Recommender systems handbook (pp. 77–118). Springer.CrossRefGoogle Scholar
Lan, A ., Waters, A ., Studer, C ., & Baraniuk, R ., (2013.). Sparse factor analysis for learning and content analytics. arXiv preprint.arXiv:1303.5685.CrossRefGoogle Scholar
Lord, F. M.. 1980. Applications of item response theory to practical testing problems, New York Routledge. Google Scholar
Lord, FM., Novick, M.,& Birnbaum, A.. Statistical theories of mental test scores 1968. Boston Addison-Wesly Publishing Google Scholar
Martin, B., Mitrovic, T., Mathan, S.,& Koedinger, KR.. 2010. Evaluating and improving adaptive educational systems with learning curves, User Modeling and User-Adapted Interaction: The Journal of Personalization Research. 21 249283. CrossRefGoogle Scholar
Palmer, H ., (2004.). Conditional maximum likelihood estimation. In The SAGE encyclopedia of social science research methods (pp. 168–169). Sage Publications.Google Scholar
Pan, J., Ip, E. H.,& Dubé, L.. 2017. An alternative to post hoc model modification in confirmatory factor analysis: The Bayesian Lasso, Psychological Methods. 22 (4), 687704. CrossRefGoogle ScholarPubMed
Pan, J., Ip, E. H., & Dube, L..(2019). Multilevel heterogeneous factor analysis and application to ecological momentary assessment. Psychometrika.Google Scholar
Pelánek, R.. 2016. Applications of the Elo rating system in adaptive educational systems, Computers and Education. 98 169179 CrossRefGoogle Scholar
Prechelt, L ., (1998). Early stopping-but when? In Neural networks: Tricks of the trade (pp. 55–69). Springer.Google Scholar
Reckase, M.. 2009. Multidimensional item response theory, New York Springer. CrossRefGoogle Scholar
Reye, J.. 2004. Student modelling based on belief networks. International Journal of Artificial Intelligence in Education, 14 133. Google Scholar
Sahebi, S ., Lin, Y .-R., & Brusilovsky, P ., (2016.). Tensor factorization for student modeling and performance prediction in unstructured domain. In Proceedings of the 9th international conference on educational data mining (pp. 502–506).Google Scholar
Seaton, D. T., Bergner, Y., Chuang, I., Mitros, P.,& Pritchard, D. E.. 2014. Who does what in a massive open online course?, Communications of the ACM. 57 (4), 5865. CrossRefGoogle Scholar
Shi, J ., Xu, Y ., & Baraniuk, R ., (2014).Sparse bilinear logistic regression. arXiv preprint 1–27.arXiv:1404.4104.Google Scholar
Stewart, J., Zabriskie, C., Devore, S.,& Stewart, G.. 2018. Multidimensional item response theory and the Force Concept Inventory, Physical Review Physics Education Research. 14 (1), 10137–. CrossRefGoogle Scholar
Su, X ., & Khoshgoftaar, T . M., (2009.). A survey of collaborative filtering techniques. Advances in Artificial Intelligence, 2009 (Section 3), 1–19.CrossRefGoogle Scholar
Sun, J., Chen, Y., Liu, J., Ying, Z.,& Xin, T.. 2016. Latent variable selection for multidimensional item response theory models via L1 regularization, Psychometrika. 81 (4), 921939. CrossRefGoogle Scholar
Tibshirani, R., 1996. Regression shrinkage and selection via the Lasso, Journal of the Royal Statistical Society Series B. 58 (1), 267288. CrossRefGoogle Scholar
Trendafilov, N. T.,& Adachi, K.. 2015. Sparse versus simple structure loadings, Psychometrika. 80 (3), 776790. CrossRefGoogle ScholarPubMed
Trendafilov, N. T., Fontanella, S.,& Adachi, K.. 2017. Sparse exploratory factor analysis, Psychometrika. 82 (3) 778794 CrossRefGoogle Scholar
Yao, Y., Rosasco, L., Caponnetto, A., 2007. On early stopping in gradient descent learning, Constructive Approximation. 26 (2), 289315. CrossRefGoogle Scholar
Zhou, Y ., Wilkinson, D ., Schreiber, R ., & Pan, R ., (2008.). Large-scale parallel collaborative filtering for the Netflix prize. In International conference on algorithmic applications in management (pp. 337–348).CrossRefGoogle Scholar
Zhu, Y., Shen, X.,& Ye, C.. 2016. Personalized prediction and sparsity pursuit in latent factor models. Journal of the American Statistical Association, 111 (513), 241252. CrossRefGoogle Scholar