Multidimensional Item Response Theory in the Style of Collaborative Filtering

Yoav Bergner; Peter Halpin; Jill-Jênn Vie

doi:10.1007/s11336-021-09788-9

Multidimensional Item Response Theory in the Style of Collaborative Filtering

Published online by Cambridge University Press: 01 January 2025

Yoav Bergner ,

Peter Halpin and

Jill-Jênn Vie

Show author details

Yoav Bergner: Affiliation:
New York University
Peter Halpin: Affiliation:
University of North Carolina-Chapel Hill
Jill-Jênn Vie*: Affiliation:
Inria
*: Correspondence should be made to Jill-Jênn Vie, Inria, UMR 9189 CRIStAL, 40 avenue Halley, 59650 Villeneuved’Ascq, France. Email: jill-jenn.vie@inria.fr

Article contents

Abstract
References

Get access

Rights & Permissions

Abstract

This paper presents a machine learning approach to multidimensional item response theory (MIRT), a class of latent factor models that can be used to model and predict student performance from observed assessment data. Inspired by collaborative filtering, we define a general class of models that includes many MIRT models. We discuss the use of penalized joint maximum likelihood to estimate individual models and cross-validation to select the best performing model. This model evaluation process can be optimized using batching techniques, such that even sparse large-scale data can be analyzed efficiently. We illustrate our approach with simulated and real data, including an example from a massive open online course. The high-dimensional model fit to this large and sparse dataset does not lend itself well to traditional methods of factor interpretation. By analogy to recommender-system applications, we propose an alternative “validation” of the factor model, using auxiliary information about the popularity of items consulted during an open-book examination in the course.

Keywords

item response theory multidimensionality machine learning collaborative filtering joint maximum likelihood

Information

Type: Application Reviews and Case Studies
Information: Psychometrika , Volume 87 , Issue 1: Special Issue on Network Psychometrics in Action , March 2022 , pp. 266 - 288

DOI: https://doi.org/10.1007/s11336-021-09788-9 [Opens in a new window]
Copyright: Copyright © 2021 The Psychometric Society

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable

References

Alldredge, J.,& Gilb, N.. 1976. Ridge regression: An annotated bibliography. International Statistical Review, 44 (3), 355– 360. Google Scholar

Andersen, E.. 1970. Asymptotic properties of conditional maximum-likelihood estimators. Journal of the Royal Statistical Society Series B, 32 (2), 283– 301. CrossRef Google Scholar

Bartholomew, D. J., Knott, M., & Moutsaki, I.. 2011. Latent variable models and factor analysis, 3 London Arnold CrossRef Google Scholar

Bennett, J ., & Lanning, S ., (2007.). The Netflix prize. In Proceedings of KDD cup and workshop (Vol. 2007, p. 35).CrossRef Google Scholar

Bergner, Y.. 2017. Measurement and its uses in learning analytics. Handbook of Learning Analytics, 35 35– 48. CrossRef Google Scholar

Bergner, Y ., Colvin, K ., & Pritchard, D .E., (2015.). Estimation of ability from homework items when there are missing and/or multiple attempts. In Proceedings of the fifth international conference on learning analytics and knowledge—LAK ’15.CrossRef Google Scholar

Bergner, Y ., Droschler, S ., & Kortemeyer, G ., (2012.). Model-based collaborative filtering analysis of student response data: Machine-learning item response theory. Educational Data Mining.Google Scholar

Billsus, D ., & Pazzani, M .J., (1998). Learning collaborative information filters. In Icml (Vol. 98, pp. 46–54).Google Scholar

Birnbaum, A .,. (1968.). Some latent trait models and their use in inferring an examinee’s ability. In Statistical theories of mental test scores.Google Scholar

Bock, R. D.,& Aitkin, M.. 1981. Marginal maximum likelihood estimation of item parameters: Application of an EM algorithm. Psychometrika, 46 (4), 443– 459. CrossRef Google Scholar

Bradley, A. P.. 1997. The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recognition, 30 (7), 1145– 1159. CrossRef Google Scholar

Browne, M. W.. 2001. An overview of analytic rotation in exploratory factor analysis. Multivariate Behavioral Research, 36 (1), 111– 150. CrossRef Google Scholar

Cai, L.. 2010. High-dimensional exploratory item factor analysis by a Metropolis–Hastings Robbins–Monro algorithm. Psychometrika, 75 33– 57. CrossRef Google Scholar

Cen, H ., Koedinger, K ., & Junker, B .,. (2006.). Learning factors analysis–a general method for cognitive model evaluation and improvement. In International conference on intelligent tutoring systems (pp. 164–175).CrossRef Google Scholar

Chalmers, R. P.. 2012. mirt: A multidimensional item response theory package for the R environment. Journal of Statistical Software, 48 (6), 1– 29. CrossRef Google Scholar

Chen, Y., Li, X.,& Zhang, S.. 2019. Joint maximum likelihood estimation for high-dimensional exploratory item factor analysis. Psychometrika, 84 (1), 124– 146. CrossRef Google Scholar PubMed

Chen, Y ., Li, X ., & Zhang, S ., (2019b.). Structured latent factor analysis for large-scale data: Identifiability, estimability, and their implications arXiv:1712.08966[stat].Google Scholar

Cho, S .J.,& Rabe-Hesketh, S.. 2011. Alternating imputation posterior estimation of models with crossed random effects. Computational Statistics & Data Analysis, 55 (1), 12– 25. CrossRef Google Scholar

Chrysafiadi, K.,& Virvou, M.. 2013. Student modeling approaches: A literature review for the last decade. Expert Systems with Applications, 40 (11), 4715– 4729. CrossRef Google Scholar

Desmarais, M ., C., & Pu, X ., (2005.). A Bayesian student model without hidden nodes and its comparison with item response theory. International Journal of Artificial Intelligence in Education.Google Scholar

Doan, T .,-N., & Sahebi, S ., (2019.). Rank-based tensor factorization for predicting student performance. In Proceedings of the 12th international conference on educational data mining (pp. 288–293).Google Scholar

Fan, J.,& Li, R.. 2001. Variable selection via nonconcave penalized likelihood and its oracle properties. Journal of the American Statistical Association, 96 (456), 1348– 1360. CrossRef Google Scholar

Goodman, L .A.,& Kruskal, W. H.. 1954. Measures of association for cross classifications. Journal of the American Statistical Association, 49 (268), 732– 764. Google Scholar

Haberman, S., 1977. Maximum likelihood estimates in exponential response models The Annals of Statistics 5 (5) 815– 841 CrossRef Google Scholar

Hastie, T., Tibshirani, R.,& Friedman, J.. 2009. The elements of statistical learning. 2 New York Springer CrossRef Google Scholar

Hestenes, D., Wells, M.,& Swackhamer, G., 1992. Force concept inventory. The Physics Teacher, 30 (3), 141– 158. CrossRef Google Scholar

Hirose, K.,& Yamamoto, M.. 2015. Sparse estimation via nonconcave penalized likelihood in factor analysis model. Statistics and Computing, 25 (5), 863– 875. CrossRef Google Scholar

Holland, P.. 1990. On the sampling foundations of item response theory models. Psychometrika, 55 (4), 577– 601. CrossRef Google Scholar

Hu, B., Zhou, Y., Wang, J., Li, L., Shen, L., Yu, W., He, H.,& Zhang, N.. 2009. Application of Item response theory to collaborative filtering. Advances in neural networks—ISNN 2009, Berlin Springer 766– 773. CrossRef Google Scholar

Jin, S., Moustaki, I.,& Yang-Wallentin, F.. 2018. Approximated penalized maximum likelihood for exploratory factor analysis: An orthogonal case. Psychometrika, 83 (3), 628– 649. CrossRef Google Scholar

Kingma, D ., P. & Ba, J ., (2015.). Adam: A method for stochastic optimization. In 3rd international conference on learning representations, ICLR 2015, San Diego, CA, USA, May 7–9, 2015, conference track proceedings.arXiv:1412.6980.Google Scholar

Koren, Y ., & Bell, R ., (2015.). Advances in collaborative filtering. In Recommender systems handbook (pp. 77–118). Springer.CrossRef Google Scholar

Lan, A ., Waters, A ., Studer, C ., & Baraniuk, R ., (2013.). Sparse factor analysis for learning and content analytics. arXiv preprint.arXiv:1303.5685.CrossRef Google Scholar

Lord, F. M.. 1980. Applications of item response theory to practical testing problems, New York Routledge. Google Scholar

Lord, FM., Novick, M.,& Birnbaum, A.. Statistical theories of mental test scores 1968. Boston Addison-Wesly Publishing Google Scholar

Martin, B., Mitrovic, T., Mathan, S.,& Koedinger, KR.. 2010. Evaluating and improving adaptive educational systems with learning curves, User Modeling and User-Adapted Interaction: The Journal of Personalization Research. 21 249– 283. CrossRef Google Scholar

Palmer, H ., (2004.). Conditional maximum likelihood estimation. In The SAGE encyclopedia of social science research methods (pp. 168–169). Sage Publications.Google Scholar

Pan, J., Ip, E. H.,& Dubé, L.. 2017. An alternative to post hoc model modification in confirmatory factor analysis: The Bayesian Lasso, Psychological Methods. 22 (4), 687– 704. CrossRef Google Scholar PubMed

Pan, J., Ip, E. H., & Dube, L..(2019). Multilevel heterogeneous factor analysis and application to ecological momentary assessment. Psychometrika.Google Scholar

Pelánek, R.. 2016. Applications of the Elo rating system in adaptive educational systems, Computers and Education. 98 169– 179 CrossRef Google Scholar

Prechelt, L ., (1998). Early stopping-but when? In Neural networks: Tricks of the trade (pp. 55–69). Springer.Google Scholar

Reckase, M.. 2009. Multidimensional item response theory, New York Springer. CrossRef Google Scholar

Reye, J.. 2004. Student modelling based on belief networks. International Journal of Artificial Intelligence in Education, 14 1– 33. Google Scholar

Sahebi, S ., Lin, Y .-R., & Brusilovsky, P ., (2016.). Tensor factorization for student modeling and performance prediction in unstructured domain. In Proceedings of the 9th international conference on educational data mining (pp. 502–506).Google Scholar

Seaton, D. T., Bergner, Y., Chuang, I., Mitros, P.,& Pritchard, D. E.. 2014. Who does what in a massive open online course?, Communications of the ACM. 57 (4), 58– 65. CrossRef Google Scholar

Shi, J ., Xu, Y ., & Baraniuk, R ., (2014).Sparse bilinear logistic regression. arXiv preprint 1–27.arXiv:1404.4104.Google Scholar

Stewart, J., Zabriskie, C., Devore, S.,& Stewart, G.. 2018. Multidimensional item response theory and the Force Concept Inventory, Physical Review Physics Education Research. 14 (1), 10137–. CrossRef Google Scholar

Su, X ., & Khoshgoftaar, T . M., (2009.). A survey of collaborative filtering techniques. Advances in Artificial Intelligence, 2009 (Section 3), 1–19.CrossRef Google Scholar

Sun, J., Chen, Y., Liu, J., Ying, Z.,& Xin, T.. 2016. Latent variable selection for multidimensional item response theory models via L1 regularization, Psychometrika. 81 (4), 921– 939. CrossRef Google Scholar

Tibshirani, R., 1996. Regression shrinkage and selection via the Lasso, Journal of the Royal Statistical Society Series B. 58 (1), 267– 288. CrossRef Google Scholar

Trendafilov, N. T.,& Adachi, K.. 2015. Sparse versus simple structure loadings, Psychometrika. 80 (3), 776– 790. CrossRef Google Scholar PubMed

Trendafilov, N. T., Fontanella, S.,& Adachi, K.. 2017. Sparse exploratory factor analysis, Psychometrika. 82 (3) 778– 794 CrossRef Google Scholar

Yao, Y., Rosasco, L., Caponnetto, A., 2007. On early stopping in gradient descent learning, Constructive Approximation. 26 (2), 289– 315. CrossRef Google Scholar

Zhou, Y ., Wilkinson, D ., Schreiber, R ., & Pan, R ., (2008.). Large-scale parallel collaborative filtering for the Netflix prize. In International conference on algorithmic applications in management (pp. 337–348).CrossRef Google Scholar

Zhu, Y., Shen, X.,& Ye, C.. 2016. Personalized prediction and sparsity pursuit in latent factor models. Journal of the American Statistical Association, 111 (513), 241– 252. CrossRef Google Scholar

Article contents

Multidimensional Item Response Theory in the Style of Collaborative Filtering

Abstract

Keywords

Information

Access options

Article purchase

Temporarily unavailable

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests