Survey of Computerized Adaptive Testing: A Machine Learning Perspective
Qi Liu, Yan Zhuang, Haoyang Bi, Zhenya Huang, Weizhe Huang, Jiatong Li, Junhao Yu, Zirui Liu, Zirui Hu, Yuting Hong, Zachary A. Pardos, Haiping Ma, Mengxiao Zhu, Shijin Wang, Enhong Chen
TL;DR
This survey analyzes how machine learning enriches Computerized Adaptive Testing (CAT) by examining four life-cycle components: Cognitive Diagnosis Models for proficiency estimation, question selection algorithms, question bank construction, and test control. It contrasts traditional statistical methods (e.g., Fisher and KL information) with modern data-driven approaches (reinforcement learning, meta-learning, active learning) and discusses model-agnostic strategies, robustness, fairness, and search efficiency. The paper highlights practical evaluation via simulations and real datasets, and it points to future directions including multi-dimensional assessment, MST, generative AI integration, explainability, and AI-system evaluation. By bridging psychometrics and ML, it advocates an interdisciplinary framework for scalable, fair, and explainable CAT systems with open-source tooling (EduCAT).
Abstract
Computerized Adaptive Testing (CAT) provides an efficient and tailored method for assessing the proficiency of examinees, by dynamically adjusting test questions based on their performance. Widely adopted across diverse fields like education, healthcare, sports, and sociology, CAT has revolutionized testing practices. While traditional methods rely on psychometrics and statistics, the increasing complexity of large-scale testing has spurred the integration of machine learning techniques. This paper aims to provide a machine learning-focused survey on CAT, presenting a fresh perspective on this adaptive testing method. By examining the test question selection algorithm at the heart of CAT's adaptivity, we shed light on its functionality. Furthermore, we delve into cognitive diagnosis models, question bank construction, and test control within CAT, exploring how machine learning can optimize these components. Through an analysis of current methods, strengths, limitations, and challenges, we strive to develop robust, fair, and efficient CAT systems. By bridging psychometric-driven CAT research with machine learning, this survey advocates for a more inclusive and interdisciplinary approach to the future of adaptive testing.
