ConceptKT: A Benchmark for Concept-Level Deficiency Prediction in Knowledge Tracing

Yu-Chen Kang; Yu-Chien Tang; An-Zi Yen

ConceptKT: A Benchmark for Concept-Level Deficiency Prediction in Knowledge Tracing

Yu-Chen Kang, Yu-Chien Tang, An-Zi Yen

Abstract

Knowledge Tracing (KT) is a critical technique for modeling student knowledge to support personalized learning. However, most KT systems focus on binary correctness prediction and cannot diagnose the underlying conceptual misunderstandings that lead to errors. Such fine-grained diagnostic feedback is essential for designing targeted instruction and effective remediation. In this work, we introduce the task of concept-level deficiency prediction, which extends traditional KT by identifying the specific concepts a student is likely to struggle with on future problems. We present ConceptKT, a dataset annotated with labels that capture both the concepts required to solve each question and the missing concepts underlying incorrect responses. We investigate in-context learning approaches to KT and evaluate the diagnostic capabilities of various Large Language Models (LLMs) and Large Reasoning Models (LRMs). Different strategies for selecting informative historical records are explored. Experimental results demonstrate that selecting response histories based on conceptual alignment and semantic similarity leads to improved performance on both correctness prediction and concept-level deficiency identification.

ConceptKT: A Benchmark for Concept-Level Deficiency Prediction in Knowledge Tracing

Abstract

Paper Structure (21 sections, 3 figures, 6 tables)

This paper contains 21 sections, 3 figures, 6 tables.

Introduction
Related Work
LLM-Enhanced and Open-Ended Knowledge Tracing
Mathematics Education Datasets
Dataset Construction and Analysis
From MathEDU to ConceptKT
Data Annotation
Dataset Statistics and Analysis
Concept Statistics
Student Concept Mastery Analysis
Methodology
Task Formulation
Response Selection Strategies
Experiments
Experimental Setup
...and 6 more sections

Figures (3)

Figure 1: Distribution of Questions Across 13 Categories.
Figure 2: Concept-level error rates across students, where darker shades indicate higher error proportions and "×" marks denote concepts that were not covered in the student's problem-solving history.
Figure 3: Overview of Knowledge Tracing.

ConceptKT: A Benchmark for Concept-Level Deficiency Prediction in Knowledge Tracing

Abstract

ConceptKT: A Benchmark for Concept-Level Deficiency Prediction in Knowledge Tracing

Authors

Abstract

Table of Contents

Figures (3)