Tracing Mathematical Proficiency Through Problem-Solving Processes
Jungyang Park, Suho Kang, Jaewoo Park, Jaehong Kim, Jaewoo Shin, Seonjoon Park, Youngjae Yu
TL;DR
This work addresses explainability gaps in knowledge tracing by incorporating students' problem-solving processes into KT (KT-PSP). It introduces KT-PSP-25, a math dataset with OCR-transcribed PSPs, and StatusKT, a three-stage LLM pipeline that derives mathematical proficiency indicators and MP ratios to augment KT models. Across multiple baselines, StatusKT improves prediction accuracy (ACC and AUC) and provides interpretable proficiency signals, demonstrating the value of process-level reasoning for pedagogy. The approach highlights the importance of integrating reasoning traces into student models to achieve more accurate, interpretable, and practically useful KT systems.
Abstract
Knowledge Tracing (KT) aims to model student's knowledge state and predict future performance to enable personalized learning in Intelligent Tutoring Systems. However, traditional KT methods face fundamental limitations in explainability, as they rely solely on the response correctness, neglecting the rich information embedded in students' problem-solving processes. To address this gap, we propose Knowledge Tracing Leveraging Problem-Solving Process (KT-PSP), which incorporates students' problem-solving processes to capture the multidimensional aspects of mathematical proficiency. We also introduce KT-PSP-25, a new dataset specifically designed for the KT-PSP. Building on this, we present StatusKT, a KT framework that employs a teacher-student-teacher three-stage LLM pipeline to extract students' MP as intermediate signals. In this pipeline, the teacher LLM first extracts problem-specific proficiency indicators, then a student LLM generates responses based on the student's solution process, and a teacher LLM evaluates these responses to determine mastery of each indicator. The experimental results on KT-PSP-25 demonstrate that StatusKT improves the prediction performance of existing KT methods. Moreover, StatusKT provides interpretable explanations for its predictions by explicitly modeling students' mathematical proficiency.
