AdvKT: An Adversarial Multi-Step Training Framework for Knowledge Tracing
Lingyue Fu, Ting Long, Jianghao Lin, Wei Xia, Xinyi Dai, Ruiming Tang, Yasheng Wang, Weinan Zhang, Yong Yu
TL;DR
AdvKT addresses the mismatch between common single-step KT training and the multi-step inference used in real-world ITS simulations, which causes error accumulation and data sparsity problems. It introduces an adversarial multi-step framework with a generator that predicts multi-step responses and a discriminator that guides data augmentation to produce realistic, diverse sequences, thereby aligning training with inference. The method combines KT-specific embeddings, GRU-based state modeling with attention, and specialized BCE/advantage losses plus autoregressive objectives, along with richly augmented and synthetic data to improve robustness. Experiments on four public KT datasets show AdvKT achieving state-of-the-art results in multi-step settings and notable improvements in data-sparse scenarios, demonstrating practical potential for student simulators and intelligent tutoring systems.
Abstract
Knowledge Tracing (KT) monitors students' knowledge states and simulates their responses to question sequences. Existing KT models typically follow a single-step training paradigm, which leads to discrepancies with the multi-step inference process required in real-world simulations, resulting in significant error accumulation. This accumulation of error, coupled with the issue of data sparsity, can substantially degrade the performance of recommendation models in the intelligent tutoring systems. To address these challenges, we propose a novel Adversarial Multi-Step Training Framework for Knowledge Tracing (AdvKT), which, for the first time, focuses on the multi-step KT task. More specifically, AdvKT leverages adversarial learning paradigm involving a generator and a discriminator. The generator mimics high-reward responses, effectively reducing error accumulation across multiple steps, while the discriminator provides feedback to generate synthetic data. Additionally, we design specialized data augmentation techniques to enrich the training data with realistic variations, ensuring that the model generalizes well even in scenarios with sparse data. Experiments conducted on four real-world datasets demonstrate the superiority of AdvKT over existing KT models, showcasing its ability to address both error accumulation and data sparsity issues effectively.
