Language Bottleneck Models for Qualitative Knowledge State Modeling
Antonin Berthon, Mihaela van der Schaar
TL;DR
The paper tackles insufficient interpretability in traditional cognitive diagnosis and knowledge tracing by introducing Language Bottleneck Models (LBMs) that compress a student’s interaction history into a textual knowledge-state summary $ ilde{\mathcal{S}}$, which then drives predictions via a decoder. By casting knowledge state modeling as an inverse problem over text, LBMs yield human-readable explanations that can surface nuanced misconceptions while maintaining competitive predictive accuracy and improved sample efficiency. The encoder is trained with reinforcement learning using a decoder-centered reward, while the decoder undergoes supervised fine-tuning; steering mechanisms and ablation studies demonstrate that the textual bottleneck supports targeted pedagogy and robust performance across synthetic and real-world datasets. The approach promises enhanced interpretability and actionable diagnostics in education, with broader applicability to domains where compact textual state representations can forecast future behavior.
Abstract
Accurately assessing student knowledge is central to education. Cognitive Diagnosis (CD) models estimate student proficiency at a fixed point in time, while Knowledge Tracing (KT) methods model evolving knowledge states to predict future performance. However, existing approaches either provide quantitative concept mastery estimates with limited expressivity (CD, probabilistic KT) or prioritize predictive accuracy at the cost of interpretability (deep learning KT). We propose Language Bottleneck Models (LBMs), where an encoder LLM produces textual knowledge state summaries, which a decoder LLM uses to predict future performance. This produces interpretable summaries that can express nuanced insights--such as misconceptions--that CD and KT models cannot capture. Extensive validation across synthetic and real-world datasets shows LBMs reveal qualitative insights beyond what CD and KT models can capture, while achieving competitive accuracy with improved sample efficiency. We demonstrate that the encoder and decoder can be fine-tuned with reinforcement learning and supervised fine-tuning respectively to improve both summary quality and predictive performance.
