Does Interpretability of Knowledge Tracing Models Support Teacher Decision Making?
Adia Khalid, Alina Deriyeva, Benjamin Paassen
TL;DR
The paper examines whether interpretable KT models ($p_{t,j}$, $\theta_{t,k}$) enhance teacher decision making. Through an Elo-grounded simulation and a human-teacher study comparing BKT, PFA, and DKT, it shows that interpretable models yield faster simulated mastery and higher usability/trust, while human teachers did not consistently leverage interpretability to reduce the number of tasks to mastery. The results imply that interpretability alone does not straightforwardly improve pedagogical decisions; teachers may rely on information beyond model outputs or may misinterpret graphs, underscoring the need for better interfaces and integrative explanations. Overall, the work highlights opportunities to design KT models and interfaces that preserve interpretability while actually guiding teaching more effectively in real classrooms.
Abstract
Knowledge tracing (KT) models are a crucial basis for pedagogical decision-making, namely which task to select next for a learner and when to stop teaching a particular skill. Given the high stakes of pedagogical decisions, KT models are typically required to be interpretable, in the sense that they should implement an explicit model of human learning and provide explicit estimates of learners' abilities. However, to our knowledge, no study to date has investigated whether the interpretability of KT models actually helps human teachers to make teaching decisions. We address this gap. First, we perform a simulation study to show that, indeed, decisions based on interpretable KT models achieve mastery faster compared to decisions based on a non-interpretable model. Second, we repeat the study but ask $N=12$ human teachers to make the teaching decisions based on the information provided by KT models. As expected, teachers rate interpretable KT models higher in terms of usability and trustworthiness. However, the number of tasks needed until mastery hardly differs between KT models. This suggests that the relationship between model interpretability and teacher decisions is not straightforward: teachers do not solely rely on KT models to make decisions and further research is needed to investigate how learners and teachers actually understand and use KT models.
