Accurate Predictions in Education with Discrete Variational Inference
Tom Quilter, Anastasia Ilick, Karen Poon, Richard Turner
TL;DR
The paper tackles the challenge of predicting student success on unseen questions to enable scalable AI tutoring and reduce inequality. It builds on item response theory by evaluating an ability–difficulty model and extended class-interaction variants, and introduces a discrete variational inference framework to handle uncertainty in binary responses, validated on a large open GCSE mathematics dataset. The key findings show that a single overall ability parameter often dominates predictive accuracy, topic-level skill differences offer limited additional predictive power, and discrete variational inference yields notable gains in data-sparse settings, with class-level information providing modest improvements. The work provides practical guidance for adaptive learning systems and contributes a large, openly available benchmark for future research in educational data mining and probabilistic modelling in education.
Abstract
One of the largest drivers of social inequality is unequal access to personal tutoring, with wealthier individuals able to afford it, while the majority cannot. Affordable, effective AI tutors offer a scalable solution. We focus on adaptive learning, predicting whether a student will answer a question correctly, a key component of any effective tutoring system. Yet many platforms struggle to achieve high prediction accuracy, especially in data-sparse settings. To address this, we release the largest open dataset of professionally marked formal mathematics exam responses to date. We introduce a probabilistic modelling framework rooted in Item Response Theory (IRT) that achieves over 80 percent accuracy, setting a new benchmark for mathematics prediction accuracy of formal exam papers. Extending this, our collaborative filtering models incorporate topic-level skill profiles, but reveal a surprising and educationally significant finding, a single latent ability parameter alone is needed to achieve the maximum predictive accuracy. Our main contribution though is deriving and implementing a novel discrete variational inference framework, achieving our highest prediction accuracy in low-data settings and outperforming all classical IRT and matrix factorisation baselines.
