Mastery Guided Non-parametric Clustering to Scale-up Strategy Prediction
Anup Shakya, Vasile Rus, Deepak Venugopal
TL;DR
The paper tackles the challenge of scaling strategy prediction in Adaptive Instructional Systems by exploiting symmetry in student mastery to cluster data non-parametrically. It introduces a mastery-based embedding, MVec, built on a Node2Vec-style relational graph of students, problems, and knowledge components, and uses a coarse-to-fine DP-Means clustering with a symmetry penalty to form strategy-invariant partitions. A one-to-many LSTM is trained on samples drawn from converged clusters to predict KC sequences, while an attention-based model estimates KC mastery to drive the embeddings. Experiments on large MATHia datasets BA08 and CL19 show that Attention Sampling achieves high accuracy with a small fraction of training data and maintains fairness across mastery levels, demonstrating scalable and robust strategy prediction for AIS deployment.
Abstract
Predicting the strategy (sequence of concepts) that a student is likely to use in problem-solving helps Adaptive Instructional Systems (AISs) better adapt themselves to different types of learners based on their learning abilities. This can lead to a more dynamic, engaging, and personalized experience for students. To scale up training a prediction model (such as LSTMs) over large-scale education datasets, we develop a non-parametric approach to cluster symmetric instances in the data. Specifically, we learn a representation based on Node2Vec that encodes symmetries over mastery or skill level since, to solve a problem, it is natural that a student's strategy is likely to involve concepts in which they have gained mastery. Using this representation, we use DP-Means to group symmetric instances through a coarse-to-fine refinement of the clusters. We apply our model to learn strategies for Math learning from large-scale datasets from MATHia, a leading AIS for middle-school math learning. Our results illustrate that our approach can consistently achieve high accuracy using a small sample that is representative of the full dataset. Further, we show that this approach helps us learn strategies with high accuracy for students at different skill levels, i.e., leveraging symmetries improves fairness in the prediction model.
