Ken Utilization Layer: Hebbian Replay Within a Student's Ken for Adaptive Exercise Recommendation
Grey Kuling, Marinka Zitnik
TL;DR
This work addresses the challenge of real-time, few-shot exercise recommendation under sparse supervision by introducing KUL-Rec, a memory-augmented architecture grounded in Complementary Learning Systems. It combines a fast Hebbian memory (Notebook) with a slow-learning Student network and a Loss-aligned Internal Target to enable continual personalization without storing raw data or relying on cohort training, and it supports both tabular and open-ended inputs through a unified embedding space. The approach is evaluated via bidirectional, rank-sensitive metrics on ten tabular KT datasets and a live 13-week short-answer deployment, demonstrating superior ranking performance, reduced memory and latency, and meaningful improvements in learner experience. The findings suggest that Hebbian replay with bounded consolidation offers a scalable, interpretable path to adaptive educational technology across modalities, while highlighting embedding robustness as a critical consideration for deployment.
Abstract
Adaptive exercise recommendation (ER) aims to choose the next activity that matches a learner's evolving Zone of Proximal Development (ZPD). We present KUL-Rec, a biologically inspired ER system that couples a fast Hebbian memory with slow replay-based consolidation to enable continual, few-shot personalization from sparse interactions. The model operates in an embedding space, allowing a single architecture to handle both tabular knowledge-tracing logs and open-ended short-answer text. We align evaluation with tutoring needs using bidirectional ranking and rank-sensitive metrics (nDCG, Recall@K). Across ten public datasets, KUL-Rec improves macro nDCG (0.316 vs. 0.265 for the strongest baseline) and Recall@10 (0.305 vs. 0.211), while achieving low inference latency and an $\approx99$\% reduction in peak GPU memory relative to a competitive graph-based model. In a 13-week graduate course, KUL-Rec personalized weekly short-answer quizzes generated by a retrieval-augmented pipeline and the personalized quizzes were associated with lower perceived difficulty and higher helpfulness (p < .05). An embedding robustness audit highlights that encoder choice affects semantic alignment, motivating routine audits when deploying open-response assessment. Together, these results indicate that Hebbian replay with bounded consolidation offers a practical path to real-time, interpretable ER that scales across data modalities and classroom settings.
