Personalized Negative Reservoir for Incremental Learning in Recommender Systems
Antonios Valkanas, Yuening Wang, Yingxue Zhang, Mark Coates
TL;DR
This work addresses the challenge of incremental learning for graph-based recommender systems under implicit feedback, where naive fine-tuning leads to catastrophic forgetting and traditional negative samplers neglect time-evolving user interests. It introduces GraphSANE, a personalized negative reservoir that biases hard negative sampling toward item categories where a user shows declining interest, guided by a Dirichlet prior on per-user category distributions and end-to-end differentiable clustering of items into latent clusters. The method combines a hard-negative reservoir with standard knowledge-distillation-based incremental learning and demonstrates state-of-the-art improvements across six large datasets when integrated with three SOTA incremental models, with notable gains for users exhibiting high interest shift. The approach offers a scalable, plug-in enhancement to existing pipelines, providing faster convergence and more accurate recommendations in dynamic, production-scale settings.
Abstract
Recommender systems have become an integral part of online platforms. Every day the volume of training data is expanding and the number of user interactions is constantly increasing. The exploration of larger and more expressive models has become a necessary pursuit to improve user experience. However, this progression carries with it an increased computational burden. In commercial settings, once a recommendation system model has been trained and deployed it typically needs to be updated frequently as new client data arrive. Cumulatively, the mounting volume of data is guaranteed to eventually make full batch retraining of the model from scratch computationally infeasible. Naively fine-tuning solely on the new data runs into the well-documented problem of catastrophic forgetting. Despite the fact that negative sampling is a crucial part of training with implicit feedback, no specialized technique exists that is tailored to the incremental learning framework. In this work, we propose a personalized negative reservoir strategy, which is used to obtain negative samples for the standard triplet loss of graph-based recommendation systems. Our technique balances alleviation of forgetting with plasticity by encouraging the model to remember stable user preferences and selectively forget when user interests change. We derive the mathematical formulation of a negative sampler to populate and update the reservoir. We integrate our design in three SOTA and commonly used incremental recommendation models. We show that these concrete realizations of our negative reservoir framework achieve state-of-the-art results for standard benchmarks using multiple top-k evaluation metrics.
