BPR: Bayesian Personalized Ranking from Implicit Feedback
Steffen Rendle, Christoph Freudenthaler, Zeno Gantner, Lars Schmidt-Thieme
TL;DR
This paper tackles personalized ranking from implicit feedback by deriving a Bayesian posterior objective (BPR-Opt) that directly optimizes pairwise item preferences for each user. It introduces LearnBPR, a bootstrap-SGD algorithm, and demonstrates how to apply it to matrix factorization and adaptive kNN models, yielding superior AUC-based ranking performance over traditional MF and kNN learning methods. The work also situates BPR within the broader landscape of WR-MF and MMMF, showing that pairwise, probabilistic ranking optimization better captures the goal of recommendation than item-score regression. The proposed approach offers a scalable, principled framework for optimizing rankings in implicit-feedback settings with strong practical impact for recommender systems.
Abstract
Item recommendation is the task of predicting a personalized ranking on a set of items (e.g. websites, movies, products). In this paper, we investigate the most common scenario with implicit feedback (e.g. clicks, purchases). There are many methods for item recommendation from implicit feedback like matrix factorization (MF) or adaptive knearest-neighbor (kNN). Even though these methods are designed for the item prediction task of personalized ranking, none of them is directly optimized for ranking. In this paper we present a generic optimization criterion BPR-Opt for personalized ranking that is the maximum posterior estimator derived from a Bayesian analysis of the problem. We also provide a generic learning algorithm for optimizing models with respect to BPR-Opt. The learning method is based on stochastic gradient descent with bootstrap sampling. We show how to apply our method to two state-of-the-art recommender models: matrix factorization and adaptive kNN. Our experiments indicate that for the task of personalized ranking our optimization method outperforms the standard learning techniques for MF and kNN. The results show the importance of optimizing models for the right criterion.
