ImplicitSLIM and How it Improves Embedding-based Collaborative Filtering
Ilya Shenbin, Sergey Nikolenko
TL;DR
ImplicitSLIM addresses the challenge of improving embeddings for sparse, high-dimensional implicit-feedback data in collaborative filtering by deriving embeddings from SLIM-like item-item interactions using a memory-efficient, unsupervised approach.The method fuses ideas from EASE and Locally Linear Embeddings, introducing a closed-form second-step update via an unconstrained objective and leveraging Woodbury identities to avoid large inversions, with an auxiliary matrix A set to the embedding matrix Q.It serves as a versatile tool for initialization and regularization across MF, PLRec, VAEs, and graph-based CF models, and demonstrates significant performance and convergence advantages, including state-of-the-art results when combined with RecVAE and H+Vamp(Gated) on MovieLens-20M and Netflix Prize.Overall, ImplicitSLIM provides a generic, scalable mechanism to enrich and stabilize embeddings in a wide range of embedding-based recommender systems, with measurable gains in accuracy and efficiency.
Abstract
We present ImplicitSLIM, a novel unsupervised learning approach for sparse high-dimensional data, with applications to collaborative filtering. Sparse linear methods (SLIM) and their variations show outstanding performance, but they are memory-intensive and hard to scale. ImplicitSLIM improves embedding-based models by extracting embeddings from SLIM-like models in a computationally cheap and memory-efficient way, without explicit learning of heavy SLIM-like models. We show that ImplicitSLIM improves performance and speeds up convergence for both state of the art and classical collaborative filtering methods. The source code for ImplicitSLIM, related models, and applications is available at https://github.com/ilya-shenbin/ImplicitSLIM.
