Pfeed: Generating near real-time personalized feeds using precomputed embedding similarities

Binyam Gebre; Karoliina Ranta; Stef van den Elzen; Ernst Kuiper; Thijs Baars; Tom Heskes

Pfeed: Generating near real-time personalized feeds using precomputed embedding similarities

Binyam Gebre, Karoliina Ranta, Stef van den Elzen, Ernst Kuiper, Thijs Baars, Tom Heskes

TL;DR

This paper proposes a method that dynamically updates customer profiles and composes a feed every two minutes, employing precomputed embeddings and their respective similarities, and enhanced customer engagement and experience.

Abstract

In personalized recommender systems, embeddings are often used to encode customer actions and items, and retrieval is then performed in the embedding space using approximate nearest neighbor search. However, this approach can lead to two challenges: 1) user embeddings can restrict the diversity of interests captured and 2) the need to keep them up-to-date requires an expensive, real-time infrastructure. In this paper, we propose a method that overcomes these challenges in a practical, industrial setting. The method dynamically updates customer profiles and composes a feed every two minutes, employing precomputed embeddings and their respective similarities. We tested and deployed this method to personalise promotional items at Bol, one of the largest e-commerce platforms of the Netherlands and Belgium. The method enhanced customer engagement and experience, leading to a significant 4.9% uplift in conversions.

Pfeed: Generating near real-time personalized feeds using precomputed embedding similarities

TL;DR

Abstract

Paper Structure (47 sections, 2 equations, 5 figures, 8 tables)

This paper contains 47 sections, 2 equations, 5 figures, 8 tables.

Introduction
Four Challenges in Personalized Feed Systems
Customer representation challenge
Item representation challenge
Candidate retrieval challenge
Ranking challenge
Our Contributions
Single vector representation bottleneck
High infrastructure and maintenance costs
Related Work
Customer Representation Challenge
Item Representation Challenge
Candidate Retrieval Challenge
Training challenge
Inference challenge
...and 32 more sections

Figures (5)

Figure 1: User-to-item framework: Single vectors from the user encoder limit representation and interpretability. Keeping them fresh demands high-maintenance infrastructure.
Figure 2: Query-to-item framework: Query embeddings and their similarities are precomputed. Users are represented by a dynamic set of queries that can be updated as needed.
Figure 3: The major steps involved in generating near real-time personalized recommendations
Figure 4: The SIMO (Single Input Multi Ouput) embedding model generates three embeddings per item in one model run using three special tokens: [Q_V], [Q_B], and [TGT].
Figure 5: Inputs to dual SIMO encoders: the query encoder takes in the metadata of the query item and generates three embeddings and the target encoder takes in the metadata of the target item and generates three embeddings. During training, the loss is determined by the target embedding derived from the target item $t_i$ encoder and pairing it with a query embedding from the query item $q_i$ encoder, selected by the relation $r_i$ indicator.

Pfeed: Generating near real-time personalized feeds using precomputed embedding similarities

TL;DR

Abstract

Pfeed: Generating near real-time personalized feeds using precomputed embedding similarities

Authors

TL;DR

Abstract

Table of Contents

Figures (5)