Generalized User Representations for Transfer Learning

Ghazal Fazelnia; Sanket Gupta; Claire Keum; Mark Koh; Ian Anderson; Mounia Lalmas

Generalized User Representations for Transfer Learning

Ghazal Fazelnia, Sanket Gupta, Claire Keum, Mark Koh, Ian Anderson, Mounia Lalmas

TL;DR

A novel framework for user representation in large-scale recommender systems, aiming at effectively representing diverse user taste in a generalized manner, using a two-stage methodology combining representation learning and transfer learning.

Abstract

We present a novel framework for user representation in large-scale recommender systems, aiming at effectively representing diverse user taste in a generalized manner. Our approach employs a two-stage methodology combining representation learning and transfer learning. The representation learning model uses an autoencoder that compresses various user features into a representation space. In the second stage, downstream task-specific models leverage user representations via transfer learning instead of curating user features individually. We further augment this methodology on the representation's input features to increase flexibility and enable reaction to user events, including new user experiences, in Near-Real Time. Additionally, we propose a novel solution to manage deployment of this framework in production models, allowing downstream models to work independently. We validate the performance of our framework through rigorous offline and online experiments within a large-scale system, showcasing its remarkable efficacy across multiple evaluation tasks. Finally, we show how the proposed framework can significantly reduce infrastructure costs compared to alternative approaches.

Generalized User Representations for Transfer Learning

TL;DR

Abstract

Paper Structure (40 sections, 2 equations, 8 figures, 4 tables, 1 algorithm)

This paper contains 40 sections, 2 equations, 8 figures, 4 tables, 1 algorithm.

Introduction
Related Works
Approach
Notations
Architecture
Modality Encoders
Input Features
Autoencoder for Core User Representation
Model Implementation
Training Dataset
Input features
Model Details and Retrain Cadence
Transfer Learning Methodology
Use Cases
Classification models
...and 25 more sections

Figures (8)

Figure 1: User Representation Model Architecture. Catalog interactions are embedded via pre-trained modality encoders to serve as input to an autoencoder model. The latent representation, $z_u$, serves as the User Representation for disparate downstream recommendation tasks.
Figure 2: Embedding outputs of music modality encoders are aggregated over multiple time horizons.
Figure 3: Music modality encoders as well as podcast modality encoders can feed into the user representation.
Figure 4: Downstream model tasks before (left) and after (right) incorporating transfer learning with a user representation.
Figure 5: Near-Real Time Inference of User Representations. Cold-start user activations and user interactions trigger inferences, allowing for smooth updates to reflect the latest information.
...and 3 more figures

Generalized User Representations for Transfer Learning

TL;DR

Abstract

Generalized User Representations for Transfer Learning

Authors

TL;DR

Abstract

Table of Contents

Figures (8)