Table of Contents
Fetching ...

Continuous Input Embedding Size Search For Recommender Systems

Yunke Qu, Tong Chen, Xiangyu Zhao, Lizhen Cui, Kai Zheng, Hongzhi Yin

TL;DR

Latent factor recommender systems rely on large embedding tables, which are memory-intensive when embedding sizes are uniform across users and items. This work introduces CIESS, a reinforcement learning framework that searches embedding sizes in a continuous interval for each user/item, using masked embeddings and a random-walk enhanced TD3 policy to navigate a vast action space efficiently. CIESS is model-agnostic and demonstrates state-of-the-art performance under strict memory budgets on two real-world datasets when paired with three popular backbones, while enabling selective retraining over a set of top masks. The approach offers practical impact by enabling highly memory-efficient yet accurate recommendations, with potential extensions to streaming and AutoML pipelines for recommender systems.

Abstract

Latent factor models are the most popular backbones for today's recommender systems owing to their prominent performance. Latent factor models represent users and items as real-valued embedding vectors for pairwise similarity computation, and all embeddings are traditionally restricted to a uniform size that is relatively large (e.g., 256-dimensional). With the exponentially expanding user base and item catalog in contemporary e-commerce, this design is admittedly becoming memory-inefficient. To facilitate lightweight recommendation, reinforcement learning (RL) has recently opened up opportunities for identifying varying embedding sizes for different users/items. However, challenged by search efficiency and learning an optimal RL policy, existing RL-based methods are restricted to highly discrete, predefined embedding size choices. This leads to a largely overlooked potential of introducing finer granularity into embedding sizes to obtain better recommendation effectiveness under a given memory budget. In this paper, we propose continuous input embedding size search (CIESS), a novel RL-based method that operates on a continuous search space with arbitrary embedding sizes to choose from. In CIESS, we further present an innovative random walk-based exploration strategy to allow the RL policy to efficiently explore more candidate embedding sizes and converge to a better decision. CIESS is also model-agnostic and hence generalizable to a variety of latent factor RSs, whilst experiments on two real-world datasets have shown state-of-the-art performance of CIESS under different memory budgets when paired with three popular recommendation models.

Continuous Input Embedding Size Search For Recommender Systems

TL;DR

Latent factor recommender systems rely on large embedding tables, which are memory-intensive when embedding sizes are uniform across users and items. This work introduces CIESS, a reinforcement learning framework that searches embedding sizes in a continuous interval for each user/item, using masked embeddings and a random-walk enhanced TD3 policy to navigate a vast action space efficiently. CIESS is model-agnostic and demonstrates state-of-the-art performance under strict memory budgets on two real-world datasets when paired with three popular backbones, while enabling selective retraining over a set of top masks. The approach offers practical impact by enabling highly memory-efficient yet accurate recommendations, with potential extensions to streaming and AutoML pipelines for recommender systems.

Abstract

Latent factor models are the most popular backbones for today's recommender systems owing to their prominent performance. Latent factor models represent users and items as real-valued embedding vectors for pairwise similarity computation, and all embeddings are traditionally restricted to a uniform size that is relatively large (e.g., 256-dimensional). With the exponentially expanding user base and item catalog in contemporary e-commerce, this design is admittedly becoming memory-inefficient. To facilitate lightweight recommendation, reinforcement learning (RL) has recently opened up opportunities for identifying varying embedding sizes for different users/items. However, challenged by search efficiency and learning an optimal RL policy, existing RL-based methods are restricted to highly discrete, predefined embedding size choices. This leads to a largely overlooked potential of introducing finer granularity into embedding sizes to obtain better recommendation effectiveness under a given memory budget. In this paper, we propose continuous input embedding size search (CIESS), a novel RL-based method that operates on a continuous search space with arbitrary embedding sizes to choose from. In CIESS, we further present an innovative random walk-based exploration strategy to allow the RL policy to efficiently explore more candidate embedding sizes and converge to a better decision. CIESS is also model-agnostic and hence generalizable to a variety of latent factor RSs, whilst experiments on two real-world datasets have shown state-of-the-art performance of CIESS under different memory budgets when paired with three popular recommendation models.
Paper Structure (23 sections, 16 equations, 5 figures, 2 tables, 1 algorithm)

This paper contains 23 sections, 16 equations, 5 figures, 2 tables, 1 algorithm.

Figures (5)

  • Figure 1: An overarching view of CIESS.
  • Figure 2: The random walk-based embedding size exploration, where raw actions are $\hat{d}^i_n$ predicted by the actor.
  • Figure 3: The average reward score and action (embedding size) of CIESS in each training episode on MovieLens-1M (a) and Yelp2018 (b). LightGCN is used as the base recommender.
  • Figure 4: Sensitivity analysis w.r.t. $\lambda$. LightGCN is used as the base recommender.
  • Figure 5: Sensitivity analysis w.r.t. $M$. LightGCN is used as the base recommender. $c\in\{90\%,95\%\}$ is reached only in later episodes on MovieLens-1M.