Table of Contents
Fetching ...

Handling Large-scale Cardinality in building recommendation systems

Dhruva Dixith Kurra, Bo Ling, Chun Zh, Seyedshahin Ashrafzadeh

TL;DR

The paper tackles the scalability of high-cardinality UUID features in the retrieval phase of large-scale recommenders by introducing a Bag-of-Words (BoW) proxy for $eater extunderscore uuid$ and a layer-sharing scheme between the two-tower embedding components. These methods substantially reduce embedding size (approximately 25x) and accelerate training, while boosting recall—evidenced by Recall@500 improvements and faster convergence (e.g., ~2000 steps to threshold vs ~20000 for the baseline BoW). The approach leverages historical user behavior via store proxies ($store extunderscore uuids$) and enables information exchange across towers to capture nuanced user-item relations. Supported by offline and online Uber Eats experiments, the work demonstrates practical gains in retrieval stage efficiency and performance, with statistical significance ($p$-value < 5%), suggesting broad applicability to high-cardinality features in large-scale recommender systems.

Abstract

Effective recommendation systems rely on capturing user preferences, often requiring incorporating numerous features such as universally unique identifiers (UUIDs) of entities. However, the exceptionally high cardinality of UUIDs poses a significant challenge in terms of model degradation and increased model size due to sparsity. This paper presents two innovative techniques to address the challenge of high cardinality in recommendation systems. Specifically, we propose a bag-of-words approach, combined with layer sharing, to substantially decrease the model size while improving performance. Our techniques were evaluated through offline and online experiments on Uber use cases, resulting in promising results demonstrating our approach's effectiveness in optimizing recommendation systems and enhancing their overall performance.

Handling Large-scale Cardinality in building recommendation systems

TL;DR

The paper tackles the scalability of high-cardinality UUID features in the retrieval phase of large-scale recommenders by introducing a Bag-of-Words (BoW) proxy for and a layer-sharing scheme between the two-tower embedding components. These methods substantially reduce embedding size (approximately 25x) and accelerate training, while boosting recall—evidenced by Recall@500 improvements and faster convergence (e.g., ~2000 steps to threshold vs ~20000 for the baseline BoW). The approach leverages historical user behavior via store proxies () and enables information exchange across towers to capture nuanced user-item relations. Supported by offline and online Uber Eats experiments, the work demonstrates practical gains in retrieval stage efficiency and performance, with statistical significance (-value < 5%), suggesting broad applicability to high-cardinality features in large-scale recommender systems.

Abstract

Effective recommendation systems rely on capturing user preferences, often requiring incorporating numerous features such as universally unique identifiers (UUIDs) of entities. However, the exceptionally high cardinality of UUIDs poses a significant challenge in terms of model degradation and increased model size due to sparsity. This paper presents two innovative techniques to address the challenge of high cardinality in recommendation systems. Specifically, we propose a bag-of-words approach, combined with layer sharing, to substantially decrease the model size while improving performance. Our techniques were evaluated through offline and online experiments on Uber use cases, resulting in promising results demonstrating our approach's effectiveness in optimizing recommendation systems and enhancing their overall performance.
Paper Structure (10 sections, 1 equation, 5 figures, 1 table)

This paper contains 10 sections, 1 equation, 5 figures, 1 table.

Figures (5)

  • Figure 1: Uber Eats recommendation on mobile app and Website.
  • Figure 2: Simplified BoW with layer sharing architecture
  • Figure 3: Bag of Words: Using a userś order history with specific stores as a proxy for the user, effectively reducing the dimensionality from user_id (several hundreds of million scale) to store_id (million scale)
  • Figure 4: Comparison of Validation Metrics between Baseline Models and BoW Models at lower hit rates{5, 20}. The BoW models outperform the baseline models by a significant margin, as indicated by the hit rate@{5, 20} values
  • Figure 5: BoW model with layer sharing converges faster (2000 steps) to hit rate@20 threshold of 0.35 compared to simple BoW model (20k steps), indicating improved training speed and performance.