Sparse Meets Dense: Unified Generative Recommendations with Cascaded Sparse-Dense Representations

Yuhao Yang; Zhi Ji; Zhaopeng Li; Yi Li; Zhonglin Mo; Yue Ding; Kai Chen; Zijian Zhang; Jie Li; Shuanglong Li; Lin Liu

Sparse Meets Dense: Unified Generative Recommendations with Cascaded Sparse-Dense Representations

Yuhao Yang, Zhi Ji, Zhaopeng Li, Yi Li, Zhonglin Mo, Yue Ding, Kai Chen, Zijian Zhang, Jie Li, Shuanglong Li, Lin Liu

TL;DR

COBRA tackles the mismatch between generative and dense retrieval in recommender systems by cascading sparse semantic IDs with learnable dense vectors. It alternates between generating sparse IDs and refined dense representations within a Transformer-based architecture and trains end-to-end with a dual objective, enabling dynamic representation refinement. A coarse-to-fine generation process, augmented by BeamFusion, yields high-precision and diverse recommendations, demonstrated through extensive public benchmarks, industrial-scale offline evaluations, and online A/B tests on a platform with hundreds of millions of users. The reported gains in recall, NDCG, and online metrics establish COBRA as a scalable, practical approach for unified generative and dense retrieval in large-scale recommendation systems.

Abstract

Generative models have recently gained attention in recommendation systems by directly predicting item identifiers from user interaction sequences. However, existing methods suffer from significant information loss due to the separation of stages such as quantization and sequence modeling, hindering their ability to achieve the modeling precision and accuracy of sequential dense retrieval techniques. Integrating generative and dense retrieval methods remains a critical challenge. To address this, we introduce the Cascaded Organized Bi-Represented generAtive retrieval (COBRA) framework, which innovatively integrates sparse semantic IDs and dense vectors through a cascading process. Our method alternates between generating these representations by first generating sparse IDs, which serve as conditions to aid in the generation of dense vectors. End-to-end training enables dynamic refinement of dense representations, capturing both semantic insights and collaborative signals from user-item interactions. During inference, COBRA employs a coarse-to-fine strategy, starting with sparse ID generation and refining them into dense vectors via the generative model. We further propose BeamFusion, an innovative approach combining beam search with nearest neighbor scores to enhance inference flexibility and recommendation diversity. Extensive experiments on public datasets and offline tests validate our method's robustness. Online A/B tests on a real-world advertising platform with over 200 million daily users demonstrate substantial improvements in key metrics, highlighting COBRA's practical advantages.

Sparse Meets Dense: Unified Generative Recommendations with Cascaded Sparse-Dense Representations

TL;DR

Abstract

Sparse Meets Dense: Unified Generative Recommendations with Cascaded Sparse-Dense Representations

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)