UNGER: Generative Recommendation with A Unified Code via Semantic and Collaborative Integration

Longtao Xiao; Haozhao Wang; Cheng Wang; Linfei Ji; Yifan Wang; Jieming Zhu; Zhenhua Dong; Rui Zhang; Ruixuan Li

UNGER: Generative Recommendation with A Unified Code via Semantic and Collaborative Integration

Longtao Xiao, Haozhao Wang, Cheng Wang, Linfei Ji, Yifan Wang, Jieming Zhu, Zhenhua Dong, Rui Zhang, Ruixuan Li

TL;DR

The paper introduces UNGER, a generative recommender that fuses semantic and collaborative knowledge into a single unified code (Unicodes) to enable efficient autoregressive item generation. It tackles the semantic-dominance issue with a modality-adaptation layer, cross-modality alignment, and intra-modality distillation across two stages: Stage I learns integrated embeddings and discretizes them into unicodes; Stage II decodes user histories into unicode sequences with a distillation signal to recover information lost in quantization. Empirical results on three benchmarks show UNGER achieving state-of-the-art performance while reducing storage and improving inference speed compared with dual-code methods, and analyses reveal favorable scaling properties and robust hyper-parameter behavior. The approach offers a practical, extensible framework for unified multimodal representations in generative recommendation, with interpretable discrete codes that capture cross-modal concepts and user intent.

Abstract

With the rise of generative paradigms, generative recommendation has garnered increasing attention. The core component is the item code, generally derived by quantizing collaborative or semantic representations to serve as candidate items identifiers in the context. However, existing methods typically construct separate codes for each modality, leading to higher computational and storage costs and hindering the integration of their complementary strengths. Considering this limitation, we seek to integrate two different modalities into a unified code, fully unleashing the potential of complementary nature among modalities. Nevertheless, the integration remains challenging: the integrated embedding obtained by the common concatenation method would lead to underutilization of collaborative knowledge, thereby resulting in limited effectiveness. To address this, we propose a novel method, named UNGER, which integrates semantic and collaborative knowledge into a unified code for generative recommendation. Specifically, we propose to adaptively learn an integrated embedding through the joint optimization of cross-modality knowledge alignment and next item prediction tasks. Subsequently, to mitigate the information loss caused by the quantization process, we introduce an intra-modality knowledge distillation task, using the integrated embeddings as supervised signals to compensate. Extensive experiments on three widely used benchmarks demonstrate the superiority of our approach compared to existing methods.

UNGER: Generative Recommendation with A Unified Code via Semantic and Collaborative Integration

TL;DR

Abstract

UNGER: Generative Recommendation with A Unified Code via Semantic and Collaborative Integration

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (15)