Towards Scalable Semantic Representation for Recommendation

Taolin Zhang; Junwei Pan; Jinpeng Wang; Yaohua Zha; Tao Dai; Bin Chen; Ruisheng Luo; Xiaoxiang Deng; Yuan Wang; Ming Yue; Jie Jiang; Shu-Tao Xia

Towards Scalable Semantic Representation for Recommendation

Taolin Zhang, Junwei Pan, Jinpeng Wang, Yaohua Zha, Tao Dai, Bin Chen, Ruisheng Luo, Xiaoxiang Deng, Yuan Wang, Ming Yue, Jie Jiang, Shu-Tao Xia

TL;DR

This work tackles the challenge of transferring rich semantic information from high-dimensional LLM embeddings to low-dimensional recommendation ID spaces. It introduces Mixture-of-Codes (MoC), a two-stage framework that uses multiple parallel codebooks to quantize LLM embeddings and a downstream fusion network to implicitly combine the resulting Semantic IDs for recommendation tasks. Empirical results across three Amazon domains and multiple CTR models show that MoC outperforms single-code and hierarchical baselines in terms of discriminability and dimension robustness, with clear scaling advantages as the representation size increases. The approach enables scalable, robust semantic representations for recommendations, offering improvements in both predictive performance and information preservation when expanding semantic representation capacity.

Abstract

With recent advances in large language models (LLMs), there has been emerging numbers of research in developing Semantic IDs based on LLMs to enhance the performance of recommendation systems. However, the dimension of these embeddings needs to match that of the ID embedding in recommendation, which is usually much smaller than the original length. Such dimension compression results in inevitable losses in discriminability and dimension robustness of the LLM embeddings, which motivates us to scale up the semantic representation. In this paper, we propose Mixture-of-Codes, which first constructs multiple independent codebooks for LLM representation in the indexing stage, and then utilizes the Semantic Representation along with a fusion module for the downstream recommendation stage. Extensive analysis and experiments demonstrate that our method achieves superior discriminability and dimension robustness scalability, leading to the best scale-up performance in recommendations.

Towards Scalable Semantic Representation for Recommendation

TL;DR

Abstract

Towards Scalable Semantic Representation for Recommendation

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (10)

Theorems & Definitions (2)