Adapting Large Language Models by Integrating Collaborative Semantics for Recommendation
Bowen Zheng, Yupeng Hou, Hongyu Lu, Yu Chen, Wayne Xin Zhao, Ming Chen, Ji-Rong Wen
TL;DR
This work tackles the gap between language semantics in LLMs and collaborative semantics in recommender systems by introducing LC-Rec, which learns discrete, tree-structured item indices via a Residual-Quantized VAE with uniform semantic mapping. It then trains LLMs with a suite of alignment tasks—sequential item prediction, explicit index-language alignment, and implicit recommendation-oriented alignment—to fuse language and collaborative information for end-to-end item generation over the full item set. Empirical results across three real-world Amazon subsets show LC-Rec achieving state-of-the-art full-ranking performance, with significant gains over strong baselines and insightful ablations highlighting the value of the indexing and tuning strategies. The approach enables autoregressive, candidate-free recommendation while maintaining semantic coherence and offering practical efficiency improvements, suggesting strong potential for scalable, language-enhanced recommender systems and future multi-turn chat extensions.
Abstract
Recently, large language models (LLMs) have shown great potential in recommender systems, either improving existing recommendation models or serving as the backbone. However, there exists a large semantic gap between LLMs and recommender systems, since items to be recommended are often indexed by discrete identifiers (item ID) out of the LLM's vocabulary. In essence, LLMs capture language semantics while recommender systems imply collaborative semantics, making it difficult to sufficiently leverage the model capacity of LLMs for recommendation. To address this challenge, in this paper, we propose a new LLM-based recommendation model called LC-Rec, which can better integrate language and collaborative semantics for recommender systems. Our approach can directly generate items from the entire item set for recommendation, without relying on candidate items. Specifically, we make two major contributions in our approach. For item indexing, we design a learning-based vector quantization method with uniform semantic mapping, which can assign meaningful and non-conflicting IDs (called item indices) for items. For alignment tuning, we propose a series of specially designed tuning tasks to enhance the integration of collaborative semantics in LLMs. Our fine-tuning tasks enforce LLMs to deeply integrate language and collaborative semantics (characterized by the learned item indices), so as to achieve an effective adaptation to recommender systems. Extensive experiments demonstrate the effectiveness of our method, showing that our approach can outperform a number of competitive baselines including traditional recommenders and existing LLM-based recommenders. Our code is available at https://github.com/RUCAIBox/LC-Rec/.
