Unified Generative Search and Recommendation
Teng Shi, Jun Xu, Xiao Zhang, Xiaoxue Zang, Kai Zheng, Yang Song, Enyun Yu
TL;DR
GenSAR addresses the mutual interference between search and recommendation by introducing a unified generative retrieval framework that encodes items with dual semantic and collaborative identifiers. It uses residual quantization with shared and task-specific codebooks, coupled with behavior-aware interactions and a suite of task-specific training objectives (NRIP, NSQP, NSIP) and identifier-language alignment to guide the model. Empirical results on Amazon and a commercial dataset show GenSAR achieving state-of-the-art performance on both tasks, outperforming traditional joint models and other generative baselines, while maintaining efficiency with a compact backbone (T5-small). This approach demonstrates how balancing semantic and collaborative signals within an LLM-based generation framework can mitigate the search-recommendation trade-off and improve overall user experience in real platforms.
Abstract
Modern commercial platforms typically offer both search and recommendation functionalities to serve diverse user needs, making joint modeling of these tasks an appealing direction. While prior work has shown that integrating search and recommendation can be mutually beneficial, it also reveals a performance trade-off: enhancements in one task often come at the expense of the other. This challenge arises from their distinct information requirements: search emphasizes semantic relevance between queries and items, whereas recommendation depends more on collaborative signals among users and items. Effectively addressing this trade-off requires tackling two key problems: (1) integrating both semantic and collaborative signals into item representations, and (2) guiding the model to distinguish and adapt to the unique demands of search and recommendation. The emergence of generative retrieval with Large Language Models (LLMs) presents new possibilities. This paradigm encodes items as identifiers and frames both search and recommendation as sequential generation tasks, offering the flexibility to leverage multiple identifiers and task-specific prompts. In light of this, we introduce GenSAR, a unified generative framework for balanced search and recommendation. Our approach designs dual-purpose identifiers and tailored training strategies to incorporate complementary signals and align with task-specific objectives. Experiments on both public and commercial datasets demonstrate that GenSAR effectively reduces the trade-off and achieves state-of-the-art performance on both tasks.
