Unifying Generative and Dense Retrieval for Sequential Recommendation
Liu Yang, Fabian Paischer, Kaveh Hassani, Jiacheng Li, Shuai Shao, Zhang Gabriel Li, Yun He, Xue Feng, Nima Noorshams, Sem Park, Bo Long, Robert D Nowak, Xiaoli Gao, Hamid Eghbalzadeh
TL;DR
The paper addresses the trade-offs between generative retrieval and sequential dense retrieval for sequential recommendations, revealing a performance gap especially for cold-start items and proposing LIGER, a hybrid model that fuses dense retrieval with semantic-ID-based generative generation. LIGER inputs semantic IDs and item text representations, optimizing both a cosine-alignment objective and next-token prediction for semantic IDs, and augments generative candidates with cold-start items for re-ranking. Across four small-scale benchmarks, LIGER narrows the gap to dense retrieval and enhances cold-start recommendations, while highlighting the practical storage and inference efficiency benefits of the hybrid approach. The findings offer a pathway toward robust, scalable hybrid retrievers for real-world recommendation systems, with clear directions for improving cold-start generation and evaluating at larger scales.
Abstract
Sequential dense retrieval models utilize advanced sequence learning techniques to compute item and user representations, which are then used to rank relevant items for a user through inner product computation between the user and all item representations. However, this approach requires storing a unique representation for each item, resulting in significant memory requirements as the number of items grow. In contrast, the recently proposed generative retrieval paradigm offers a promising alternative by directly predicting item indices using a generative model trained on semantic IDs that encapsulate items' semantic information. Despite its potential for large-scale applications, a comprehensive comparison between generative retrieval and sequential dense retrieval under fair conditions is still lacking, leaving open questions regarding performance, and computation trade-offs. To address this, we compare these two approaches under controlled conditions on academic benchmarks and propose LIGER (LeveragIng dense retrieval for GEnerative Retrieval), a hybrid model that combines the strengths of these two widely used methods. LIGER integrates sequential dense retrieval into generative retrieval, mitigating performance differences and enhancing cold-start item recommendation in the datasets evaluated. This hybrid approach provides insights into the trade-offs between these approaches and demonstrates improvements in efficiency and effectiveness for recommendation systems in small-scale benchmarks.
