Preference Discerning with LLM-Enhanced Generative Retrieval
Fabian Paischer, Liu Yang, Linfeng Liu, Shuai Shao, Kaveh Hassani, Jiacheng Li, Ricky Chen, Zhang Gabriel Li, Xiaoli Gao, Wei Shao, Xue Feng, Nima Noorshams, Sem Park, Bo Long, Hamid Eghbalzadeh
TL;DR
The paper introduces preference discerning, a paradigm that conditionally steers sequential recommendations using user preferences expressed in natural language. It decomposes the process into preference approximation via LLMs and preference conditioning within a multimodal generative retrieval model called Mender, which fuses semantic IDs with language-based context. A holistic benchmark across five steerability axes demonstrates that current generative retrievers struggle with dynamic adaptation, while Mender achieves state-of-the-art performance on several axes, including recommendation and fine-grained steering, especially when paired with larger language encoders. The work provides insights into how textual preferences can guide recommendations without retraining, highlights challenges in sentiment following, and outlines future avenues for scalable, language-aware, and neutral steering in recommender systems.
Abstract
In sequential recommendation, models recommend items based on user's interaction history. To this end, current models usually incorporate information such as item descriptions and user intent or preferences. User preferences are usually not explicitly given in open-source datasets, and thus need to be approximated, for example via large language models (LLMs). Current approaches leverage approximated user preferences only during training and rely solely on the past interaction history for recommendations, limiting their ability to dynamically adapt to changing preferences, potentially reinforcing echo chambers. To address this issue, we propose a new paradigm, namely preference discerning, which explicitly conditions a generative recommendation model on user preferences in natural language within its context. To evaluate preference discerning, we introduce a novel benchmark that provides a holistic evaluation across various scenarios, including preference steering and sentiment following. Upon evaluating current state-of-the-art methods on our benchmark, we discover that their ability to dynamically adapt to evolving user preferences is limited. To address this, we propose a new method named Mender ($\textbf{M}$ultimodal Prefer$\textbf{en}$ce $\textbf{D}$iscern$\textbf{er}$), which achieves state-of-the-art performance in our benchmark. Our results show that Mender effectively adapts its recommendation guided by human preferences, even if not observed during training, paving the way toward more flexible recommendation models.
