Reindex-Then-Adapt: Improving Large Language Models for Conversational Recommendation
Zhankui He, Zhouhang Xie, Harald Steck, Dawen Liang, Rahul Jha, Nathan Kallus, Julian McAuley
TL;DR
This work tackles distribution misalignment in conversational recommender systems using large language models by introducing Reindex-Then-Adapt (RTA). RTA first reindexes multi-token item titles into single-token embeddings, enabling efficient logit computation, and then adapts the resulting distributions through bias-term adjustments or gating with traditional RecSys models to align with target data distributions. By treating LLMs as Differentiable Search Indexes and leveraging both L2I/L2R content knowledge and conventional recommendations, the approach achieves substantial accuracy gains across INSPIRED, ReDIAL, and Reddit-V1.5 datasets with favorable efficiency and flexibility characteristics. The results show that simple, dataset-aware adaptation strategies can meaningfully improve CRS performance, demonstrating practical impact in aligning LLM-driven recommendations with dynamic platform distributions and enabling controllability and fairness in recommendations.
Abstract
Large language models (LLMs) are revolutionizing conversational recommender systems by adeptly indexing item content, understanding complex conversational contexts, and generating relevant item titles. However, controlling the distribution of recommended items remains a challenge. This leads to suboptimal performance due to the failure to capture rapidly changing data distributions, such as item popularity, on targeted conversational recommendation platforms. In conversational recommendation, LLMs recommend items by generating the titles (as multiple tokens) autoregressively, making it difficult to obtain and control the recommendations over all items. Thus, we propose a Reindex-Then-Adapt (RTA) framework, which converts multi-token item titles into single tokens within LLMs, and then adjusts the probability distributions over these single-token item titles accordingly. The RTA framework marries the benefits of both LLMs and traditional recommender systems (RecSys): understanding complex queries as LLMs do; while efficiently controlling the recommended item distributions in conversational recommendations as traditional RecSys do. Our framework demonstrates improved accuracy metrics across three different conversational recommendation datasets and two adaptation settings
