Keyword-driven Retrieval-Augmented Large Language Models for Cold-start User Recommendations
Hai-Dang Kieu, Minh Duc Nguyen, Thanh-Son Nguyen, Dung D. Le
TL;DR
The paper tackles cold-start user recommendations by combining keyword-based profiles with a retrieval-augmented LLM re-ranking framework called KALM4REC. It first retrieves candidate items using a keyword-driven heterogeneous graph (MPG) with TF-IRF edge weights, then re-ranks the candidates with LLM prompts that encode user and item keywords and leverage few-shot examples. Experiments on Yelp and TripAdvisor show that MPG-based retrieval combined with LLM re-ranking improves recall and precision, while using keywords reduces token usage and maintains accuracy. The findings demonstrate the practical viability of in-context prompting and keyword-focused representations for enhancing recommender systems in data-sparse scenarios and across domains.
Abstract
Recent advancements in Large Language Models (LLMs) have shown significant potential in enhancing recommender systems. However, addressing the cold-start recommendation problem, where users lack historical data, remains a considerable challenge. In this paper, we introduce KALM4Rec (Keyword-driven Retrieval-Augmented Large Language Models for Cold-start User Recommendations), a novel framework specifically designed to tackle this problem by requiring only a few input keywords from users in a practical scenario of cold-start user restaurant recommendations. KALM4Rec operates in two main stages: candidates retrieval and LLM-based candidates re-ranking. In the first stage, keyword-driven retrieval models are used to identify potential candidates, addressing LLMs' limitations in processing extensive tokens and reducing the risk of generating misleading information. In the second stage, we employ LLMs with various prompting strategies, including zero-shot and few-shot techniques, to re-rank these candidates by integrating multiple examples directly into the LLM prompts. Our evaluation, using a Yelp restaurant dataset with user reviews from three English-speaking cities, shows that our proposed framework significantly improves recommendation quality. Specifically, the integration of in-context instructions with LLMs for re-ranking markedly enhances the performance of the cold-start user recommender system.
