PALR: Personalization Aware LLMs for Recommendation
Fan Yang, Zheng Chen, Ziyan Jiang, Eunah Cho, Xiaojiang Huang, Yanbin Lu
TL;DR
PALR introduces a personalization-aware framework that couples user history with retrieval-grounded LLM reasoning to generate recommendations. By fine-tuning a 7B LLaMA model on instruction-based tasks (Recommend and Recommend_Retrieval) and using a three-part natural language prompt plus a retrieval module, PALR achieves state-of-the-art performance on sequential recommendation benchmarks. The approach demonstrates the importance of explicit retrieval grounding and prompt design for LLM-based ranking, with strong inductive generalization from limited fine-tuning. The work suggests promising avenues for integrating LLMs into practical recommender systems with potential for explainable and conversational recommendations, while highlighting latency considerations for real-world deployment.
Abstract
Large language models (LLMs) have recently received significant attention for their exceptional capabilities. Despite extensive efforts in developing general-purpose LLMs that can be utilized in various natural language processing (NLP) tasks, there has been less research exploring their potential in recommender systems. In this paper, we propose a novel framework, named PALR, which aiming to combine user history behaviors (such as clicks, purchases, ratings, etc.) with LLMs to generate user preferred items. Specifically, we first use user/item interactions as guidance for candidate retrieval. Then we adopt a LLM-based ranking model to generate recommended items. Unlike existing approaches that typically adopt general-purpose LLMs for zero/few-shot recommendation testing or training on small-sized language models (with less than 1 billion parameters), which cannot fully elicit LLMs' reasoning abilities and leverage rich item side parametric knowledge, we fine-tune a 7 billion parameters LLM for the ranking purpose. This model takes retrieval candidates in natural language format as input, with instruction which explicitly asking to select results from input candidates during inference. Our experimental results demonstrate that our solution outperforms state-of-the-art models on various sequential recommendation tasks.
