PersonaRAG: Enhancing Retrieval-Augmented Generation Systems with User-Centric Agents
Saber Zerhoudi, Michael Granitzer
TL;DR
PersonaRAG introduces a user-centric, multi-agent extension to retrieval-augmented generation that continuously adapts both retrieval and generation using live user data. By deploying specialized agents (profile, contextual retrieval, live session, document ranking, and feedback) and a Global Message Pool, the framework integrates user context and feedback into each step of the QA process. Experimental results show PersonaRAG improves accuracy across multiple QA datasets and generalizes to other LLM architectures, while maintaining personalization advantages over traditional RAG. The work highlights practical benefits for personalized information retrieval and identifies trade-offs in latency and cost, pointing to future efficiency and expansion of agent capabilities.
Abstract
Large Language Models (LLMs) struggle with generating reliable outputs due to outdated knowledge and hallucinations. Retrieval-Augmented Generation (RAG) models address this by enhancing LLMs with external knowledge, but often fail to personalize the retrieval process. This paper introduces PersonaRAG, a novel framework incorporating user-centric agents to adapt retrieval and generation based on real-time user data and interactions. Evaluated across various question answering datasets, PersonaRAG demonstrates superiority over baseline models, providing tailored answers to user needs. The results suggest promising directions for user-adapted information retrieval systems.
