K-PERM: Personalized Response Generation Using Dynamic Knowledge Retrieval and Persona-Adaptive Queries
Kanak Raj, Kaushik Roy, Vamshi Bonagiri, Priyanshul Govil, Krishnaprasad Thirunarayanan, Manas Gaur
TL;DR
K-PERM addresses the challenge of personalized response generation by jointly leveraging dynamic knowledge retrieval and persona-adaptive queries. It selects a subset of user personas via a dedicated Persona Selector and grounds responses with retrieved knowledge through a Knowledge Retriever, then trains a reward-modulated generator to balance coherence and factual alignment. Evaluations on FoCus show state-of-the-art performance with notable gains for downstream LLMs like GPT-3.5 when augmented by K-PERM, underscoring the practical impact of retrieval-augmented personalization. The framework is model-agnostic and extensible to other domains, offering a scalable path to more contextually aware and persona-consistent conversational agents.
Abstract
Personalizing conversational agents can enhance the quality of conversations and increase user engagement. However, they often lack external knowledge to appropriately tend to a user's persona. This is particularly crucial for practical applications like mental health support, nutrition planning, culturally sensitive conversations, or reducing toxic behavior in conversational agents. To enhance the relevance and comprehensiveness of personalized responses, we propose using a two-step approach that involves (1) selectively integrating user personas and (2) contextualizing the response with supplementing information from a background knowledge source. We develop K-PERM (Knowledge-guided PErsonalization with Reward Modulation), a dynamic conversational agent that combines these elements. K-PERM achieves state-of-the-art performance on the popular FoCus dataset, containing real-world personalized conversations concerning global landmarks. We show that using responses from K-PERM can improve performance in state-of-the-art LLMs (GPT 3.5) by 10.5%, highlighting the impact of K-PERM for personalizing chatbots.
