Towards Proactive Personalization through Profile Customization for Individual Users in Dialogues
Xiaotian Zhang, Yuan Wang, Ruizhe Chen, Zeya Wang, Runchen Hou, Zuozhu Liu
TL;DR
The paper addresses the challenge of long-term, user-specific personalization in dialog systems by introducing PersonalAgent, a memory-enabled agent that incrementally infers and stores user preferences as a unified profile across sessions. It formalizes turn-level preference inference as a multi-turn MDP and trains the agent with Group Relative Policy Optimization, using a policy-based judge to provide robust feedback. The approach is validated on ALOE, PrefEval, and a new ALOE-Unseen dataset for cold-start scenarios, showing superior accuracy, proactive querying, and strong cross-session consistency, with human annotations corroborating the evaluation signals. The work emphasizes memory-based personalization as a pathway to more natural, inclusive, and adaptive conversational agents and outlines directions for broader, longer-horizon evaluations.
Abstract
The deployment of Large Language Models (LLMs) in interactive systems necessitates a deep alignment with the nuanced and dynamic preferences of individual users. Current alignment techniques predominantly address universal human values or static, single-turn preferences, thereby failing to address the critical needs of long-term personalization and the initial user cold-start problem. To bridge this gap, we propose PersonalAgent, a novel user-centric lifelong agent designed to continuously infer and adapt to user preferences. PersonalAgent constructs and dynamically refines a unified user profile by decomposing dialogues into single-turn interactions, framing preference inference as a sequential decision-making task. Experiments show that PersonalAgent achieves superior performance over strong prompt-based and policy optimization baselines, not only in idealized but also in noisy conversational contexts, while preserving cross-session preference consistency. Furthermore, human evaluation confirms that PersonalAgent excels at capturing user preferences naturally and coherently. Our findings underscore the importance of lifelong personalization for developing more inclusive and adaptive conversational agents. Our code is available here.
