In Prospect and Retrospect: Reflective Memory Management for Long-term Personalized Dialogue Agents
Zhen Tan, Jun Yan, I-Hung Hsu, Rujun Han, Zifeng Wang, Long T. Le, Yiwen Song, Yanfei Chen, Hamid Palangi, George Lee, Anand Iyer, Tianlong Chen, Huan Liu, Chen-Yu Lee, Tomas Pfister
TL;DR
This work tackles the challenge of long-term personalization in dialogue agents by introducing Reflective Memory Management (RMM), which combines Prospective Reflection for topic-based memory organization with Retrospective Reflection for online retrieval refinement via LLM attribution. By organizing memories into coherent topics and continually refining retrieval through a lightweight RL-trained reranker guided by LLM-derived rewards, RMM improves both memory relevance and response quality across MSC and LongMemEval benchmarks. Key contributions include a detailed framework for topic-based memory extraction, a differentiable reranker with Gumbel-based sampling, and attribution-based reward signals that enable online adaptation without extensive labeled data. The results demonstrate consistent improvements over strong baselines, with analysis on granularity, offline pretraining, and different LLMs, highlighting RMM’s potential for robust, long-term personalization in real-world dialogue systems.
Abstract
Large Language Models (LLMs) have made significant progress in open-ended dialogue, yet their inability to retain and retrieve relevant information from long-term interactions limits their effectiveness in applications requiring sustained personalization. External memory mechanisms have been proposed to address this limitation, enabling LLMs to maintain conversational continuity. However, existing approaches struggle with two key challenges. First, rigid memory granularity fails to capture the natural semantic structure of conversations, leading to fragmented and incomplete representations. Second, fixed retrieval mechanisms cannot adapt to diverse dialogue contexts and user interaction patterns. In this work, we propose Reflective Memory Management (RMM), a novel mechanism for long-term dialogue agents, integrating forward- and backward-looking reflections: (1) Prospective Reflection, which dynamically summarizes interactions across granularities-utterances, turns, and sessions-into a personalized memory bank for effective future retrieval, and (2) Retrospective Reflection, which iteratively refines the retrieval in an online reinforcement learning (RL) manner based on LLMs' cited evidence. Experiments show that RMM demonstrates consistent improvement across various metrics and benchmarks. For example, RMM shows more than 10% accuracy improvement over the baseline without memory management on the LongMemEval dataset.
