Ever-Evolving Memory by Blending and Refining the Past
Seo Hyun Kim, Keummin Ka, Yohan Jo, Seung-won Hwang, Dongha Lee, Jinyoung Yeo
TL;DR
The paper addresses the challenge of endowing chatbots with robust, long-term memory. It introduces CREEM, a Contextualized Refinement based Ever-Evolving Memory framework that blends past memories with current dialogue and refines the memory stream to forget outdated or conflicting information, treating memory construction and response generation as intertwined. CREEM employs contextual search to broaden memory retrieval, a blending step to generate new insights from past and present information, and a refining step to maintain consistency, with insights infused into response generation. Extensive experiments on MSC and CC datasets demonstrate that CREEM achieves superior memory quality across integration, consistency, and sophistication, and yields more coherent, memory-informed responses compared to strong baselines, without requiring extensive fine-tuning. The work highlights the practical impact of memory-informed dialogue and provides a foundation for future improvements in memory retrievers and larger-scale long-term conversational datasets.
Abstract
For a human-like chatbot, constructing a long-term memory is crucial. However, current large language models often lack this capability, leading to instances of missing important user information or redundantly asking for the same information, thereby diminishing conversation quality. To effectively construct memory, it is crucial to seamlessly connect past and present information, while also possessing the ability to forget obstructive information. To address these challenges, we propose CREEM, a novel memory system for long-term conversation. Improving upon existing approaches that construct memory based solely on current sessions, CREEM blends past memories during memory formation. Additionally, we introduce a refining process to handle redundant or outdated information. Unlike traditional paradigms, we view responding and memory construction as inseparable tasks. The blending process, which creates new memories, also serves as a reasoning step for response generation by informing the connection between past and present. Through evaluation, we demonstrate that CREEM enhances both memory and response qualities in multi-session personalized dialogues.
