MemWeaver: A Hierarchical Memory from Textual Interactive Behaviors for Personalized Generation
Shuo Yu, Mingyue Cheng, Daoyu Wang, Qi Liu, Zirui Liu, Ze Guo, Xiaoyu Tao
TL;DR
This work addresses deep personalization from rich textual histories by introducing MemWeaver, a hierarchical memory framework that combines a behavioral memory of concrete actions with a cognitive memory of long-term preferences to guide generation. A context-aware memory graph enables semantic- and time-aware retrieval via a memory-guided generation process, formalized with $ \hat{y} = \arg\max_{y'} p(y' \mid q, \mathcal{M}^{\mathrm{behavior}}_u, \mathcal{M}^{\mathrm{cog}}_u; \theta)$ and a behavioral-edge score $ S(u\rightarrow v)=\left( \frac{e_q \cdot e_v}{\|e_q\|\|e_v\|} \right)^{\alpha} R(v) C(u,v)$. An incremental updating strategy ensures scalable maintenance of memory as new data arrive. Experiments on the LaMP benchmark show MemWeaver achieves state-of-the-art results across twelve metrics, with ablations confirming the complementary roles of behavioral and cognitive memories and the importance of semantic and temporal connections. The approach offers a practical, scalable path to deeply personalized generation in real-world systems.
Abstract
The primary form of user-internet engagement is shifting from leveraging implicit feedback signals, such as browsing and clicks, to harnessing the rich explicit feedback provided by textual interactive behaviors. This shift unlocks a rich source of user textual history, presenting a profound opportunity for a deeper form of personalization. However, prevailing approaches offer only a shallow form of personalization, as they treat user history as a flat list of texts for retrieval and fail to model the rich temporal and semantic structures reflecting dynamic nature of user interests. In this work, we propose \textbf{MemWeaver}, a framework that weaves the user's entire textual history into a hierarchical memory to power deeply personalized generation. The core innovation of our memory lies in its ability to capture both the temporal evolution of interests and the semantic relationships between different activities. To achieve this, MemWeaver builds two complementary memory components that both integrate temporal and semantic information, but at different levels of abstraction: behavioral memory, which captures specific user actions, and cognitive memory, which represents long-term preferences. This dual-component memory serves as a unified representation of the user, allowing large language models (LLMs) to reason over both concrete behaviors and abstracted traits. Experiments on the Language Model Personalization (LaMP) benchmark validate the efficacy of MemWeaver. Our code is available\footnote{https://github.com/fishsure/MemWeaver}.
