Hierarchical Memory for High-Efficiency Long-Term Reasoning in LLM Agents

Haoran Sun; Shaoning Zeng

Hierarchical Memory for High-Efficiency Long-Term Reasoning in LLM Agents

Haoran Sun, Shaoning Zeng

TL;DR

Problem: Long-term reasoning in LLM agents is hampered by inefficient memory organization and retrieval.Approach: The authors propose H-MEM, a four-layer hierarchical memory with position-index routing to enable structured, layer-wise memory retrieval and grounding.Contributions: four-layer storage (Domain, Category, Memory Trace, Episode), memory update via user feedback, top-down retrieval with FAISS, and extensive evaluation on LoCoMo showing consistent improvements over baselines and strong efficiency.Significance: The method improves long-term dialogue reasoning and scalability, with potential extensions to multimodal memory.

Abstract

Long-term memory is one of the key factors influencing the reasoning capabilities of Large Language Model Agents (LLM Agents). Incorporating a memory mechanism that effectively integrates past interactions can significantly enhance decision-making and contextual coherence of LLM Agents. While recent works have made progress in memory storage and retrieval, such as encoding memory into dense vectors for similarity-based search or organizing knowledge in the form of graph, these approaches often fall short in structured memory organization and efficient retrieval. To address these limitations, we propose a Hierarchical Memory (H-MEM) architecture for LLM Agents that organizes and updates memory in a multi-level fashion based on the degree of semantic abstraction. Each memory vector is embedded with a positional index encoding pointing to its semantically related sub-memories in the next layer. During the reasoning phase, an index-based routing mechanism enables efficient, layer-by-layer retrieval without performing exhaustive similarity computations. We evaluate our method on five task settings from the LoCoMo dataset. Experimental results show that our approach consistently outperforms five baseline methods, demonstrating its effectiveness in long-term dialogue scenarios.

Hierarchical Memory for High-Efficiency Long-Term Reasoning in LLM Agents

TL;DR

Abstract

Hierarchical Memory for High-Efficiency Long-Term Reasoning in LLM Agents

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)