EcphoryRAG: Re-Imagining Knowledge-Graph RAG via Human Associative Memory
Zirui Liao
TL;DR
EcphoryRAG introduces a cue-driven, memory-inspired RAG framework that casts retrieval as an ecphoric process over an entity-centric memory graph. It offline-builds Engrams and an associative knowledge graph with multi-granularity indices, then online performs cue extraction, multi-hop associative search guided by weighted embeddings, and final re-ranking anchored to the original query to produce grounded, multi-step reasoning. Across 2WikiMultiHopQA, HotpotQA, and MuSiQue, EcphoryRAG achieves state-of-the-art EM and F1 scores while substantially reducing offline indexing costs, demonstrating both high reasoning quality and practical efficiency. By modeling human memory principles, the approach offers a scalable, adaptable pathway for structured RAG that can enable continual learning and goal-oriented retrieval in AI systems.
Abstract
Cognitive neuroscience research indicates that humans leverage cues to activate entity-centered memory traces (engrams) for complex, multi-hop recollection. Inspired by this mechanism, we introduce EcphoryRAG, an entity-centric knowledge graph RAG framework. During indexing, EcphoryRAG extracts and stores only core entities with corresponding metadata, a lightweight approach that reduces token consumption by up to 94\% compared to other structured RAG systems. For retrieval, the system first extracts cue entities from queries, then performs a scalable multi-hop associative search across the knowledge graph. Crucially, EcphoryRAG dynamically infers implicit relations between entities to populate context, enabling deep reasoning without exhaustive pre-enumeration of relationships. Extensive evaluations on the 2WikiMultiHop, HotpotQA, and MuSiQue benchmarks demonstrate that EcphoryRAG sets a new state-of-the-art, improving the average Exact Match (EM) score from 0.392 to 0.474 over strong KG-RAG methods like HippoRAG. These results validate the efficacy of the entity-cue-multi-hop retrieval paradigm for complex question answering.
