HyperbolicRAG: Enhancing Retrieval-Augmented Generation with Hyperbolic Representations
Linxiao Cao, Ruitao Wang, Jindong Li, Zhipeng Zhou, Menglin Yang
TL;DR
HyperbolicRAG addresses the inability of Euclidean graph-based RAG to capture hierarchical structure by embedding passages, entities, and facts in a hyperbolic space. It introduces depth-aware representations, a bidirectional containment alignment, and a dual-space retrieval framework that fuses Euclidean and hyperbolic signals for robust, hierarchy-aware retrieval. Across multiple QA benchmarks, HyperbolicRAG improves retrieval recall and end-to-end QA metrics, particularly on multi-hop tasks, demonstrating the practical value of incorporating hyperbolic geometry into retrieval-augmented generation. The method reduces hubness, enhances evidence coherence, and provides a model-agnostic approach that broadens the applicability of hierarchical representations in RAG systems.
Abstract
Retrieval-augmented generation (RAG) enables large language models (LLMs) to access external knowledge, helping mitigate hallucinations and enhance domain-specific expertise. Graph-based RAG enhances structural reasoning by introducing explicit relational organization that enables information propagation across semantically connected text units. However, these methods typically rely on Euclidean embeddings that capture semantic similarity but lack a geometric notion of hierarchical depth, limiting their ability to represent abstraction relationships inherent in complex knowledge graphs. To capture both fine-grained semantics and global hierarchy, we propose HyperbolicRAG, a retrieval framework that integrates hyperbolic geometry into graph-based RAG. HyperbolicRAG introduces three key designs: (1) a depth-aware representation learner that embeds nodes within a shared Poincare manifold to align semantic similarity with hierarchical containment, (2) an unsupervised contrastive regularization that enforces geometric consistency across abstraction levels, and (3) a mutual-ranking fusion mechanism that jointly exploits retrieval signals from Euclidean and hyperbolic spaces, emphasizing cross-space agreement during inference. Extensive experiments across multiple QA benchmarks demonstrate that HyperbolicRAG outperforms competitive baselines, including both standard RAG and graph-augmented baselines.
