HypRAG: Hyperbolic Dense Retrieval for Retrieval Augmented Generation

Hiren Madhu; Ngoc Bui; Ali Maatouk; Leandros Tassiulas; Smita Krishnaswamy; Menglin Yang; Sukanta Ganguly; Kiran Srinivasan; Rex Ying

HypRAG: Hyperbolic Dense Retrieval for Retrieval Augmented Generation

Hiren Madhu, Ngoc Bui, Ali Maatouk, Leandros Tassiulas, Smita Krishnaswamy, Menglin Yang, Sukanta Ganguly, Kiran Srinivasan, Rex Ying

TL;DR

Dense retrieval for retrieval-augmented generation often relies on Euclidean embeddings that struggle to preserve language hierarchies, increasing hallucination risk. The authors introduce hyperbolic dense retrieval with two variants, HyTE-FH and HyTE-H, in the Lorentz model ($K<0$), plus a geometry-aware Outward Einstein Midpoint pooling to preserve hierarchical depth during aggregation. Empirical results on MTEB and RAGBench show up to 29% gains in context and answer relevance and a clear radial-depth organization (≈20.2% radii increase from Level 1 to Level 5) that Euclidean baselines do not exhibit, highlighting the impact of geometry as a design choice. The approach achieves these improvements with smaller, more efficient models and provides a principled pooling mechanism, suggesting that hyperbolic inductive bias is crucial for faithful RAG systems and scalable retrieval architectures.

Abstract

Embedding geometry plays a fundamental role in retrieval quality, yet dense retrievers for retrieval-augmented generation (RAG) remain largely confined to Euclidean space. However, natural language exhibits hierarchical structure from broad topics to specific entities that Euclidean embeddings fail to preserve, causing semantically distant documents to appear spuriously similar and increasing hallucination risk. To address these limitations, we introduce hyperbolic dense retrieval, developing two model variants in the Lorentz model of hyperbolic space: HyTE-FH, a fully hyperbolic transformer, and HyTE-H, a hybrid architecture projecting pre-trained Euclidean embeddings into hyperbolic space. To prevent representational collapse during sequence aggregation, we introduce the Outward Einstein Midpoint, a geometry-aware pooling operator that provably preserves hierarchical structure. On MTEB, HyTE-FH outperforms equivalent Euclidean baselines, while on RAGBench, HyTE-H achieves up to 29% gains over Euclidean baselines in context relevance and answer relevance using substantially smaller models than current state-of-the-art retrievers. Our analysis also reveals that hyperbolic representations encode document specificity through norm-based separation, with over 20% radial increase from general to specific concepts, a property absent in Euclidean embeddings, underscoring the critical role of geometric inductive bias in faithful RAG systems.

HypRAG: Hyperbolic Dense Retrieval for Retrieval Augmented Generation

TL;DR

), plus a geometry-aware Outward Einstein Midpoint pooling to preserve hierarchical depth during aggregation. Empirical results on MTEB and RAGBench show up to 29% gains in context and answer relevance and a clear radial-depth organization (≈20.2% radii increase from Level 1 to Level 5) that Euclidean baselines do not exhibit, highlighting the impact of geometry as a design choice. The approach achieves these improvements with smaller, more efficient models and provides a principled pooling mechanism, suggesting that hyperbolic inductive bias is crucial for faithful RAG systems and scalable retrieval architectures.

Abstract

Paper Structure (29 sections, 9 theorems, 42 equations, 4 figures, 8 tables)

This paper contains 29 sections, 9 theorems, 42 equations, 4 figures, 8 tables.

Introduction
Related Works
Hyperbolic Space Preliminaries
Lorentz Model of Hyperbolic Space
Hyperbolic Transformer Components
Method
Architecture
Failure of Naïve Hyperbolic Pooling
Outward Einstein Midpoint Pooling
Training Methodology
Experiments and Results
Experimental Setup
Results
Conclusion
Proofs
...and 14 more sections

Key Result

Proposition 4.3

Let $\{{{\mathbf{x}}}_i\}_{i=1}^n \subset \mathbb{H}^d_K$ with $n \geq 2$. Define the Euclidean mean $\bar{{{\mathbf{x}}}} = \frac{1}{n}\sum_{i=1}^n {{\mathbf{x}}}_i$ and its projection onto the hyperboloid ${{\mathbf{m}}}^{\mathrm{Euc}} = \Pi_K(\bar{{{\mathbf{x}}}})$. Then, we have with equality if and only if all ${{\mathbf{x}}}_i$ are identical.

Figures (4)

Figure 1: Hierarchies in Text. (A) Documents naturally organize into branching hierarchies where general topics spawn increasingly specific subtopics. Euclidean spaces distort such hierarchies due to crowding effects, while hyperbolic geometry preserves hierarchical relationships through exponential volume growth. (B) Ricci curvature analysis of document embeddings from strong baselines reveals predominantly negative curvature, indicating tree-like semantic structure.
Figure 2: HyTE Architecture. A) HyTE-FH Encoder Block, B) HyTE-FH architecture, C) HyTE-H Architecture.
Figure 3: Outward Einstein Midpoint. Size of token shows its contribution towards aggregation.
Figure 4: Empirical validation of hierarchical encoding.Left: Euclidean models show flat or decreasing norms. Middle: HyTE-H demonstrate increasing norms with fine-tuning enhancing this trend. Right: HyTE-FH achieves +20.2% total increase from L1 to L5. Bottom: Normalized comparison and percent change summary highlighting the contrasting behaviors of different geometric approaches.

Theorems & Definitions (17)

Definition 4.1: Lorentz Projection
Definition 4.2: Radial Depth
Proposition 4.3: Euclidean Mean Contracts
Definition 4.4: Outward Bias
Proposition 4.5: Implicit Radial Weighting is Insufficient
Theorem 4.6: OEM Pre-Projection Bound
Theorem 4.7: OEM Outward Bias
Lemma 1.1: Lorentzian Inner Product Bound
proof
Proposition 1.2: Euclidean Mean Contracts
...and 7 more

HypRAG: Hyperbolic Dense Retrieval for Retrieval Augmented Generation

TL;DR

Abstract

HypRAG: Hyperbolic Dense Retrieval for Retrieval Augmented Generation

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (4)

Theorems & Definitions (17)