HypRAG: Hyperbolic Dense Retrieval for Retrieval Augmented Generation
Hiren Madhu, Ngoc Bui, Ali Maatouk, Leandros Tassiulas, Smita Krishnaswamy, Menglin Yang, Sukanta Ganguly, Kiran Srinivasan, Rex Ying
TL;DR
Dense retrieval for retrieval-augmented generation often relies on Euclidean embeddings that struggle to preserve language hierarchies, increasing hallucination risk. The authors introduce hyperbolic dense retrieval with two variants, HyTE-FH and HyTE-H, in the Lorentz model ($K<0$), plus a geometry-aware Outward Einstein Midpoint pooling to preserve hierarchical depth during aggregation. Empirical results on MTEB and RAGBench show up to 29% gains in context and answer relevance and a clear radial-depth organization (≈20.2% radii increase from Level 1 to Level 5) that Euclidean baselines do not exhibit, highlighting the impact of geometry as a design choice. The approach achieves these improvements with smaller, more efficient models and provides a principled pooling mechanism, suggesting that hyperbolic inductive bias is crucial for faithful RAG systems and scalable retrieval architectures.
Abstract
Embedding geometry plays a fundamental role in retrieval quality, yet dense retrievers for retrieval-augmented generation (RAG) remain largely confined to Euclidean space. However, natural language exhibits hierarchical structure from broad topics to specific entities that Euclidean embeddings fail to preserve, causing semantically distant documents to appear spuriously similar and increasing hallucination risk. To address these limitations, we introduce hyperbolic dense retrieval, developing two model variants in the Lorentz model of hyperbolic space: HyTE-FH, a fully hyperbolic transformer, and HyTE-H, a hybrid architecture projecting pre-trained Euclidean embeddings into hyperbolic space. To prevent representational collapse during sequence aggregation, we introduce the Outward Einstein Midpoint, a geometry-aware pooling operator that provably preserves hierarchical structure. On MTEB, HyTE-FH outperforms equivalent Euclidean baselines, while on RAGBench, HyTE-H achieves up to 29% gains over Euclidean baselines in context relevance and answer relevance using substantially smaller models than current state-of-the-art retrievers. Our analysis also reveals that hyperbolic representations encode document specificity through norm-based separation, with over 20% radial increase from general to specific concepts, a property absent in Euclidean embeddings, underscoring the critical role of geometric inductive bias in faithful RAG systems.
