HyperCore: The Core Framework for Building Hyperbolic Foundation Models with Comprehensive Modules
Neil He, Menglin Yang, Rex Ying
TL;DR
This work tackles the lack of a unified, open-source framework for building hyperbolic foundation models. It introduces HyperCore, a PyTorch-based collection of comprehensive modules that support both Lorentz and Poincaré hyperbolic spaces and facilitate end-to-end construction. The authors demonstrate the framework by building fully hyperbolic architectures—LViT, L-CLIP, and HypGraphRAG—and conducting extensive experiments across vision, language, and graph modalities. Results indicate that hyperbolic variants can outperform their Euclidean counterparts in several tasks, while HyperCore substantially lowers development effort and encourages broader exploration of curvature and geometry in foundation models.
Abstract
Hyperbolic neural networks have emerged as a powerful tool for modeling hierarchical data across diverse modalities. Recent studies show that token distributions in foundation models exhibit scale-free properties, suggesting that hyperbolic space is a more suitable ambient space than Euclidean space for many pre-training and downstream tasks. However, existing tools lack essential components for building hyperbolic foundation models, making it difficult to leverage recent advancements. We introduce HyperCore, a comprehensive open-source framework that provides core modules for constructing hyperbolic foundation models across multiple modalities. HyperCore's modules can be effortlessly combined to develop novel hyperbolic foundation models, eliminating the need to extensively modify Euclidean modules from scratch and possible redundant research efforts. To demonstrate its versatility, we build and test the first fully hyperbolic vision transformers (LViT) with a fine-tuning pipeline, the first fully hyperbolic multimodal CLIP model (L-CLIP), and a hybrid Graph RAG with a hyperbolic graph encoder. Our experiments demonstrate that LViT outperforms its Euclidean counterpart. Additionally, we benchmark and reproduce experiments across hyperbolic GNNs, CNNs, Transformers, and vision Transformers to highlight HyperCore's advantages.
