Position: Beyond Euclidean -- Foundation Models Should Embrace Non-Euclidean Geometries

Neil He; Jiahong Liu; Buze Zhang; Ngoc Bui; Ali Maatouk; Menglin Yang; Irwin King; Melanie Weber; Rex Ying

Position: Beyond Euclidean -- Foundation Models Should Embrace Non-Euclidean Geometries

Neil He, Jiahong Liu, Buze Zhang, Ngoc Bui, Ali Maatouk, Menglin Yang, Irwin King, Melanie Weber, Rex Ying

TL;DR

This work argues that Euclidean geometry is insufficient for scaling next-generation foundation models on real-world data that exhibit non-Euclidean structures such as hierarchies and cycles. It advocates for curvature-aware, non-Euclidean foundations—including hyperbolic, spherical, and mixed-curvature geometries—through three development paths: fine-tuning existing Euclidean models, pretraining new non-Euclidean models, and hybrid architectures that blend geometries. The authors provide theoretical insights (distortion-dimension trade-offs and Markov convexity) and empirical evidence showing that non-Euclidean embeddings achieve lower distortion and better capture hierarchical and multi-modal structures, potentially improving representational efficiency and transfer. If adopted, curvature-aware foundation models could enhance scalability and adaptability while mitigating issues like hallucinations, enabling more efficient cross-modal learning and more faithful representations of complex data geometry.

Abstract

In the era of foundation models and Large Language Models (LLMs), Euclidean space has been the de facto geometric setting for machine learning architectures. However, recent literature has demonstrated that this choice comes with fundamental limitations. At a large scale, real-world data often exhibits inherently non-Euclidean structures, such as multi-way relationships, hierarchies, symmetries, and non-isotropic scaling, in a variety of domains, such as languages, vision, and the natural sciences. It is challenging to effectively capture these structures within the constraints of Euclidean spaces. This position paper argues that moving beyond Euclidean geometry is not merely an optional enhancement but a necessity to maintain the scaling law for the next-generation of foundation models. By adopting these geometries, foundation models could more efficiently leverage the aforementioned structures. Task-aware adaptability that dynamically reconfigures embeddings to match the geometry of downstream applications could further enhance efficiency and expressivity. Our position is supported by a series of theoretical and empirical investigations of prevalent foundation models. Finally, we outline a roadmap for integrating non-Euclidean geometries into foundation models, including strategies for building geometric foundation models via fine-tuning, training from scratch, and hybrid approaches.

Position: Beyond Euclidean -- Foundation Models Should Embrace Non-Euclidean Geometries

TL;DR

Abstract

Position: Beyond Euclidean -- Foundation Models Should Embrace Non-Euclidean Geometries

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (6)

Theorems & Definitions (7)