Achieving Hyperbolic-Like Expressiveness with Arbitrary Euclidean Regions: A New Approach to Hierarchical Embeddings
Hui Yang, Jiaoyan Chen
TL;DR
The paper addresses the challenge of embedding hierarchical data in low dimensions using hyperbolic geometry by introducing RegD, a Euclidean framework that embeds arbitrary regions (e.g., boxes, balls). RegD combines two dissimilarities, depth $d_{\text{dep}}$ and boundary $d_{\text{bd}}$, to mimic hyperbolic expressiveness and support richer semantic relations such as ontology reasoning. The authors establish theoretical connections showing $d_{\text{dep}}$ recovers hyperbolic distance as a special case and demonstrate strong empirical gains on DAGs and ontologies over state-of-the-art baselines, while maintaining simplicity and flexibility. This approach offers a versatile tool for hierarchical and semantic embedding tasks, with reproducibility resources and potential for seamless integration with hyperbolic neural networks and broader region types.
Abstract
Hierarchical data is common in many domains like life sciences and e-commerce, and its embeddings often play a critical role. While hyperbolic embeddings offer a theoretically grounded approach to representing hierarchies in low-dimensional spaces, current methods often rely on specific geometric constructs as embedding candidates. This reliance limits their generalizability and makes it difficult to integrate with techniques that model semantic relationships beyond pure hierarchies, such as ontology embeddings. In this paper, we present RegD, a flexible Euclidean framework that supports the use of arbitrary geometric regions -- such as boxes and balls -- as embedding representations. Although RegD operates entirely in Euclidean space, we formally prove that it achieves hyperbolic-like expressiveness by incorporating a depth-based dissimilarity between regions, enabling it to emulate key properties of hyperbolic geometry, including exponential growth. Our empirical evaluation on diverse real-world datasets shows consistent performance gains over state-of-the-art methods and demonstrates RegD's potential for broader applications such as the ontology embedding task that goes beyond hierarchy.
