Low-distortion and GPU-compatible Tree Embeddings in Hyperbolic Space
Max van Spengler, Pascal Mettes
TL;DR
Addressing the need for faithful, GPU-friendly hyperbolic embeddings of tree-structured data, the paper proposes HS-DTE, a highly separated Delaunay-based constructive embedding that optimizes hyperspherical separation, and HypFPE, a floating point expansion framework for high-precision computations on GPUs. HS-DTE uses a minimal angle maximization objective to place children on hyperspheres with improved separation, enabling lower distortion across dimensions; HypFPE provides ulp-nonoverlapping expansions and new routines to compute hyperbolic distances via acosh and atanh with enhanced precision. Empirically, HS-DTE outperforms existing constructive and optimization-based methods on complete m-ary and phylogenetic trees, and HypFPE further boosts accuracy when combined with HS-DTE or other constructive methods, while preserving GPU compatibility. The work also delivers theoretical error guarantees for FPE-based distance computations and releases software libraries for arbitrary-dimensional hyperbolic embeddings and GPU-optimized precision arithmetic, broadening the practical impact for hyperbolic neural networks.
Abstract
Embedding tree-like data, from hierarchies to ontologies and taxonomies, forms a well-studied problem for representing knowledge across many domains. Hyperbolic geometry provides a natural solution for embedding trees, with vastly superior performance over Euclidean embeddings. Recent literature has shown that hyperbolic tree embeddings can even be placed on top of neural networks for hierarchical knowledge integration in deep learning settings. For all applications, a faithful embedding of trees is needed, with combinatorial constructions emerging as the most effective direction. This paper identifies and solves two key limitations of existing works. First, the combinatorial construction hinges on finding highly separated points on a hypersphere, a notoriously difficult problem. Current approaches achieve poor separation, degrading the quality of the corresponding hyperbolic embedding. We propose highly separated Delaunay tree embeddings (HS-DTE), which integrates angular separation in a generalized formulation of Delaunay embeddings, leading to lower embedding distortion. Second, low-distortion requires additional precision. The current approach for increasing precision is to use multiple precision arithmetic, which renders the embeddings useless on GPUs in deep learning settings. We reformulate the combinatorial construction using floating point expansion arithmetic, leading to superior embedding quality while retaining utility on accelerated hardware.
