A Geometry-Aware Algorithm to Learn Hierarchical Embeddings in Hyperbolic Space
Zhangyu Wang, Lantian Xu, Zhifeng Kong, Weilong Wang, Xuyu Peng, Enyang Zheng
TL;DR
The paper tackles learning hierarchical representations in hyperbolic space, where geometry-mismatch with Euclidean methods impedes optimization. It defines three illness categories that describe misordered relationships and proposes a geometry-aware framework combining a dilation mapping with transitive closure regularization to mitigate them, with a theoretical analysis of the dilation mechanism in the Poincaré ball $\mathcal{B}^d$ and optimization on the Riemannian manifold using $d(\cdot,\cdot)$. The authors formulate local capacity and provide bounds on the $r$-packing number $\mathcal{A}(d,\theta_r)$ and validate on synthetic and real-tree datasets, showing improved MAP and MR over baselines. The work advances practical, scalable hyperbolic embeddings for tree-like data by explicitly leveraging hyperbolic geometry through dilation and transitive-closure-inspired regularization, offering guidance for future geometry-aware representation learning.
Abstract
Hyperbolic embeddings are a class of representation learning methods that offer competitive performances when data can be abstracted as a tree-like graph. However, in practice, learning hyperbolic embeddings of hierarchical data is difficult due to the different geometry between hyperbolic space and the Euclidean space. To address such difficulties, we first categorize three kinds of illness that harm the performance of the embeddings. Then, we develop a geometry-aware algorithm using a dilation operation and a transitive closure regularization to tackle these illnesses. We empirically validate these techniques and present a theoretical analysis of the mechanism behind the dilation operation. Experiments on synthetic and real-world datasets reveal superior performances of our algorithm.
