Learning Structured Representations with Hyperbolic Embeddings
Aditya Sinha, Siqi Zeng, Makoto Yamada, Han Zhao
TL;DR
This work addresses the distortion that Euclidean spaces introduce when embedding hierarchical label structures into learned representations. It introduces HypStructure, a hyperbolic regularization framework comprising HypCPCC and HypCenter, which embeds the label hierarchy explicitly in hyperbolic space and is compatible with standard task losses. Empirical results across CIFAR10/100 and ImageNet100 show reduced hierarchy distortion, improved CPCC and classification performance, and enhanced OOD detection, with a formal eigenspectrum analysis linking hyperbolic geometry to these gains. The approach highlights the advantage of hyperbolic geometry for hierarchy-informed learning and offers practical implications for efficient, low-dimensional representations and reliable OOD detection.
Abstract
Most real-world datasets consist of a natural hierarchy between classes or an inherent label structure that is either already available or can be constructed cheaply. However, most existing representation learning methods ignore this hierarchy, treating labels as permutation invariant. Recent work [Zeng et al., 2022] proposes using this structured information explicitly, but the use of Euclidean distance may distort the underlying semantic context [Chen et al., 2013]. In this work, motivated by the advantage of hyperbolic spaces in modeling hierarchical relationships, we propose a novel approach HypStructure: a Hyperbolic Structured regularization approach to accurately embed the label hierarchy into the learned representations. HypStructure is a simple-yet-effective regularizer that consists of a hyperbolic tree-based representation loss along with a centering loss, and can be combined with any standard task loss to learn hierarchy-informed features. Extensive experiments on several large-scale vision benchmarks demonstrate the efficacy of HypStructure in reducing distortion and boosting generalization performance especially under low dimensional scenarios. For a better understanding of structured representation, we perform eigenvalue analysis that links the representation geometry to improved Out-of-Distribution (OOD) detection performance seen empirically. The code is available at \url{https://github.com/uiuctml/HypStructure}.
