Shedding Light on Problems with Hyperbolic Graph Learning
Isay Katsman, Anna Gilbert
TL;DR
The paper challenges the claimed advantages of hyperbolic graph learning by showing that a carefully trained Euclidean baseline can match or surpass hyperbolic models on standard tasks, even for datasets labeled as highly hyperbolic by $\delta$-hyperbolicity. It identifies three core problems—poor Euclidean baselines, tasks that poorly discriminate geometry, and coarse geometry metrics—responsible for inflated hyperbolic performance. The authors propose concrete remedies, including a well-tuned Euclidean baseline, synthetic Tree benchmarks with controlled graph-feature interactions, and richer dataset geometry characterizations. They introduce a parametric family Tree$(b,\ell,\gamma,\delta,\mathcal{D})$ and a concrete Tree1111 dataset to stress-test geometry-dependent learning. Overall, the work urges rigorous baselines and geometry-aware benchmarking to accurately assess when hyperbolic representations offer real benefits for graph tasks.
Abstract
Recent papers in the graph machine learning literature have introduced a number of approaches for hyperbolic representation learning. The asserted benefits are improved performance on a variety of graph tasks, node classification and link prediction included. Claims have also been made about the geometric suitability of particular hierarchical graph datasets to representation in hyperbolic space. Despite these claims, our work makes a surprising discovery: when simple Euclidean models with comparable numbers of parameters are properly trained in the same environment, in most cases, they perform as well, if not better, than all introduced hyperbolic graph representation learning models, even on graph datasets previously claimed to be the most hyperbolic as measured by Gromov $δ$-hyperbolicity (i.e., perfect trees). This observation gives rise to a simple question: how can this be? We answer this question by taking a careful look at the field of hyperbolic graph representation learning as it stands today, and find that a number of results do not diligently present baselines, make faulty modelling assumptions when constructing algorithms, and use misleading metrics to quantify geometry of graph datasets. We take a closer look at each of these three problems, elucidate the issues, perform an analysis of methods, and introduce a parametric family of benchmark datasets to ascertain the applicability of (hyperbolic) graph neural networks.
