Understanding Hyperbolic Metric Learning through Hard Negative Sampling
Yun Yue, Fangzhou Lin, Guanyi Mou, Ziming Zhang
TL;DR
The paper investigates why hyperbolic metric learning improves contrastive image representations and under what conditions it outperforms Euclidean embeddings. It analyzes how geometry shapes hard negative sampling by deriving geometry-dependent triplet weights $p(x^-)$ and demonstrates that hyperbolic distance modulates relative negatives, leading to complementary behavior across geometries. The authors propose a simple mixed-geometry fusion that combines Euclidean and hyperbolic information, showing improved Recall@K on CUB, Cars, and SOP with Vision Transformers. This work provides practical guidance for leveraging hyperbolic geometry in metric learning and highlights a viable ensemble approach to exploit diverse hard negatives, with code released for reproducibility.
Abstract
In recent years, there has been a growing trend of incorporating hyperbolic geometry methods into computer vision. While these methods have achieved state-of-the-art performance on various metric learning tasks using hyperbolic distance measurements, the underlying theoretical analysis supporting this superior performance remains under-exploited. In this study, we investigate the effects of integrating hyperbolic space into metric learning, particularly when training with contrastive loss. We identify a need for a comprehensive comparison between Euclidean and hyperbolic spaces regarding the temperature effect in the contrastive loss within the existing literature. To address this gap, we conduct an extensive investigation to benchmark the results of Vision Transformers (ViTs) using a hybrid objective function that combines loss from Euclidean and hyperbolic spaces. Additionally, we provide a theoretical analysis of the observed performance improvement. We also reveal that hyperbolic metric learning is highly related to hard negative sampling, providing insights for future work. This work will provide valuable data points and experience in understanding hyperbolic image embeddings. To shed more light on problem-solving and encourage further investigation into our approach, our code is available online (https://github.com/YunYunY/HypMix).
