Fully Hyperbolic Convolutional Neural Networks for Computer Vision
Ahmad Bdeir, Kristian Schwethelm, Niels Landwehr
TL;DR
This paper tackles the limitation of existing hyperbolic neural networks in vision by proposing HCNN, a fully hyperbolic CNN built in the Lorentz model to learn hyperbolic representations across all layers. It introduces Lorentz-specific components—Lorentz convolution, Lorentz batch normalization, and Lorentz multinomial logistic regression—along with Lorentz-compatible residuals and activations, enabling end-to-end hyperbolic encoders. Empirical results on image classification and generation show that HCNNs, especially with the Lorentz model, achieve higher accuracy, improved robustness, and effective low-dimensional embeddings compared to Euclidean and Poincaré baselines. The work demonstrates the practical viability and stability advantages of fully hyperbolic vision models and provides a foundation for future scalable hyperbolic architectures in computer vision, with code publicly available.
Abstract
Real-world visual data exhibit intrinsic hierarchical structures that can be represented effectively in hyperbolic spaces. Hyperbolic neural networks (HNNs) are a promising approach for learning feature representations in such spaces. However, current HNNs in computer vision rely on Euclidean backbones and only project features to the hyperbolic space in the task heads, limiting their ability to fully leverage the benefits of hyperbolic geometry. To address this, we present HCNN, a fully hyperbolic convolutional neural network (CNN) designed for computer vision tasks. Based on the Lorentz model, we generalize fundamental components of CNNs and propose novel formulations of the convolutional layer, batch normalization, and multinomial logistic regression. {Experiments on standard vision tasks demonstrate the promising performance of our HCNN framework in both hybrid and fully hyperbolic settings.} Overall, we believe our contributions provide a foundation for developing more powerful HNNs that can better represent complex structures found in image data. Our code is publicly available at https://github.com/kschwethelm/HyperbolicCV.
