Siegel Neural Networks
Xuan Son Nguyen, Aymeric Histace, Nistor Grozavu
TL;DR
This work develops discriminative neural networks on Siegel spaces, a class of Riemannian symmetric spaces (RSS) that extend hyperbolic and SPD geometries. It introduces MLR layers based on the quotient structure $ ext{SH}_m \\cong Sp_{2m}/SpO_{2m}$ and a vector-valued distance $d_{\\Delta}(\cdot,\cdot)$, along with two FC-layer families (AFC and DFC) leveraging the real symplectic group action on $\\text{SH}_m$. A closed-form point-to-hyperplane distance on Siegel spaces is derived, with extensions to product spaces to enable compact, invariant decision boundaries for multiclass problems. Empirically, the approach achieves state-of-the-art performance on radar clutter classification and node classification, underscoring the practical value of non-Euclidean representations for discriminative learning on RSS.
Abstract
Riemannian symmetric spaces (RSS) such as hyperbolic spaces and symmetric positive definite (SPD) manifolds have become popular spaces for representation learning. In this paper, we propose a novel approach for building discriminative neural networks on Siegel spaces, a family of RSS that is largely unexplored in machine learning tasks. For classification applications, one focus of recent works is the construction of multiclass logistic regression (MLR) and fully-connected (FC) layers for hyperbolic and SPD neural networks. Here we show how to build such layers for Siegel neural networks. Our approach relies on the quotient structure of those spaces and the notation of vector-valued distance on RSS. We demonstrate the relevance of our approach on two applications, i.e., radar clutter classification and node classification. Our results successfully demonstrate state-of-the-art performance across all datasets.
