O$n$ Learning Deep O($n$)-Equivariant Hyperspheres
Pavlo Melnyk, Michael Felsberg, Mårten Wadenbäck, Andreas Robinson, Cuong Le
TL;DR
This work addresses learning deep features that are equivariant to orthogonal transformations in arbitrary dimensions by introducing Deep Equivariant Hyperspheres (DEH), which combine regular $n$-simplexes with $n$-dimensional spherical decision surfaces. The authors derive a simplex-based simplex change-of-basis $M_n$, construct $n$D equivariant spheres, and cascade them to build deep, point-based representations; they also propose an invariant Gram-based operator $oldsymbol{ riangle}= extbf{Y} extbf{Y}^ op$ to capture higher-order relations. Theoretical results establish $O(n)$-equivariance of the neuron and practical techniques for normalization, bias, and higher-order interactions, complemented by empirical validation on $ ext{O}(3)$ and $ ext{O}(5)$ tasks where DEH outperforms several baselines while offering favorable speed/performance trade-offs. The approach generalizes to any dimension, enabling scalable, geometry-aware learning for 3D/4D data with potential applications in molecular design and related domains; code is released at the provided repository.
Abstract
In this paper, we utilize hyperspheres and regular $n$-simplexes and propose an approach to learning deep features equivariant under the transformations of $n$D reflections and rotations, encompassed by the powerful group of O$(n)$. Namely, we propose O$(n)$-equivariant neurons with spherical decision surfaces that generalize to any dimension $n$, which we call Deep Equivariant Hyperspheres. We demonstrate how to combine them in a network that directly operates on the basis of the input points and propose an invariant operator based on the relation between two points and a sphere, which as we show, turns out to be a Gram matrix. Using synthetic and real-world data in $n$D, we experimentally verify our theoretical contributions and find that our approach is superior to the competing methods for O$(n)$-equivariant benchmark datasets (classification and regression), demonstrating a favorable speed/performance trade-off. The code is available at https://github.com/pavlo-melnyk/equivariant-hyperspheres.
