Learning with Exact Invariances in Polynomial Time
Ashkan Soleymani, Behrooz Tahmasebi, Stefanie Jegelka, Patrick Jaillet
TL;DR
The paper tackles learning under exact symmetries in kernel regression, a setting where naive approaches like data augmentation or group averaging are computationally infeasible for large groups. By leveraging the Laplace–Beltrami spectrum and the commutativity of group actions with the Laplacian, it reformulates invariance constraints into linear conditions across spectral eigenspaces, enabling a polynomial-time procedure. The proposed Spectral Averaging (Spec-Avg) method reduces the constraint set to a generator-based subset, truncates the spectral expansion, and solves closed-form projections in each eigenspace to enforce exact invariance, achieving the same minimax-risk rate as non-invariant kernel regression. This yields a statistically optimal and computationally efficient approach for learning with exact invariances on manifolds, with potential extensions to broader oracle models and kernel-trick adaptations.
Abstract
We study the statistical-computational trade-offs for learning with exact invariances (or symmetries) using kernel regression. Traditional methods, such as data augmentation, group averaging, canonicalization, and frame-averaging, either fail to provide a polynomial-time solution or are not applicable in the kernel setting. However, with oracle access to the geometric properties of the input space, we propose a polynomial-time algorithm that learns a classifier with \emph{exact} invariances. Moreover, our approach achieves the same excess population risk (or generalization error) as the original kernel regression problem. To the best of our knowledge, this is the first polynomial-time algorithm to achieve exact (not approximate) invariances in this context. Our proof leverages tools from differential geometry, spectral theory, and optimization. A key result in our development is a new reformulation of the problem of learning under invariances as optimizing an infinite number of linearly constrained convex quadratic programs, which may be of independent interest.
