Data Augmentation and Regularization for Learning Group Equivariance
Oskar Nordenfors, Axel Flinth
TL;DR
This work investigates learning group equivariance by combining data augmentation with a regularization penalty that suppresses non-equivalent components of neural network parameters. Building on a prior framework, it shows that augmenting training data with symmetry transformations and adding a term $\frac{\gamma}{2}\|\Pi_{\mathcal{E}^{\perp}}A\|^2$ makes the equivariant subspace $\mathcal{E}$ an attractor of the training dynamics for large $\gamma$. The key contributions include a formal analysis of augmented versus equivariant dynamics and a concrete, small-scale SGD experiment confirming attractor behavior, suggesting practical routes to achieve equivariance without hard architectural constraints. The findings have potential practical impact for leveraging known symmetries in diverse architectures by combining augmentation with regularization to obtain provably equivariant behavior in training.
Abstract
In many machine learning tasks, known symmetries can be used as an inductive bias to improve model performance. In this paper, we consider learning group equivariance through training with data augmentation. We summarize results from a previous paper of our own, and extend the results to show that equivariance of the trained model can be achieved through training on augmented data in tandem with regularization.
