Emergent Equivariance in Deep Ensembles
Jan E. Gerken, Pan Kessel
TL;DR
The paper tackles how to enforce symmetry in neural networks by leveraging deep ensembles and data augmentation. Using neural tangent kernel theory in the infinite-width limit, it proves that with full augmentation the ensemble mean becomes equivariant to symmetries for all inputs and training times, even off-manifold, while individual members need not be. It provides finite-width and finite-ensemble bounds, analyzes continuous versus discrete groups, and validates the theory across Ising models, FashionMNIST, and histological data, showing practical invariance gains and competitive performance relative to manifestly equivariant methods. The findings suggest a simple, scalable way to achieve emergent equivariance without bespoke architectures, with implications for robustness, uncertainty estimation, and symmetry-compliant learning in scientific and visual domains.
Abstract
We show that deep ensembles become equivariant for all inputs and at all training times by simply using data augmentation. Crucially, equivariance holds off-manifold and for any architecture in the infinite width limit. The equivariance is emergent in the sense that predictions of individual ensemble members are not equivariant but their collective prediction is. Neural tangent kernel theory is used to derive this result and we verify our theoretical insights using detailed numerical experiments.
