Diffeomorphism-Equivariant Neural Networks
Josephine Elisabeth Oettinger, Zakhar Shumaylov, Johannes Bostelmann, Jan Lellmann, Carola-Bibiane Schönlieb
TL;DR
This work tackles the challenge of extending neural networks to be diffeomorphism-equivariant under the infinite-dimensional group of diffeomorphisms $\mathcal{D}(\mathcal{X})$. It introduces DiffeoNN, which uses energy-based canonicalisation with stationary velocity fields (SVFs) to map inputs to a canonical form before applying a pretrained inner model and then reversing the transformation to restore equivariance, all without retraining. The canonicalisation energy combines a VAE loss and an adversarial discriminator with a deformation regulariser, and gradient-based optimisation with a SIREN parameterisation and the Scaling-and-Squaring method ensures practical, approximate invariance. Empirical results across synthetic segmentation, lung segmentation, and MNIST genus classification show improved robustness to unseen deformations and competitive performance relative to data-augmented baselines, demonstrating data efficiency and modularity. These findings point to promising theoretical questions about the stability, limits, and generalisation guarantees of energy-based canonicalisation for infinite-dimensional symmetry groups.
Abstract
Incorporating group symmetries via equivariance into neural networks has emerged as a robust approach for overcoming the efficiency and data demands of modern deep learning. While most existing approaches, such as group convolutions and averaging-based methods, focus on compact, finite, or low-dimensional groups with linear actions, this work explores how equivariance can be extended to infinite-dimensional groups. We propose a strategy designed to induce diffeomorphism equivariance in pre-trained neural networks via energy-based canonicalisation. Formulating equivariance as an optimisation problem allows us to access the rich toolbox of already established differentiable image registration methods. Empirical results on segmentation and classification tasks confirm that our approach achieves approximate equivariance and generalises to unseen transformations without relying on extensive data augmentation or retraining.
