Generalizing Convolutional Neural Networks for Equivariance to Lie Groups on Arbitrary Continuous Data
Marc Finzi, Samuel Stanton, Pavel Izmailov, Andrew Gordon Wilson
TL;DR
The paper tackles the challenge of generalizing convolutional networks to be equivariant to continuous Lie group transformations on arbitrary continuous data. It introduces LieConv, a convolutional layer whose kernel is defined on a Lie group via exp/log maps, with lifting of inputs, locality constraints, and Monte Carlo discretization to perform group convolutions without relying on regular grids, and it supports multiple orbits. The approach is demonstrated on image data (RotMNIST), molecular data (QM9), and dynamical systems, achieving competitive or state-of-the-art results and, in Hamiltonian settings, exact conservation of linear and angular momentum when appropriate symmetries are enforced. The work offers a practical, extensible framework for rapid prototyping of equivariant models across diverse modalities, with open-source code to enable broad adoption.
Abstract
The translation equivariance of convolutional layers enables convolutional neural networks to generalize well on image problems. While translation equivariance provides a powerful inductive bias for images, we often additionally desire equivariance to other transformations, such as rotations, especially for non-image data. We propose a general method to construct a convolutional layer that is equivariant to transformations from any specified Lie group with a surjective exponential map. Incorporating equivariance to a new group requires implementing only the group exponential and logarithm maps, enabling rapid prototyping. Showcasing the simplicity and generality of our method, we apply the same model architecture to images, ball-and-stick molecular data, and Hamiltonian dynamical systems. For Hamiltonian systems, the equivariance of our models is especially impactful, leading to exact conservation of linear and angular momentum.
