A Generative Model of Symmetry Transformations
James Urquhart Allingham, Bruno Kacper Mlodozeniec, Shreyas Padhy, Javier Antorán, David Krueger, Richard E. Turner, Eric Nalisnick, José Miguel Hernández-Lobato
TL;DR
The paper introduces the Symmetry-aware Generative Model (SGM), a two-stage generative framework that separates an invariant prototype $\\hat{{\mathbf{x}}}$ from an equivariant transformation latent $\\bm{\upeta}$ so that ${\mathbf{x}} = {\\mathcal{T}}_{\\bm{\upeta}}(\\hat{{\mathbf{x}}})$. By modeling a group of transformations and learning $p(\\bm{\upeta}|\\hat{{\mathbf{x}}})$ alongside a self-supervised invariant mapping $f_{\\omega}$, the method captures the distribution of naturally occurring symmetries without requiring a distribution over prototypes. The approach yields interpretable symmetry representations, enables natural data augmentation, and improves marginal log-likelihoods and data efficiency when integrated with VAEs, as demonstrated on datasets with affine and color transformations (e.g., dSprites, MNIST, GalaxyMNIST). However, it requires specifying a superset of possible symmetries and exhibits limitations in boundary-content datasets, motivating future work to relax symmetry sets and handle boundary effects. Overall, SGM offers a principled, group-theoretic avenue for uncovering and leveraging data symmetries in generative modeling and beyond.
Abstract
Correctly capturing the symmetry transformations of data can lead to efficient models with strong generalization capabilities, though methods incorporating symmetries often require prior knowledge. While recent advancements have been made in learning those symmetries directly from the dataset, most of this work has focused on the discriminative setting. In this paper, we take inspiration from group theoretic ideas to construct a generative model that explicitly aims to capture the data's approximate symmetries. This results in a model that, given a prespecified but broad set of possible symmetries, learns to what extent, if at all, those symmetries are actually present. Our model can be seen as a generative process for data augmentation. We provide a simple algorithm for learning our generative model and empirically demonstrate its ability to capture symmetries under affine and color transformations, in an interpretable way. Combining our symmetry model with standard generative models results in higher marginal test-log-likelihoods and improved data efficiency.
