Symmetry-Aware Generative Modeling through Learned Canonicalization
Kusha Sareen, Daniel Levy, Arnab Kumar Mondal, Sékou-Oumar Kaba, Tara Akhound-Sadegh, Siamak Ravanbakhsh
TL;DR
The paper addresses the challenge of modeling symmetric densities by arguing that learning the full invariant distribution $p_X$ is unnecessary. It introduces a learned canonicalization function $h$ that maps inputs to a canonical pose, defining the orbit representative map $c(x) = h(x)^{-1}x$ to model the distribution over orbits $p_{X/G}$ with a non-equivariant diffusion generator. Empirical results on RotMNIST and QM9 show that canon+GDM yields higher-quality samples and significantly faster inference than fully equivariant baselines, while a frozen-canonicalizer baseline still produces stable molecules. This symmetry-aware, architecture-agnostic approach preserves inductive biases while enabling flexible, efficient generative modeling in symmetry-rich domains, with broad potential applications in AI for science.
Abstract
Generative modeling of symmetric densities has a range of applications in AI for science, from drug discovery to physics simulations. The existing generative modeling paradigm for invariant densities combines an invariant prior with an equivariant generative process. However, we observe that this technique is not necessary and has several drawbacks resulting from the limitations of equivariant networks. Instead, we propose to model a learned slice of the density so that only one representative element per orbit is learned. To accomplish this, we learn a group-equivariant canonicalization network that maps training samples to a canonical pose and train a non-equivariant generative model over these canonicalized samples. We implement this idea in the context of diffusion models. Our preliminary experimental results on molecular modeling are promising, demonstrating improved sample quality and faster inference time.
