Rethinking Diffusion Models with Symmetries through Canonicalization with Applications to Molecular Graph Generation
Cai Zhou, Zijie Chen, Zian Li, Jike Wang, Kaiyi Jiang, Pan Li, Rose Yu, Muhan Zhang, Stephen Bates, Tommi Jaakkola
TL;DR
This work reframes diffusion and flow-based generative modeling for symmetric data by canonicalizing inputs to a slice that breaks symmetry, training with non-equivariant backbones, and restoring invariance via Haar randomization at generation. The authors establish a factorization of invariant measures through the canonical slice, prove universality and enhanced expressivity of canonical parameterizations, and show that canonicalization reduces both score mixture and conditional flow variance, speeding up training. For molecular graph generation under $S_N \times SE(3)$, they implement a geometric-spectra–driven canonicalization (via the Fiedler vector) and a novel Canon architecture (CanonFlow) that ingests canonical rank as an extra state, achieving state-of-the-art results on QM9 and GEOM-DRUG with efficient few-step generation. The combination of canonical training, aligned priors, and near-Monge couplings provides a practical, scalable approach that improves generation quality while preserving invariance, offering a principled alternative to fully equivariant models. Overall, canonical diffusion with orbit-space thinking yields faster convergence, better sample quality, and strong empirical gains in 3D molecular generation.
Abstract
Many generative tasks in chemistry and science involve distributions invariant to group symmetries (e.g., permutation and rotation). A common strategy enforces invariance and equivariance through architectural constraints such as equivariant denoisers and invariant priors. In this paper, we challenge this tradition through the alternative canonicalization perspective: first map each sample to an orbit representative with a canonical pose or order, train an unconstrained (non-equivariant) diffusion or flow model on the canonical slice, and finally recover the invariant distribution by sampling a random symmetry transform at generation time. Building on a formal quotient-space perspective, our work provides a comprehensive theory of canonical diffusion by proving: (i) the correctness, universality and superior expressivity of canonical generative models over invariant targets; (ii) canonicalization accelerates training by removing diffusion score complexity induced by group mixtures and reducing conditional variance in flow matching. We then show that aligned priors and optimal transport act complementarily with canonicalization and further improves training efficiency. We instantiate the framework for molecular graph generation under $S_n \times SE(3)$ symmetries. By leveraging geometric spectra-based canonicalization and mild positional encodings, canonical diffusion significantly outperforms equivariant baselines in 3D molecule generation tasks, with similar or even less computation. Moreover, with a novel architecture Canon, CanonFlow achieves state-of-the-art performance on the challenging GEOM-DRUG dataset, and the advantage remains large in few-step generation.
