SymDiff: Equivariant Diffusion via Stochastic Symmetrisation
Leo Zhang, Kianoosh Ashouritaklimi, Yee Whye Teh, Rob Cornish
TL;DR
SymDiff introduces stochastic symmetrisation to diffusion models to achieve $E(3)$-equivariance without relying on intrinsically equivariant neural networks. By upgrading an $\mathcal{H}$-equivariant kernel with a learnable $\gamma_\theta$ via a symmetrisation operator, the method yields a $(\mathcal{H}\times\mathcal{G})$-equivariant reverse process with a tractable training objective. The authors instantiate this for $N$-body systems on the CoM-free space, using scalable architectures like Diffusion Transformers and Set Transformers, and demonstrate strong gains on QM9 and GEOM-Drugs while reducing computational demands. They also show data augmentation is a special case of SymDiff, and provide extensions to score and flow models. Overall, SymDiff broadens the toolkit for equivariant generative modelling by combining stochastic symmetrisation with off-the-shelf components, enabling flexible, efficient, and accurate $N$-body molecule generation with practical impact in computational chemistry and related domains.
Abstract
We propose SymDiff, a method for constructing equivariant diffusion models using the framework of stochastic symmetrisation. SymDiff resembles a learned data augmentation that is deployed at sampling time, and is lightweight, computationally efficient, and easy to implement on top of arbitrary off-the-shelf models. In contrast to previous work, SymDiff typically does not require any neural network components that are intrinsically equivariant, avoiding the need for complex parameterisations or the use of higher-order geometric features. Instead, our method can leverage highly scalable modern architectures as drop-in replacements for these more constrained alternatives. We show that this additional flexibility yields significant empirical benefit for $\mathrm{E}(3)$-equivariant molecular generation. To the best of our knowledge, this is the first application of symmetrisation to generative modelling, suggesting its potential in this domain more generally.
