Improving Equivariant Networks with Probabilistic Symmetry Breaking

Hannah Lawrence; Vasco Portilheiro; Yan Zhang; Sékou-Oumar Kaba

Improving Equivariant Networks with Probabilistic Symmetry Breaking

Hannah Lawrence, Vasco Portilheiro, Yan Zhang, Sékou-Oumar Kaba

TL;DR

This work tackles the limitation of equivariant networks that enforce self-symmetry by reframing prediction as sampling from equivariant conditional distributions ${\mathbb{P}(Y|X)}$ and introducing randomized canonicalization to enable symmetry breaking. The authors prove a representation theorem using an inversion kernel and propose SymPE, a symmetry-breaking positional encoding that injects a sampled group element into inputs in an equivariant manner. They also show that equivariant noise injection sits within the same representational class, derive generalization benefits for symmetry-breaking, and connect to existing relaxed-equivariance frameworks. Empirically, SymPE improves performance across graph autoencoding, diffusion-based graph generation, and Ising-model ground-state prediction, demonstrating robust symmetry-breaking while preserving the inductive bias of symmetry. Overall, the approach provides a principled, end-to-end framework for expanding the expressive power of equivariant networks without discarding their symmetry-driven generalization advantages.

Abstract

Equivariance encodes known symmetries into neural networks, often enhancing generalization. However, equivariant networks cannot break symmetries: the output of an equivariant network must, by definition, have at least the same self-symmetries as the input. This poses an important problem, both (1) for prediction tasks on domains where self-symmetries are common, and (2) for generative models, which must break symmetries in order to reconstruct from highly symmetric latent spaces. This fundamental limitation can be addressed by considering equivariant conditional distributions, instead of equivariant functions. We present novel theoretical results that establish necessary and sufficient conditions for representing such distributions. Concretely, this representation provides a practical framework for breaking symmetries in any equivariant network via randomized canonicalization. Our method, SymPE (Symmetry-breaking Positional Encodings), admits a simple interpretation in terms of positional encodings. This approach expands the representational power of equivariant networks while retaining the inductive bias of symmetry, which we justify through generalization bounds. Experimental results demonstrate that SymPE significantly improves performance of group-equivariant and graph neural networks across diffusion models for graphs, graph autoencoders, and lattice spin system modeling.

Improving Equivariant Networks with Probabilistic Symmetry Breaking

TL;DR

This work tackles the limitation of equivariant networks that enforce self-symmetry by reframing prediction as sampling from equivariant conditional distributions

and introducing randomized canonicalization to enable symmetry breaking. The authors prove a representation theorem using an inversion kernel and propose SymPE, a symmetry-breaking positional encoding that injects a sampled group element into inputs in an equivariant manner. They also show that equivariant noise injection sits within the same representational class, derive generalization benefits for symmetry-breaking, and connect to existing relaxed-equivariance frameworks. Empirically, SymPE improves performance across graph autoencoding, diffusion-based graph generation, and Ising-model ground-state prediction, demonstrating robust symmetry-breaking while preserving the inductive bias of symmetry. Overall, the approach provides a principled, end-to-end framework for expanding the expressive power of equivariant networks without discarding their symmetry-driven generalization advantages.

Improving Equivariant Networks with Probabilistic Symmetry Breaking

TL;DR

Abstract

Improving Equivariant Networks with Probabilistic Symmetry Breaking

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (7)

Theorems & Definitions (26)