Breaking Symmetry Bottlenecks in GNN Readouts
Mouad Talhi, Arne Wolf, Anthea Monod
TL;DR
This work identifies a fundamental expressivity bottleneck in GNNs at the readout stage: any linear permutation-invariant readout collapses node embeddings to the trivial symmetry component via the Reynolds projection, discarding non-trivial symmetry-aware information regardless of encoder power. To overcome this, the authors propose projector-based invariant readouts that decompose node representations into symmetry-aware channels using equivariant projectors and summarize each channel with nonlinear invariant statistics, thereby preserving permutation invariance while retaining information invisible to averaging. Grounded in representation theory, they establish a factorization result for invariant readouts and demonstrate a practical, graph-dependent two-stage readout that avoids the averaging bottleneck. Empirically, swapping only the readout enables fixed encoders to distinguish WL-hard graph pairs and improves performance on symmetry-heavy benchmarks and several downstream tasks, underscoring the readout as a decisive factor in GNN expressivity.
Abstract
Graph neural networks (GNNs) are widely used for learning on structured data, yet their ability to distinguish non-isomorphic graphs is fundamentally limited. These limitations are usually attributed to message passing; in this work we show that an independent bottleneck arises at the readout stage. Using finite-dimensional representation theory, we prove that all linear permutation-invariant readouts, including sum and mean pooling, factor through the Reynolds (group-averaging) operator and therefore project node embeddings onto the fixed subspace of the permutation action, erasing all non-trivial symmetry-aware components regardless of encoder expressivity. This yields both a new expressivity barrier and an interpretable characterization of what global pooling preserves or destroys. To overcome this collapse, we introduce projector-based invariant readouts that decompose node representations into symmetry-aware channels and summarize them with nonlinear invariant statistics, preserving permutation invariance while retaining information provably invisible to averaging. Empirically, swapping only the readout enables fixed encoders to separate WL-hard graph pairs and improves performance across multiple benchmarks, demonstrating that readout design is a decisive and under-appreciated factor in GNN expressivity.
