Permutation Equivariant Neural Networks for Symmetric Tensors
Edward Pearce-Crump
TL;DR
This work addresses learning linear $S_n$-equivariant maps between symmetric power spaces $S^k(\mathbb{R}^n)$ and $S^l(\mathbb{R}^n)$. It provides two exact characterizations of all such maps through an orbit basis and a diagram basis, and introduces map label notation to enable memory-efficient implementation by avoiding explicit storage of large weight matrices. The authors show that these permutation-equivariant linear functions are highly data-efficient on synthetic tasks and can generalize across tensor sizes, with substantial computational speedups when using the diagram-based map-label approach. Collectively, the results enable scalable, exact, and transferable learning with symmetric tensors under permutation symmetry, with potential applications across physics, chemistry, and graph-structured data.
Abstract
Incorporating permutation equivariance into neural networks has proven to be useful in ensuring that models respect symmetries that exist in data. Symmetric tensors, which naturally appear in statistics, machine learning, and graph theory, are essential for many applications in physics, chemistry, and materials science, amongst others. However, existing research on permutation equivariant models has not explored symmetric tensors as inputs, and most prior work on learning from these tensors has focused on equivariance to Euclidean groups. In this paper, we present two different characterisations of all linear permutation equivariant functions between symmetric power spaces of $\mathbb{R}^n$. We show on two tasks that these functions are highly data efficient compared to standard MLPs and have potential to generalise well to symmetric tensors of different sizes.
