MatrixNet: Learning over symmetry groups using learned group representations
Lucas Laird, Circe Hsu, Asilata Bapat, Robin Walters
TL;DR
MatrixNet introduces learned matrix representations for group elements to enable neural models to operate directly on symmetry groups. By encoding generators as invertible matrices and composing them multiplicatively, the approach enforces group structure through matrix blocks and an auxiliary relation loss, with variants that incorporate linear, nonlinear, and block-diagonal designs. Empirical results show strong performance on order prediction in finite groups and on braid-group predictions, with excellent generalization to longer words and to unseen group elements. The work advances learning over algebraic structures, offering improved data efficiency and potential interpretability through connection to irreducible subspaces and categorical braid actions.
Abstract
Group theory has been used in machine learning to provide a theoretically grounded approach for incorporating known symmetry transformations in tasks from robotics to protein modeling. In these applications, equivariant neural networks use known symmetry groups with predefined representations to learn over geometric input data. We propose MatrixNet, a neural network architecture that learns matrix representations of group element inputs instead of using predefined representations. MatrixNet achieves higher sample efficiency and generalization over several standard baselines in prediction tasks over the several finite groups and the Artin braid group. We also show that MatrixNet respects group relations allowing generalization to group elements of greater word length than in the training set.
