SEAL - A Symmetry EncourAging Loss for High Energy Physics
Pradyun Hebbar, Thandikire Madula, Vinicius Mikuni, Benjamin Nachman, Nadav Outmezguine, Inbar Savoray
TL;DR
SEAL introduces soft symmetry constraints to encourage Lorentz invariance in neural networks without altering architecture. It defines two penalties, the group-level $\Gamma_G$ based on random Lorentz boosts and the infinitesimal $\Gamma_{\delta}$ based on infinitesimal generators, and integrates them into the training loss with a weight $\lambda$. In toy and jet-tagging experiments, SEAL improves invariance and extrapolation to unseen kinematic regions, while maintaining or improving performance, illustrating a flexible, data-efficient alternative to strictly equivariant networks. The approach is broadly applicable to other transformation groups and can complement symmetry-aware modeling in high-energy physics and beyond.
Abstract
Physical symmetries provide a strong inductive bias for constructing functions to analyze data. In particular, this bias may improve robustness, data efficiency, and interpretability of machine learning models. However, building machine learning models that explicitly respect symmetries can be difficult due to the dedicated components required. Moreover, real-world experiments may not exactly respect fundamental symmetries at the level of finite granularities and energy thresholds. In this work, we explore an alternative approach to create symmetry-aware machine learning models. We introduce soft constraints that allow the model to decide the importance of added symmetries during the learning process instead of enforcing exact symmetries. We investigate two complementary approaches, one that penalizes the model based on specific transformations of the inputs and one inspired by group theory and infinitesimal transformations of the inputs. Using top quark jet tagging and Lorentz equivariance as examples, we observe that the addition of the soft constraints leads to more robust performance while requiring negligible changes to current state-of-the-art models.
