Economical Jet Taggers -- Equivariant, Slim, and Quantized
Antoine Petitjean, Tilman Plehn, Jonas Spinner, Ullrich Köthe
TL;DR
The paper addresses the challenge of resource-intensive jet tagging at the LHC by designing slim, quantized, Lorentz-equivariant transformers. It introduces L-GATr-slim, a scalar–vector latent architecture, and benchmarks it against LLoCa-Transformer across jet tagging, amplitude regression, and event generation, showing competitive performance with significantly reduced compute. The study demonstrates substantial resource savings—up to orders of magnitude in training efficiency and energy cost—while maintaining accuracy, aided by quantization-aware training via PARQ and STE. The results indicate strong potential for trigger-level jet tagging and online deployment, supported by public code and clear directions for further hardware-oriented optimizations.
Abstract
Modern machine learning is transforming jet tagging at the LHC, but the leading transformer architectures are large, not particularly fast, and training-intensive. We present a slim version of the L-GATr tagger, reduce the number of parameters of jet-tagging transformers, and quantize them. We compare different quantization methods for standard and Lorentz-equivariant transformers and estimate their gains in resource efficiency. We find a six-fold reduction in energy cost for an moderate performance decrease, down to 1000-parameter taggers. This might be a step towards trigger-level jet tagging with small and quantized versions of the leading equivariant transformer architectures.
