Versatile Energy-Based Probabilistic Models for High Energy Physics
Taoli Cheng, Aaron Courville
TL;DR
This work introduces a versatile energy-based probabilistic framework for LHC events that uses a transformer-based energy function to model higher-order inter-particle interactions and to support implicit generation via short-run Langevin dynamics. It demonstrates three capabilities: parameterized event generation, model-independent anomaly detection free from spurious correlations, and a hybrid classifier-EBM (EBM-CLF) that unifies jet classification with generation. The approach offers a data-driven path toward fast, reliable physics event simulation and robust new-physics searches, with evidence of realistic jet generation and improved OOD tagging. Overall, the framework provides a multitasking, physics-informed alternative to traditional Monte Carlo methods for high-energy physics analyses.
Abstract
As a classical generative modeling approach, energy-based models have the natural advantage of flexibility in the form of the energy function. Recently, energy-based models have achieved great success in modeling high-dimensional data in computer vision and natural language processing. In line with these advancements, we build a multi-purpose energy-based probabilistic model for High Energy Physics events at the Large Hadron Collider. This framework builds on a powerful generative model and describes higher-order inter-particle interactions. It suits different encoding architectures and builds on implicit generation. As for applicative aspects, it can serve as a powerful parameterized event generator for physics simulation, a generic anomalous signal detector free from spurious correlations, and an augmented event classifier for particle identification.
