Energy-Energy Flow Networks
Arianna Garcia Caffaro, Ian Moult, Chase Shimmin
TL;DR
The paper addresses how neural network jet taggers can be misled by non-perturbative QCD effects and proposes EnFN and PnFN architectures to directly exploit higher-point jet correlations while enforcing infrared and collinear safety. It introduces perturbative regularization via jet re-clustering and develops RMS-based robustness metrics plus Pareto visualizations to quantify resilience to hadronization. Through a Z-boson tagging study, EnFN/E2FN demonstrate competitive performance with strong robustness, while ParticleNet remains high-performing but more sensitive to non-perturbative systematics. The work provides a framework for balancing performance and theoretical reliability in ML-driven jet searches, with implications for improved theory-experiment synergy in high energy physics.
Abstract
Jet substructure provides one of the most exciting new approaches for searching for physics in and beyond the Standard Model at the Large Hadron Collider. Modern jet substructure searches are often performed with Neural Network (NN) taggers which study the jets' radiation distributions in great detail, far beyond what is theoretically described by parton shower generators. While this represents a great opportunity, as NNs look deeper into the structure of jets they become increasingly sensitive both to perturbative and non-perturbative theoretical uncertainties. It is therefore important to be able to control which aspects of both regimes the networks focus on, and to develop techniques for quantifying these uncertainties. In this paper we take two steps in this direction: First, we introduce EnFNs, a generalization of the Energy Flow Networks (EFNs) which directly probes higher point correlations in jets, as motivated by recent advances in the study of energy correlators. Second, we introduce a number of techniques to quantify and visualize their robustness to non-perturbative corrections. We highlight the importance of such considerations in a toy study incorporating systematics into a search, and maximizing for the network's discovery significance, as opposed to absolute tagging performance. We hope this study continues the interest in understanding the role QCD systematics play in Machine Learning applications and opens the door to a better interplay between theory and experiment in HEP.
