Jet Substructure at the Large Hadron Collider: A Review of Recent Advances in Theory and Machine Learning
Andrew J. Larkoski, Ian Moult, Benjamin Nachman
TL;DR
This review surveys the rapid progress in jet substructure at the LHC from both theory and machine learning perspectives. It synthesizes the analytic framework for calculating jet observables, including resummation, fixed-order corrections, non-perturbative effects, and grooming, with an emphasis on prong-structure observables and new frontiers like NGLs and jet-radius logarithms. It then surveys a broad spectrum of ML approaches for jet representations, tagging, calibration, simulation, and anomaly detection, highlighting how data-driven methods complement and extend traditional QCD techniques. The article also discusses data openness, physics-driven observable design, and data-theory collaboration paths, aiming to guide precise QCD studies and robust jet tagging in the LHC era and beyond.
Abstract
Jet substructure has emerged to play a central role at the Large Hadron Collider (LHC), where it has provided numerous innovative new ways to search for new physics and to probe the Standard Model in extreme regions of phase space. In this article we provide a comprehensive review of state of the art theoretical and machine learning developments in jet substructure. This article is meant both as a pedagogical introduction, covering the key physical principles underlying the calculation of jet substructure observables, the development of new observables, and cutting edge machine learning techniques for jet substructure, as well as a comprehensive reference for experts. We hope that it will prove a useful introduction to the exciting and rapidly developing field of jet substructure at the LHC.
