A Field Guide to Event-Shape Observables Using Optimal Transport
Cari Cesarotti, Matt LeBlanc
TL;DR
This work develops a practical framework for event-shape observables defined via the Earth Mover's Distance between collider events and reference geometries, introducing manifold distances that depend on the choice of reference geometry and distance metric. By analyzing isotropic and dipole references and a family of angular distance exponents $\beta$, the authors demonstrate how these choices modulate sensitivity to different QCD structures and hadronization effects, using analytic insights and Pythia-based simulations of hard and soft QCD events. The key contributions include characterizing the dependence on $\beta$, illustrating the gains from multi-differential analyses, and providing actionable prescriptions for tuning observables to the physics of interest, with a practical demonstration that simple combinations like $\bar{\mathcal{I}}/N$ can achieve near-perfect separation of event classes. The findings offer a versatile, geometry-aware toolkit for LHC analyses and future colliders, enabling robust searches for BSM phenomena by tailoring manifold distances to the targeted event topologies, supported by open-source code for computing these observables.
Abstract
We lay out the phenomenological behavior of event-shape observables evaluated by solving optimal transport problems between collider events and reference geometries -- which we name 'manifold distances' -- to provide guidance regarding their use in future studies. This discussion considers several choices related to the metric used to quantify these distances. We explore the differences between the various options, using a combination of analytical studies and simulated minimum-bias and multi-jet events. Making judicious choices when defining the metric and reference geometry can improve sensitivity to interesting signal features and reduce sensitivity to non-perturbative effects in QCD. The goal of this article is to provide a 'field guide' that can inform how choices made when defining a manifold distance can be tailored for the analysis at-hand.
