Characterizing the hadronization of parton showers using the HOMER method
Benoit Assi, Christian Bierlich, Philip Ilten, Tony Menzo, Stephen Mrenna, Manuel Szewc, Michael K. Wilkinson, Ahmed Youssef, Jure Zupan
TL;DR
This work extends the HOMER method to the hadronization of strings with arbitrary numbers of gluons, enabling extraction of the Lund string fragmentation function $f(z)$ from data using only observable information. It introduces a gluon-aware modification to the three-step HOMER workflow and a smearing procedure over fragmentation-history neighborhoods to cope with the non-bijective mapping between fragmentation chains and observable events. Across four increasingly complex string configurations, the study demonstrates that $f(z)$ can be recovered with percent-level fidelity in simpler cases and with about $5\%$ deviations in the most realistic multi-gluon scenario, provided that the smearing hyperparameter $\sigma_{\mathrm{s}}$ is optimally chosen. The approach offers a data-driven path to constrain hadronization directly from measurements, with potential extensions to unbinned observables and full parton-shower simulations in real data analyses.
Abstract
We update the HOMER method, a technique to solve a restricted version of the inverse problem of hadronization -- extracting the Lund string fragmentation function $f(z)$ from data using only observable information. Here, we demonstrate its utility by extracting $f(z)$ from synthetic Pythia simulations using high-level observables constructed on an event-by-event basis, such as multiplicities and shape variables. Four cases of increasing complexity are considered, corresponding to $e^+e^-$ collisions at a center-of-mass energy of $90$ GeV producing either a string stretched between a $q$ and $\bar{q}$ containing no gluons; the same string containing one gluon $g$ with fixed kinematics; the same but the gluon has varying kinematics; and the most realistic case, strings with an unrestricted number of gluons that is the end-result of a parton shower. We demonstrate the extraction of $f(z)$ in each case, with the result of only a relatively modest degradation in performance of the HOMER method with the increased complexity of the string system.
