Accelerating multijet-merged event generation with neural network matrix element surrogates
Tim Herrmann, Timo Janßen, Mathis Schenker, Steffen Schumann, Frank Siegert
TL;DR
The work tackles the heavy computational burden of simulating multijet final states at the HL-LHC by introducing a two-stage rejection-sampling scheme that uses neural-network surrogates to unweight matrix elements. It extends previous surrogate methods to tree-level multijet merging, integrating seamlessly with Sherpa’s phase-space biasing, colour treatments, and Sudakov vetoes, and trains surrogates on a reduced basis of process groups for efficiency. Applied to inclusive $Z+$jets production with up to six final-state partons, the approach yields large CPU-time reductions—approximately a factor of $11$ for $Z+6$ jets and $3.3$ for $Z+5$ jets—without compromising physics, as validated by Rivet. The results demonstrate the practicality of fast, unbiased, high-statistics simulations at HL-LHC, with potential extensions to NLO/one-loop elements and broader processes.
Abstract
The efficient simulation of multijet final states presents a serious computational task for analyses of LHC data and will be even more so at the HL-LHC. We here discuss means to accelerate the generation of unweighted events based on a two-stage rejection-sampling algorithm that employs neural-network surrogates for unweighting the hard-process matrix elements. To this end, we generalise the previously proposed algorithm based on factorisation-aware neural networks to the case of multijet merging at tree-level accuracy. We thereby account for several non-trivial aspects of realistic event-simulation setups, including biased phase-space sampling, partial unweighting, and the mapping of partonic subprocesses. We apply our methods to the production of Z+jets final states at the HL-LHC using the Sherpa event generator, including matrix elements with up to six final-state partons. When using neural-network surrogates for the dominant Z+5 jets and Z+6 jets partonic processes, we find a reduction in the total event-generation time by more than a factor of 10 compared to baseline Sherpa.
