Flow Matching Beyond Kinematics: Generating Jets with Particle-ID and Trajectory Displacement Information
Joschka Birk, Erik Buhmann, Cedric Ewen, Gregor Kasieczka, David Shih
TL;DR
This work develops a permutation-equivariant continuous normalizing flow (CNF) to generate jet constituents at the particle level, trained via flow matching on the expansive JetClass dataset. By conditioning on jet type, a single model captures ten jet classes and models beyond-kinematic features such as particle IDs and trajectory displacement, using EPiC layers to handle permutation-invariant point clouds. The approach achieves strong agreement with real jets across constituent kinematics and jet substructure, demonstrated through KL divergences, Fréchet distances, and a classifier test with AUCs up to 0.829, while also highlighting ongoing challenges in complex 3-prong topologies. The work expands jet generative modeling beyond kinematics, enabling richer benchmarks and potential improvements in anomaly detection and differentiable analyses, with code released for reproducibility.
Abstract
We introduce the first generative model trained on the JetClass dataset. Our model generates jets at the constituent level, and it is a permutation-equivariant continuous normalizing flow (CNF) trained with the flow matching technique. It is conditioned on the jet type, so that a single model can be used to generate the ten different jet types of JetClass. For the first time, we also introduce a generative model that goes beyond the kinematic features of jet constituents. The JetClass dataset includes more features, such as particle-ID and track impact parameter, and we demonstrate that our CNF can accurately model all of these additional features as well. Our generative model for JetClass expands on the versatility of existing jet generation techniques, enhancing their potential utility in high-energy physics research, and offering a more comprehensive understanding of the generated jets.
