Full Event Particle-Level Unfolding with Variable-Length Latent Variational Diffusion
Alexander Shmakov, Kevin Greif, Michael James Fenton, Aishik Ghosh, Pierre Baldi, Daniel Whiteson
TL;DR
This work tackles the problem of detector-effects unfolding in collider data by extending generative diffusion models to variable-length, full-event outputs. It introduces a Particle VAE, a Detector Encoder, and a Multiplicity Predictor to build a set-valued latent representation $(X|Y)$, and trains a Latent Diffusion Process with a set-aware denoising network to model $P(X|Y)$. The model is learned via an end-to-end ELBO that combines Prior Loss, Reconstruction Loss, Denoising Loss, and Multiplicity Loss, with inference sampling the event multiplicity from a Gamma distribution and performing reverse diffusion to produce particle-level observables. Applied to semi-leptonic $t\bar{t}$ events at the LHC, the method achieves close agreement with truth-level distributions across many observables and demonstrates robustness to prior changes, highlighting a path toward post-hoc re-interpretation of unfolded data while noting areas for improvement in sharp features and top-quark kinematics.
Abstract
The measurements performed by particle physics experiments must account for the imperfect response of the detectors used to observe the interactions. One approach, unfolding, statistically adjusts the experimental data for detector effects. Recently, generative machine learning models have shown promise for performing unbinned unfolding in a high number of dimensions. However, all current generative approaches are limited to unfolding a fixed set of observables, making them unable to perform full-event unfolding in the variable dimensional environment of collider data. A novel modification to the variational latent diffusion model (VLD) approach to generative unfolding is presented, which allows for unfolding of high- and variable-dimensional feature spaces. The performance of this method is evaluated in the context of semi-leptonic top quark pair production at the Large Hadron Collider.
