Communicating Likelihoods with Normalising Flows
Jack Y. Araz, Anja Beck, Méril Reboud, Michael Spannowsky, Danny van Dyk
TL;DR
This work addresses the challenge of sharing and reusing unbinned likelihoods in high-energy physics by learning a likelihood model (LM) with normalizing flows that map samples to a known base distribution and validating the joint distribution via KS tests. The method includes a standardized training pipeline, a robust loss based on the unbinned likelihood in base space, and a KS-based stopping criterion, implemented in the open-source nabu platform. Through three real-world case studies, the authors demonstrate that the LM can capture complex, non-Gaussian features with strong correlations while achieving compact storage relative to the training data. The results support reliable reinterpretation and downstream inference, and the work outlines clear paths to broader adoption and integration with differentiable workflows and future ML techniques.
Abstract
We present a machine-learning-based workflow to model an unbinned likelihood from its samples. A key advancement over existing approaches is the validation of the learned likelihood using rigorous statistical tests of the joint distribution, such as the Kolmogorov-Smirnov test of the joint distribution. Our method enables the reliable communication of experimental and phenomenological likelihoods for subsequent analyses. We demonstrate its effectiveness through three case studies in high-energy physics. To support broader adoption, we provide an open-source reference implementation, nabu.
