Profiling systematic uncertainties in Simulation-Based Inference with Factorizable Normalizing Flows
Davide Valsecchi, Mauro Donegà, Rainer Wallny
TL;DR
The paper tackles the challenge of profiling high-dimensional systematic uncertainties in unbinned likelihood analyses by introducing a Simulation-Based Inference framework based on Factorizable Normalizing Flows (FNF). It defines Distributions of Interest (DoI) as learnable invertible transformations of the feature space, enabling functional, distribution-wide measurements beyond scalar parameters. A two-tier approach combines a nominal density with a modular, linear/quadratic deformation for nuisances, together with amortized training that maps nuisance configurations to DoI deformations, and an orthogonal decomposition to interpret dominant uncertainty modes. The method is validated on a synthetic high-energy physics-like dataset, showing scalable profiling, accurate DoI recovery, and robust uncertainty quantification, with potential applications to unfolding and differential cross-section measurements in complex analyses.
Abstract
Unbinned likelihood fits aim at maximizing the information one can extract from experimental data, yet their application in realistic statistical analyses is often hindered by the computational cost of profiling systematic uncertainties. Additionally, current machine learning-based inference methods are typically limited to estimating scalar parameters in a multidimensional space rather than full differential distributions. We propose a general framework for Simulation-Based Inference (SBI) that efficiently profiles nuisance parameters while measuring multivariate Distributions of Interest (DoI), defined as learnable invertible transformations of the feature space. We introduce Factorizable Normalizing Flows to model systematic variations as parametric deformations of a nominal density, preserving tractability without combinatorial explosion. Crucially, we develop an amortized training strategy that learns the conditional dependence of the DoI on nuisance parameters in a single optimization process, bypassing the need for repetitive training during the likelihood scan. This allows for the simultaneous extraction of the underlying distribution and the robust profiling of nuisances. The method is validated on a synthetic dataset emulating a high-energy physics measurement with multiple systematic sources, demonstrating its potential for unbinned, functional measurements in complex analyses.
