Discriminative versus Generative Approaches to Simulation-based Inference
Benjamin Sluijter, Sascha Diefenbacher, Wahid Bhimji, Benjamin Nachman
TL;DR
The paper directly compares discriminative NSBI via likelihood-ratio classification and generative NSBI via direct density estimation for parameter inference in high-dimensional collider data. It formulates the problem as a mixture model $p(x|\mu,z)=\frac{\mu}{\mu+1}p_{\text{sig}}(x|z)+\frac{1}{\mu+1}p_{\text{back}}(x|z)$ with nuisance $z$ and targets the signal fraction $\mu$ using either direct density estimation with conditional normalizing flows or likelihood-ratio estimation with a parameterized classifier. Across Gaussian and Higgs datasets, both approaches recover $\mu$ with reasonable uncertainty; the likelihood-ratio method generally offers higher accuracy or precision within the explored hyperparameters, though both methods exhibit training-induced variability that motivates ensembling and calibration. The work demonstrates that NSBI can outperform histogram-based methods by exploiting unbinned, high-dimensional information, but practical deployment requires careful hyperparameter tuning, substantial computation, and ensemble strategies, especially for higher-dimensional problems. It also provides guidance for applying NSBI to collider physics and points to future directions in gradient-based inference and calibration techniques.
Abstract
Most of the fundamental, emergent, and phenomenological parameters of particle and nuclear physics are determined through parametric template fits. Simulations are used to populate histograms which are then matched to data. This approach is inherently lossy, since histograms are binned and low-dimensional. Deep learning has enabled unbinned and high-dimensional parameter estimation through neural likelihiood(-ratio) estimation. We compare two approaches for neural simulation-based inference (NSBI): one based on discriminative learning (classification) and one based on generative modeling. These two approaches are directly evaluated on the same datasets, with a similar level of hyperparameter optimization in both cases. In addition to a Gaussian dataset, we study NSBI using a Higgs boson dataset from the FAIR Universe Challenge. We find that both the direct likelihood and likelihood ratio estimation are able to effectively extract parameters with reasonable uncertainties. For the numerical examples and within the set of hyperparameters studied, we found that the likelihood ratio method is more accurate and/or precise. Both methods have a significant spread from the network training and would require ensembling or other mitigation strategies in practice.
