Unbinned Inference with Correlated Events
Krish Desai, Owen Long, Benjamin Nachman
TL;DR
The paper tackles the problem that event correlations induced by unfolding invalidate standard unbinned inference assumptions. It uses OmniFold to perform unbinned unfolding and then conducts parameter inference on Gaussian toy models across 1D to6D, comparing unbinned ML and binned $\chi^2$ approaches with full or diagonal covariances. It shows that ignoring unfolding-induced correlations can significantly underestimate uncertainties, while numerical approaches (e.g., bootstrap) yield valid coverage; in higher dimensions, the RMS uncertainty systematically exceeds asymptotic estimates by about 18–28%. The work provides practical guidance for analyzing unbinned unfolded data, arguing against relying on asymptotic formulas until a proper correlated formalism is developed, and it offers code and concrete recommendations for covariance-aware inference.
Abstract
Modern machine learning has enabled parameter inference from event-level data without the need to first summarize all events with a histogram. All of these unbinned inference methods make use of the fact that the events are statistically independent so that the log likelihood is a sum over events. However, this assumption is not valid for unbinned inference on unfolded data, where the deconvolution process induces a correlation between events. We explore the impact of event correlations on downstream inference tasks in the context of the OmniFold unbinned unfolding method. We find that uncertainties may be significantly underestimated when event correlations are excluded from uncertainty quantification.
