Testing the statistical compatibility of independent data sets
M. Maltoni, T. Schwetz
TL;DR
The paper tackles the limitation of standard goodness-of-fit in global, multi-parameter analyses by introducing the parameter goodness-of-fit (PG) test, which assesses compatibility across $D$ independent data sets within a single model. It derives the probability distribution of the PG statistic $\bar{\chi}^2_{\min}$ and shows it follows a $\chi^2$ distribution with $P_c$ degrees of freedom, analogous to the SG case, enabling exact p-values. Through theoretical proofs and concrete examples—including a two-experiment one-parameter scenario and a neutrino-oscillation analysis—the authors demonstrate that PG avoids dilution by data points insensitive to the crucial parameters and can reveal inter-set tensions that SG maymiss. The pull approach is discussed to extend PG to correlated data via nuisance parameters with penalty terms, broadening its applicability to realistic analyses. Overall, PG provides a principled, complementary framework for testing cross-experiment consistency in large, multi-parameter fits with practical impact for neutrino physics and beyond, where independent data subsets are common.
Abstract
We discuss a goodness-of-fit method which tests the compatibility between statistically independent data sets. The method gives sensible results even in cases where the chi^2-minima of the individual data sets are very low or when several parameters are fitted to a large number of data points. In particular, it avoids the problem that a possible disagreement between data sets becomes diluted by data points which are insensitive to the crucial parameters. A formal derivation of the probability distribution function for the proposed test statistic is given, based on standard theorems of statistics. The application of the method is illustrated on data from neutrino oscillation experiments, and its complementarity to the standard goodness-of-fit is discussed.
