On the statistical analysis of grouped data: when Pearson $χ^2$ and other divisible statistics are not goodness-of-fit tests
Sara Algeri, Estate V. Khmaladze
TL;DR
This work develops a unifying, linear-functional framework for divisible statistics in the analysis of grouped Poisson data, addressing the common practice that treats Pearson's χ^2 as a GOF benchmark. By casting many test statistics as linear functionals of a random measure and allowing parameter estimation, the authors derive projection-based representations that reveal when tests gain or lose power under contiguous alternatives. A key finding is that, in sparse regimes, no single divisible statistic is adequate for GOF; instead, power is best achieved through ensembles of partial-sums tests and optimal weighting, with parameter estimation handled via projection and a projected bootstrap. The paper further provides practically applicable methods, including an omnibus KS-type test based on partial sums and a distribution-free variant via unitary transformations, and demonstrates these methods on Chandra X-ray background data, highlighting the practical impact for physics and astronomy analyses.
Abstract
Thousands of experiments are analyzed and papers are published each year involving the statistical analysis of grouped data. While this area of statistics is often perceived -- somewhat naively -- as saturated, several misconceptions still affect everyday practice, and new frontiers have so far remained unexplored. Researchers must be aware of the limitations affecting their analyses and what are the new possibilities in their hands. Motivated by this need, the article introduces a unifying approach to the analysis of grouped data, which allows us to study the class of divisible statistics -- that includes Pearson's $χ^2$, the likelihood ratio as special cases -- with a fresh perspective. The contributions collected in this manuscript span from modeling and estimation to distribution-free goodness-of-fit tests. Perhaps the most surprising result presented here is that, in a sparse regime, all tests proposed in the literature are dominated by members of the class of weighted linear statistics.
