On Focusing Statistical Power for Searches and Measurements in Particle Physics
James Carzon, Aishik Ghosh, Rafael Izbicki, Ann Lee, Luca Masserano, Daniel Whiteson
TL;DR
The paper tackles the non-optimal power allocation of the generalized likelihood ratio test (LRT) for composite hypotheses in particle physics. It introduces a focused test statistic (FTS), $T_f(D; \mu_0) = -2 \log\left( p(D|\mu_0) / \int p(D|\mu) f(\mu) d\mu \right)$, where the Gaussian focus function $f(\mu)$ concentrates power in physics-motivated regions while preserving valid confidence intervals via a Neyman construction. Confidence intervals are built efficiently using ML-enhanced quantile regression to estimate critical values, enabling fast, nonasymptotic interval construction even in small-sample or high-dimensional settings. The authors demonstrate substantial gains in two case studies—a Higgs boson coupling measurement and a LZ-inspired WIMP search—achieving median CI length reductions of roughly 13–21% (Higgs) and 22–35% (WIMPs) at common confidence levels. The approach yields tighter bounds in no-signal scenarios and maintains gains when a signal is present, offering a practical, drop-in improvement for a wide range of collider, neutrino, and dark-mmatter analyses, including high-dimensional or unbinned cases.
Abstract
Particle physics experiments rely on the (generalised) likelihood ratio test (LRT) for searches and measurements, which consist of composite hypothesis tests. However, this test is not guaranteed to be optimal, as the Neyman-Pearson lemma pertains only to simple hypothesis tests. Any choice of test statistic thus implicitly determines how statistical power varies across the parameter space. An improvement in the core statistical testing methodology for general settings with composite tests would have widespread ramifications across experiments. We discuss an alternate test statistic that provides the data analyzer an ability to focus the power of the test on physics-motivated regions of the parameter space. We demonstrate the improvement from this technique compared to the LRT on a Higgs $\rightarrowττ$ dataset simulated by the ATLAS experiment and a dark matter dataset inspired by the LZ experiment. We also employ machine learning to efficiently perform the Neyman construction, which is essential to ensure statistically valid confidence intervals.
