On hypothesis testing, trials factor, hypertests and the BumpHunter
Georgios Choudalakis
TL;DR
The paper tackles the look-elsewhere effect in hypothesis testing by formalizing hypothesis hypertests and introducing BumpHunter, a practical method that searches for local excesses (bumps) across a spectrum with varying window sizes. It defines a robust p-value framework where the global test statistic is $t = -\log(p\text{-value}^{\min})$, ensuring that the resulting Type I error is properly controlled despite the multiple testing across locations and widths. Through the Banff Challenge and related sensitivity studies, it demonstrates how BumpHunter can detect bumps without assuming a specific signal shape or position, while also comparing its performance to targeted likelihood-based tests and exploring generalizations like TailHunter and multi-spectrum aggregation. The work highlights the balance between broad, model-independent searches and the associated efficiency cost from the trials factor, offering practical guidance for implementing hypertests in high energy physics analyses.
Abstract
A detailed presentation of hypothesis testing is given. The "look elsewhere" effect is illustrated, and a treatment of the trials factor is proposed with the introduction of hypothesis hypertests. An example of such a hypertest is presented, named BumpHunter, which is used in the recent ATLAS dijet resonance search, and in an earlier version in the CDF Global Search, to look for exotic phenomena in high energy physics. As a demonstration, the BumpHunter is used to address Problem 1 of the Banff Challenge.
