Post hoc inference via joint family-wise error rate control
Gilles Blanchard, Pierre Neuvial, Etienne Roquain
TL;DR
The paper tackles post hoc inference in large-scale multiple testing by introducing a user-agnostic bound based on a joint-family-wise error rate (JER), ensuring uniform control for any user-selected rejection set. It develops a general framework using a reference family of rejection sets, threshold templates, and $\lambda$-calibration to adapt to unknown dependence and signal sparsity, with explicit constructions under known and unknown dependence. Two template families (linear and balanced) are analyzed, including single-step and step-down procedures, and the approach connects to the Simes/Hommel inequalities as a baseline while enabling adaptive improvements. Numerical experiments demonstrate controlled JER and improved power, highlighting practical guidance on template choice, calibration, and the benefits of user-agnostic inference for exploratory analyses.
Abstract
We introduce a general methodology for post hoc inference in a large-scale multiple testing framework. The approach is called "user-agnostic" in the sense that the statistical guarantee on the number of correct rejections holds for any set of candidate items selected by the user (after having seen the data). This task is investigated by defining a suitable criterion, named the joint-family-wise-error rate (JER for short). We propose several procedures for controlling the JER, with a special focus on incorporating dependencies while adapting to the unknown quantity of signal (via a step-down approach). We show that our proposed setting incorporates as particular cases a version of the higher criticism as well as the closed testing based approach of Goeman and Solari (2011). Our theoretical statements are supported by numerical experiments.
