Inference for location and height of peaks of a standardized field after selection
Alden Green, Jonathan Taylor
TL;DR
The paper develops a rigorous, post-selection framework for inferring the location and height of peaks in a smooth, standardized random field observed with Gaussian noise. It introduces a two-stage TG-test-based peak detection procedure and then constructs post-selection confidence regions for nearby true peaks, with both conditional and marginal coverage guarantees. Central to the theory is a second-order accurate local expansion of the peak intensity near true peaks via the Kac-Rice formula, which illuminates how selection biases affect height and location and enables precise pivots for inference. To address strong selection, the authors propose randomized peak detection and data carving, showing empirical improvements in coverage and interval lengths. The results yield controlled miscoverage rates (PCMR) and robust inference for peak height and localization, with extensive proofs and simulations validating the theoretical claims.
Abstract
Peak inference concerns the use of local maxima ("peaks") of a noisy random field to detect and localize regions where underlying signal is present. We propose a peak inference method that first subjects observed peaks to a significance test of the null hypothesis that no signal is present, and then uses the peaks that are declared significant to construct post-selectively valid confidence regions for the location and height of nearby true peaks. We analyze the performance of this method in a smooth signal plus constant variance noise model under a high-curvature asymptotic assumption, and prove that it asymptotically controls both the number of false discoveries, and the number of confidence regions that do not contain a true peak, relative to the number of points at which inference is conducted. An important intermediate theoretical result uses the Kac-Rice formula to derive a novel approximation to the intensity function of a point process that counts local maxima, which is second-order accurate under the alternative, nearby high-curvature true peaks.
