Table of Contents
Fetching ...

A fast algorithm to compute a curve of confidence upper bounds for the False Discovery Proportion using a reference family with a forest structure

Guillermo Durand

TL;DR

The paper tackles the challenge of computing a full curve of post hoc FDP bounds $V^*_{\mathfrak{R}}(S_t)$ along a nested sequence of hypothesis sets, by exploiting forest-structured reference families. It introduces pruning of the forest and, most importantly, a fast $O(m|\mathcal{K}|)$ curve-computation algorithm that maintains per-region counters and a partition to update the curve efficiently; the core identity $V^*_{\mathfrak{R}}(S_t)=\sum_{k\in\mathcal{P}^t} \zeta_k \wedge |S_t\cap R_k|$ enables linear-time progression along the path. The authors provide a rigorous proof framework for the curve updates, implement the methods in the RR-base package, and demonstrate substantial speedups via numerical experiments across large-scale scenarios. This work significantly improves the practicality of extensive FDP-bound exploration in high-dimensional multiple testing, enabling exact curve computation and broader empirical study of post hoc inference strategies.

Abstract

This paper presents a new algorithm (and an additional trick) that allows to compute fastly an entire curve of post hoc bounds for the False Discovery Proportion when the underlying bound $V^*_{\mathfrak{R}}$ construction is based on a reference family $\mathfrak{R}$ with a forest structure {à} la Durand et al. (2020). By an entire curve, we mean the values $V^*_{\mathfrak{R}}(S_1),\dotsc,V^*_{\mathfrak{R}}(S_m)$ computed on a path of increasing selection sets $S_1\subsetneq\dotsb\subsetneq S_m$, $|S_t|=t$. The new algorithm leverages the fact that going from $S_t$ to $S_{t+1}$ is done by adding only one hypothesis.

A fast algorithm to compute a curve of confidence upper bounds for the False Discovery Proportion using a reference family with a forest structure

TL;DR

The paper tackles the challenge of computing a full curve of post hoc FDP bounds along a nested sequence of hypothesis sets, by exploiting forest-structured reference families. It introduces pruning of the forest and, most importantly, a fast curve-computation algorithm that maintains per-region counters and a partition to update the curve efficiently; the core identity enables linear-time progression along the path. The authors provide a rigorous proof framework for the curve updates, implement the methods in the RR-base package, and demonstrate substantial speedups via numerical experiments across large-scale scenarios. This work significantly improves the practicality of extensive FDP-bound exploration in high-dimensional multiple testing, enabling exact curve computation and broader empirical study of post hoc inference strategies.

Abstract

This paper presents a new algorithm (and an additional trick) that allows to compute fastly an entire curve of post hoc bounds for the False Discovery Proportion when the underlying bound construction is based on a reference family with a forest structure {à} la Durand et al. (2020). By an entire curve, we mean the values computed on a path of increasing selection sets , . The new algorithm leverages the fact that going from to is done by adding only one hypothesis.

Paper Structure

This paper contains 20 sections, 4 theorems, 51 equations, 11 figures, 4 algorithms.

Key Result

Proposition 3.1

For any $S\subseteq \mathbb{N}_m^*$, $V^*_{\mathfrak{R}}(S)=V^*_{\mathfrak{R}^{\mathfrak{pr}}}(S)$.

Figures (11)

  • Figure 1: The regions of Example \ref{['exm-toy-forest']}.
  • Figure 2: The regions of Example \ref{['exm-toy-forest']} but with the labels of Example \ref{['exm-toy-leaves']}.
  • Figure 3: The regions of Example \ref{['exm-toy-forest']} with the $\zeta_k$ values.
  • Figure 4: The regions of Example \ref{['exm-toy-forest']} after pruning.
  • Figure 5: The regions of Example \ref{['exm-toy-forest']} at $t=0$ in Algorithm \ref{['alg-formal-curve']} .
  • ...and 6 more figures

Theorems & Definitions (14)

  • Example 2.1: Gaussian one-sided
  • Example 2.2
  • Example 2.3: Continuation of Example \ref{['exm-toy-forest']}
  • Remark 2.1
  • Definition 3.1: Pruning
  • Proposition 3.1
  • proof
  • Proposition 3.2
  • proof
  • Example 3.1: Continuation of Example \ref{['exm-toy-forest']} and Example \ref{['exm-toy-leaves']}
  • ...and 4 more