Table of Contents
Fetching ...

Risk level dependent Minimax Quantile lower bounds for Interactive Statistical Decision Making

Raghav Bongole, Amirreza Zamani, Tobias J. Oechtering, Mikael Skoglund

TL;DR

This work addresses tail risk in interactive statistical decision making by developing a delta-explicit minimax-quantile framework that complements traditional minimax risk. It introduces high-probability Fano and Le Cam tools tailored to ISDM and establishes a quantile-to-expectation conversion, along with a tight link between strict and lower minimax quantiles. The authors show that, for essentially all confidence levels, minimax and lower minimax quantiles coincide, enabling uniform, delta-explicit tail guarantees that translate into expectation bounds. They demonstrate the practicality of the approach by instantiating it on a two-armed Gaussian bandit, where the delta-explicit bound scales as $\sqrt{T\log(1/\delta)}$, matching optimal rates and showcasing the framework’s potential for broader interactive settings, including RL.

Abstract

Minimax risk and regret focus on expectation, missing rare failures critical in safety-critical bandits and reinforcement learning. Minimax quantiles capture these tails. Three strands of prior work motivate this study: minimax-quantile bounds restricted to non-interactive estimation; unified interactive analyses that focus on expected risk rather than risk level specific quantile bounds; and high-probability bandit bounds that still lack a quantile-specific toolkit for general interactive protocols. To close this gap, within the interactive statistical decision making framework, we develop high-probability Fano and Le Cam tools and derive risk level explicit minimax-quantile bounds, including a quantile-to-expectation conversion and a tight link between strict and lower minimax quantiles. Instantiating these results for the two-armed Gaussian bandit immediately recovers optimal-rate bounds.

Risk level dependent Minimax Quantile lower bounds for Interactive Statistical Decision Making

TL;DR

This work addresses tail risk in interactive statistical decision making by developing a delta-explicit minimax-quantile framework that complements traditional minimax risk. It introduces high-probability Fano and Le Cam tools tailored to ISDM and establishes a quantile-to-expectation conversion, along with a tight link between strict and lower minimax quantiles. The authors show that, for essentially all confidence levels, minimax and lower minimax quantiles coincide, enabling uniform, delta-explicit tail guarantees that translate into expectation bounds. They demonstrate the practicality of the approach by instantiating it on a two-armed Gaussian bandit, where the delta-explicit bound scales as , matching optimal rates and showcasing the framework’s potential for broader interactive settings, including RL.

Abstract

Minimax risk and regret focus on expectation, missing rare failures critical in safety-critical bandits and reinforcement learning. Minimax quantiles capture these tails. Three strands of prior work motivate this study: minimax-quantile bounds restricted to non-interactive estimation; unified interactive analyses that focus on expected risk rather than risk level specific quantile bounds; and high-probability bandit bounds that still lack a quantile-specific toolkit for general interactive protocols. To close this gap, within the interactive statistical decision making framework, we develop high-probability Fano and Le Cam tools and derive risk level explicit minimax-quantile bounds, including a quantile-to-expectation conversion and a tight link between strict and lower minimax quantiles. Instantiating these results for the two-armed Gaussian bandit immediately recovers optimal-rate bounds.

Paper Structure

This paper contains 9 sections, 6 theorems, 9 equations.

Key Result

Theorem 1

Let $\mathfrak{M}(\delta)$ be the minimax quantile and let $\mathfrak{M}$ be the minimax risk; then for every $\delta\in(0,1]$, we have This shows that a $(1-\delta)$–quantile lower bound immediately implies a $\delta$–scaled expectation lower bound.

Theorems & Definitions (19)

  • Definition 1: Minimax Risk
  • Definition 2: Quantile
  • Definition 3: Minimax Quantile
  • Definition 4: Lower Minimax Quantile
  • Theorem 1: Quantile-to-expectation in ISDM
  • proof : Proof
  • Theorem 2: Lower minimax quantile relation in ISDM
  • proof
  • Theorem 3: High-probability interactive Fano $\Rightarrow$ lower minimax quantile
  • proof : Proof sketch
  • ...and 9 more