Table of Contents
Fetching ...

Mode Estimation with Partial Feedback

Charles Arnal, Vivien Cabannes, Vianney Perchet

TL;DR

This work tackles mode estimation under partial feedback, formulating a minimal-query framework that blends weak supervision with active information gathering. It develops an empirical mode estimator and proves tight information-theoretic bounds, then introduces a spectrum of coding- and search-based algorithms that progressively exploit entropy coding and coarse statistics. The culminating Set Elimination algorithm unifies truncated search and bandit-inspired elimination to achieve near-optimal query complexity, with practical implications for scalable weakly supervised online learning. A public codebase accompanies the results, enabling practitioners to apply these strategies to real-world partial-feedback problems.

Abstract

The combination of lightly supervised pre-training and online fine-tuning has played a key role in recent AI developments. These new learning pipelines call for new theoretical frameworks. In this paper, we formalize core aspects of weakly supervised and active learning with a simple problem: the estimation of the mode of a distribution using partial feedback. We show how entropy coding allows for optimal information acquisition from partial feedback, develop coarse sufficient statistics for mode identification, and adapt bandit algorithms to our new setting. Finally, we combine those contributions into a statistically and computationally efficient solution to our problem.

Mode Estimation with Partial Feedback

TL;DR

This work tackles mode estimation under partial feedback, formulating a minimal-query framework that blends weak supervision with active information gathering. It develops an empirical mode estimator and proves tight information-theoretic bounds, then introduces a spectrum of coding- and search-based algorithms that progressively exploit entropy coding and coarse statistics. The culminating Set Elimination algorithm unifies truncated search and bandit-inspired elimination to achieve near-optimal query complexity, with practical implications for scalable weakly supervised online learning. A public codebase accompanies the results, enabling practitioners to apply these strategies to real-world partial-feedback problems.

Abstract

The combination of lightly supervised pre-training and online fine-tuning has played a key role in recent AI developments. These new learning pipelines call for new theoretical frameworks. In this paper, we formalize core aspects of weakly supervised and active learning with a simple problem: the estimation of the mode of a distribution using partial feedback. We show how entropy coding allows for optimal information acquisition from partial feedback, develop coarse sufficient statistics for mode identification, and adapt bandit algorithms to our new setting. Finally, we combine those contributions into a statistically and computationally efficient solution to our problem.
Paper Structure (27 sections, 43 theorems, 222 equations, 1 table, 9 algorithms)

This paper contains 27 sections, 43 theorems, 222 equations, 1 table, 9 algorithms.

Key Result

Theorem 1

Let $(Y_j)_{j\in[n]}$ be $n$ independent variables sampled according to $p\in\prob{\cY}$. Then the probability of error of eq:mode-emp satisfies where $\Delta_2$ is defined in Eq. eq:delta-*, and $c_p$ is some constant that depends on $p$.

Theorems & Definitions (72)

  • Theorem 1: Empirical mode performance
  • Corollary 2: Minimax lower bound
  • Lemma 1
  • Definition 1: Binary Tree etc.
  • Definition 2: Vertex code
  • Definition 3: Node ordering
  • Definition 4: Code ordering
  • Proposition 2: Vitter1987
  • Definition 5
  • Lemma 3
  • ...and 62 more