Active clustering with bandit feedback

Victor Thuot; Alexandra Carpentier; Christophe Giraud; Nicolas Verzelen

Active clustering with bandit feedback

Victor Thuot, Alexandra Carpentier, Christophe Giraud, Nicolas Verzelen

TL;DR

This work tackles ACP, where N arms with d-dimensional subGaussian means are partitioned into K hidden groups and the goal is δ-PAC exact recovery with minimal budget τ. The authors establish a non-asymptotic lower bound on the minimal budget that separates a dimension-free term and a high-dimensional term, and introduce ACB, a computationally efficient algorithm whose budget matches the lower bound in many high-dimensional regimes. ACB decomposes the task into Sequential Representatives Identification (SRI) and Active Distance-based Classification (ADC), with ACB achieving near-optimal budgets and no computation-information gap in the active setting. An adaptive variant, ACB^*, handles unknown Δ_* and θ_* via a multiscale search, and numerical experiments on high-dimensional synthetic data validate the theoretical gains and δ-PAC guarantees. Overall, the paper advances understanding of efficient active clustering in high dimensions and demonstrates practical gains over batch or uniform sampling approaches.

Abstract

We investigate the Active Clustering Problem (ACP). A learner interacts with an $N$-armed stochastic bandit with $d$-dimensional subGaussian feedback. There exists a hidden partition of the arms into $K$ groups, such that arms within the same group, share the same mean vector. The learner's task is to uncover this hidden partition with the smallest budget - i.e., the least number of observation - and with a probability of error smaller than a prescribed constant $δ$. In this paper, (i) we derive a non-asymptotic lower bound for the budget, and (ii) we introduce the computationally efficient ACB algorithm, whose budget matches the lower bound in most regimes. We improve on the performance of a uniform sampling strategy. Importantly, contrary to the batch setting, we establish that there is no computation-information gap in the active setting.

Active clustering with bandit feedback

TL;DR

Abstract

We investigate the Active Clustering Problem (ACP). A learner interacts with an

-armed stochastic bandit with

-dimensional subGaussian feedback. There exists a hidden partition of the arms into

groups, such that arms within the same group, share the same mean vector. The learner's task is to uncover this hidden partition with the smallest budget - i.e., the least number of observation - and with a probability of error smaller than a prescribed constant

. In this paper, (i) we derive a non-asymptotic lower bound for the budget, and (ii) we introduce the computationally efficient ACB algorithm, whose budget matches the lower bound in most regimes. We improve on the performance of a uniform sampling strategy. Importantly, contrary to the batch setting, we establish that there is no computation-information gap in the active setting.

Paper Structure (31 sections, 30 theorems, 214 equations, 3 algorithms)

This paper contains 31 sections, 30 theorems, 214 equations, 3 algorithms.

Introduction
Setting and notation
Lower bound on the budget
ACB and Upper bound on the budget
Warm-up: optimal active clustering with known $\Delta,\theta$
Main algorithm ACB$^*$
Numerical experiments
Discussion
Acknowledgements
Details on the numerical experiments
Proof of the Lower Bound
From active clustering to binary classification
Construction of a family of environments
Symmetrization
First Lower bound : proof of \ref{['lemma:LB1']}
...and 16 more sections

Key Result

Theorem 3.1

There exists a numerical constant $c>0$, such that we have for any $\sigma>0$, any $\Delta >0$, any $d\geq 1$, any $\theta>0$, any $\delta\in(0,1/12)$, and any $N\geqslant 2K \geq 4$ such that $\mathcal{E}(\Delta,\theta,\sigma,N,K,d) \neq \emptyset$

Theorems & Definitions (64)

Remark 2.2
Theorem 3.1
proof : Sketch of proof of Theorem \ref{['thm:LB']}
Theorem 4.1
Lemma B.1
Lemma B.2
Remark B.3
proof : Proof of \ref{['thm:LB']}
Lemma B.4
Remark B.5
...and 54 more

Active clustering with bandit feedback

TL;DR

Abstract

Active clustering with bandit feedback

Authors

TL;DR

Abstract

Table of Contents

Key Result

Theorems & Definitions (64)