Learning with a Budget: Identifying the Best Arm with Resource Constraints
Zitian Li, Wang Chi Cheung
TL;DR
The SH-RR algorithm is proposed, which integrates resource-aware allocation into the classical successive halving framework on best arm identification and unifies the theoretical analysis for both the stochastic and deterministic consumption settings.
Abstract
In many applications, evaluating the effectiveness of different alternatives comes with varying costs or resource usage. Motivated by such heterogeneity, we study the Best Arm Identification with Resource Constraints (BAIwRC) problem, where an agent seeks to identify the best alternative (aka arm) in the presence of resource constraints. Each arm pull consumes one or more types of limited resources. We make two key contributions. First, we propose the Successive Halving with Resource Rationing (SH-RR) algorithm, which integrates resource-aware allocation into the classical successive halving framework on best arm identification. The SH-RR algorithm unifies the theoretical analysis for both the stochastic and deterministic consumption settings, with a new \textit{effective consumption measure
