Fair Best Arm Identification with Fixed Confidence
Alessio Russo, Filippo Vannella
TL;DR
This work defines Fair Best Arm Identification (F-BAI), a PAC-style BAI problem augmented with per-arm fairness constraints. It derives an instance-specific lower bound on sample complexity and introduces the price of fairness, quantifying the extra samples required to satisfy fairness. The authors propose F-TaS, a Track-and-Stop-like algorithm that asymptotically matches the lower bound while guaranteeing fairness, with both model-agnostic and model-dependent fairness variants. The approach is validated through synthetic experiments and a wireless scheduling application, showing efficient sample usage and controlled fairness violations. Overall, the paper bridges fairness considerations and pure-exploration bandits, offering practical guarantees and insights for fair resource allocation problems.
Abstract
In this work, we present a novel framework for Best Arm Identification (BAI) under fairness constraints, a setting that we refer to as \textit{F-BAI} (fair BAI). Unlike traditional BAI, which solely focuses on identifying the optimal arm with minimal sample complexity, F-BAI also includes a set of fairness constraints. These constraints impose a lower limit on the selection rate of each arm and can be either model-agnostic or model-dependent. For this setting, we establish an instance-specific sample complexity lower bound and analyze the \textit{price of fairness}, quantifying how fairness impacts sample complexity. Based on the sample complexity lower bound, we propose F-TaS, an algorithm provably matching the sample complexity lower bound, while ensuring that the fairness constraints are satisfied. Numerical results, conducted using both a synthetic model and a practical wireless scheduling application, show the efficiency of F-TaS in minimizing the sample complexity while achieving low fairness violations.
