Harnessing the Power of Federated Learning in Federated Contextual Bandits
Chengshuai Shi, Ruida Zhou, Kun Yang, Cong Shen
TL;DR
The paper addresses the gap between canonical Federated Learning (FL) and Federated Contextual Bandits (FCB) by proposing FedIGW, an exploring design that couples inverse gap weighting (IGW) CB with flexible FL protocols. FedIGW operates in epochs where the FL routine learns a reward model from distributed interaction data, which then informs the IGW CB step for the next epoch; this decouples FL and CB while allowing plug-in of any FL protocol (e.g., FedAvg, SCAFFOLD) and FL appendages like personalization, robustness, and privacy. Theoretical guarantees express the global regret Reg(T) in terms of FL excess risk, enabling seamless incorporation of advances in FL convergence analyses; concrete corollaries cover finite and linear reward function classes and generalize to non-linear settings. Empirical results on Bibtex and Delicious demonstrate that FedIGW with various FL backbones outperforms baselines such as FN-UCB, validating its flexibility and practical impact. Overall, FedIGW provides a principled, modular bridge that unlocks the broader FL literature for FCB, with meaningful implications for personalized, private, and robust federated sequential decision-making.
Abstract
Federated learning (FL) has demonstrated great potential in revolutionizing distributed machine learning, and tremendous efforts have been made to extend it beyond the original focus on supervised learning. Among many directions, federated contextual bandits (FCB), a pivotal integration of FL and sequential decision-making, has garnered significant attention in recent years. Despite substantial progress, existing FCB approaches have largely employed their tailored FL components, often deviating from the canonical FL framework. Consequently, even renowned algorithms like FedAvg remain under-utilized in FCB, let alone other FL advancements. Motivated by this disconnection, this work takes one step towards building a tighter relationship between the canonical FL study and the investigations on FCB. In particular, a novel FCB design, termed FedIGW, is proposed to leverage a regression-based CB algorithm, i.e., inverse gap weighting. Compared with existing FCB approaches, the proposed FedIGW design can better harness the entire spectrum of FL innovations, which is concretely reflected as (1) flexible incorporation of (both existing and forthcoming) FL protocols; (2) modularized plug-in of FL analyses in performance guarantees; (3) seamless integration of FL appendages (such as personalization, robustness, and privacy). We substantiate these claims through rigorous theoretical analyses and empirical evaluations.
