Federated Linear Contextual Bandits with Heterogeneous Clients

Ethan Blaser; Chuanhao Li; Hongning Wang

Federated Linear Contextual Bandits with Heterogeneous Clients

Ethan Blaser, Chuanhao Li, Hongning Wang

TL;DR

This work extends federated bandit learning to heterogeneous client populations by clustering clients and performing cluster-wise collaborative learning under a standard FL single-model broadcast. The HetoFedBandit framework comprises a pure-exploration phase for clustering, followed by an optimistic learning phase that leverages cross-client information within identified clusters, all coordinated via a FIFO cluster-level communication protocol. The authors establish clustering correctness with high probability, derive confidence ellipsoids, and bound both regret and communication cost, while also proposing empirical enhancements (data-dependent clustering and a priority queue) that improve performance in practice. Experiments on synthetic and LastFM datasets demonstrate that HetoFedBandit and its enhanced variant achieve sub-linear regret and lower communication costs compared to strong baselines, highlighting the approach's practical potential for private, distributed bandit learning with heterogeneity.

Abstract

The demand for collaborative and private bandit learning across multiple agents is surging due to the growing quantity of data generated from distributed systems. Federated bandit learning has emerged as a promising framework for private, efficient, and decentralized online learning. However, almost all previous works rely on strong assumptions of client homogeneity, i.e., all participating clients shall share the same bandit model; otherwise, they all would suffer linear regret. This greatly restricts the application of federated bandit learning in practice. In this work, we introduce a new approach for federated bandits for heterogeneous clients, which clusters clients for collaborative bandit learning under the federated learning setting. Our proposed algorithm achieves non-trivial sub-linear regret and communication cost for all clients, subject to the communication protocol under federated learning that at anytime only one model can be shared by the server.

Federated Linear Contextual Bandits with Heterogeneous Clients

TL;DR

Abstract

Paper Structure (34 sections, 15 theorems, 58 equations, 4 figures, 1 table, 5 algorithms)

This paper contains 34 sections, 15 theorems, 58 equations, 4 figures, 1 table, 5 algorithms.

Introduction
Related Work
Methodology
Problem Setting
Algorithm: HetoFedBandit
Pure Exploration Phase
Cluster Estimation
Optimistic Learning Phase
Communication Protocol
Theoretical Results
Empirical Enhancements: HetoFedBandit-E
Data-Dependent Clustering
Priority Queue
Experiments
Baselines
...and 19 more sections

Key Result

Theorem 1

Under the condition that we set the homogeneity test threshold $\upsilon^c\geq F^{-1}(1-\frac{\delta}{N^2}, df, \psi^c)$, with probability at least $1-\delta$, we have $\hat{\mathcal{C}} = \mathcal{C}$.

Figures (4)

Figure 1: Execution of HetoFedBandit after pure exploration phase $T_0$. Each client $i \in N$ is represented by a node in the client graph $\mathcal{G}$ on the left hand side. Edges between clients indicate potential collaborators, as defined by the homogeneity test. The colored ellipsoids represent the estimated clusters $\hat{\mathcal{C}} = \{\hat{C}_1,...,\hat{C}_5\}$, which are the maximal cliques of $\mathcal{G}$. Clients exceeding their communication threshold are highlighted in orange. Currently, client $\theta_6$ has exceeded the communication threshold $D_1$ for cluster $\hat{C}_1$, which causes cluster $\hat{C}_1$ to be added to the queue. The server pops cluster $\hat{C}_2$ from the queue and facilitates collaboration among $\{\theta_1, \theta_3, \theta_7\}$. In the next timestep, the server will serve cluster $\hat{C}_4$, queued for removal.
Figure 2: Experimental Results on Simulated Dataset
Figure 3: Experimental Results on LastFM Dataset
Figure 4: Experimental Results on Imbalanced Synthetic Dataset

Theorems & Definitions (16)

Theorem 1: Clustering Correctness
Lemma 2: Confidence Ellipsoids
Theorem 3: Regret and Communication Cost
Remark 4
Lemma 5: Lemma 11 in abbasi2011improved
Lemma 6: Theorem 1 of abbasi2011improved
Lemma 7: Determinant-Trace Inequality
Lemma 8: Lemma 12 from lidyclu
Lemma 9: Lemma B1 from Liasynch
Lemma 10
...and 6 more

Federated Linear Contextual Bandits with Heterogeneous Clients

TL;DR

Abstract

Federated Linear Contextual Bandits with Heterogeneous Clients

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (4)

Theorems & Definitions (16)