Fair Allocation of Bandwidth At Edge Servers For Concurrent Hierarchical Federated Learning
Md Anwar Hossen, Fatema Siddika, Wensheng Zhang
TL;DR
The paper tackles uplink bandwidth bottlenecks in concurrent three-tier Hierarchical Federated Learning by modeling the interaction between edge servers and multiple FL servers as a Stackelberg game. It develops both a distributed heuristic and a centralized algorithm to approximate a Nash Equilibrium, aiming for full utilization of edge uplink capacity $b_j$ and fairness across FL servers with funds $f_i$. Key contributions include the formal three-tier architecture, the Stackelberg formulation with allocations $x_{i,j}$ and prices $p_j$, and extensive MNIST-based evaluations showing that the proposed schemes achieve fair client allocation and competitive FL accuracy versus a centralized baseline, while outperforming a naive proportional baseline. The results demonstrate practical potential for scalable, privacy-preserving FL with multiple concurrent processes by leveraging edge mediation to balance efficiency and fairness under heterogeneity of client distributions, funds, and bandwidth needs.
Abstract
This paper explores concurrent FL processes within a three-tier system, with edge servers between edge devices and FL servers. A challenge in this setup is the limited bandwidth from edge devices to edge servers. Thus, allocating the bandwidth efficiently and fairly to support simultaneous FL processes becomes crucial. We propose a game-theoretic approach to model the bandwidth allocation problem and develop distributed and centralized heuristic schemes to find an approximate Nash Equilibrium of the game. We proposed the approach mentioned above using centralized and entirely distributed assumptions. Through rigorous analysis and experimentation, we demonstrate that our schemes efficiently and fairly assign the bandwidth to the FL processes for centralized and distributed solutions and outperform a baseline scheme where each edge server assigns bandwidth proportionally to the FL servers' requests that it receives. The proposed distributed and centralized schemes have comptetive performance.
