FedRTS: Federated Robust Pruning via Combinatorial Thompson Sampling
Hong Huang, Hai Yang, Yuan Chen, Jiaxun Ye, Dapeng Wu
TL;DR
FedRTS addresses the challenges of federated pruning in resource-constrained, non-IID settings by reframing topology adjustment as a combinatorial multi-armed bandit and introducing a Thompson Sampling-based Adjustment (TSAdj) mechanism. TSAdj maintains per-link posterior distributions, samples probabilistic actions, and fuses global and client-specific information to stabilize sparse topologies while reducing communication to top-gradient indices. The framework includes a two-loop training process with outer-loop pruning guided by TSAdj and a theoretical regret bound, and it demonstrates state-of-the-art accuracy and lower communication costs on CV and NLP tasks under heterogeneous data and partial client participation. The work provides both practical performance gains and theoretical insights, with broad implications for efficient, robust federated pruning in real-world deployments.
Abstract
Federated Learning (FL) enables collaborative model training across distributed clients without data sharing, but its high computational and communication demands strain resource-constrained devices. While existing methods use dynamic pruning to improve efficiency by periodically adjusting sparse model topologies while maintaining sparsity, these approaches suffer from issues such as greedy adjustments, unstable topologies, and communication inefficiency, resulting in less robust models and suboptimal performance under data heterogeneity and partial client availability. To address these challenges, we propose Federated Robust pruning via combinatorial Thompson Sampling (FedRTS), a novel framework designed to develop robust sparse models. FedRTS enhances robustness and performance through its Thompson Sampling-based Adjustment (TSAdj) mechanism, which uses probabilistic decisions informed by stable, farsighted information instead of deterministic decisions reliant on unstable and myopic information in previous methods. Extensive experiments demonstrate that FedRTS achieves state-of-the-art performance in computer vision and natural language processing tasks while reducing communication costs, particularly excelling in scenarios with heterogeneous data distributions and partial client participation. Our codes are available at: https://github.com/Little0o0/FedRTS
