FedSUM Family: Efficient Federated Learning Methods under Arbitrary Client Participation
Runze You, Shi Pu
TL;DR
The paper tackles federated learning under arbitrary client participation by introducing two delay metrics, τ_max and τ_avg, to quantify participation variability. It proposes the FedSUM family (FedSUM-B, FedSUM, FedSUM-CR) which uses Stochastic Uplink-Merge to integrate updates from intermittently active clients and counteract data heterogeneity. The authors provide unified nonconvex convergence guarantees that degrade gracefully with participation delays and demonstrate that the methods match or exceed the efficiency of existing baselines, with FedSUM-CR offering further communication savings. Empirical results on standard benchmarks with varying participation patterns validate the theoretical findings and show robust performance across heterogeneous settings and tasks, including NLP with SST-2 and image classification.
Abstract
Federated Learning (FL) methods are often designed for specific client participation patterns, limiting their applicability in practical deployments. We introduce the FedSUM family of algorithms, which supports arbitrary client participation without additional assumptions on data heterogeneity. Our framework models participation variability with two delay metrics, the maximum delay $τ_{\max}$ and the average delay $τ_{\text{avg}}$. The FedSUM family comprises three variants: FedSUM-B (basic version), FedSUM (standard version), and FedSUM-CR (communication-reduced version). We provide unified convergence guarantees demonstrating the effectiveness of our approach across diverse participation patterns, thereby broadening the applicability of FL in real-world scenarios.
