Asynchronous Federated Optimization
Cong Xie, Sanmi Koyejo, Indranil Gupta
TL;DR
This work tackles scalability and straggler issues in federated learning by introducing FedAsync, an asynchronous optimization framework that solves regularized local objectives and updates the global model via adaptive mixing to mitigate staleness. The authors prove convergence for a restricted class of non-convex problems under standard smoothness and delay assumptions, and demonstrate through experiments on CIFAR-10 and WikiText-2 that FedAsync achieves fast convergence and robustness to stale updates, often outperforming synchronous FedAvg. The combination of a regularized local objective, adaptive mixing, and asynchronous server–worker communication offers practical gains for large-scale, non-IID federated setups, with future work focusing on refining adaptive strategies. These insights are relevant for developers and researchers seeking scalable, robust federated optimization algorithms capable of handling heterogeneous devices and network conditions.
Abstract
Federated learning enables training on a massive number of edge devices. To improve flexibility and scalability, we propose a new asynchronous federated optimization algorithm. We prove that the proposed approach has near-linear convergence to a global optimum, for both strongly convex and a restricted family of non-convex problems. Empirical results show that the proposed algorithm converges quickly and tolerates staleness in various applications.
