Exact and Linear Convergence for Federated Learning under Arbitrary Client Participation is Attainable
Bicheng Ying, Zhe Li, Haibo Yang
TL;DR
This paper addresses the fundamental FL challenge of exact convergence under arbitrary client participation and data heterogeneity. It introduces a stochastic-matrix and time-varying-graph framework to model participation and local updates, and reformulates FL as a constrained optimization solved by a push-pull strategy (FOCUS). The authors prove that FOCUS achieves exact convergence with a linear rate for both strongly convex and PL-condition nonconvex cases, without decaying the learning rate, and extend the framework to SG-FOCUS for stochastic gradients. They also provide an interpretation of FedAvg within this decentralized perspective and demonstrate the practical viability through theoretical rates and supporting experiments. The work establishes a principled connection between FL and decentralized optimization, offering a scalable path to exact convergence under arbitrary participation patterns.
Abstract
This work tackles the fundamental challenges in Federated Learning (FL) posed by arbitrary client participation and data heterogeneity, prevalent characteristics in practical FL settings. It is well-established that popular FedAvg-style algorithms struggle with exact convergence and can suffer from slow convergence rates since a decaying learning rate is required to mitigate these scenarios. To address these issues, we introduce the concept of stochastic matrix and the corresponding time-varying graphs as a novel modeling tool to accurately capture the dynamics of arbitrary client participation and the local update procedure. Leveraging this approach, we offer a fresh decentralized perspective on designing FL algorithms and present FOCUS, Federated Optimization with Exact Convergence via Push-pull Strategy, a provably convergent algorithm designed to effectively overcome the previously mentioned two challenges. More specifically, we provide a rigorous proof demonstrating that FOCUS achieves exact convergence with a linear rate regardless of the arbitrary client participation, establishing it as the first work to demonstrate this significant result.
