Convergence Analysis of Sequential Federated Learning on Heterogeneous Data
Yipeng Li, Xinchen Lyu
TL;DR
This work analyzes the convergence of sequential federated learning (SFL) on heterogeneous data, establishing guarantees for strongly convex, general convex, and non-convex objectives. By introducing an effective learning rate $\tilde{\eta}=\eta MK$ and carefully bounding stochasticity and heterogeneity, it derives upper bounds on $\mathbb{E}[F(\bar{x}^{(R)})-F(x^*)]$ that reveal when SFL outperforms parallel FL (PFL) under data non-i.i.d. settings. The results are complemented by experiments on quadratic functions and real datasets with Extended Dirichlet-based heterogeneity, validating the counterintuitive finding that SFL can surpass PFL in extremely heterogeneous cross-device scenarios. The findings have practical implications for choosing sequential versus parallel training in FL and tuning the local update count $K$ to balance optimization progress and heterogeneity error terms.
Abstract
There are two categories of methods in Federated Learning (FL) for joint training across multiple clients: i) parallel FL (PFL), where clients train models in a parallel manner; and ii) sequential FL (SFL), where clients train models in a sequential manner. In contrast to that of PFL, the convergence theory of SFL on heterogeneous data is still lacking. In this paper, we establish the convergence guarantees of SFL for strongly/general/non-convex objectives on heterogeneous data. The convergence guarantees of SFL are better than that of PFL on heterogeneous data with both full and partial client participation. Experimental results validate the counterintuitive analysis result that SFL outperforms PFL on extremely heterogeneous data in cross-device settings.
