On the Convergence of a Federated Expectation-Maximization Algorithm
Zhixu Tao, Rajita Chandak, Sanjeev Kulkarni
TL;DR
The paper analyzes the convergence of a federated EM algorithm for a Federated Mixture of $K$ Linear Regressions (FMLR) under data heterogeneity across clients. It develops population- and empirical-EM theory that holds across all regimes of the number of clients $m$ and per-client samples $n$, establishing a SNR threshold $Ω(√K)$ for uniform contraction and showing that, in many regimes, the EM algorithm converges in a constant number of iterations as $m$ grows relative to $n$. It also proves minimax-optimality in the SNR scaling and reveals that larger cluster separation $Δ_{\max}$ does not universally improve convergence. Complemented by synthetic experiments, the results demonstrate that heterogeneity can even accelerate convergence of iterative federated algorithms, offering practical guidance for FL deployments with heterogeneous data.
Abstract
Data heterogeneity has been a long-standing bottleneck in studying the convergence rates of Federated Learning algorithms. In order to better understand the issue of data heterogeneity, we study the convergence rate of the Expectation-Maximization (EM) algorithm for the Federated Mixture of $K$ Linear Regressions model (FMLR). We completely characterize the convergence rate of the EM algorithm under all regimes of $m/n$ where $m$ is the number of clients and $n$ is the number of data points per client. We show that with a signal-to-noise-ratio (SNR) of order $Ω(\sqrt{K})$, the well-initialized EM algorithm converges within the minimax distance of the ground truth under all regimes. Interestingly, we identify that when the number of clients grows reasonably with respect to the number of data points per client, the EM algorithm only requires a constant number of iterations to converge. We perform experiments on synthetic data to illustrate our results. In line with our theoretical findings, the simulations show that rather than being a bottleneck, data heterogeneity can accelerate the convergence of iterative federated algorithms.
