On the Convergence of a Federated Expectation-Maximization Algorithm

Zhixu Tao; Rajita Chandak; Sanjeev Kulkarni

On the Convergence of a Federated Expectation-Maximization Algorithm

Zhixu Tao, Rajita Chandak, Sanjeev Kulkarni

TL;DR

The paper analyzes the convergence of a federated EM algorithm for a Federated Mixture of $K$ Linear Regressions (FMLR) under data heterogeneity across clients. It develops population- and empirical-EM theory that holds across all regimes of the number of clients $m$ and per-client samples $n$, establishing a SNR threshold $Ω(√K)$ for uniform contraction and showing that, in many regimes, the EM algorithm converges in a constant number of iterations as $m$ grows relative to $n$. It also proves minimax-optimality in the SNR scaling and reveals that larger cluster separation $Δ_{\max}$ does not universally improve convergence. Complemented by synthetic experiments, the results demonstrate that heterogeneity can even accelerate convergence of iterative federated algorithms, offering practical guidance for FL deployments with heterogeneous data.

Abstract

Data heterogeneity has been a long-standing bottleneck in studying the convergence rates of Federated Learning algorithms. In order to better understand the issue of data heterogeneity, we study the convergence rate of the Expectation-Maximization (EM) algorithm for the Federated Mixture of $K$ Linear Regressions model (FMLR). We completely characterize the convergence rate of the EM algorithm under all regimes of $m/n$ where $m$ is the number of clients and $n$ is the number of data points per client. We show that with a signal-to-noise-ratio (SNR) of order $Ω(\sqrt{K})$, the well-initialized EM algorithm converges within the minimax distance of the ground truth under all regimes. Interestingly, we identify that when the number of clients grows reasonably with respect to the number of data points per client, the EM algorithm only requires a constant number of iterations to converge. We perform experiments on synthetic data to illustrate our results. In line with our theoretical findings, the simulations show that rather than being a bottleneck, data heterogeneity can accelerate the convergence of iterative federated algorithms.

On the Convergence of a Federated Expectation-Maximization Algorithm

TL;DR

The paper analyzes the convergence of a federated EM algorithm for a Federated Mixture of

Linear Regressions (FMLR) under data heterogeneity across clients. It develops population- and empirical-EM theory that holds across all regimes of the number of clients

and per-client samples

, establishing a SNR threshold

for uniform contraction and showing that, in many regimes, the EM algorithm converges in a constant number of iterations as

grows relative to

. It also proves minimax-optimality in the SNR scaling and reveals that larger cluster separation

does not universally improve convergence. Complemented by synthetic experiments, the results demonstrate that heterogeneity can even accelerate convergence of iterative federated algorithms, offering practical guidance for FL deployments with heterogeneous data.

Abstract

Linear Regressions model (FMLR). We completely characterize the convergence rate of the EM algorithm under all regimes of

where

is the number of clients and

is the number of data points per client. We show that with a signal-to-noise-ratio (SNR) of order

, the well-initialized EM algorithm converges within the minimax distance of the ground truth under all regimes. Interestingly, we identify that when the number of clients grows reasonably with respect to the number of data points per client, the EM algorithm only requires a constant number of iterations to converge. We perform experiments on synthetic data to illustrate our results. In line with our theoretical findings, the simulations show that rather than being a bottleneck, data heterogeneity can accelerate the convergence of iterative federated algorithms.

Paper Structure (19 sections, 14 theorems, 115 equations, 5 figures, 1 algorithm)

This paper contains 19 sections, 14 theorems, 115 equations, 5 figures, 1 algorithm.

Introduction
Our contributions
Related Work
Problem Setup and EM Algorithm
Notation
The FMLR model
EM Algorithm
Main Results
Experiments
Conclusions and Discussions
Proofs for Section \ref{['sec:setup']}
Proof of Proposition \ref{['prop:population_EM']}
Proof of Proposition \ref{['prop:empirical_EM']}
Proofs for Section \ref{['sec:results']}
Proof of Theorem \ref{['theorem.1']}
...and 4 more sections

Key Result

Proposition 2

Suppose Assumption as:dgp holds and $\{(x_i, y_i)\}_{i=1}^n$ are generated by the FMLR model as given in Algorithm alg:fmlr. Then for each $k \in [K]$, one iteration of the population EM, given the current estimates $\boldsymbol{\theta}$, is given by

Figures (5)

Figure 1: Effect of number of data points $n$
Figure 2: Effect of number of clusters $K$
Figure 3: Effect of dimension $d$
Figure 4: Effect of SNR
Figure 5: Effect of $\Delta_{\max}$

Theorems & Definitions (14)

Proposition 2: Population EM
Proposition 3: Empirical EM
Theorem 5: Uniform consistency
Theorem 6: Empirical uniform consistency
Corollary 7
Lemma 8
Proposition 9
Lemma 10
Lemma 11
Lemma 12
...and 4 more

On the Convergence of a Federated Expectation-Maximization Algorithm

TL;DR

Abstract

On the Convergence of a Federated Expectation-Maximization Algorithm

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (5)

Theorems & Definitions (14)