Network EM Algorithm for Gaussian Mixture Model in Decentralized Federated Learning

Shuyuan Wu; Bin Du; Xuetong Li; Hansheng Wang

Network EM Algorithm for Gaussian Mixture Model in Decentralized Federated Learning

Shuyuan Wu, Bin Du, Xuetong Li, Hansheng Wang

TL;DR

Rigorous theoretical analysis demonstrates that MNEM can achieve statistical efficiency comparable to that of the whole sample estimator when the mixture components satisfy certain separation conditions, even in heterogeneous scenarios, even in heterogeneous scenarios.

Abstract

We systematically study various network Expectation-Maximization (EM) algorithms for the Gaussian mixture model within the framework of decentralized federated learning. Our theoretical investigation reveals that directly extending the classical decentralized supervised learning method to the EM algorithm exhibits poor estimation accuracy with heterogeneous data across clients and struggles to converge numerically when Gaussian components are poorly-separated. To address these issues, we propose two novel solutions. First, to handle heterogeneous data, we introduce a momentum network EM (MNEM) algorithm, which uses a momentum parameter to combine information from both the current and historical estimators. Second, to tackle the challenge of poorly-separated Gaussian components, we develop a semi-supervised MNEM (semi-MNEM) algorithm, which leverages partially labeled data. Rigorous theoretical analysis demonstrates that MNEM can achieve statistical efficiency comparable to that of the whole sample estimator when the mixture components satisfy certain separation conditions, even in heterogeneous scenarios. Moreover, the semi-MNEM estimator enhances the convergence speed of the MNEM algorithm, effectively addressing the numerical convergence challenges in poorly-separated scenarios. Extensive simulation and real data analyses are conducted to justify our theoretical findings.

Network EM Algorithm for Gaussian Mixture Model in Decentralized Federated Learning

TL;DR

Abstract

Network EM Algorithm for Gaussian Mixture Model in Decentralized Federated Learning

TL;DR

Abstract

Paper Structure

Key Result

Figures (4)

Theorems & Definitions (4)