Federated ADMM from Bayesian Duality

Thomas Möllenhoff; Siddharth Swaroop; Finale Doshi-Velez; Mohammad Emtiyaz Khan

Federated ADMM from Bayesian Duality

Thomas Möllenhoff, Siddharth Swaroop, Finale Doshi-Velez, Mohammad Emtiyaz Khan

TL;DR

The paper introduces Bayesian Duality as a unifying lens to derive and extend federated ADMM. By reformulating federated learning with variational Bayes over exponential-family posteriors, it recovers ADMM for isotropic Gaussians and naturally yields Newton-like and Adam-like variants (e.g., IVON-ADMM) when richer posteriors are used. The BayesADMM framework leverages dual updates in natural-parameter space, enabling uncertainty-aware updates that improve performance under client heterogeneity while preserving communication efficiency. Empirical results across diverse federated benchmarks show IVON-ADMM often outperforming strong baselines, sometimes achieving near-one-round convergence in convex settings and robust performance in deep learning tasks. Overall, the work opens a path to extending primal-dual methods with Bayesian principles, suggesting broad applicability to other distributed optimization techniques.

Abstract

We propose a new Bayesian approach to derive and extend the federated Alternating Direction Method of Multipliers (ADMM). We show that the solutions of variational-Bayesian objectives are associated with a duality structure that not only resembles ADMM but also extends it. For example, ADMM-like updates are recovered when the objective is optimized over the isotropic-Gaussian family, and new non-trivial extensions are obtained for other more flexible exponential families. Examples include a Newton-like variant that converges in one step on quadratics and an Adam-like variant called IVON-ADMM that has the same cost as Adam but yields up to 7% accuracy boosts in heterogeneous deep learning. Our work opens a new direction to use Bayes to extend ADMM and other primal-dual methods.

Federated ADMM from Bayesian Duality

TL;DR

Abstract

Federated ADMM from Bayesian Duality

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (6)

Theorems & Definitions (6)