Federated ADMM from Bayesian Duality
Thomas Möllenhoff, Siddharth Swaroop, Finale Doshi-Velez, Mohammad Emtiyaz Khan
TL;DR
The paper introduces Bayesian Duality as a unifying lens to derive and extend federated ADMM. By reformulating federated learning with variational Bayes over exponential-family posteriors, it recovers ADMM for isotropic Gaussians and naturally yields Newton-like and Adam-like variants (e.g., IVON-ADMM) when richer posteriors are used. The BayesADMM framework leverages dual updates in natural-parameter space, enabling uncertainty-aware updates that improve performance under client heterogeneity while preserving communication efficiency. Empirical results across diverse federated benchmarks show IVON-ADMM often outperforming strong baselines, sometimes achieving near-one-round convergence in convex settings and robust performance in deep learning tasks. Overall, the work opens a path to extending primal-dual methods with Bayesian principles, suggesting broad applicability to other distributed optimization techniques.
Abstract
We propose a new Bayesian approach to derive and extend the federated Alternating Direction Method of Multipliers (ADMM). We show that the solutions of variational-Bayesian objectives are associated with a duality structure that not only resembles ADMM but also extends it. For example, ADMM-like updates are recovered when the objective is optimized over the isotropic-Gaussian family, and new non-trivial extensions are obtained for other more flexible exponential families. Examples include a Newton-like variant that converges in one step on quadratics and an Adam-like variant called IVON-ADMM that has the same cost as Adam but yields up to 7% accuracy boosts in heterogeneous deep learning. Our work opens a new direction to use Bayes to extend ADMM and other primal-dual methods.
