Distributionally Robust Federated Learning: An ADMM Algorithm
Wen Bai, Yi Wong, Xiao Qiao, Chin Pang Ho
TL;DR
This work tackles distributional heterogeneity and ambiguity in federated learning by introducing Distributionally Robust Federated Learning (DRFL) with per-client Wasserstein ambiguity sets. It derives a tractable reformulation that converts the inner worst-case expectation into a finite-dimensional optimization and embeds it into a min-max-min structure amenable to ADMM-based optimization (AD-LPMM). The proposed algorithm provides convergence guarantees in the convex setting and practical convergence to critical points otherwise, while enabling client-local worst-case loss updates in a distributed fashion. Empirical results on SVM and huber-regression tasks across several datasets show that DRFL consistently outperforms AFL, WAFL, and other robust baselines under data heterogeneity and distributional ambiguity, highlighting its potential for robust FL in real-world heterogeneous environments.
Abstract
Federated learning (FL) aims to train machine learning (ML) models collaboratively using decentralized data, bypassing the need for centralized data aggregation. Standard FL models often assume that all data come from the same unknown distribution. However, in practical situations, decentralized data frequently exhibit heterogeneity. We propose a novel FL model, Distributionally Robust Federated Learning (DRFL), that applies distributionally robust optimization to overcome the challenges posed by data heterogeneity and distributional ambiguity. We derive a tractable reformulation for DRFL and develop a novel solution method based on the alternating direction method of multipliers (ADMM) algorithm to solve this problem. Our experimental results demonstrate that DRFL outperforms standard FL models under data heterogeneity and ambiguity.
