Hierarchical Federated ADMM
Seyed Mohammad Azimi-Abarghouyi, Nicola Bastianello, Karl H. Johansson, Viktoria Fodor
TL;DR
This work replaces gradient-descent-based hierarchical FL with an ADMM-based top-layer framework, introducing two algorithms: HierFADMM, which uses ADMM at the top layer and gradient-descent updates at the bottom, and HierF2ADMM, which uses ADMM on both layers. The authors derive the cloud aggregation and lower-layer update rules, show privacy benefits from sharing linear combinations of parameters, and prove convergence of the inexact ADMM formulations as inner iterations grow. Experiments on logistic regression with the Adult dataset demonstrate improved convergence and accuracy for the ADMM-based methods, especially under non-i.i.d. data, with HierF2ADMM offering additional privacy gains. The results establish a modular hierarchical FL approach that can integrate alternative optimization methods at either layer, with practical implications for scalable and privacy-aware distributed learning.
Abstract
In this paper, we depart from the widely-used gradient descent-based hierarchical federated learning (FL) algorithms to develop a novel hierarchical FL framework based on the alternating direction method of multipliers (ADMM). Within this framework, we propose two novel FL algorithms, which both use ADMM in the top layer: one that employs ADMM in the lower layer and another that uses the conventional gradient descent-based approach. The proposed framework enhances privacy, and experiments demonstrate the superiority of the proposed algorithms compared to the conventional algorithms in terms of learning convergence and accuracy. Additionally, gradient descent on the lower layer performs well even if the number of local steps is very limited, while ADMM on both layers lead to better performance otherwise.
