Noise-Robust and Resource-Efficient ADMM-based Federated Learning
Ehsan Lari, Reza Arablouei, Vinay Chakravarthi Gogineni, Stefan Werner
TL;DR
This work tackles federated learning under noisy communication channels and limited client participation by formulating an ADMM-based WLS solver that is robust to additive noise and reduces communication overhead. It introduces dual-variable elimination and random client scheduling, plus a continual local-update variant, to boost robustness and efficiency. The authors provide rigorous mean and mean-square convergence analyses and derive a closed-form steady-state MSE, validated by simulations that confirm the theory and demonstrate significant performance gains and resource savings. The approach offers a practical path to reliable, scalable FL in wireless environments with heterogeneous devices.
Abstract
Federated learning (FL) leverages client-server communications to train global models on decentralized data. However, communication noise or errors can impair model accuracy. To address this problem, we propose a novel FL algorithm that enhances robustness against communication noise while also reducing communication load. We derive the proposed algorithm through solving the weighted least-squares (WLS) regression problem as an illustrative example. We first frame WLS regression as a distributed convex optimization problem over a federated network employing random scheduling for improved communication efficiency. We then apply the alternating direction method of multipliers (ADMM) to iteratively solve this problem. To counteract the detrimental effects of cumulative communication noise, we introduce a key modification by eliminating the dual variable and implementing a new local model update at each participating client. This subtle yet effective change results in using a single noisy global model update at each client instead of two, improving robustness against additive communication noise. Furthermore, we incorporate another modification enabling clients to continue local updates even when not selected by the server, leading to substantial performance improvements. Our theoretical analysis confirms the convergence of our algorithm in both mean and the mean-square senses, even when the server communicates with a random subset of clients over noisy links at each iteration. Numerical results validate the effectiveness of our proposed algorithm and corroborate our theoretical findings.
