Adaptive Decentralized Federated Learning for Robust Optimization
Shuyuan Wu, Feifei Wang, Yuan Gao, Rui Wang, Hansheng Wang
TL;DR
The paper tackles robustness in decentralized federated learning (DFL) by addressing the detrimental impact of abnormal clients. It introduces adaptive decentralized federated learning (aDFL), which assigns per-client learning-rate weights based on gradient behavior to down-weight suspicious updates, without requiring prior knowledge of neighbors. The authors prove convergence results, including an oracle-property guarantee, and demonstrate superior robustness and efficiency through extensive synthetic and real-data experiments. The method preserves the original network topology and is applicable to heterogeneous data, with promising directions for privacy, communication efficiency, and broader robustness extensions.
Abstract
In decentralized federated learning (DFL), the presence of abnormal clients, often caused by noisy or poisoned data, can significantly disrupt the learning process and degrade the overall robustness of the model. Previous methods on this issue often require a sufficiently large number of normal neighboring clients or prior knowledge of reliable clients, which reduces the practical applicability of DFL. To address these limitations, we develop here a novel adaptive DFL (aDFL) approach for robust estimation. The key idea is to adaptively adjust the learning rates of clients. By assigning smaller rates to suspicious clients and larger rates to normal clients, aDFL mitigates the negative impact of abnormal clients on the global model in a fully adaptive way. Our theory does not put any stringent conditions on neighboring nodes and requires no prior knowledge. A rigorous convergence analysis is provided to guarantee the oracle property of aDFL. Extensive numerical experiments demonstrate the superior performance of the aDFL method.
