An Element-Wise Weights Aggregation Method for Federated Learning
Yi Hu, Hanchi Ren, Chen Hu, Jingjing Deng, Xianghua Xie
TL;DR
EWWA-FL advances federated learning by introducing element-wise aggregation, where each parameter across local models receives a distinct, client-specific weight rather than a single global proportion. Building on FedAvg, it uses Adam-like first- and second-moment estimates to compute per-parameter contributions and applies Softmax to obtain parameter-level aggregation weights, enabling finer alignment to heterogeneous client data. Empirical results show improved accuracy and faster convergence, particularly on large-class datasets and non-IID settings, with high stability and reduced variance compared to peers like FedOpt and FedCAMS. The approach is implemented in an open-source framework, indicating practical applicability and reproducibility for diverse backbone networks and benchmarks.
Abstract
Federated learning (FL) is a powerful Machine Learning (ML) paradigm that enables distributed clients to collaboratively learn a shared global model while keeping the data on the original device, thereby preserving privacy. A central challenge in FL is the effective aggregation of local model weights from disparate and potentially unbalanced participating clients. Existing methods often treat each client indiscriminately, applying a single proportion to the entire local model. However, it is empirically advantageous for each weight to be assigned a specific proportion. This paper introduces an innovative Element-Wise Weights Aggregation Method for Federated Learning (EWWA-FL) aimed at optimizing learning performance and accelerating convergence speed. Unlike traditional FL approaches, EWWA-FL aggregates local weights to the global model at the level of individual elements, thereby allowing each participating client to make element-wise contributions to the learning process. By taking into account the unique dataset characteristics of each client, EWWA-FL enhances the robustness of the global model to different datasets while also achieving rapid convergence. The method is flexible enough to employ various weighting strategies. Through comprehensive experiments, we demonstrate the advanced capabilities of EWWA-FL, showing significant improvements in both accuracy and convergence speed across a range of backbones and benchmarks.
