FedPara: Low-Rank Hadamard Product for Communication-Efficient Federated Learning
Nam Hyeon-Woo, Moon Ye-Bin, Tae-Hyun Oh
TL;DR
<3-5 sentence high-level summary> FedPara tackles the communication bottleneck in Federated Learning by re-parameterizing neural network layers with a low-rank Hadamard product, enabling near-full expressiveness with far fewer transmitted parameters. The core idea W = (X1Y1^T) ⊙ (X2Y2^T) achieves substantial parameter and communication reductions, while preserving or even enhancing accuracy in IID and non-IID settings; a personalized variant pFedPara further splits parameters into global and local components for robust non-IID performance. The approach is compatible with existing FL optimizers and can be extended to various architectures, including CNNs and LSTMs, with demonstrated 3.4× to 10× communication savings in experiments. Overall, FedPara/pFedPara offer practical, scalable improvements for edge devices and heterogeneous networks, with realistic implications for energy use and global FL accessibility.
Abstract
In this work, we propose a communication-efficient parameterization, FedPara, for federated learning (FL) to overcome the burdens on frequent model uploads and downloads. Our method re-parameterizes weight parameters of layers using low-rank weights followed by the Hadamard product. Compared to the conventional low-rank parameterization, our FedPara method is not restricted to low-rank constraints, and thereby it has a far larger capacity. This property enables to achieve comparable performance while requiring 3 to 10 times lower communication costs than the model with the original layers, which is not achievable by the traditional low-rank methods. The efficiency of our method can be further improved by combining with other efficient FL optimizers. In addition, we extend our method to a personalized FL application, pFedPara, which separates parameters into global and local ones. We show that pFedPara outperforms competing personalized FL methods with more than three times fewer parameters.
