Exploring the Practicality of Federated Learning: A Survey Towards the Communication Perspective
Khiem Le, Nhan Luong-Ha, Manh Nguyen-Duc, Danh Le-Phuoc, Cuong Do, Kok-Seng Wong
TL;DR
This survey tackles the practical challenge of communication efficiency in Federated Learning by defining quantitative measures of communication overhead, identifying its primary sources, and presenting a comprehensive taxonomy of methods to reduce rounds, client participation, and network burdens. It reviews both centralized and decentralized FL frameworks, assesses open-source programming environments, and surveys an extensive set of techniques—including regularization, aggregation correction, one-shot updating, dynamic client selection, quantization, sparsification, factorization, and distillation. The authors also discuss FL architectures (hierarchical and peer-to-peer), and articulate future directions such as hybrid communication strategies, dynamic participation, transfer learning, and privacy-preserving mechanisms, all aimed at enabling scalable, privacy-preserving FL in real-world deployments. The work synthesizes concrete metrics, methodological trends, and architectural considerations to advance the practicality and adoption of FL in diverse domains like IoT, healthcare, and finance, highlighting trade-offs between model performance, privacy, and communication cost. Overall, the paper provides a structured roadmap for designing, evaluating, and deploying communication-efficient FL systems at scale.
Abstract
Federated Learning (FL) is a promising paradigm that offers significant advancements in privacy-preserving, decentralized machine learning by enabling collaborative training of models across distributed devices without centralizing data. However, the practical deployment of FL systems faces a significant bottleneck: the communication overhead caused by frequently exchanging large model updates between numerous devices and a central server. This communication inefficiency can hinder training speed, model performance, and the overall feasibility of real-world FL applications. In this survey, we investigate various strategies and advancements made in communication-efficient FL, highlighting their impact and potential to overcome the communication challenges inherent in FL systems. Specifically, we define measures for communication efficiency, analyze sources of communication inefficiency in FL systems, and provide a taxonomy and comprehensive review of state-of-the-art communication-efficient FL methods. Additionally, we discuss promising future research directions for enhancing the communication efficiency of FL systems. By addressing the communication bottleneck, FL can be effectively applied and enable scalable and practical deployment across diverse applications that require privacy-preserving, decentralized machine learning, such as IoT, healthcare, or finance.
