Accelerating Energy-Efficient Federated Learning in Cell-Free Networks with Adaptive Quantization
Afsaneh Mahmoudi, Ming Xiao, Emil Björnson
TL;DR
The paper tackles energy- and latency-constrained federated learning over cell-free MIMO networks by introducing Exponent-Mantissa Quantization (EMQ) and AdaDelta-based local updates. The approach decomposes the training into two subproblems per global iteration: local model development with adaptive local iterations and EMQ-driven communication, and a SQP-based uplink power allocation to trade off energy consumption and latency within budgets. Theoretical convergence is established for FedAvg with AdaDelta under standard assumptions, and extensive simulations on IID and non-IID data show that EMQ can match full-precision accuracy while dramatically reducing computation, with power-allocation gains yielding up to 7–36% improvements in test accuracy over various baselines. The results demonstrate practical benefits for scalable, energy-efficient FL in CFmMIMO settings, offering a pathway to real-time, resource-constrained distributed learning in wireless networks.
Abstract
Federated Learning (FL) enables clients to share learning parameters instead of local data, reducing communication overhead. Traditional wireless networks face latency challenges with FL. In contrast, Cell-Free Massive MIMO (CFmMIMO) can serve multiple clients on shared resources, boosting spectral efficiency and reducing latency for large-scale FL. However, clients' communication resource limitations can hinder the completion of the FL training. To address this challenge, we propose an energy-efficient, low-latency FL framework featuring optimized uplink power allocation for seamless client-server collaboration. Our framework employs an adaptive quantization scheme, dynamically adjusting bit allocation for local gradient updates to reduce communication costs. We formulate a joint optimization problem covering FL model updates, local iterations, and power allocation, solved using sequential quadratic programming (SQP) to balance energy and latency. Additionally, clients use the AdaDelta method for local FL model updates, enhancing local model convergence compared to standard SGD, and we provide a comprehensive analysis of FL convergence with AdaDelta local updates. Numerical results show that, within the same energy and latency budgets, our power allocation scheme outperforms the Dinkelbach and max-sum rate methods by increasing the test accuracy up to $7$\% and $19$\%, respectively. Moreover, for the three power allocation methods, our proposed quantization scheme outperforms AQUILA and LAQ by increasing test accuracy by up to $36$\% and $35$\%, respectively.
