Hierarchical Federated Learning in Wireless Networks: Pruning Tackles Bandwidth Scarcity and System Heterogeneity
Md Ferdous Pervej, Richeng Jin, Huaiyu Dai
TL;DR
This paper tackles the challenge of bandwidth-scarce, heterogeneous wireless networks by proposing pruning-enabled hierarchical federated learning (PHFL) that injects model pruning into a four-tier FL stack (UE-VC-sBS-mBS-cloud). It derives a convergence bound that separates pruning errors and wireless-link effects, and then uses successive convex approximation (SCA) to jointly optimize pruning ratios, CPU frequency, and transmit power under strict delay and energy constraints. The authors demonstrate, through simulations on CIFAR-10/100 with CNN and ResNet architectures, that PHFL can substantially reduce training time, energy consumption, and bandwidth requirements while incurring only modest or negligible accuracy degradation, especially for bulky models. Overall, PHFL offers a practical pathway to enable efficient, privacy-preserving distributed learning in resource-constrained wireless networks with hierarchical aggregation and pruning-driven efficiency gains.
Abstract
While a practical wireless network has many tiers where end users do not directly communicate with the central server, the users' devices have limited computation and battery powers, and the serving base station (BS) has a fixed bandwidth. Owing to these practical constraints and system models, this paper leverages model pruning and proposes a pruning-enabled hierarchical federated learning (PHFL) in heterogeneous networks (HetNets). We first derive an upper bound of the convergence rate that clearly demonstrates the impact of the model pruning and wireless communications between the clients and the associated BS. Then we jointly optimize the model pruning ratio, central processing unit (CPU) frequency and transmission power of the clients in order to minimize the controllable terms of the convergence bound under strict delay and energy constraints. However, since the original problem is not convex, we perform successive convex approximation (SCA) and jointly optimize the parameters for the relaxed convex problem. Through extensive simulation, we validate the effectiveness of our proposed PHFL algorithm in terms of test accuracy, wall clock time, energy consumption and bandwidth requirement.
