A Hierarchical Federated Learning Approach for the Internet of Things
Seyed Mohammad Azimi-Abarghouyi, Viktoria Fodor
TL;DR
This work addresses scalable federated learning for large-scale IoT deployments by introducing QHetFed, a two-level hierarchical FL algorithm that combines intra-set gradient aggregation with inter-set model aggregation and uses end-of-interval gamma steps to accelerate learning under data heterogeneity and quantization. It provides a heterogeneity-aware convergence analysis that yields an optimality gap bound dependent on quantization noise and data heterogeneity, and derives closed-form expressions for optimal intra-set iterations and gradient steps under a deadline. System optimization links communication and computation times to learning parameters, enabling closed-form selection of tau and gamma to minimize gradient error within time constraints. Empirical results on CIFAR-10 with a three-set IoT-like topology show QHetFed is more robust to non-iid data and quantization than conventional hierarchical schemes, with optimized parameter choices delivering faster convergence and higher final accuracy in heterogeneous settings.
Abstract
This paper presents a novel federated learning solution, QHetFed, suitable for large-scale Internet of Things deployments, addressing the challenges of large geographic span, communication resource limitation, and data heterogeneity. QHetFed is based on hierarchical federated learning over multiple device sets, where the learning process and learning parameters take the necessary data quantization and the data heterogeneity into consideration to achieve high accuracy and fast convergence. Unlike conventional hierarchical federated learning algorithms, the proposed approach combines gradient aggregation in intra-set iterations with model aggregation in inter-set iterations. We offer a comprehensive analytical framework to evaluate its optimality gap and convergence rate, and give a closed form expression for the optimal learning parameters under a deadline, that accounts for communication and computation times. Our findings reveal that QHetFed consistently achieves high learning accuracy and significantly outperforms other hierarchical algorithms, particularly in scenarios with heterogeneous data distributions.
