Sparse Incremental Aggregation in Multi-Hop Federated Learning
Sourav Mukherjee, Nasrin Razmi, Armin Dekorsy, Petar Popovski, Bho Matthiesen
TL;DR
The paper tackles federated learning over multi-hop networks where in-network incremental aggregation (IA) can dramatically cut communication, but gradient sparsification degrades IA gains. It proposes correlated sparsification methods—RE-SIA, CL-SIA, TC-SIA, and CL-TC-SIA—to preserve IA efficiency under Top-$Q$ sparsification and, when used with time-correlation (tcs), control the growth of transmitted nonzeros across hops. The authors analyze error minimization, derive cost bounds, and demonstrate via MNIST-based experiments that constant-length (CL) and time-correlated (TC) variants achieve substantial bandwidth reductions while maintaining near-IA convergence, with CL-SIA/CL-TC-SIA performing best under equal bandwidth constraints. The results indicate strong potential for efficient, scalable IA in multi-hop FL, especially in satellite constellations and related networks, and point to future work on rigorous convergence guarantees for the new schemes.
Abstract
This paper investigates federated learning (FL) in a multi-hop communication setup, such as in constellations with inter-satellite links. In this setup, part of the FL clients are responsible for forwarding other client's results to the parameter server. Instead of using conventional routing, the communication efficiency can be improved significantly by using in-network model aggregation at each intermediate hop, known as incremental aggregation (IA). Prior works [1] have indicated diminishing gains for IA under gradient sparsification. Here we study this issue and propose several novel correlated sparsification methods for IA. Numerical results show that, for some of these algorithms, the full potential of IA is still available under sparsification without impairing convergence. We demonstrate a 15x improvement in communication efficiency over conventional routing and a 11x improvement over state-of-the-art (SoA) sparse IA.
