FedMeld: A Model-dispersal Federated Learning Framework for Space-ground Integrated Networks
Qian Chen, Xianhao Chen, Kaibin Huang
TL;DR
FedMeld introduces an infrastructure-free federated learning framework for space-ground integrated networks by leveraging the store-ccarry-forward mobility of satellites to disperse and mix model parameters across adjacent regions. The authors derive convergence bounds for both full and partial participation, and formulate a joint SC-MR optimization to minimize training loss under latency constraints, delivering closed-form solutions for the inter-region round interval and semi-closed-form mixing ratio. Through extensive simulations on CIFAR-10 and MNIST with Starlink-like constellations, FedMeld achieves higher accuracy with lower communication costs compared to centralized and ISL-based baselines, while maintaining robustness to data heterogeneity. The work highlights a practical path to global FL in satellite-enabled networks, balancing latency, bandwidth, and heterogeneity considerations, and opens avenues for region-specific timing and adaptive mixing strategies.
Abstract
To bridge the digital divide, space-ground integrated networks (SGINs) are expected to deliver artificial intelligence (AI) services to every corner of the world. One key mission of SGINs is to support federated learning (FL) at a global scale. However, existing space-ground integrated FL frameworks involve ground stations or costly inter-satellite links, entailing excessive training latency and communication costs. To overcome these limitations, we propose an infrastructure-free federated learning framework based on a model dispersal (FedMeld) strategy, which exploits periodic movement patterns and store-carry-forward capabilities of satellites to enable parameter mixing across large-scale geographical regions. We theoretically show that FedMeld leads to global model convergence and quantify the effects of round interval and mixing ratio between adjacent areas on its learning performance. Based on the theoretical results, we formulate a joint optimization problem to design the staleness control and mixing ratio (SC-MR) for minimizing the training loss. By decomposing the problem into sequential SC and MR subproblems without compromising the optimality, we derive the round interval solution in a closed form and the mixing ratio in a semi-closed form to achieve the optimal latency-accuracy tradeoff. Experiments using various datasets demonstrate that FedMeld achieves superior model accuracy while significantly reducing communication costs as compared with traditional FL schemes for SGINs.
