Table of Contents
Fetching ...

FedSN: A Federated Learning Framework over Heterogeneous LEO Satellite Networks

Zheng Lin, Zhe Chen, Zihan Fang, Xianhao Chen, Xiong Wang, Yue Gao

TL;DR

FedSN is proposed as a general FL framework to enable heterogeneous local model training considering different computing, memory, and communication constraints on LEO satellites and a pseudo-synchronous model aggregation strategy to dynamically schedule model aggregation for compensating model staleness is proposed.

Abstract

Recently, a large number of Low Earth Orbit (LEO) satellites have been launched and deployed successfully in space by commercial companies, such as SpaceX. Due to multimodal sensors equipped by the LEO satellites, they serve not only for communication but also for various machine learning applications, such as space modulation recognition, remote sensing image classification, etc. However, the ground station (GS) may be incapable of downloading such a large volume of raw sensing data for centralized model training due to the limited contact time with LEO satellites (e.g. 5 minutes). Therefore, federated learning (FL) has emerged as the promising solution to address this problem via on-device training. Unfortunately, to enable FL on LEO satellites, we still face three critical challenges that are i) heterogeneous computing and memory capabilities, ii) limited uplink rate, and iii) model staleness. To this end, we propose FedSN as a general FL framework to tackle the above challenges, and fully explore data diversity on LEO satellites. Specifically, we first present a novel sub-structure scheme to enable heterogeneous local model training considering different computing, memory, and communication constraints on LEO satellites. Additionally, we propose a pseudo-synchronous model aggregation strategy to dynamically schedule model aggregation for compensating model staleness. To further demonstrate the effectiveness of the FedSN, we evaluate it using space modulation recognition and remote sensing image classification tasks by leveraging the data from real-world satellite networks. Extensive experimental results demonstrate that FedSN framework achieves higher accuracy, lower computing, and communication overhead than the state-of-the-art benchmarks and the effectiveness of each components in FedSN.

FedSN: A Federated Learning Framework over Heterogeneous LEO Satellite Networks

TL;DR

FedSN is proposed as a general FL framework to enable heterogeneous local model training considering different computing, memory, and communication constraints on LEO satellites and a pseudo-synchronous model aggregation strategy to dynamically schedule model aggregation for compensating model staleness is proposed.

Abstract

Recently, a large number of Low Earth Orbit (LEO) satellites have been launched and deployed successfully in space by commercial companies, such as SpaceX. Due to multimodal sensors equipped by the LEO satellites, they serve not only for communication but also for various machine learning applications, such as space modulation recognition, remote sensing image classification, etc. However, the ground station (GS) may be incapable of downloading such a large volume of raw sensing data for centralized model training due to the limited contact time with LEO satellites (e.g. 5 minutes). Therefore, federated learning (FL) has emerged as the promising solution to address this problem via on-device training. Unfortunately, to enable FL on LEO satellites, we still face three critical challenges that are i) heterogeneous computing and memory capabilities, ii) limited uplink rate, and iii) model staleness. To this end, we propose FedSN as a general FL framework to tackle the above challenges, and fully explore data diversity on LEO satellites. Specifically, we first present a novel sub-structure scheme to enable heterogeneous local model training considering different computing, memory, and communication constraints on LEO satellites. Additionally, we propose a pseudo-synchronous model aggregation strategy to dynamically schedule model aggregation for compensating model staleness. To further demonstrate the effectiveness of the FedSN, we evaluate it using space modulation recognition and remote sensing image classification tasks by leveraging the data from real-world satellite networks. Extensive experimental results demonstrate that FedSN framework achieves higher accuracy, lower computing, and communication overhead than the state-of-the-art benchmarks and the effectiveness of each components in FedSN.
Paper Structure (29 sections, 12 equations, 17 figures)

This paper contains 29 sections, 12 equations, 17 figures.

Figures (17)

  • Figure 1: A scenario of FL over LEO satellite networks.
  • Figure 2: The impact of heterogeneous computing and memory resources on FL. Fig. \ref{['sfig:hetero_compute']} and Fig. \ref{['sfig:hetero_memory']} show the performance for test accuracy versus under-training satellite rates under computing and memory constraints. The experiment is conducted on the GBSense gbsense under the IID setting using VGG-16 simonyan2014very.
  • Figure 3: Uplink communication between LEO satellite and GS becomes a major bottleneck for FL. Fig. \ref{['sfig:exStarlink']} and Fig. \ref{['sfig:gsStarlink']} show Starlink's GS and experimental setup for average uplink and downlink rate measurements. Fig. \ref{['sfig:data_rate']} presents the CDF of the uplink and downlink rates. Fig. \ref{['sfig:update_fail']} illustrates the performance for test accuracy versus the contact time, which is obtained from conducting experiments on GBSense dataset under IID setting using VGG-16, where $\Delta t$ denotes the contact time between GS and satellites.
  • Figure 4: The illustration of the intra-group and inter-group sets. The satellites in the triangles of different colors belong to distinct inter-group sets, while satellites within each orbit constitute an intra-group set.
  • Figure 5: The impact of intra-group and inter-group staleness on FL. Fig. \ref{['sfig:staleness1']} and Fig. \ref{['sfig:staleness2']} show the performance for test accuracy versus different intra-group satellite orbital period ratios and inter-group model aggregation schemes, respectively.
  • ...and 12 more figures