Have Your Cake and Eat It Too: Toward Efficient and Accurate Split Federated Learning
Dengke Yan, Ming Hu, Zeke Xia, Yanxin Yang, Jun Xia, Xiaofei Xie, Mingsong Chen
TL;DR
S^2FL tackles the twin challenges of stragglers and non-IID data in Split Federated Learning by introducing adaptive sliding model splitting and a data balance-based training mechanism. A three-part model (client-side, shared, server-side) is trained with device-aware partitions, feature-grouping by labels, and a novel aggregation method that respects per-device data weights. The authors provide convergence guarantees with an $O(1/t)$ rate under standard smoothness and convexity assumptions, and show through extensive experiments that S^2FL yields up to 16.5% accuracy gains and up to 3.54× training speedups over baselines across diverse AIoT datasets and models. This approach offers practical improvements for resource-constrained edge environments by aligning device workloads and homogenizing server-side training data distributions, thereby enabling efficient and accurate large-model federated learning.
Abstract
Due to its advantages in resource constraint scenarios, Split Federated Learning (SFL) is promising in AIoT systems. However, due to data heterogeneity and stragglers, SFL suffers from the challenges of low inference accuracy and low efficiency. To address these issues, this paper presents a novel SFL approach, named Sliding Split Federated Learning (S$^2$FL), which adopts an adaptive sliding model split strategy and a data balance-based training mechanism. By dynamically dispatching different model portions to AIoT devices according to their computing capability, S$^2$FL can alleviate the low training efficiency caused by stragglers. By combining features uploaded by devices with different data distributions to generate multiple larger batches with a uniform distribution for back-propagation, S$^2$FL can alleviate the performance degradation caused by data heterogeneity. Experimental results demonstrate that, compared to conventional SFL, S$^2$FL can achieve up to 16.5\% inference accuracy improvement and 3.54X training acceleration.
