Hierarchical Split Federated Learning: Convergence Analysis and System Optimization
Zheng Lin, Wei Wei, Zhe Chen, Chan-Tong Lam, Xianhao Chen, Yue Gao, Jun Luo
TL;DR
HSFL extends split federated learning to multi-tier cloud-edge systems by partitioning a neural network across tiers and optimizing where to split (MS) and how often to aggregate (MA). The authors derive a convergence bound that accounts for tiered aggregation frequencies and layer cuts, then formulate and solve a joint MS/MA latency-minimization problem using a BCD-based approach with approximations and MILFP/Dinkelbach steps. The proposed framework, validated on CIFAR-10/MNIST with VGG-16, shows faster convergence and higher accuracy than several baselines, particularly under non-IID data and constrained network resources, while maintaining robustness to resource variations. These results indicate HSFL can significantly accelerate on-device training in large-scale, heterogeneous edge environments, enabling practical deployment of large models at the edge.
Abstract
As AI models expand in size, it has become increasingly challenging to deploy federated learning (FL) on resource-constrained edge devices. To tackle this issue, split federated learning (SFL) has emerged as an FL framework with reduced workload on edge devices via model splitting; it has received extensive attention from the research community in recent years. Nevertheless, most prior works on SFL focus only on a two-tier architecture without harnessing multi-tier cloudedge computing resources. In this paper, we intend to analyze and optimize the learning performance of SFL under multi-tier systems. Specifically, we propose the hierarchical SFL (HSFL) framework and derive its convergence bound. Based on the theoretical results, we formulate a joint optimization problem for model splitting (MS) and model aggregation (MA). To solve this rather hard problem, we then decompose it into MS and MA subproblems that can be solved via an iterative descending algorithm. Simulation results demonstrate that the tailored algorithm can effectively optimize MS and MA for SFL within virtually any multi-tier system.
