Makespan Minimization in Split Learning: From Theory to Practice
Robert Ganian, Fionn Mc Inerney, Dimitra Tsigkari
TL;DR
The paper tackles makespan minimization in Split Learning under memory constraints, introducing SL-Makespan and GenSL-Makespan to capture client-helper scheduling with task precedences. It proves strong hardness results (including NP-hardness and inapproximability) for these problems, even under highly restricted instances, and provides a practical 5-approximation for SL-Makespan. For the more general GenSL-Makespan, it shows CH-Assign is strongly NP-hard, and proposes EquiDistributed (EquiD), an IP-guided heuristic drawing on the 5-approximation framework that performs well in experiments. Numerical evaluations on open-source and synthetic data demonstrate that EquiD achieves near-optimal makespans with substantial speedups over exact solvers and outperforms existing heuristics, highlighting its potential for scalable SL deployment in heterogeneous IoT settings.
Abstract
Split learning recently emerged as a solution for distributed machine learning with heterogeneous IoT devices, where clients can offload part of their training to computationally-powerful helpers. The core challenge in split learning is to minimize the training time by jointly devising the client-helper assignment and the schedule of tasks at the helpers. We first study the model where each helper has a memory cardinality constraint on how many clients it may be assigned, which represents the case of homogeneous tasks. Through complexity theory, we rule out exact polynomial-time algorithms and approximation schemes even for highly restricted instances of this problem. We complement these negative results with a non-trivial polynomial-time 5-approximation algorithm. Building on this, we then focus on the more general heterogeneous task setting considered by Tirana et al. [INFOCOM 2024], where helpers have memory capacity constraints and clients have variable memory costs. In this case, we prove that, unless P=NP, the problem cannot admit a polynomial-time approximation algorithm for any approximation factor. However, by adapting our aforementioned 5-approximation algorithm, we develop a novel heuristic for the heterogeneous task setting and show that it outperforms heuristics from prior works through extensive experiments.
