Towards Optimal Heterogeneous Client Sampling in Multi-Model Federated Learning

Haoran Zhang; Zejun Gong; Zekai Li; Marie Siew; Carlee Joe-Wong; Rachid El-Azouzi

Towards Optimal Heterogeneous Client Sampling in Multi-Model Federated Learning

Haoran Zhang, Zejun Gong, Zekai Li, Marie Siew, Carlee Joe-Wong, Rachid El-Azouzi

TL;DR

This paper tackles multi-model federated learning (MMFL) under heterogeneous client constraints by first establishing a convergence analysis for MMFL with arbitrary client sampling and then proposing loss-based variance-reduced sampling (MMFL-LVR) to minimize per-round variance while honoring server and client budgets. To further stabilize training, it introduces MMFL-StaleVR, which optimally leverages stale updates, and MMFL-StaleVRE, a low-overhead variant that approximates the optimal stale-weighting using only active clients. Empirical results on Fashion-MNIST, EMNIST, CIFAR-10, and Shakespeare show MMFL-LVR and especially MMFL-StaleVR achieving up to 19.1% higher average accuracy than random scheduling and within 5.4% of full participation, demonstrating robust performance under diverse data distributions and resource heterogeneity. Overall, the work provides a principled, scalable framework for efficiently coordinating concurrent model training across heterogeneous MMFL deployments, with practical implications for edge deployments under limited bandwidth and compute resources.

Abstract

Federated learning (FL) allows edge devices to collaboratively train models without sharing local data. As FL gains popularity, clients may need to train multiple unrelated FL models, but communication constraints limit their ability to train all models simultaneously. While clients could train FL models sequentially, opportunistically having FL clients concurrently train different models -- termed multi-model federated learning (MMFL) -- can reduce the overall training time. Prior work uses simple client-to-model assignments that do not optimize the contribution of each client to each model over the course of its training. Prior work on single-model FL shows that intelligent client selection can greatly accelerate convergence, but naïve extensions to MMFL can violate heterogeneous resource constraints at both the server and the clients. In this work, we develop a novel convergence analysis of MMFL with arbitrary client sampling methods, theoretically demonstrating the strengths and limitations of previous well-established gradient-based methods. Motivated by this analysis, we propose MMFL-LVR, a loss-based sampling method that minimizes training variance while explicitly respecting communication limits at the server and reducing computational costs at the clients. We extend this to MMFL-StaleVR, which incorporates stale updates for improved efficiency and stability, and MMFL-StaleVRE, a lightweight variant suitable for low-overhead deployment. Experiments show our methods improve average accuracy by up to 19.1% over random sampling, with only a 5.4% gap from the theoretical optimum (full client participation).

Towards Optimal Heterogeneous Client Sampling in Multi-Model Federated Learning

TL;DR

Abstract

Towards Optimal Heterogeneous Client Sampling in Multi-Model Federated Learning

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (6)

Theorems & Definitions (13)