CoBo: Collaborative Learning via Bilevel Optimization
Diba Hashemi, Lie He, Martin Jaggi
TL;DR
We address collaborative learning with heterogeneous clients by a bilevel optimization framework that jointly selects collaborators and trains personalized models. The outer problem optimizes personalized models while the inner problem yields adaptive pairwise collaboration weights based on gradient alignment. CoBo, an SGD-style alternating algorithm, achieves convergence guarantees and scales to large numbers of clients, delivering up to $9.3\%$ accuracy improvement on a highly heterogeneous 80-client task. Empirical results across cross-silo, cross-device, and language-model fine-tuning demonstrate competitive performance against strong personalized baselines and reveal interpretable cluster-aware collaboration patterns.
Abstract
Collaborative learning is an important tool to train multiple clients more effectively by enabling communication among clients. Identifying helpful clients, however, presents challenging and often introduces significant overhead. In this paper, we model client-selection and model-training as two interconnected optimization problems, proposing a novel bilevel optimization problem for collaborative learning. We introduce CoBo, a scalable and elastic, SGD-type alternating optimization algorithm that efficiently addresses these problem with theoretical convergence guarantees. Empirically, CoBo achieves superior performance, surpassing popular personalization algorithms by 9.3% in accuracy on a task with high heterogeneity, involving datasets distributed among 80 clients.
