Robust and Efficient Collaborative Learning

Abdellah El Mrini; Sadegh Farhadkhan; Rachid Guerraoui

Robust and Efficient Collaborative Learning

Abdellah El Mrini, Sadegh Farhadkhan, Rachid Guerraoui

TL;DR

This work tackles robust decentralized collaborative learning in the presence of Byzantine adversaries without relying on a central server. It introduces Robust Pull-based Epidemic Learning (RPEL), a pull-based epidemic communication scheme that achieves $O(n \log n)$ communication by pulling model updates from a small random subset of peers. The authors define the Effective adversarial fraction and provide convergence guarantees for non-convex objectives under mild assumptions, using robust aggregation and variance reduction, with high-probability robustness against omniscient attacks. Empirically, RPEL achieves competitive accuracy on MNIST and CIFAR-10 with up to 20% adversaries, while significantly reducing communication compared to all-to-all robust methods, and it scales effectively to large networks. This work thus offers a practical, scalable, serverless approach to robust distributed learning with strong theoretical and empirical support.

Abstract

Collaborative machine learning is challenged by training-time adversarial behaviors. Existing approaches to tolerate such behaviors either rely on a central server or induce high communication costs. We propose Robust Pull-based Epidemic Learning (RPEL), a novel, scalable collaborative approach to ensure robust learning despite adversaries. RPEL does not rely on any central server and, unlike traditional methods, where communication costs grow in $\mathcal{O}(n^2)$ with the number of nodes $n$, RPEL employs a pull-based epidemic-based communication strategy that scales in $\mathcal{O}(n \log n)$. By pulling model parameters from small random subsets of nodes, RPEL significantly lowers the number of required messages without compromising convergence guarantees, which hold with high probability. Empirical results demonstrate that RPEL maintains robustness in adversarial settings, competes with all-to-all communication accuracy, and scales efficiently across large networks.

Robust and Efficient Collaborative Learning

TL;DR

communication by pulling model updates from a small random subset of peers. The authors define the Effective adversarial fraction and provide convergence guarantees for non-convex objectives under mild assumptions, using robust aggregation and variance reduction, with high-probability robustness against omniscient attacks. Empirically, RPEL achieves competitive accuracy on MNIST and CIFAR-10 with up to 20% adversaries, while significantly reducing communication compared to all-to-all robust methods, and it scales effectively to large networks. This work thus offers a practical, scalable, serverless approach to robust distributed learning with strong theoretical and empirical support.

Abstract

with the number of nodes

, RPEL employs a pull-based epidemic-based communication strategy that scales in

. By pulling model parameters from small random subsets of nodes, RPEL significantly lowers the number of required messages without compromising convergence guarantees, which hold with high probability. Empirical results demonstrate that RPEL maintains robustness in adversarial settings, competes with all-to-all communication accuracy, and scales efficiently across large networks.

Robust and Efficient Collaborative Learning

TL;DR

Abstract

Robust and Efficient Collaborative Learning

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (21)

Theorems & Definitions (24)