VALID: a Validated Algorithm for Learning in Decentralized Networks with Possible Adversarial Presence
Mayank Bakshi, Sara Ghasvarianjahromi, Yauhen Yakimenka, Allison Beemer, Oliver Kosut, Joerg Kliewer
TL;DR
The paper introduces VALID, a validated learning protocol for decentralized networks facing potential Byzantine agents and heterogeneous data. By combining a learning phase of distributed SGD with a rigorous validation phase leveraging validated broadcast and polynomial hashing, VALID either achieves an admissible consensus or flags adversaries, while maintaining an $O(1/T)$ convergence rate and comparable complexity to non-adversarial SGD. A heterogeneity-based optimality test and a formal admissible consensus framework underpin the detection guarantees, enabling fast performance without sacrificing robustness in adversary-free settings. Empirical results on a 20-node network with MNIST data show that VALID outperforms other Byzantine-robust methods when no attacks are present and reliably detects adversaries when attacks occur, highlighting practical impact for secure, scalable decentralized learning.
Abstract
We introduce the paradigm of validated decentralized learning for undirected networks with heterogeneous data and possible adversarial infiltration. We require (a) convergence to a global empirical loss minimizer when adversaries are absent, and (b) either detection of adversarial presence of convergence to an admissible consensus irrespective of the adversarial configuration. To this end, we propose the VALID protocol which, to the best of our knowledge, is the first to achieve a validated learning guarantee. Moreover, VALID offers an O(1/T) convergence rate (under pertinent regularity assumptions), and computational and communication complexities comparable to non-adversarial distributed stochastic gradient descent. Remarkably, VALID retains optimal performance metrics in adversary-free environments, sidestepping the robustness penalties observed in prior byzantine-robust methods. A distinctive aspect of our study is a heterogeneity metric based on the norms of individual agents' gradients computed at the global empirical loss minimizer. This not only provides a natural statistic for detecting significant byzantine disruptions but also allows us to prove the optimality of VALID in wide generality. Lastly, our numerical results reveal that, in the absence of adversaries, VALID converges faster than state-of-the-art byzantine robust algorithms, while when adversaries are present, VALID terminates with each honest either converging to an admissible consensus of declaring adversarial presence in the network.
