Resilient Peer-to-peer Learning based on Adaptive Aggregation
Chandreyee Bhowmick, Xenofon Koutsoukos
TL;DR
This work tackles resilient decentralized P2P learning in the presence of Byzantine workers, non-iid data, and non-convex losses. It introduces a loss-based adaptive aggregation that assigns neighbor weights based on the inverse of each neighbor's local risk $r_k(\hat{w}_l^t)$, computed from the neighbor's parameters and private data, with a variant that uses only neighbors with lower risk to enhance robustness. The authors prove convergence guarantees under local strong convexity and Lipschitz gradients, yielding a bounded optimality gap for fixed stepsize $\mu$ and aggregation complexity $O(d|\mathcal{N}_k|)$. Empirical evaluations on HAR, MNIST, and Spambase under several attack models show improved accuracy over baselines, demonstrating robust performance in non-iid, non-convex settings and highlighting the method's practicality for decentralized ML without a central server.
Abstract
Collaborative learning in peer-to-peer networks offers the benefits of distributed learning while mitigating the risks associated with single points of failure inherent in centralized servers. However, adversarial workers pose potential threats by attempting to inject malicious information into the network. Thus, ensuring the resilience of peer-to-peer learning emerges as a pivotal research objective. The challenge is exacerbated in the presence of non-convex loss functions and non-iid data distributions. This paper introduces a resilient aggregation technique tailored for such scenarios, aimed at fostering similarity among peers' learning processes. The aggregation weights are determined through an optimization procedure, and use the loss function computed using the neighbor's models and individual private data, thereby addressing concerns regarding data privacy in distributed machine learning. Theoretical analysis demonstrates convergence of parameters with non-convex loss functions and non-iid data distributions. Empirical evaluations across three distinct machine learning tasks support the claims. The empirical findings, which encompass a range of diverse attack models, also demonstrate improved accuracy when compared to existing methodologies.
