Decentralized Adversarial Training over Graphs
Ying Cao, Elsa Rizk, Stefan Vlaski, Ali H. Sayed
TL;DR
Decentralized Adversarial Training over Graphs tackles robustness of multi-agent learning under worst-case perturbations by formulating $J(w)=\sum_k \pi_k J_k(w)$ with $J_k(w)=\mathbb{E}_{x_k,y_k}\{\max_{\|\delta\|_{p_k}\le \epsilon_k} Q_k(w;x_k+\delta,y_k)\}$. It introduces two fully decentralized schemes based on diffusion (ATC) and consensus, analyzes convergence in strongly-convex, convex, and non-convex settings using affine-Lipschitz gradient properties and the Moreau envelope, and demonstrates empirically that graph topology enhances robustness relative to centralized and non-cooperative baselines. Key results show linear convergence to a small neighborhood in strongly-convex cases, sublinear and $O(1/(\mu N))$ behavior in convex settings, and provable near-stationarity in non-convex regimes with an $O(1/(\mu N))$ rate and $O(\epsilon^2)$ error terms. Across simulations on MNIST and CIFAR-10, diffusion and consensus strategies consistently improve robustness against a range of attacks, with heterogeneous perturbation models further illustrating the benefits of distributed, graph-structured collaboration.
Abstract
The vulnerability of machine learning models to adversarial attacks has been attracting considerable attention in recent years. Most existing studies focus on the behavior of stand-alone single-agent learners. In comparison, this work studies adversarial training over graphs, where individual agents are subjected to perturbations of varied strength levels across space. It is expected that interactions by linked agents, and the heterogeneity of the attack models that are possible over the graph, can help enhance robustness in view of the coordination power of the group. Using a min-max formulation of distributed learning, we develop a decentralized adversarial training framework for multi-agent systems. Specifically, we devise two decentralized adversarial training algorithms by relying on two popular decentralized learning strategies--diffusion and consensus. We analyze the convergence properties of the proposed framework for strongly-convex, convex, and non-convex environments, and illustrate the enhanced robustness to adversarial attacks.
