Graph Agent Network: Empowering Nodes with Inference Capabilities for Adversarial Resilience
Ao Liu, Wenshan Li, Tao Li, Beibei Li, Guangquan Xu, Pan Zhou, Wengang Ma, Hanyuan Huang
TL;DR
The paper addresses vulnerabilities of graph neural networks to adversarial edge perturbations arising from global end-to-end training. It introduces Graph Agent Network (GAgN), a decentralized framework where each node is a 1-hop-view agent that learns embeddings, degree distributions, and neighbor relationships through restricted local communication to filter adversarial edges during classification. The authors prove that a single hidden layer MLP suffices for these tasks and demonstrate, through extensive experiments on real-world datasets, that GAgN achieves state-of-the-art accuracy under both primary and secondary edge-perturbation attacks. The approach offers a scalable, robust alternative to global-defense strategies by constraining information propagation and enabling distributed adversarial edge detection and filtering, with practical implications for secure graph-based learning in adversarial settings.
Abstract
End-to-end training with global optimization have popularized graph neural networks (GNNs) for node classification, yet inadvertently introduced vulnerabilities to adversarial edge-perturbing attacks. Adversaries can exploit the inherent opened interfaces of GNNs' input and output, perturbing critical edges and thus manipulating the classification results. Current defenses, due to their persistent utilization of global-optimization-based end-to-end training schemes, inherently encapsulate the vulnerabilities of GNNs. This is specifically evidenced in their inability to defend against targeted secondary attacks. In this paper, we propose the Graph Agent Network (GAgN) to address the aforementioned vulnerabilities of GNNs. GAgN is a graph-structured agent network in which each node is designed as an 1-hop-view agent. Through the decentralized interactions between agents, they can learn to infer global perceptions to perform tasks including inferring embeddings, degrees and neighbor relationships for given nodes. This empowers nodes to filtering adversarial edges while carrying out classification tasks. Furthermore, agents' limited view prevents malicious messages from propagating globally in GAgN, thereby resisting global-optimization-based secondary attacks. We prove that single-hidden-layer multilayer perceptrons (MLPs) are theoretically sufficient to achieve these functionalities. Experimental results show that GAgN effectively implements all its intended capabilities and, compared to state-of-the-art defenses, achieves optimal classification accuracy on the perturbed datasets.
