Distributed Backdoor Attacks on Federated Graph Learning and Certified Defenses

Yuxin Yang; Qiang Li; Jinyuan Jia; Yuan Hong; Binghui Wang

Distributed Backdoor Attacks on Federated Graph Learning and Certified Defenses

Yuxin Yang, Qiang Li, Jinyuan Jia, Yuan Hong, Binghui Wang

TL;DR

The paper addresses backdoor vulnerabilities in Federated Graph Learning by introducing Opt-GDBA, an adaptive, graph-aware distributed backdoor attack that learns per-graph trigger location and shape through an adaptive trigger generator. It also presents a certified defense against such backdoors built on deterministic graph partitioning and a majority-vote ensemble, deriving tight robustness guarantees for both clean and backdoored graphs. Empirical results demonstrate that Opt-GDBA achieves high backdoor accuracy across datasets and trigger configurations, while the certified defense can achieve near-clean performance and zero certified backdoor accuracy in many settings. The work advances FedGL security by providing provable, scalable defense guarantees applicable to arbitrary graph structures and perturbations, with practical implications for safety-critical graph-based applications. Overall, the combination of a sophisticated, graph-aware attack and a rigorous, certified defense offers a comprehensive framework for evaluating and improving FedGL robustness."

Abstract

Federated graph learning (FedGL) is an emerging federated learning (FL) framework that extends FL to learn graph data from diverse sources. FL for non-graph data has shown to be vulnerable to backdoor attacks, which inject a shared backdoor trigger into the training data such that the trained backdoored FL model can predict the testing data containing the trigger as the attacker desires. However, FedGL against backdoor attacks is largely unexplored, and no effective defense exists. In this paper, we aim to address such significant deficiency. First, we propose an effective, stealthy, and persistent backdoor attack on FedGL. Our attack uses a subgraph as the trigger and designs an adaptive trigger generator that can derive the effective trigger location and shape for each graph. Our attack shows that empirical defenses are hard to detect/remove our generated triggers. To mitigate it, we further develop a certified defense for any backdoored FedGL model against the trigger with any shape at any location. Our defense involves carefully dividing a testing graph into multiple subgraphs and designing a majority vote-based ensemble classifier on these subgraphs. We then derive the deterministic certified robustness based on the ensemble classifier and prove its tightness. We extensively evaluate our attack and defense on six graph datasets. Our attack results show our attack can obtain > 90% backdoor accuracy in almost all datasets. Our defense results show, in certain cases, the certified accuracy for clean testing graphs against an arbitrary trigger with size 20 can be close to the normal accuracy under no attack, while there is a moderate gap in other cases. Moreover, the certified backdoor accuracy is always 0 for backdoored testing graphs generated by our attack, implying our defense can fully mitigate the attack. Source code is available at: https://github.com/Yuxin104/Opt-GDBA.

Distributed Backdoor Attacks on Federated Graph Learning and Certified Defenses

TL;DR

Abstract

Paper Structure (34 sections, 3 theorems, 16 equations, 13 figures, 14 tables, 3 algorithms)

This paper contains 34 sections, 3 theorems, 16 equations, 13 figures, 14 tables, 3 algorithms.

Introduction
Background and Problem Definition
Federated Graph Learning (FedGL)
Backdoor Attacks on FedGL
Threat Model
Optimized DBAs on FedGL
Adaptive Trigger Generator
FedGL Training with Optimized Backdoor
Attack Results
Experimental Setup
Experimental Results
Main results of the compared attacks
Impact of hyperparameters on our Opt-GDBA
Persistence and stealthiness of the triggers
Ablation study
...and 19 more sections

Key Result

Theorem 1

Given a backdoored graph classifier $f_B$ and our ensemble graph classifier $g_B$. Given a clean testing graph $G$ with a label $y$ and its $T$ subgraphs $\{G^t\}_{t=1}^T$ produced by our graph division strategy. Suppose $T_y$ and $T_z$ are the largest and second largest frequency outputted by $f_B$

Figures (13)

Figure 1: Comparing the triggers of the backdoor attacks on FedGL: (b) Rand-GCBA, (c) Rand-GDBA, and (d) our Opt-GDBA. Opt-GDBA strategically selects critical nodes and their connected edges in individual graphs, resulting in more effective local triggers and the combined global trigger.
Figure 2: Pipeline of our proposed Opt-GDBA on FedGL (a client $i$ perspective).
Figure 3: Examples of original clean graphs on the six datasets and their corresponding backdoored ones by our Opt-GDBA.
Figure 4: MA/BA vs. $\rho$ on all compared attacks in all datasets.
Figure 5: MA/BA vs. $n_{tri}$ ($n^*_{tri}=5$) on all compared attacks in all datasets.
...and 8 more figures

Theorems & Definitions (3)

Theorem 1: Certified perturbation size w.r.t. clean graph
Theorem 2: Tightness of $m^*$
Theorem 3: Certified (non-)backdoored graph

Distributed Backdoor Attacks on Federated Graph Learning and Certified Defenses

TL;DR

Abstract

Distributed Backdoor Attacks on Federated Graph Learning and Certified Defenses

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (13)

Theorems & Definitions (3)