Cross-Context Backdoor Attacks against Graph Prompt Learning
Xiaoting Lyu, Yufei Han, Wei Wang, Hangwei Qian, Ivor Tsang, Xiangliang Zhang
TL;DR
This work reveals a persistent vulnerability in Graph Prompt Learning: backdoors embedded in pretrained GNN encoders can transfer to downstream tasks through graph prompts. The proposed CrossBA framework jointly optimizes a small trigger graph and the backdoored encoder during pretraining, using a feature-collision objective and alignment losses to maintain clean-task utility while steering certain inputs to attacker-chosen embeddings. Theoretical analysis shows why GPL’s prompt mechanism facilitates backdoor transfer, and extensive experiments across five cross-context scenarios and multiple GPL methods demonstrate high attack success rates (ASR > 0.85) with minimal accuracy loss, outperforming baselines like GCBA. The findings raise trust and security concerns for GPL deployments and point to the need for defense strategies tailored to cross-context graph learning, including beyond existing GPL defenses.
Abstract
Graph Prompt Learning (GPL) bridges significant disparities between pretraining and downstream applications to alleviate the knowledge transfer bottleneck in real-world graph learning. While GPL offers superior effectiveness in graph knowledge transfer and computational efficiency, the security risks posed by backdoor poisoning effects embedded in pretrained models remain largely unexplored. Our study provides a comprehensive analysis of GPL's vulnerability to backdoor attacks. We introduce \textit{CrossBA}, the first cross-context backdoor attack against GPL, which manipulates only the pretraining phase without requiring knowledge of downstream applications. Our investigation reveals both theoretically and empirically that tuning trigger graphs, combined with prompt transformations, can seamlessly transfer the backdoor threat from pretrained encoders to downstream applications. Through extensive experiments involving 3 representative GPL methods across 5 distinct cross-context scenarios and 5 benchmark datasets of node and graph classification tasks, we demonstrate that \textit{CrossBA} consistently achieves high attack success rates while preserving the functionality of downstream applications over clean input. We also explore potential countermeasures against \textit{CrossBA} and conclude that current defenses are insufficient to mitigate \textit{CrossBA}. Our study highlights the persistent backdoor threats to GPL systems, raising trustworthiness concerns in the practices of GPL techniques.
