Efficient Backdoor Attacks for Deep Neural Networks in Real-world Scenarios
Ziqiang Li, Hong Sun, Pengfei Xia, Heng Li, Beihao Xia, Yi Wu, Bin Li
TL;DR
This work tackles backdoor attacks under realistic data collection conditions where adversaries lack full access to training data, introducing the data-constrained backdoor attack paradigm. It identifies entanglement between benign and poisoning features as a key bottleneck and proposes a CLIP-guided framework with three techniques: CLIP-CFE for clean feature suppression, and CLIP-UAP and CLIP-CFA for poisoning feature augmentation. Through extensive experiments on CIFAR-10/100 and ImageNet-50 across multiple architectures, the authors demonstrate substantial performance gains over baseline attacks, with some settings achieving over 100% improvement in data-constrained scenarios, while preserving Benign Accuracy. The results underscore practical threat potential and offer actionable insights for defense and robust evaluation in multi-source data settings, while also suggesting future work in domain-specific CLIP adaptations and broader applicability.
Abstract
Recent deep neural networks (DNNs) have came to rely on vast amounts of training data, providing an opportunity for malicious attackers to exploit and contaminate the data to carry out backdoor attacks. However, existing backdoor attack methods make unrealistic assumptions, assuming that all training data comes from a single source and that attackers have full access to the training data. In this paper, we introduce a more realistic attack scenario where victims collect data from multiple sources, and attackers cannot access the complete training data. We refer to this scenario as data-constrained backdoor attacks. In such cases, previous attack methods suffer from severe efficiency degradation due to the entanglement between benign and poisoning features during the backdoor injection process. To tackle this problem, we introduce three CLIP-based technologies from two distinct streams: Clean Feature Suppression and Poisoning Feature Augmentation.effective solution for data-constrained backdoor attacks. The results demonstrate remarkable improvements, with some settings achieving over 100% improvement compared to existing attacks in data-constrained scenarios. Code is available at https://github.com/sunh1113/Efficient-backdoor-attacks-for-deep-neural-networks-in-real-world-scenarios
