Backdoor Attacks and Countermeasures on Deep Learning: A Comprehensive Review

Yansong Gao; Bao Gia Doan; Zhi Zhang; Siqi Ma; Jiliang Zhang; Anmin Fu; Surya Nepal; Hyoungshick Kim

Backdoor Attacks and Countermeasures on Deep Learning: A Comprehensive Review

Yansong Gao, Bao Gia Doan, Zhi Zhang, Siqi Ma, Jiliang Zhang, Anmin Fu, Surya Nepal, Hyoungshick Kim

TL;DR

This comprehensive review maps backdoor threats in deep learning across six attack surfaces—code poisoning, outsourcing, pretrained, data collection, collaborative learning, and post-deployment—and catalogs corresponding countermeasures into blind removal, offline/online inspections, and post-removal strategies. It emphasizes that defenses lag behind evolving attacks and that adaptive adversaries can bypass many existing methods. The analysis highlights the diverse variants of triggers (class-specific, multi-trigger, dynamic, blended, etc.) and extends discussion to the flip side of backdoors, including watermarking and data-deletion verification. The paper calls for practical, domain-general defenses, empirical evaluations of physical triggers, and better alignment with defender capabilities, while acknowledging the challenges of cross-domain applicability and resource constraints.

Abstract

This work provides the community with a timely comprehensive review of backdoor attacks and countermeasures on deep learning. According to the attacker's capability and affected stage of the machine learning pipeline, the attack surfaces are recognized to be wide and then formalized into six categorizations: code poisoning, outsourcing, pretrained, data collection, collaborative learning and post-deployment. Accordingly, attacks under each categorization are combed. The countermeasures are categorized into four general classes: blind backdoor removal, offline backdoor inspection, online backdoor inspection, and post backdoor removal. Accordingly, we review countermeasures, and compare and analyze their advantages and disadvantages. We have also reviewed the flip side of backdoor attacks, which are explored for i) protecting intellectual property of deep learning models, ii) acting as a honeypot to catch adversarial example attacks, and iii) verifying data deletion requested by the data contributor.Overall, the research on defense is far behind the attack, and there is no single defense that can prevent all types of backdoor attacks. In some cases, an attacker can intelligently bypass existing defenses with an adaptive attack. Drawing the insights from the systematic review, we also present key areas for future research on the backdoor, such as empirical security evaluations from physical trigger attacks, and in particular, more efficient and practical countermeasures are solicited.

Backdoor Attacks and Countermeasures on Deep Learning: A Comprehensive Review

TL;DR

Abstract

Backdoor Attacks and Countermeasures on Deep Learning: A Comprehensive Review

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (9)

Theorems & Definitions (2)