DPatch: An Adversarial Patch Attack on Object Detectors
Xin Liu, Huanrui Yang, Ziwei Liu, Linghao Song, Hai Li, Yiran Chen
TL;DR
This work addresses the vulnerability of modern object detectors to physical-world adversarial patches by introducing DPatch, a small 40x40 patch trained to disrupt both region proposals and classification in Faster R-CNN and YOLO under black-box conditions. It formalizes untargeted and targeted objectives with dedicated training goals and random shifts to enforce location invariance, and demonstrates that the patch can drastically reduce mean average precision (mAP) on multiple detectors and datasets. The results show strong cross-detector and cross-dataset transferability, patch-size effects, and ROI concentration around the patch, underscoring a fundamental security risk in contemporary detection systems. The findings motivate the development of robust defenses for detectors used in safety-critical applications such as surveillance and autonomous driving.
Abstract
Object detectors have emerged as an indispensable module in modern computer vision systems. In this work, we propose DPatch -- a black-box adversarial-patch-based attack towards mainstream object detectors (i.e. Faster R-CNN and YOLO). Unlike the original adversarial patch that only manipulates image-level classifier, our DPatch simultaneously attacks the bounding box regression and object classification so as to disable their predictions. Compared to prior works, DPatch has several appealing properties: (1) DPatch can perform both untargeted and targeted effective attacks, degrading the mAP of Faster R-CNN and YOLO from 75.10% and 65.7% down to below 1%, respectively. (2) DPatch is small in size and its attacking effect is location-independent, making it very practical to implement real-world attacks. (3) DPatch demonstrates great transferability among different detectors as well as training datasets. For example, DPatch that is trained on Faster R-CNN can effectively attack YOLO, and vice versa. Extensive evaluations imply that DPatch can perform effective attacks under black-box setup, i.e., even without the knowledge of the attacked network's architectures and parameters. Successful realization of DPatch also illustrates the intrinsic vulnerability of the modern detector architectures to such patch-based adversarial attacks.
