Network transferability of adversarial patches in real-time object detection
Jens Bayer, Stefan Becker, David Münch, Michael Arens
TL;DR
This work addresses the problem of how adversarial patches designed for real-time object detectors transfer across architectures and datasets. It conducts an extensive empirical study by training 280 patches on INRIA and evaluating them on 28 detectors pretrained on COCO, using a patch-optimization procedure that follows prior methods while tailoring loss components to different architectures. The key findings show that patches optimized with larger models tend to transfer more effectively across networks, with YOLOv9 and YOLOv10 delivering the strongest cross-model impact, while YOLO-NAS and RT-DETR exhibit greater robustness. The results highlight cross-architecture vulnerabilities in modern real-time detectors and suggest directions for defense, including further investigation into grayscale patch effects tied to training-time padding practices.
Abstract
Adversarial patches in computer vision can be used, to fool deep neural networks and manipulate their decision-making process. One of the most prominent examples of adversarial patches are evasion attacks for object detectors. By covering parts of objects of interest, these patches suppress the detections and thus make the target object 'invisible' to the object detector. Since these patches are usually optimized on a specific network with a specific train dataset, the transferability across multiple networks and datasets is not given. This paper addresses these issues and investigates the transferability across numerous object detector architectures. Our extensive evaluation across various models on two distinct datasets indicates that patches optimized with larger models provide better network transferability than patches that are optimized with smaller models.
