Steal Now and Attack Later: Evaluating Robustness of Object Detection against Black-box Adversarial Attacks
Erh-Chung Chen, Pin-Yu Chen, I-Hsin Chung, Che-Rung Lee
TL;DR
Steal Now and Attack Later addresses the vulnerability of object detectors to latency-based adversarial attacks in a black-box setting by forcing the model to predict numerous ghost objects, thereby increasing inference time within an $L_ extinf$ perturbation budget. The method builds a data-driven, patch-based attack pipeline—data collection from open-world datasets, position-centric object insertion, and color-space perturbation projected onto the $L_ extinf$ ball with the perturbation bounded by $\epsilon d$—to craft adversarial examples without model access. Empirical results show successful attacks across diverse detectors (e.g., Faster R-CNN, RetinaNet, FCOS, YOLOv8, DERT) and public vision APIs (GCP, Azure) with ASR rising with larger perturbations and modest data collection costs (roughly $3 on local GPU and under $1 per API query). The work highlights practical security implications and discusses defenses such as multi-dimension inference, context consistency checks, and image-quality screening, while advocating private locally deployed models as an economical mitigation.
Abstract
Latency attacks against object detection represent a variant of adversarial attacks that aim to inflate the inference time by generating additional ghost objects in a target image. However, generating ghost objects in the black-box scenario remains a challenge since information about these unqualified objects remains opaque. In this study, we demonstrate the feasibility of generating ghost objects in adversarial examples by extending the concept of "steal now, decrypt later" attacks. These adversarial examples, once produced, can be employed to exploit potential vulnerabilities in the AI service, giving rise to significant security concerns. The experimental results demonstrate that the proposed attack achieves successful attacks across various commonly used models and Google Vision API without any prior knowledge about the target model. Additionally, the average cost of each attack is less than \$ 1 dollars, posing a significant threat to AI security.
