BB-Patch: BlackBox Adversarial Patch-Attack using Zeroth-Order Optimization

Satyadwyoom Kumar; Saurabh Gupta; Arun Balaji Buduru

BB-Patch: BlackBox Adversarial Patch-Attack using Zeroth-Order Optimization

Satyadwyoom Kumar, Saurabh Gupta, Arun Balaji Buduru

TL;DR

The paper tackles black-box adversarial patches that can be printed and placed anywhere on an image. It introduces BB-Patch, a patch crafted without access to model gradients by optimizing a patch parameter $z$ with a zeroth-order adaptive momentum method under an Expectation Over Transformations (EOT) objective. It shows that BB-Patch is scalable to MNIST, CIFAR-10, and ImageNet and transferable across architectures such as ResNet50, VGG16, and MobileNet, with patch-trained-on-one-model still reducing accuracy on others. A real-world demonstration on a distracted driving classifier shows the patch can shift predictions from “unsafe” to “safe” in practical settings. The findings imply that true black-box patches with printable constraints pose a realistic and significant threat to deployed vision systems.

Abstract

Deep Learning has become popular due to its vast applications in almost all domains. However, models trained using deep learning are prone to failure for adversarial samples and carry a considerable risk in sensitive applications. Most of these adversarial attack strategies assume that the adversary has access to the training data, the model parameters, and the input during deployment, hence, focus on perturbing the pixel level information present in the input image. Adversarial Patches were introduced to the community which helped in bringing out the vulnerability of deep learning models in a much more pragmatic manner but here the attacker has a white-box access to the model parameters. Recently, there has been an attempt to develop these adversarial attacks using black-box techniques. However, certain assumptions such as availability large training data is not valid for a real-life scenarios. In a real-life scenario, the attacker can only assume the type of model architecture used from a select list of state-of-the-art architectures while having access to only a subset of input dataset. Hence, we propose an black-box adversarial attack strategy that produces adversarial patches which can be applied anywhere in the input image to perform an adversarial attack.

BB-Patch: BlackBox Adversarial Patch-Attack using Zeroth-Order Optimization

TL;DR

with a zeroth-order adaptive momentum method under an Expectation Over Transformations (EOT) objective. It shows that BB-Patch is scalable to MNIST, CIFAR-10, and ImageNet and transferable across architectures such as ResNet50, VGG16, and MobileNet, with patch-trained-on-one-model still reducing accuracy on others. A real-world demonstration on a distracted driving classifier shows the patch can shift predictions from “unsafe” to “safe” in practical settings. The findings imply that true black-box patches with printable constraints pose a realistic and significant threat to deployed vision systems.

Abstract

Paper Structure (13 sections, 2 equations, 2 figures, 5 tables, 1 algorithm)

This paper contains 13 sections, 2 equations, 2 figures, 5 tables, 1 algorithm.

Introduction
Related Work
Methodology
Universal Patch
BB-Patch
Experiments and Results
Dataset Description
Image classification models used for comparison
Patch applicability and scalability
Transferability
Real Life: Distracted Driving
Comparing Universal Patch and BB-Patch
Conclusion

Figures (2)

Figure 1: BB-Patch Optimization Workflow: we randomly initialize the patch and pass it into the BB-patch optimizer that uses the expectation over transformation loss function and zeroth order optimisation technique modified for adversarial patches, we train the patch until the model is able to classify the input image incorrectly.
Figure 2: An example where Universal Patch Brown2017AdversarialPatch and BB-Patch are applied to a distracted driver image classified as 'Safe Driving'.

BB-Patch: BlackBox Adversarial Patch-Attack using Zeroth-Order Optimization

TL;DR

Abstract

BB-Patch: BlackBox Adversarial Patch-Attack using Zeroth-Order Optimization

Authors

TL;DR

Abstract

Table of Contents

Figures (2)