Adversarial Training against Location-Optimized Adversarial Patches

Sukrut Rao; David Stutz; Bernt Schiele

Adversarial Training against Location-Optimized Adversarial Patches

Sukrut Rao, David Stutz, Bernt Schiele

TL;DR

This paper tackles robustness to clearly visible adversarial patches by introducing location-optimized patches and adversarial patch training. It defines image-specific, untargeted patches and develops strategies to optimize patch location, including full and random location optimization. Through extensive experiments on CIFAR-10 and GTSRB, the authors demonstrate that adversarial patch training with location optimization significantly improves robustness without sacrificing clean accuracy and even enhances robustness to universal patches. The findings suggest practical defense benefits for real-world scenarios like autonomous driving and highlight the importance of patch placement in adversarial robustness.

Abstract

Deep neural networks have been shown to be susceptible to adversarial examples -- small, imperceptible changes constructed to cause mis-classification in otherwise highly accurate image classifiers. As a practical alternative, recent work proposed so-called adversarial patches: clearly visible, but adversarially crafted rectangular patches in images. These patches can easily be printed and applied in the physical world. While defenses against imperceptible adversarial examples have been studied extensively, robustness against adversarial patches is poorly understood. In this work, we first devise a practical approach to obtain adversarial patches while actively optimizing their location within the image. Then, we apply adversarial training on these location-optimized adversarial patches and demonstrate significantly improved robustness on CIFAR10 and GTSRB. Additionally, in contrast to adversarial training on imperceptible adversarial examples, our adversarial patch training does not reduce accuracy.

Adversarial Training against Location-Optimized Adversarial Patches

TL;DR

Abstract

Adversarial Training against Location-Optimized Adversarial Patches

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)