Mitigating Adversarial Effects Through Randomization

Cihang Xie; Jianyu Wang; Zhishuai Zhang; Zhou Ren; Alan Yuille

Mitigating Adversarial Effects Through Randomization

Cihang Xie, Jianyu Wang, Zhishuai Zhang, Zhou Ren, Alan Yuille

TL;DR

CNNs are highly vulnerable to adversarial examples, motivating a defense based on inference-time randomization. The authors introduce random resizing and random padding as light, training-free layers that generate many input patterns, reducing perturbation transfer and boosting robustness—especially against iterative attacks—when combined with adversarial training. Across ImageNet-scale experiments and the NIPS 2017 defense challenge, the approach markedly improves resilience and achieves state-of-the-art-performance when paired with ensemble adversarial training, while preserving clean-image accuracy. The method is simple, fast, and compatible with various architectures, making it a practical module for adversarial defense with publicly available code.

Abstract

Convolutional neural networks have demonstrated high accuracy on various tasks in recent years. However, they are extremely vulnerable to adversarial examples. For example, imperceptible perturbations added to clean images can cause convolutional neural networks to fail. In this paper, we propose to utilize randomization at inference time to mitigate adversarial effects. Specifically, we use two randomization operations: random resizing, which resizes the input images to a random size, and random padding, which pads zeros around the input images in a random manner. Extensive experiments demonstrate that the proposed randomization method is very effective at defending against both single-step and iterative attacks. Our method provides the following advantages: 1) no additional training or fine-tuning, 2) very few additional computations, 3) compatible with other adversarial defense methods. By combining the proposed randomization method with an adversarially trained model, it achieves a normalized score of 0.924 (ranked No.2 among 107 defense teams) in the NIPS 2017 adversarial examples defense challenge, which is far better than using adversarial training alone with a normalized score of 0.773 (ranked No.56). The code is public available at https://github.com/cihangxie/NIPS2017_adv_challenge_defense.

Mitigating Adversarial Effects Through Randomization

TL;DR

Abstract

Mitigating Adversarial Effects Through Randomization

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)