Stylized Synthetic Augmentation further improves Corruption Robustness
Georg Siedel, Rojan Regmi, Abhirami Anand, Weijia Shao, Silvia Vock, Andrey Morozov
TL;DR
Conventional robustification relies on rule based augmentations; The authors propose a pipeline that combines synthetic data with neural style transfer to improve corruption robustness. They show that stylized synthetic images, despite higher FID, improve training outcomes and achieve state of the art robustness on CIFAR-10-C, CIFAR-100-C and TinyImageNet-C. The work highlights that FID is not a reliable predictor of augmentation usefulness and discusses practical considerations including hyperparameter tuning and training overhead. Overall, the method demonstrates strong robustness gains across multiple architectures and datasets, with implications for scalable, texture-agnostic learning in the presence of common corruptions.
Abstract
This paper proposes a training data augmentation pipeline that combines synthetic image data with neural style transfer in order to address the vulnerability of deep vision models to common corruptions. We show that although applying style transfer on synthetic images degrades their quality with respect to the common FID metric, these images are surprisingly beneficial for model training. We conduct a systematic empirical analysis of the effects of both augmentations and their key hyperparameters on the performance of image classifiers. Our results demonstrate that stylization and synthetic data complement each other well and can be combined with popular rule-based data augmentation techniques such as TrivialAugment, while not working with others. Our method achieves state-of-the-art corruption robustness on several small-scale image classification benchmarks, reaching 93.54%, 74.9% and 50.86% robust accuracy on CIFAR-10-C, CIFAR-100-C and TinyImageNet-C, respectively
