Investigating the Corruption Robustness of Image Classifiers with Random Lp-norm Corruptions
Georg Siedel, Weijia Shao, Silvia Vock, Andrey Morozov
TL;DR
This work addresses the brittleness of image classifiers to real-world random corruptions by formalizing robustness under $L_p$ distances and introducing a scalable sampling method for random $p$-norm perturbations. It proposes the imperceptible Corruption Error ($ ext{iCE}$) and mean Corruption Error for $p$-norms ($ ext{mCE}_{L_p}$) as metrics, and demonstrates that training with combinations of $p$-norm corruptions substantially enhances corruption robustness beyond state-of-the-art augmentations. The study finds that robustness transfers across non-$L_0$ norms and to some real-world corruptions, with lower $p$ values generally yielding stronger benefits, while $L_0$ remains a special case. Practically, the results offer guidance for designing data augmentation pipelines that improve safety and reliability of vision systems in the presence of imperceptible and real-world distortions.
Abstract
Robustness is a fundamental property of machine learning classifiers required to achieve safety and reliability. In the field of adversarial robustness of image classifiers, robustness is commonly defined as the stability of a model to all input changes within a p-norm distance. However, in the field of random corruption robustness, variations observed in the real world are used, while p-norm corruptions are rarely considered. This study investigates the use of random p-norm corruptions to augment the training and test data of image classifiers. We evaluate the model robustness against imperceptible random p-norm corruptions and propose a novel robustness metric. We empirically investigate whether robustness transfers across different p-norms and derive conclusions on which p-norm corruptions a model should be trained and evaluated. We find that training data augmentation with a combination of p-norm corruptions significantly improves corruption robustness, even on top of state-of-the-art data augmentation schemes.
