Detecting and Corrupting Convolution-based Unlearnable Examples
Minghui Li, Xianlong Wang, Zhifei Yu, Shengshan Hu, Ziqi Zhou, Longling Zhang, Leo Yu Zhang
TL;DR
This paper tackles the vulnerability of deep networks to convolution-based unlearnable examples (UEs) that embed class-wise multiplicative noise via convolution. It introduces COIN, a defense that uses random multiplicative interpolation in the image domain, and EPD, an edge-pixel detector to identify convolution-based UEs; it also expands the UE space with VUDA and HUDA for generalization testing. The authors show COIN outperforms 11 SOTA defenses on CIFAR and ImageNet, while EPD achieves high detection accuracy and AUC, establishing a practical detection-and-defense framework. Overall, the work highlights a new, effective approach to counter convolution-based UEs and broadens the understanding of UE design and defense.
Abstract
Convolution-based unlearnable examples (UEs) employ class-wise multiplicative convolutional noise to training samples, severely compromising model performance. This fire-new type of UEs have successfully countered all defense mechanisms against UEs. The failure of such defenses can be attributed to the absence of norm constraints on convolutional noise, leading to severe blurring of image features. To address this, we first design an Edge Pixel-based Detector (EPD) to identify convolution-based UEs. Upon detection of them, we propose the first defense scheme against convolution-based UEs, COrrupting these samples via random matrix multiplication by employing bilinear INterpolation (COIN) such that disrupting the distribution of class-wise multiplicative noise. To evaluate the generalization of our proposed COIN, we newly design two convolution-based UEs called VUDA and HUDA to expand the scope of convolution-based UEs. Extensive experiments demonstrate the effectiveness of detection scheme EPD and that our defense COIN outperforms 11 state-of-the-art (SOTA) defenses, achieving a significant improvement on the CIFAR and ImageNet datasets.
