Texture Re-scalable Universal Adversarial Perturbation
Yihao Huang, Qing Guo, Felix Juefei-Xu, Ming Hu, Xiaojun Jia, Xiaochun Cao, Geguang Pu, Yang Liu
TL;DR
This paper tackles the limitations of fixed texture and scale in universal adversarial perturbations by introducing Texture Scale Constrained UAP (TSC-UAP). TSC-UAP learns a small local texture patch v and tiles it into a full-size perturbation δ = 𝒯(v, α) with a split ratio α, optimizing over v under a norm constraint to maximize an attack loss. Across multiple datasets and models, including CNNs and ViT, TSC-UAP consistently improves fooling ratios and cross-model/data transferability, while maintaining low computational overhead and enabling data-efficient training. The approach demonstrates that category-specific local textures and controlled texture scale are a practical, general enhancement to both data-dependent and data-free UAP methods with broad potential impacts and future theoretical insights.
Abstract
Universal adversarial perturbation (UAP), also known as image-agnostic perturbation, is a fixed perturbation map that can fool the classifier with high probabilities on arbitrary images, making it more practical for attacking deep models in the real world. Previous UAP methods generate a scale-fixed and texture-fixed perturbation map for all images, which ignores the multi-scale objects in images and usually results in a low fooling ratio. Since the widely used convolution neural networks tend to classify objects according to semantic information stored in local textures, it seems a reasonable and intuitive way to improve the UAP from the perspective of utilizing local contents effectively. In this work, we find that the fooling ratios significantly increase when we add a constraint to encourage a small-scale UAP map and repeat it vertically and horizontally to fill the whole image domain. To this end, we propose texture scale-constrained UAP (TSC-UAP), a simple yet effective UAP enhancement method that automatically generates UAPs with category-specific local textures that can fool deep models more easily. Through a low-cost operation that restricts the texture scale, TSC-UAP achieves a considerable improvement in the fooling ratio and attack transferability for both data-dependent and data-free UAP methods. Experiments conducted on two state-of-the-art UAP methods, eight popular CNN models and four classical datasets show the remarkable performance of TSC-UAP.
