Exploring the Robustness and Transferability of Patch-Based Adversarial Attacks in Quantized Neural Networks
Amira Guesmi, Bassem Ouni, Muhammad Shafique
TL;DR
This work demonstrates that patch-based adversarial attacks remain highly effective and transferable in quantized neural networks across multiple architectures and bitwidths, undermining the notion that quantization alone can provide robust defense. By systematically varying quantization levels, patch size, and model architecture, the authors show that patches preserve high-visibility signals and gradient structure, enabling strong cross-model transfer and resilience to spatial perturbations. The study introduces Quantization-Aware Defense Training (QADT) and outlines practical defense strategies to counter patch-based threats in low-precision regimes. The findings have significant implications for deploying secure, quantized models in resource-constrained environments and guide future defense development for real-world applications.
Abstract
Quantized neural networks (QNNs) are increasingly used for efficient deployment of deep learning models on resource-constrained platforms, such as mobile devices and edge computing systems. While quantization reduces model size and computational demands, its impact on adversarial robustness-especially against patch-based attacks-remains inadequately addressed. Patch-based attacks, characterized by localized, high-visibility perturbations, pose significant security risks due to their transferability and resilience. In this study, we systematically evaluate the vulnerability of QNNs to patch-based adversarial attacks across various quantization levels and architectures, focusing on factors that contribute to the robustness of these attacks. Through experiments analyzing feature representations, quantization strength, gradient alignment, and spatial sensitivity, we find that patch attacks consistently achieve high success rates across bitwidths and architectures, demonstrating significant transferability even in heavily quantized models. Contrary to the expectation that quantization might enhance adversarial defenses, our results show that QNNs remain highly susceptible to patch attacks due to the persistence of distinct, localized features within quantized representations. These findings underscore the need for quantization-aware defenses that address the specific challenges posed by patch-based attacks. Our work contributes to a deeper understanding of adversarial robustness in QNNs and aims to guide future research in developing secure, quantization-compatible defenses for real-world applications.
