Verification of Bit-Flip Attacks against Quantized Neural Networks
Yedi Zhang, Lei Huang, Pengfei Gao, Fu Song, Jun Sun, Jin Song Dong
TL;DR
BFAVerifier tackles the problem of verifying the absence of bit-flip attacks on quantized neural networks by proving $\mathcal{N} \models^\rho_{\mathfrak{m},\mathfrak{n}} \langle \phi,\psi \rangle$ for all admissible attack vectors $\rho$. It introduces SymPoly, an abstract domain for networks with symbolic parameters, combined with a MILP-based solver to achieve sound and complete verification across architectures, quantization schemes, and attacker capabilities. The framework demonstrates substantial efficiency gains over naive approaches (up to 30x) on benchmarks such as MNIST and ACAS Xu, while providing formal guarantees and identifying vulnerable parameters to prioritize protection. Overall, BFAVerifier offers a practical path toward robust, BF A-aware deployment of quantized neural networks in security-sensitive applications.
Abstract
In the rapidly evolving landscape of neural network security, the resilience of neural networks against bit-flip attacks (i.e., an attacker maliciously flips an extremely small amount of bits within its parameter storage memory system to induce harmful behavior), has emerged as a relevant area of research. Existing studies suggest that quantization may serve as a viable defense against such attacks. Recognizing the documented susceptibility of real-valued neural networks to such attacks and the comparative robustness of quantized neural networks (QNNs), in this work, we introduce BFAVerifier, the first verification framework designed to formally verify the absence of bit-flip attacks or to identify all vulnerable parameters in a sound and rigorous manner. BFAVerifier comprises two integral components: an abstraction-based method and an MILP-based method. Specifically, we first conduct a reachability analysis with respect to symbolic parameters that represent the potential bit-flip attacks, based on a novel abstract domain with a sound guarantee. If the reachability analysis fails to prove the resilience of such attacks, then we encode this verification problem into an equivalent MILP problem which can be solved by off-the-shelf solvers. Therefore, BFAVerifier is sound, complete, and reasonably efficient. We conduct extensive experiments, which demonstrate its effectiveness and efficiency across various network architectures, quantization bit-widths, and adversary capabilities.
