ARQ: A Mixed-Precision Quantization Framework for Accurate and Certifiably Robust DNNs

Yuchen Yang; Shubham Ugare; Yifan Zhao; Gagandeep Singh; Sasa Misailovic

ARQ: A Mixed-Precision Quantization Framework for Accurate and Certifiably Robust DNNs

Yuchen Yang, Shubham Ugare, Yifan Zhao, Gagandeep Singh, Sasa Misailovic

TL;DR

ARQ introduces a novel mixed-precision quantization framework that directly optimizes for certifiable robustness by integrating randomized smoothing into a reinforcement-learning search. It uses a DDPG-based policy to assign per-layer bit-widths under a resource constraint, with robustness quantified by the Average Certified Radius (ACR) and accelerated via Incremental Randomized Smoothing. Empirically, ARQ outperforms state-of-the-art MPQ baselines on CIFAR-10 and ImageNet, often matching or surpassing the original FP32 models in both clean accuracy and certified robustness while reducing compute by up to a few percent of the original operations. This approach enables scalable, certifiably robust quantization for large vision models, offering practical benefits for deployment on resource-constrained devices.

Abstract

Mixed precision quantization has become an important technique for optimizing the execution of deep neural networks (DNNs). Certified robustness, which provides provable guarantees about a model's ability to withstand different adversarial perturbations, has rarely been addressed in quantization due to unacceptably high cost of certifying robustness. This paper introduces ARQ, an innovative mixed-precision quantization method that not only preserves the clean accuracy of the smoothed classifiers but also maintains their certified robustness. ARQ uses reinforcement learning to find accurate and robust DNN quantization, while efficiently leveraging randomized smoothing, a popular class of statistical DNN verification algorithms. ARQ consistently performs better than multiple state-of-the-art quantization techniques across all the benchmarks and the input perturbation levels. The performance of ARQ quantized networks reaches that of the original DNN with floating-point weights, but with only 1.5% instructions and the highest certified radius. ARQ code is available at https://anonymous.4open.science/r/ARQ-FE4B.

ARQ: A Mixed-Precision Quantization Framework for Accurate and Certifiably Robust DNNs

TL;DR

Abstract

ARQ: A Mixed-Precision Quantization Framework for Accurate and Certifiably Robust DNNs

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (6)

Theorems & Definitions (1)