Certifying Global Robustness for Deep Neural Networks
You Li, Guannan Zhao, Shuyu Kong, Yunqi He, Hai Zhou
TL;DR
This work tackles the challenge of certifying global robustness for deep neural networks by shifting from local neighborhood verification to probabilistic modeling of meaningful input regions. It combines PAC robustness verification with probabilistic programs to define a global input distribution and introduces the cumulative robustness function as a comprehensive robustness metric. The ACE framework couples local robustness estimation with adaptive multi-level splitting and regression to efficiently and accurately quantify global robustness, even for rare violations, while enabling mining of diverse counterexamples for adversarial training. The results demonstrate that ACE yields faster, more accurate global robustness estimates and practical counterexamples, offering scalable, rigorous guarantees with meaningful real-world input distributions.
Abstract
A globally robust deep neural network resists perturbations on all meaningful inputs. Current robustness certification methods emphasize local robustness, struggling to scale and generalize. This paper presents a systematic and efficient method to evaluate and verify global robustness for deep neural networks, leveraging the PAC verification framework for solid guarantees on verification results. We utilize probabilistic programs to characterize meaningful input regions, setting a realistic standard for global robustness. Additionally, we introduce the cumulative robustness curve as a criterion in evaluating global robustness. We design a statistical method that combines multi-level splitting and regression analysis for the estimation, significantly reducing the execution time. Experimental results demonstrate the efficiency and effectiveness of our verification method and its capability to find rare and diversified counterexamples for adversarial training.
