PatchCURE: Improving Certifiable Robustness, Model Utility, and Computation Efficiency of Adversarial Patch Defenses
Chong Xiang, Tong Wu, Sihui Dai, Jonathan Petit, Suman Jana, Prateek Mittal
TL;DR
PatchCURE addresses adversarial patch defenses by tackling the three-way trade-off among certifiable robustness, model utility, and computation efficiency. It unifies small-receptive-field (SRF) and large-receptive-field (LRF) approaches into a three-module framework—an SRF sub-model, an LRF sub-model, and a secure operation—with a tunable split point $k$ to navigate the trade-offs. The method provides certifiable guarantees via PCURE-Certify by mapping image-space attacks to feature-space threat models and applying a robust prediction procedure; efficient instantiations achieve throughput near undefended models while delivering state-of-the-art clean and robust accuracy across efficiency levels. PatchCURE demonstrates viability on ImageNet-1k with ViT-SRF and BagNet SRF options and offers a flexible path to adapt defenses to practical constraints and evolving SRF/LRF advances. Overall, PatchCURE presents a versatile, deployable framework for certifiably robust defenses that can interpolate between efficiency and robustness as needed.
Abstract
State-of-the-art defenses against adversarial patch attacks can now achieve strong certifiable robustness with a marginal drop in model utility. However, this impressive performance typically comes at the cost of 10-100x more inference-time computation compared to undefended models -- the research community has witnessed an intense three-way trade-off between certifiable robustness, model utility, and computation efficiency. In this paper, we propose a defense framework named PatchCURE to approach this trade-off problem. PatchCURE provides sufficient "knobs" for tuning defense performance and allows us to build a family of defenses: the most robust PatchCURE instance can match the performance of any existing state-of-the-art defense (without efficiency considerations); the most efficient PatchCURE instance has similar inference efficiency as undefended models. Notably, PatchCURE achieves state-of-the-art robustness and utility performance across all different efficiency levels, e.g., 16-23% absolute clean accuracy and certified robust accuracy advantages over prior defenses when requiring computation efficiency to be close to undefended models. The family of PatchCURE defenses enables us to flexibly choose appropriate defenses to satisfy given computation and/or utility constraints in practice.
