Localized Randomized Smoothing for Collective Robustness Certification
Jan Schuchardt, Tom Wollschläger, Aleksandar Bojchevski, Stephan Günnemann
TL;DR
The paper addresses collective robustness for multi-output models by introducing localized randomized smoothing, where each output is smoothed with anisotropic noise tailored to its input locality. A MILP-based collective certificate aggregates per-output base certificates, enabling provable bounds on how many predictions can be compromised by a single adversarial input under a budget constraint. Key contributions include a formal base-certificate interface, an efficient LP formulation, efficiency-enhancing strategies (sharing noise, partitioning, and binning), and variance-constrained certification for discrete data. Empirical results on semantic segmentation and graph node classification show that localized smoothing can achieve stronger average certifiable radii and competitive accuracy compared to isotropic baselines, particularly for softly local models. The work offers a general, scalable certificate for softly local multi-output tasks and highlights the role of locality in achieving cooperative robustness with practical implications for safer deployment of complex models.
Abstract
Models for image segmentation, node classification and many other tasks map a single input to multiple labels. By perturbing this single shared input (e.g. the image) an adversary can manipulate several predictions (e.g. misclassify several pixels). Collective robustness certification is the task of provably bounding the number of robust predictions under this threat model. The only dedicated method that goes beyond certifying each output independently is limited to strictly local models, where each prediction is associated with a small receptive field. We propose a more general collective robustness certificate for all types of models. We further show that this approach is beneficial for the larger class of softly local models, where each output is dependent on the entire input but assigns different levels of importance to different input regions (e.g. based on their proximity in the image). The certificate is based on our novel localized randomized smoothing approach, where the random perturbation strength for different input regions is proportional to their importance for the outputs. Localized smoothing Pareto-dominates existing certificates on both image segmentation and node classification tasks, simultaneously offering higher accuracy and stronger certificates.
