Enhanced Estimation Techniques for Certified Radii in Randomized Smoothing
Zixuan Liang
TL;DR
The paper tackles the challenge of certifying neural network robustness under adversarial perturbations by advancing radius estimation in randomized smoothing. It develops discrete and continuous estimation methods that tighten lower bounds on the certified radii through refined Monte Carlo sampling, confidence intervals, and novel approximations (including signomial programming and Taylor-based inverses). Key contributions include Bonferroni-corrected exact intervals, empirical Bernstein and confidence-sequence approaches, and betting-based strategies for continuous monitoring, evaluated on CIFAR-10 and ImageNet to demonstrate tighter certified test-set accuracy and reduced radii estimation discrepancies. The work also analyzes hyperparameters such as sample size $n$, noise level $\sigma$, and temperature $T$ (for tempered softmax), highlighting practical trade-offs between robustness guarantees and computational cost. Overall, the proposed techniques advance scalable, tighter, and more reliable probabilistic certificates for randomized smoothing with directions for future theoretical and empirical refinement.
Abstract
This paper presents novel methods for estimating certified radii in randomized smoothing, a technique crucial for certifying the robustness of neural networks against adversarial perturbations. Our proposed techniques significantly improve the accuracy of certified test-set accuracy by providing tighter bounds on the certified radii. We introduce advanced algorithms for both discrete and continuous domains, demonstrating their effectiveness on CIFAR-10 and ImageNet datasets. The new methods show considerable improvements over existing approaches, particularly in reducing discrepancies in certified radii estimates. We also explore the impact of various hyperparameters, including sample size, standard deviation, and temperature, on the performance of these methods. Our findings highlight the potential for more efficient certification processes and pave the way for future research on tighter confidence sequences and improved theoretical frameworks. The study concludes with a discussion of potential future directions, including enhanced estimation techniques for discrete domains and further theoretical advancements to bridge the gap between empirical and theoretical performance in randomized smoothing.
