Tight Verification of Probabilistic Robustness in Bayesian Neural Networks

Ben Batten; Mehran Hosseini; Alessio Lomuscio

Tight Verification of Probabilistic Robustness in Bayesian Neural Networks

Ben Batten, Mehran Hosseini, Alessio Lomuscio

TL;DR

This work tackles the challenge of verifying probabilistic robustness in Bayesian Neural Networks, where weight uncertainty makes exact certification intractable. It introduces two iterative, dynamic-region expansion methods—Pure Dynamic Scaling (PIE) and Gradient-Guided Dynamic Scaling (GIE)—to compute provably tighter lower bounds than state-of-the-art sampling approaches. Empirical results on MNIST and CIFAR-10 show substantial improvements in probabilistic certification, with bounds often doubling those of prior methods; gradient guidance generally yields the strongest guarantees. The methods are designed to be compatible with any BNN verification backend, enabling tighter safety assurances for probabilistic models in safety-critical contexts while highlighting practical considerations such as computation time and iteration depth.

Abstract

We introduce two algorithms for computing tight guarantees on the probabilistic robustness of Bayesian Neural Networks (BNNs). Computing robustness guarantees for BNNs is a significantly more challenging task than verifying the robustness of standard Neural Networks (NNs) because it requires searching the parameters' space for safe weights. Moreover, tight and complete approaches for the verification of standard NNs, such as those based on Mixed-Integer Linear Programming (MILP), cannot be directly used for the verification of BNNs because of the polynomial terms resulting from the consecutive multiplication of variables encoding the weights. Our algorithms efficiently and effectively search the parameters' space for safe weights by using iterative expansion and the network's gradient and can be used with any verification algorithm of choice for BNNs. In addition to proving that our algorithms compute tighter bounds than the SoA, we also evaluate our algorithms against the SoA on standard benchmarks, such as MNIST and CIFAR10, showing that our algorithms compute bounds up to 40% tighter than the SoA.

Tight Verification of Probabilistic Robustness in Bayesian Neural Networks

TL;DR

Abstract

Paper Structure (17 sections, 2 theorems, 12 equations, 5 figures, 2 tables, 2 algorithms)

This paper contains 17 sections, 2 theorems, 12 equations, 5 figures, 2 tables, 2 algorithms.

Introduction
Preliminaries
Bayesian Neural Networks
The Probabilistic Robustness Problem for BNNs
Computing the Probabilistic Robustness for BNNs
Pure Dynamic Scaling
Gradient-Guided Dynamic Scaling
Theoretical Comparison
Evaluation
Comparison of Pure Sampling, PIE, and GIE Approaches
Ablation Study on the Use of Gradient-Based Dynamic Scaling Factors
Ablation Study on the Number of Iterations
Limitations
Correction to WickerLPK20 implementation
Evaluation of timing for PIE algorithm
...and 2 more sections

Key Result

Proposition 2.1

Given a BNN $f_{{\mathbf{w}}}$, such that ${\mathbf{w}} \sim q(\bm{w})$ for a probability density function $q$, and a safety specification $\langle\mathcal{X}, \mathcal{S}\rangle$, we have that

Figures (5)

Figure 1: A comparison between the area of the set of safe weights $\mathcal{C}$ in Example \ref{['ex: Iterative Expansion']} that is covered by the pure sampling (\ref{['subfig: Pure Sampling']}), PIE after two iterations (\ref{['subfig: Iterative Expansion']}), and GIE after two iterations (\ref{['subfig: Gradient-Guided']}) for the same initial weights $\bm{w}_1, \dots, \bm{w}_4 \sim \mathcal{N}(\bm{0}, \bm{1})$. See Section \ref{['sec: Evaluation']} for detailed comparisons, including comparison with equal compute.
Figure 2: A 2-dimensional cross-section of the safe region $\mathcal{C}$ covered by pure sampling (black), PIE (blue), and GIE (Red) on the same image in CIFAR dataset.
Figure 3: Probabilistic certification as a function of $\rho$, the gradient-based scaling factor in Algorithm \ref{['alg: GIE']}.
Figure 4: The number of expansion iterations before PIE reaches max probabilistic certifications in Table \ref{['tbl: Evaluation']}.
Figure 5: Comparison of the pure sampling, PIE, and GIE approaches against the empirical probabilistic robustness, obtained by sampling parameters from the BNN posterior and using IBP to verify whether they result in correct classification. 50 samples were used for calculating each empirical accuracy.

Theorems & Definitions (8)

Definition 2.1: Bayesian Neural Network
Definition 2.2: Probabilistic Robustness Problem
Definition 2.3: Set of Safe Weights
Proposition 2.1
Example 3.1
Proposition 3.1
proof
Example 3.2

Tight Verification of Probabilistic Robustness in Bayesian Neural Networks

TL;DR

Abstract

Tight Verification of Probabilistic Robustness in Bayesian Neural Networks

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (5)

Theorems & Definitions (8)