Risk-Averse Certification of Bayesian Neural Networks
Xiyue Zhang, Zifan Wang, Yulong Gao, Licio Romao, Alessandro Abate, Marta Kwiatkowska
TL;DR
This work tackles certification of Bayesian neural networks under uncertainty by introducing RAC-BNN, a risk-averse framework that combines sampling-based output-set approximation with template polytopes and CVaR-based performance evaluation. By constructing provably sound convex approximations of the BNN output space and providing probabilistic guarantees on CVaR estimates, RAC-BNN enables flexible risk-level analysis (via the parameter $\alpha$) and improves tightness and efficiency over state-of-the-art methods. The approach is implemented as a prototype tool and validated on regression and classification benchmarks, showing tighter certified bounds, reduced computation time, and robust handling of worst-case scenarios. This risk-aware certification has practical impact for deploying BNNs in safety- and security-critical settings, and the framework can be extended to closed-loop decision systems in future work.
Abstract
In light of the inherently complex and dynamic nature of real-world environments, incorporating risk measures is crucial for the robustness evaluation of deep learning models. In this work, we propose a Risk-Averse Certification framework for Bayesian neural networks called RAC-BNN. Our method leverages sampling and optimisation to compute a sound approximation of the output set of a BNN, represented using a set of template polytopes. To enhance robustness evaluation, we integrate a coherent distortion risk measure--Conditional Value at Risk (CVaR)--into the certification framework, providing probabilistic guarantees based on empirical distributions obtained through sampling. We validate RAC-BNN on a range of regression and classification benchmarks and compare its performance with a state-of-the-art method. The results show that RAC-BNN effectively quantifies robustness under worst-performing risky scenarios, and achieves tighter certified bounds and higher efficiency in complex tasks.
