Probabilistic Robustness in Deep Learning: A Concise yet Comprehensive Guide
Xingyu Zhao
TL;DR
Probabilistic robustness (PR) addresses robustness in deep learning under stochastic perturbations, offering a practical alternative to traditional adversarial robustness (AR) by focusing on the probability of unaffected predictions within perturbation neighborhoods. The chapter formally defines PR, surveys both black-box and white-box evaluation methods, and introduces AT-PR, a dedicated min-max training framework to enlarge PR-safe regions around inputs. It further discusses how to embed PR evidence into system-level safety assurance, including translating DL-level robustness metrics into reliability claims and propagating uncertainty through reliability models. The work highlights open challenges—benchmarking PR estimators, extending PR to frontier AI tasks, and developing end-to-end case studies for system integration—signaling important directions for safer, more trustworthy AI in safety-critical domains.
Abstract
Deep learning (DL) has demonstrated significant potential across various safety-critical applications, yet ensuring its robustness remains a key challenge. While adversarial robustness has been extensively studied in worst-case scenarios, probabilistic robustness (PR) offers a more practical perspective by quantifying the likelihood of failures under stochastic perturbations. This paper provides a concise yet comprehensive overview of PR, covering its formal definitions, evaluation and enhancement methods. We introduce a reformulated ''min-max'' optimisation framework for adversarial training specifically designed to improve PR. Furthermore, we explore the integration of PR verification evidence into system-level safety assurance, addressing challenges in translating DL model-level robustness to system-level claims. Finally, we highlight open research questions, including benchmarking PR evaluation methods, extending PR to generative AI tasks, and developing rigorous methodologies and case studies for system-level integration.
