Efficiently Computing Compact Formal Explanations
Min Wu, Xiaofu Li, Haoze Wu, Clark Barrett
TL;DR
VeriX+ tackles the challenge of producing minimal, formally guaranteed explanations for neural networks under bounded perturbations. It fuses bound-propagation-based sensitivity for compact explanations with batch-oriented traversals (binary search and QuickXplain adaptations) and a confidence-ranking heuristic to accelerate generation, achieving sizable gains on standard benchmarks and real-world tasks. The approach also demonstrates applicability to transformers and practical scenarios such as autonomous aircraft taxiing and sentiment analysis, with additional insights into adversarial training and OOD detection. Overall, VeriX+ advances scalable, verifiable explanations and highlights their practical value in safety-critical AI contexts.
Abstract
Building on VeriX (Verified eXplainability, arXiv:2212.01051), a system for producing optimal verified explanations for machine learning models, we present VeriX+, which significantly improves both the size and the generation time of formal explanations. We introduce a bound propagation-based sensitivity technique to improve the size, and a binary search-based traversal with confidence ranking for improving time -- the two techniques are orthogonal and can be used independently or together. We also show how to adapt the QuickXplain algorithm to our setting to provide a trade-off between size and time. Experimental evaluations on standard benchmarks demonstrate significant improvements on both metrics, e.g., a size reduction of $38\%$ on the GTSRB dataset and a time reduction of $90\%$ on MNIST. We demonstrate that our approach is scalable to transformers and real-world scenarios such as autonomous aircraft taxiing and sentiment analysis. We conclude by showcasing several novel applications of formal explanations.
