Proof Minimization in Neural Network Verification
Omri Isac, Idan Refaeli, Haoze Wu, Clark Barrett, Guy Katz
TL;DR
This paper tackles the challenge of prohibitively large UNSAT proofs produced by DNN verifiers and proposes a two-stage proof minimization workflow: a proof-dependency analysis to drop unused lemmas, followed by two greedy/global minimization strategies to further prune dependencies. The authors implement these techniques on the Marabou proof-producing verifier and evaluate them across six benchmarks, achieving average proof-size reductions of 37%–82% and proof-checking-time reductions up to 88%, with a modest verifier overhead of 7%–20%. The approach leverages a Farkas-lemma-based UNSAT certificate and formalizes proof dependencies through $w^T A V = \sum c_i x_i$, enabling on-the-fly lemma deletion and memory savings. Overall, the method significantly improves the scalability and reliability of proof-producing DNN verifiers, facilitating safer deployment in safety-critical applications.
Abstract
The widespread adoption of deep neural networks (DNNs) requires efficient techniques for verifying their safety. DNN verifiers are complex tools, which might contain bugs that could compromise their soundness and undermine the reliability of the verification process. This concern can be mitigated using proofs: artifacts that are checkable by an external and reliable proof checker, and which attest to the correctness of the verification process. However, such proofs tend to be extremely large, limiting their use in many scenarios. In this work, we address this problem by minimizing proofs of unsatisfiability produced by DNN verifiers. We present algorithms that remove facts which were learned during the verification process, but which are unnecessary for the proof itself. Conceptually, our method analyzes the dependencies among facts used to deduce UNSAT, and removes facts that did not contribute. We then further minimize the proof by eliminating remaining unnecessary dependencies, using two alternative procedures. We implemented our algorithms on top of a proof producing DNN verifier, and evaluated them across several benchmarks. Our results show that our best-performing algorithm reduces proof size by 37%-82% and proof checking time by 30%-88%, while introducing a runtime overhead of 7%-20% to the verification process itself.
