Towards Optimal Branching of Linear and Semidefinite Relaxations for Neural Network Robustness Certification
Brendon G. Anderson, Ziye Ma, Jingqi Li, Somayeh Sojoudi
TL;DR
This work enhances robustness certification of ReLU networks by introducing partitioned LP and SDP relaxations within a branch-and-bound framework. By partitioning the input uncertainty set and solving convex relaxations on each part, the authors prove both validity and tightening of the relaxations, and derive principled, worst-case-guided branching schemes for single-hidden-layer networks. They establish that optimizing the true LP relaxation error is NP-hard, motivating tractable worst-case partitioning rules, and derive a rank-1 gap based SDP partition strategy that favors uniform coordinate-wise partitions. Empirical results on MNIST, CIFAR-10, and Wisconsin Breast Cancer demonstrate substantial gains in certified robustness for both LP and SDP approaches, with the SDP method particularly advantageous in deeper networks and large-scale benchmarks, achieving performance competitive with state-of-the-art verifiers. The paper also extends LP branching to multi-layer settings, offering guidance on regime-specific applicability of branched LP vs branched SDP in practical certification tasks.
Abstract
In this paper, we study certifying the robustness of ReLU neural networks against adversarial input perturbations. To diminish the relaxation error suffered by the popular linear programming (LP) and semidefinite programming (SDP) certification methods, we take a branch-and-bound approach to propose partitioning the input uncertainty set and solving the relaxations on each part separately. We show that this approach reduces relaxation error, and that the error is eliminated entirely upon performing an LP relaxation with a partition intelligently designed to exploit the nature of the ReLU activations. To scale this approach to large networks, we consider using a coarser partition whereby the number of parts in the partition is reduced. We prove that computing such a coarse partition that directly minimizes the LP relaxation error is NP-hard. By instead minimizing the worst-case LP relaxation error, we develop a closed-form branching scheme in the single-hidden layer case. We extend the analysis to the SDP, where the feasible set geometry is exploited to design a branching scheme that minimizes the worst-case SDP relaxation error. Experiments on MNIST, CIFAR-10, and Wisconsin breast cancer diagnosis classifiers demonstrate significant increases in the percentages of test samples certified. By independently increasing the input size and the number of layers, we empirically illustrate under which regimes the branched LP and branched SDP are best applied. Finally, we extend our LP branching method into a multi-layer branching heuristic, which attains comparable performance to prior state-of-the-art heuristics on large-scale, deep neural network certification benchmarks.
