Computational Tradeoffs of Optimization-Based Bound Tightening in ReLU Networks
Fabian Badilla, Marcos Goycoolea, Gonzalo Muñoz, Thiago Serra
TL;DR
This work investigates the tradeoff between activation-bound tightness and MILP solve time in ReLU networks, focusing on how bound quality affects downstream optimization tasks like verification. It compares strong (exact/OBBT) and weak (LP-relaxed) bounds, as well as naive bounds, across architectures, regularization, and pruning, using a rigorous MNIST experimental setup. The findings show that LP-relaxation-based weak bounds often achieve a favorable balance between computational cost and bound quality, while strong bounds can dominate in deep networks for verification. The results provide actionable guidance for embedding neural networks in MILP-based optimization and verification pipelines, including hybrid bound strategies that adapt to layer depth and network conditioning.
Abstract
The use of Mixed-Integer Linear Programming (MILP) models to represent neural networks with Rectified Linear Unit (ReLU) activations has become increasingly widespread in the last decade. This has enabled the use of MILP technology to test-or stress-their behavior, to adversarially improve their training, and to embed them in optimization models leveraging their predictive power. Many of these MILP models rely on activation bounds. That is, bounds on the input values of each neuron. In this work, we explore the tradeoff between the tightness of these bounds and the computational effort of solving the resulting MILP models. We provide guidelines for implementing these models based on the impact of network structure, regularization, and rounding.
