Normalization of ReLU Dual for Cut Generation in Stochastic Mixed-Integer Programs
Akul Bansal, Simge Küçükyavuz
TL;DR
The paper addresses weak cuts arising from dual degeneracy in multistage stochastic programs with mixed-integer state variables by normalizing the ReLU dual in an extended space. It proves that the resulting normalized cuts are tight and Pareto-optimal in the original state space and shows that normalization can reproduce any cut from regularization while offering greater flexibility. The authors derive theoretical results connecting the normalized ReLU dual to lifted Lagrangian cuts and provide a comprehensive computational study on DCAP and CLSP problems, demonstrating stronger cuts and faster convergence, especially when paired with an alternating cut strategy. An open-source implementation is released, highlighting practical impact for scalable, convergent MSIP solution methods. The work advances cut-generation theory and offers a robust, flexible tool for improving decomposition-based solution methods in stochastic integer optimization.
Abstract
We study the Rectified Linear Unit (ReLU) dual, an existing dual formulation for stochastic programs that reformulates non-anticipativity constraints using ReLU functions to generate tight, non-convex, and mixed-integer representable cuts. While this dual reformulation guarantees convergence with mixed-integer state variables, it admits multiple optimal solutions that can yield weak cuts. To address this issue, we propose normalizing the dual in the extended space to identify solutions that yield stronger cuts. We prove that the resulting normalized cuts are tight and Pareto-optimal in the original state space. We further compare normalization with existing regularization-based approaches for handling dual degeneracy and explain why normalization offers key advantages. In particular, we show that normalization can recover any cut obtained via regularization, whereas the converse does not hold. Computational experiments demonstrate that the proposed approach outperforms existing methods by consistently yielding stronger cuts and reducing solution times on harder instances.
