A Complete Set of Quadratic Constraints for Repeated ReLU and Generalizations
Sahel Vahedi Noori, Bin Hu, Geir Dullerud, Peter Seiler
TL;DR
The paper addresses tightening stability and robustness analysis for networks with ReLU activations by deriving a complete class of quadratic constraints (QCs) for the repeated ReLU, characterized by a copositivity-based set $\mathcal{M}_c$ with $2^{n_v}$ conditions. It proves that only the repeated ReLU $\Phi$ and its flipped version $\Phi^{\text{flip}}$ satisfy all QCs in $\mathcal{M}_c$, and it further develops a complete incremental QC set $\mathcal{M}_c^{\text{inc}}$ along with a new incremental QC set $\mathcal{M}^{\text{inc}}_2$, enabling less conservative Lipschitz bounds and $\ell_2$-gain analyses via SDP relaxations. The framework extends to affine transformations and other piecewise linear activations (e.g., leaky ReLU, Householder, MaxMin) and is demonstrated through examples showing reduced conservatism compared to standard QC sets. This leads to practical SDP-based tools for stability and performance assessment of RNNs/NNs with ReLU activations and offers a unifying view of ReLU QCs with potential scalability improvements. Overall, the work significantly tightens the characterization of ReLU nonlinearities and provides actionable methods for rigorous analysis of Lipschitz bounds and stability in neural networks.
Abstract
This paper derives a complete set of quadratic constraints (QCs) for the repeated ReLU. The complete set of QCs is described by a collection of matrix copositivity conditions. We also show that only two functions satisfy all QCs in our complete set: the repeated ReLU and flipped ReLU. Thus our complete set of QCs bounds the repeated ReLU as tight as possible up to the sign invariance inherent in quadratic forms. We derive a similar complete set of incremental QCs for repeated ReLU, which can potentially lead to less conservative Lipschitz bounds for ReLU networks than the standard LipSDP approach. The basic constructions are also used to derive the complete sets of QCs for other piecewise linear activation functions such as leaky ReLU, MaxMin, and HouseHolder. Finally, we illustrate the use of the complete set of QCs to assess stability and performance for recurrent neural networks with ReLU activation functions. We rely on a standard copositivity relaxation to formulate the stability/performance condition as a semidefinite program. Simple examples are provided to illustrate that the complete sets of QCs and incremental QCs can yield less conservative bounds than existing sets.
