LDLT $\mathcal{L}$-Lipschitz Network: Generalized Deep End-To-End Lipschitz Network Construction
Marius F. R. Juston, Ramavarapu S. Sreenivas, Dustin Nottage, Ahmet Soylemezoglu
TL;DR
This work introduces a novel LDL$^\\top$-based Linear Matrix Inequality framework to design general $\\mathcal{L}$-Lipschitz deep residual networks, enabling end-to-end parameterization that preserves expressiveness while guaranteeing Lipschitz continuity. By reformulating ResNets as LMI constraints and exploiting block LDL$^\\top$ decomposition with positive diagonal blocks, the authors provide closed-form parameterizations and efficient computations (via Cholesky/Woodbury identities) that extend Lipschitz guarantees to inner layers and CNNs. Empirical results on 121 UCI datasets show competitive or superior certified robustness and accuracy compared to SDP-based layers, with LDLT variants achieving 3–13% gains in certified settings. The framework supports a broad range of architectures (including linear nets and U-Nets) and offers a scalable path toward robust, certifiable neural networks for adversarial defense and safety-critical control tasks.
Abstract
Deep residual networks (ResNets) have demonstrated outstanding success in computer vision tasks, attributed to their ability to maintain gradient flow through deep architectures. Simultaneously, controlling the Lipschitz constant in neural networks has emerged as an essential area of research to enhance adversarial robustness and network certifiability. This paper presents a rigorous approach to the general design of $\mathcal{L}$-Lipschitz deep residual networks using a Linear Matrix Inequality (LMI) framework. Initially, the ResNet architecture was reformulated as a cyclic tridiagonal LMI, and closed-form constraints on network parameters were derived to ensure $\mathcal{L}$-Lipschitz continuity; however, using a new $LDL^\top$ decomposition approach for certifying LMI feasibility, we extend the construction of $\mathcal{L}$-Lipchitz networks to any other nonlinear architecture. Our contributions include a provable parameterization methodology for constructing Lipschitz-constrained residual networks and other hierarchical architectures. Cholesky decomposition is also used for efficient parameterization. These findings enable robust network designs applicable to adversarial robustness, certified training, and control systems. The $LDL^\top$ formulation is shown to be a tight relaxation of the SDP-based network, maintaining full expressiveness and achieving 3\%-13\% accuracy gains over SLL Layers on 121 UCI data sets.
