Compositional Curvature Bounds for Deep Neural Networks
Taha Entesari, Sina Sharifi, Mahyar Fazlyab
TL;DR
This work addresses adversarial vulnerability by introducing a curvature-based framework for neural networks, where the curvature constant $L_{Df}$ (the Jacobian’s Lipschitz constant) governs robustness beyond traditional Lipschitz bounds. It develops an analytical, scalable algorithm that propagates curvature bounds through model compositions and residual blocks, yielding differentiable curvature regularizers and anchored Lipschitz constants to tighten robustness certificates. The paper derives curvature-based robustness and attack certificates with closed-form radii, and furnishes a practical algorithm to bound the end-to-end Jacobian while enabling curvature-aware training. Empirical results on MNIST and CIFAR-10 demonstrate improved certified radii and reduced certification gaps, and the approach supports 1-Lipschitz architectures to further enhance robustness with practical training strategies.
Abstract
A key challenge that threatens the widespread use of neural networks in safety-critical applications is their vulnerability to adversarial attacks. In this paper, we study the second-order behavior of continuously differentiable deep neural networks, focusing on robustness against adversarial perturbations. First, we provide a theoretical analysis of robustness and attack certificates for deep classifiers by leveraging local gradients and upper bounds on the second derivative (curvature constant). Next, we introduce a novel algorithm to analytically compute provable upper bounds on the second derivative of neural networks. This algorithm leverages the compositional structure of the model to propagate the curvature bound layer-by-layer, giving rise to a scalable and modular approach. The proposed bound can serve as a differentiable regularizer to control the curvature of neural networks during training, thereby enhancing robustness. Finally, we demonstrate the efficacy of our method on classification tasks using the MNIST and CIFAR-10 datasets.
