Provable Bounds on the Hessian of Neural Networks: Derivative-Preserving Reachability Analysis

Sina Sharifi; Mahyar Fazlyab

Provable Bounds on the Hessian of Neural Networks: Derivative-Preserving Reachability Analysis

Sina Sharifi, Mahyar Fazlyab

TL;DR

This work develops a derivative-preserving reachability framework for neural networks with differentiable activations by deriving analytical bounds on the gradient and Hessian through a Taylor-model abstraction. It leverages a loop-transformation of activations to tighten local Lipschitz constants (local LipLT) and couples these bounds with a branch-and-bound scheme to obtain provable reachability for general input sets, including zonotopes. The main contributions are scalable gradient and Hessian bounds, a derivative-informed bounding strategy, and a method that combines end-to-end Taylor approximations with curvature information to improve verification over ReLU-centric approaches. The approach yields tighter, less conservative certificates for both open-loop and closed-loop systems, demonstrated on control benchmarks, with public code released for broader use and extension to curvature-aware robustness and safety analyses.

Abstract

We propose a novel reachability analysis method tailored for neural networks with differentiable activations. Our idea hinges on a sound abstraction of the neural network map based on first-order Taylor expansion and bounding the remainder. To this end, we propose a method to compute analytical bounds on the network's first derivative (gradient) and second derivative (Hessian). A key aspect of our method is loop transformation on the activation functions to exploit their monotonicity effectively. The resulting end-to-end abstraction locally preserves the derivative information, yielding accurate bounds on small input sets. Finally, we employ a branch and bound framework for larger input sets to refine the abstraction recursively. We evaluate our method numerically via different examples and compare the results with relevant state-of-the-art methods.

Provable Bounds on the Hessian of Neural Networks: Derivative-Preserving Reachability Analysis

TL;DR

Abstract

Paper Structure (29 sections, 13 theorems, 101 equations, 9 figures, 3 tables, 2 algorithms)

This paper contains 29 sections, 13 theorems, 101 equations, 9 figures, 3 tables, 2 algorithms.

Introduction
Contributions
Related Work
Preliminaries and Notation
Problem Statement
Sound Taylor Approximation of Neural Networks
Zeroth-order approximation
First-order approximation
Local Lipschitz Approximation for general neural networks
Two-layer neural networks
Multi-layer neural networks
Hessian approximation of smooth neural networks
Two-layer neural networks
Multi-layer neural networks
Reachability Analysis
...and 14 more sections

Key Result

Lemma 1

The loop-transformed two-layer neural network $f$ given in eq:liplt2layer is Lipschitz continuous with constant

Figures (9)

Figure 1: Overview of proposed abstraction.
Figure 2: Loop Transformation on an activation layer.
Figure 3: Figure of the modified neural network. The yellow part is the sub-network consisting of the first $l$ layers and the blue part is the sub-network consisting of the final $(L-l)$ layers.
Figure 4: Zeroth and first-order upper bounds.
Figure 5: Closed-loop system reachability
...and 4 more figures

Theorems & Definitions (28)

Lemma 1
Theorem 1
proof
Example 1
Theorem 2
proof
Corollary 1
Proposition 1
Remark 1: Comparison with global LipLT fazlyab2024certified
Lemma 2
...and 18 more

Provable Bounds on the Hessian of Neural Networks: Derivative-Preserving Reachability Analysis

TL;DR

Abstract

Provable Bounds on the Hessian of Neural Networks: Derivative-Preserving Reachability Analysis

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (9)

Theorems & Definitions (28)