Table of Contents
Fetching ...

Provable Bounds on the Hessian of Neural Networks: Derivative-Preserving Reachability Analysis

Sina Sharifi, Mahyar Fazlyab

TL;DR

This work develops a derivative-preserving reachability framework for neural networks with differentiable activations by deriving analytical bounds on the gradient and Hessian through a Taylor-model abstraction. It leverages a loop-transformation of activations to tighten local Lipschitz constants (local LipLT) and couples these bounds with a branch-and-bound scheme to obtain provable reachability for general input sets, including zonotopes. The main contributions are scalable gradient and Hessian bounds, a derivative-informed bounding strategy, and a method that combines end-to-end Taylor approximations with curvature information to improve verification over ReLU-centric approaches. The approach yields tighter, less conservative certificates for both open-loop and closed-loop systems, demonstrated on control benchmarks, with public code released for broader use and extension to curvature-aware robustness and safety analyses.

Abstract

We propose a novel reachability analysis method tailored for neural networks with differentiable activations. Our idea hinges on a sound abstraction of the neural network map based on first-order Taylor expansion and bounding the remainder. To this end, we propose a method to compute analytical bounds on the network's first derivative (gradient) and second derivative (Hessian). A key aspect of our method is loop transformation on the activation functions to exploit their monotonicity effectively. The resulting end-to-end abstraction locally preserves the derivative information, yielding accurate bounds on small input sets. Finally, we employ a branch and bound framework for larger input sets to refine the abstraction recursively. We evaluate our method numerically via different examples and compare the results with relevant state-of-the-art methods.

Provable Bounds on the Hessian of Neural Networks: Derivative-Preserving Reachability Analysis

TL;DR

This work develops a derivative-preserving reachability framework for neural networks with differentiable activations by deriving analytical bounds on the gradient and Hessian through a Taylor-model abstraction. It leverages a loop-transformation of activations to tighten local Lipschitz constants (local LipLT) and couples these bounds with a branch-and-bound scheme to obtain provable reachability for general input sets, including zonotopes. The main contributions are scalable gradient and Hessian bounds, a derivative-informed bounding strategy, and a method that combines end-to-end Taylor approximations with curvature information to improve verification over ReLU-centric approaches. The approach yields tighter, less conservative certificates for both open-loop and closed-loop systems, demonstrated on control benchmarks, with public code released for broader use and extension to curvature-aware robustness and safety analyses.

Abstract

We propose a novel reachability analysis method tailored for neural networks with differentiable activations. Our idea hinges on a sound abstraction of the neural network map based on first-order Taylor expansion and bounding the remainder. To this end, we propose a method to compute analytical bounds on the network's first derivative (gradient) and second derivative (Hessian). A key aspect of our method is loop transformation on the activation functions to exploit their monotonicity effectively. The resulting end-to-end abstraction locally preserves the derivative information, yielding accurate bounds on small input sets. Finally, we employ a branch and bound framework for larger input sets to refine the abstraction recursively. We evaluate our method numerically via different examples and compare the results with relevant state-of-the-art methods.
Paper Structure (29 sections, 13 theorems, 101 equations, 9 figures, 3 tables, 2 algorithms)

This paper contains 29 sections, 13 theorems, 101 equations, 9 figures, 3 tables, 2 algorithms.

Key Result

Lemma 1

The loop-transformed two-layer neural network $f$ given in eq:liplt2layer is Lipschitz continuous with constant

Figures (9)

  • Figure 1: Overview of proposed abstraction.
  • Figure 2: Loop Transformation on an activation layer.
  • Figure 3: Figure of the modified neural network. The yellow part is the sub-network consisting of the first $l$ layers and the blue part is the sub-network consisting of the final $(L-l)$ layers.
  • Figure 4: Zeroth and first-order upper bounds.
  • Figure 5: Closed-loop system reachability
  • ...and 4 more figures

Theorems & Definitions (28)

  • Lemma 1
  • Theorem 1
  • proof
  • Example 1
  • Theorem 2
  • proof
  • Corollary 1
  • Proposition 1
  • Remark 1: Comparison with global LipLT fazlyab2024certified
  • Lemma 2
  • ...and 18 more