Learning a Formally Verified Control Barrier Function in Stochastic Environment

Manan Tayal; Hongchao Zhang; Pushpak Jagtap; Andrew Clark; Shishir Kolathaya

Learning a Formally Verified Control Barrier Function in Stochastic Environment

Manan Tayal, Hongchao Zhang, Pushpak Jagtap, Andrew Clark, Shishir Kolathaya

TL;DR

The paper addresses safety guarantees for continuous-time control in stochastic environments by learning a Stochastic Neural Control Barrier Function (SNCBF) that is formally verifiable in a single training step. It introduces a data-driven SOP framework with Lipschitz-certified neural networks to ensure safety constraints hold across the state space, eliminating post hoc verification. The approach yields a SNCBF and an accompanying SNCBF-QP controller, demonstrated on an inverted pendulum and obstacle avoidance tasks, achieving larger safe regions than baselines. This work advances safe learning-based control by providing formal guarantees under stochastic disturbances and scalable, offline training suitable for real-time deployment. The methods have practical impact for safety-critical autonomous systems where data-driven policies must be verifiably safe in continuous-time stochastic settings.

Abstract

Safety is a fundamental requirement of control systems. Control Barrier Functions (CBFs) are proposed to ensure the safety of the control system by constructing safety filters or synthesizing control inputs. However, the safety guarantee and performance of safe controllers rely on the construction of valid CBFs. Inspired by universal approximatability, CBFs are represented by neural networks, known as neural CBFs (NCBFs). This paper presents an algorithm for synthesizing formally verified continuous-time neural Control Barrier Functions in stochastic environments in a single step. The proposed training process ensures efficacy across the entire state space with only a finite number of data points by constructing a sample-based learning framework for Stochastic Neural CBFs (SNCBFs). Our methodology eliminates the need for post hoc verification by enforcing Lipschitz bounds on the neural network, its Jacobian, and Hessian terms. We demonstrate the effectiveness of our approach through case studies on the inverted pendulum system and obstacle avoidance in autonomous driving, showcasing larger safe regions compared to baseline methods.

Learning a Formally Verified Control Barrier Function in Stochastic Environment

TL;DR

Abstract

Paper Structure (15 sections, 5 theorems, 30 equations, 3 figures, 1 algorithm)

This paper contains 15 sections, 5 theorems, 30 equations, 3 figures, 1 algorithm.

Introduction
Preliminaries
Notations
System Description
Control Barrier Functions (CBFs)
Controller Synthesis for Real-time Safety
Problem Formulation
Method
Synthesis of SNCBF
Training with Safety Guarantee
Simulations
Inverted Pendulum
Obstacle Avoidance
Conclusions
Proof of Theorem \ref{['thm: effective_weights']}

Key Result

Lemma 1

The set $\mathcal{C} \subset \mathbb{R}^n$ be a set defined on the super-level set of a continuously differentiable function $h : \mathcal{X} \subset \mathbb{R}^n \rightarrow \mathbb{R}$. The function $h$ defined on the set $\mathcal{X}$ is a CBF for stochastic system in eq:state-sde, if there exist

Figures (3)

Figure 1: This figure presents the experimental results on the inverted pendulum system. Fig. \ref{['fig:2d_sample']} visualizes of $\tilde{h}\left(x \mid \theta\right)$ over $X$. Blue and red regions denote the safe region ($\tilde{h} \geq -\psi^*$) and the unsafe region ($\tilde{h} < \psi^*$), respectively. The initial safety region boundary and unsafe region boundary is denoted by black boxes. We observe that the boundary of trained SNCBF (black dots) successfully separate the unsafe and safe region. Fig. \ref{['fig:3d_sncbf']} shows the 3D plot of $\tilde{h}\left(x \mid \theta\right)$ over $X$. We observe that the safe region is greater than zero while unsafe region has negative function value. Fig. \ref{['fig:ip_traj']} presents trajectories initiating inside the safe set following SNCBF-QP, following different reference controllers
Figure 2: Proposed safe control comparison among different reaction distance. We let the vehicle to adjust its orientation to maneuver in its lane. We show three trajectories to demonstrate our proposed SNCBF-based controller under different initial state, namely, $6$, $8$ and $12$ meter away from the pedestrian, respectively. Three trajectories of the vehicle under control shows our proposed method succeeds in maneuvering the vehicle to avoid the pedestrian.
Figure 3: The left and right figures show the unsafe region and the zero-level sets of the SNCBF $\tilde{h}$ trained by baseline and the proposed method, respectively. Both zero-level sets (in yellow) do not overlap with the unsafe region in red color. The SNCBF trained by the baseline ensures a guaranteed safe subset over the state space with the ratio $11.8\%$ only, whereas the proposed method ensures a guaranteed safe subset with a ratio up to $77.6\%$.

Theorems & Definitions (12)

Definition 1: Safety
Definition 2: Control barrier function (CBF)
Lemma 1: clark2021verification
Theorem 1
proof
Lemma 2: pauli2021training
Remark 1
Theorem 2
proof
Theorem 3
...and 2 more

Learning a Formally Verified Control Barrier Function in Stochastic Environment

TL;DR

Abstract

Learning a Formally Verified Control Barrier Function in Stochastic Environment

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (3)

Theorems & Definitions (12)