Learning a Formally Verified Control Barrier Function in Stochastic Environment
Manan Tayal, Hongchao Zhang, Pushpak Jagtap, Andrew Clark, Shishir Kolathaya
TL;DR
The paper addresses safety guarantees for continuous-time control in stochastic environments by learning a Stochastic Neural Control Barrier Function (SNCBF) that is formally verifiable in a single training step. It introduces a data-driven SOP framework with Lipschitz-certified neural networks to ensure safety constraints hold across the state space, eliminating post hoc verification. The approach yields a SNCBF and an accompanying SNCBF-QP controller, demonstrated on an inverted pendulum and obstacle avoidance tasks, achieving larger safe regions than baselines. This work advances safe learning-based control by providing formal guarantees under stochastic disturbances and scalable, offline training suitable for real-time deployment. The methods have practical impact for safety-critical autonomous systems where data-driven policies must be verifiably safe in continuous-time stochastic settings.
Abstract
Safety is a fundamental requirement of control systems. Control Barrier Functions (CBFs) are proposed to ensure the safety of the control system by constructing safety filters or synthesizing control inputs. However, the safety guarantee and performance of safe controllers rely on the construction of valid CBFs. Inspired by universal approximatability, CBFs are represented by neural networks, known as neural CBFs (NCBFs). This paper presents an algorithm for synthesizing formally verified continuous-time neural Control Barrier Functions in stochastic environments in a single step. The proposed training process ensures efficacy across the entire state space with only a finite number of data points by constructing a sample-based learning framework for Stochastic Neural CBFs (SNCBFs). Our methodology eliminates the need for post hoc verification by enforcing Lipschitz bounds on the neural network, its Jacobian, and Hessian terms. We demonstrate the effectiveness of our approach through case studies on the inverted pendulum system and obstacle avoidance in autonomous driving, showcasing larger safe regions compared to baseline methods.
