Learning Control Barrier Functions with Deterministic Safety Guarantees
Amy K. Strong, Ali Kashani, Claus Danielson, Leila Bridgeman
TL;DR
This work tackles safe set synthesis with deterministic guarantees for Lipschitz, discrete-time dynamics from data by learning a continuous piecewise affine barrier function $W$ over a triangulation and a data-driven classifier $\gamma$ to enforce the BF condition $W(g(\mathbf{x},\mathbf{u}))-\gamma(W(\mathbf{x}),\mathbf{x})\le0$. The nonconvex learning problem is solved via Iterative Convex Overbounding (ICO), which alternates convex updates to maximize the safe-set volume while ensuring the BF condition holds on each simplex; this yields a provably safe, contractive sublevel set ${\mathcal{S}}=\{\mathbf{x}\mid W(\mathbf{x})\le0\}$ contained in $\mathcal{X}$. The CPA/triangle-based representation affords hard BF constraints and data efficiency, with convergence to a local minimum and the option to adaptively add data in regions where the constraint is tight. Experiments on 2D autonomous and non-autonomous systems show the learned safe sets can outperform model-based baselines in volume and closely approach maximal invariant sets, validating the approach's practicality for data-driven safety guarantees.
Abstract
Barrier functions (BFs) characterize safe sets of dynamical systems, where hard constraints are never violated as the system evolves over time. Computing a valid safe set and BF for a nonlinear (and potentially unmodeled), non-autonomous dynamical system is a difficult task. This work explores the design of BFs using data to obtain safe sets with deterministic assurances of control invariance. We leverage ReLU neural networks (NNs) to create continuous piecewise affine (CPA) BFs with deterministic safety guarantees for Lipschitz continuous, discrete-time dynamical system using sampled one-step trajectories. The CPA structure admits a novel classifier term to create a relaxed \ac{bf} condition and construction via a data driven constrained optimization. We use iterative convex overbounding (ICO) to solve this nonconvex optimization problem through a series of convex optimization steps. We then demonstrate our method's efficacy on two-dimensional autonomous and non-autonomous dynamical systems.
