Estimating Control Barriers from Offline Data

Hongzhan Yu; Seth Farrell; Ryo Yoshimitsu; Zhizhen Qin; Henrik I. Christensen; Sicun Gao

Estimating Control Barriers from Offline Data

Hongzhan Yu, Seth Farrell, Ryo Yoshimitsu, Zhizhen Qin, Henrik I. Christensen, Sicun Gao

TL;DR

The paper tackles learning neural control barrier functions from offline, sparsely labeled data by introducing an offline framework where OOD-based annotation propagates information from limited labeled samples to unlabeled trajectories. It combines a rejection-based OOD detector, a maximally-safe actor to generate in-distribution controls, and a barrier-critic objective that enforces the CBF Lie-derivative condition, all while using a surrogate normalization to prevent training collapse. The approach is validated in simulation and on real hardware, achieving state-of-the-art dynamic obstacle avoidance with limited offline data and demonstrating safer, less conservative maneuvers compared to existing offline methods. The zero-superlevel set $\{x: B(x)\ge 0\}$ is maintained as a control-invariant region, enabling effective safety guarantees within the learned boundary.

Abstract

Learning-based methods for constructing control barrier functions (CBFs) are gaining popularity for ensuring safe robot control. A major limitation of existing methods is their reliance on extensive sampling over the state space or online system interaction in simulation. In this work we propose a novel framework for learning neural CBFs through a fixed, sparsely-labeled dataset collected prior to training. Our approach introduces new annotation techniques based on out-of-distribution analysis, enabling efficient knowledge propagation from the limited labeled data to the unlabeled data. We also eliminate the dependency on a high-performance expert controller, and allow multiple sub-optimal policies or even manual control during data collection. We evaluate the proposed method on real-world platforms. With limited amount of offline data, it achieves state-of-the-art performance for dynamic obstacle avoidance, demonstrating statistically safer and less conservative maneuvers compared to existing methods.

Estimating Control Barriers from Offline Data

TL;DR

Abstract

Estimating Control Barriers from Offline Data

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (8)

Theorems & Definitions (1)