A Learning-Based Control Barrier Function for Car-Like Robots: Toward Less Conservative Collision Avoidance
Jianye Xu, Bassam Alrifaee
TL;DR
The work tackles conservatism in collision avoidance for car-like robots by replacing circle-based safety margins with an MTV-based margin that accounts for actual geometry and heading. A relative-dynamics framework enables learning a differentiable neural network surrogate, $h_\theta(\boldsymbol{x}^{j\underline{i}})$, to approximate the non-differentiable MTV margin, yielding a CBF $h_{MTV}(\boldsymbol{x}^{j\underline{i}})=h_\theta(\boldsymbol{x}^{j\underline{i}})-e_{\max}$ with relative degree $r=2$. The authors provide a theoretical foundation for applying this margin to the nonlinear kinematic bicycle model, train the margin on a large dataset (≈80k points) with an ~2.8% width error, and implement a CBF-QP that minimally adjusts a nominal RL controller to enforce safety. Case studies on overtaking and bypassing demonstrate that the MTV-based margin reduces conservatism (e.g., 33.5% less lateral space for bypassing) while maintaining comparable computation times, enabling more efficient maneuvers in dense environments.
Abstract
We propose a learning-based Control Barrier Function (CBF) to reduce conservatism in collision avoidance for car-like robots. Traditional CBFs often use the Euclidean distance between robots' centers as a safety margin, which neglects their headings and approximates their geometries as circles. Although this simplification meets the smoothness and differentiability requirements of CBFs, it may result in overly conservative behavior in dense environments. We address this by designing a safety margin that considers both the robot's heading and actual shape, thereby enabling a more precise estimation of safe regions. Because this safety margin is non-differentiable, we approximate it with a neural network to ensure differentiability. In addition, we propose a notion of relative dynamics that makes the learning process tractable. In a case study, we establish the theoretical foundation for applying this notion to a nonlinear kinematic bicycle model. Numerical experiments in overtaking and bypassing scenarios show that our approach reduces conservatism (e.g., requiring 33.5% less lateral space for bypassing) without incurring significant extra computation time. Code: https://github.com/bassamlab/sigmarl
