Learning Control Barrier Functions and their application in Reinforcement Learning: A Survey
Maeva Guerrier, Hassan Fouad, Giovanni Beltrame
TL;DR
This survey tackles the problem of safe reinforcement learning for robotics by focusing on Control Barrier Functions (CBFs) as a principled tool to enforce forward invariance of safe state sets during learning and deployment. It surveys soft, hard, and probabilistic safety constraints in SRL, and systematically reviews how CBFs can be constructed or learned from data, including demonstrations, inverse RL, and priors, as well as safety-filter/shield architectures and GP-based uncertainty handling. The contributions include a comprehensive taxonomy of SRL approaches using CBFs, analysis of data-driven CBF construction methods, and discussion of practical challenges such as sim2real transfer, generalization, and deployment certification. The work highlights that CBF-based safety can improve sample efficiency and safety in RL, while underscoring the need for robust, transferable, and less conservative methods to bridge the gap to real-world, lifelong robotic systems.
Abstract
Reinforcement learning is a powerful technique for developing new robot behaviors. However, typical lack of safety guarantees constitutes a hurdle for its practical application on real robots. To address this issue, safe reinforcement learning aims to incorporate safety considerations, enabling faster transfer to real robots and facilitating lifelong learning. One promising approach within safe reinforcement learning is the use of control barrier functions. These functions provide a framework to ensure that the system remains in a safe state during the learning process. However, synthesizing control barrier functions is not straightforward and often requires ample domain knowledge. This challenge motivates the exploration of data-driven methods for automatically defining control barrier functions, which is highly appealing. We conduct a comprehensive review of the existing literature on safe reinforcement learning using control barrier functions. Additionally, we investigate various techniques for automatically learning the Control Barrier Functions, aiming to enhance the safety and efficacy of Reinforcement Learning in practical robot applications.
