A Survey of Constraint Formulations in Safe Reinforcement Learning
Akifumi Wachi, Xun Shen, Yanan Sui
TL;DR
This survey addresses the fragmentation in safe RL by focusing on constraint formulations within the constrained Markov decision process (CMDP) framework, formalizing the problem as max $V_r^\pi(\rho)$ subject to a constraint $f_{\mathcal{C}}(\pi)\le 0$ and cataloguing seven representative constraint representations. It introduces three theoretical notions—transformability, generalizability, and conservative approximation—and shows that many formulations are IoMG-SafeRL variants of others, e.g., an instantaneous or per-step constraint can be related to a cumulative, long-horizon constraint via budget transformations $\eta_h$, with key results including that Problem 3.4 is IoMG-SafeRL over (3.1,3.2) and Problem 3.7 is IoMG-SafeRL over (3.5,3.6); furthermore, a gamma-corrected instantaneous formulation can conservatively approximate joint chance constraints. The paper provides a curated map of representative algorithms aligned to each formulation, discusses practical considerations for algorithm choice and safety guarantees during training versus post-convergence, and highlights online versus offline safe RL as a practical axis for deployment. Overall, the work offers a systematic understanding of constraint formulations, guides formulation- and algorithm-selection for real-world safety-critical RL, and sketches avenues for extending safe RL beyond the standard cumulative-additive paradigms.
Abstract
Safety is critical when applying reinforcement learning (RL) to real-world problems. As a result, safe RL has emerged as a fundamental and powerful paradigm for optimizing an agent's policy while incorporating notions of safety. A prevalent safe RL approach is based on a constrained criterion, which seeks to maximize the expected cumulative reward subject to specific safety constraints. Despite recent effort to enhance safety in RL, a systematic understanding of the field remains difficult. This challenge stems from the diversity of constraint representations and little exploration of their interrelations. To bridge this knowledge gap, we present a comprehensive review of representative constraint formulations, along with a curated selection of algorithms designed specifically for each formulation. In addition, we elucidate the theoretical underpinnings that reveal the mathematical mutual relations among common problem formulations. We conclude with a discussion of the current state and future directions of safe reinforcement learning research.
