A Comprehensive Survey on Inverse Constrained Reinforcement Learning: Definitions, Progress and Challenges
Guiliang Liu, Sheng Xu, Shicheng Liu, Ashish Gaurav, Sriram Ganapathi Subramanian, Pascal Poupart
TL;DR
This survey formalizes Inverse Constrained Reinforcement Learning (ICRL) as the problem of recovering implicit constraints from expert demonstrations within a Constrained Markov Decision Process framework, then reviews a spectrum of methods across deterministic and stochastic environments, limited demonstrations, and multi-agent settings. It analyzes maximum entropy and maximum causal entropy formulations, discusses hard versus soft constraints, and covers Bayesian, variational, data-augmentation, and offline strategies to address epistemic uncertainty in constraint inference. The authors also present approaches for simultaneous reward and constraint learning, constraint inference from multiple experts and multi-agent systems, and they benchmark ICRL methods on grid-world, MuJoCo, and HighD-like realistic environments, highlighting applications in autonomous driving, robotics, healthcare, and sports analytics. Open questions span theoretical identifiability, dynamic and transferable constraints, and real-world deployment, aiming to bridge theory and industrial practice with robust, generalizable constraint inference. $ICRL$ thus provides a comprehensive taxonomy, formal definitions, and practical guidelines to advance safe, interpretable constraint-aware RL systems across diverse domains.
Abstract
Inverse Constrained Reinforcement Learning (ICRL) is the task of inferring the implicit constraints that expert agents adhere to, based on their demonstration data. As an emerging research topic, ICRL has received considerable attention in recent years. This article presents a categorical survey of the latest advances in ICRL. It serves as a comprehensive reference for machine learning researchers and practitioners, as well as starters seeking to comprehend the definitions, advancements, and important challenges in ICRL. We begin by formally defining the problem and outlining the algorithmic framework that facilitates constraint inference across various scenarios. These include deterministic or stochastic environments, environments with limited demonstrations, and multiple agents. For each context, we illustrate the critical challenges and introduce a series of fundamental methods to tackle these issues. This survey encompasses discrete, virtual, and realistic environments for evaluating ICRL agents. We also delve into the most pertinent applications of ICRL, such as autonomous driving, robot control, and sports analytics. To stimulate continuing research, we conclude the survey with a discussion of key unresolved questions in ICRL that can effectively foster a bridge between theoretical understanding and practical industrial applications. The papers referenced in this survey can be found at https://github.com/Jasonxu1225/Awesome-Constraint-Inference-in-RL.
