Cooptimizing Safety and Performance with a Control-Constrained Formulation
Hao Wang, Adityaya Dhande, Somil Bansal
TL;DR
This paper addresses cooptimizing safety and performance for autonomous systems by converting a state-constrained optimal control problem into a time-dependent, control-constrained problem via Hamilton-Jacobi reachability. It proves the equivalence of the two formulations and shows the value function is a viscosity solution to an HJB-PDE under the control-constrained view, enabling practical synthesis through safe-control sets. A 2D case study demonstrates consistent safety-performance improvements over baselines, with clear tradeoffs in offline/online computation. Limitations include scalability to high-dimensional systems and differentiability assumptions, with future work pointing to deep-learning value-function approximations and smooth overapproximations to address these issues.
Abstract
Autonomous systems have witnessed a rapid increase in their capabilities, but it remains a challenge for them to perform tasks both effectively and safely. The fact that performance and safety can sometimes be competing objectives renders the cooptimization between them difficult. One school of thought is to treat this cooptimization as a constrained optimal control problem with a performance-oriented objective function and safety as a constraint. However, solving this constrained optimal control problem for general nonlinear systems remains challenging. In this work, we use the general framework of constrained optimal control, but given the safety state constraint, we convert it into an equivalent control constraint, resulting in a state and time-dependent control-constrained optimal control problem. This equivalent optimal control problem can readily be solved using the dynamic programming principle. We show the corresponding value function is a viscosity solution of a certain Hamilton-Jacobi-Bellman Partial Differential Equation (HJB-PDE). Furthermore, we demonstrate the effectiveness of our method with a two-dimensional case study, and the experiment shows that the controller synthesized using our method consistently outperforms the baselines, both in safety and performance.
