A Physics-Informed Machine Learning Framework for Safe and Optimal Control of Autonomous Systems
Manan Tayal, Aditya Singh, Shishir Kolathaya, Somil Bansal
TL;DR
This work tackles co-optimizing safety and performance for nonlinear autonomous systems by formulating a state-constrained optimal control problem (SC-OCP) and solving it via an epigraph reformulation. A physics-informed neural network learns the auxiliary value function $\hat{V}$ satisfying the HJB-PDE, while conformal prediction provides high-confidence safety guarantees and performance bounds. The final value function $V_{\theta}$ and policy $\pi_{\theta}$ are recovered by enforcing a safety margin $\delta$ and optimizing over an augmented state, ensuring robust, safe, and near-optimal control. Experiments on 2D boat navigation, 8D pursuer-evader tracking, and 20D multi-agent navigation demonstrate scalable, real-time performance with provable safety guarantees, outperforming CRL and safety-filter baselines in both safety and efficiency.
Abstract
As autonomous systems become more ubiquitous in daily life, ensuring high performance with guaranteed safety is crucial. However, safety and performance could be competing objectives, which makes their co-optimization difficult. Learning-based methods, such as Constrained Reinforcement Learning (CRL), achieve strong performance but lack formal safety guarantees due to safety being enforced as soft constraints, limiting their use in safety-critical settings. Conversely, formal methods such as Hamilton-Jacobi (HJ) Reachability Analysis and Control Barrier Functions (CBFs) provide rigorous safety assurances but often neglect performance, resulting in overly conservative controllers. To bridge this gap, we formulate the co-optimization of safety and performance as a state-constrained optimal control problem, where performance objectives are encoded via a cost function and safety requirements are imposed as state constraints. We demonstrate that the resultant value function satisfies a Hamilton-Jacobi-Bellman (HJB) equation, which we approximate efficiently using a novel physics-informed machine learning framework. In addition, we introduce a conformal prediction-based verification strategy to quantify the learning errors, recovering a high-confidence safety value function, along with a probabilistic error bound on performance degradation. Through several case studies, we demonstrate the efficacy of the proposed framework in enabling scalable learning of safe and performant controllers for complex, high-dimensional autonomous systems.
