Strong Duality and Dual Ascent Approach to Continuous-Time Chance-Constrained Stochastic Optimal Control
Apurva Patil, Alfredo Duarte, Fabrizio Bisetti, Takashi Tanaka
TL;DR
The paper tackles continuous-time, continuous-space chance-constrained stochastic optimal control by formulating an exit-time based CC-SOC and deriving a dual problem via Lagrangian relaxation. It proves strong duality under a structural Assumption 1 and solves the dual using a path-integral dual ascent algorithm, enabling online optimization with open-loop trajectory samples. The dual function is computed through a transformed HJB PDE, with risk estimated either by a PDE-based approach or an importance-sampling scheme. Numerical experiments on 2D and 5D robotic models demonstrate the method’s effectiveness and its advantage over traditional finite-difference methods in higher dimensions, highlighting potential for real-time risk-aware planning in uncertain environments.
Abstract
The paper addresses a continuous-time continuous-space chance-constrained stochastic optimal control (SOC) problem where the probability of failure to satisfy given state constraints is explicitly bounded. We leverage the notion of exit time from continuous-time stochastic calculus to formulate a chance-constrained SOC problem. Without any conservative approximation, the chance constraint is transformed into an expectation of an indicator function which can be incorporated into the cost function by considering a dual formulation. We then express the dual function in terms of the solution to a Hamilton-Jacobi-Bellman partial differential equation parameterized by the dual variable. Under a certain assumption on the system dynamics and cost function, it is shown that a strong duality holds between the primal chance-constrained problem and its dual. The Path integral approach is utilized to numerically solve the dual problem via gradient ascent using open-loop samples of system trajectories. We present simulation studies on chance-constrained motion planning for spatial navigation of mobile robots and the solution of the path integral approach is compared with that of the finite difference method.
