Temporal Parallelisation of the HJB Equation and Continuous-Time Linear Quadratic Control
Simo Särkkä, Ángel F. García-Fernández
TL;DR
This paper introduces a temporal parallelisation framework for solving continuous-time optimal control problems by partitioning the time horizon and solving conditional value functions on each sub-interval. An associative operator enables parallel combination of these conditional solutions, yielding an overall $O(\log T)$ span for the HJB solution and enabling parallel recovery of optimal trajectories. The approach is specialised to the continuous-time LQT problem, where closed-form backward and forward conditional HJB equations are derived and used with block-wise scans to achieve fast, scalable solutions on multi-core CPUs and GPUs. Numerical experiments demonstrate substantial speedups over sequential methods, while discussing storage requirements and extensions to stochastic settings. The framework paves the way for scalable offline computation of optimal controls in high-dimensional, time-critical applications.
Abstract
This paper presents a mathematical formulation to perform temporal parallelisation of continuous-time optimal control problems, which can be solved via the Hamilton--Jacobi--Bellman (HJB) equation. We divide the time interval of the control problem into sub-intervals, and define a control problem in each sub-interval, conditioned on the start and end states, leading to conditional value functions for the sub-intervals. By defining an associative operator as the minimisation of the sum of conditional value functions, we obtain the elements and associative operators for a parallel associative scan operation. This allows for solving the optimal control problem on the whole time interval in parallel in logarithmic time complexity in the number of sub-intervals. We derive the HJB-type of backward and forward equations for the conditional value functions and solve them in closed form for linear quadratic problems. We also discuss numerical methods for computing the conditional value functions. The computational advantages of the proposed parallel methods are demonstrated via simulations run on a multi-core central processing unit and a graphics processing unit.
