Convergence Analysis of the Upwind Difference Methods for Hamilton-Jacobi-Bellman Equations
Daisuke Inoue, Yuji Ito, Takahito Kashiwabara, Norikazu Saito, Hiroaki Yoshida
TL;DR
This work analyzes the upwind difference scheme for Hamilton-Jacobi-Bellman equations arising in optimal control. It shows that, under a classical (smooth) solution, the value function converges with a first-order rate in both time and space, i.e., $O(\Delta t^{\eta}+\Delta x^{\eta})$ for some $\eta\in(0,1]$, while the standard viscosity-solution theory gives $O(\sqrt{\Delta t})$ in general. The authors further establish convergence of the numerical control input in the one-dimensional setting by exploiting the link between HJB equations and scalar conservation laws, using $L^1$ convergence of spatial derivatives and epi-convergence to guarantee that the discrete minimizers converge to the true optimal input $a^*(x,t)$. Numerical experiments in 1D and 2D corroborate the theoretical rates and demonstrate the scheme’s practical reliability for control synthesis. The results provide rigorous guarantees for using upwind schemes in discrete HJB-based control and highlight an open problem of extending control-input convergence to higher dimensions.
Abstract
This paper investigates the convergence properties of the upwind difference scheme for the Hamilton--Jacobi--Bellman (HJB) equation, a central partial differential equation in optimal control theory. First, assuming the existence of a classical solution, we show that the numerical solution converges to the true solution with a first-order rate with respect to the time step. This result complements the square-root rate established in previous studies for viscosity solutions. Second, by exploiting the correspondence between HJB equations and conservation laws, we prove the convergence of the optimal control input. This analysis is crucial for practical applications where the control input is the primary quantity of interest, yet it has rarely been addressed in previous studies. Finally, we confirm the validity of our theoretical results through numerical experiments on typical control problems.
