Bounding the Error of Value Functions in Sobolev Norm Yields Bounds on Suboptimality of Controller Performance
Morgan Jones, Matthew Peet
TL;DR
The paper tackles suboptimality guarantees for controllers synthesized from approximate value functions (VFs) in finite-horizon nonlinear control. It proves a sharp suboptimality bound: the cost gap $L(u_J,x_0)$ is bounded by a constant times the Sobolev norm error $\|J - V^*\|_{W^{1,\infty}(B_R(0)\times[0,T])}$, enabling arbitrarily small performance loss when VF approximations converge in $W^{1,\infty}$; this does not hold for $L^ty$ error. The core argument uses a modified OCP in which $J$ plays the role of the VF and the HJB framework for viscosity solutions to establish the bound, with explicit dependence on horizon and system growth parameters. Numerical examples illustrate that uniform convergence of $J$ to $V^*$ can fail to guarantee near-optimal performance unless the convergence is in the Sobolev sense, and confirm that the bound behaves as predicted with respect to the Sobolev error. The results justify prioritizing Sobolev-error control in numerical VF approximations (e.g., PINNs) to obtain provable controller performance guarantees in continuous-time, finite-horizon settings.
Abstract
Optimal feedback controllers for nonlinear systems can be derived by solving the Hamilton-Jacobi-Bellman (HJB) equation. However, because the HJB is a nonlinear partial differential equation, in general only approximate solutions can be numerically found. While numerical error bounds on approximate HJB solutions are often available, we show that these bounds do not necessarily translate into guarantees on the suboptimality of the resulting controllers. In this paper, we establish that if the numerical error in the HJB solution can be bounded in a Sobolev norm, a norm involving spatial derivatives, then the suboptimality of the corresponding feedback controller can also be bounded, and this bound can be made arbitrarily small. In contrast, we demonstrate that such guarantees do not hold when the error is measured in more typical norms, such as the uniform norm ($L^\infty$). Our results apply to systems governed by locally Lipschitz continuous dynamics over a finite time horizon with a compact input space. Numerical examples are provided to illustrate the theoretical findings.
