Deep neural network approximations for the stable manifolds of the Hamilton-Jacobi-Bellman equations
Guoyuan Chen
TL;DR
The paper tackles infinite-horizon optimal control by representing the optimal feedback through the stable manifold of the Hamiltonian system associated with the stationary HJB equation. It proves that, under suitable accuracy, a neural network approximation of the stable manifold yields a stabilizing, nearly optimal closed-loop and provides exponential convergence bounds. Building on this theory, the authors develop a grid-free, data-driven algorithm that uses adaptive trajectory generation via two-point boundary-value problems and forward integration, a composite loss, and rescaling to train an LSTM-type network that approximates the manifold. They validate the approach on two high-dimensional problems—the Reaction Wheel Pendulum and the parabolic Allen-Cahn equation—demonstrating real-time control performance and substantial improvements over traditional methods, with the method remaining scalable to high-dimensional systems.
Abstract
For an infinite-horizon control problem, the optimal control can be represented by the stable manifold of the characteristic Hamiltonian system of Hamilton-Jacobi-Bellman (HJB) equation in a semiglobal domain. In this paper, we first theoretically prove that if an approximation is sufficiently close to the exact stable manifold of the HJB equation in a certain sense, then the control derived from this approximation stabilizes the system and is nearly optimal. Then, based on the theoretical result, we propose a deep learning algorithm to approximate the stable manifold and compute optimal feedback control numerically. The algorithm relies on adaptive data generation through finding trajectories randomly within the stable manifold. Such kind of algorithm is grid-free basically, making it potentially applicable to a wide range of high-dimensional nonlinear systems. We demonstrate the effectiveness of our method through two examples: stabilizing the Reaction Wheel Pendulums and controlling the parabolic Allen-Cahn equation.
