Martingale deep learning for very high dimensional quasi-linear partial differential equations and stochastic optimal controls
Wei Cai, Shuixin Fang, Wenzhong Zhang, Tao Zhou
TL;DR
The paper tackles the challenge of solving very high-dimensional parabolic PDEs and Hamilton-Jacobi-Bellman equations arising in stochastic optimal control. It introduces a derivative-free martingale deep learning framework that reframes the PDEs as martingale conditions using a pilot process, enabling offline path generation and parallel training across time and space. A weak Galerkin formulation with adversarial test functions eliminates the need for conditional expectations, while a Policy Improvement Algorithm extension yields simultaneous learning of the value function and optimal control without explicit minimization. Numerical results demonstrate accurate solutions up to $d=10^4$ across quasilinear PDEs and HJB equations, with favorable runtimes and robustness to nonlinearities, highlighting practical potential for high-dimensional SOCPs.
Abstract
In this paper, a highly parallel and derivative-free martingale neural network learning method is proposed to solve Hamilton-Jacobi-Bellman (HJB) equations arising from stochastic optimal control problems (SOCPs), as well as general quasilinear parabolic partial differential equations (PDEs). In both cases, the PDEs are reformulated into a martingale formulation such that loss functions will not require the computation of the gradient or Hessian matrix of the PDE solution, while its implementation can be parallelized in both time and spatial domains. Moreover, the martingale conditions for the PDEs are enforced using a Galerkin method in conjunction with adversarial learning techniques, eliminating the need for direct computation of the conditional expectations associated with the martingale property. For SOCPs, a derivative-free implementation of the maximum principle for optimal controls is also introduced. The numerical results demonstrate the effectiveness and efficiency of the proposed method, which is capable of solving HJB and quasilinear parabolic PDEs accurately in dimensions as high as 10,000.
