Convergence of the deep BSDE method for stochastic control problems formulated through the stochastic maximum principle
Zhipeng Huang, Balint Negyesi, Cornelis W. Oosterlee
TL;DR
This paper addresses high-dimensional stochastic control problems formulated via the stochastic maximum principle (SMP) and develops a deep SMP-BSDE method to solve the resulting vector-valued FBSDEs. The authors prove an a-posteriori convergence bound that relates the numerical error to the time discretization and to the terminal loss, generalizing scalar results to multi-dimensional settings. They also compare the SMP-based approach with HJB- and DP-based deep BSDE methods, highlighting its ability to handle diffusion control. Numerical experiments on linear-quadratic-type problems show accurate, scalable performance and practical convergence guarantees in high dimensions.
Abstract
It is well-known that decision-making problems from stochastic control can be formulated by means of a forward-backward stochastic differential equation (FBSDE). Recently, the authors of Ji et al. 2022 proposed an efficient deep learning algorithm based on the stochastic maximum principle (SMP). In this paper, we provide a convergence result for this deep SMP-BSDE algorithm and compare its performance with other existing methods. In particular, by adopting a strategy as in Han and Long 2020, we derive a-posteriori estimate, and show that the total approximation error can be bounded by the value of the loss functional and the discretization error. We present numerical examples for high-dimensional stochastic control problems, both in case of drift- and diffusion control, which showcase superior performance compared to existing algorithms.
