Time-reversal solution of BSDEs in stochastic optimal control: a linear quadratic study
Yuhang Mei, Amirhossein Taghvaei
TL;DR
The paper addresses numerical solutions for BSDEs arising in stochastic optimal control by comparing least-squares Monte Carlo (LSMC) and time-reversal (TR) approaches within a unified framework, focusing on both value-function and co-state BSDEs and benchmarking against the linear-quadratic (LQ) case. It demonstrates that the TR method delivers significantly higher accuracy and stability, particularly for the co-state BSDE and in higher dimensions, and provides explicit LQ formulas to validate results. The findings suggest that time-reversal techniques, including the score-based drift estimation, offer a robust alternative to conditional-expectation regression for solving SOC BSDEs with potential applicability beyond the LQ setting. The work highlights practical implications for efficiently solving stochastic control problems and motivates further exploration into nonlinear extensions.
Abstract
This paper addresses the numerical solution of backward stochastic differential equations (BSDEs) arising in stochastic optimal control. Specifically, we investigate two BSDEs: one derived from the Hamilton-Jacobi-Bellman equation and the other from the stochastic maximum principle. For both formulations, we analyze and compare two numerical methods. The first utilizes the least-squares Monte-Carlo (LSMC) approach for approximating conditional expectations, while the second leverages a time-reversal (TR) of diffusion processes. Although both methods extend to nonlinear settings, our focus is on the linear-quadratic case, where analytical solutions provide a benchmark. Numerical results demonstrate the superior accuracy and efficiency of the TR approach across both BSDE representations, highlighting its potential for broader applications in stochastic control.
