Solving high-dimensional partial differential equations using deep learning
Jiequn Han, Arnulf Jentzen, Weinan E
TL;DR
Problem: solving high-dimensional semilinear parabolic PDEs is computationally intractable with traditional methods due to the curse of dimensionality. Approach: reformulate the PDEs as backward stochastic differential equations and use deep neural networks to approximate the gradient of the solution, integrating these approximations into a forward-time computation via a deep BSDE network trained with terminal-condition losses. Contributions: the method is demonstrated on 100-dimensional nonlinear Black-Scholes with default risk, a 100-dimensional Hamilton-Jacobi-Bellman equation, and a 100-dimensional Allen-Cahn equation, achieving relative errors under 0.5% and practical runtimes on standard hardware; the architecture supports evaluating u and its gradient along stochastic paths. Significance: enables efficient, scalable solution of complex high-dimensional PDEs in economics, finance, operations research, and physics, broadening the scope of multi-agent and multi-asset models.
Abstract
Developing algorithms for solving high-dimensional partial differential equations (PDEs) has been an exceedingly difficult task for a long time, due to the notoriously difficult problem known as the "curse of dimensionality". This paper introduces a deep learning-based approach that can handle general high-dimensional parabolic PDEs. To this end, the PDEs are reformulated using backward stochastic differential equations and the gradient of the unknown solution is approximated by neural networks, very much in the spirit of deep reinforcement learning with the gradient acting as the policy function. Numerical results on examples including the nonlinear Black-Scholes equation, the Hamilton-Jacobi-Bellman equation, and the Allen-Cahn equation suggest that the proposed algorithm is quite effective in high dimensions, in terms of both accuracy and cost. This opens up new possibilities in economics, finance, operational research, and physics, by considering all participating agents, assets, resources, or particles together at the same time, instead of making ad hoc assumptions on their inter-relationships.
