Generalization Error Analysis of Deep Backward Dynamic Programming for Solving Nonlinear PDEs
Du Ouyang, Jichang Xiao, Xiaoqun Wang
TL;DR
The paper analyzes how generalization error contributes to the total error in solving high-dimensional nonlinear PDEs via Deep Backward Dynamic Programming (DBDP) and shows that quasi-Monte Carlo (QMC) sampling yields a superior convergence rate for the generalization error compared to Monte Carlo (MC). By formulating PDEs as BSDEs and training neural networks to approximate the solution and its gradient, the authors decompose the total error into scheme, approximation, and generalization components, proving that QMC achieves an $O(m^{-1+ ext{ε}})$ rate under suitable conditions. They provide rigorous MC-based bounds and show that, under certain approximation assumptions, MC can break the curse of dimensionality, while QMC further accelerates convergence. Numerical experiments across nonlinear heat, HJB, and nonlinear Black-Scholes equations corroborate that QMC-based training yields smaller errors and lower variance than MC, confirming the practical benefits for high-dimensional nonlinear PDEs.
Abstract
We explore the application of the quasi-Monte Carlo (QMC) method in deep backward dynamic programming (DBDP) (Hure et al. 2020) for numerically solving high-dimensional nonlinear partial differential equations (PDEs). Our study focuses on examining the generalization error as a component of the total error in the DBDP framework, discovering that the rate of convergence for the generalization error is influenced by the choice of sampling methods. Specifically, for a given batch size $m$, the generalization error under QMC methods exhibits a convergence rate of $O(m^{-1+\varepsilon})$, where $\varepsilon$ can be made arbitrarily small. This rate is notably more favorable than that of the traditional Monte Carlo (MC) methods, which is $O(m^{-1/2+\varepsilon})$. Our theoretical analysis shows that the generalization error under QMC methods achieves a higher order of convergence than their MC counterparts. Numerical experiments demonstrate that QMC indeed surpasses MC in delivering solutions that are both more precise and stable.
