Deep Learning Based on Randomized Quasi-Monte Carlo Method for Solving Linear Kolmogorov Partial Differential Equation
Jichang Xiao, Fengjiang Fu, Xiaoqun Wang
TL;DR
This work addresses solving high-dimensional linear Kolmogorov PDEs via deep learning by replacing MC-based data sampling with randomized quasi-Monte Carlo (RQMC) sampling in the empirical risk minimization framework. The authors derive a decomposition of the learning error into approximation and generalization components and prove that, under boundary-growth conditions, the mean generalization error decays as $O(n^{-1+ε})$ for RQMC and as $O(n^{-1/2+ε})$ for MC, with $ε>0$ arbitrarily small. They validate the theory through numerical experiments on the heat equation and Black–Scholes PDEs, showing that RQMC consistently yields smaller relative $L^{2}$ errors and more stable training, especially in lower dimensions. The results suggest that using scrambled digital nets in DL-based PDE solvers can substantially improve accuracy and efficiency for high-dimensional problems, though gains diminish with increasing dimension.
Abstract
Deep learning algorithms have been widely used to solve linear Kolmogorov partial differential equations~(PDEs) in high dimensions, where the loss function is defined as a mathematical expectation. We propose to use the randomized quasi-Monte Carlo (RQMC) method instead of the Monte Carlo (MC) method for computing the loss function. In theory, we decompose the error from empirical risk minimization~(ERM) into the generalization error and the approximation error. Notably, the approximation error is independent of the sampling methods. We prove that the convergence order of the mean generalization error for the RQMC method is $O(n^{-1+ε})$ for arbitrarily small $ε>0$, while for the MC method it is $O(n^{-1/2+ε})$ for arbitrarily small $ε>0$. Consequently, we find that the overall error for the RQMC method is asymptotically smaller than that for the MC method as $n$ increases. Our numerical experiments show that the algorithm based on the RQMC method consistently achieves smaller relative $L^{2}$ error than that based on the MC method.
