Deep Learning Based on Randomized Quasi-Monte Carlo Method for Solving Linear Kolmogorov Partial Differential Equation

Jichang Xiao; Fengjiang Fu; Xiaoqun Wang

Deep Learning Based on Randomized Quasi-Monte Carlo Method for Solving Linear Kolmogorov Partial Differential Equation

Jichang Xiao, Fengjiang Fu, Xiaoqun Wang

TL;DR

This work addresses solving high-dimensional linear Kolmogorov PDEs via deep learning by replacing MC-based data sampling with randomized quasi-Monte Carlo (RQMC) sampling in the empirical risk minimization framework. The authors derive a decomposition of the learning error into approximation and generalization components and prove that, under boundary-growth conditions, the mean generalization error decays as $O(n^{-1+ε})$ for RQMC and as $O(n^{-1/2+ε})$ for MC, with $ε>0$ arbitrarily small. They validate the theory through numerical experiments on the heat equation and Black–Scholes PDEs, showing that RQMC consistently yields smaller relative $L^{2}$ errors and more stable training, especially in lower dimensions. The results suggest that using scrambled digital nets in DL-based PDE solvers can substantially improve accuracy and efficiency for high-dimensional problems, though gains diminish with increasing dimension.

Abstract

Deep learning algorithms have been widely used to solve linear Kolmogorov partial differential equations~(PDEs) in high dimensions, where the loss function is defined as a mathematical expectation. We propose to use the randomized quasi-Monte Carlo (RQMC) method instead of the Monte Carlo (MC) method for computing the loss function. In theory, we decompose the error from empirical risk minimization~(ERM) into the generalization error and the approximation error. Notably, the approximation error is independent of the sampling methods. We prove that the convergence order of the mean generalization error for the RQMC method is $O(n^{-1+ε})$ for arbitrarily small $ε>0$, while for the MC method it is $O(n^{-1/2+ε})$ for arbitrarily small $ε>0$. Consequently, we find that the overall error for the RQMC method is asymptotically smaller than that for the MC method as $n$ increases. Our numerical experiments show that the algorithm based on the RQMC method consistently achieves smaller relative $L^{2}$ error than that based on the MC method.

Deep Learning Based on Randomized Quasi-Monte Carlo Method for Solving Linear Kolmogorov Partial Differential Equation

TL;DR

for RQMC and as

for MC, with

arbitrarily small. They validate the theory through numerical experiments on the heat equation and Black–Scholes PDEs, showing that RQMC consistently yields smaller relative

errors and more stable training, especially in lower dimensions. The results suggest that using scrambled digital nets in DL-based PDE solvers can substantially improve accuracy and efficiency for high-dimensional problems, though gains diminish with increasing dimension.

Abstract

for arbitrarily small

, while for the MC method it is

for arbitrarily small

. Consequently, we find that the overall error for the RQMC method is asymptotically smaller than that for the MC method as

increases. Our numerical experiments show that the algorithm based on the RQMC method consistently achieves smaller relative

error than that based on the MC method.

Paper Structure (15 sections, 15 theorems, 106 equations, 6 figures, 2 tables)

This paper contains 15 sections, 15 theorems, 106 equations, 6 figures, 2 tables.

Introduction
Preliminaries
General deep learning framework for solving linear Kolmogorov PDEs
Randomized quasi-Monte Carlo methods
Analysis of the estimation error
Main Results
RQMC-based convergence rate
MC-based convergence rate
Applications for linear Kolmogorov PDE with affine drift and diffusion
Numerical Experiments
Heat equation
Black-Scholes model
Conclusion
Low variation extension
Rademacher complexity

Key Result

Lemma 2.1

Let $d \in \mathbb{N}$, $T,L \in (0,\infty)$, $a\in \mathbb{R}$, $b \in (a,\infty)$, let $\mu(x)$ and $\sigma(x)$ in eq:target satisfy for every $x, y \in \mathbb{R}^d$ that where $\left\Vert\cdot\right\Vert_{2}$ is the Euclidean norm and $\left\Vert\cdot\right\Vert_{HS}$ is the Hilbert-Schmidt norm. Let the function $u^*(t,x) \in C^{1,2}([0,T]\times\mathbb{R}^{d},\mathbb{R})$ be the solution of

Figures (6)

Figure 1: Average relative $L^{2}$ error vs. batchsize for solving heat equation in dimension 5 and 50.
Figure 2: The relative $L^{2}$ error of the training process for solving heat equation in dimension 5 and 50, the batchsize is chosen to be $2^{18}$.
Figure 3: The projection on $(0,1)^{2}$ with $x_{i} = 1/2,i=3,\dots,50$. The exact solution and the absolute error are calculated point-wisely on a $50\times 50$ uniform grid.
Figure 4: Average relative $L^{2}$ error vs. batchsize for solving Black-Scholes PDE in dimension 5 and 50.
Figure 5: The relative $L^{2}$ error of the training process for solving Black-Scholes PDE in dimension 5 and 50, the batchsize is chosen to be $2^{18}$.
...and 1 more figures

Theorems & Definitions (25)

Lemma 2.1
Lemma 2.2
proof
Lemma 2.3
proof
Theorem 3.1
proof
Lemma 3.1
proof
Theorem 3.2
...and 15 more

Deep Learning Based on Randomized Quasi-Monte Carlo Method for Solving Linear Kolmogorov Partial Differential Equation

TL;DR

Abstract

Deep Learning Based on Randomized Quasi-Monte Carlo Method for Solving Linear Kolmogorov Partial Differential Equation

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (6)

Theorems & Definitions (25)