Quasi-Monte Carlo Methods: What, Why, and How?
Fred J. Hickernell, Nathan Kirk, Aleksei G. Sorokin
TL;DR
Quasi-Monte Carlo methods replace IID sampling with low-discrepancy sequences to accelerate convergence in estimating $μ = \mathbb{E}(Y)$. The paper surveys LD sequence constructions (lattice, digital nets, Halton), randomization techniques, discrepancy measures via RKHS, stopping criteria, and problem reformulation strategies, highlighting how to maximize performance gains. It emphasizes reducing effective dimension through variable transforms, control variates, and multilevel methods, enabling near $O(1/ε)$ sampling costs under favorable conditions. The work also discusses extensions beyond mean estimation to density estimation, PDEs with uncertainty, and applications in graphics, robotics, and ML, and points to practical software implementations for practitioners.
Abstract
Many questions in quantitative finance, uncertainty quantification, and other disciplines are answered by computing the population mean, $μ:= \mathbb{E}(Y)$, where instances of $Y:=f(\boldsymbol{X})$ may be generated by numerical simulation and $\boldsymbol{X}$ has a simple probability distribution. The population mean can be approximated by the sample mean, $\hatμ_n := n^{-1} \sum_{i=0}^{n-1} f(\boldsymbol{x}_i)$ for a well chosen sequence of nodes, $\{\boldsymbol{x}_0, \boldsymbol{x}_1, \ldots\}$ and a sufficiently large sample size, $n$. Computing $μ$ is equivalent to computing a $d$-dimensional integral, $\int f(\boldsymbol{x}) \varrho(\boldsymbol{x}) \, \mathrm{d} \boldsymbol{x}$, where $\varrho$ is the probability density for $\boldsymbol{X}$. Quasi-Monte Carlo methods replace independent and identically distributed sequences of random vector nodes, $\{\boldsymbol{x}_i \}_{i = 0}^{\infty}$, by low discrepancy sequences. This accelerates the convergence of $\hatμ_n$ to $μ$ as $n \to \infty$. This tutorial describes low discrepancy sequences and their quality measures. We demonstrate the performance gains possible with quasi-Monte Carlo methods. Moreover, we describe how to formulate problems to realize the greatest performance gains using quasi-Monte Carlo. We also briefly describe the use of quasi-Monte Carlo methods for problems beyond computing the mean, $μ$.
