When are Unbiased Monte Carlo Estimators More Preferable than Biased Ones?
Guanyang Wang, Jose Blanchet, Peter W. Glynn
TL;DR
This paper formulates a rigorous framework to compare unbiased and biased Monte Carlo estimators under massively parallel computation. It shows that unbiased methods tend to have favorable completion times, tightly linked to the tail behavior of their per-replication running times, but do not automatically reduce total computational cost relative to biased methods. The framework is applied to multilevel and Markov chain Monte Carlo, revealing that unbiased randomized MLMC (rMLMC) can achieve the optimal $O(\epsilon^{-2})$ cost with favorable completion-time properties, while standard MLMC remains optimal for total cost in non-parallel settings. In MCMC, debiasing via coupling yields unbiased estimators with competitive cost and substantially improved completion-time performance in parallel environments. The numerical case study on Gaussian mean estimation demonstrates the practical benefits of unbiased MCMC under heavy parallelization, offering guidance on when to adopt unbiased versus biased strategies and highlighting opportunities for hybrids and variance reduction techniques.
Abstract
Due to the potential benefits of parallelization, designing unbiased Monte Carlo estimators, primarily in the setting of randomized multilevel Monte Carlo, has recently become very popular in operations research and computational statistics. However, existing work primarily substantiates the benefits of unbiased estimators at an intuitive level or using empirical evaluations. The intuition being that unbiased estimators can be replicated in parallel enabling fast estimation in terms of wall-clock time. This intuition ignores that, typically, bias will be introduced due to impatience because most unbiased estimators necesitate random completion times. This paper provides a mathematical framework for comparing these methods under various metrics, such as completion time and overall computational cost. Under practical assumptions, our findings reveal that unbiased methods typically have superior completion times - the degree of superiority being quantifiable through the tail behavior of their running time distribution - but they may not automatically provide substantial savings in overall computational costs. We apply our findings to Markov Chain Monte Carlo and Multilevel Monte Carlo methods to identify the conditions and scenarios where unbiased methods have an advantage, thus assisting practitioners in making informed choices between unbiased and biased methods.
