Fast Compute via MC Boosting
Sarah Polson, Vadim Sokolov
TL;DR
The paper addresses the bottleneck of repeated linear solves in modern statistical learning by proposing Monte Carlo boosting, a framework that leverages Neumann-series representations and random-walk estimators to efficiently estimate selected solution components or linear functionals. It unifies forward and adjoint Monte Carlo estimators, and connects Halton-style sequential residual correction with IRLS-like updates used in data augmentation and EM/ECM algorithms. Through theoretical formulations and empirical scaling studies, the work delineates regimes where plain MC, exact sequential MC, and subsampled sequential MC offer compute advantages over traditional direct or stationary methods. The results demonstrate that sequential correction can dramatically reduce compute for large-scale problems or when only a subset of outputs is required, with variance reduction and trajectory reuse playing key roles in practical performance. This framework enables practitioners to trade accuracy for compute in a principled way and suggests concrete extensions to structured matrices and nonlinear/incremental inference workflows.
Abstract
Modern training and inference pipelines in statistical learning and deep learning repeatedly invoke linear-system solves as inner loops, yet high-accuracy deterministic solvers can be prohibitively expensive when solves must be repeated many times or when only partial information (selected components or linear functionals) is required. We position \emph{Monte Carlo boosting} as a practical alternative in this regime, surveying random-walk estimators and sequential residual correction in a unified notation (Neumann-series representation, forward/adjoint estimators, and Halton-style sequential correction), with extensions to overdetermined/least-squares problems and connections to IRLS-style updates in data augmentation and EM/ECM algorithms. Empirically, we compare Jacobi and Gauss--Seidel iterations with plain Monte Carlo, exact sequential Monte Carlo, and a subsampled sequential variant, illustrating scaling regimes that motivate when Monte Carlo boosting can be an enabling compute primitive for modern statistical learning workflows.
