Orthogonal Bootstrap: Efficient Simulation of Input Uncertainty
Kaizhao Liu, Jose Blanchet, Lexing Ying, Yiping Lu
TL;DR
This paper introduces Orthogonal Bootstrap, a debiasing strategy for bootstrap-based uncertainty quantification under input uncertainty. By decomposing the bootstrap target into a non-orthogonal part captured by the influence function (via Infinitesimal Jackknife) and an orthogonal remainder, the method uses the non-orthogonal component as a control variate and simulates only the orthogonal part, dramatically reducing the required Monte Carlo replications. The authors prove that, under mild smoothness assumptions in a kernel mean embedding (Fréchet derivative) framework, the simulation variance drops from $O_p(1/n^{1+ abla})$ for standard bootstrap to $O_p(1/n^{2+ abla})$ for the orthogonal method, enabling $O(1)$ replications. They also extend the approach to variance estimation and provide extensive numerical experiments on debiasing, confidence/prediction interval construction, and real-data prediction, showing improved accuracy and competitiveness with much lower computational cost. The work offers a practical, theoretically grounded path to fast, reliable uncertainty quantification for large-scale applications in statistics and ML.
Abstract
Bootstrap is a popular methodology for simulating input uncertainty. However, it can be computationally expensive when the number of samples is large. We propose a new approach called \textbf{Orthogonal Bootstrap} that reduces the number of required Monte Carlo replications. We decomposes the target being simulated into two parts: the \textit{non-orthogonal part} which has a closed-form result known as Infinitesimal Jackknife and the \textit{orthogonal part} which is easier to be simulated. We theoretically and numerically show that Orthogonal Bootstrap significantly reduces the computational cost of Bootstrap while improving empirical accuracy and maintaining the same width of the constructed interval.
