Linear-Cost Vecchia Approximation of Multivariate Normal Probabilities
Jian Cao, Matthias Katzfuss
TL;DR
This work addresses the computational bottleneck of evaluating high-dimensional MVN probabilities and sampling from TMVN distributions. It introduces VMET, a Vecchia-based reparameterization of the minimax exponential tilting framework, achieving linear-time Monte Carlo sampling and gradient/Hessian estimation while preserving MET's accuracy, even in large-scale problems. Key contributions include a sparse inverse-Cholesky representation via Vecchia, a linear-time solver for the exponential-tilting parameters, a Vecchia-based variable reordering, and optional multi-level MC and regional TMVN sampling for partially censored data. Empirical results on simulations and a 24,701-point groundwater dataset demonstrate substantial speedups, improved accuracy, numerical stability, and practical applicability to partially censored Gaussian-process models; the authors also provide an R package VeccTMVN for public use.
Abstract
Multivariate normal (MVN) probabilities arise in myriad applications, but they are analytically intractable and need to be evaluated via Monte-Carlo-based numerical integration. For the state-of-the-art minimax exponential tilting (MET) method, we show that the complexity of each of its components can be greatly reduced through an integrand parameterization that utilizes the sparse inverse Cholesky factor produced by the Vecchia approximation, whose approximation error is often negligible relative to the Monte-Carlo error. Based on this idea, we derive algorithms that can estimate MVN probabilities and sample from truncated MVN distributions in linear time (and that are easily parallelizable) at the same convergence or acceptance rate as MET, whose complexity is cubic in the dimension of the MVN probability. We showcase the advantages of our methods relative to existing approaches using several simulated examples. We also analyze a groundwater-contamination dataset with over twenty thousand censored measurements to demonstrate the scalability of our method for partially censored Gaussian-process models.
