Tensor train based sampling algorithms for approximating regularized Wasserstein proximal operators
Fuqun Han, Stanley Osher, Wuchen Li
TL;DR
This work develops a tensor-train (TT) based sampling framework that leverages a regularized Wasserstein proximal operator to approximate the density evolution of overdamped Langevin dynamics in high dimensions. By expressing the kernel through a TT representation, the authors achieve scalable computation and storage benefits, with rigorous unbiasedness and linear convergence demonstrated in the Gaussian setting. They introduce TT-BRWP, a noise-free, unbiased variant that uses a carefully chosen covariance update to stabilize density estimation, and provide theoretical analyses for Gaussian and simplified Bayesian inverse problems along with practical computational considerations. Extensive numerical experiments across Gaussian, multimodal, nonconvex, and Bayesian inverse problems show that TT-BRWP outperforms classical Langevin-type samplers and BRWP with MC integration in accuracy, convergence speed, and robustness. The proposed method has the potential to impact high-dimensional Bayesian inference and inverse problems by enabling efficient, accurate sampling in settings where traditional MCMC approaches struggle.
Abstract
We present a tensor train (TT) based algorithm designed for sampling from a target distribution and employ TT approximation to capture the high-dimensional probability density evolution of overdamped Langevin dynamics. This involves utilizing the regularized Wasserstein proximal operator, which exhibits a simple kernel integration formulation, i.e., the softmax formula of the traditional proximal operator. The integration, performed in $\mathbb{R}^d$, poses a challenge in practical scenarios, making the algorithm practically implementable only with the aid of TT approximation. In the specific context of Gaussian distributions, we rigorously establish the unbiasedness and linear convergence of our sampling algorithm towards the target distribution. To assess the effectiveness of our proposed methods, we apply them to various scenarios, including Gaussian families, Gaussian mixtures, bimodal distributions, and Bayesian inverse problems in numerical examples. The sampling algorithm exhibits superior accuracy and faster convergence when compared to classical Langevin dynamics-type sampling algorithms.
