Path-Guided Particle-based Sampling
Mingzhou Fan, Ruida Zhou, Chao Tian, Xiaoning Qian
TL;DR
This work tackles multimodal posterior inference in Bayesian settings by introducing Path-Guided Particle-based Sampling (PGPS), which steers particles along a partition-free density path from an initial distribution to the target using a neural network to learn the guiding vector field. The core novelty is the Log-weighted Shrinkage (LwS) density path, which enables efficient mode discovery and coverage of the target distribution, coupled with a PDE-inspired training objective for the vector field. The authors prove a Wasserstein-distance bound between PGPS outputs and the target that decomposes into approximation and discretization errors, and demonstrate improved mode search, weight recovery, and calibration over SVGD, Langevin dynamics, and related methods in synthetic Gaussian mixtures and Bayesian neural network tasks, including UCI benchmarks and noisy MNIST. They also discuss a training-free variant based on Langevin steps and outline future directions for density-path design and convergence analysis. Overall, PGPS provides a principled, path-guided alternative to traditional gradient-flow samplers for efficient, multimodal Bayesian inference with practical gains in accuracy and uncertainty calibration.
Abstract
Particle-based Bayesian inference methods by sampling from a partition-free target (posterior) distribution, e.g., Stein variational gradient descent (SVGD), have attracted significant attention. We propose a path-guided particle-based sampling~(PGPS) method based on a novel Log-weighted Shrinkage (LwS) density path linking an initial distribution to the target distribution. We propose to utilize a Neural network to learn a vector field motivated by the Fokker-Planck equation of the designed density path. Particles, initiated from the initial distribution, evolve according to the ordinary differential equation defined by the vector field. The distribution of these particles is guided along a density path from the initial distribution to the target distribution. The proposed LwS density path allows for an efficient search of modes of the target distribution while canonical methods fail. We theoretically analyze the Wasserstein distance of the distribution of the PGPS-generated samples and the target distribution due to approximation and discretization errors. Practically, the proposed PGPS-LwS method demonstrates higher Bayesian inference accuracy and better calibration ability in experiments conducted on both synthetic and real-world Bayesian learning tasks, compared to baselines, such as SVGD and Langevin dynamics, etc.
