General-purpose post-sampling reweighting method for multimodal target measures
Pierre Monmarché
TL;DR
The paper tackles the problem of obtaining correct relative weights for multiple modes after sampling a multimodal distribution, focusing on a purely post-sampling reweighting step. It formulates a variational-inference-like objective that minimizes the KL divergence between a weighted empirical mixture $\pi(p)=\sum_k p_k\nu_k$ and the target measure $\mu$, with $\nu_k$ representing cluster-local densities. The method optimizes the weights $p$ in the simplex via projected exponential gradient descent, using stochastic estimates of gradients from cluster data, and it provides exact or near-exact solutions in the disjoint-support case. Through extensive numerical experiments on Gaussian mixtures, tempered Langevin, and high-dimensional setups, the approach demonstrates reliable weight recovery and improved estimations, especially when cluster supports are well separated. The proposed post-sampling VI-style reweighting offers a practical, sample-efficient tool for correcting mode weights without generating new samples, enabling accurate observable estimates from existing multimodal samples in moderate dimensions and with multiple modes.
Abstract
When sampling multi-modal probability distributions, correctly estimating the relative probability of each mode, even when the modes have been discovered and locally sampled, remains challenging. We test a simple reweighting scheme designed for this situation, which consists in minimizing (in terms of weights) the Kullback-Leibler divergence of a weighted (regularized) empirical distribution of the samples with respect to the target measure.
