Optimal Particle-based Approximation of Discrete Distributions (OPAD)

Hadi Mohasel Afshar; Gilad Francis; Sally Cripps

Optimal Particle-based Approximation of Discrete Distributions (OPAD)

Hadi Mohasel Afshar, Gilad Francis, Sally Cripps

TL;DR

This work proves that for discrete target distributions, the KL divergence of any particle-based approximation is minimized when particle weights are proportional to the target probabilities, defining the Optimal Particle-based Approximation of Discrete Distributions (OPAD). The authors establish a main theorem showing the minimum $D_{KL}$ equals $-\log(\pi^*(\mathcal{X}^P))$ and provide a Jensen-based proof, with OPAD+ further leveraging rejected proposals. The approach requires no extra computation and can be applied to existing MCMC outputs by reweighting, yielding substantial reductions in approximation error. Empirical evaluations on Ising models, Bayesian Variable Selection, and Bayesian Structure Learning demonstrate that OPAD/OPAD+ consistently outperform standard MCMC in KL divergence by orders of magnitude, highlighting the practical impact for high-dimensional discrete inference.

Abstract

Particle-based methods include a variety of techniques, such as Markov Chain Monte Carlo (MCMC) and Sequential Monte Carlo (SMC), for approximating a probabilistic target distribution with a set of weighted particles. In this paper, we prove that for any set of particles, there is a unique weighting mechanism that minimizes the Kullback-Leibler (KL) divergence of the (particle-based) approximation from the target distribution, when that distribution is discrete -- any other weighting mechanism (e.g. MCMC weighting that is based on particles' repetitions in the Markov chain) is sub-optimal with respect to this divergence measure. Our proof does not require any restrictions either on the target distribution, or the process by which the particles are generated, other than the discreteness of the target. We show that the optimal weights can be determined based on values that any existing particle-based method already computes; As such, with minimal modifications and no extra computational costs, the performance of any particle-based method can be improved. Our empirical evaluations are carried out on important applications of discrete distributions including Bayesian Variable Selection and Bayesian Structure Learning. The results illustrate that our proposed reweighting of the particles improves any particle-based approximation to the target distribution consistently and often substantially.

Optimal Particle-based Approximation of Discrete Distributions (OPAD)

TL;DR

Abstract

Optimal Particle-based Approximation of Discrete Distributions (OPAD)

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (4)

Theorems & Definitions (3)