Electrostatics-based particle sampling and approximate inference
Yongchao Huang
TL;DR
The paper addresses the challenge of sampling and inference for complex target densities that may be difficult to access with gradients. It introduces EParVI, a deterministic, gradient-free interacting particle system inspired by electrostatics, where negative charges migrate under repulsion and attraction to fixed positives whose magnitudes encode the target density $p(oldsymbol{x})$, driving the particle distribution toward the target. The authors formalize repulsive and attractive forces, propose three discrete-time update rules (Euler, Verlet, damped-Verlet), and demonstrate competitive performance on low-dimensional densities, Neal's funnel, Bayesian logistic regression, and Lotka–Volterra parameter inference, while providing discussion on complexity, diagnostics, and extensions. They argue that this physics-based, non-gradient-based approach offers a simple, non-parametric, extensible alternative to traditional MCMC/VI methods, with potential for broader probabilistic machine learning applications, albeit with notable dimensionality challenges and opportunities for efficiency gains.
Abstract
A new particle-based sampling and approximate inference method, based on electrostatics and Newton mechanics principles, is introduced with theoretical ground, algorithm design and experimental validation. This method simulates an interacting particle system (IPS) where particles, i.e. the freely-moving negative charges and spatially-fixed positive charges with magnitudes proportional to the target distribution, interact with each other via attraction and repulsion induced by the resulting electric fields described by Poisson's equation. The IPS evolves towards a steady-state where the distribution of negative charges conforms to the target distribution. This physics-inspired method offers deterministic, gradient-free sampling and inference, achieving comparable performance as other particle-based and MCMC methods in benchmark tasks of inferring complex densities, Bayesian logistic regression and dynamical system identification. A discrete-time, discrete-space algorithmic design, readily extendable to continuous time and space, is provided for usage in more general inference problems occurring in probabilistic machine learning scenarios such as Bayesian inference, generative modelling, and beyond.
