Accelerating Particle-based Energetic Variational Inference
Xuelian Bao, Lulu Kang, Chun Liu, Yiwei Wang
TL;DR
This work targets efficient particle-based variational inference (ParVI) by accelerating the Energetic Variational Inference (EVI) framework. It introduces ImEQ, an implicit scheme that applies energy quadratization to only part of the objective, reducing the frequency of inter-particle term evaluations while preserving energy stability in a modified energy. The method decomposes the objective into $F=G+H$ with $G$ bounded below and quadratizes $G$ via an auxiliary variable $r=q(z)$, yielding a coupled, well-conditioned update involving a positive-definite operator ${oldsymbol{B}}^n$ and a single inner optimization per time step. Numerical experiments across toy distributions, Bayesian logistic regression, and Bayesian neural networks demonstrate that ImEQ achieves comparable accuracy to EVI-Im with substantially lower CPU time, and generally outperforms AEGD and AdaGrad in robustness and efficiency. The results suggest ImEQ as a practical, extensible approach for accelerating gradient-based sampling in high-dimensional, interacting-particle settings.
Abstract
In this work, we propose a novel particle-based variational inference (ParVI) method that accelerates the EVI-Im. Inspired by energy quadratization (EQ) and operator splitting techniques for gradient flows, our approach efficiently drives particles towards the target distribution. Unlike EVI-Im, which employs the implicit Euler method to solve variational-preserving particle dynamics for minimizing the KL divergence, derived using a "discretize-then-variational" approach, the proposed algorithm avoids repeated evaluation of inter-particle interaction terms, significantly reducing computational cost. The framework is also extensible to other gradient-based sampling techniques. Through several numerical experiments, we demonstrate that our method outperforms existing ParVI approaches in efficiency, robustness, and accuracy.
