Variational Bayesian Inference with Stochastic Search
John Paisley, David Blei, Michael Jordan
TL;DR
This paper tackles the challenge of variational Bayesian inference with mean-field approximations when not all log-joint expectations are tractable. It introduces stochastic search variational Bayes (SSVB), which directly optimizes the variational lower bound $\mathcal{L}$ using unbiased Monte Carlo gradients and variance reduction via control variates, rather than relying on tightened bounds. The authors demonstrate the method on two nonconjugate models—logistic regression and a finite HDP approximation—showing that suitable control variates, including a second-order Taylor (delta) method, substantially reduce gradient variance and improve the objective with fewer samples. The proposed framework generalizes MFVB inference to nonconjugate settings and can integrate existing bounds as control variates, offering a scalable, flexible approach for Bayesian learning with large or intractable models.
Abstract
Mean-field variational inference is a method for approximate Bayesian posterior inference. It approximates a full posterior distribution with a factorized set of distributions by maximizing a lower bound on the marginal likelihood. This requires the ability to integrate a sum of terms in the log joint likelihood using this factorized distribution. Often not all integrals are in closed form, which is typically handled by using a lower bound. We present an alternative algorithm based on stochastic optimization that allows for direct optimization of the variational lower bound. This method uses control variates to reduce the variance of the stochastic search gradient, in which existing lower bounds can play an important role. We demonstrate the approach on two non-conjugate models: logistic regression and an approximation to the HDP.
