Fast Gradient-Based Inference with Continuous Latent Variable Models in Auxiliary Form
Diederik P Kingma
TL;DR
The paper tackles the challenge of efficient gradient-based inference in Bayesian networks with many continuous latent layers, where exact inference is intractable. It introduces an auxiliary-form transformation that replaces latent variables with conditionally deterministic counterparts and auxiliary variables, yielding larger Markov blankets and faster gradient updates. The method is shown to be equivalent to the original model after marginalization and is implemented with practical steps, including generating functions and inversion-based sampling schemes. Empirical results on MNIST-based generative networks and dynamical Bayesian networks demonstrate substantial speedups in MAP inference, validating the approach and its potential for scalable gradient-based learning in deep latent structures.
Abstract
We propose a technique for increasing the efficiency of gradient-based inference and learning in Bayesian networks with multiple layers of continuous latent vari- ables. We show that, in many cases, it is possible to express such models in an auxiliary form, where continuous latent variables are conditionally deterministic given their parents and a set of independent auxiliary variables. Variables of mod- els in this auxiliary form have much larger Markov blankets, leading to significant speedups in gradient-based inference, e.g. rapid mixing Hybrid Monte Carlo and efficient gradient-based optimization. The relative efficiency is confirmed in ex- periments.
