Delta-AI: Local objectives for amortized inference in sparse graphical models
Jean-Pierre Falet, Hae Beom Lee, Nikolay Malkin, Chen Sun, Dragos Secrieru, Thomas Jiralerspong, Dinghuai Zhang, Guillaume Lajoie, Yoshua Bengio
TL;DR
Δ-AI introduces a local, structure-aware objective for amortized inference in sparse PGMs by enforcing local equality of conditional distributions between a Markov network and a chordal Bayesian network. By training a single parametric sampler with losses that depend only on a variable and its Markov blanket, it achieves fast, off-policy learning and can amortize over multiple DAG orders. The approach yields faster wall-clock convergence than traditional GFlowNets and outperforms unstructured amortized methods and MCMC in synthetic experiments, while enabling partial-subset inference in real-data VAEs. The work also connects to continuous-space score matching and offers a bilevel training framework where the amortized sampler aids the training of energy-based or latent-variable models. Overall, Δ-AI provides a scalable, locality-driven path to accurate amortized inference in sparse graphical models with practical implications for deep generative modeling and structure-aware learning.
Abstract
We present a new algorithm for amortized inference in sparse probabilistic graphical models (PGMs), which we call $Δ$-amortized inference ($Δ$-AI). Our approach is based on the observation that when the sampling of variables in a PGM is seen as a sequence of actions taken by an agent, sparsity of the PGM enables local credit assignment in the agent's policy learning objective. This yields a local constraint that can be turned into a local loss in the style of generative flow networks (GFlowNets) that enables off-policy training but avoids the need to instantiate all the random variables for each parameter update, thus speeding up training considerably. The $Δ$-AI objective matches the conditional distribution of a variable given its Markov blanket in a tractable learned sampler, which has the structure of a Bayesian network, with the same conditional distribution under the target PGM. As such, the trained sampler recovers marginals and conditional distributions of interest and enables inference of partial subsets of variables. We illustrate $Δ$-AI's effectiveness for sampling from synthetic PGMs and training latent variable models with sparse factor structure.
