Variational Transdimensional Inference
Laurence Davies, Dan Mackinlay, Rafael Oliveira, Scott A. Sisson
TL;DR
This work tackles variational inference over transdimensional spaces by introducing CoSMIC, a contextually masked normalizing flow that enables a single variational density to approximate a transdimensional posterior $\pi(m,\boldsymbol{\theta}_m)$. A dimension-saturation trick augments all model-specific parameter spaces to a common dimension $d_{\max}$ with auxiliary variables, enabling exact factorization and tractable training. The authors present two training pathways: a GP surrogate-based approach with UCB to approximate model weights and scalable parametric mass functions via MADE for large model spaces, plus Monte Carlo gradient estimators with variance reduction for discrete variables. They provide theoretical convergence guarantees under sub-Gaussian noise and demonstrate strong performance on robust variable selection and non-linear DAG discovery with high-cardinality model spaces, indicating practical applicability to model selection and structure learning tasks. Overall, the method broadens the applicability of flow-based VI to transdimensional problems and offers scalable strategies for complex model spaces.
Abstract
The expressiveness of flow-based models combined with stochastic variational inference (SVI) has expanded the application of optimization-based Bayesian inference to highly complex problems. However, despite the importance of multi-model Bayesian inference for problems defined on a transdimensional joint model and parameter space, such as Bayesian structure learning and model selection, flow-based SVI has been limited to problems defined on a fixed-dimensional parameter space. We introduce CoSMIC, normalizing flows (COntextually-Specified Masking for Identity-mapped Components), an extension to neural autoregressive conditional normalizing flow architectures that enables use of a single flow-based variational density for inference over a transdimensional (multi-model) conditional target distribution. We propose a combined stochastic variational transdimensional inference (VTI) approach to training CoSMIC, flows using ideas from Bayesian optimization and Monte Carlo gradient estimation. Numerical experiments show the performance of VTI on challenging problems that scale to high-cardinality model spaces.
