Vector Copula Variational Inference and Dependent Block Posterior Approximations
Yu Fu, Michael Stanley Smith, Anastasios Panagiotelis
TL;DR
This paper addresses the accuracy limitations of traditional VI that assumes block independence by introducing VCVI, a framework that uses vector copulas to tie together heterogeneous, learnable marginals across blocks. It defines two learnable, scalable families—Gaussian vector copulas (GVC) and Kendall vector copulas (KVC)—and supports two marginal transformation schemes (M1 and M2) implemented as normalizing flows to enable efficient stochastic gradient optimization. Across four example models and 16 datasets, VCVI consistently delivers more accurate posterior approximations than block-independent or factor-based methods, with only modest additional computation and a practical Python package available. The approach is highly modular, allowing users to tailor marginals and between-block dependence to the posterior structure, making it applicable to large-scale econometric and statistical models.
Abstract
The key to VI is the selection of a tractable density to approximate the Bayesian posterior. For large and complex models a common choice is to assume independence between multivariate blocks in a partition of the parameter space. While this simplifies the problem it can reduce accuracy. This paper proposes using vector copulas to capture dependence between the blocks parsimoniously. Tailored multivariate marginals are constructed using learnable transport maps. We call the resulting joint distribution a ``dependent block posterior'' approximation. Vector copula models are suggested that make tractable and flexible variational approximations. They allow for differing marginals, numbers of blocks, block sizes and forms of between block dependence. They also allow for solution of the variational optimization using efficient stochastic gradient methods. The approach is demonstrated using four different statistical models and 16 datasets which have posteriors that are challenging to approximate. This includes models that use global-local shrinkage priors for regularization, and hierarchical models for smoothing and heteroscedastic time series. In all cases, our method produces more accurate posterior approximations than benchmark VI methods that either assume block independence or factor-based dependence, at limited additional computational cost. A python package implementing the method is available on GitHub at https://github.com/YuFuOliver/VCVI_Rep_PyPackage.
