Ai-Sampler: Adversarial Learning of Markov kernels with involutive maps
Evgenii Egorov, Ricardo Valperga, Efstratios Gavves
TL;DR
Ai-Sampler proposes an adversarial MCMC framework in which transition kernels are parameterized by involutive maps built from time-reversible neural networks, ensuring detailed balance by construction. The core idea is to train a deterministic involutive proposal via a discriminator that approximates the density ratio, with a bootstrap process to progressively improve sampling quality; the objective upper-bounds the total variation distance to the target using Pinsker’s inequality. A $C_2$-equivariant discriminator enforces symmetry under the involution, and two discriminator parameterizations are offered: a product form and a more general linear–nonlinear composition. Empirical results on 2D multimodal densities and Bayesian logistic regression demonstrate competitive ESS and favorable running times, with strong mixing and scalability on accelerators, highlighting Ai-Sampler as a robust alternative to baselines like HMC and NICE-based methods.
Abstract
Markov chain Monte Carlo methods have become popular in statistics as versatile techniques to sample from complicated probability distributions. In this work, we propose a method to parameterize and train transition kernels of Markov chains to achieve efficient sampling and good mixing. This training procedure minimizes the total variation distance between the stationary distribution of the chain and the empirical distribution of the data. Our approach leverages involutive Metropolis-Hastings kernels constructed from reversible neural networks that ensure detailed balance by construction. We find that reversibility also implies $C_2$-equivariance of the discriminator function which can be used to restrict its function space.
