Bayesian Experimental Design via Contrastive Diffusions
Jacopo Iollo, Christophe Heinkelé, Pierre Alliez, Florence Forbes
TL;DR
CoDiff introduces a pooled-posterior gradient and diffusion-based sampling to scale Bayesian Optimal Experimental Design, enabling efficient, single-loop optimization of the Expected Information Gain. By deriving EIG gradient expressions via a reparameterization trick and employing a pooled posterior as a sampling proxy, it circumvents nested Monte Carlo and leverages diffusion models for data-based priors. The method is demonstrated on sequential source localization and MNIST-based inverse problems, showing substantial gains over state-of-the-art approaches in information gains and posterior quality, and it extends BOED to diffusion-based generative models. While effective, it remains a greedy, likelihood-reliant approach; future work includes nonlinear forward models and simulation-based inference to broaden applicability and non-myopic planning.
Abstract
Bayesian Optimal Experimental Design (BOED) is a powerful tool to reduce the cost of running a sequence of experiments. When based on the Expected Information Gain (EIG), design optimization corresponds to the maximization of some intractable expected contrast between prior and posterior distributions. Scaling this maximization to high dimensional and complex settings has been an issue due to BOED inherent computational complexity. In this work, we introduce a pooled posterior distribution with cost-effective sampling properties and provide a tractable access to the EIG contrast maximization via a new EIG gradient expression. Diffusion-based samplers are used to compute the dynamics of the pooled posterior and ideas from bi-level optimization are leveraged to derive an efficient joint sampling-optimization loop. The resulting efficiency gain allows to extend BOED to the well-tested generative capabilities of diffusion models. By incorporating generative models into the BOED framework, we expand its scope and its use in scenarios that were previously impractical. Numerical experiments and comparison with state-of-the-art methods show the potential of the approach.
