Optimizing Likelihoods via Mutual Information: Bridging Simulation-Based Inference and Bayesian Optimal Experimental Design
Vincent D. Zaballa, Elliot E. Hui
TL;DR
The paper addresses the challenge of integrating Simulation-Based Inference (SBI) with Bayesian Optimal Experimental Design (BOED) by formulating a mutual information objective, and introduces the InfoNCE-λ bound to jointly optimize experimental designs and amortized SBI surrogates for non-differentiable simulators. It establishes theoretical connections between SBI objectives and MI-based design gains, and presents a practical framework that uses a design distribution to stabilize gradient-based optimization. The authors demonstrate improved calibration and predictive accuracy over baselines on SBI-based epidemiology and Bone Morphogenetic Protein (BMP) models, while detailing trade-offs between MI tightness and likelihood fidelity through the regularization parameter λ. The work broadens the applicability of BOED to SBI tasks and provides concrete guidelines (design distributions, checkpoints, and calibration metrics) to ensure robust, interpretable experimental design decisions in complex scientific simulators.
Abstract
Simulation-based inference (SBI) is a method to perform inference on a variety of complex scientific models with challenging inference (inverse) problems. Bayesian Optimal Experimental Design (BOED) aims to efficiently use experimental resources to make better inferences. Various stochastic gradient-based BOED methods have been proposed as an alternative to Bayesian optimization and other experimental design heuristics to maximize information gain from an experiment. We demonstrate a link via mutual information bounds between SBI and stochastic gradient-based variational inference methods that permits BOED to be used in SBI applications as SBI-BOED. This link allows simultaneous optimization of experimental designs and optimization of amortized inference functions. We evaluate the pitfalls of naive design optimization using this method in a standard SBI task and demonstrate the utility of a well-chosen design distribution in BOED. We compare this approach on SBI-based models in real-world simulators in epidemiology and biology, showing notable improvements in inference.
