Online Bayesian Experimental Design for Partially Observed Dynamical Systems
Sara Pérez-Vieites, Sahel Iqbal, Simo Särkkä, Dominik Baumann
TL;DR
This work addresses Bayesian adaptive design for partially observable dynamical systems by deriving EIG estimators and gradients that marginalize latent states, enabling online, gradient-based design. The authors integrate nested particle filters to jointly infer states and parameters online, achieving linear-time scaling and providing consistency guarantees. The BAD-PODS framework combines these estimators with online SGD to optimize continuous designs while updating the posterior via an online NPF, demonstrated on two realistic tasks (a two-group SIR model and moving-source localization). The results show that online adaptation substantially improves information gain and design quality compared to random or static designs, with practical implications for adaptive experiments in complex dynamical settings.
Abstract
Bayesian experimental design (BED) provides a principled framework for optimizing data collection, but existing approaches do not apply to crucial real-world settings such as dynamical systems with partial observability, where only noisy and incomplete observations are available. These systems are naturally modeled as state-space models (SSMs), where latent states mediate the link between parameters and data, making the likelihood -- and thus information-theoretic objectives like the expected information gain (EIG) -- intractable. In addition, the dynamical nature of the system requires online algorithms that update posterior distributions and select designs sequentially in a computationally efficient manner. We address these challenges by deriving new estimators of the EIG and its gradient that explicitly marginalize latent states, enabling scalable stochastic optimization in nonlinear SSMs. Our approach leverages nested particle filters (NPFs) for efficient online inference with convergence guarantees. Applications to realistic models, such as the susceptible-infected-recovered (SIR) and a moving source location task, show that our framework successfully handles both partial observability and online computation.
