Uncertainty quantification of neural network models of evolving processes via Langevin sampling
Cosmin Safta, Reese E. Jones, Ravi G. Patel, Raelynn Wonnacot, Dan S. Bolintineanu, Craig M. Hamel, Sharlotte L. B. Kramer
TL;DR
This work addresses uncertainty quantification for history-dependent processes modeled by neural ordinary differential equations, coupling a data model to a trainable weight sampler within a differentiable hypernetwork. It uses Langevin sampling to draw posterior weight ensembles with a learnable score-based drift, enabling flexible trade-offs between data-model cost and posterior accuracy, and combines this with an ELBO objective and a pathwise KL bound. The method is demonstrated on chemical kinetics and material physics problems, showing advantages over variational inference and enabling efficient Bayesian last-layer variants. The approach supports epistemic and aleatory uncertainty quantification and is designed to be integrable with larger simulation workflows and experimental design settings.
Abstract
We propose a scalable, approximate inference hypernetwork framework for a general model of history-dependent processes. The flexible data model is based on a neural ordinary differential equation (NODE) representing the evolution of internal states together with a trainable observation model subcomponent. The posterior distribution corresponding to the data model parameters (weights and biases) follows a stochastic differential equation with a drift term related to the score of the posterior that is learned jointly with the data model parameters. This Langevin sampling approach offers flexibility in balancing the computational budget between the evaluation cost of the data model and the approximation of the posterior density of its parameters. We demonstrate performance of the ensemble sampling hypernetwork on chemical reaction and material physics data and compare it to standard variational inference.
