Active Learning with Fully Bayesian Neural Networks for Discontinuous and Nonstationary Data
Maxim Ziatdinov
TL;DR
This work tackles active learning in physical sciences where data are scarce and system behavior exhibits discontinuities and nonstationarity. It advocates Fully Bayesian Neural Networks (FBNNs) trained with No-U-Turn Sampling as a probabilistic surrogate, offering improved uncertainty quantification over traditional Gaussian Processes (GPs). Through a battery of 1D and 2D test problems modeling phase transitions and nonstationary dynamics, FBNNs demonstrate competitive or superior predictive accuracy and uncertainty estimates, particularly in regions with abrupt changes. The findings suggest FBNNs are a practical, more reliable option for autonomous experimentation and modeling in small-data, low-to-moderate dimensional problems, with potential for extension to higher-dimensional applications.
Abstract
Active learning optimizes the exploration of large parameter spaces by strategically selecting which experiments or simulations to conduct, thus reducing resource consumption and potentially accelerating scientific discovery. A key component of this approach is a probabilistic surrogate model, typically a Gaussian Process (GP), which approximates an unknown functional relationship between control parameters and a target property. However, conventional GPs often struggle when applied to systems with discontinuities and non-stationarities, prompting the exploration of alternative models. This limitation becomes particularly relevant in physical science problems, which are often characterized by abrupt transitions between different system states and rapid changes in physical property behavior. Fully Bayesian Neural Networks (FBNNs) serve as a promising substitute, treating all neural network weights probabilistically and leveraging advanced Markov Chain Monte Carlo techniques for direct sampling from the posterior distribution. This approach enables FBNNs to provide reliable predictive distributions, crucial for making informed decisions under uncertainty in the active learning setting. Although traditionally considered too computationally expensive for 'big data' applications, many physical sciences problems involve small amounts of data in relatively low-dimensional parameter spaces. Here, we assess the suitability and performance of FBNNs with the No-U-Turn Sampler for active learning tasks in the 'small data' regime, highlighting their potential to enhance predictive accuracy and reliability on test functions relevant to problems in physical sciences.
