Bayesian Entropy Neural Networks for Physics-Aware Prediction
Rahul Rathnakumar, Jiayu Huang, Hao Yan, Yongming Liu
TL;DR
The paper tackles the challenge of embedding physical and informational constraints into deep neural surrogates while preserving meaningful uncertainty quantification. It introduces Bayesian Entropy Neural Networks (BENN), which merge maximum entropy principles with Bayesian neural networks and employ variational inference plus the Modified Differential Method of Multipliers to enforce multiple, potentially conflicting constraints on predictions, derivatives, and predictive variance. The approach is demonstrated on 1D regression, beam deflection, and microstructure generation via a BE-CVAE, showing improved constraint satisfaction and competitive performance relative to constrained baselines. This framework offers a principled, scalable route to physics-aware, uncertainty-aware surrogate modeling with broad applicability in engineering and materials science.
Abstract
This paper addresses the need for deep learning models to integrate well-defined constraints into their outputs, driven by their application in surrogate models, learning with limited data and partial information, and scenarios requiring flexible model behavior to incorporate non-data sample information. We introduce Bayesian Entropy Neural Networks (BENN), a framework grounded in Maximum Entropy (MaxEnt) principles, designed to impose constraints on Bayesian Neural Network (BNN) predictions. BENN is capable of constraining not only the predicted values but also their derivatives and variances, ensuring a more robust and reliable model output. To achieve simultaneous uncertainty quantification and constraint satisfaction, we employ the method of multipliers approach. This allows for the concurrent estimation of neural network parameters and the Lagrangian multipliers associated with the constraints. Our experiments, spanning diverse applications such as beam deflection modeling and microstructure generation, demonstrate the effectiveness of BENN. The results highlight significant improvements over traditional BNNs and showcase competitive performance relative to contemporary constrained deep learning methods.
