Uncertainty-Aware Diagnostics for Physics-Informed Machine Learning
Mara Daniels, Liam Hodgkinson, Michael Mahoney
TL;DR
This work addresses the challenge of multi-objective model selection in physics-informed machine learning by introducing the Physics-Informed Log Evidence (PILE) within a Gaussian-process-based Physics-Informed Kernel Learning (PIKL) framework. PILE unifies data fidelity, physics constraints, and regularization into a single uncertainty-aware metric derived from GP Bayes free energy, enabling robust hyperparameter tuning and kernel selection, including data-free kernel assessment. The authors demonstrate that PILE can guide bandwidth, regularization weights, and kernel choice, diagnose model misspecification, and, in a data-free limit, relate to Fredholm determinants that offer a principled kernel-criterion prior to data. The work further shows practical benefits through case studies on Poisson and convection PDEs, highlighting the potential of PILE to improve predictive accuracy and physics adherence in PIML, with broader applicability to nonlinear operators and extensions beyond GP formalisms.
Abstract
Physics-informed machine learning (PIML) integrates prior physical information, often in the form of differential equation constraints, into the process of fitting machine learning models to physical data. Popular PIML approaches, including neural operators, physics-informed neural networks, neural ordinary differential equations, and neural discrete equilibria, are typically fit to objectives that simultaneously include both data and physical constraints. However, the multi-objective nature of this approach creates ambiguity in the measurement of model quality. This is related to a poor understanding of epistemic uncertainty, and it can lead to surprising failure modes, even when existing statistical metrics suggest strong fits. Working within a Gaussian process regression framework, we introduce the Physics-Informed Log Evidence (PILE) score. Bypassing the ambiguities of test losses, the PILE score is a single, uncertainty-aware metric that provides a selection principle for hyperparameters of a PIML model. We show that PILE minimization yields excellent choices for a wide variety of model parameters, including kernel bandwidth, least squares regularization weights, and even kernel function selection. We also show that, even prior to data acquisition, a special 'data-free' case of the PILE score identifies a priori kernel choices that are 'well-adapted' to a given PDE. Beyond the kernel setting, we anticipate that the PILE score can be extended to PIML at large, and we outline approaches to do so.
