Improving the performance of Stein variational inference through extreme sparsification of physically-constrained neural network models
Govinda Anantha Padmanabha, Jan Niklas Fuhg, Cosmin Safta, Reese E. Jones, Nikolaos Bouklas
TL;DR
Uncertainty quantification for high-dimensional neural networks is hindered by the curse of dimensionality. The paper proposes an $L_0$ sparsification prior applied before Stein variational gradient descent ($L_0$+SVGD) to create a compact nonlinear parameter manifold and perform nonparametric UQ on this manifold. Across hyperelasticity and mechanochemistry, $L_0$+SVGD achieves accurate predictive distributions with far fewer parameters and lower computational cost than dense SVGD, $p$SVGD$, or HMC, while preserving physical structure via ICNN-based polyconvex constraints. This framework offers a robust, efficient approach for physics-informed uncertainty quantification and enables potential integration with large-scale finite element workflows and active-learning strategies.
Abstract
Most scientific machine learning (SciML) applications of neural networks involve hundreds to thousands of parameters, and hence, uncertainty quantification for such models is plagued by the curse of dimensionality. Using physical applications, we show that $L_0$ sparsification prior to Stein variational gradient descent ($L_0$+SVGD) is a more robust and efficient means of uncertainty quantification, in terms of computational cost and performance than the direct application of SGVD or projected SGVD methods. Specifically, $L_0$+SVGD demonstrates superior resilience to noise, the ability to perform well in extrapolated regions, and a faster convergence rate to an optimal solution.
