Condensed Stein Variational Gradient Descent for Uncertainty Quantification of Neural Networks
Govinda Anantha Padmanabha, Cosmin Safta, Nikolaos Bouklas, Reese E. Jones
TL;DR
This work targets uncertainty quantification for highly parameterized neural networks by introducing condensed SVGD (cSVGD), which couples concurrent sparsification with Stein gradient flow and a graph reconciliation procedure. By using sparsifying priors (e.g., |θ|^α with multiplier λ) and kernel-based repulsion, the method evolves a particle ensemble that encodes a posterior over parameters while aligning ensemble representations on a common graph, reducing degeneracy and computational cost. Applied to a physics-informed ICNN for hyperelastic constitutive modeling, cSVGD achieves drastic reductions in active parameters (from 1020 to a few hundred) without sacrificing accuracy, and an adaptive penalty further improves sparsity-accuracy tradeoffs. The approach advances scalable Bayesian inference for scientific neural networks by exploiting parameter fungibility and graph-structured alignment to enable interpretable, efficient parameter UQ in large-scale, physics-guided models.
Abstract
We propose a Stein variational gradient descent method to concurrently sparsify, train, and provide uncertainty quantification of a complexly parameterized model such as a neural network. It employs a graph reconciliation and condensation process to reduce complexity and increase similarity in the Stein ensemble of parameterizations. Therefore, the proposed condensed Stein variational gradient (cSVGD) method provides uncertainty quantification on parameters, not just outputs. Furthermore, the parameter reduction speeds up the convergence of the Stein gradient descent as it reduces the combinatorial complexity by aligning and differentiating the sensitivity to parameters. These properties are demonstrated with an illustrative example and an application to a representation problem in solid mechanics.
