Condensed Stein Variational Gradient Descent for Uncertainty Quantification of Neural Networks

Govinda Anantha Padmanabha; Cosmin Safta; Nikolaos Bouklas; Reese E. Jones

Condensed Stein Variational Gradient Descent for Uncertainty Quantification of Neural Networks

Govinda Anantha Padmanabha, Cosmin Safta, Nikolaos Bouklas, Reese E. Jones

TL;DR

This work targets uncertainty quantification for highly parameterized neural networks by introducing condensed SVGD (cSVGD), which couples concurrent sparsification with Stein gradient flow and a graph reconciliation procedure. By using sparsifying priors (e.g., |θ|^α with multiplier λ) and kernel-based repulsion, the method evolves a particle ensemble that encodes a posterior over parameters while aligning ensemble representations on a common graph, reducing degeneracy and computational cost. Applied to a physics-informed ICNN for hyperelastic constitutive modeling, cSVGD achieves drastic reductions in active parameters (from 1020 to a few hundred) without sacrificing accuracy, and an adaptive penalty further improves sparsity-accuracy tradeoffs. The approach advances scalable Bayesian inference for scientific neural networks by exploiting parameter fungibility and graph-structured alignment to enable interpretable, efficient parameter UQ in large-scale, physics-guided models.

Abstract

We propose a Stein variational gradient descent method to concurrently sparsify, train, and provide uncertainty quantification of a complexly parameterized model such as a neural network. It employs a graph reconciliation and condensation process to reduce complexity and increase similarity in the Stein ensemble of parameterizations. Therefore, the proposed condensed Stein variational gradient (cSVGD) method provides uncertainty quantification on parameters, not just outputs. Furthermore, the parameter reduction speeds up the convergence of the Stein gradient descent as it reduces the combinatorial complexity by aligning and differentiating the sensitivity to parameters. These properties are demonstrated with an illustrative example and an application to a representation problem in solid mechanics.

Condensed Stein Variational Gradient Descent for Uncertainty Quantification of Neural Networks

TL;DR

Abstract

Condensed Stein Variational Gradient Descent for Uncertainty Quantification of Neural Networks

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (13)