Student-t processes as infinite-width limits of posterior Bayesian neural networks

Francesco Caporali; Stefano Favaro; Dario Trevisan

Student-t processes as infinite-width limits of posterior Bayesian neural networks

Francesco Caporali, Stefano Favaro, Dario Trevisan

TL;DR

This proof shows that, if the parameters of a BNN follow a Gaussian prior distribution, and the variance of both the last hidden layer and the Gaussian likelihood function follows an Inverse-Gamma prior distribution, the resulting posterior BNN converges to a Student-t process in the infinite-width limit.

Abstract

The asymptotic properties of Bayesian Neural Networks (BNNs) have been extensively studied, particularly regarding their approximations by Gaussian processes in the infinite-width limit. We extend these results by showing that posterior BNNs can be approximated by Student-t processes, which offer greater flexibility in modeling uncertainty. Specifically, we show that, if the parameters of a BNN follow a Gaussian prior distribution, and the variance of both the last hidden layer and the Gaussian likelihood function follows an Inverse-Gamma prior distribution, then the resulting posterior BNN converges to a Student-t process in the infinite-width limit. Our proof leverages the Wasserstein metric to establish control over the convergence rate of the Student-t process approximation.

Student-t processes as infinite-width limits of posterior Bayesian neural networks

TL;DR

Abstract

Student-t processes as infinite-width limits of posterior Bayesian neural networks

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (4)

Theorems & Definitions (38)