On Neural Networks as Infinite Tree-Structured Probabilistic Graphical Models
Boyao Li, Alexander J. Thomson, Houssam Nassif, Matthew M. Engelhard, David Page
TL;DR
The paper addresses the lack of precise probabilistic semantics for deep neural networks by constructing an infinite-width tree-structured probabilistic graphical model (PGM) that corresponds exactly to any given DNN architecture. It proves that, for sigmoid activations, forward propagation in a DNN matches exact inference in this PGM, and it further shows the corresponding gradients align with backpropagation (Theorems 1–2). The authors extend the framework to nonnegative activations and outline a practical Hamiltonian Monte Carlo (HMC)–based fine-tuning algorithm (with CD-like updates) that leverages the PGM perspective to improve calibration. Empirical results on synthetic data and the Covertype dataset demonstrate calibration gains from HMC-based fine-tuning, suggesting a viable path to integrating PGMs and DNNs for uncertainty quantification and interpretability in hybrid models.
Abstract
Deep neural networks (DNNs) lack the precise semantics and definitive probabilistic interpretation of probabilistic graphical models (PGMs). In this paper, we propose an innovative solution by constructing infinite tree-structured PGMs that correspond exactly to neural networks. Our research reveals that DNNs, during forward propagation, indeed perform approximations of PGM inference that are precise in this alternative PGM structure. Not only does our research complement existing studies that describe neural networks as kernel machines or infinite-sized Gaussian processes, it also elucidates a more direct approximation that DNNs make to exact inference in PGMs. Potential benefits include improved pedagogy and interpretation of DNNs, and algorithms that can merge the strengths of PGMs and DNNs.
