Learning in Deep Factor Graphs with Gaussian Belief Propagation
Seth Nabarro, Mark van der Wilk, Andrew J Davison
TL;DR
This work addresses learning in deep neural-like models under continual and distributed settings by modeling all relevant quantities as random variables within Gaussian factor graphs and performing training and inference with Gaussian Belief Propagation (GBP). By linearising non-linear factors and leveraging local, parallel message updates, GBP enables scalable, asynchronous training and continual learning via Bayesian filtering of parameters. The authors demonstrate learning with learnable parameters across CNN-like architectures, achieving competitive results on video denoising and strong performance in MNIST and CIFAR10 with single-pass or limited replay data, while outperforming prior GBP-based methods. The approach integrates energy-based modelling with probabilistic inference, offering a flexible, hardware-friendly alternative to backpropagation for distributed, continual learning in deep architectures.
Abstract
We propose an approach to do learning in Gaussian factor graphs. We treat all relevant quantities (inputs, outputs, parameters, latents) as random variables in a graphical model, and view both training and prediction as inference problems with different observed nodes. Our experiments show that these problems can be efficiently solved with belief propagation (BP), whose updates are inherently local, presenting exciting opportunities for distributed and asynchronous training. Our approach can be scaled to deep networks and provides a natural means to do continual learning: use the BP-estimated parameter marginals of the current task as parameter priors for the next. On a video denoising task we demonstrate the benefit of learnable parameters over a classical factor graph approach and we show encouraging performance of deep factor graphs for continual image classification.
