Bayesian Predictive Coding
Alexander Tschantz, Magnus Koudahl, Hampus Linander, Lancelot Da Costa, Conor Heins, Jeff Beck, Christopher Buckley
TL;DR
Bayesian Predictive Coding (BPC) extends predictive coding by placing a Bayesian posterior over neural network parameters, enabling explicit uncertainty quantification while preserving PC's locality through Hebbian-like updates. By using a Matrix Normal-Wishart prior, BPC achieves closed-form updates for posterior parameters, improving convergence in full-batch settings and remaining competitive in mini-batch regimes. Empirical results show that BPC maintains competitive accuracy relative to PC and backpropagation, and it provides robust epistemic and aleatoric uncertainty estimates, outperforming some Bayesian baselines on several tasks. The work suggests that biologically plausible Bayesian learning can support uncertainty-aware deep learning and motivates future work on scalable posterior approximations and uncertainty propagation in large networks.
Abstract
Predictive coding (PC) is an influential theory of information processing in the brain, providing a biologically plausible alternative to backpropagation. It is motivated in terms of Bayesian inference, as hidden states and parameters are optimised via gradient descent on variational free energy. However, implementations of PC rely on maximum \textit{a posteriori} (MAP) estimates of hidden states and maximum likelihood (ML) estimates of parameters, limiting their ability to quantify epistemic uncertainty. In this work, we investigate a Bayesian extension to PC that estimates a posterior distribution over network parameters. This approach, termed Bayesian Predictive coding (BPC), preserves the locality of PC and results in closed-form Hebbian weight updates. Compared to PC, our BPC algorithm converges in fewer epochs in the full-batch setting and remains competitive in the mini-batch setting. Additionally, we demonstrate that BPC offers uncertainty quantification comparable to existing methods in Bayesian deep learning, while also improving convergence properties. Together, these results suggest that BPC provides a biologically plausible method for Bayesian learning in the brain, as well as an attractive approach to uncertainty quantification in deep learning.
