Brain in the Dark: Design Principles for Neuromimetic Inference under the Free Energy Principle
Mehran H. Bazargani, Szymon Urbas, Karl Friston
TL;DR
The paper tackles the weaknesses of deep learning—out-of-distribution generalisation, forgetting, and interpretability—by advocating neuromimetic AI built on the Free Energy Principle (FEP) and Predictive Coding. It provides a practical roadmap and a CPU-based PyTorch implementation to enable brain-like inference via variational methods, notably using Variational Free Energy $F$ as an objective. The authors formalise inference through $F(q;y^t)=D_{KL}[q(z^t)||p(z^t)]-\mathbb{E}_{q(z^t)}[\ln p(y^t|z^t)]$ and discuss discrete/continuous state-space formulations, including generalised coordinates of motion. A one-layer predictive coding experiment with Lotka-Volterra dynamics demonstrates improved state inference and model selection (e.g., $BF_{1,2}=1.36$ in favour of a nonlinear model $M_2$) and highlights the practical viability of FEP-inspired neuromimetic AI for robust perception tasks.
Abstract
Deep learning has revolutionised artificial intelligence (AI) by enabling automatic feature extraction and function approximation from raw data. However, it faces challenges such as a lack of out-of-distribution generalisation, catastrophic forgetting and poor interpretability. In contrast, biological neural networks, such as those in the human brain, do not suffer from these issues, inspiring AI researchers to explore neuromimetic deep learning, which aims to replicate brain mechanisms within AI models. A foundational theory for this approach is the Free Energy Principle (FEP), which despite its potential, is often considered too complex to understand and implement in AI as it requires an interdisciplinary understanding across a variety of fields. This paper seeks to demystify the FEP and provide a comprehensive framework for designing neuromimetic models with human-like perception capabilities. We present a roadmap for implementing these models and a Pytorch code repository for applying FEP in a predictive coding network.
