Structural Plasticity as Active Inference: A Biologically-Inspired Architecture for Homeostatic Control
Brennen A. Hill
TL;DR
This work introduces SAPIN, a biologically inspired architecture that combines local Hebbian-like synaptic updates with global structural plasticity, enabling a 2D grid to both compute and reconfigure its topology in pursuit of homeostatic prediction. Grounded in the Free Energy Principle and predictive coding, SAPIN demonstrates that a dynamically reconfiguring substrate can learn control policies without external rewards, as shown by solving CartPole and achieving robust performance after parameter locking. Key contributions include a concrete implementation of simultaneous content and topology optimization, a locking mechanism to stabilize learned policies, and evidence that intrinsic prediction-error minimization can suffice for certain control tasks. The results highlight the potential of embodied, structurally adaptive networks as a proof-of-concept for active inference in physically dynamic substrates and motivate future work on longer-horizon planning and more complex environments.
Abstract
Traditional neural networks, while powerful, rely on biologically implausible learning mechanisms such as global backpropagation. This paper introduces the Structurally Adaptive Predictive Inference Network (SAPIN), a novel computational model inspired by the principles of active inference and the morphological plasticity observed in biological neural cultures. SAPIN operates on a 2D grid where processing units, or cells, learn by minimizing local prediction errors. The model features two primary, concurrent learning mechanisms: a local, Hebbian-like synaptic plasticity rule based on the temporal difference between a cell's actual activation and its learned expectation, and a structural plasticity mechanism where cells physically migrate across the grid to optimize their information-receptive fields. This dual approach allows the network to learn both how to process information (synaptic weights) and also where to position its computational resources (network topology). We validated the SAPIN model on the classic Cart Pole reinforcement learning benchmark. Our results demonstrate that the architecture can successfully solve the CartPole task, achieving robust performance. The network's intrinsic drive to minimize prediction error and maintain homeostasis was sufficient to discover a stable balancing policy. We also found that while continual learning led to instability, locking the network's parameters after achieving success resulted in a stable policy. When evaluated for 100 episodes post-locking (repeated over 100 successful agents), the locked networks maintained an average 82% success rate.
