Self-orthogonalizing attractor neural networks emerging from the free energy principle
Tamas Spisak, Karl Friston
TL;DR
This work derives self-organizing attractor neural networks directly from the Free Energy Principle by employing deep particular partitions, showing how local free energy minimization yields both inference and learning rules that produce approximately orthogonal attractor representations. The framework unifies Boltzmann-machine–like stationary dynamics with non-equilibrium, sequence-capable flows through symmetric and antisymmetric couplings, enabling macro-scale Bayesian inference via sampling. Empirically, simulations demonstrate orthogonal basis formation, robust generalization, sequence learning, and resistance to catastrophic forgetting, underscoring potential benefits for brain-inspired AI and neuromorphic architectures. Overall, the paper provides a principled, hierarchical account of how adaptive, self-orthogonalizing attractor networks can emerge from first principles and solve core computational challenges in perception, memory, and learning.
Abstract
Attractor dynamics are a hallmark of many complex systems, including the brain. Understanding how such self-organizing dynamics emerge from first principles is crucial for advancing our understanding of neuronal computations and the design of artificial intelligence systems. Here we formalize how attractor networks emerge from the free energy principle applied to a universal partitioning of random dynamical systems. Our approach obviates the need for explicitly imposed learning and inference rules and identifies emergent, but efficient and biologically plausible inference and learning dynamics for such self-organizing systems. These result in a collective, multi-level Bayesian active inference process. Attractors on the free energy landscape encode prior beliefs; inference integrates sensory data into posterior beliefs; and learning fine-tunes couplings to minimize long-term surprise. Analytically and via simulations, we establish that the proposed networks favor approximately orthogonalized attractor representations, a consequence of simultaneously optimizing predictive accuracy and model complexity. These attractors efficiently span the input subspace, enhancing generalization and the mutual information between hidden causes and observable effects. Furthermore, while random data presentation leads to symmetric and sparse couplings, sequential data fosters asymmetric couplings and non-equilibrium steady-state dynamics, offering a natural extension to conventional Boltzmann Machines. Our findings offer a unifying theory of self-organizing attractor networks, providing novel insights for AI and neuroscience.
