Active Inference as a Model of Agency
Lancelot Da Costa, Samuel Tenka, Dominic Zhao, Noor Sajid
TL;DR
The paper questions reward maximisation as the sole basis of agency and proposes active inference as a canonical, physics-grounded alternative that unifies exploration and exploitation through minimising risk and ambiguity via the expected free energy $-\\\\log P(a_{>t} \\mid h_{\\le t})$. It derives agency from first principles, showing a decomposition into a KL term and an expected observation-entropy term, and then outlines a concrete algorithm (preferential inference, perceptual inference, planning as inference) that realizes this objective in POMDP-like settings. The approach yields a principled, information-aware, and risk-averse decision-making framework that can explain a wide range of behaviours and be rewritten to resemble traditional RL under explicit world-models. It also discusses scalability via hierarchical deep models and acknowledges limitations regarding representations, pointing to future work in learning and developmental aspects of the world model, with implications for neuroscience, robotics, and AI safety.
Abstract
Is there a canonical way to think of agency beyond reward maximisation? In this paper, we show that any type of behaviour complying with physically sound assumptions about how macroscopic biological agents interact with the world canonically integrates exploration and exploitation in the sense of minimising risk and ambiguity about states of the world. This description, known as active inference, refines the free energy principle, a popular descriptive framework for action and perception originating in neuroscience. Active inference provides a normative Bayesian framework to simulate and model agency that is widely used in behavioural neuroscience, reinforcement learning (RL) and robotics. The usefulness of active inference for RL is three-fold. \emph{a}) Active inference provides a principled solution to the exploration-exploitation dilemma that usefully simulates biological agency. \emph{b}) It provides an explainable recipe to simulate behaviour, whence behaviour follows as an explainable mixture of exploration and exploitation under a generative world model, and all differences in behaviour are explicit in differences in world model. \emph{c}) This framework is universal in the sense that it is theoretically possible to rewrite any RL algorithm conforming to the descriptive assumptions of active inference as an active inference algorithm. Thus, active inference can be used as a tool to uncover and compare the commitments and assumptions of more specific models of agency.
