Active Inference in Discrete State Spaces from First Principles
Patrick Kenny
TL;DR
This paper separates active inference in discrete state spaces from the Free Energy Principle and shows that a unified, monotone objective can be obtained by minimizing a constrained KL divergence via standard mean-field variational methods. Perception and action are cast within a single framework using Hidden Markov Models and dynamic Bayesian networks, avoiding reliance on expected free energy and instead deriving predictive and posterior distributions from KL optimization. The work provides comprehensive treatment of learning with Dirichlet priors, policy beliefs, planning, and time-domain renormalization, and it discusses the relation and trade-offs with traditional expected free energy formulations. Overall, it argues that discrete active inference can be implemented with robust, conventional variational techniques, with clear implications for scalable perception, action, and learning in AI systems.
Abstract
We seek to clarify the concept of active inference by disentangling it from the Free Energy Principle. We show how the optimizations that need to be carried out in order to implement active inference in discrete state spaces can be formulated as constrained divergence minimization problems which can be solved by standard mean field methods that do not appeal to the idea of expected free energy. When it is used to model perception, the perception/action divergence criterion that we propose coincides with variational free energy. When it is used to model action, it differs from an expected free energy functional by an entropy regularizer.
