Deep hybrid models: infer and plan in a dynamic world
Matteo Priorelli, Ivilin Peev Stoianov
TL;DR
The paper addresses planning in dynamic, hierarchical environments by reframing control as inference within a deep hybrid active-inference framework. It combines predictive coding with discrete policy-based planning, utilizing variational free energy $\mathcal{F}$ for perception and expected free energy $\mathcal{G}_\pi$ for action selection to operate across continuous and discrete levels. The main contributions are a deep hierarchical architecture of hybrid units that encode factorial, hierarchical, and temporal depths, a representation of potential body configurations via intrinsic/extrinsic modules, and a demonstration on a moving-tool/moving-ball reach task showing robust inference and dynamic planning under varying conditions. The work offers an interpretable, data-efficient alternative to traditional optimal control and some deep RL methods, with potential advantages in explainability and flexible multi-timescale planning.
Abstract
To determine an optimal plan for complex tasks, one often deals with dynamic and hierarchical relationships between several entities. Traditionally, such problems are tackled with optimal control, which relies on the optimization of cost functions; instead, a recent biologically-motivated proposal casts planning and control as an inference process. Active inference assumes that action and perception are two complementary aspects of life whereby the role of the former is to fulfill the predictions inferred by the latter. Here, we present an active inference approach that exploits discrete and continuous processing, based on three features: the representation of potential body configurations in relation to the objects of interest; the use of hierarchical relationships that enable the agent to easily interpret and flexibly expand its body schema for tool use; the definition of potential trajectories related to the agent's intentions, used to infer and plan with dynamic elements at different temporal scales. We evaluate this deep hybrid model on a habitual task: reaching a moving object after having picked a moving tool. We show that the model can tackle the presented task under different conditions. This study extends past work on planning as inference and advances an alternative direction to optimal control.
