Active Sensing with Predictive Coding and Uncertainty Minimization
Abdelrahman Sharafeldin, Nabil Imam, Hannah Choi
TL;DR
The paper presents a unified, end-to-end differentiable framework that integrates predictive coding-based perception with uncertainty-minimizing action to enable intrinsically driven embodied exploration. It demonstrates two instantiations of the model: a discrete controllable Markov chain setting and a continuous active vision task with band-limited sensing, both trained without external rewards. The perception module learns generative models of the environment via variational inference, while the action module selects informative actions through Bayesian Action Selection that reduces uncertainty, improving sample efficiency and data utilization for downstream classification. This work advances embodied AI by linking perception and action in a scalable, interpretable way and providing code to reproduce the results, with potential applications to more complex real-world perception-action problems.
Abstract
We present an end-to-end procedure for embodied exploration inspired by two biological computations: predictive coding and uncertainty minimization. The procedure can be applied to exploration settings in a task-independent and intrinsically driven manner. We first demonstrate our approach in a maze navigation task and show that it can discover the underlying transition distributions and spatial features of the environment. Second, we apply our model to a more complex active vision task, where an agent actively samples its visual environment to gather information. We show that our model builds unsupervised representations through exploration that allow it to efficiently categorize visual scenes. We further show that using these representations for downstream classification leads to superior data efficiency and learning speed compared to other baselines while maintaining lower parameter complexity. Finally, the modularity of our model allows us to probe its internal mechanisms and analyze the interaction between perception and action during exploration.
