Hybrid Recurrent Models Support Emergent Descriptions for Hierarchical Planning and Control
Poppy Collis, Ryan Singh, Paul F Kinghorn, Christopher L Buckley
TL;DR
Addresses the challenge of learning useful discrete abstractions for inherently continuous control problems by introducing a Hybrid Hierarchical Agent (HHA) that combines an rSLDS with a discrete MDP and a finite-horizon LQR controller, guided by Active Inference-inspired exploration. The recurrent transitions in the rSLDS yield $K$ discrete modes indexing piecewise affine dynamics, enabling temporally abstract sub-goals and information-theoretic exploration bonuses, while caching low-level solutions for efficient planning. The approach is demonstrated on the Continuous Mountain Car task, showing faster reward discovery compared to model-free baselines and revealing meaningful, non-grid discretizations of the state-space that support planning and control. Overall, the work provides a interpretable, scalable framework that integrates discrete planning with continuous control through emergent hierarchical representations, with potential implications for safe and efficient autonomous systems.
Abstract
An open problem in artificial intelligence is how systems can flexibly learn discrete abstractions that are useful for solving inherently continuous problems. Previous work has demonstrated that a class of hybrid state-space model known as recurrent switching linear dynamical systems (rSLDS) discover meaningful behavioural units via the piecewise linear decomposition of complex continuous dynamics (Linderman et al., 2016). Furthermore, they model how the underlying continuous states drive these discrete mode switches. We propose that the rich representations formed by an rSLDS can provide useful abstractions for planning and control. We present a novel hierarchical model-based algorithm inspired by Active Inference in which a discrete MDP sits above a low-level linear-quadratic controller. The recurrent transition dynamics learned by the rSLDS allow us to (1) specify temporally-abstracted sub-goals in a method reminiscent of the options framework, (2) lift the exploration into discrete space allowing us to exploit information-theoretic exploration bonuses and (3) `cache' the approximate solutions to low-level problems in the discrete planner. We successfully apply our model to the sparse Continuous Mountain Car task, demonstrating fast system identification via enhanced exploration and non-trivial planning through the delineation of abstract sub-goals.
