ExoPredicator: Learning Abstract Models of Dynamic Worlds for Robot Planning
Yichao Liang, Dat Nguyen, Cambridge Yang, Tianyang Li, Joshua B. Tenenbaum, Carl Edward Rasmussen, Adrian Weller, Zenna Tavares, Tom Silver, Kevin Ellis
TL;DR
ExoPredicator addresses long-horizon robot planning in environments with concurrent exogenous dynamics by learning abstract world models that combine symbolic state predicates with causal processes. It jointly learns (i) predicates that abstract observations into a compact state representation and (ii) the timecourse of both endogenous actions and exogenous mechanisms, using variational inference and LLM-guided proposals. A big-step planner operates over these abstractions, enabling efficient lookahead that accounts for delayed effects. Across five simulated tabletop domains, the approach generalizes to unseen tasks with more objects and complex goals and outperforms baselines, demonstrating the value of combining symbolic predicates, learned temporal dynamics, and foundation-model guidance for robust, sample-efficient planning. This framework broadly advances planning under uncertainty by integrating learning, causality, and symbolic reasoning into a single, scalable pipeline.
Abstract
Long-horizon embodied planning is challenging because the world does not only change through an agent's actions: exogenous processes (e.g., water heating, dominoes cascading) unfold concurrently with the agent's actions. We propose a framework for abstract world models that jointly learns (i) symbolic state representations and (ii) causal processes for both endogenous actions and exogenous mechanisms. Each causal process models the time course of a stochastic cause-effect relation. We learn these world models from limited data via variational Bayesian inference combined with LLM proposals. Across five simulated tabletop robotics environments, the learned models enable fast planning that generalizes to held-out tasks with more objects and more complex goals, outperforming a range of baselines.
