Affordances Enable Partial World Modeling with LLMs
Khimya Khetarpal, Gheorghe Comanici, Jonathan Richens, Jeremy Shar, Fei Xia, Laurent Orseau, Aleksandra Faust, Doina Precup
TL;DR
Full world models for multi-task planning are costly and brittle; this work investigates using large language models as partial world models guided by affordances. It develops a Generalized Affordance Framework that separates task-agnostic and task-specific intents, introduces distribution-robust affordances, and proves that achieving language-conditioned intents yields predictive partial world models while enabling provably faster search. Theoretical results (Theorem 1 and Theorem 2) link intents to partial world models and to planning efficiency, and empirical validation on tabletop robotics shows that affordance-informed partial models reduce search branching and improve rewards compared to full models. The approach offers a scalable path to efficient, transferable planning in multi-task robotic domains by constraining dynamics to relevant, robust affordances discovered via LLMs.
Abstract
Full models of the world require complex knowledge of immense detail. While pre-trained large models have been hypothesized to contain similar knowledge due to extensive pre-training on vast amounts of internet scale data, using them directly in a search procedure is inefficient and inaccurate. Conversely, partial models focus on making high quality predictions for a subset of state and actions: those linked through affordances that achieve user intents~\citep{khetarpal2020can}. Can we posit large models as partial world models? We provide a formal answer to this question, proving that agents achieving task-agnostic, language-conditioned intents necessarily possess predictive partial-world models informed by affordances. In the multi-task setting, we introduce distribution-robust affordances and show that partial models can be extracted to significantly improve search efficiency. Empirical evaluations in tabletop robotics tasks demonstrate that our affordance-aware partial models reduce the search branching factor and achieve higher rewards compared to full world models.
