From Real World to Logic and Back: Learning Generalizable Relational Concepts For Long Horizon Robot Planning
Naman Shah, Jayesh Nagpal, Siddharth Srivastava
TL;DR
This work addresses the challenge of enabling robots to generalize from limited demonstrations to long-horizon planning in unseen environments by autonomously inventing symbolic relational concepts and world models. The authors introduce LAMP, which learns relational critical regions from raw trajectories, constructs a symbolic vocabulary, and invents high-level actions and interpreters to form a transferable PDDL-like world model, enabling zero-shot solving of tasks far beyond training. Key contributions include the Relational Critical Regions framework, the Relation Inventor for semantic predicates, and the Action Inventor for automatic action models, which together yield interpretable, scalable planning that transfers across domains and object counts, achieving up to 18× generalization and competitive performance with hand-crafted baselines. The results demonstrate substantial gains in sample efficiency and generalization, with code and data released to support reuse; future work targets stochastic environments, broader object categories, and robust perception integration.
Abstract
Robots still lag behind humans in their ability to generalize from limited experience, particularly when transferring learned behaviors to long-horizon tasks in unseen environments. We present the first method that enables robots to autonomously invent symbolic, relational concepts directly from a small number of raw, unsegmented, and unannotated demonstrations. From these, the robot learns logic-based world models that support zero-shot generalization to tasks of far greater complexity than those in training. Our framework achieves performance on par with hand-engineered symbolic models, while scaling to execution horizons far beyond training and handling up to 18$\times$ more objects than seen during learning. The results demonstrate a framework for autonomously acquiring transferable symbolic abstractions from raw robot experience, contributing toward the development of interpretable, scalable, and generalizable robot planning systems. Project website and code: https://aair-lab.github.io/r2l-lamp.
