One Demo Is All It Takes: Planning Domain Derivation with LLMs from A Single Demonstration
Jinbang Huang, Yixin Xiao, Zhanguang Zhang, Mark Coates, Jianye Hao, Yingxue Zhang
TL;DR
PDDLLM addresses the challenge of long-horizon robotic planning by deriving symbolic planning domains from a single demonstration using LLM reasoning and physics-based simulation. It automatically generates predicates and actions without manual domain initialization, then interfaces with low-level motion planners via LoCA to execute plans. Evaluated on 1,200 tasks across nine environments and deployed on multiple real robots, PDDLLM outperforms six LLM-based baselines, reduces token costs, and approaches expert-designed domain quality. This work significantly reduces human effort in domain engineering and enables scalable, robust TAMP for real-world robotics.
Abstract
Pre-trained large language models (LLMs) show promise for robotic task planning but often struggle to guarantee correctness in long-horizon problems. Task and motion planning (TAMP) addresses this by grounding symbolic plans in low-level execution, yet it relies heavily on manually engineered planning domains. To improve long-horizon planning reliability and reduce human intervention, we present Planning Domain Derivation with LLMs (PDDLLM), a framework that automatically induces symbolic predicates and actions directly from demonstration trajectories by combining LLM reasoning with physical simulation roll-outs. Unlike prior domain-inference methods that rely on partially predefined or language descriptions of planning domains, PDDLLM constructs domains without manual domain initialization and automatically integrates them with motion planners to produce executable plans, enhancing long-horizon planning automation. Across 1,200 tasks in nine environments, PDDLLM outperforms six LLM-based planning baselines, achieving at least 20\% higher success rates, reduced token costs, and successful deployment on multiple physical robot platforms.
