Pretraining a Unified PDDL Domain from Real-World Demonstrations for Generalizable Robot Task Planning
Haoming Ye, Yunxiao Xiao, Cewu Lu, Panpan Cai
TL;DR
UniDomain addresses the challenge of grounding long-horizon robot task planning in real-world constraints by pretraining a unified PDDL domain from thousands of demonstrations. It blends energy-based keyframe extraction, VLM/LLM-driven domain construction with closed-loop verification, and hierarchical domain fusion to produce task-relevant meta-domains for online planning. Empirical results across four unseen task domains show substantial improvements in success and plan optimality over strong LLM-only and hybrid baselines, highlighting the value of data-driven symbolic grounding for compositional generalization. The framework promises scalable, zero-shot symbolic planning in real-world manipulation, with future work aimed at richer PDDL variants and handling perceptual uncertainty.
Abstract
Robotic task planning in real-world environments requires reasoning over implicit constraints from language and vision. While LLMs and VLMs offer strong priors, they struggle with long-horizon structure and symbolic grounding. Existing methods that combine LLMs with symbolic planning often rely on handcrafted or narrow domains, limiting generalization. We propose UniDomain, a framework that pre-trains a PDDL domain from robot manipulation demonstrations and applies it for online robotic task planning. It extracts atomic domains from 12,393 manipulation videos to form a unified domain with 3137 operators, 2875 predicates, and 16481 causal edges. Given a target class of tasks, it retrieves relevant atomics from the unified domain and systematically fuses them into high-quality meta-domains to support compositional generalization in planning. Experiments on diverse real-world tasks show that UniDomain solves complex, unseen tasks in a zero-shot manner, achieving up to 58% higher task success and 160% improvement in plan optimality over state-of-the-art LLM and LLM-PDDL baselines.
