LLM-guided Task and Motion Planning using Knowledge-based Reasoning

Muhayy Ud Din; Jan Rosell; Waseem Akram; Isiah Zaplana; Maximo A Roa; Irfan Hussain

LLM-guided Task and Motion Planning using Knowledge-based Reasoning

Muhayy Ud Din, Jan Rosell, Waseem Akram, Isiah Zaplana, Maximo A Roa, Irfan Hussain

TL;DR

The paper tackles the fragility of LLM-based task and motion planning (TAMP) caused by static, template prompts in dynamic environments. It introduces Onto-LLM-TAMP, a knowledge-based framework that enriches prompts with ontology-driven context, semantic tagging, and environment-state descriptions, feeding into an LLM to produce semantically correct action sequences. The architecture combines an Ontological Prompt Construction Layer with a Planning Layer, employing SPARQL-enabled contextual inference, SpaCy tagging, YOLO/FoundationPose perception, and RRTConnect motion planning, with a feedback loop to replan on failures. Empirical results in simulation and real-world scenarios show robust planning under ambiguous prompts, improved task/execution success, and competitive planning times across multiple LLMs. The work demonstrates practical improvements in adaptive, semantically accurate TAMP by integrating domain knowledge with LLM reasoning, enabling more reliable robotic manipulation in dynamic settings.

Abstract

Performing complex manipulation tasks in dynamic environments requires efficient Task and Motion Planning (TAMP) approaches that combine high-level symbolic plans with low-level motion control. Advances in Large Language Models (LLMs), such as GPT-4, are transforming task planning by offering natural language as an intuitive and flexible way to describe tasks, generate symbolic plans, and reason. However, the effectiveness of LLM-based TAMP approaches is limited due to static and template-based prompting, which limits adaptability to dynamic environments and complex task contexts. To address these limitations, this work proposes a novel Onto-LLM-TAMP framework that employs knowledge-based reasoning to refine and expand user prompts with task-contextual reasoning and knowledge-based environment state descriptions. Integrating domain-specific knowledge into the prompt ensures semantically accurate and context-aware task plans. The proposed framework demonstrates its effectiveness by resolving semantic errors in symbolic plan generation, such as maintaining logical temporal goal ordering in scenarios involving hierarchical object placement. The proposed framework is validated through both simulation and real-world scenarios, demonstrating significant improvements over the baseline approach in terms of adaptability to dynamic environments and the generation of semantically correct task plans.

LLM-guided Task and Motion Planning using Knowledge-based Reasoning

TL;DR

Abstract

LLM-guided Task and Motion Planning using Knowledge-based Reasoning

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (11)