DAG-Plan: Generating Directed Acyclic Dependency Graphs for Dual-Arm Cooperative Planning
Zeyu Gao, Yao Mu, Jinye Qu, Mengkang Hu, Shijia Peng, Chengkai Hou, Lingyue Guo, Ping Luo, Shanghang Zhang, Yanfeng Lu
TL;DR
DAG-Plan tackles the challenge of long-horizon dual-arm task planning by replacing linear task sequences with a Directed Acyclic Graph (DAG) representation generated by LLMs. It dynamically assigns sub-tasks to the appropriate arm using real-time environment observations and a cost-based selection mechanism, enabling parallel execution and adaptive behavior. The framework introduces occupy-release DAGs, two candidate sets (common and priority), and a cost function to guide feasible pairings, with execution coded via four foundation-model skills and perceptual backends for pose estimation. Evaluations on the Dual-arm Kitchen Benchmark show DAG-Plan achieves higher efficiency and robustness than single-arm planning and other dual-arm baselines, including substantial gains in plan success rate and reduced token/query costs, while maintaining real-world viability. Overall, DAG-Plan demonstrates that DAG-based, environment-aware, cooperative planning can significantly advance the practicality of autonomous dual-arm manipulation in complex tasks.
Abstract
Dual-arm robots offer enhanced versatility and efficiency over single-arm counterparts by enabling concurrent manipulation of multiple objects or cooperative execution of tasks using both arms. However, the coordination of dual-arm systems for long-horizon tasks continues to pose significant challenges, stemming from the intricate temporal and spatial dependencies among sub-tasks, necessitating intelligent decisions regarding the allocation of actions between arms and their optimal execution order. Existing task planning methods predominantly focus on single-arm robots or rely on predefined bimanual operations to use large language models (LLMs) generate task sequence with linear temporal dependency, failing to fully leverage the capabilities of dual-arm systems. To address this limitation, we introduce DAG-Plan, a structured task planning framework tailored for dual-arm robots. DAG-Plan harnesses LLMs to decompose intricate tasks into actionable sub-tasks represented as nodes within a directed acyclic graph (DAG). Critically, DAG-Plan dynamically assigns these sub-tasks to the appropriate arm based on real-time environmental observations, enabling parallel and adaptive execution. We evaluate DAG-Plan on the Dual-Arm Kitchen Benchmark, comprising 5 sequential tasks with 44 sub-tasks. Extensive experiments demonstrate the superiority of DAG-Plan over directly using LLM to generate linear task sequence, achieving 52.8% higher efficiency compared to the single-arm task planning and 48% higher success rate of the dual-arm task planning. Compared to iterative methods, DAG-Plan improving execution efficiency 84.1% due to its fewer query time. More demos and information are available on https://sites.google.com/view/dag-plan.
