Plan-over-Graph: Towards Parallelable LLM Agent Schedule
Shiqi Zhang, Xinbei Ma, Zouying Cao, Zhuosheng Zhang, Hai Zhao
TL;DR
Plan-over-Graph introduces a parallelizable planning paradigm for LLM agents by first extracting an executable task graph from natural language and then planning over this graph under a time-cost objective. It combines synthetic data generation of DAGs and a two-stage training regime (SFT with LoRA and Direct Preference Optimization) to improve graph understanding and parallel planning across API-based and open-source LLMs. Empirical results show substantial gains in optimal and success rates, and clear time-efficiency benefits from parallel execution, across both graph-only tasks and real-world textual queries. The approach provides a scalable, graph-centric framework for robust, efficient LLM-driven planning, with code and data released for reproducibility. It also analyzes how graph topology and size affect performance, informing future directions for graph-aware reasoning in LLMs.
Abstract
Large Language Models (LLMs) have demonstrated exceptional abilities in reasoning for task planning. However, challenges remain under-explored for parallel schedules. This paper introduces a novel paradigm, plan-over-graph, in which the model first decomposes a real-life textual task into executable subtasks and constructs an abstract task graph. The model then understands this task graph as input and generates a plan for parallel execution. To enhance the planning capability of complex, scalable graphs, we design an automated and controllable pipeline to generate synthetic graphs and propose a two-stage training scheme. Experimental results show that our plan-over-graph method significantly improves task performance on both API-based LLMs and trainable open-sourced LLMs. By normalizing complex tasks as graphs, our method naturally supports parallel execution, demonstrating global efficiency. The code and data are available at https://github.com/zsq259/Plan-over-Graph.
