TwoStep: Multi-agent Task Planning using Classical Planners and Large Language Models
David Bai, Ishika Singh, David Traum, Jesse Thomason
TL;DR
The paper introduces TwoStep, a framework that converts a multi-agent planning problem into parallel single-agent planning tasks by using LLMs to generate helper subgoals and translating them into PDDL. This hybrid approach leverages the strengths of classical planners for execution guarantees while harnessing LLM commonsense to partition goals and enable concurrency. Empirical results across five symbolic domains and an embodied AlfWorld/AI2THOR setup show faster planning times and shorter execution lengths than traditional multi-agent PDDL planning in most cases, with subgoals that closely match human expert specifications. The work demonstrates practical gains in multi-agent coordination and provides prompts for subgoal generation and translation to facilitate future development.
Abstract
Classical planning formulations like the Planning Domain Definition Language (PDDL) admit action sequences guaranteed to achieve a goal state given an initial state if any are possible. However, reasoning problems defined in PDDL do not capture temporal aspects of action taking, such as concurrent actions between two agents when there are no conflicting conditions, without significant modification and definition to existing PDDL domains. A human expert aware of such constraints can decompose a goal into subgoals, each reachable through single agent planning, to take advantage of simultaneous actions. In contrast to classical planning, large language models (LLMs) directly used for inferring plan steps rarely guarantee execution success, but are capable of leveraging commonsense reasoning to assemble action sequences. We combine the strengths of both classical planning and LLMs by approximating human intuitions for multi-agent planning goal decomposition. We demonstrate that LLM-based goal decomposition leads to faster planning times than solving multi-agent PDDL problems directly while simultaneously achieving fewer plan execution steps than a single agent plan alone, as well as most multiagent plans, while guaranteeing execution success. Additionally, we find that LLM-based approximations of subgoals result in similar multi-agent execution lengths to those specified by human experts. Website and resources at https://glamor-usc.github.io/twostep
