Fast and Accurate Task Planning using Neuro-Symbolic Language Models and Multi-level Goal Decomposition
Minseo Kwon, Yaesol Kim, Young J. Kim
TL;DR
This work addresses the bottleneck of long-horizon robotic task planning by introducing a neuro-symbolic planner that uses LLMs both as L-Model (for multi-level subgoal decomposition) and L-Policy (to drive subgoal planning via Monte Carlo Tree Search). Planning is formulated as a multi-valued planning task $P \equiv \langle \mathcal{S}, \mathcal{O}, \mathcal{A}, \mathcal{T}, s_0, S^\star \rangle$, with subgoals generating subproblems $P_i$ that are solved either symbolically or with MCTS-LLM depending on complexity. The pipeline combines planning formulation from multimodal perception, subgoal generation with LLMs, and a selective subproblem solver (symbolic or MCTS-LLM), ultimately concatenating subplans into a complete PDDL-based plan. Across three IPC domains and real/simulation tests, the approach yields high success rates (up to $100\%$) and demonstrates favorable scalability and robustness, supported by ablations showing the effectiveness of goal decomposition and by real-robot demonstrations. The work advances scalable, robust task planning for robotics by leveraging subgoal decomposition and a hybrid search strategy that fuses symbolic and LLM-guided planning.
Abstract
In robotic task planning, symbolic planners using rule-based representations like PDDL are effective but struggle with long-sequential tasks in complicated environments due to exponentially increasing search space. Meanwhile, LLM-based approaches, which are grounded in artificial neural networks, offer faster inference and commonsense reasoning but suffer from lower success rates. To address the limitations of the current symbolic (slow speed) or LLM-based approaches (low accuracy), we propose a novel neuro-symbolic task planner that decomposes complex tasks into subgoals using LLM and carries out task planning for each subgoal using either symbolic or MCTS-based LLM planners, depending on the subgoal complexity. This decomposition reduces planning time and improves success rates by narrowing the search space and enabling LLMs to focus on more manageable tasks. Our method significantly reduces planning time while maintaining high success rates across task planning domains, as well as real-world and simulated robotics environments. More details are available at http://graphics.ewha.ac.kr/LLMTAMP/.
