HiCRISP: An LLM-based Hierarchical Closed-Loop Robotic Intelligent Self-Correction Planner
Chenlin Ming, Jiacheng Lin, Pangkit Fong, Han Wang, Xiaoming Duan, Jianping He
TL;DR
This work addresses the lack of stepwise self-correction in LLM-based robotic planning by introducing HiCRISP, a Hierarchical Closed-loop Robotic Intelligent Self-correction Planner. It frames task execution as a finite MDP, with the LLM decomposing instructions into a state-action path $igl\langle \mathcal{S}, \mathcal{A}, \mathcal{P}, \mathcal{R} \bigr\rangle$ and using perception to detect high-level and low-level failures, while employing a stack-based, depth-limited error-correction mechanism. High-level feedback re-plans via rectified actions $a_{\text{correction}}$ to bridge from $s_{\text{error}}$ toward the desired $s_{i+1}$; low-level feedback relies on movement primitives with built-in error checks, invoking LLM-driven corrections only for novel issues. Across virtual (Virtual Home) and robot-simulation (Gazebo, Bullet) setups, and real-world AGV experiments, HiCRISP achieves higher execution and success rates than baselines like ProgPrompt, demonstrating improved robustness and adaptability in dynamic environments.
Abstract
The integration of Large Language Models (LLMs) into robotics has revolutionized human-robot interactions and autonomous task planning. However, these systems are often unable to self-correct during the task execution, which hinders their adaptability in dynamic real-world environments. To address this issue, we present a Hierarchical Closed-loop Robotic Intelligent Self-correction Planner (HiCRISP), an innovative framework that enables robots to correct errors within individual steps during the task execution. HiCRISP actively monitors and adapts the task execution process, addressing both high-level planning and low-level action errors. Extensive benchmark experiments, encompassing virtual and real-world scenarios, showcase HiCRISP's exceptional performance, positioning it as a promising solution for robotic task planning with LLMs.
