Learning to Recover from Plan Execution Errors during Robot Manipulation: A Neuro-symbolic Approach
Namasivayam Kalithasan, Arnav Tuli, Vishal Bindal, Himanshu Gaurav Singh, Parag Singla, Rohan Paul
TL;DR
This work tackles the challenge of detecting and recovering from execution errors in long-horizon robot manipulation. It introduces a neuro-symbolic framework that uses a dense scene-graph representation, consisting of a scene-graph encoder $H_\psi$, a scene-graph predictor $T^{ideal}_\theta$, and a scene-graph discriminator $K_\phi$, to imagine nominal plan execution and identify discrepancies without annotated failure data. When errors are detected, a neuro-symbolic search crafts a recovery plan that reuses parts of the original plan and targets sub-goals $S_{t_l}$, with an anytime variant allowing adjustable planning budgets. The approach demonstrates superior recovery efficiency and accuracy compared to RL baselines and full re-planning strategies in a simulated PyBullet environment with varied error types. Overall, the method offers a practical, self-supervised pipeline for robust long-horizon manipulation by coupling perception, prediction, and discrepancy-guided planning.
Abstract
Automatically detecting and recovering from failures is an important but challenging problem for autonomous robots. Most of the recent work on learning to plan from demonstrations lacks the ability to detect and recover from errors in the absence of an explicit state representation and/or a (sub-) goal check function. We propose an approach (blending learning with symbolic search) for automated error discovery and recovery, without needing annotated data of failures. Central to our approach is a neuro-symbolic state representation, in the form of dense scene graph, structured based on the objects present within the environment. This enables efficient learning of the transition function and a discriminator that not only identifies failures but also localizes them facilitating fast re-planning via computation of heuristic distance function. We also present an anytime version of our algorithm, where instead of recovering to the last correct state, we search for a sub-goal in the original plan minimizing the total distance to the goal given a re-planning budget. Experiments on a physics simulator with a variety of simulated failures show the effectiveness of our approach compared to existing baselines, both in terms of efficiency as well as accuracy of our recovery mechanism.
