RAIDER: Tool-Equipped Large Language Model Agent for Robotic Action Issue Detection, Explanation and Recovery

Silvia Izquierdo-Badiola; Carlos Rizzo; Guillem Alenyà

RAIDER: Tool-Equipped Large Language Model Agent for Robotic Action Issue Detection, Explanation and Recovery

Silvia Izquierdo-Badiola, Carlos Rizzo, Guillem Alenyà

TL;DR

RAIDER tackles the challenge of grounding and recovering from action-related issues in embodied robots operating around humans. It integrates an LLM with a grounded toolset under a Ground, Ask&Answer, Issue procedure to selectively gather context and identify precondition ambiguities or infeasibilities, then generate recoveries often involving human input. In simulated AI2THOR household tasks, RAIDER outperforms baselines in grounding, issue detection, and explanation, and its explanations substantially improve interactive recovery planning. The framework is modular and extensible, enabling adaptation to real-world assistive scenarios with minimal reconfiguration.

Abstract

As robots increasingly operate in dynamic human-centric environments, improving their ability to detect, explain, and recover from action-related issues becomes crucial. Traditional model-based and data-driven techniques lack adaptability, while more flexible generative AI methods struggle with grounding extracted information to real-world constraints. We introduce RAIDER, a novel agent that integrates Large Language Models (LLMs) with grounded tools for adaptable and efficient issue detection and explanation. Using a unique "Ground, Ask&Answer, Issue" procedure, RAIDER dynamically generates context-aware precondition questions and selects appropriate tools for resolution, achieving targeted information gathering. Our results within a simulated household environment surpass methods relying on predefined models, full scene descriptions, or standalone trained models. Additionally, RAIDER's explanations enhance recovery success, including cases requiring human interaction. Its modular architecture, featuring self-correction mechanisms, enables straightforward adaptation to diverse scenarios, as demonstrated in a real-world human-assistive task. This showcases RAIDER's potential as a versatile agentic AI solution for robotic issue detection and explanation, while addressing the problem of grounding generative AI for its effective application in embodied agents. Project website: https://eurecat.github.io/raider-llmagent/

RAIDER: Tool-Equipped Large Language Model Agent for Robotic Action Issue Detection, Explanation and Recovery

TL;DR

Abstract

RAIDER: Tool-Equipped Large Language Model Agent for Robotic Action Issue Detection, Explanation and Recovery

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (22)