INPROVF: Leveraging Large Language Models to Repair High-level Robot Controllers from Assumption Violations

Qian Meng; Jin Peng Zhou; Kilian Q. Weinberger; Hadas Kress-Gazit

INPROVF: Leveraging Large Language Models to Repair High-level Robot Controllers from Assumption Violations

Qian Meng, Jin Peng Zhou, Kilian Q. Weinberger, Hadas Kress-Gazit

TL;DR

This paper tackles the challenge of repairing high-level robot controllers at runtime when environment assumptions are violated and state spaces are too large for purely formal methods. It introduces INPROVF, a hybrid framework that leverages large language models to generate repair candidates and uses formal verification to ensure correctness, with an explicit Informalization step that translates abstractions into natural language to improve prompting. The approach includes iterative Repair and Feedback stages, where formal analyses provide targeted guidance to the LLM, and verification checks ensure the updated specification is realizable. Empirical results across 12 violations in diverse workspaces show that INPROVF significantly improves scalability over a purely formal-baseline, with ablation studies confirming the value of informalization and feedback in achieving higher success rates.

Abstract

This paper presents INPROVF, an automatic framework that combines large language models (LLMs) and formal methods to speed up the repair process of high-level robot controllers. Previous approaches based solely on formal methods are computationally expensive and cannot scale to large state spaces. In contrast, INPROVF uses LLMs to generate repair candidates, and formal methods to verify their correctness. To improve the quality of these candidates, our framework first translates the symbolic representations of the environment and controllers into natural language descriptions. If a candidate fails the verification, INPROVF provides feedback on potential unsafe behaviors or unsatisfied tasks, and iteratively prompts LLMs to generate improved solutions. We demonstrate the effectiveness of INPROVF through 12 violations with various workspaces, tasks, and state space sizes.

INPROVF: Leveraging Large Language Models to Repair High-level Robot Controllers from Assumption Violations

TL;DR

Abstract

INPROVF: Leveraging Large Language Models to Repair High-level Robot Controllers from Assumption Violations

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (7)