Table of Contents
Fetching ...

Probabilistically Correct Language-based Multi-Robot Planning using Conformal Prediction

Jun Wang, Guocheng He, Yiannis Kantaros

TL;DR

This work tackles language-based mission planning for robot teams by introducing S-ATLAS, a distributed planner that uses conformal prediction to create local, uncertainty-aware prediction sets for each robot. By solving sequential MCQA problems and selectively seeking help when predictions are uncertain, S-ATLAS achieves user-defined mission success rates with reduced reliance on human input and improved scalability over centralized CP baselines. The authors provide theoretical guarantees on achievement probabilities and demonstrate substantial empirical gains in planning accuracy and help-rate reductions on AI2THOR-based tasks as team size grows. Overall, the approach offers a practical, scalable framework for reliable NL-based multi-robot planning with provable performance assurances.

Abstract

This paper addresses task planning problems for language-instructed robot teams. Tasks are expressed in natural language (NL), requiring the robots to apply their capabilities at various locations and semantic objects. Several recent works have addressed similar planning problems by leveraging pre-trained Large Language Models (LLMs) to design effective multi-robot plans. However, these approaches lack performance guarantees. To address this challenge, we introduce a new distributed LLM-based planner, called S-ATLAS for Safe plAnning for Teams of Language-instructed AgentS, that is capable of achieving user-defined mission success rates. This is accomplished by leveraging conformal prediction (CP), a distribution-free uncertainty quantification tool in black-box models. CP allows the proposed multi-robot planner to reason about its inherent uncertainty in a distributed fashion, enabling robots to make individual decisions when they are sufficiently certain and seek help otherwise. We show, both theoretically and empirically, that the proposed planner can achieve user-specified task success rates, assuming successful plan execution, while minimizing the overall number of help requests. We provide comparative experiments against related works showing that our method is significantly more computational efficient and achieves lower help rates. The advantage of our algorithm over baselines becomes more pronounced with increasing robot team size.

Probabilistically Correct Language-based Multi-Robot Planning using Conformal Prediction

TL;DR

This work tackles language-based mission planning for robot teams by introducing S-ATLAS, a distributed planner that uses conformal prediction to create local, uncertainty-aware prediction sets for each robot. By solving sequential MCQA problems and selectively seeking help when predictions are uncertain, S-ATLAS achieves user-defined mission success rates with reduced reliance on human input and improved scalability over centralized CP baselines. The authors provide theoretical guarantees on achievement probabilities and demonstrate substantial empirical gains in planning accuracy and help-rate reductions on AI2THOR-based tasks as team size grows. Overall, the approach offers a practical, scalable framework for reliable NL-based multi-robot planning with provable performance assurances.

Abstract

This paper addresses task planning problems for language-instructed robot teams. Tasks are expressed in natural language (NL), requiring the robots to apply their capabilities at various locations and semantic objects. Several recent works have addressed similar planning problems by leveraging pre-trained Large Language Models (LLMs) to design effective multi-robot plans. However, these approaches lack performance guarantees. To address this challenge, we introduce a new distributed LLM-based planner, called S-ATLAS for Safe plAnning for Teams of Language-instructed AgentS, that is capable of achieving user-defined mission success rates. This is accomplished by leveraging conformal prediction (CP), a distribution-free uncertainty quantification tool in black-box models. CP allows the proposed multi-robot planner to reason about its inherent uncertainty in a distributed fashion, enabling robots to make individual decisions when they are sufficiently certain and seek help otherwise. We show, both theoretically and empirically, that the proposed planner can achieve user-specified task success rates, assuming successful plan execution, while minimizing the overall number of help requests. We provide comparative experiments against related works showing that our method is significantly more computational efficient and achieves lower help rates. The advantage of our algorithm over baselines becomes more pronounced with increasing robot team size.
Paper Structure (17 sections, 2 theorems, 19 equations, 5 figures, 3 tables, 1 algorithm)

This paper contains 17 sections, 2 theorems, 19 equations, 5 figures, 3 tables, 1 algorithm.

Key Result

Proposition 4.1

The prediction set $\bar{{\mathcal{C}}}(\bar{\ell}_{\text{test}})$ defined in eq:pred3 is the same as the on-the-fly constructed prediction set $\hat{{\mathcal{C}}}(\bar{\ell}_{\text{test}})$ defined in eq:causalPred3, i.e., $\bar{{\mathcal{C}}}(\bar{\ell}_{\text{test}})=\hat{{\mathcal{C}}}(\bar{\el

Figures (5)

  • Figure 1: We propose a distributed planner for language-instructed multi-robot systems being capable of achieving user-specified mission success rates. The robots are delegated to pre-trained Large Language Models (LLMs), enabling them to select actions while coordinating in a conversational manner. Our LLM-based planner reasons about its inherent uncertainty, using conformal prediction, in a decentralized fashion. This capability allows the planner to determine when and which robot is uncertain about correctness of its next action. In cases of high uncertainty, the respective robots seek assistance.
  • Figure 2: Given a task scenario $\xi$, the robots select decisions at each time $t$ sequentially as per an ordered set ${\mathcal{I}}$ of robot indices. Each robot $j={\mathcal{I}}(i)$ (gray disk) is delegated to an LLM that generates a decision $s_j(t)$ given textual context $\ell_{{\mathcal{I}}(i-1)}(t)$ containing the task description and past robot decisions provided by the previous robot ${\mathcal{I}}(i-1)$. If the LLM for robot $j$ is not certain enough about what the correct decision $s_j(t)$ is, the robots seek assistance (not shown).
  • Figure 3: Example of the constructed prompt in Section \ref{['sec:experiments']} for GPT 3.5. This prompt refers to $t=1$ when the action history is empty.
  • Figure 4: Execution of a single-robot plan, generated by S-ATLAS, for a mission: 'Deliver a tomato and an apple to the sink. Also, deliver the kettle and the bread to the table'.
  • Figure 5: Execution of a three-robot plan, generated by S-ATLAS, for the mission: 'Deliver the bread, apple and tomato to the table; Deliver the potato to the sink. Also, Robot 1 should never pick up a knife'. The first three snapshots left illustrate the plans, generated by S-ATLAS, followed by each robot to achieve certain sub-tasks of the mission. The fourth snapshot shows the environment when all the sub-tasks are completed.

Theorems & Definitions (14)

  • Remark 3.1: Enhancing Computational Efficiency
  • Remark 3.2: Unknown/Dynamic Environments
  • Remark 3.3: Prediction sets
  • Remark 3.4: Efficiency Benefits
  • Remark 3.5: Multiple Feasible Solutions
  • Remark 3.6: i.i.d. Assumption
  • Remark 3.7: Dataset-Conditional Guarantee
  • Proposition 4.1
  • proof
  • Theorem 4.2: Mission Success Rate
  • ...and 4 more