Towards Human Awareness in Robot Task Planning with Large Language Models
Yuchen Liu, Luigi Palmieri, Sebastian Koch, Ilche Georgievski, Marco Aiello
TL;DR
The paper tackles proactive robot task planning in dynamic environments by integrating humans into a hierarchical 3D scene graph and leveraging Large Language Models to predict future human activities and ground them into formal planning language. It introduces an architecture that encodes humans as scene-graph nodes, uses LLMs to predict human goals with probabilities, and transforms the problem from single-robot planning to a multi-agent setting with humans as planning agents. A key contribution is an algorithm that extracts domain knowledge, accounts for past observations, generates or augments actions when needed, assigns goal states to humans, and solves the resulting multi-agent planning problem with a standard planner. The approach aims to enable proactive, human-aware robot decision-making in dynamic environments, with future work including implementation, evaluation, and benchmarking in photorealistic simulations and real-world scenarios.
Abstract
The recent breakthroughs in the research on Large Language Models (LLMs) have triggered a transformation across several research domains. Notably, the integration of LLMs has greatly enhanced performance in robot Task And Motion Planning (TAMP). However, previous approaches often neglect the consideration of dynamic environments, i.e., the presence of dynamic objects such as humans. In this paper, we propose a novel approach to address this gap by incorporating human awareness into LLM-based robot task planning. To obtain an effective representation of the dynamic environment, our approach integrates humans' information into a hierarchical scene graph. To ensure the plan's executability, we leverage LLMs to ground the environmental topology and actionable knowledge into formal planning language. Most importantly, we use LLMs to predict future human activities and plan tasks for the robot considering the predictions. Our contribution facilitates the development of integrating human awareness into LLM-driven robot task planning, and paves the way for proactive robot decision-making in dynamic environments.
