Anticipate & Act : Integrating LLMs and Classical Planning for Efficient Task Execution in Household Environments
Raghav Arora, Shivam Singh, Karthik Swaminathan, Ahana Datta, Snehasis Banerjee, Brojeshwar Bhowmick, Krishna Murthy Jatavallabhula, Mohan Sridharan, Madhava Krishna
TL;DR
This work addresses the inefficiency of executing household tasks sequentially by integrating LLM-driven high-level task anticipation with a classical planning system (PDDL/Fast Downward) to produce joint, fine-grained action sequences. By prompting LLMs with routine examples, the agent anticipates upcoming tasks and grounds them as joint goals in a domain model, enabling the planner to interleave actions across tasks. Experimental results in VirtualHome show a notable 31% reduction in execution time and a 12% reduction in plan length when anticipating multiple tasks, with GPT-4 achieving robust task-order predictions given contextual prompts. The framework demonstrates practical benefits for assistive agents in dynamic home environments and points to future work on scalability, probabilistic planning, and real-robot deployment.
Abstract
Assistive agents performing household tasks such as making the bed or cooking breakfast often compute and execute actions that accomplish one task at a time. However, efficiency can be improved by anticipating upcoming tasks and computing an action sequence that jointly achieves these tasks. State-of-the-art methods for task anticipation use data-driven deep networks and Large Language Models (LLMs), but they do so at the level of high-level tasks and/or require many training examples. Our framework leverages the generic knowledge of LLMs through a small number of prompts to perform high-level task anticipation, using the anticipated tasks as goals in a classical planning system to compute a sequence of finer-granularity actions that jointly achieve these goals. We ground and evaluate our framework's abilities in realistic scenarios in the VirtualHome environment and demonstrate a 31% reduction in execution time compared with a system that does not consider upcoming tasks.
