$How^{2}$: How to learn from procedural How-to questions
Gautier Dagan, Frank Keller, Alex Lascarides
TL;DR
The paper introduces the memory-augmented framework $How^2$ for lifelong learning from procedural how-to questions to improve planning in interactive environments. By decoupling guidance from the current state through abstraction and parsing, it enables reusable memory entries across tasks. In Plancraft, fully executable teacher answers yield high immediate success, while abstracted subgoals enhance long-term reuse, with the complete $How^2$ pipeline balancing immediate utility and autonomous learning. These results demonstrate the potential of memory-guided planning for LLM-based agents in complex, open-ended domains.
Abstract
An agent facing a planning problem can use answers to how-to questions to reduce uncertainty and fill knowledge gaps, helping it solve both current and future tasks. However, their open ended nature, where valid answers to "How do I X?" range from executable actions to high-level descriptions of X's sub-goals, makes them challenging for AI agents to ask, and for AI experts to answer, in ways that support efficient planning. We introduce $How^{2}$, a memory agent framework that enables agents to ask how-to questions, store the answers, and reuse them for lifelong learning in interactive environments. We evaluate our approach in Plancraft, a Minecraft crafting environment, where agents must complete an assembly task by manipulating inventory items. Using teacher models that answer at varying levels of abstraction, from executable action sequences to high-level subgoal descriptions, we show that lifelong learning agents benefit most from answers that are abstracted and decoupled from the current state. $How^{2}$ offers a way for LLM-based agents to improve their planning capabilities over time by asking questions in interactive environments.
