VAL: Interactive Task Learning with GPT Dialog Parsing
Lane Lawley, Christopher J. MacLellan
TL;DR
VAL tackles the brittleness of natural language interfaces in interactive task learning by integrating a neuro-symbolic HTN-based planning framework with narrowly scoped GPT subroutines. It uses the VALgorithm to ground natural language in symbolic actions, enabling incremental, interpretable task knowledge that generalizes to new tasks with few examples. The approach is validated via a user study in a video game environment, showing usable and interpretable learning with measurable subroutine success and actionable feedback mechanisms like confirmatory dialogs and an undo feature. The work demonstrates practical implications for human-centered AI that can learn from limited natural language guidance while maintaining reliability and interpretability, pointing to future expansion across modalities and open-model deployments.
Abstract
Machine learning often requires millions of examples to produce static, black-box models. In contrast, interactive task learning (ITL) emphasizes incremental knowledge acquisition from limited instruction provided by humans in modalities such as natural language. However, ITL systems often suffer from brittle, error-prone language parsing, which limits their usability. Large language models (LLMs) are resistant to brittleness but are not interpretable and cannot learn incrementally. We present VAL, an ITL system with a new philosophy for LLM/symbolic integration. By using LLMs only for specific tasks--such as predicate and argument selection--within an algorithmic framework, VAL reaps the benefits of LLMs to support interactive learning of hierarchical task knowledge from natural language. Acquired knowledge is human interpretable and generalizes to support execution of novel tasks without additional training. We studied users' interactions with VAL in a video game setting, finding that most users could successfully teach VAL using language they felt was natural.
