Continual Learning of Domain Knowledge from Human Feedback in Text-to-SQL
Thomas Cook, Kelly Patel, Sivapriya Vellaichamy, Udari Madhushani Sehwag, Saba Rahimi, Zhen Zeng, Sumitra Ganesh
TL;DR
The paper tackles the problem of tacit domain knowledge missing from formal documentation in Text-to-SQL by introducing a continual-learning framework driven by human feedback. It proposes a memory-augmented Learning Agent with four granular memory levels and two modes of procedural reasoning, complemented by a Human Proxy Agent for scalable feedback. Empirical results on the BIRD Dev benchmark show that memory enhancements—especially with Procedural Agent configurations—improve execution accuracy and generalization to new questions, while distilling actionable tacit knowledge across interactions. The work provides a practical blueprint for adaptive, domain-aware text-to-SQL systems that continually learn from human input, with implications for broader structured-reasoning tasks.
Abstract
Large Language Models (LLMs) can generate SQL queries from natural language questions but struggle with database-specific schemas and tacit domain knowledge. We introduce a framework for continual learning from human feedback in text-to-SQL, where a learning agent receives natural language feedback to refine queries and distills the revealed knowledge for reuse on future tasks. This distilled knowledge is stored in a structured memory, enabling the agent to improve execution accuracy over time. We design and evaluate multiple variations of a learning agent architecture that vary in how they capture and retrieve past experiences. Experiments on the BIRD benchmark Dev set show that memory-augmented agents, particularly the Procedural Agent, achieve significant accuracy gains and error reduction by leveraging human-in-the-loop feedback. Our results highlight the importance of transforming tacit human expertise into reusable knowledge, paving the way for more adaptive, domain-aware text-to-SQL systems that continually learn from a human-in-the-loop.
