Robust Text-to-SQL Generation with Execution-Guided Decoding
Chenglong Wang, Kedar Tatwawadi, Marc Brockschmidt, Po-Sen Huang, Yi Mao, Oleksandr Polozov, Rishabh Singh
TL;DR
This work introduces execution-guided decoding, a test-time mechanism that prunes invalid partial SQL programs by executing them during generation. By applying this approach to four diverse text-to-SQL models across WikiSQL, ATIS, and GeoQuery, the authors achieve consistent improvements and set a new state of the art on WikiSQL with 83.8% execution accuracy. Ablation studies show that fine-grained guidance on conditional generation yields the most benefit, while the method remains simple and broadly applicable across model families. The work highlights a promising direction for combining neural generation with symbolic execution to enforce semantic correctness without retraining.
Abstract
We consider the problem of neural semantic parsing, which translates natural language questions into executable SQL queries. We introduce a new mechanism, execution guidance, to leverage the semantics of SQL. It detects and excludes faulty programs during the decoding procedure by conditioning on the execution of partially generated program. The mechanism can be used with any autoregressive generative model, which we demonstrate on four state-of-the-art recurrent or template-based semantic parsing models. We demonstrate that execution guidance universally improves model performance on various text-to-SQL datasets with different scales and query complexity: WikiSQL, ATIS, and GeoQuery. As a result, we achieve new state-of-the-art execution accuracy of 83.8% on WikiSQL.
