Large Language Model Enhanced Text-to-SQL Generation: A Survey
Xiaohu Zhu, Qian Li, Lizhen Cui, Yongkang Liu
TL;DR
The paper surveys the emergence of large language model–driven Text-to-SQL generation, organizing approaches into prompt engineering, fine-tuning, task-training, and agent paradigms. It highlights how LLMs, combined with structured prompts, pretraining, and external tools, advance SQL generation across single- and cross-domain databases, while detailing evaluation metrics and diverse datasets. Key contributions include a taxonomy of methods, synthesis of datasets and benchmarks, and discussion of challenges such as schema linking, robustness, and efficiency. The work clarifies practical implications for deploying LLM-based Text-to-SQL systems in real-world, enterprise-like environments and identifies avenues for future research toward smarter, scalable, and generalizable SQL generation.
Abstract
Text-to-SQL translates natural language queries into Structured Query Language (SQL) commands, enabling users to interact with databases using natural language. Essentially, the text-to-SQL task is a text generation task, and its development is primarily dependent on changes in language models. Especially with the rapid development of Large Language Models (LLMs), the pattern of text-to-SQL has undergone significant changes. Existing survey work mainly focuses on rule-based and neural-based approaches, but it still lacks a survey of Text-to-SQL with LLMs. In this paper, we survey the large language model enhanced text-to-SQL generations, classifying them into prompt engineering, fine-tuning, pre-trained, and Agent groups according to training strategies. We also summarize datasets and evaluation metrics comprehensively. This survey could help people better understand the pattern, research status, and challenges of LLM-based text-to-SQL generations.
