Prompt Tuning for Natural Language to SQL with Embedding Fine-Tuning and RAG
Jisoo Jang, Tien-Cuong Bui, Yunjun Choi, Wen-Syan Li
TL;DR
The paper tackles NL-to-SQL conversion by introducing Error Correction through Prompt Tuning (ECPT), which combines diagnosis-based prompts, embedding-fine-tuned retrieval, and Retrieval-Augmented Generation to repair SQL errors. Central to ECPT is a three-step process (Diagnose, Write Prescription, Apply Treatment) supported by a fine-tuned Sentence Transformer to organize correction cases and FAISS-based retrieval, enabling targeted corrections guided by external knowledge. Evaluation on the Spider dev set using GPT-3.5-turbo and GPT-4-turbo shows a substantial improvement, achieving up to about $88.18\%$ execution accuracy with full ECPT, a roughly 12% gain over baselines. The work demonstrates the value of integrating error-diagnosis, embedding-focused retrieval, and external knowledge sources to enhance NLIDB reliability, while highlighting avenues for future improvements such as human-in-the-loop checks and cost-aware RAG deployment.
Abstract
This paper introduces an Error Correction through Prompt Tuning for NL-to-SQL, leveraging the latest advancements in generative pre-training-based LLMs and RAG. Our work addresses the crucial need for efficient and accurate translation of natural language queries into SQL expressions in various settings with the growing use of natural language interfaces. We explore the evolution of NLIDBs from early rule-based systems to advanced neural network-driven approaches. Drawing inspiration from the medical diagnostic process, we propose a novel framework integrating an error correction mechanism that diagnoses error types, identifies their causes, provides fixing instructions, and applies these corrections to SQL queries. This approach is further enriched by embedding fine-tuning and RAG, which harnesses external knowledge bases for improved accuracy and transparency. Through comprehensive experiments, we demonstrate that our framework achieves a significant 12 percent accuracy improvement over existing baselines, highlighting its potential to revolutionize data access and handling in contemporary data-driven environments.
