VeriEQL: Bounded Equivalence Verification for Complex SQL Queries with Integrity Constraints
Yang He, Pinhan Zhao, Xinyu Wang, Yuepeng Wang
TL;DR
VeriEQL tackles the practical problem of bounded SQL query equivalence under integrity constraints by introducing an SMT-based encoding that models query semantics with symbolic tuples over integers and uninterpreted functions. It formalizes a larger, real-world fragment of SQL (including ORDER BY, CASE, WITH, and complex constraints) and proves correctness with respect to a three-valued, NULL-aware semantics. The approach builds a symbolic database under a bound, encodes integrity constraints and query operators into a single SMT formula, and uses SAT/SMT solving to either certify bounded equivalence or produce counterexamples. Empirically, VeriEQL dramatically outperforms state-of-the-art baselines across 24,455 benchmark pairs, proving or disproving significantly more cases and revealing real bugs in systems like MySQL and Calcite. The work demonstrates the practical impact of formal equivalence checking for real-world SQL, including automated debugging, test generation, and validation of rewrite rules.
Abstract
The task of SQL query equivalence checking is important in various real-world applications (including query rewriting and automated grading) that involve complex queries with integrity constraints; yet, state-of-the-art techniques are very limited in their capability of reasoning about complex features (e.g., those that involve sorting, case statement, rich integrity constraints, etc.) in real-life queries. To the best of our knowledge, we propose the first SMT-based approach and its implementation, VeriEQL, capable of proving and disproving bounded equivalence of complex SQL queries. VeriEQL is based on a new logical encoding that models query semantics over symbolic tuples using the theory of integers with uninterpreted functions. It is simple yet highly practical -- our comprehensive evaluation on over 20,000 benchmarks shows that VeriEQL outperforms all state-of-the-art techniques by more than one order of magnitude in terms of the number of benchmarks that can be proved or disproved. VeriEQL can also generate counterexamples that facilitate many downstream tasks (such as finding serious bugs in systems like MySQL and Apache Calcite).
