The Unreasonable Effectiveness of LLMs for Query Optimization

Peter Akioyamen; Zixuan Yi; Ryan Marcus

The Unreasonable Effectiveness of LLMs for Query Optimization

Peter Akioyamen, Zixuan Yi, Ryan Marcus

TL;DR

Surprisingly, it is shown that LLM embeddings of query text contain useful semantic information for query optimization, and a simple binary classifier deciding between alternative query plans, trained only on a small number of labeled embedded query vectors, can outperform existing heuristic systems.

Abstract

Recent work in database query optimization has used complex machine learning strategies, such as customized reinforcement learning schemes. Surprisingly, we show that LLM embeddings of query text contain useful semantic information for query optimization. Specifically, we show that a simple binary classifier deciding between alternative query plans, trained only on a small number of labeled embedded query vectors, can outperform existing heuristic systems. Although we only present some preliminary results, an LLM-powered query optimizer could provide significant benefits, both in terms of performance and simplicity.

The Unreasonable Effectiveness of LLMs for Query Optimization

TL;DR

Abstract

The Unreasonable Effectiveness of LLMs for Query Optimization

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)