A New Paradigm in Tuning Learned Indexes: A Reinforcement Learning Enhanced Approach
Taiyi Wang, Liang Liang, Guang Yang, Thomas Heinis, Eiko Yoneki
TL;DR
This work addresses the challenging problem of tuning learned index structures (LIS) to workloads and distributions. It introduces LITune, an end-to-end automatic tuner that combines a tailor-made Deep Reinforcement Learning pipeline with an online-offline (O2) updating mechanism and a safety-focused ET-MDP framework. Through offline meta-trained initialization and online adaptation, LITune achieves substantial runtime reductions and throughput gains, while maintaining safety during exploration and rapid responsiveness to data distribution shifts. The approach is validated on multiple LIS instances and workloads, demonstrating strong adaptability, efficiency, and robustness with significant improvements over traditional tuning methods. The findings suggest LITune can significantly broaden the practical deployment of LIS in real-world systems by automating complex tuning under dynamic conditions.
Abstract
Learned Index Structures (LIS) have significantly advanced data management by leveraging machine learning models to optimize data indexing. However, designing these structures often involves critical trade-offs, making it challenging for both designers and end-users to find an optimal balance tailored to specific workloads and scenarios. While some indexes offer adjustable parameters that demand intensive manual tuning, others rely on fixed configurations based on heuristic auto-tuners or expert knowledge, which may not consistently deliver optimal performance. This paper introduces LITune, a novel framework for end-to-end automatic tuning of Learned Index Structures. LITune employs an adaptive training pipeline equipped with a tailor-made Deep Reinforcement Learning (DRL) approach to ensure stable and efficient tuning. To accommodate long-term dynamics arising from online tuning, we further enhance LITune with an on-the-fly updating mechanism termed the O2 system. These innovations allow LITune to effectively capture state transitions in online tuning scenarios and dynamically adjust to changing data distributions and workloads, marking a significant improvement over other tuning methods. Our experimental results demonstrate that LITune achieves up to a 98% reduction in runtime and a 17-fold increase in throughput compared to default parameter settings given a selected Learned Index instance. These findings highlight LITune's effectiveness and its potential to facilitate broader adoption of LIS in real-world applications.
