IA2: Leveraging Instance-Aware Index Advisor with Reinforcement Learning for Diverse Workloads

Taiyi Wang; Eiko Yoneki

IA2: Leveraging Instance-Aware Index Advisor with Reinforcement Learning for Diverse Workloads

Taiyi Wang, Eiko Yoneki

TL;DR

The paper tackles the Index Selection Problem (ISP) in databases where the space of candidate indexes is large and workloads vary widely. It introduces IA2, a deep reinforcement learning-based index advisor that uses the TD3-TD-SWAR architecture with instance-aware action masking and a workload-rich state representation to efficiently navigate vast action spaces. The approach is evaluated on the TPC-H benchmark, where IA2 achieves substantial runtime reductions (approximately 40% versus no indexes) and outperforms prior state-of-the-art DRL-based advisors by about 20%. The work demonstrates strong generalization to unseen workloads, rapid training efficiency, and storage-aware optimization, offering a practical and robust solution for diversified database environments.

Abstract

This study introduces the Instance-Aware Index Advisor (IA2), a novel deep reinforcement learning (DRL)-based approach for optimizing index selection in databases facing large action spaces of potential candidates. IA2 introduces the Twin Delayed Deep Deterministic Policy Gradient - Temporal Difference State-Wise Action Refinery (TD3-TD-SWAR) model, enabling efficient index selection by understanding workload-index dependencies and employing adaptive action masking. This method includes a comprehensive workload model, enhancing its ability to adapt to unseen workloads and ensuring robust performance across diverse database environments. Evaluation on benchmarks such as TPC-H reveals IA2's suggested indexes' performance in enhancing runtime, securing a 40% reduction in runtime for complex TPC-H workloads compared to scenarios without indexes, and delivering a 20% improvement over existing state-of-the-art DRL-based index advisors.

IA2: Leveraging Instance-Aware Index Advisor with Reinforcement Learning for Diverse Workloads

TL;DR

Abstract

Paper Structure (22 sections, 4 equations, 6 figures, 1 table, 1 algorithm)

This paper contains 22 sections, 4 equations, 6 figures, 1 table, 1 algorithm.

Introduction
Related Works
Traditional Index Selection Approaches
RL-based Index Selection Approaches
Index Selection Problem
Methodology
Formulation of the DRL Problem
Instance-Aware Deep Reinforcement Learning for Efficient Index Selection
System Framework of IA2
Preprocessing Phase
RL Training and Application Phase
Experiments
Experimental Setting
Experimental Results
Training Efficiency of TD3-TD-SWAR:
...and 7 more sections

Figures (6)

Figure 1: Unique challenges to RL-based Index Advisors due to diverse and complex workloads
Figure 2: Block diagram of TD3-TD-SWAR in temporal difference learning. States and actions sampled from replay buffer are fed into the selector network that predicts the selection probabilities of different dimensions of actions. A selection mask is then generated according to such a selection probability vector.
Figure 3: Framework and working flow of IA2
Figure 4: Workloads' Outline, W1-W7 with the increasing complexity and diverse patterns
Figure 5: Comparative Analysis of IA2's (a) TrainingEfficiency and (b) Action Pruning Efficiency.
...and 1 more figures

IA2: Leveraging Instance-Aware Index Advisor with Reinforcement Learning for Diverse Workloads

TL;DR

Abstract

IA2: Leveraging Instance-Aware Index Advisor with Reinforcement Learning for Diverse Workloads

Authors

TL;DR

Abstract

Table of Contents

Figures (6)