Table of Contents
Fetching ...

IA2: Leveraging Instance-Aware Index Advisor with Reinforcement Learning for Diverse Workloads

Taiyi Wang, Eiko Yoneki

TL;DR

The paper tackles the Index Selection Problem (ISP) in databases where the space of candidate indexes is large and workloads vary widely. It introduces IA2, a deep reinforcement learning-based index advisor that uses the TD3-TD-SWAR architecture with instance-aware action masking and a workload-rich state representation to efficiently navigate vast action spaces. The approach is evaluated on the TPC-H benchmark, where IA2 achieves substantial runtime reductions (approximately 40% versus no indexes) and outperforms prior state-of-the-art DRL-based advisors by about 20%. The work demonstrates strong generalization to unseen workloads, rapid training efficiency, and storage-aware optimization, offering a practical and robust solution for diversified database environments.

Abstract

This study introduces the Instance-Aware Index Advisor (IA2), a novel deep reinforcement learning (DRL)-based approach for optimizing index selection in databases facing large action spaces of potential candidates. IA2 introduces the Twin Delayed Deep Deterministic Policy Gradient - Temporal Difference State-Wise Action Refinery (TD3-TD-SWAR) model, enabling efficient index selection by understanding workload-index dependencies and employing adaptive action masking. This method includes a comprehensive workload model, enhancing its ability to adapt to unseen workloads and ensuring robust performance across diverse database environments. Evaluation on benchmarks such as TPC-H reveals IA2's suggested indexes' performance in enhancing runtime, securing a 40% reduction in runtime for complex TPC-H workloads compared to scenarios without indexes, and delivering a 20% improvement over existing state-of-the-art DRL-based index advisors.

IA2: Leveraging Instance-Aware Index Advisor with Reinforcement Learning for Diverse Workloads

TL;DR

The paper tackles the Index Selection Problem (ISP) in databases where the space of candidate indexes is large and workloads vary widely. It introduces IA2, a deep reinforcement learning-based index advisor that uses the TD3-TD-SWAR architecture with instance-aware action masking and a workload-rich state representation to efficiently navigate vast action spaces. The approach is evaluated on the TPC-H benchmark, where IA2 achieves substantial runtime reductions (approximately 40% versus no indexes) and outperforms prior state-of-the-art DRL-based advisors by about 20%. The work demonstrates strong generalization to unseen workloads, rapid training efficiency, and storage-aware optimization, offering a practical and robust solution for diversified database environments.

Abstract

This study introduces the Instance-Aware Index Advisor (IA2), a novel deep reinforcement learning (DRL)-based approach for optimizing index selection in databases facing large action spaces of potential candidates. IA2 introduces the Twin Delayed Deep Deterministic Policy Gradient - Temporal Difference State-Wise Action Refinery (TD3-TD-SWAR) model, enabling efficient index selection by understanding workload-index dependencies and employing adaptive action masking. This method includes a comprehensive workload model, enhancing its ability to adapt to unseen workloads and ensuring robust performance across diverse database environments. Evaluation on benchmarks such as TPC-H reveals IA2's suggested indexes' performance in enhancing runtime, securing a 40% reduction in runtime for complex TPC-H workloads compared to scenarios without indexes, and delivering a 20% improvement over existing state-of-the-art DRL-based index advisors.
Paper Structure (22 sections, 4 equations, 6 figures, 1 table, 1 algorithm)

This paper contains 22 sections, 4 equations, 6 figures, 1 table, 1 algorithm.

Figures (6)

  • Figure 1: Unique challenges to RL-based Index Advisors due to diverse and complex workloads
  • Figure 2: Block diagram of TD3-TD-SWAR in temporal difference learning. States and actions sampled from replay buffer are fed into the selector network that predicts the selection probabilities of different dimensions of actions. A selection mask is then generated according to such a selection probability vector.
  • Figure 3: Framework and working flow of IA2
  • Figure 4: Workloads' Outline, W1-W7 with the increasing complexity and diverse patterns
  • Figure 5: Comparative Analysis of IA2's (a) TrainingEfficiency and (b) Action Pruning Efficiency.
  • ...and 1 more figures