Benchmarking RL-Enhanced Spatial Indices Against Traditional, Advanced, and Learned Counterparts
Guanli Liu, Renata Borovica-Gajic, Hai Lan, Zhifeng Bao
TL;DR
This paper presents the first modular benchmark framework for RL-enhanced spatial indices (RLESIs), designed to compare RLESIs with traditional, advanced, and learned spatial indices across diverse datasets and workloads. The framework separates index training (ITM) from index building (IBM) and uses grid-search parameter tuning to optimize performance under build-time constraints. Through evaluations on six datasets and twelve indices, the study finds that while RLESIs can reduce query latency with tuning, they generally underperform learned spatial indices and advanced baselines, and they incur substantial training and build-time costs. The work identifies training cost, parameter sensitivity, and generalization gaps as key barriers, and suggests improvements such as cost-based reward functions and incremental learning to make RLESIs more practical in real systems.
Abstract
Reinforcement learning has recently been used to enhance index structures, giving rise to reinforcement learning-enhanced spatial indices (RLESIs) that aim to improve query efficiency during index construction. However, their practical benefits remain unclear due to the lack of unified implementations and comprehensive evaluations, especially in disk-based settings. We present the first modular and extensible benchmark for RLESIs. Built on top of an existing spatial index library, our framework decouples index training from building, supports parameter tuning, and enables consistent comparison with traditional, advanced, and learned spatial indices. We evaluate 12 representative spatial indices across six datasets and diverse workloads, including point, range, kNN, spatial join, and mixed read/write queries. Using latency, I/O, and index statistics as metrics, we find that while RLESIs can reduce query latency with tuning, they consistently underperform learned spatial indices and advanced variants in both query efficiency and index build cost. These findings highlight that although RLESIs offer promising architectural compatibility, their high tuning costs and limited generalization hinder practical adoption.
