A Survey of Learned Indexes for the Multi-dimensional Space

Abdullah Al-Mamun; Hao Wu; Qiyang He; Jianguo Wang; Walid G. Aref

A Survey of Learned Indexes for the Multi-dimensional Space

Abdullah Al-Mamun, Hao Wu, Qiyang He, Jianguo Wang, Walid G. Aref

TL;DR

This survey presents a taxonomy that classifies and categorizes both learned one- and multi-dimensional indexes, and surveys the existing literature on learned indexes according to this taxonomy with an emphasis on learned multi-dimensional index structures.

Abstract

A recent research trend involves treating database index structures as Machine Learning (ML) models. In this domain, single or multiple ML models are trained to learn the mapping from keys to positions inside a data set. This class of indexes is known as "Learned Indexes." Learned indexes have demonstrated improved search performance and reduced space requirements for one-dimensional data. The concept of one-dimensional learned indexes has naturally been extended to multi-dimensional (e.g., spatial) data, leading to the development of "Learned Multi-dimensional Indexes". This survey focuses on learned multi-dimensional index structures. Specifically, it reviews the current state of this research area, explains the core concepts behind each proposed method, and classifies these methods based on several well-defined criteria. We present a taxonomy that classifies and categorizes each learned multi-dimensional index, and survey the existing literature on learned multi-dimensional indexes according to this taxonomy. Additionally, we present a timeline to illustrate the evolution of research on learned indexes. Finally, we highlight several open challenges and future research directions in this emerging and highly active field.

A Survey of Learned Indexes for the Multi-dimensional Space

TL;DR

Abstract

Paper Structure (105 sections, 4 figures, 4 tables)

This paper contains 105 sections, 4 figures, 4 tables.

Introduction
Related Surveys and Tutorials
Evolution of Learned Indexes
Indexing the Learned Models
Learned Immutable Indexes
The One-dimensional Case
Pure Learned Indexes
RMI kraska2018case
Hybrid Learned Indexes
RS kipf2020radixspline
The Multi-dimensional Case
Pure Learned Indexes in the Projected Space
SageDB-LMI kraska2019sagedb
ZM-index wang2019learned
ML-index davitkova2020ml
...and 90 more sections

Figures (4)

Figure 1: Spectrum of learned indexes
Figure 2: Taxonomy of learned indexes. The end of a branch indicates that there are no papers in that category as of the time this article has been written. An asterisk (*) symbol is used if the index natively supports concurrency. The hybrid learned indexes are categorized based on their underlying traditional data structures.
Figure 3: Illustration of the taxonomy criteria
Figure 4: Timeline of the evolution of learned indexes. Lines connecting between the various learned multi-dimensional indexes reflect dependence of these indexes on earlier work.

A Survey of Learned Indexes for the Multi-dimensional Space

TL;DR

Abstract

A Survey of Learned Indexes for the Multi-dimensional Space

Authors

TL;DR

Abstract

Table of Contents

Figures (4)