Foundations of Vector Retrieval
Sebastian Bruch
TL;DR
Foundations of Vector Retrieval surveys the core theory and algorithmic toolkit for retrieving top-$k$ similar vectors across modalities. It cohesively presents four main algorithmic families—branch-and-bound trees, locality-sensitive hashing, graph-based methods, and clustering—and augments them with a detailed treatment of vector compression via quantization (PQ and AQ) to enable scalable indexing. A central thread is the tension between exactness and scalability in high dimensions, explored through intrinsic dimensionality, doubling dimension, and instability results, while offering principled guarantees under doubling-measure assumptions and constructive bounds for various methods. The work emphasizes the relevance of vector retrieval to real-time search, recommender systems, and retrieval-augmented generation, and frames future directions around improved theory-experiment alignment, scalable graph structures, and advanced compression techniques for billion-scale vector databases.
Abstract
Vectors are universal mathematical objects that can represent text, images, speech, or a mix of these data modalities. That happens regardless of whether data is represented by hand-crafted features or learnt embeddings. Collect a large enough quantity of such vectors and the question of retrieval becomes urgently relevant: Finding vectors that are more similar to a query vector. This monograph is concerned with the question above and covers fundamental concepts along with advanced data structures and algorithms for vector retrieval. In doing so, it recaps this fascinating topic and lowers barriers of entry into this rich area of research.
