QPAD: Quantile-Preserving Approximate Dimension Reduction for Nearest Neighbors Preservation in High-Dimensional Vector Search
Jiuzhou Fu, Dongfang Zhao
TL;DR
QPAD addresses the problem of deploying dimensionality reduction in high-dimensional vector search without sacrificing nearest-neighbor fidelity. It introduces a margin-based, unsupervised objective that preserves true $k$-NN relationships by maximizing the lower-tail distance gaps while employing a soft orthogonality penalty to diversify projection directions. The paper provides both Naive-QPAD and Fast-QPAD algorithms, with Fast-QPAD achieving $O(N\log N + Nn)$ per-evaluation complexity and strong theoretical guarantees, including Lipschitz continuity and KKT stationarity, as well as quantization-stable neighborhood preservation. Empirically, QPAD consistently outperforms 11 standard DR methods across five real-world domains and remains highly indexable across several ANN backends, demonstrating practical impact for scalable, retrieval-centric vector databases.
Abstract
High-dimensional vector embeddings are widely used in retrieval systems, but they often suffer from noise, the curse of dimensionality, and slow runtime. However, dimensionality reduction (DR) is rarely applied due to its tendency to distort the nearest-neighbor (NN) structure that is critical for search. Existing DR techniques such as PCA and UMAP optimize global or manifold-preserving criteria, rather than retrieval-specific objectives. We present QPAD -- Quantile-Preserving Approximate Dimension Reduction, an unsupervised DR method that explicitly preserves approximate NN relations by maximizing the margin between k-NNs and non-k-NNs under a soft orthogonality constraint. We analyze its complexity and favorable properties. This design enables QPAD to retain ANN-relevant geometry without supervision or changes to the original embedding model, while supporting scalability for large-scale vector search and being indexable for ANN search. Experiments across five domains show that QPAD consistently outperforms eleven standard DR methods in preserving neighborhood structure, enabling more accurate search in reduced dimensions.
