Compass: General Filtered Search across Vector and Structured Data
Chunxiao Ye, Xiao Yan, Eric Lo
TL;DR
Compass addresses the challenge of general filtered search over hybrid vector and structured data by integrating a vector proximity graph with clustered B+-trees for relational attributes, all driven by a shared candidate queue. Its core innovation is a progressive, predicate-aware search that adaptively expands the graph search or consults relational indices to maintain connectivity and predicate satisfaction, without building new specialized indices. Empirical results show Compass outperforming NaviX across conjunctions and disjunctions, while approaching the efficiency of specialized single-attribute indices in single-attribute scenarios, and maintaining DBMS compatibility with modest storage overhead. The approach offers a practical, flexible, and robust solution for truly general filtered search in vector DBMSs, enabling scalable multi-attribute filtering with high recall at competitive throughput.
Abstract
The increasing prevalence of hybrid vector and relational data necessitates efficient, general support for queries that combine high-dimensional vector search with complex relational filtering. However, existing filtered search solutions are fundamentally limited by specialized indices, which restrict arbitrary filtering and hinder integration with general-purpose DBMSs. This work introduces \textsc{Compass}, a unified framework that enables general filtered search across vector and structured data without relying on new index designs. Compass leverages established index structures -- such as HNSW and IVF for vector attributes, and B+-trees for relational attributes -- implementing a principled cooperative query execution strategy that coordinates candidate generation and predicate evaluation across modalities. Uniquely, Compass maintains generality by allowing arbitrary conjunctions, disjunctions, and range predicates, while ensuring robustness even with highly-selective or multi-attribute filters. Comprehensive empirical evaluations demonstrate that Compass consistently outperforms NaviX, the only existing performant general framework, across diverse hybrid query workloads. It also matches the query throughput of specialized single-attribute indices in their favorite settings with only a single attribute involved, all while maintaining full generality and DBMS compatibility. Overall, Compass offers a practical and robust solution for achieving truly general filtered search in vector database systems.
