New Compressed Indices for Multijoins on Graph Databases
Diego Arroyuelo, Fabrizio Barisione, Antonio Fariña, Adrián Gómez-Brandón, Gonzalo Navarro
TL;DR
The paper tackles efficient evaluation of worst-case-optimal multijoins on graph databases by proposing compact index structures that approach raw data space while supporting LTJ in $ ilde{O}(Q^*)$ time. It introduces The Ring and RDFCSA, along with URing, to provide strong space-time tradeoffs, and combines them with adaptive variable elimination orders and refined intersection estimators. Empirical results on Wikidata show substantial speedups for obtaining the first 1000 results (up to 13x faster) and robust performance with larger savings in space, often surpassing traditional wco indices. The work highlights adaptive query planning and sophisticated cost estimators as key levers for improving practical performance, while noting open challenges around disk-based storage and dynamic updates.
Abstract
A recent surprising result in the implementation of worst-case-optimal (wco) multijoins in graph databases (specifically, basic graph patterns) is that they can be supported on graph representations that take even less space than a plain representation, and orders of magnitude less space than classical indices, while offering comparable performance. In this paper we uncover a wide set of new wco space-time tradeoffs: we (1) introduce new compact indices that handle multijoins in wco time, and (2) combine them with new query resolution strategies that offer better times in practice. As a result, we improve the average query times of current compact representations by a factor of up to 13 to produce the first 1000 results, and using twice their space, reduce their total average query time by a factor of 2. Our experiments suggest that there is more room for improvement in terms of generating better query plans for multijoins.
