Optimal Non-Adaptive Cell Probe Dictionaries and Hashing
Kasper Green Larsen, Rasmus Pagh, Giuseppe Persiano, Toniann Pitassi, Kevin Yeo, Or Zamir
TL;DR
The paper resolves the static dictionary and membership problems in the non-adaptive cell-probe model by giving a simple data structure that stores $n$ key–value pairs in $s$ cells of $w=\Theta(\log u)$ bits and answers queries in $t=O\left(\dfrac{\log(u/n)}{\log(s/n)}\right)$ probes, together with matching lower bounds. It shows that non-adaptivity incurs a fundamental time-space trade-off and establishes an adaptivity gap: a single round of adaptivity suffices to achieve constant-time queries, while non-adaptive schemes require super-constant time. The construction leverages $(\le n)$-non-contractive expanders and Hall's marriage theorem, and its Hashing extension yields an optimal non-adaptive evaluation of $n$-wise independent hash functions in the cell-probe model using similar probe counts. Although non-explicit, these results imply that explicit expanders with suitable parameters would enable RAM implementations, and they highlight the pivotal role of expanders and dispersers in non-adaptive data-structure design.
Abstract
We present a simple and provably optimal non-adaptive cell probe data structure for the static dictionary problem. Our data structure supports storing a set of n key-value pairs from [u]x[u] using s words of space and answering key lookup queries in t = O(lg(u/n)/ lg(s/n)) nonadaptive probes. This generalizes a solution to the membership problem (i.e., where no values are associated with keys) due to Buhrman et al. We also present matching lower bounds for the non-adaptive static membership problem in the deterministic setting. Our lower bound implies that both our dictionary algorithm and the preceding membership algorithm are optimal, and in particular that there is an inherent complexity gap in these problems between no adaptivity and one round of adaptivity (with which hashing-based algorithms solve these problems in constant time). Using the ideas underlying our data structure, we also obtain the first implementation of a n-wise independent family of hash functions with optimal evaluation time in the cell probe model.
