Robust Learning-Augmented Dictionaries
Ali Zeynali, Shahin Kamali, Mohammad Hajiesmaili
TL;DR
The paper tackles the problem of designing dictionary data structures that are simultaneously statically optimal and robust to adversarial predictions. It introduces RobustSL, a skip-list implementation that fuses an optimistic (learning-augmented) treap with a pessimistic balanced BST to achieve $\mathcal{O}(m H(p))$ consistency when predictions are accurate and $\mathcal{O}(m \log n)$ robustness in general. The static and dynamic variants, RobustSL_s and RobustSL_d, are proven to provide optimal consistency and robustness, with dynamic operations supported in $O(\log n)$ time and total costs containment under varying prediction quality. Comprehensive experiments on synthetic and real data corroborate the theory, showing RobustSL outperforms prior learned structures under noise and remains competitive when predictions are reliable, highlighting its practical impact for real-world workloads with imperfect predictions.
Abstract
We present the first learning-augmented data structure for implementing dictionaries with optimal consistency and robustness. Our data structure, named RobustSL, is a skip list augmented by predictions of access frequencies of elements in a data sequence. With proper predictions, RobustSL has optimal consistency (achieves static optimality). At the same time, it maintains a logarithmic running time for each operation, ensuring optimal robustness, even if predictions are generated adversarially. Therefore, RobustSL has all the advantages of the recent learning-augmented data structures of Lin, Luo, and Woodruff (ICML 2022) and Cao et al. (arXiv 2023), while providing robustness guarantees that are absent in the previous work. Numerical experiments show that RobustSL outperforms alternative data structures using both synthetic and real datasets.
