Bounds and Algorithms for Alphabetic Codes and Binary Search Trees
Roberto Bruno, Roberto De Prisco, Alfredo De Santis, Ugo Vaccaro
TL;DR
The paper addresses the cost of navigating ordered spaces under probabilistic models by studying alphabetic codes and binary search trees. It introduces a linear-time construction framework for near-optimal alphabetic codes and derives refined upper bounds on the average length $L_{\min}$, improving on prior bounds such as the Dagan bound. These alphabetic-code insights are then leveraged to obtain tighter upper bounds on the cost of optimal binary search trees, and a linear-time method is presented to produce BSTs whose performance approaches the improved theoretical limits. The work thus links coding-theoretic constructions with search-data-structure optimization, delivering practical linear-time algorithms and tighter theoretical guarantees for both alphabets and BSTs.
Abstract
Alphabetic codes and binary search trees are combinatorial structures that abstract search procedures in ordered sets endowed with probability distributions. In this paper, we design new linear-time algorithms to construct alphabetic codes, and we show that the obtained codes are not too far from being optimal. Moreover, we exploit our results on alphabetic codes to provide new bounds on the average cost of optimal binary search trees. Our results improve on the best-known bounds on the average cost of optimal binary search trees present in the literature.
