Old and New Results on Alphabetic Codes

Roberto Bruno; Roberto De Prisco; Ugo Vaccaro

Old and New Results on Alphabetic Codes

Roberto Bruno, Roberto De Prisco, Ugo Vaccaro

TL;DR

The paper surveys alphabetic codes and their deep connections to comparison-based search, detailing classical optimal-code algorithms with their time complexities ($O(n^3)$ for Gilbert–Moore, $O(n^2)$ for Knuth, and $O(n\log n)$ for Hu–Tucker and Garsia–Wachs) and the foundational existence conditions from Yeung, Nakatsu, and Sheinwald, all relative to the entropy bound $H(P)$. It then surveys a broad landscape of variations and generalizations, including height limits, partial orders, AIFV variants, and $k$-ary trees, along with linear-time solutions for special cases and extensions to nonstandard objective functions. The survey synthesizes upper-bounding results on $E[C]$ around $H(P)$ plus corrective terms, highlighting both early bounds and modern refinements (e.g., $E[C]<H(P)+2$, $E[C]\le H(P)+1-p_1-p_n$ for dyadic distributions). Collectively, the work maps a rich set of algorithmic ideas, theoretical conditions, and practical applications across data compression, routing, and search, and it identifies numerous open problems and directions for future research.

Abstract

This comprehensive survey examines the field of alphabetic codes, tracing their development from the 1960s to the present day. We explore classical alphabetic codes and their variants, analyzing their properties and the underlying mathematical and algorithmic principles. The paper covers the fundamental relationship between alphabetic codes and comparison-based search procedures and their applications in data compression, routing, and testing. We review optimal alphabetic code construction algorithms, necessary and sufficient conditions for their existence, and upper bounds on the average code length of optimal alphabetic codes. The survey also discusses variations and generalizations of the classical problem of constructing minimum average length alphabetic codes. By elucidating both classical results and recent findings, this paper aims to serve as a valuable resource for researchers and students, concluding with promising future research directions in this still-active field.

Old and New Results on Alphabetic Codes

TL;DR

The paper surveys alphabetic codes and their deep connections to comparison-based search, detailing classical optimal-code algorithms with their time complexities (

for Gilbert–Moore,

for Knuth, and

for Hu–Tucker and Garsia–Wachs) and the foundational existence conditions from Yeung, Nakatsu, and Sheinwald, all relative to the entropy bound

. It then surveys a broad landscape of variations and generalizations, including height limits, partial orders, AIFV variants, and

-ary trees, along with linear-time solutions for special cases and extensions to nonstandard objective functions. The survey synthesizes upper-bounding results on

around

plus corrective terms, highlighting both early bounds and modern refinements (e.g.,

for dyadic distributions). Collectively, the work maps a rich set of algorithmic ideas, theoretical conditions, and practical applications across data compression, routing, and search, and it identifies numerous open problems and directions for future research.

Old and New Results on Alphabetic Codes

TL;DR

Abstract

Old and New Results on Alphabetic Codes

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (21)

Theorems & Definitions (17)