Edit and Alphabet-Ordering Sensitivity of Lex-parse
Yuto Nakashima, Dominik Köppl, Mitsuru Funakoshi, Shunsuke Inenaga, Hideo Bannai
TL;DR
This work analyzes the sensitivity of lex-parse to two perturbations: single-character edits and alphabet-ordering changes. It develops tight upper and lower bounds by leveraging Fibonacci words and Lyndon factorizations, establishing that both edit-sensitivity and alphabet-ordering sensitivity of lex-parse scale as $\Theta(\log n)$. The results connect lex-parse behavior to bidirectional macro schemes and deepen understanding of dictionary compressors and repetitiveness measures. Overall, the findings reveal fundamental limits on how input modifications and alphabet ordering influence lex-parse structure and size.
Abstract
We investigate the compression sensitivity [Akagi et al., 2023] of lex-parse [Navarro et al., 2021] for two operations: (1) single character edit and (2) modification of the alphabet ordering, and give tight upper and lower bounds for both operations. For both lower bounds, we use the family of Fibonacci words. For the bounds on edit operations, our analysis makes heavy use of properties of the Lyndon factorization of Fibonacci words to characterize the structure of lex-parse.
