Probabilistic automatic complexity of finite strings
Kenneth Gill
TL;DR
This work introduces $A_P(x)$, the probabilistic automatic complexity of a finite string, defined via PFAs as the smallest number of states that yield a unique most-likely accepted string of a given length; it also defines a gap-parameterized variant $A_{P,δ}(x)$. The authors establish tight bounds relating $A_P$ to $A_N$ and $A_D$, prove δ-computability of $A_{P,δ}$ for all $δ$, and present a complete binary-string classification for strings with $A_P=2$, using an affine iterated function system (IFS) correspondence to analyze witnesses. They develop a computable-analysis framework proving uniform computability of $A_{P,δ}$ across inputs and noting a countable set of discontinuities tied to maximal gaps; they also explore extensions, variations, and potential refinements via GPFA generalizations, gap-structure functions, witness-bit-length, and measures over witnesses. Together, these results illuminate the algorithmic content and computability of probabilistic automata-based string complexity, and suggest rich avenues for further refinement and applications in formal language and complexity theory.
Abstract
We introduce a new complexity measure for finite strings using probabilistic finite-state automata (PFAs), in the same spirit as existing notions employing DFAs and NFAs, and explore its properties. The PFA complexity $A_P(x)$ is the least number of states of a PFA for which $x$ is the most likely string of its length to be accepted. The variant $A_{P,δ}(x)$ adds a real-valued parameter $δ$ specifying a required lower bound on the gap in acceptance probabilities between $x$ and other strings. We prove $A_{P,δ}$ is $δ$-computable for all $δ$, relate $A_P$ to the DFA and NFA complexities, and obtain a complete classification of binary strings with $A_P=2$. Finally, we discuss several other variations on $A_P$ with a view to obtaining additional desirable properties.
