Optimal Approximate Minimization of One-Letter Weighted Finite Automata
Clara Lacroce, Borja Balle, Prakash Panangaden, Guillaume Rabusseau
TL;DR
The paper addresses the problem of optimally approximating a weighted finite automaton (WFA) under a fixed state bound by reframing the task as a low-rank Hankel operator approximation and applying Adamyan-Arov-Krein (AAK) theory to achieve a spectral-norm optimum. It confines to irredundant WFAs with real weights over a one-letter alphabet, deriving a closed-form symbol, an algorithm, and a rigorous error analysis, all executable in polynomial time. The core contributions include a detailed linkage between WFA parameters and Hankel symbols, a constructive procedure to obtain the size-$k$ WFA that minimizes the spectral norm error with bound $\ig|\mathbf H - \mathbf G\big| = \sigma_k(\mathbf H)$, and an explicit example demonstrating the method. This framework supports principled model compression and learning, with extensions toward removing key assumptions and extending to broader alphabets and model classes in future work.
Abstract
In this paper, we study the approximate minimization problem of weighted finite automata (WFAs): to compute the best possible approximation of a WFA given a bound on the number of states. By reformulating the problem in terms of Hankel matrices, we leverage classical results on the approximation of Hankel operators, namely the celebrated Adamyan-Arov-Krein (AAK) theory. We solve the optimal spectral-norm approximate minimization problem for irredundant WFAs with real weights, defined over a one-letter alphabet. We present a theoretical analysis based on AAK theory, and bounds on the quality of the approximation in the spectral norm and $\ell^2$ norm. Moreover, we provide a closed-form solution, and an algorithm, to compute the optimal approximation of a given size in polynomial time.
