Formal Languages and TQFTs with Defects
Luisa Boateng, Matilde Marcolli
TL;DR
The paper develops a categorical extension of the Boolean 1D TQFT with defects associated to finite automata, proving functoriality under transducers and linking language-theoretic subregularities to cohomological data in the TQFT. It then generalizes to context-free grammars using a categorical Chomsky–Schützenberger framework based on operads of spliced arrows and cobordisms with defects, producing TQFTs as operad morphisms. Subregular classes such as ${\rm SL}_k$ and ${\rm LT}_k$ yield additional structures (e.g., defect operator cohomologies) on the TQFTs, highlighting a deep interaction between formal language theory and topological field theories. The approach further extends to categorical FSAs and CF grammars, with tree-contour grammars providing a universal Dyck-language basis, and it outlines future directions toward higher-dimensional automata and 2D TQFTs with defects. Overall, the work integrates category theory, automata, operads, and TQFTs to create a unified, functorial framework connecting formal languages to defect-laden cobordism theories.
Abstract
A construction that assigns a Boolean 1D TQFT with defects to a finite state automaton was recently developed by Gustafson, Im, Kaldawy, Khovanov, and Lihn. We show that the construction is functorial with respect to the category of finite state automata with transducers as morphisms. Certain classes of subregular languages correspond to additional cohomological structures on the associated TQFTs. We also show that the construction generalizes to context-free grammars through a categorical version of the Chomsky-Schützenberger representation theorem, due to Melliès and Zeilberger. The corresponding TQFTs are then described as morphisms of colored operads on an operad of cobordisms with defects.
