Characterizing the Expressivity of Fixed-Precision Transformer Language Models
Jiaoda Li, Ryan Cotterell
TL;DR
This work provides a precise characterization of the expressivity of fixed-precision transformers with soft attention, strict masking, and no positional encodings, proving equivalence to the past-fragment of linear temporal logic $\text{LTL}[\mathrel{P}]$ and to $\text{PFO}^2[<]$ (via PODFAs and $\mathcal{R}$-trivial monoids). It further shows that transformer language models share this expressivity, and it offers a constructive bridge between logical formalisms and transformer architectures. Empirical evaluations on a range of formal languages demonstrate perfect generalization for tasks within the characterized capacity and consistent failure on languages beyond it, aligning theory with practice. The results illuminate the limitations of fixed-precision transformers and provide a principled framework for understanding when such models can reliably generalize across sequence lengths.
Abstract
Transformer-based language models (LMs) have achieved widespread empirical success, but their theoretical expressive power remains only partially understood. In this work, we analyze a restricted idealization of fixed-precision transformers with strict future masking, soft attention, and no positional encodings. We establish that this class of models is exactly as expressive as a specific fragment of linear temporal logic that contains only a single temporal operator: the past operator. We further connect this fragment to established classes in formal language theory, automata theory, and algebra, yielding a unified framework for understanding transformer expressivity under this idealization. Finally, we present empirical results that align closely with our theory: transformers trained on languages within their characterized expressive capacity generalize reliably across sequence lengths, while they consistently fail to generalize on languages beyond it.
