A Logic for Expressing Log-Precision Transformers
William Merrill, Ashish Sabharwal
TL;DR
The paper addresses how to formally characterize the reasoning of transformer classifiers that operate with log-precision on context lengths, proposing first-order logic augmented with majority quantifiers, $FO(M)$, as a target descriptive framework. It proves that any log-precision transformer can be translated into an $FO(M)$ sentence, establishing the tightest known upper bound and connecting transformer expressiveness to ${ t TC}^0$ circuit classes. By contrasting with finite-precision transformers, it highlights the necessity of increasing precision to capture uniform attention and broad input dependencies, while offering a mechanistic interpretability angle via a simple logical specification. The results pave the way for a logic-based view of transformer computation and suggest avenues for transformer-complete programming languages and systematic interpretability of neural nets.
Abstract
One way to interpret the reasoning power of transformer-based language models is to describe the types of logical rules they can resolve over some input text. Recently, Chiang et al. (2023) showed that finite-precision transformers can be equivalently expressed in a generalization of first-order logic. However, finite-precision transformers are a weak transformer variant because, as we show, a single head can only attend to a constant number of tokens and, in particular, cannot represent uniform attention. Since attending broadly is a core capability for transformers, we ask whether a minimally more expressive model that can attend universally can also be characterized in logic. To this end, we analyze transformers whose forward pass is computed in $\log n$ precision on contexts of length $n$. We prove that any log-precision transformer can be equivalently expressed as a first-order logic sentence that, in addition to standard universal and existential quantifiers, may also contain majority-vote quantifiers. This is the tightest known upper bound and first logical characterization of log-precision transformers.
