Epistemic Throughput: Fundamental Limits of Attention-Constrained Inference
Lei You
TL;DR
The paper addresses the challenge of inferring truth from large pools of public records when verification attention is scarce. It introduces Attention-Constrained Inference (ACI) and defines epistemic throughput as the maximum reduction in posterior uncertainty under Bayes log-loss, in a two-stage screening ($K$) followed by verification ($B$) framework. The authors derive a JaKoB scaling law, showing that the information gain per window scales as $Gain = I_{ver} B \big( p + c \sqrt{J K / B} \big)$, and prove both converse bounds and achievable policies; tail-leverage analyses reveal when expanding screening is beneficial, with heavy-tailed screening scores enabling polynomial gains. These results inform retrieval-augmented and tool-using systems and motivate a broader paradigm of epistemic communication, where truthfulness is pursued through efficiently reusable verification artifacts and tail-driven screening strategies.
Abstract
Recent generative and tool-using AI systems can surface a large volume of candidates at low marginal cost, yet only a small fraction can be checked carefully. This creates a decoder-side bottleneck: downstream decision-makers must form reliable posteriors from many public records under scarce attention. We formalize this regime via Attention-Constrained Inference (ACI), in which a cheap screening stage processes $K$ records and an expensive verification stage can follow up on at most $B$ of them. Under Bayes log-loss, we study the maximum achievable reduction in posterior uncertainty per window, which we call \emph{epistemic throughput}. Our main result is a ``JaKoB'' scaling law showing that epistemic throughput has a baseline term that grows linearly with verification and prevalence, and an additional \emph{information-leverage} term that scales as $\sqrt{JKB}$, where $J$ summarizes screening quality. Thus, expanding cheap screening can nonlinearly amplify scarce verification, even when informative records are rare. We further show that this scaling is tight in a weak-screening limit, and that in the sparse-verification regime ($B \ll K$), substantial leverage requires heavy-tailed score distributions; for light-tailed scores the amplification is only logarithmic.
