Intrinsic Stability Limits of Autoregressive Reasoning: Structural Consequences for Long-Horizon Execution
Hsien-Jyh Liao
TL;DR
This work argues that long-horizon autoregressive reasoning suffers from an intrinsic process-level instability, not just task complexity. It formalizes a stability theorem (Theorem A) showing that the decision advantage along a single autoregressive trajectory decays exponentially with horizon, implying a finite stability horizon $L^*$ and necessitating segmentation into sub-edges to maintain coherence. The authors show that stable long-horizon reasoning naturally leads to graph-like topologies, such as DAGs, with stabilization nodes that consolidate state and control entropy. Empirically, synthetic tasks and TextWorld experiments reveal performance cliffs and demonstrate that structural governance—through segmentation and resets—mitigates instability, whileBranching-based search approaches incur exponential sample costs. The results advocate a shift from pure scaling to structural governance in designing future reasoning systems, with diagnostic indicators for monitoring stability and guidance for endogenous stabilization mechanisms.
Abstract
Large language models (LLMs) demonstrate remarkable reasoning capabilities, yet their performance often deteriorates sharply in long-horizon tasks, exhibiting systematic breakdown beyond certain scales. Conventional explanations primarily attribute this phenomenon to task complexity, such as combinatorial search explosion or long-term credit assignment challenges. In this work, we argue that these explanations are incomplete: even in linear, unbranched tasks without semantic ambiguity, autoregressive execution is subject to an intrinsic stability limit. We propose that the fundamental constraint on long-horizon reasoning arises from process-level instability in autoregressive generation rather than solely from search or task complexity, reframing long-horizon reasoning as a problem of structural governance. We derive Theorem~A, showing that decision advantage in single-path autoregressive reasoning decays exponentially with execution length, imposing a fundamental bound on maintainable reasoning chains. This result implies a structural consequence: stable long-horizon reasoning requires discrete segmentation, naturally inducing graph-like execution structures such as directed acyclic graphs (DAGs). Empirical studies in both synthetic environments and real TextWorld tasks reveal observable performance cliffs consistent with theoretical predictions. Our findings provide a dynamical perspective on long-horizon reasoning failure and suggest new limitations on maintaining long-term coherence under purely autoregressive architectures. Furthermore, we highlight that short-horizon evaluation protocols may obscure structural instability, indicating a potential shift from scaling toward structured governance in future reasoning systems.
