Algorithmic Thinking Theory
MohammadHossein Bateni, Vincent Cohen-Addad, Yuzhou Gu, Silvio Lattanzi, Simon Meierhans, Christopher Mohri
TL;DR
This work develops a formal theory of algorithmic thinking for reasoning with large language models by modeling a reasoning oracle and a context-dependent transfer function. It analyzes three core strategies—Branching, Genetic, and Random Sampling—under decaying and uniform models, establishing optimality results, monotonicity implications, and convergence rates. The framework captures how adding correct solutions to context can boost performance but may exhibit diminishing returns or detrimental correlations, guiding efficient design of iterative reasoning pipelines. The results provide rigorous benchmarks for designing and analyzing next-generation reasoning methods that synthesize information across multiple intermediate outputs rather than rely on single-shot accuracy.
Abstract
Large language models (LLMs) have proven to be highly effective for solving complex reasoning tasks. Surprisingly, their capabilities can often be improved by iterating on previously generated solutions. In this context, a reasoning plan for generating and combining a set of solutions can be thought of as an algorithm for reasoning using a probabilistic oracle. We introduce a theoretical framework for analyzing such reasoning algorithms. This framework formalizes the principles underlying popular techniques for iterative improvement and answer aggregation, providing a foundation for designing a new generation of more powerful reasoning methods. Unlike approaches for understanding models that rely on architectural specifics, our model is grounded in experimental evidence. As a result, it offers a general perspective that may extend to a wide range of current and future reasoning oracles.
