Hierarchical Reasoning Models: Perspectives and Misconceptions
Renee Ge, Qianli Liao, Tomaso Poggio
TL;DR
The paper analyzes Hierarchical Reasoning Model (HRM) as a latent-space, recurrent approach to improve logical reasoning in transformers. It interrogates key design choices—L/H modules, one-step gradient training, and Adaptive Computation Time—via ablations on a Sudoku task. Findings show the high-level H module adds little beyond a strong L-module, HRM's training aligns with diffusion-like latent consistency, and ACT halting does not enhance inference when maximum steps are used. Collectively, these results challenge the necessity of hierarchical recurrence and encourage further exploration of latent-consistency and adaptive computation methods for reasoning tasks.
Abstract
Transformers have demonstrated remarkable performance in natural language processing and related domains, as they largely focus on sequential, autoregressive next-token prediction tasks. Yet, they struggle in logical reasoning, not necessarily because of a fundamental limitation of these models, but possibly due to the lack of exploration of more creative uses, such as latent space and recurrent reasoning. An emerging exploration in this direction is the Hierarchical Reasoning Model (Wang et. al., 2025), which introduces a novel type of recurrent reasoning in the latent space of transformers, achieving remarkable performance on a wide range of 2D reasoning tasks. Despite the promising results, this line of models is still at an early stage and calls for in-depth investigation. In this work, we review this class of models, examine key design choices, test alternative variants and clarify common misconceptions.
