Table of Contents
Fetching ...

Do LLMs Really Think Step-by-step In Implicit Reasoning?

Yijiong Yu

TL;DR

The paper probes whether implicit chain-of-thought (CoT) reasoning—either triggered by prompts or learned through training—truly imitates explicit CoT in multi-step arithmetic tasks. By training a linear probe on hidden-layer representations, it reveals that prompted implicit CoT rarely encodes intermediate steps, while internally trained CoT shows stepwise computation, though with limited capacity and sensitivity to problem formatting. The findings indicate implicit CoT cannot yet replace explicit CoT for reliability and robustness, particularly under format changes, reinforcing the continued practicality of explicit CoT for demanding reasoning tasks. The work highlights a latency-accuracy trade-off and underscores the need for robust, format-insensitive reasoning strategies in LLMs.

Abstract

It has been well-known that Chain-of-Thought can remarkably enhance LLMs' performance on complex tasks. However, because it also introduces slower inference speeds and higher computational costs, many researches have attempted to use implicit CoT, which does not need LLMs to explicitly generate the intermediate steps. However, the invisible reasoning process leaves us a doubt that, can implicit CoT really be equal to explicit CoT? Therefore, in this study, we address this question through experiments. We probe the information of intermediate steps from the model's hidden states when it is either trained or prompted to perform implicit CoT. The results surprisingly indicate that when prompted, LLMs hardly think about intermediate steps, suggesting they may just rely on experience rather than strict step-by-step reasoning. But when trained, they indeed calculate intermediate steps. Moreover, in both situations, we find the effect of using implicit CoT is susceptible to the format of the problem, reaffirming the current deficiency of implicit CoT.

Do LLMs Really Think Step-by-step In Implicit Reasoning?

TL;DR

The paper probes whether implicit chain-of-thought (CoT) reasoning—either triggered by prompts or learned through training—truly imitates explicit CoT in multi-step arithmetic tasks. By training a linear probe on hidden-layer representations, it reveals that prompted implicit CoT rarely encodes intermediate steps, while internally trained CoT shows stepwise computation, though with limited capacity and sensitivity to problem formatting. The findings indicate implicit CoT cannot yet replace explicit CoT for reliability and robustness, particularly under format changes, reinforcing the continued practicality of explicit CoT for demanding reasoning tasks. The work highlights a latency-accuracy trade-off and underscores the need for robust, format-insensitive reasoning strategies in LLMs.

Abstract

It has been well-known that Chain-of-Thought can remarkably enhance LLMs' performance on complex tasks. However, because it also introduces slower inference speeds and higher computational costs, many researches have attempted to use implicit CoT, which does not need LLMs to explicitly generate the intermediate steps. However, the invisible reasoning process leaves us a doubt that, can implicit CoT really be equal to explicit CoT? Therefore, in this study, we address this question through experiments. We probe the information of intermediate steps from the model's hidden states when it is either trained or prompted to perform implicit CoT. The results surprisingly indicate that when prompted, LLMs hardly think about intermediate steps, suggesting they may just rely on experience rather than strict step-by-step reasoning. But when trained, they indeed calculate intermediate steps. Moreover, in both situations, we find the effect of using implicit CoT is susceptible to the format of the problem, reaffirming the current deficiency of implicit CoT.

Paper Structure

This paper contains 10 sections, 1 equation, 3 figures, 1 table.

Figures (3)

  • Figure 1: The accuracy of probing the results of each step in multi-step arithmetic problems. Qwen2.5 represents we prompt Qwen2.5-72b-instruct to perform implicit CoT. Mistral represents the mistral-7b-v0.1 model which has been trained to internalize CoT.
  • Figure 2: The examples of explicit CoT and implicit CoT. Explicit CoT is commonly used, which is completed by step-by-step output tokens. The process of implicit CoT is just a hypothetical or conceptual situation, which could be a layer-by-layer way.
  • Figure 3: The accuracy of probing the results of each step in multi-step arithmetic problems from the hidden states of mistral-internal-CoT, when the order of the equations is reversed.