ThoughtProbe: Classifier-Guided LLM Thought Space Exploration via Probing Representations
Zijian Wang, Chang Xu
TL;DR
ThoughtProbe presents an inference-time framework that leverages discriminative signals from LLM hidden representations to guide tree-structured reasoning exploration. It probes representations to train lightweight classifiers, uses classifier-guided beam search to generate diverse reasoning paths, and applies branch-aggregation to select final answers, all without fine-tuning. Across multiple arithmetic benchmarks and LLMs, it delivers substantial performance gains over prompting, sampling, and activation-steering baselines, demonstrating the practical viability of linear representation probing for robust reasoning. The work highlights a shift toward inference-time, representation-based guidance as a scalable approach to enhance LLM reasoning in real-world deployments.
Abstract
This paper introduces ThoughtProbe, a novel inference time framework that leverages the hidden reasoning features of Large Language Models (LLMs) to improve their reasoning performance. Unlike previous works that manipulate the hidden representations to steer LLM generation, we harness them as discriminative signals to guide the tree structured response space exploration. In each node expansion, a classifier serves as a scoring and ranking mechanism that efficiently allocates computational resources by prioritizing higher score candidates for continuation. After completing the tree expansion, we collect answers from all branches to form a candidate answer pool. We then propose a branch aggregation method that marginalizes over all supporting branches by aggregating their CoT scores, thereby identifying the optimal answer from the pool. Experimental results show that our framework's comprehensive exploration not only covers valid reasoning chains but also effectively identifies them, achieving significant improvements across multiple arithmetic reasoning benchmarks.
