Prototype-Based Dynamic Steering for Large Language Models
Ceyhun Efe Kayan, Li Zhang
TL;DR
Prototype-Based Dynamic Steering (PDS) introduces a lightweight, inference-time mechanism to amplify reasoning in large language models without fine-tuning or prompt redesign. By collecting activation differences between CoT and neutral prompts, clustering them into reasoning prototypes, and projecting input activations onto these prototypes to form a context-sensitive steering vector, PDS yields input-adaptive guidance injected into the residual stream. Across GSM8K, AQuA-RAT, and BIG-Bench with LLaMA-3-Instruct models, PDS improves accuracy and remains effective even under Anti-CoT prompts, indicating enhancement of latent reasoning rather than superficial behavior change. The work demonstrates that reasoning behaviors occupy a structured subspace in activation space and that dynamic, prototype-guided steering can serve as a practical, scalable alternative to training-time approaches for improving LLM reasoning.
Abstract
Despite impressive breadth, LLMs still rely on explicit reasoning instructions or static, one-fits-all steering methods, leaving a gap for adaptive, instruction-free reasoning amplification. We present Prototype-Based Dynamic Steering (PDS), a test-time method that amplifies large language model (LLM) reasoning without adding or altering instructions. We introduce "reasoning prototypes" by clustering activation differences between Chain-of-Thought (CoT) and neutral prompts. At inference, an input's hidden state is projected onto these prototypes to form an instance-specific steering vector. Evaluated on GSM8K, AQuA-RAT, and BIG-Bench tasks, PDS consistently improves accuracy without fine-tuning or prompt engineering. Notably, the gains persist even when CoT is explicitly suppressed to improve cost-efficiency, indicating that the intervention strengthens latent reasoning processes rather than inducing a superficial behavioral shift. These results position dynamic, prototype-guided steering as a lightweight alternative to training-time approaches for enhancing LLM reasoning.
