Focused Chain-of-Thought: Efficient LLM Reasoning via Structured Input Information
Lukas Struppek, Dominik Hintersdorf, Hannah Struppek, Daniel Neider, Kristian Kersting
TL;DR
The paper introduces Focused Chain-of-Thought (F-CoT), a training-free prompting strategy that splits information extraction from reasoning by providing a structured context (fixed XML-like blocks) before reasoning. This input-centric approach yields 2–3x reductions in generated tokens with comparable reasoning accuracy to standard zero-shot CoT on arithmetic problems, demonstrating that structured inputs can significantly improve inference efficiency. The authors validate F-CoT across multiple model sizes and datasets, explore pre-computed versus self-generated contexts, and show robustness to prompt and format variations while identifying limitations and future directions for integrating structure with broader prompting and multimodal settings. Overall, the work argues that input representation is a powerful, orthogonal lever for efficient, faithful LLM reasoning.
Abstract
Recent large language models achieve strong reasoning performance by generating detailed chain-of-thought traces, but this often leads to excessive token use and high inference latency. Existing efficiency approaches typically focus on model-centric interventions, such as reinforcement learning or supervised fine-tuning, to reduce verbosity. In contrast, we propose a training-free, input-centric approach. Inspired by cognitive psychology, we introduce Focused Chain-of-Thought (F-CoT), which separates information extraction from the reasoning process. F-CoT first organizes the essential information from a query into a concise, structured context and then guides the model to reason exclusively over this context. By preventing attention to irrelevant details, F-CoT naturally produces shorter reasoning paths. On arithmetic word problems, F-CoT reduces generated tokens by 2-3x while maintaining accuracy comparable to standard zero-shot CoT. These results highlight structured input as a simple yet effective lever for more efficient LLM reasoning.
