CoT-X: An Adaptive Framework for Cross-Model Chain-of-Thought Transfer and Optimization
Ziqian Bi, Kaijie Chen, Tianyang Wang, Junfeng Hao, Benji Peng, Xinyuan Song
TL;DR
The paper tackles the high computation cost of chain-of-thought reasoning by introducing an adaptive summarization framework that transfers reasoning traces across model scales under token budgets. It combines semantic segmentation, importance propagation, and coherence reconstruction with Gaussian Process-based Bayesian optimization to balance accuracy and robustness, achieving up to 40% gains over truncation and an ~84% reduction in evaluation cost. The work also uncovers a power-law relationship between average accuracy and cross-domain robustness, providing Pareto-frontier guidance for deployment. Across 7,501 medical questions and multilingual datasets, the framework demonstrates strong cross-model transferability, practical efficiency, and clear pathways for real-world medical AI applications.
Abstract
Chain-of-Thought (CoT) reasoning enhances the problem-solving ability of large language models (LLMs) but leads to substantial inference overhead, limiting deployment in resource-constrained settings. This paper investigates efficient CoT transfer across models of different scales and architectures through an adaptive reasoning summarization framework. The proposed method compresses reasoning traces via semantic segmentation with importance scoring, budget-aware dynamic compression, and coherence reconstruction, preserving critical reasoning steps while significantly reducing token usage. Experiments on 7{,}501 medical examination questions across 10 specialties show up to 40% higher accuracy than truncation under the same token budgets. Evaluations on 64 model pairs from eight LLMs (1.5B-32B parameters, including DeepSeek-R1 and Qwen3) confirm strong cross-model transferability. Furthermore, a Gaussian Process-based Bayesian optimization module reduces evaluation cost by 84% and reveals a power-law relationship between model size and cross-domain robustness. These results demonstrate that reasoning summarization provides a practical path toward efficient CoT transfer, enabling advanced reasoning under tight computational constraints. Code will be released upon publication.
