Self-Harmonized Chain of Thought
Ziqi Jin, Wei Lu
TL;DR
This work tackles the challenges of chain-of-thought prompting, notably the instability of zero-shot reasoning and the labor-intensive need for human-crafted demonstrations in few-shot CoT. It introduces ECHO, a self-harmonized CoT approach that clusters questions, samples representative demonstrations, and iteratively unifies their rationales to produce a coherent reasoning pattern, drawing on cognitive load theory. Across arithmetic, commonsense, and symbolic reasoning, ECHO outperforms Auto-CoT and rivals Few-shot-CoT, with ablations confirming the benefits of diversity reduction and demonstration unification. The results suggest that unifying diverse reasoning patterns yields more robust automated reasoning in large language models, albeit with higher inference cost and some domain-dependent limitations.
Abstract
Chain-of-thought (CoT) prompting has demonstrated the capacity of large language models to perform complex reasoning through intermediate steps. While effective, current CoT methods face challenges: Zero-shot-CoT can lead to reasoning errors, and Few-shot-CoT requires labor-intensive manual demonstrations. Auto-CoT attempts to address these issues by automatically generating diverse demonstrations, but this diversity can lead to inconsistent reasoning patterns. We propose ECHO (Self-Harmonized Chain of Thought), a novel method that unifies diverse solution paths into a consistent and effective reasoning pattern. ECHO employs an iterative process to refine and harmonize automatically generated demonstrations, mitigating the limitations of existing approaches. Our comprehensive experiments across arithmetic, commonsense, and symbolic reasoning tasks demonstrate that ECHO outperforms Auto-CoT by an average of 2.8%. These findings suggest that ECHO represents a significant step towards more robust and generalizable automated reasoning in large language models.
