AlignedCoT: Prompting Large Language Models via Native-Speaking Demonstrations
Zhicheng Yang, Yinya Huang, Jing Xiong, Liang Feng, Xiaodan Liang, Yiwei Wang, Jing Tang
TL;DR
AlignedCoT addresses prompt sensitivity in LLM chain-of-thought prompting by eliciting the model’s native reasoning style through probing, refinement, and formatting, achieving high-quality zero-shot CoTs without handcrafted demonstrations. The method yields consistent reasoning improvements across multiple benchmarks and models, enhances detection of logical pitfalls, and remains compatible with retrieval-augmented generation and smaller LMs. Empirical results show notable gains over baselines like standard CoT, Auto-CoT, and Complex CoT, and GSM8K-Align enables improved RAG performance. The work suggests that prompting LLMs in their native linguistic style can unlock embedded knowledge more effectively, reducing the need for manual exemplars and enabling broader applicability.
Abstract
Large Language Models prompting, such as using in-context demonstrations, is a mainstream technique for invoking LLMs to perform high-performance and solid complex reasoning (e.g., mathematical reasoning, commonsense reasoning), and has the potential for further human-machine collaborative scientific findings. However, current LLMs are delicate and elusive in prompt words and styles. And there is an unseen gap between LLM understanding and human-written prompts. This paper introduces Alignedcot, an LLM-acquainted prompting technique that includes proficient ``native-speaking'' in in-context learning for the LLMs. Specifically, it achieves consistent and correct step-wise prompts in zero-shot scenarios by progressively probing, refining, and formatting the LLM chain of thoughts so that free from handcrafted few-shot demonstrations while maintaining the prompt quality. We conduct experiments on mathematical reasoning and commonsense reasoning. We find that LLMs with Alignedcot perform significantly superior to them with human-crafted demonstrations. We further apply Alignedcot for rewriting the GSM8K training set, resulting in a GSM8K-Align dataset. We observe its benefits for retrieval augmented generation. The code and data can be found at https://github.com/yangzhch6/AlignedCoT.
