Eliciting Causal Abilities in Large Language Models for Reasoning Tasks
Yajing Wang, Zongwei Luo, Jingzhe Wang, Zhanke Zhou, Yongqiang Chen, Bo Han
TL;DR
The paper addresses the challenge of improving LLM-based reasoning by eliciting causal inference through prompting. It introduces Self-Causal Instruction Enhancement (SCIE), a pipeline that generates high-quality observational data, estimates causal effects of prompting instructions using LLMs, and produces enhanced instructions, with an Object-Relational (OR) module for reusing causal templates across tasks. Empirical results across diverse reasoning tasks show SCIE improves accuracy while reducing prompt-training cost and providing interpretable, proxy-feature–driven insights; OR further demonstrates cost-effective reusability and generalization of learned prompting patterns. The work highlights the potential of causal meta-prompting to guide instruction design and paves the way for robust, interpretable prompt optimization in real-world applications.
Abstract
Prompt optimization automatically refines prompting expressions, unlocking the full potential of LLMs in downstream tasks. However, current prompt optimization methods are costly to train and lack sufficient interpretability. This paper proposes enhancing LLMs' reasoning performance by eliciting their causal inference ability from prompting instructions to correct answers. Specifically, we introduce the Self-Causal Instruction Enhancement (SCIE) method, which enables LLMs to generate high-quality, low-quantity observational data, then estimates the causal effect based on these data, and ultimately generates instructions with the optimized causal effect. In SCIE, the instructions are treated as the treatment, and textual features are used to process natural language, establishing causal relationships through treatments between instructions and downstream tasks. Additionally, we propose applying Object-Relational (OR) principles, where the uncovered causal relationships are treated as the inheritable class across task objects, ensuring low-cost reusability. Extensive experiments demonstrate that our method effectively generates instructions that enhance reasoning performance with reduced training cost of prompts, leveraging interpretable textual features to provide actionable insights.
