Unveiling and Causalizing CoT: A Causal Pespective
Jiarun Fu, Lizhong Ding, Hao Li, Pengqi Li, Qiuning Wei, Xu Chen
TL;DR
This work reframes Chain-of-Thought reasoning in LLMs as a causal process modeled by structural causal models (SCM). It defines metrics such as CoT Average Causal Effect (CACE) and First-Step Causal Effect (FSCE) to quantify causal links between CoT steps and the final answer, and introduces CauCoT, a two-stage, role-playing causalization algorithm that iteratively enforces correct causal relations across all reasoning steps. Across open- and closed-source LLMs on the PROCESSBENCH suite, CauCoT not only improves answer accuracy on complex problems but also increases the causal goodfulness of the reasoning chain by correcting common causal errors. The approach provides a principled, interpretable pathway to make CoT reasoning both correct and understandable, with potential broad impact on the reliability and transparency of LLM-based reasoning systems.
Abstract
Although Chain-of-Thought (CoT) has achieved remarkable success in enhancing the reasoning ability of large language models (LLMs), the mechanism of CoT remains a ``black box''. Even if the correct answers can frequently be obtained, existing CoTs struggle to make the reasoning understandable to human. In this paper, we unveil and causalize CoT from a causal perspective to ensure both correctness and understandability of all reasoning steps (to the best of our knowledge, the first such). We model causality of CoT via structural causal models (SCM) to unveil the reasoning mechanism of CoT. To measure the causality of CoT, we define the CoT Average Causal Effect (CACE) to test the causal relations between steps. For those steps without causality (wrong or unintelligible steps), we design a role-playing causal query algorithm to causalize these steps, resulting a causalized CoT with all steps correct and understandable. Experimental results on both open-source and closed-source LLMs demonstrate that the causal errors commonly in steps are effectively corrected and the reasoning ability of LLMs is significantly improved.
