KisMATH: Do LLMs Have Knowledge of Implicit Structures in Mathematical Reasoning?

Soumadeep Saha; Akshay Chaturvedi; Saptarshi Saha; Utpal Garain; Nicholas Asher

KisMATH: Do LLMs Have Knowledge of Implicit Structures in Mathematical Reasoning?

Soumadeep Saha, Akshay Chaturvedi, Saptarshi Saha, Utpal Garain, Nicholas Asher

TL;DR

This work addresses how chain of thought traces aid mathematical reasoning by proposing a causal graph abstraction of CoT traces and releasing KisMATH, a dataset of 1671 problems paired with LLM solutions and CCGraphs. It introduces a scalable CCGraph construction algorithm, enabling graph aligned interventions that test mediation and causal structure using attention suppression and path probability analyses across 15 open-weight LLMs. The key findings show that reasoning nodes in CCGraphs mediate the final answer and that LLMs preferentially traverse CCGraph aligned reasoning paths, with two distinct behavior regimes observed across models. The results highlight the practical significance of uncovering latent, graph-like structures in LLM reasoning and point to directions for more principled evaluation and intervention of CoT in mathematical domains.

Abstract

Chain-of-thought (CoT) traces have been shown to improve performance of large language models on a plethora of reasoning tasks, yet there is no consensus on the mechanism by which this boost is achieved. To shed more light on this, we introduce Causal CoT Graphs (CCGraphs), which are directed acyclic graphs automatically extracted from reasoning traces that model fine-grained causal dependencies in language-model outputs. A collection of 1671 mathematical reasoning problems from MATH500, GSM8K, and AIME, together with their associated CCGraphs, has been compiled into our dataset -- KisMATH. Our detailed empirical analysis with 15 open-weight LLMs shows that (i) reasoning nodes in the CCGraphs are causal contributors to the final answer, which we argue is constitutive of reasoning; and (ii) LLMs emphasize the reasoning paths captured by the CCGraphs, indicating that the models internally realize structures similar to our graphs. KisMATH enables controlled, graph-aligned interventions and opens avenues for further investigation into the role of CoT in LLM reasoning.

KisMATH: Do LLMs Have Knowledge of Implicit Structures in Mathematical Reasoning?

TL;DR

Abstract

KisMATH: Do LLMs Have Knowledge of Implicit Structures in Mathematical Reasoning?

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (11)