Table of Contents
Fetching ...

Graph of Verification: Structured Verification of LLM Reasoning with Directed Acyclic Graphs

Jiwei Fang, Bin Zhang, Changwei Wang, Jin Wan, Zhiwei Xu

TL;DR

This work presents Graph of Verification (GoV), a training-free framework that verifies LLM reasoning by modeling it as a directed acyclic graph and using a flexible node-block architecture to adapt verification granularity to the task. It formalizes a two-dimensional design space—Verification Granularity and Contextual Scope—to unify verification across well-structured and loosely-structured reasoning. GoV employs atomic nodes or node blocks within topologically ordered verification, guided by Premise provisions, to localize errors and improve robustness. Empirical results on Number Triangle Summation and ProcessBench show GoV outperforms holistic verification and prior decomposition-based methods, demonstrating strong applicability to both formal and natural-language reasoning. The framework advances transparent, scalable verification and opens avenues toward interactive correction workflows.

Abstract

Verifying the complex and multi-step reasoning of Large Language Models (LLMs) is a critical challenge, as holistic methods often overlook localized flaws. Step-by-step validation is a promising alternative, yet existing methods are often rigid. They struggle to adapt to diverse reasoning structures, from formal proofs to informal natural language narratives. To address this adaptability gap, we propose the Graph of Verification (GoV), a novel framework for adaptable and multi-granular verification. GoV's core innovation is its flexible "node block" architecture. This mechanism allows GoV to adaptively adjust its verification granularity--from atomic steps for formal tasks to entire paragraphs for natural language--to match the native structure of the reasoning process. This flexibility allows GoV to resolve the fundamental trade-off between verification precision and robustness. Experiments on both well-structured and loosely-structured benchmarks demonstrate GoV's versatility. The results show that GoV's adaptive approach significantly outperforms both holistic baselines and other state-of-the-art decomposition-based methods, establishing a new standard for training-free reasoning verification.

Graph of Verification: Structured Verification of LLM Reasoning with Directed Acyclic Graphs

TL;DR

This work presents Graph of Verification (GoV), a training-free framework that verifies LLM reasoning by modeling it as a directed acyclic graph and using a flexible node-block architecture to adapt verification granularity to the task. It formalizes a two-dimensional design space—Verification Granularity and Contextual Scope—to unify verification across well-structured and loosely-structured reasoning. GoV employs atomic nodes or node blocks within topologically ordered verification, guided by Premise provisions, to localize errors and improve robustness. Empirical results on Number Triangle Summation and ProcessBench show GoV outperforms holistic verification and prior decomposition-based methods, demonstrating strong applicability to both formal and natural-language reasoning. The framework advances transparent, scalable verification and opens avenues toward interactive correction workflows.

Abstract

Verifying the complex and multi-step reasoning of Large Language Models (LLMs) is a critical challenge, as holistic methods often overlook localized flaws. Step-by-step validation is a promising alternative, yet existing methods are often rigid. They struggle to adapt to diverse reasoning structures, from formal proofs to informal natural language narratives. To address this adaptability gap, we propose the Graph of Verification (GoV), a novel framework for adaptable and multi-granular verification. GoV's core innovation is its flexible "node block" architecture. This mechanism allows GoV to adaptively adjust its verification granularity--from atomic steps for formal tasks to entire paragraphs for natural language--to match the native structure of the reasoning process. This flexibility allows GoV to resolve the fundamental trade-off between verification precision and robustness. Experiments on both well-structured and loosely-structured benchmarks demonstrate GoV's versatility. The results show that GoV's adaptive approach significantly outperforms both holistic baselines and other state-of-the-art decomposition-based methods, establishing a new standard for training-free reasoning verification.

Paper Structure

This paper contains 34 sections, 2 equations, 4 figures, 5 tables, 1 algorithm.

Figures (4)

  • Figure 1: Both humans and GoV approach reasoning validation by decomposing the process into a directed acyclic graph. The granularity of these validation units is highly flexible, allowing for adaptation to different tasks.
  • Figure 2: The GoV Four-Stage Verification Pipeline. GoV operationalizes structured validation by first modeling reasoning as a (1) Directed Acyclic Graph (DAG). A (2) topological sort then enforces causal consistency, ensuring premises are verified before conclusions. Following this order, (3) sequential verification is performed by an LLM which assesses each unit based on previously validated antecedents. The process (4) terminates at the first detected error, enabling precise fault localization and guaranteeing the logical soundness of any fully verified reasoning chain.
  • Figure 3: Illustration of the numerical validation procedure for Number Triangle Summation.
  • Figure 4: The performance of GoV and holistic verification on Number Triangle Summation.