Graph-Memoized Reasoning: Foundations Structured Workflow Reuse in Intelligent Systems
Yash Raj Singh
TL;DR
The paper addresses the inefficiency of stateless, recomputation-heavy LLM reasoning by proposing Graph-Memoized Reasoning, a framework that stores reasoning workflows as labeled DAGs and retrieves reusable subgraphs via structural and semantic similarity. It formalizes a cost–consistency optimization, $\min_G \mathrm{Cost}(G) + \lambda \mathrm{Inconsistency}(G)$, to balance efficiency with fidelity, and outlines a conceptual evaluation protocol to assess reuse benefits. Key contributions include a formal graph-based model, a memoization operator $\mathrm{Memo}$ with a reusable policy $\pi$, and a roadmap for conceptual evaluation and safety considerations. This approach aims to enable interpretable, cost-efficient, and self-improving reasoning architectures that act as persistent memory substrates for large-scale agentic systems.
Abstract
Modern large language model-based reasoning systems frequently recompute similar reasoning steps across tasks, wasting computational resources, inflating inference latency, and limiting reproducibility. These inefficiencies underscore the need for persistent reasoning mechanisms that can recall and reuse prior computational traces. We introduce Graph-Memoized Reasoning, a formal framework for representing, storing, and reusing reasoning workflows as graph-structured memory. By encoding past decision graphs and retrieving them through structural and semantic similarity, our approach enables compositional reuse of subgraphs across new reasoning tasks. We formulate an optimization objective that minimizes total reasoning cost regularized by inconsistency between stored and generated workflows, providing a theoretical foundation for efficiency-consistency trade-offs in intelligent systems. We outline a conceptual evaluation protocol aligned with the proposed optimization objective. This framework establishes the groundwork for interpretable, cost-efficient, and self-improving reasoning architectures, offering a step toward persistent memory in large-scale agentic systems.
