SIT-Graph: State Integrated Tool Graph for Multi-Turn Agents

Sijia Li; Yuchen Huang; Zifan Liu; Zijian Li; Jingjing fu; Lei Song; Jiang Bian; Jun Zhang; Rui Wang

SIT-Graph: State Integrated Tool Graph for Multi-Turn Agents

Sijia Li, Yuchen Huang, Zifan Liu, Zijian Li, Jingjing fu, Lei Song, Jiang Bian, Jun Zhang, Rui Wang

TL;DR

The paper tackles the challenge of multi-turn tool use under partial observability by introducing SIT-Graph, a State-Integrated Tool Graph that unifies episodic-like state fragments with procedural tool dependencies. It augments a tool graph with compact edge-level state summaries and a dedicated state-summarization tool, enabling adaptive retrieval between episodic recall and procedural execution. The method is evaluated on four stateful benchmarks, showing consistent gains over memory-based and tool-graph baselines, with pronounced improvements for weaker base models and in online, training-free settings. These results suggest SIT-Graph enables more robust tool selection and effective experience transfer in open-ended, evolving task environments, approaching human-like decision-making in multi-turn interactions.

Abstract

Despite impressive advances in agent systems, multi-turn tool-use scenarios remain challenging. It is mainly because intent is clarified progressively and the environment evolves with each tool call. While reusing past experience is natural, current LLM agents either treat entire trajectories or pre-defined subtasks as indivisible units, or solely exploit tool-to-tool dependencies, hindering adaptation as states and information evolve across turns. In this paper, we propose a State Integrated Tool Graph (SIT-Graph), which enhances multi-turn tool use by exploiting partially overlapping experience. Inspired by human decision-making that integrates episodic and procedural memory, SIT-Graph captures both compact state representations (episodic-like fragments) and tool-to-tool dependencies (procedural-like routines) from historical trajectories. Specifically, we first build a tool graph from accumulated tool-use sequences, and then augment each edge with a compact state summary of the dialog and tool history that may shape the next action. At inference time, SIT-Graph enables a human-like balance between episodic recall and procedural execution: when the next decision requires recalling prior context, the agent retrieves the state summaries stored on relevant edges and uses them to guide its next action; when the step is routine, it follows high-confidence tool dependencies without explicit recall. Experiments across multiple stateful multi-turn tool-use benchmarks show that SIT-Graph consistently outperforms strong memory- and graph-based baselines, delivering more robust tool selection and more effective experience transfer.

SIT-Graph: State Integrated Tool Graph for Multi-Turn Agents

TL;DR

Abstract

SIT-Graph: State Integrated Tool Graph for Multi-Turn Agents

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (3)