GraphScout: Empowering Large Language Models with Intrinsic Exploration Ability for Agentic Graph Reasoning

Yuchen Ying; Weiqi Jiang; Tongya Zheng; Yu Wang; Shunyu Liu; Kaixuan Chen; Mingli Song

GraphScout: Empowering Large Language Models with Intrinsic Exploration Ability for Agentic Graph Reasoning

Yuchen Ying, Weiqi Jiang, Tongya Zheng, Yu Wang, Shunyu Liu, Kaixuan Chen, Mingli Song

TL;DR

GraphScout enables models to autonomously interact with knowledge graphs to synthesize structured training data which are then used to post-train LLMs, thereby internalizing agentic graph reasoning ability without laborious manual annotation or task curation.

Abstract

Knowledge graphs provide structured and reliable information for many real-world applications, motivating increasing interest in combining large language models (LLMs) with graph-based retrieval to improve factual grounding. Recent Graph-based Retrieval-Augmented Generation (GraphRAG) methods therefore introduce iterative interaction between LLMs and knowledge graphs to enhance reasoning capability. However, existing approaches typically depend on manually designed guidance and interact with knowledge graphs through a limited set of predefined tools, which substantially constrains graph exploration. To address these limitations, we propose GraphScout, a training-centric agentic graph reasoning framework equipped with more flexible graph exploration tools. GraphScout enables models to autonomously interact with knowledge graphs to synthesize structured training data which are then used to post-train LLMs, thereby internalizing agentic graph reasoning ability without laborious manual annotation or task curation. Extensive experiments across five knowledge-graph domains show that a small model (e.g., Qwen3-4B) augmented with GraphScout outperforms baseline methods built on leading LLMs (e.g., Qwen-Max) by an average of 16.7\% while requiring significantly fewer inference tokens. Moreover, GraphScout exhibits robust cross-domain transfer performance. Our code will be made publicly available~\footnote{https://github.com/Ying-Yuchen/_GraphScout_}.

GraphScout: Empowering Large Language Models with Intrinsic Exploration Ability for Agentic Graph Reasoning

TL;DR

Abstract

Paper Structure (39 sections, 10 equations, 8 figures, 5 tables)

This paper contains 39 sections, 10 equations, 8 figures, 5 tables.

Introduction
Related Work
LLM for Graph Reasoning
Augmenting LLMs with Knowledge Graph
GraphScout
Preliminary
Agentic Graph Exploration Tools
Code Interpreter
Node Retriever
Graph Quizzer
Task specification
Exploration initialization
Exploration process
Question reporting
Graph Solver
...and 24 more sections

Figures (8)

Figure 1: Qwen3-4B-Instruct with GraphScout achieves substantial gains over leading LLMs (e.g.,Qwen-Max) with prompting-based GraphRAG baselines in Healthcare dataset.
Figure 2: Overview of GraphScout framework.
Figure 3: Cross-Domain Generalization Performance Across Training and Test Domains.
Figure 4: Performance Comparison Across Question Difficulty Levels on GRBENCH using F1-score as the metric. The Healthcare domain contains no hard questions. For the Literature, all methods achieve an F1-score of 0% on hard questions.
Figure 5: Behavioral analysis across question difficulty levels.
...and 3 more figures

GraphScout: Empowering Large Language Models with Intrinsic Exploration Ability for Agentic Graph Reasoning

TL;DR

Abstract

GraphScout: Empowering Large Language Models with Intrinsic Exploration Ability for Agentic Graph Reasoning

Authors

TL;DR

Abstract

Table of Contents

Figures (8)