Align-GRAG: Reasoning-Guided Dual Alignment for Graph Retrieval-Augmented Generation

Derong Xu; Pengyue Jia; Xiaopeng Li; Yingyi Zhang; Maolin Wang; Qidong Liu; Xiangyu Zhao; Yichao Wang; Huifeng Guo; Ruiming Tang; Enhong Chen; Tong Xu

Align-GRAG: Reasoning-Guided Dual Alignment for Graph Retrieval-Augmented Generation

Derong Xu, Pengyue Jia, Xiaopeng Li, Yingyi Zhang, Maolin Wang, Qidong Liu, Xiangyu Zhao, Yichao Wang, Huifeng Guo, Ruiming Tang, Enhong Chen, Tong Xu

TL;DR

Align-GRAG addresses two core challenges in graph-based retrieval-augmented generation: noisy retrieval introducing irrelevant graph content and a representation gap between graph structures and language models. It introduces a reasoning-guided dual alignment framework with a Graph Aligner that performs node-level pruning via KL divergence and representation alignment via contrastive learning, anchored by LLM-generated summarized reasoning chains. The approach enables efficient pruning and unified graph-language representations, leading to grounded, context-aware generation. Extensive experiments on GraphQA across commonsense reasoning, scene understanding, and knowledge graph reasoning demonstrate state-of-the-art performance and robust generalization, highlighting the value of integrating reasoning-guided graph alignment into GRAG pipelines.

Abstract

Large language models (LLMs) have demonstrated remarkable capabilities, but still struggle with issues like hallucinations and outdated information. Retrieval-augmented generation (RAG) addresses these issues by grounding LLM outputs in external knowledge with an Information Retrieval (IR) system. Building on this foundation, graph-based RAG systems go a step further by retrieving subgraphs, which preserve the relationships between knowledge entities and provide more comprehensive context. However, graph RAG faces two challenges: (1) Retrieving relevant information introduces irrelevant nodes (especially in dense graph databases, where retrieval usually extends to adjacent nodes), and leads to overly lengthy inputs that hinder efficiency; (2) The representation gap between graph and language during generation with LLMs limits the ability to fully leverage graph structures for enhanced understanding. To address these limitations, we propose Align-GRAG, a novel reasoning-guided dual alignment framework in post-retrieval phrase. It first formulates a subgraph by retrieving nodes and edges. Then an Aligner is proposed to jointly optimize a graph encoder with an LLM-summarized reasoning chain. It achieves dual alignment of graph node and representation by leveraging KL divergence loss and contrastive loss, facilitating efficient pruning of irrelevant knowledge and establishing a unified semantic space. The Generator integrates the aligned graph data with LLM to produce coherent and accurate answers. Experiments on the GraphQA benchmark across three tasks (including common sense reasoning, scene graph understanding, and knowledge graph reasoning) validate the effectiveness of our method. The codes are available in this repository\footnote{https://anonymous.4open.science/r/Align-GRAG-F3D8/}.

Align-GRAG: Reasoning-Guided Dual Alignment for Graph Retrieval-Augmented Generation

TL;DR

Abstract

Align-GRAG: Reasoning-Guided Dual Alignment for Graph Retrieval-Augmented Generation

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (11)