DrKGC: Dynamic Subgraph Retrieval-Augmented LLMs for Knowledge Graph Completion across General and Biomedical Domains
Yongkang Xiao, Sinian Zhang, Yi Dai, Huixue Zhou, Jue Hou, Jie Ding, Rui Zhang
TL;DR
DrKGC introduces a dynamic, retrieval-augmented LLM framework for knowledge graph completion that preserves graph structure through a lightweight pretraining stage, rules-based subgraph retrieval, and a GCN adapter to produce local embeddings that enrich LLM prompts. By converting KG queries into question templates, ranking candidate entities, and constructing a bottom-up subgraph guided by learned rules, DrKGC achieves state-of-the-art results on four datasets, including two biomedical KGs, while offering improved interpretability via explicit subgraph reasoning. The approach demonstrates robustness under inductive and noisy conditions and highlights the importance of combining structural signals with prompt design and LLM reasoning. Overall, DrKGC advances KGC by tightly integrating structural graph cues with generation-based models, enabling accurate, context-aware predictions in both general and biomedical domains.
Abstract
Knowledge graph completion (KGC) aims to predict missing triples in knowledge graphs (KGs) by leveraging existing triples and textual information. Recently, generative large language models (LLMs) have been increasingly employed for graph tasks. However, current approaches typically encode graph context in textual form, which fails to fully exploit the potential of LLMs for perceiving and reasoning about graph structures. To address this limitation, we propose DrKGC (Dynamic Subgraph Retrieval-Augmented LLMs for Knowledge Graph Completion). DrKGC employs a flexible lightweight model training strategy to learn structural embeddings and logical rules within the KG. It then leverages a novel bottom-up graph retrieval method to extract a subgraph for each query guided by the learned rules. Finally, a graph convolutional network (GCN) adapter uses the retrieved subgraph to enhance the structural embeddings, which are then integrated into the prompt for effective LLM fine-tuning. Experimental results on two general domain benchmark datasets and two biomedical datasets demonstrate the superior performance of DrKGC. Furthermore, a realistic case study in the biomedical domain highlights its interpretability and practical utility.
