UniRel-R1: RL-tuned LLM Reasoning for Knowledge Graph Relational Question Answering
Yinxu Tang, Chengsong Huang, Jiaxin Huang, William Yeoh
TL;DR
This work defines relation-centric KGQA, where answers are subgraphs capturing semantic connections among seed entities rather than single entities.It introduces UniRel-R1, a unified pipeline combining subgraph selection, multi-stage pruning using hub penalties, and an RL-tuned LLM with a composite reward that prefers compact, informative subgraphs.Across seven benchmark KG datasets and multiple LLM families (Qwen and Llama), UniRel-R1 achieves large gains in connectivity and reward and demonstrates generalization to unseen entities and relations.The study also analyzes model differences, finding Qwen models rely more on semantic cues while Llama models leverage structural connectivity, and shows scalability to three- and four-entity queries.
Abstract
Knowledge Graph Question Answering (KGQA) has traditionally focused on entity-centric queries that return a single answer entity. However, real-world queries are often relational, seeking to understand how entities are associated. In this work, we introduce relation-centric KGQA, a complementary setting where the answer is a subgraph capturing the semantic connections among entities rather than an individual entity. The main challenge lies in the abundance of candidate subgraphs, where trivial or overly common connections often obscure the identification of unique and informative answers. To tackle this, we propose UniRel-R1, a unified framework that integrates subgraph selection, multi-stage graph pruning, and an LLM fine-tuned with reinforcement learning. The reward function is designed to encourage compact and specific subgraphs with more informative relations and lower-degree intermediate entities. Extensive experiments show that UniRel-R1 achieves significant gains in connectivity and reward over Vanilla baselines and generalizes effectively to unseen entities and relations.
