ChatRule: Mining Logical Rules with Large Language Models for Knowledge Graph Reasoning
Linhao Luo, Jiaxin Ju, Bo Xiong, Yuan-Fang Li, Gholamreza Haffari, Shirui Pan
TL;DR
ChatRule tackles scalable, interpretable knowledge graph reasoning by integrating large language models with knowledge graph structure. It introduces an LLM-based rule generator (including a BFS-rule sampler and multi-query prompting), a PCA-confidence–driven rule ranking module, and a rule-based reasoning component that performs forward chaining to complete knowledge graphs. Empirical results on four large KGs show state-of-the-art performance for KG completion and robust rule-quality evaluation, highlighting improved generalization in incomplete KGs. The approach delivers interpretable, rule-based reasoning without requiring training of additional KG models, with potential for broader applicability and further enhancements leveraging advanced structural understanding in LLMs.
Abstract
Logical rules are essential for uncovering the logical connections between relations, which could improve reasoning performance and provide interpretable results on knowledge graphs (KGs). Although there have been many efforts to mine meaningful logical rules over KGs, existing methods suffer from computationally intensive searches over the rule space and a lack of scalability for large-scale KGs. Besides, they often ignore the semantics of relations which is crucial for uncovering logical connections. Recently, large language models (LLMs) have shown impressive performance in the field of natural language processing and various applications, owing to their emergent ability and generalizability. In this paper, we propose a novel framework, ChatRule, unleashing the power of large language models for mining logical rules over knowledge graphs. Specifically, the framework is initiated with an LLM-based rule generator, leveraging both the semantic and structural information of KGs to prompt LLMs to generate logical rules. To refine the generated rules, a rule ranking module estimates the rule quality by incorporating facts from existing KGs. Last, the ranked rules can be used to conduct reasoning over KGs. ChatRule is evaluated on four large-scale KGs, w.r.t. different rule quality metrics and downstream tasks, showing the effectiveness and scalability of our method.
