Improving Complex Reasoning over Knowledge Graph with Logic-Aware Curriculum Tuning
Tianle Xia, Liang Ding, Guojia Wan, Yibing Zhan, Bo Du, Dacheng Tao
TL;DR
This work presents Logic-Aware Curriculum Tuning (LACT), a fine-tuning framework that enables 7B-scale LLMs to perform complex logical reasoning over incomplete knowledge graphs. By combining Binary Tree Decomposition to convert $EFO_{1}$ queries into a logic-rich computation tree and curriculum learning to balance difficulty across query types, LACT achieves state-of-the-art results on FB15K, FB15K-237, and NELL995, with an average MRR improvement of approximately $+5.5\%$. The approach emphasizes knowledge sharing from KGs during training and demonstrates strong transferability across datasets and model scales, offering a scalable solution for KG reasoning without resorting to large closed-source models. Its practical impact lies in enabling accurate, scalable reasoning over incomplete KGs using accessible 7B models, which broadens the deployment potential of KG-aware AI systems.
Abstract
Answering complex queries over incomplete knowledge graphs (KGs) is a challenging job. Most previous works have focused on learning entity/relation embeddings and simulating first-order logic operators with various neural networks. However, they are bottlenecked by the inability to share world knowledge to improve logical reasoning, thus resulting in suboptimal performance. In this paper, we propose a complex reasoning schema over KG upon large language models (LLMs), containing a curriculum-based logical-aware instruction tuning framework, named LACT. Specifically, we augment the arbitrary first-order logical queries via binary tree decomposition, to stimulate the reasoning capability of LLMs. To address the difficulty gap among different types of complex queries, we design a simple and flexible logic-aware curriculum learning framework. Experiments across widely used datasets demonstrate that LACT has substantial improvements~(brings an average +5.5% MRR score) over advanced methods, achieving the new state-of-the-art.
