UniOQA: A Unified Framework for Knowledge Graph Question Answering with Large Language Models
Zhuoyang Li, Liran Deng, Hui Liu, Qiaoqiao Liu, Junzhao Du
TL;DR
This work tackles KGQA over the large Chinese OwnThink knowledge graph by introducing UniOQA, a unified framework that runs two parallel workflows: a Translator that fine-tunes an LLM to generate executable Cypher queries (CQL) and a Searcher that employs Retrieval-Augmented Generation (GRAG) to retrieve direct answers. The Translator is augmented by an Entity and Relation Replacement step to align CQL with the KG, while the Searcher provides complementary answers from retrieved subgraphs; final answers are produced via a Dynamic Decision Algorithm that fuses both sources. Empirical results on SpCQL show state-of-the-art performance with ACC_LX 21.2% and ACC_EX 54.9%, outperforming all baselines, and ablation studies attribute gains to improved representation (ERR, Neft) and the effectiveness of combining translation and retrieval. The approach demonstrates the value of integrating precise CQL generation with retrieval-based supplementation, offering practical improvements for KGQA and guiding future work on unstructured data integration and efficiency optimization.
Abstract
OwnThink stands as the most extensive Chinese open-domain knowledge graph introduced in recent times. Despite prior attempts in question answering over OwnThink (OQA), existing studies have faced limitations in model representation capabilities, posing challenges in further enhancing overall accuracy in question answering. In this paper, we introduce UniOQA, a unified framework that integrates two complementary parallel workflows. Unlike conventional approaches, UniOQA harnesses large language models (LLMs) for precise question answering and incorporates a direct-answer-prediction process as a cost-effective complement. Initially, to bolster representation capacity, we fine-tune an LLM to translate questions into the Cypher query language (CQL), tackling issues associated with restricted semantic understanding and hallucinations. Subsequently, we introduce the Entity and Relation Replacement algorithm to ensure the executability of the generated CQL. Concurrently, to augment overall accuracy in question answering, we further adapt the Retrieval-Augmented Generation (RAG) process to the knowledge graph. Ultimately, we optimize answer accuracy through a dynamic decision algorithm. Experimental findings illustrate that UniOQA notably advances SpCQL Logical Accuracy to 21.2% and Execution Accuracy to 54.9%, achieving the new state-of-the-art results on this benchmark. Through ablation experiments, we delve into the superior representation capacity of UniOQA and quantify its performance breakthrough.
