One Model, Any Conjunctive Query: Graph Neural Networks for Answering Queries over Incomplete Knowledge Graphs

Krzysztof Olejniczak; Xingyue Huang; Mikhail Galkin; İsmail İlkan Ceylan

One Model, Any Conjunctive Query: Graph Neural Networks for Answering Queries over Incomplete Knowledge Graphs

Krzysztof Olejniczak, Xingyue Huang, Mikhail Galkin, İsmail İlkan Ceylan

TL;DR

This work reframes complex query answering over incomplete knowledge graphs as two binary tasks—Query Answer Classification and Query Answer Retrieval—and introduces AnyCQ, a neuro-symbolic, reinforcement-learning-guided GNN framework that scores existential Boolean conjunctive queries by searching over variable assignments. By constructing a query-conditioned computational graph with PE and LE edge labels guided by a link predictor, AnyCQ efficiently handles arbitrarily structured, cyclic queries beyond the reach of prior CQA methods. The approach delivers strong results on new high-complexity benchmarks, demonstrates transferability to unseen KGs, and shows substantial potential under a perfect predictor, underscoring practical applicability for querying incomplete data. The work also provides theoretical guarantees (completeness and, with a perfect predictor, soundness) and discusses limitations and avenues for future extension, including inductive, multi-dataset training and alternative search strategies.

Abstract

Motivated by the incompleteness of modern knowledge graphs, a new setup for query answering has emerged, where the goal is to predict answers that do not necessarily appear in the knowledge graph, but are present in its completion. In this paper, we formally introduce and study two query answering problems, namely, query answer classification and query answer retrieval. To solve these problems, we propose AnyCQ, a model that can classify answers to any conjunctive query on any knowledge graph. At the core of our framework lies a graph neural network trained using a reinforcement learning objective to answer Boolean queries. Trained only on simple, small instances, AnyCQ generalizes to large queries of arbitrary structure, reliably classifying and retrieving answers to queries that existing approaches fail to handle. This is empirically validated through our newly proposed, challenging benchmarks. Finally, we empirically show that AnyCQ can effectively transfer to completely novel knowledge graphs when equipped with an appropriate link prediction model, highlighting its potential for querying incomplete data.

One Model, Any Conjunctive Query: Graph Neural Networks for Answering Queries over Incomplete Knowledge Graphs

TL;DR

Abstract

Paper Structure (52 sections, 3 theorems, 51 equations, 2 figures, 14 tables)

This paper contains 52 sections, 3 theorems, 51 equations, 2 figures, 14 tables.

Introduction
Related work
Preliminaries
Query answering on incomplete KGs
Query Answer Classification & Query Answer Retrieval
: framework for query answering
Query representation
$\anycq$ search process
Training
Theoretical and conceptual properties
Experimental evaluation
Experimental setup
Main experiments results over QAC and QAR
Query answer classification (QAC) experiments
Query answer retrieval (QAR) experiments
...and 37 more sections

Key Result

theorem 1

Let $Q = \exists\vec{y} . \Phi(\vec{y})$ be a conjunctive Boolean query and let $\Theta$ be any $\anycq$ model equipped with a predictor $\pi$. For any execution of $\Theta$ on $Q$, running for $T$ steps:

Figures (2)

Figure 1: Examples of undirected query graphs of formulas from the FB15k-237-QAR '3-hub' split. Blue nodes represent constant terms, while grey - to the existentially quantified variables. The orange node corresponds to the free variable.
Figure 2: $\anycq$ search step time analysis for queries of different complexities: a) average step time (AST) per the number of variables $|\vec{y}|$, b) AST divided by the number of variables $|\vec{y}|$, c) AST divided by the number of literals $|Q|$, d) AST divided by $|\vec{y}| + 2|Q|$, the complexity factor indicated by the theoretical analysis.

Theorems & Definitions (6)

theorem 1
proof
proposition 1: Scores of a Perfect Link Predictor
proof
theorem 2
proof

One Model, Any Conjunctive Query: Graph Neural Networks for Answering Queries over Incomplete Knowledge Graphs

TL;DR

Abstract

One Model, Any Conjunctive Query: Graph Neural Networks for Answering Queries over Incomplete Knowledge Graphs

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (2)

Theorems & Definitions (6)