Counting Still Counts: Understanding Neural Complex Query Answering Through Query Relaxation
Yannick Brunink, Daniel Daza, Yunjie He, Michael Cochez
TL;DR
This work questions whether neural complex query answering (CQA) over knowledge graphs truly extends beyond symbolic reasoning by contrasting state-of-the-art neural CQA models with a training-free query relaxation baseline that counts matching paths. Across diverse datasets and query types, neural methods do not consistently surpass RELAX, and the top answers differ substantially between approaches, indicating complementary reasoning signals. An oracle-based combination reveals substantial gains, especially for longer path queries, underscoring the value of integrating relaxation-like structural reasoning with neural methods. The findings call for stronger non-neural baselines and pave the way for hybrid CQA models that merge symbolic relaxation with learned representations.
Abstract
Neural methods for Complex Query Answering (CQA) over knowledge graphs (KGs) are widely believed to learn patterns that generalize beyond explicit graph structure, allowing them to infer answers that are unreachable through symbolic query processing. In this work, we critically examine this assumption through a systematic analysis comparing neural CQA models with an alternative, training-free query relaxation strategy that retrieves possible answers by relaxing query constraints and counting resulting paths. Across multiple datasets and query structures, we find several cases where neural and relaxation-based approaches perform similarly, with no neural model consistently outperforming the latter. Moreover, a similarity analysis reveals that their retrieved answers exhibit little overlap, and that combining their outputs consistently improves performance. These results call for a re-evaluation of progress in neural query answering: despite their complexity, current models fail to subsume the reasoning patterns captured by query relaxation. Our findings highlight the importance of stronger non-neural baselines and suggest that future neural approaches could benefit from incorporating principles of query relaxation.
