Neurosymbolic Methods for Rule Mining
Agnieszka Lawrynowicz, Luis Galarraga, Mehwish Alam, Berenice Jaulmes, Vaclav Zeman, Tomas Kliegr
TL;DR
The chapter surveys rule mining and rule learning for knowledge graphs, framing the problem around Horn rules and multiple evaluation metrics under open-world assumptions. It contrasts traditional ILP approaches (e.g., AMIE, AMIE3), path-based and LP-based rule mining (RuDiK, AnyBURL, RARL, LPRules) with neurosymbolic methods that fuse deep learning, embeddings, and matrix-based rule evaluation (Neural-LP, TensorLog, RLvLR, ChatRule, HtT). It highlights the strengths and trade-offs of each paradigm, including explainability, scalability, and data-completeness considerations, and discusses how large language models are being integrated to generate and reason over rule libraries. The practical significance lies in advancing scalable, interpretable, and accurate KG inference by combining symbolic rules with neural representations and language-model-driven rule generation. The coverage points to a future where neuro-symbolic rule learning enables robust, explainable KG completion and reasoning in the presence of incomplete or evolving data.
Abstract
In this chapter, we address the problem of rule mining, beginning with essential background information, including measures of rule quality. We then explore various rule mining methodologies, categorized into three groups: inductive logic programming, path sampling and generalization, and linear programming. Following this, we delve into neurosymbolic methods, covering topics such as the integration of deep learning with rules, the use of embeddings for rule learning, and the application of large language models in rule learning.
