Table of Contents
Fetching ...

Honey, I shrunk the hypothesis space (through logical preprocessing)

Andrew Cropper, Filipe Gouveia, David M. Cerna

TL;DR

The paper addresses the prohibitive size of hypothesis spaces in inductive logic programming by introducing shrinker, a preprocessing tool that uses background knowledge to automatically identify and remove pointless rules (unsatisfiable, implication reducible, recall reducible, and singleton reducible). Implemented via answer set programming and integrated with a constraint-based ILP system (Popper), shrinker preserves optimal hypotheses while dramatically reducing learning times across diverse domains, often with only seconds of preprocessing. The approach is task- and system-agnostic, offering a principled, formal method to constrain hypothesis generation before learning, and it highlights both empirical gains and clear limitations (finite BK, closed-world assumption, monotonic ILP). Overall, shrinker offers a practical pathway to faster ILP with provable soundness, enabling scalable rule learning and reuse across tasks and domains.

Abstract

Inductive logic programming (ILP) is a form of logical machine learning. The goal is to search a hypothesis space for a hypothesis that generalises training examples and background knowledge. We introduce an approach that 'shrinks' the hypothesis space before an ILP system searches it. Our approach uses background knowledge to find rules that cannot be in an optimal hypothesis regardless of the training examples. For instance, our approach discovers relationships such as "even numbers cannot be odd" and "prime numbers greater than 2 are odd". It then removes violating rules from the hypothesis space. We implement our approach using answer set programming and use it to shrink the hypothesis space of a constraint-based ILP system. Our experiments on multiple domains, including visual reasoning and game playing, show that our approach can substantially reduce learning times whilst maintaining predictive accuracies. For instance, given just 10 seconds of preprocessing time, our approach can reduce learning times from over 10 hours to only 2 seconds.

Honey, I shrunk the hypothesis space (through logical preprocessing)

TL;DR

The paper addresses the prohibitive size of hypothesis spaces in inductive logic programming by introducing shrinker, a preprocessing tool that uses background knowledge to automatically identify and remove pointless rules (unsatisfiable, implication reducible, recall reducible, and singleton reducible). Implemented via answer set programming and integrated with a constraint-based ILP system (Popper), shrinker preserves optimal hypotheses while dramatically reducing learning times across diverse domains, often with only seconds of preprocessing. The approach is task- and system-agnostic, offering a principled, formal method to constrain hypothesis generation before learning, and it highlights both empirical gains and clear limitations (finite BK, closed-world assumption, monotonic ILP). Overall, shrinker offers a practical pathway to faster ILP with provable soundness, enabling scalable rule learning and reuse across tasks and domains.

Abstract

Inductive logic programming (ILP) is a form of logical machine learning. The goal is to search a hypothesis space for a hypothesis that generalises training examples and background knowledge. We introduce an approach that 'shrinks' the hypothesis space before an ILP system searches it. Our approach uses background knowledge to find rules that cannot be in an optimal hypothesis regardless of the training examples. For instance, our approach discovers relationships such as "even numbers cannot be odd" and "prime numbers greater than 2 are odd". It then removes violating rules from the hypothesis space. We implement our approach using answer set programming and use it to shrink the hypothesis space of a constraint-based ILP system. Our experiments on multiple domains, including visual reasoning and game playing, show that our approach can substantially reduce learning times whilst maintaining predictive accuracies. For instance, given just 10 seconds of preprocessing time, our approach can reduce learning times from over 10 hours to only 2 seconds.

Paper Structure

This paper contains 41 sections, 12 theorems, 7 equations, 7 figures, 3 algorithms.

Key Result

Proposition 1

Let $B$ be BK, $h$ be a hypothesis, $r \in h$, $r' \subseteq r$, and $r'$ be unsatisfiable with respect to $B$. Then $h$ is not optimal.

Figures (7)

  • Figure 1: Learning time improvements when using shrinker. The left figure shows all tasks and the right figure shows tasks where the two approaches significantly $(p < 0.05)$ differ. The tasks are ordered by the improvement.
  • Figure 2: Learning time improvements when using shrinker with only unsatisfiable rules. The left figure shows all tasks and the right figure shows tasks where the two approaches significantly $(p < 0.05)$ differ. The tasks are ordered by the improvement.
  • Figure 3: Learning time improvements when using shrinker with only implication reducible rules. The left figure shows all tasks and the right figure shows tasks where the two approaches significantly $(p < 0.05)$ differ. The tasks are ordered by the improvement.
  • Figure 4: Learning time improvements when using shrinker with only recall reducible rules. The left figure shows all tasks and the right figure shows tasks where the two approaches significantly $(p < 0.05)$ differ. The tasks are ordered by the improvement.
  • Figure 5: Learning time improvements when using shrinker with only singleton reducible rules. The left figure shows all tasks and the right figure shows tasks where the two approaches significantly $(p < 0.05)$ differ. The tasks are ordered by the improvement.
  • ...and 2 more figures

Theorems & Definitions (51)

  • Definition 1: Sub-rule
  • Definition 2: Sub-hypothesis
  • Definition 3: Basic rule
  • Definition 4: ILP input
  • Definition 5: Cost function
  • Definition 6: Optimal hypothesis
  • Definition 7: Hypothesis space reduction problem
  • Definition 8: Unsatisfiable rule
  • Proposition 1: Unsatisfiability soundness
  • proof
  • ...and 41 more