Table of Contents
Fetching ...

Differentiable Rule Induction from Raw Sequence Inputs

Kun Gao, Katsumi Inoue, Yongzhi Cao, Hanpin Wang, Feng Yang

TL;DR

This work tackles the problem of extracting interpretable logical rules directly from raw data without explicit label leakage. It introduces NeurRL, a fully differentiable neuro-symbolic ILP framework that combines autoencoder-based representations, differentiable clustering, and a neural rule-learning module to induce rules from raw sequences and images. By formalizing ILP from raw inputs via interpretation transitions and using a program tensor $\mathbf{M}_P$ to encode rules, NeurRL achieves competitive accuracy on time-series benchmarks and provides interpretable rule bodies built from pattern-region predicates. The results demonstrate effective rule discovery with precise and recallful explanations, along with favorable training efficiency and robust ablations, suggesting practical impact for explainable AI in domains like healthcare and surveillance.

Abstract

Rule learning-based models are widely used in highly interpretable scenarios due to their transparent structures. Inductive logic programming (ILP), a form of machine learning, induces rules from facts while maintaining interpretability. Differentiable ILP models enhance this process by leveraging neural networks to improve robustness and scalability. However, most differentiable ILP methods rely on symbolic datasets, facing challenges when learning directly from raw data. Specifically, they struggle with explicit label leakage: The inability to map continuous inputs to symbolic variables without explicit supervision of input feature labels. In this work, we address this issue by integrating a self-supervised differentiable clustering model with a novel differentiable ILP model, enabling rule learning from raw data without explicit label leakage. The learned rules effectively describe raw data through its features. We demonstrate that our method intuitively and precisely learns generalized rules from time series and image data.

Differentiable Rule Induction from Raw Sequence Inputs

TL;DR

This work tackles the problem of extracting interpretable logical rules directly from raw data without explicit label leakage. It introduces NeurRL, a fully differentiable neuro-symbolic ILP framework that combines autoencoder-based representations, differentiable clustering, and a neural rule-learning module to induce rules from raw sequences and images. By formalizing ILP from raw inputs via interpretation transitions and using a program tensor to encode rules, NeurRL achieves competitive accuracy on time-series benchmarks and provides interpretable rule bodies built from pattern-region predicates. The results demonstrate effective rule discovery with precise and recallful explanations, along with favorable training efficiency and robust ablations, suggesting practical impact for explainable AI in domains like healthcare and surveillance.

Abstract

Rule learning-based models are widely used in highly interpretable scenarios due to their transparent structures. Inductive logic programming (ILP), a form of machine learning, induces rules from facts while maintaining interpretability. Differentiable ILP models enhance this process by leveraging neural networks to improve robustness and scalability. However, most differentiable ILP methods rely on symbolic datasets, facing challenges when learning directly from raw data. Specifically, they struggle with explicit label leakage: The inability to map continuous inputs to symbolic variables without explicit supervision of input feature labels. In this work, we address this issue by integrating a self-supervised differentiable clustering model with a novel differentiable ILP model, enabling rule learning from raw data without explicit label leakage. The learned rules effectively describe raw data through its features. We demonstrate that our method intuitively and precisely learns generalized rules from time series and image data.
Paper Structure (18 sections, 9 equations, 6 figures, 3 tables)

This paper contains 18 sections, 9 equations, 6 figures, 3 tables.

Figures (6)

  • Figure 1: The learning pipeline of NeurRL and the rule-learning module.
  • Figure 2: The synthetic data and the learned rules.
  • Figure 3: Selected rules from two UCR datasets.
  • Figure 4: Learned rules from MNIST datasets.
  • Figure 5: Results of ablation study. Hyperparameter values vs. accuracy.
  • ...and 1 more figures

Theorems & Definitions (1)

  • Definition 1: Learning from Binary Raw Input