Learning Interpretable Differentiable Logic Networks

Chang Yue; Niraj K. Jha

Learning Interpretable Differentiable Logic Networks

Chang Yue, Niraj K. Jha

TL;DR

This work introduces Differentiable Logic Networks (DLNs), a class of architectures that learn interpretable Boolean rules through gradient-based optimization. By decomposing training into two phases—searching neuron functions and then learning connections—and by employing ThresholdLayer, LogicLayer, and SumLayer with differentiable relaxations and straight-through estimators, DLNs achieve competitive accuracy with far fewer gate-level operations, enabling edge deployment. Post-training simplification yields compact, human-readable logic expressions, enhancing interpretability without sacrificing performance on many tabular tasks. While DLNs excel in many settings, training cost and occasional dataset-specific limitations motivate future work on ensembles and prior-rule integration to broaden applicability.

Abstract

The ubiquity of neural networks (NNs) in real-world applications, from healthcare to natural language processing, underscores their immense utility in capturing complex relationships within high-dimensional data. However, NNs come with notable disadvantages, such as their "black-box" nature, which hampers interpretability, as well as their tendency to overfit the training data. We introduce a novel method for learning interpretable differentiable logic networks (DLNs) that are architectures that employ multiple layers of binary logic operators. We train these networks by softening and differentiating their discrete components, e.g., through binarization of inputs, binary logic operations, and connections between neurons. This approach enables the use of gradient-based learning methods. Experimental results on twenty classification tasks indicate that differentiable logic networks can achieve accuracies comparable to or exceeding that of traditional NNs. Equally importantly, these networks offer the advantage of interpretability. Moreover, their relatively simple structure results in the number of logic gate-level operations during inference being up to a thousand times smaller than NNs, making them suitable for deployment on edge devices.

Learning Interpretable Differentiable Logic Networks

TL;DR

Abstract

Paper Structure (22 sections, 11 equations, 8 figures, 6 tables, 4 algorithms)

This paper contains 22 sections, 11 equations, 8 figures, 6 tables, 4 algorithms.

Introduction
Related Work
Methodology
Logic Operations
Phase I: Determining Neuron Functions
ThresholdLayer
LogicLayer
Phase II: Determining Connections
LogicLayer
SumLayer
Searching over Subspaces
Using Straight-Through Estimators
Concatenating Inputs
Model Simplification
Experiments
...and 7 more sections

Figures (8)

Figure 1: A simplified DLN example. It takes input samples and binarizes continuous variables through a ThresholdLayer. It then passes the binary vector to layers of two-input Boolean logic operators. Finally, it counts logic rule triggers to determine the sample's class.
Figure 2: DLN training flowchart.
Figure 3: Illustration of the two-phase training algorithm. During Phase I, we discretize and fix the connections between neurons while training their parameters within ThresholdLayer and LogicLayer. In Phase II, we hold the neuron operations constant and focus on optimizing the connections between them. Key details are highlighted for clarification.
Figure 4: Example training process of a ThresholdLayer neuron.
Figure 5: Example training of a LogicLayer neuron. Note that colors are displayed on a logarithmic scale.
...and 3 more figures

Learning Interpretable Differentiable Logic Networks

TL;DR

Abstract

Learning Interpretable Differentiable Logic Networks

Authors

TL;DR

Abstract

Table of Contents

Figures (8)