Incorporating Expert Rules into Neural Networks in the Framework of Concept-Based Learning
Andrei V. Konstantinov, Lev V. Utkin
TL;DR
This work addresses integrating expert rules into neural networks within concept-based learning (CBL). By representing feasible concept distributions as either a convex polytope (with vertices or faces) or a set of linear inequalities derived from the rules, the authors design neural heads that guarantee rule-consistent concept probabilities. They propose four neural-head strategies—Base, Admissible State, Vertex-based, and Constraints heads—along with state-space reduction to scale to larger rule sets. Numerical experiments on synthetic data and a modified MNIST task demonstrate improved performance under partial labeling, validating the approach as a practical bridge between inductive and deductive learning with interpretable predictions.
Abstract
A problem of incorporating the expert rules into machine learning models for extending the concept-based learning is formulated in the paper. It is proposed how to combine logical rules and neural networks predicting the concept probabilities. The first idea behind the combination is to form constraints for a joint probability distribution over all combinations of concept values to satisfy the expert rules. The second idea is to represent a feasible set of probability distributions in the form of a convex polytope and to use its vertices or faces. We provide several approaches for solving the stated problem and for training neural networks which guarantee that the output probabilities of concepts would not violate the expert rules. The solution of the problem can be viewed as a way for combining the inductive and deductive learning. Expert rules are used in a broader sense when any logical function that connects concepts and class labels or just concepts with each other can be regarded as a rule. This feature significantly expands the class of the proposed results. Numerical examples illustrate the approaches. The code of proposed algorithms is publicly available.
