A Learnability Analysis on Neuro-Symbolic Learning
Hao-Yuan He, Ming Li
TL;DR
The paper develops a principled learnability framework for neuro-symbolic learning by recasting NeSy tasks as derived constraint satisfaction problems (DCSPs). It proves that a NeSy task is learnable if and only if the DCSP has a unique solution, with a concrete sample-complexity bound N ≥ (1/κ) log(|B|/ε) ensuring small concept error; when the DCSP admits multiple solutions (disagreement d > 0), the task is unlearnable and the asymptotic error scales as E* ≤ d/L. The authors further show that the NeSy risk and concept risk align under a unique DCSP solution, and that ensembles of unlearnable tasks can become learnable by enforcing mutual constraints that reduce solution-space ambiguity. Empirical validation on MNIST-like and real-world datasets demonstrates the surrogate risks (PNL/ABL) effectively minimize the NeSy risk for learnable tasks, and showcases how DCSP disagreement and ensemble configurations impact learnability. The work offers a framework to diagnose learnability and guides the design of new NeSy algorithms and task ensembles with practical impact for hybrid AI systems.
Abstract
This paper analyzes the learnability of neuro-symbolic (NeSy) tasks within hybrid systems. We show that the learnability of NeSy tasks can be characterized by their derived constraint satisfaction problems (DCSPs). Specifically, a task is learnable if the corresponding DCSP has a unique solution; otherwise, it is unlearnable. For learnable tasks, we establish error bounds by exploiting the clustering property of the hypothesis space. Additionally, we analyze the asymptotic error for general NeSy tasks, showing that the expected error scales with the disagreement among solutions. Our results offer a principled approach to determining learnability and provide insights into the design of new algorithms.
