Right for the Right Reasons: Avoiding Reasoning Shortcuts via Prototypical Neurosymbolic AI
Luca Andolfi, Eleonora Giunchiglia
TL;DR
Prototypical Neurosymbolic AI tackles reasoning shortcuts in neurosymbolic learning by grounding neural predicates to class prototypes and updating embeddings with both background knowledge and proximity to labelled exemplars. The approach yields a disentangled, annotation-efficient NeSy model that reduces deterministic shortcuts and improves concept–semantic alignment, as demonstrated on rsbench tasks MNIST-EvenOdd, Kand-Logic, and BDD-OIA. Theoretical analysis quantifies shortcut possibilities and empirical results show substantial improvements under very limited supervision. This yields a practical, scalable strategy for safe and reliable neurosymbolic reasoning, especially in low-data regimes.
Abstract
Neurosymbolic AI is growing in popularity thanks to its ability to combine neural perception and symbolic reasoning in end-to-end trainable models. However, recent findings reveal these are prone to shortcut reasoning, i.e., to learning unindented concepts--or neural predicates--which exploit spurious correlations to satisfy the symbolic constraints. In this paper, we address reasoning shortcuts at their root cause and we introduce prototypical neurosymbolic architectures. These models are able to satisfy the symbolic constraints (be right) because they have learnt the correct basic concepts (for the right reasons) and not because of spurious correlations, even in extremely low data regimes. Leveraging the theory of prototypical learning, we demonstrate that we can effectively avoid reasoning shortcuts by training the models to satisfy the background knowledge while taking into account the similarity of the input with respect to the handful of labelled datapoints. We extensively validate our approach on the recently proposed rsbench benchmark suite in a variety of settings and tasks with very scarce supervision: we show significant improvements in learning the right concepts both in synthetic tasks (MNIST-EvenOdd and Kand-Logic) and real-world, high-stake ones (BDD-OIA). Our findings pave the way to prototype grounding as an effective, annotation-efficient strategy for safe and reliable neurosymbolic learning.
