LogicCBMs: Logic-Enhanced Concept-Based Learning
Deepika SN Vemuri, Gautham Bellamkonda, Aditya Pola, Vineeth N Balasubramanian
TL;DR
LogicCBMs introduce differentiable fuzzy logic gates to concept-based models, enabling expressive predicates that connect intermediate concepts to classes while preserving end-to-end learnability. The approach yields improved accuracy, better concept alignment, and more effective interventions, validated on standard CBM benchmarks and synthetic datasets like XOR, CLEVR-Logic, and SUN-scale scenes. A new worst-case interventional metric, Concept Correction Gain (CCG), assesses robustness to misleading cues, and results demonstrate that diverse logic gates enhance predictive capacity without sacrificing interpretability. The work additionally provides extensive datasets, analyses, and ablations to support the viability and scalability of logic-enhanced concept-based learning. Overall, LogicCBMs offer a principled neurosymbolic pathway for integrating logic reasoning with semantic concepts in vision tasks.
Abstract
Concept Bottleneck Models (CBMs) provide a basis for semantic abstractions within a neural network architecture. Such models have primarily been seen through the lens of interpretability so far, wherein they offer transparency by inferring predictions as a linear combination of semantic concepts. However, a linear combination is inherently limiting. So we propose the enhancement of concept-based learning models through propositional logic. We introduce a logic module that is carefully designed to connect the learned concepts from CBMs through differentiable logic operations, such that our proposed LogicCBM can go beyond simple weighted combinations of concepts to leverage various logical operations to yield the final predictions, while maintaining end-to-end learnability. Composing concepts using a set of logic operators enables the model to capture inter-concept relations, while simultaneously improving the expressivity of the model in terms of logic operations. Our empirical studies on well-known benchmarks and synthetic datasets demonstrate that these models have better accuracy, perform effective interventions and are highly interpretable.
