Teaching Probabilistic Logical Reasoning to Transformers
Aliakbar Nafar, Kristen Brent Venable, Parisa Kordjamshidi
TL;DR
This work addresses probabilistic logical reasoning under uncertainty in text by introducing Probabilistic Constraint Training (PCT), a training-time constraint mechanism that enforces probabilistic reasoning rules without constraining inference. It pairs PCT with RuleTaker-pro, a dataset featuring instance-specific probabilistic rules, to study generalization, transfer, and explainability in transformer models like RoBERTa Large. Empirical results show that PCT improves intrinsic reasoning, intermediate-inference consistency (constraint satisfaction), and transfer to higher depths and new domains, while large language models struggle with exact probabilistic inferences despite few-shot prompts. The findings highlight the value of integrating probabilistic constraints during training for robust, explainable reasoning in uncertain domains and point to future work in better utilizing textual rules and prompting for LLMs.
Abstract
In this paper, we evaluate the capability of transformer-based language models in making inferences over uncertain text that includes uncertain rules of reasoning. We cover both Pre-trained Language Models (PLMs) and generative Large Language Models (LLMs). Our evaluation results show that both generations of language models struggle with reasoning over uncertain text. We propose a novel end-to-end fine-tuning approach, Probabilistic Constraint Training (PCT), that utilizes probabilistic logical rules as constraints in the fine-tuning phase without relying on these rules in the inference stage. To assess the effectiveness of PCT, we utilize the related corpora and, additionally, create a new and more challenging benchmark that, unlike the previous ones, uses instance-specific rules. Our study demonstrates that PCT improves the transformer-based language model's intrinsic reasoning and makes their probabilistic logical reasoning process more explicit and explainable. Furthermore, PCT equips these models to effectively handle novel situations, including higher reasoning depth, new domains, and complex probabilistic structures.
