Reframing Tax Law Entailment as Analogical Reasoning
Xinrui Zou, Ming Zhang, Nathaniel Weir, Benjamin Van Durme, Nils Holzenberger
TL;DR
This work reframes statutory reasoning as an analogy task by constructing quadruples (S1,C1,S2,C2) from SARA, enabling scalable data generation and interpretability through cross-pair analogy labels. It evaluates multiple approaches, including GPT prompting, vector-offset based analogies with SBERT embeddings, binary classification with T5-Large, and retrieval-augmented reasoning that uses BM25 or DPR to fetch similar pairs. The findings show that the analogy task remains challenging for current models, with vector-offset and GPT-based methods achieving modest gains and retrieval-augmented approaches offering the most promising improvements (best around 57% accuracy) though not always statistically significant on small test sets. Overall, the study suggests that integrating robust analogy capabilities with principled retrieval can advance statutory reasoning, while highlighting the need for stronger analogical models and larger evaluation suites in legal NLP.
Abstract
Statutory reasoning refers to the application of legislative provisions to a series of case facts described in natural language. We re-frame statutory reasoning as an analogy task, where each instance of the analogy task involves a combination of two instances of statutory reasoning. This increases the dataset size by two orders of magnitude, and introduces an element of interpretability. We show that this task is roughly as difficult to Natural Language Processing models as the original task. Finally, we come back to statutory reasoning, solving it with a combination of a retrieval mechanism and analogy models, and showing some progress on prior comparable work.
