Argumentative Large Language Models for Explainable and Contestable Claim Verification
Gabriel Freedman, Adam Dejl, Deniz Gorur, Xiang Yin, Antonio Rago, Francesca Toni
TL;DR
ArgLLMs address the lack of explainability and contestability in large language models by augmenting them with formal argumentative reasoning via quantitative bipolar argumentation frameworks (QBAFs) and the DF-QuAD semantics. The approach decomposes the task of claim verification into argument generation, intrinsic strength attribution, and dialectical strength calculation, producing a final decision and an interpretable reasoning trace. Empirical results show ArgLLMs achieve competitive accuracy with baselines while offering faithful, contestable explanations, and formal proofs establish contestability properties for the underlying semantics. This framework enables robust, explainable decision support suitable for high-stakes domains, with avenues for improvement through retrieval augmentation, ensemble methods, and deeper human evaluations.
Abstract
The profusion of knowledge encoded in large language models (LLMs) and their ability to apply this knowledge zero-shot in a range of settings makes them promising candidates for use in decision-making. However, they are currently limited by their inability to provide outputs which can be faithfully explained and effectively contested to correct mistakes. In this paper, we attempt to reconcile these strengths and weaknesses by introducing \emph{argumentative LLMs (ArgLLMs)}, a method for augmenting LLMs with argumentative reasoning. Concretely, ArgLLMs construct argumentation frameworks, which then serve as the basis for formal reasoning in support of decision-making. The interpretable nature of these argumentation frameworks and formal reasoning means that any decision made by ArgLLMs may be explained and contested. We evaluate ArgLLMs' performance experimentally in comparison with state-of-the-art techniques, in the context of the decision-making task of claim verification. We also define novel properties to characterise contestability and assess ArgLLMs formally in terms of these properties.
