Table of Contents
Fetching ...

Explaining Arguments' Strength: Unveiling the Role of Attacks and Supports (Technical Report)

Xiang Yin, Potyka Nico, Francesca Toni

TL;DR

This work introduces Relation Attribution Explanations (RAEs) to explain the strength of topic arguments in Quantitative Bipolar AFs (QBAFs) under gradual semantics by adapting Shapley values to edge contributions. RAEs assign edge-level contributions to both attacks and supports, including indirect paths, and are equipped with a suite of properties (Shapley-based and argumentative) along with a probabilistic approximation algorithm that converges to the true values. Two case studies—Fraud Detection and Large Language Models (LLMs)—demonstrate RAEs’ practical utility, revealing nuanced, path-specific influences that go beyond traditional argument-level attributions. The approach provides a principled, interpretable framework for explaining QBAFs and suggests future work on joint Shapley analyses, edge-weighted QBAFs, and user-centered evaluation.

Abstract

Quantitatively explaining the strength of arguments under gradual semantics has recently received increasing attention. Specifically, several works in the literature provide quantitative explanations by computing the attribution scores of arguments. These works disregard the importance of attacks and supports, even though they play an essential role when explaining arguments' strength. In this paper, we propose a novel theory of Relation Attribution Explanations (RAEs), adapting Shapley values from game theory to offer fine-grained insights into the role of attacks and supports in quantitative bipolar argumentation towards obtaining the arguments' strength. We show that RAEs satisfy several desirable properties. We also propose a probabilistic algorithm to approximate RAEs efficiently. Finally, we show the application value of RAEs in fraud detection and large language models case studies.

Explaining Arguments' Strength: Unveiling the Role of Attacks and Supports (Technical Report)

TL;DR

This work introduces Relation Attribution Explanations (RAEs) to explain the strength of topic arguments in Quantitative Bipolar AFs (QBAFs) under gradual semantics by adapting Shapley values to edge contributions. RAEs assign edge-level contributions to both attacks and supports, including indirect paths, and are equipped with a suite of properties (Shapley-based and argumentative) along with a probabilistic approximation algorithm that converges to the true values. Two case studies—Fraud Detection and Large Language Models (LLMs)—demonstrate RAEs’ practical utility, revealing nuanced, path-specific influences that go beyond traditional argument-level attributions. The approach provides a principled, interpretable framework for explaining QBAFs and suggests future work on joint Shapley analyses, edge-weighted QBAFs, and user-centered evaluation.

Abstract

Quantitatively explaining the strength of arguments under gradual semantics has recently received increasing attention. Specifically, several works in the literature provide quantitative explanations by computing the attribution scores of arguments. These works disregard the importance of attacks and supports, even though they play an essential role when explaining arguments' strength. In this paper, we propose a novel theory of Relation Attribution Explanations (RAEs), adapting Shapley values from game theory to offer fine-grained insights into the role of attacks and supports in quantitative bipolar argumentation towards obtaining the arguments' strength. We show that RAEs satisfy several desirable properties. We also propose a probabilistic algorithm to approximate RAEs efficiently. Finally, we show the application value of RAEs in fraud detection and large language models case studies.
Paper Structure (35 sections, 35 theorems, 16 equations, 13 figures, 6 tables, 1 algorithm)

This paper contains 35 sections, 35 theorems, 16 equations, 13 figures, 6 tables, 1 algorithm.

Key Result

Proposition 1

DF-QuAD, QE, REB satisfy monotonicity in acyclic QBAFs.

Figures (13)

  • Figure 1: Graphical view of (elements of) a QBAF resulting from aggregating movie reviews (here, nodes are arguments, edges labelled + are supports, edges labelled - are attacks, and the $r_i$ are identifiers for the edges (for ease of reference)).
  • Figure 2: An example of QBAF (where all base scores are set to $0.5$).
  • Figure 3: Contributions, drawn from RAEs, for topic argument $\alpha$ for the QBAF in Figure \ref{['fig_counter']}. (Blue/red edges denote positive/negative contributions, respectively. The thickness of edges represents the magnitude of their contributions, i.e. their RAE value.)
  • Figure 4: Convergence of Algorithm \ref{['alg:algorithm']} for random cyclic QBAFs with 15 arguments and various numbers of edges.
  • Figure 5: Fraud Detection example from chi2021optimized.
  • ...and 8 more figures

Theorems & Definitions (68)

  • Definition 1: QBAF
  • Definition 2: Well-definedness
  • Example 1
  • Definition 3
  • Definition 4: Monotonicity
  • Proposition 1
  • Conjecture 1
  • Definition 5: RAEs
  • Definition 6: (Relation) Contribution
  • Example 2: Cont
  • ...and 58 more