Heterogeneous Graph Neural Networks for Assumption-Based Argumentation
Preesha Gehlot, Anna Rapberger, Fabrizio Russo, Francesca Toni
TL;DR
This work tackles the computational intractability of credulous acceptance under stable semantics in Assumption-Based Argumentation (ABA) by introducing native heterogeneous Graph Neural Networks (GNNs) operating on a faithful dependency-graph encoding of ABA frameworks. It presents two architectures, ABAGCN and ABAGAT, which leverage relation-specific GCN and GAT layers to learn node embeddings for assumptions, rules, and claims, achieving strong accuracy on ICCMA 2023 benchmarks and synthetic ABAFs. A sound polynomial-time extension-reconstruction algorithm uses the predictor to assemble stable extensions, delivering high F1 on small instances and substantial runtime gains over exact solvers on large frameworks. The approach demonstrates that GNN-based approximate reasoning can significantly bridge the gap between scalability and accuracy in structured argumentation, with practical implications for learning ABAFs and causal/discovery tasks.
Abstract
Assumption-Based Argumentation (ABA) is a powerful structured argumentation formalism, but exact computation of extensions under stable semantics is intractable for large frameworks. We present the first Graph Neural Network (GNN) approach to approximate credulous acceptance in ABA. To leverage GNNs, we model ABA frameworks via a dependency graph representation encoding assumptions, claims and rules as nodes, with heterogeneous edge labels distinguishing support, derive and attack relations. We propose two GNN architectures - ABAGCN and ABAGAT - that stack residual heterogeneous convolution or attention layers, respectively, to learn node embeddings. Our models are trained on the ICCMA 2023 benchmark, augmented with synthetic ABAFs, with hyperparameters optimised via Bayesian search. Empirically, both ABAGCN and ABAGAT outperform a state-of-the-art GNN baseline that we adapt from the abstract argumentation literature, achieving a node-level F1 score of up to 0.71 on the ICCMA instances. Finally, we develop a sound polynomial time extension-reconstruction algorithm driven by our predictor: it reconstructs stable extensions with F1 above 0.85 on small ABAFs and maintains an F1 of about 0.58 on large frameworks. Our work opens new avenues for scalable approximate reasoning in structured argumentation.
