Shapley Revisited: Tractable Responsibility Measures for Query Answers
Meghyn Bienvenu, Diego Figueira, Pierre Lafourcade
TL;DR
The paper challenges the dominance of the drastic Shapley value for non-numeric, monotone queries by introducing WSMS, a family of responsibility measures based on weighted sums of minimal supports. It shows WSMS are equivalent to Shapley values for suitably defined wealth functions, and proves tractability of WSMS computation for UCQs and many bounded query classes, while identifying hardness for certain CQ subclasses. The authors provide a comprehensive complexity landscape, including data- and combined-complexity results, reductions to counting homomorphisms, and tractable cases under bounded generalized hypertree width and self-join width. They also connect WSMS to related measures in inconsistency, homomorphism counting, and SHAP explainability, and discuss extensions to explainable AI and ontology-mediated query answering. Overall, WSMS offer a principled, computable, and interpretable alternative to drastic Shapley for explaining query answers, with broad implications for semantic explanations and data-provenance tools.
Abstract
The Shapley value, originating from cooperative game theory, has been employed to define responsibility measures that quantify the contributions of database facts to obtaining a given query answer. For non-numeric queries, this is done by considering a cooperative game whose players are the facts and whose wealth function assigns 1 or 0 to each subset of the database, depending on whether the query answer holds in the given subset. While conceptually simple, this approach suffers from a notable drawback: the problem of computing such Shapley values is #P-hard in data complexity, even for simple conjunctive queries. This motivates us to revisit the question of what constitutes a reasonable responsibility measure and to introduce a new family of responsibility measures -- weighted sums of minimal supports (WSMS) -- which satisfy intuitive properties. Interestingly, while the definition of WSMSs is simple and bears no obvious resemblance to the Shapley value formula, we prove that every WSMS measure can be equivalently seen as the Shapley value of a suitably defined cooperative game. Moreover, WSMS measures enjoy tractable data complexity for a large class of queries, including all unions of conjunctive queries. We further explore the combined complexity of WSMS computation and establish (in)tractability results for various subclasses of conjunctive queries.
