Counterfactual explainability and analysis of variance
Zijun Gao, Qingyuan Zhao
TL;DR
The paper introduces counterfactual explainability, a causal attribution framework that generalizes Sobol-like global sensitivity measures to dependent explanatory variables via DAGs. It links the concept to genetic heritability, uses comonotonicity to achieve point identification, and provides an axiomatic foundation that unifies existing variable-importance notions under an explanation algebra. Identification and estimation are discussed, with a practical estimation strategy based on conditional quantile functions and Monte Carlo sampling, demonstrated on income inequality data where education and gender show substantial explainability and interact in meaningful ways across age groups. The approach offers a principled, causally informed alternative to purely associational explainability methods and highlights both its potential and limitations, including partial identification under general settings and extensions to more complex causal structures.
Abstract
Existing tools for explaining complex models and systems are associational rather than causal and do not provide mechanistic understanding. We propose a new notion called counterfactual explainability for causal attribution that is motivated by the concept of genetic heritability in twin studies. Counterfactual explainability extends methods for global sensitivity analysis (including the functional analysis of variance and Sobol's indices), which assumes independent explanatory variables, to dependent explanations by using a directed acyclic graphs to describe their causal relationship. Therefore, this explanability measure directly incorporates causal mechanisms by construction. Under a comonotonicity assumption, we discuss methods for estimating counterfactual explainability and apply them to a real dataset dataset to explain income inequality by gender, race, and educational attainment.
