Counterfactual Fairness Is Not Demographic Parity, and Other Observations
Ricardo Silva
TL;DR
This note clarifies that counterfactual fairness (CF) is not generally equivalent to demographic parity (DP); CF sits on Pearl’s causal ladder as an individual fairness notion, while DP is a purely probabilistic, group-level criterion. It shows that DP does not imply CF and CF does not imply DP, even under seemingly favorable conditions, using both constructive counterexamples and analysis of prior work. The author reframes CF as an information-filtering (information bottleneck) procedure that selects which information from $X$ to use for prediction, independent of the loss function or downstream constraints, and stresses that path-specific variants and identifiability issues matter. The discussion also separates common misunderstandings about notational conventions, ancestral closure, and the role CF plays in fairness, highlighting the need for careful causal modeling in fairness research with practical implications for algorithm design. Overall, the paper urges cautious interpretation of claimed equivalences and clarifies the proper scope and purpose of CF within causal fairness frameworks.
Abstract
Blanket statements of equivalence between causal concepts and purely probabilistic concepts should be approached with care. In this short note, I examine a recent claim that counterfactual fairness is equivalent to demographic parity. The claim fails to hold up upon closer examination. I will take the opportunity to address some broader misunderstandings about counterfactual fairness.
