Fairness under Graph Uncertainty: Achieving Interventional Fairness with Partially Known Causal Graphs over Clusters of Variables

Yoichi Chikahara

Fairness under Graph Uncertainty: Achieving Interventional Fairness with Partially Known Causal Graphs over Clusters of Variables

Yoichi Chikahara

TL;DR

This work proposes a learning framework that achieves interventional fairness by leveraging a causal graph over clusters of variables, which is substantially easier to estimate than a variable-level graph.

Abstract

Algorithmic decisions about individuals require predictions that are not only accurate but also fair with respect to sensitive attributes such as gender and race. Causal notions of fairness align with legal requirements, yet many methods assume access to detailed knowledge of the underlying causal graph, which is a demanding assumption in practice. We propose a learning framework that achieves interventional fairness by leveraging a causal graph over \textit{clusters of variables}, which is substantially easier to estimate than a variable-level graph. With possible \textit{adjustment cluster sets} identified from such a cluster causal graph, our framework trains a prediction model by reducing the worst-case discrepancy between interventional distributions across these sets. To this end, we develop a computationally efficient barycenter kernel maximum mean discrepancy (MMD) that scales favorably with the number of sensitive attribute values. Extensive experiments show that our framework strikes a better balance between fairness and accuracy than existing approaches, highlighting its effectiveness under limited causal graph knowledge.

Fairness under Graph Uncertainty: Achieving Interventional Fairness with Partially Known Causal Graphs over Clusters of Variables

TL;DR

Abstract

Paper Structure (81 sections, 5 theorems, 47 equations, 9 figures, 6 tables, 1 algorithm)

This paper contains 81 sections, 5 theorems, 47 equations, 9 figures, 6 tables, 1 algorithm.

Introduction
Preliminaries
Problem Setup
Fairness via Lens of Causality
Cluster Causal Graphs
Cluster DAGs
Cluster CPDAGs
Proposed Method
Challenges in Cluster CPDAGs
Adjustment Set Enumeration
Assumptions and Enumeration Strategy
Parent Set Enumeration
Adjustment Cluster Completion
Graph Refinement for Unidentifiable Cases.
Penalizing Worst-Case Unfairness
...and 66 more sections

Key Result

Theorem 3.5

A set of clusters $S \subseteq \mathrm{sib}(\textbf{A}, \mathcal{G}) \coloneqq \{\textbf{C} \in \mathcal{C} \colon \textbf{C} - \textbf{A} \hbox{in} \mathcal{G}\}$ in a cluster CPDAG $\mathcal{G}$ is a possible parent set of $\textbf{A}$ if and only if it satisfies the following two conditions:

Figures (9)

Figure 1: Causal graphs representing a scenario of hiring decisions: (a) DAG over $\textbf{X}$, (b) DAG over $\textbf{X}$ and $\hat{\textbf{Y}}$, (c) CPDAG over $\textbf{X}$, (d) cluster DAG over $\textbf{X}$, and (e) cluster CPDAG over $\textbf{X}$ with independence arcs and connection/separation marks.
Figure 2: Independence arcs for cluster triplet $\langle \textbf{X}, \textbf{Z}, \textbf{Y} \rangle$
Figure 3: Performance on linear datasets when varying $\lambda$ from $0$ to $200$: (a): $d = 5$, (b): $d = 10$, and (c): $d = 15$ clusters.
Figure 4: Probability difference between intervened values $\textbf{a}'=[0,1], [1,0], [1,1]$ and $\textbf{a}=[0,0]$ on Adult dataset
Figure 5: Log-scale box plot of number of adjustment sets $M$ for expected node degree $p$ on linear datasets with $d=10$ clusters. Whiskers, boxes, and orange lines denote minimum and maximum, interquartile range, and median, respectively.
...and 4 more figures

Theorems & Definitions (19)

Definition 2.1: salimi2019interventional
Remark 2.2: Connection to other causality-based notions
Definition 3.4: anand2025causal
Theorem 3.5
Definition A.1: Independence arcs anand2025causal
Definition A.2: Separation marks anand2025causal
Definition A.3: Analogous paths anand2025causal
Definition A.4: Connection marks anand2025causal
Definition A.5: Cluster Causal DAGs anand2025causal
Definition A.6: Cluster CPDAGs anand2025causal
...and 9 more

Fairness under Graph Uncertainty: Achieving Interventional Fairness with Partially Known Causal Graphs over Clusters of Variables

TL;DR

Abstract

Fairness under Graph Uncertainty: Achieving Interventional Fairness with Partially Known Causal Graphs over Clusters of Variables

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (9)

Theorems & Definitions (19)