Table of Contents
Fetching ...

Fairness under Graph Uncertainty: Achieving Interventional Fairness with Partially Known Causal Graphs over Clusters of Variables

Yoichi Chikahara

TL;DR

This work proposes a learning framework that achieves interventional fairness by leveraging a causal graph over clusters of variables, which is substantially easier to estimate than a variable-level graph.

Abstract

Algorithmic decisions about individuals require predictions that are not only accurate but also fair with respect to sensitive attributes such as gender and race. Causal notions of fairness align with legal requirements, yet many methods assume access to detailed knowledge of the underlying causal graph, which is a demanding assumption in practice. We propose a learning framework that achieves interventional fairness by leveraging a causal graph over \textit{clusters of variables}, which is substantially easier to estimate than a variable-level graph. With possible \textit{adjustment cluster sets} identified from such a cluster causal graph, our framework trains a prediction model by reducing the worst-case discrepancy between interventional distributions across these sets. To this end, we develop a computationally efficient barycenter kernel maximum mean discrepancy (MMD) that scales favorably with the number of sensitive attribute values. Extensive experiments show that our framework strikes a better balance between fairness and accuracy than existing approaches, highlighting its effectiveness under limited causal graph knowledge.

Fairness under Graph Uncertainty: Achieving Interventional Fairness with Partially Known Causal Graphs over Clusters of Variables

TL;DR

This work proposes a learning framework that achieves interventional fairness by leveraging a causal graph over clusters of variables, which is substantially easier to estimate than a variable-level graph.

Abstract

Algorithmic decisions about individuals require predictions that are not only accurate but also fair with respect to sensitive attributes such as gender and race. Causal notions of fairness align with legal requirements, yet many methods assume access to detailed knowledge of the underlying causal graph, which is a demanding assumption in practice. We propose a learning framework that achieves interventional fairness by leveraging a causal graph over \textit{clusters of variables}, which is substantially easier to estimate than a variable-level graph. With possible \textit{adjustment cluster sets} identified from such a cluster causal graph, our framework trains a prediction model by reducing the worst-case discrepancy between interventional distributions across these sets. To this end, we develop a computationally efficient barycenter kernel maximum mean discrepancy (MMD) that scales favorably with the number of sensitive attribute values. Extensive experiments show that our framework strikes a better balance between fairness and accuracy than existing approaches, highlighting its effectiveness under limited causal graph knowledge.
Paper Structure (81 sections, 5 theorems, 47 equations, 9 figures, 6 tables, 1 algorithm)

This paper contains 81 sections, 5 theorems, 47 equations, 9 figures, 6 tables, 1 algorithm.

Key Result

Theorem 3.5

A set of clusters $S \subseteq \mathrm{sib}(\textbf{A}, \mathcal{G}) \coloneqq \{\textbf{C} \in \mathcal{C} \colon \textbf{C} - \textbf{A} \hbox{in} \mathcal{G}\}$ in a cluster CPDAG $\mathcal{G}$ is a possible parent set of $\textbf{A}$ if and only if it satisfies the following two conditions:

Figures (9)

  • Figure 1: Causal graphs representing a scenario of hiring decisions: (a) DAG over $\textbf{X}$, (b) DAG over $\textbf{X}$ and $\hat{\textbf{Y}}$, (c) CPDAG over $\textbf{X}$, (d) cluster DAG over $\textbf{X}$, and (e) cluster CPDAG over $\textbf{X}$ with independence arcs and connection/separation marks.
  • Figure 2: Independence arcs for cluster triplet $\langle \textbf{X}, \textbf{Z}, \textbf{Y} \rangle$
  • Figure 3: Performance on linear datasets when varying $\lambda$ from $0$ to $200$: (a): $d = 5$, (b): $d = 10$, and (c): $d = 15$ clusters.
  • Figure 4: Probability difference between intervened values $\textbf{a}'=[0,1], [1,0], [1,1]$ and $\textbf{a}=[0,0]$ on Adult dataset
  • Figure 5: Log-scale box plot of number of adjustment sets $M$ for expected node degree $p$ on linear datasets with $d=10$ clusters. Whiskers, boxes, and orange lines denote minimum and maximum, interquartile range, and median, respectively.
  • ...and 4 more figures

Theorems & Definitions (19)

  • Definition 2.1: salimi2019interventional
  • Remark 2.2: Connection to other causality-based notions
  • Definition 3.4: anand2025causal
  • Theorem 3.5
  • Definition A.1: Independence arcs anand2025causal
  • Definition A.2: Separation marks anand2025causal
  • Definition A.3: Analogous paths anand2025causal
  • Definition A.4: Connection marks anand2025causal
  • Definition A.5: Cluster Causal DAGs anand2025causal
  • Definition A.6: Cluster CPDAGs anand2025causal
  • ...and 9 more