Causal Fair Metric: Bridging Causality, Individual Fairness, and Adversarial Robustness
Ahmad-Reza Ehyaei, Golnoosh Farnadi, Samira Samadi
TL;DR
This work tackles the challenge of integrating causality, individual fairness, and adversarial robustness by introducing a causal fair metric defined on structural causal models. It develops a semi-latent space embedding to separate sensitive from non-sensitive factors, enabling a push-forward distance that remains zero for counterfactual twins and continuous with respect to non-sensitive perturbations. Since SCMs are often unknown, the paper proposes data-driven metric learning methods and introduces ECAPIFY for causality-aware fair adversarial learning that does not require full SCM knowledge, achieving competitive or superior results to oracle-based approaches. The findings advance counterfactual fairness while enhancing robustness, with practical implications for deploying fair, robust AI systems in real-world, causally structured data settings; the authors also outline avenues for future theoretical guarantees and broader causal ML applications.
Abstract
Despite the essential need for comprehensive considerations in responsible AI, factors like robustness, fairness, and causality are often studied in isolation. Adversarial perturbation, used to identify vulnerabilities in models, and individual fairness, aiming for equitable treatment of similar individuals, despite initial differences, both depend on metrics to generate comparable input data instances. Previous attempts to define such joint metrics often lack general assumptions about data or structural causal models and were unable to reflect counterfactual proximity. To address this, our paper introduces a causal fair metric formulated based on causal structures encompassing sensitive attributes and protected causal perturbation. To enhance the practicality of our metric, we propose metric learning as a method for metric estimation and deployment in real-world problems in the absence of structural causal models. We also demonstrate the application of our novel metric in classifiers. Empirical evaluation of real-world and synthetic datasets illustrates the effectiveness of our proposed metric in achieving an accurate classifier with fairness, resilience to adversarial perturbations, and a nuanced understanding of causal relationships.
