Causally Consistent Normalizing Flow
Qingyang Zhou, Kangjie Lu, Meng Xu
TL;DR
The paper tackles causal inconsistency between generative models and structured causal models, which can yield unfair outcomes. It introduces Causally Consistent Normalizing Flows (CCNF), leveraging a sequential SCM representation and partial causal transformations to preserve the SCM’s causal graph while maintaining multi-layer expressiveness, with at least $d$ layers where $d$ is the longest path length. CCNF is a multi-layer universal approximator and is causally consistent with the SCM, ensuring the induced graph $G_{GM}$ matches $G_M$ and enabling interventions and counterfactuals. Empirically, CCNF outperforms prior causally consistent GMs on synthetic benchmarks and improves real-world fairness on the German credit dataset, increasing accuracy from $73.00\%$ to $75.80\%$ and reducing individual unfairness from $9.00\%$ to $0.00\%$.
Abstract
Causal inconsistency arises when the underlying causal graphs captured by generative models like \textit{Normalizing Flows} (NFs) are inconsistent with those specified in causal models like \textit{Struct Causal Models} (SCMs). This inconsistency can cause unwanted issues including the unfairness problem. Prior works to achieve causal consistency inevitably compromise the expressiveness of their models by disallowing hidden layers. In this work, we introduce a new approach: \textbf{C}ausally \textbf{C}onsistent \textbf{N}ormalizing \textbf{F}low (CCNF). To the best of our knowledge, CCNF is the first causally consistent generative model that can approximate any distribution with multiple layers. CCNF relies on two novel constructs: a sequential representation of SCMs and partial causal transformations. These constructs allow CCNF to inherently maintain causal consistency without sacrificing expressiveness. CCNF can handle all forms of causal inference tasks, including interventions and counterfactuals. Through experiments, we show that CCNF outperforms current approaches in causal inference. We also empirically validate the practical utility of CCNF by applying it to real-world datasets and show how CCNF addresses challenges like unfairness effectively.
