From Identifiable Causal Representations to Controllable Counterfactual Generation: A Survey on Causal Generative Modeling

Aneesh Komanduri; Xintao Wu; Yongkai Wu; Feng Chen

From Identifiable Causal Representations to Controllable Counterfactual Generation: A Survey on Causal Generative Modeling

Aneesh Komanduri, Xintao Wu, Yongkai Wu, Feng Chen

TL;DR

This survey presents a comprehensive overview of causal generative modeling, focusing on identifiable causal representation learning (CRL) and controllable counterfactual generation (CCG). It articulates a Pearl-based causal hierarchy and reviews methodological families across observational, interventional, and counterfactual data regimes, including VAE-, flow-, and diffusion-based approaches. Key contributions include a taxonomy of CRL methods (e.g., CausalVAE, DEAR, SCM-VAE, ICM-VAE) with identifiability guarantees under auxiliary signals, multi-domain data, and temporal settings, plus a taxonomy of CCG techniques (GAN-, flow-, and diffusion-based) enabling counterfactual generation with tractable inference. The paper also surveys datasets, evaluation metrics, and real-world applications in fairness, privacy, robustness, and precision medicine, and outlines open problems and future directions for scalable, verifiable CRL/CCG with weaker supervision and stronger benchmarks.

Abstract

Deep generative models have shown tremendous capability in data density estimation and data generation from finite samples. While these models have shown impressive performance by learning correlations among features in the data, some fundamental shortcomings are their lack of explainability, tendency to induce spurious correlations, and poor out-of-distribution extrapolation. To remedy such challenges, recent work has proposed a shift toward causal generative models. Causal models offer several beneficial properties to deep generative models, such as distribution shift robustness, fairness, and interpretability. Structural causal models (SCMs) describe data-generating processes and model complex causal relationships and mechanisms among variables in a system. Thus, SCMs can naturally be combined with deep generative models. We provide a technical survey on causal generative modeling categorized into causal representation learning and controllable counterfactual generation methods. We focus on fundamental theory, methodology, drawbacks, datasets, and metrics. Then, we cover applications of causal generative models in fairness, privacy, out-of-distribution generalization, precision medicine, and biological sciences. Lastly, we discuss open problems and fruitful research directions for future work in the field.

From Identifiable Causal Representations to Controllable Counterfactual Generation: A Survey on Causal Generative Modeling

TL;DR

Abstract

Paper Structure (69 sections, 87 equations, 27 figures, 4 tables)

This paper contains 69 sections, 87 equations, 27 figures, 4 tables.

Introduction
Background
Structural Causal Model
Variational Autoencoders
Normalizing Flows
Generative Adversarial Networks
Diffusion Models
Disentangled Representation Learning
The Identifiability Problem
Towards Identifiability in Disentangled Representation Learning
Causal Representation Learning
Formulating Causal Representation Learning
Linking Causal Representations to Nonlinear Independent Component Analysis
Principles of Causal Representations
Intuition behind Causal Disentanglement
...and 54 more sections

Figures (27)

Figure 1: Taxonomy of Causal Generative Modeling
Figure 2: Types of Interventions
Figure 3: Pearl's Causal Hierarchy
Figure 4: Overview of generative models (a) Variational Autoencoder (VAE), (b) Normalizing Flow, (c) Generative Adversarial Network (GAN), and (d) Diffusion models
Figure 5: An illustration of (a) the Darmois construction and (b) rotated Gaussian MPA that are counterexamples to blind-source separation of nonlinear ICA. The Darmois CDF transform maps the data to independent intervals of a uniform distribution. The Gaussian is invariant to rotations and thus represents the same observational distribution after rotation (the left circle and right circle are equivalent distributions).
...and 22 more figures

Theorems & Definitions (16)

Definition 1: Structural Causal Model
Definition 2: Pearl's Causal Hierarchy pearls_hierarchy
Definition 3: locatello_weakly-supervised_2020
Definition 4: Identifiability
Example 1: Darmois Construction - HYVARINEN1999429
Example 2: Rotated Gaussian MPA - gresele2021independent
Definition 5: Affine equivalence
Definition 6: Permutation equivalence
Example 3: Unidentifiability in CRL Simple Linear Example
Definition 7: Interventional Discrepancy liang2023causal
...and 6 more

From Identifiable Causal Representations to Controllable Counterfactual Generation: A Survey on Causal Generative Modeling

TL;DR

Abstract

From Identifiable Causal Representations to Controllable Counterfactual Generation: A Survey on Causal Generative Modeling

Authors

TL;DR

Abstract

Table of Contents

Figures (27)

Theorems & Definitions (16)