Emerging Synergies in Causality and Deep Generative Models: A Survey
Guanglin Zhou, Shaoan Xie, Guang-Yuan Hao, Shiming Chen, Biwei Huang, Xiwei Xu, Chen Wang, Liming Zhu, Lina Yao, Kun Zhang
TL;DR
The paper tackles the challenge of building more interpretable and generalizable generative systems by tying together causality and deep generative models (DGMs). It presents a two-pronged agenda: (i) integrating causal principles into DGMs (via GANs, VAEs, and diffusion) to create causal DGMs (CGMs) and to enable interventions and counterfactuals, and (ii) using DGMs to identify and reason about causal structures, including causal discovery, counterfactual inference, and representation learning. A substantial portion is devoted to how large-scale DGMs, especially generative LLMs, can participate in causal tasks, including evaluating memorization versus reasoning, prompting strategies, and data-driven pipelines that combine LLMs with traditional causal methods. The survey covers trustworthy properties (generalization, fairness, interpretability) and applications in healthcare, social science, and interdisciplinary domains, while outlining open challenges (scalability to large causal graphs, identifiability, evaluation of counterfactuals) and future directions. Overall, the work provides a comprehensive guide to a rapidly evolving area at the intersection of causality and generative modeling, with practical implications for robust and interpretable AI systems.
Abstract
In the field of artificial intelligence (AI), the quest to understand and model data-generating processes (DGPs) is of paramount importance. Deep generative models (DGMs) have proven adept in capturing complex data distributions but often fall short in generalization and interpretability. On the other hand, causality offers a structured lens to comprehend the mechanisms driving data generation and highlights the causal-effect dynamics inherent in these processes. While causality excels in interpretability and the ability to extrapolate, it grapples with intricacies of high-dimensional spaces. Recognizing the synergistic potential, we delve into the confluence of causality and DGMs. We elucidate the integration of causal principles within DGMs, investigate causal identification using DGMs, and navigate an emerging research frontier of causality in large-scale generative models, particularly generative large language models (LLMs). We offer insights into methodologies, highlight open challenges, and suggest future directions, positioning our comprehensive review as an essential guide in this swiftly emerging and evolving area.
