Causal Deep Learning
Jeroen Berrevoets, Krzysztof Kacprzyk, Zhaozhi Qian, Mihaela van der Schaar
TL;DR
Causal Deep Learning (CDL) proposes a pragmatic framework to integrate causality with deep learning by explicitly modeling partial causal knowledge, functional forms, and temporal dynamics. It introduces a three-dimensional CDL scale—structural, parametric, and temporal—to organize assumptions and guide model pipelines, including transitions between knowledge states and the construction of cascaded models. The paper uses treatment effects, exemplified by IHDP, to illustrate how a CDL perspective separates a priori causal structure from learned posteriors and clarifies which aspects are testable. By providing a topology to map methods, pipelines, and data requirements, CDL aims to increase real-world impact in domains like healthcare, economics, environment, and education while offering practical guidelines for reporting assumptions and results.
Abstract
Causality has the potential to truly transform the way we solve a large number of real-world problems. Yet, so far, its potential largely remains to be unlocked as causality often requires crucial assumptions which cannot be tested in practice. To address this challenge, we propose a new way of thinking about causality -- we call this causal deep learning. Our causal deep learning framework spans three dimensions: (1) a structural dimension, which incorporates partial yet testable causal knowledge rather than assuming either complete or no causal knowledge among the variables of interest; (2) a parametric dimension, which encompasses parametric forms that capture the type of relationships among the variables of interest; and (3) a temporal dimension, which captures exposure times or how the variables of interest interact (possibly causally) over time. Causal deep learning enables us to make progress on a variety of real-world problems by leveraging partial causal knowledge (including independencies among variables) and quantitatively characterising causal relationships among variables of interest (possibly over time). Our framework clearly identifies which assumptions are testable and which ones are not, such that the resulting solutions can be judiciously adopted in practice. Using our formulation we can combine or chain together causal representations to solve specific problems without losing track of which assumptions are required to build these solutions, pushing real-world impact in healthcare, economics and business, environmental sciences and education, through causal deep learning.
