Factored space models: Towards causality between levels of abstraction
Scott Garrabrant, Matthias Georg Mayer, Magdalena Wache, Leon Lang, Sam Eisenstat, Holger Dell
TL;DR
The paper introduces factored space models (FSMs) to model causality across abstraction levels, including both probabilistic and deterministic relationships, addressing a key limitation of traditional causal graphs. It defines structural independence within FSMs and proves a generalized soundness and completeness theorem, extending $d$-separation to settings with deterministic dependencies. The authors show how to construct FSMs from Bayesian networks, argue that FSMs can be more expressive than DAG-based models, and establish a formal equivalence between FSM structural independence and probabilistic independence across all product-form distributions. These results provide a robust framework for reasoning about cross-level causality and abstraction in intelligent systems, with potential implications for neural networks, representation learning, and interpretable AI.
Abstract
Causality plays an important role in understanding intelligent behavior, and there is a wealth of literature on mathematical models for causality, most of which is focused on causal graphs. Causal graphs are a powerful tool for a wide range of applications, in particular when the relevant variables are known and at the same level of abstraction. However, the given variables can also be unstructured data, like pixels of an image. Meanwhile, the causal variables, such as the positions of objects in the image, can be arbitrary deterministic functions of the given variables. Moreover, the causal variables may form a hierarchy of abstractions, in which the macro-level variables are deterministic functions of the micro-level variables. Causal graphs are limited when it comes to modeling this kind of situation. In the presence of deterministic relationships there is generally no causal graph that satisfies both the Markov condition and the faithfulness condition. We introduce factored space models as an alternative to causal graphs which naturally represent both probabilistic and deterministic relationships at all levels of abstraction. Moreover, we introduce structural independence and establish that it is equivalent to statistical independence in every distribution that factorizes over the factored space. This theorem generalizes the classical soundness and completeness theorem for d-separation.
