Disentangled Representations for Causal Cognition
Filippo Torresan, Manuel Baltieri
TL;DR
This paper advances a unifying framework that connects causal cognition in natural agents with causal machine learning through disentangled representations. It formalizes three core dimensions—explicitness, sources, and integration of causal information—and defines weak and strong disentanglement to capture varying levels of causal structure. By articulating a mathematical base (disentanglement, structural causal models, and reinforcement learning) and a computational framework that uses trajectories and multi-source integration, the work lays groundwork for causal RL that better mirrors animal cognition and supports transfer, zero-shot learning, and interventions. The proposed synthesis aims to drive practical advances in AI and provide a principled lens for interpreting causal reasoning in natural agents, with concrete avenues for future benchmarks, algorithms, and cross-domain experiments.
Abstract
Complex adaptive agents consistently achieve their goals by solving problems that seem to require an understanding of causal information, information pertaining to the causal relationships that exist among elements of combined agent-environment systems. Causal cognition studies and describes the main characteristics of causal learning and reasoning in human and non-human animals, offering a conceptual framework to discuss cognitive performances based on the level of apparent causal understanding of a task. Despite the use of formal intervention-based models of causality, including causal Bayesian networks, psychological and behavioural research on causal cognition does not yet offer a computational account that operationalises how agents acquire a causal understanding of the world. Machine and reinforcement learning research on causality, especially involving disentanglement as a candidate process to build causal representations, represent on the one hand a concrete attempt at designing causal artificial agents that can shed light on the inner workings of natural causal cognition. In this work, we connect these two areas of research to build a unifying framework for causal cognition that will offer a computational perspective on studies of animal cognition, and provide insights in the development of new algorithms for causal reinforcement learning in AI.
