Relation-First Modeling Paradigm for Causal Representation Learning toward the Development of AGI
Jia Li, Xiang Li
TL;DR
This work argues that traditional object-first, i.i.d.-based learning struggles to capture causal dynamics, especially under interventional questions and timing. It introduces the relation-first paradigm and formalizes dynamic causal relations using $X\xrightarrow{\theta}Y$ with timing-augmented variables $\mathcal{X}=\langle X,t\rangle$ and $\mathcal{Y}=\langle Y,\tau\rangle$, including definitions and theorems (e.g., EI, dynamic timing, and sequential causality). As a practical instantiation, the paper proposes Relation-Indexed Representation Learning (RIRL), featuring a micro-causal architecture with invertible autoencoders, stacking of relation-indexed representations, and a latent-space exploration algorithm to uncover DAG-like causal routines. Empirical demonstrations on synthetic hydrology data show that RIRL can reconstruct high-dimensional dynamics, disentangle hierarchical causal components, and discover underlying DAG structures, albeit with data requirements and multi-timeline challenges acknowledged. Collectively, the work outlines a forward-looking framework for causality in AI that emphasizes relational information, dynamic timing, and reusable latent indices, with potential implications for developing AGI systems capable of forward-looking, dynamic reasoning.
Abstract
The traditional i.i.d.-based learning paradigm faces inherent challenges in addressing causal relationships, which has become increasingly evident with the rise of applications in causal representation learning. Our understanding of causality naturally requires a perspective as the creator rather than observer, as the ``what...if'' questions only hold within the possible world we conceive. The traditional perspective limits capturing dynamic causal outcomes and leads to compensatory efforts such as the reliance on hidden confounders. This paper lays the groundwork for the new perspective, which enables the \emph{relation-first} modeling paradigm for causality. Also, it introduces the Relation-Indexed Representation Learning (RIRL) as a practical implementation, supported by experiments that validate its efficacy.
