Neural Spacetimes for DAG Representation Learning
Haitz Sáez de Ocáriz Borde, Anastasis Kratsios, Marc T. Law, Xiaowen Dong, Michael Bronstein
TL;DR
Neural Spacetimes (NSTs) address DAG representation learning by embedding nodes as events in a trainable spacetime manifold, decomposing the representation into a space via a neural quasi-metric $\mathcal{D}$ and a time via a neural partial order $\mathcal{T}$, all guided by a learnable encoder $\mathcal{E}$. The framework provides a global embedding guarantee: for any finite DAG with $k$ nodes, there exists an NST with distortion $1+\mathcal{O}(\log k)$ that preserves causality, with parameter complexity $\tilde{\mathcal{O}}(D+W+k^{5/2}D^4N)$; planar posets can be embedded with $D=\mathcal{O}(\log k)$ spatial dimensions and $W=2$ time dimensions. NSTs are trained end-to-end and show lower embedding distortions than fixed-spacetime baselines on both synthetic and real-world graphs, illustrating the practical benefits of learning geometry. By jointly optimizing space and time, NSTs enable scalable, data-driven causal embeddings for directed graphs with potential downstream impact on causal inference and network analysis via richer, adaptive representations.
Abstract
We propose a class of trainable deep learning-based geometries called Neural Spacetimes (NSTs), which can universally represent nodes in weighted directed acyclic graphs (DAGs) as events in a spacetime manifold. While most works in the literature focus on undirected graph representation learning or causality embedding separately, our differentiable geometry can encode both graph edge weights in its spatial dimensions and causality in the form of edge directionality in its temporal dimensions. We use a product manifold that combines a quasi-metric (for space) and a partial order (for time). NSTs are implemented as three neural networks trained in an end-to-end manner: an embedding network, which learns to optimize the location of nodes as events in the spacetime manifold, and two other networks that optimize the space and time geometries in parallel, which we call a neural (quasi-)metric and a neural partial order, respectively. The latter two networks leverage recent ideas at the intersection of fractal geometry and deep learning to shape the geometry of the representation space in a data-driven fashion, unlike other works in the literature that use fixed spacetime manifolds such as Minkowski space or De Sitter space to embed DAGs. Our main theoretical guarantee is a universal embedding theorem, showing that any $k$-point DAG can be embedded into an NST with $1+\mathcal{O}(\log(k))$ distortion while exactly preserving its causal structure. The total number of parameters defining the NST is sub-cubic in $k$ and linear in the width of the DAG. If the DAG has a planar Hasse diagram, this is improved to $\mathcal{O}(\log(k)) + 2)$ spatial and 2 temporal dimensions. We validate our framework computationally with synthetic weighted DAGs and real-world network embeddings; in both cases, the NSTs achieve lower embedding distortions than their counterparts using fixed spacetime geometries.
