Convolutional Learning on Directed Acyclic Graphs
Samuel Rey, Hamed Ajorlou, Gonzalo Mateos
TL;DR
The paper tackles learning from signals on directed acyclic graphs (DAGs), where standard graph convolutions struggle due to nilpotent adjacency. It introduces the DAG Convolutional Network (DCN), built on causal graph filters with a reachability-based operator $W=(I-\mathbf{A})^{-1}$ and node-specific $\mathbf{T}_k=\mathbf{W}\mathbf{D}_k\mathbf{W}^{-1}$, enabling a principled, spectral convolution on DAGs. Each DCN layer uses $\mathbf{X}^{(\ell+1)}=\sigma(\sum_k \mathbf{T}_k \mathbf{X}^{(\ell)} \boldsymbol{\Theta}_k^{(\ell)})$, and complexity can be managed by restricting the GSO set. Empirical results on synthetic network-diffusion tasks show DCN outperforming baselines, highlighting the value of honoring DAG directionality and providing a scalable, causal-aware learning tool for DAG-supported data.
Abstract
We develop a novel convolutional architecture tailored for learning from data defined over directed acyclic graphs (DAGs). DAGs can be used to model causal relationships among variables, but their nilpotent adjacency matrices pose unique challenges towards developing DAG signal processing and machine learning tools. To address this limitation, we harness recent advances offering alternative definitions of causal shifts and convolutions for signals on DAGs. We develop a novel convolutional graph neural network that integrates learnable DAG filters to account for the partial ordering induced by the graph topology, thus providing valuable inductive bias to learn effective representations of DAG-supported data. We discuss the salient advantages and potential limitations of the proposed DAG convolutional network (DCN) and evaluate its performance on two learning tasks using synthetic data: network diffusion estimation and source identification. DCN compares favorably relative to several baselines, showcasing its promising potential.
