Diffusion-Convolutional Neural Networks
James Atwood, Don Towsley
TL;DR
This work introduces diffusion-convolutional neural networks (DCNNs), extending CNNs to graph-structured data by applying a diffusion-convolution operation that yields diffusion-based latent representations. The model uses a compact, ($O(HF)$) parameterization and is designed to be invariant to graph isomorphism, enabling efficient GPU-accelerated learning and prediction in polynomial time. Empirically, DCNNs achieve strong node classification performance on CORA and PubMed, and offer competitive results for graph classification across several datasets, though graph-level diffusion requires improved aggregation. The study also transparently discusses scalability limitations due to memory demands and locality constraints, and it situates DCNNs relative to PRMs and kernel methods, highlighting a distinct, differentiable, learnable diffusion-based approach.
Abstract
We present diffusion-convolutional neural networks (DCNNs), a new model for graph-structured data. Through the introduction of a diffusion-convolution operation, we show how diffusion-based representations can be learned from graph-structured data and used as an effective basis for node classification. DCNNs have several attractive qualities, including a latent representation for graphical data that is invariant under isomorphism, as well as polynomial-time prediction and learning that can be represented as tensor operations and efficiently implemented on the GPU. Through several experiments with real structured datasets, we demonstrate that DCNNs are able to outperform probabilistic relational models and kernel-on-graph methods at relational node classification tasks.
