Cooperative Graph Neural Networks
Ben Finkelshtein, Xingyue Huang, Michael Bronstein, İsmail İlkan Ceylan
TL;DR
Co-GNNs restructure graph neural computation as a multi-agent, dynamic message-passing process where each node chooses per-layer actions (listen, broadcast, isolate, or standard) via a dedicated action network ${\pi}$, while an environment network ${\eta}$ updates states based on these choices. This yields a learned, layer-dependent computational graph that can be directed and asynchronous, enabling flexible propagation of task-relevant information and mitigating issues like over-smoothing and long-range bottlenecks. The framework is shown to be more expressive than 1-WL in expectation due to stochastic action sampling, and theoretical results demonstrate how Co-GNNs can approximate complex functions of distant node features. Empirically, Co-GNNs improve over baselines on synthetic tasks and, notably, achieve state-of-the-art performance on heterophilic graphs, while providing insights into how actions and environment choices shape information flow and topological rewiring. Overall, Co-GNNs offer a principled approach to adaptively shape computation graphs for improved long-range reasoning and robustness across graph types, with potential extensions to directed and multi-relational graphs.
Abstract
Graph neural networks are popular architectures for graph machine learning, based on iterative computation of node representations of an input graph through a series of invariant transformations. A large class of graph neural networks follow a standard message-passing paradigm: at every layer, each node state is updated based on an aggregate of messages from its neighborhood. In this work, we propose a novel framework for training graph neural networks, where every node is viewed as a player that can choose to either 'listen', 'broadcast', 'listen and broadcast', or to 'isolate'. The standard message propagation scheme can then be viewed as a special case of this framework where every node 'listens and broadcasts' to all neighbors. Our approach offers a more flexible and dynamic message-passing paradigm, where each node can determine its own strategy based on their state, effectively exploring the graph topology while learning. We provide a theoretical analysis of the new message-passing scheme which is further supported by an extensive empirical analysis on a synthetic dataset and on real-world datasets.
