Port-Hamiltonian Architectural Bias for Long-Range Propagation in Deep Graph Networks
Simon Heilig, Alessio Gravina, Alessandro Trenta, Claudio Gallicchio, Davide Bacciu
TL;DR
The paper addresses the challenge of long-range information diffusion in deep graph networks by introducing port-Hamiltonian Deep Graph Networks (PH-DGN), a framework that models neural information flow as port-Hamiltonian dynamics to balance conservation and dissipation. By representing node states with momentum and position and evolving them through a Hamiltonian with optional damping and external forcing, PH-DGN enables both purely conservative long-range propagation and task-driven non-conservative behavior. The authors prove energy conservation and non-vanishing gradient properties in the conservative regime, and show how dissipative components can be learned to improve performance. Empirically, PH-DGN achieves state-of-the-art results on synthetic and real-world long-range propagation tasks, including graph property prediction and the Long-Range Graph Benchmark, while maintaining competitive runtimes and offering clear interpretability from a physics perspective.
Abstract
The dynamics of information diffusion within graphs is a critical open issue that heavily influences graph representation learning, especially when considering long-range propagation. This calls for principled approaches that control and regulate the degree of propagation and dissipation of information throughout the neural flow. Motivated by this, we introduce (port-)Hamiltonian Deep Graph Networks, a novel framework that models neural information flow in graphs by building on the laws of conservation of Hamiltonian dynamical systems. We reconcile under a single theoretical and practical framework both non-dissipative long-range propagation and non-conservative behaviors, introducing tools from mechanical systems to gauge the equilibrium between the two components. Our approach can be applied to general message-passing architectures, and it provides theoretical guarantees on information conservation in time. Empirical results prove the effectiveness of our port-Hamiltonian scheme in pushing simple graph convolutional architectures to state-of-the-art performance in long-range benchmarks.
