Lying Graph Convolution: Learning to Lie for Node Classification Tasks
Daniele Castellana
TL;DR
This work addresses node classification across graphs with varying homophily by introducing Lying-GCN, an adaptive propagation scheme inspired by opinion dynamics where nodes may lie about their private embeddings when sharing messages. The method defines per-edge, per-channel lying weights that modulate messages, resulting in an asymmetric diffusion process whose dynamics are analyzed via a spectral view of the operator E = \tilde{L}^{sym} ⊙ (Z+I) and the resulting complex eigenvalues that induce early oscillations. Empirically, Lying-GCN improves performance on heterophilic graphs and remains competitive on homophilic graphs, with Lying-GCNII delivering robust results in deeper architectures and often matching or surpassing state-of-the-art baselines. The work highlights that adaptively modulated diffusion per channel can address heterophily without sacrificing homophily performance, offering a complementary tool alongside existing techniques like GCNII and attention-based methods.
Abstract
In the context of machine learning for graphs, many researchers have empirically observed that Deep Graph Networks (DGNs) perform favourably on node classification tasks when the graph structure is homophilic (\ie adjacent nodes are similar). In this paper, we introduce Lying-GCN, a new DGN inspired by opinion dynamics that can adaptively work in both the heterophilic and the homophilic setting. At each layer, each agent (node) shares its own opinions (node embeddings) with its neighbours. Instead of sharing its opinion directly as in GCN, we introduce a mechanism which allows agents to lie. Such a mechanism is adaptive, thus the agents learn how and when to lie according to the task that should be solved. We provide a characterisation of our proposal in terms of dynamical systems, by studying the spectral property of the coefficient matrix of the system. While the steady state of the system collapses to zero, we believe the lying mechanism is still usable to solve node classification tasks. We empirically prove our belief on both synthetic and real-world datasets, by showing that the lying mechanism allows to increase the performances in the heterophilic setting without harming the results in the homophilic one.
