Hierarchical Representation Learning in Graph Neural Networks with Node Decimation Pooling
Filippo Maria Bianchi, Daniele Grattarola, Lorenzo Livi, Cesare Alippi
TL;DR
The paper tackles efficient hierarchical representation learning in GNNs by introducing Node Decimation Pooling (NDP), a topological pooling operator that pre-computes a pyramid of coarsened graphs offline. NDP combines a MAXCUT-based spectral node decimation, Kron reduction to build coarsened graphs, and a sparsification step to control density, enabling fast, scalable pooling across multiple GNN layers. The authors provide theoretical analyses of MAXCUT approximations, spectral preservation under sparsification, and practical implementation details, showing that NDP offers competitive accuracy with significantly reduced computation and memory compared to both topological and feature-based pooling methods. Empirical results on graph classification and graph signal classification demonstrate that NDP achieves higher or comparable accuracy while training faster than alternatives, with particularly strong performance on tasks like MNIST graph signals where purely topological methods excel. These findings suggest that pre-computed, topology-focused pooling can be a robust and efficient choice for deploying GNNs in resource-constrained settings.
Abstract
In graph neural networks (GNNs), pooling operators compute local summaries of input graphs to capture their global properties, and they are fundamental for building deep GNNs that learn hierarchical representations. In this work, we propose the Node Decimation Pooling (NDP), a pooling operator for GNNs that generates coarser graphs while preserving the overall graph topology. During training, the GNN learns new node representations and fits them to a pyramid of coarsened graphs, which is computed offline in a pre-processing stage. NDP consists of three steps. First, a node decimation procedure selects the nodes belonging to one side of the partition identified by a spectral algorithm that approximates the \maxcut{} solution. Afterwards, the selected nodes are connected with Kron reduction to form the coarsened graph. Finally, since the resulting graph is very dense, we apply a sparsification procedure that prunes the adjacency matrix of the coarsened graph to reduce the computational cost in the GNN. Notably, we show that it is possible to remove many edges without significantly altering the graph structure. Experimental results show that NDP is more efficient compared to state-of-the-art graph pooling operators while reaching, at the same time, competitive performance on a significant variety of graph classification tasks.
