MDL-Pool: Adaptive Multilevel Graph Pooling Based on Minimum Description Length
Jan von Pichowski, Christopher Blöcker, Ingo Scholtes
TL;DR
MDL-Pool introduces an adaptive multilevel graph pooling operator driven by the minimum description length principle and the multilevel map equation. By jointly optimizing hierarchical clustering across levels and enabling per-graph depth selection, it overcomes fixed-depth limitations of prior pooling methods without requiring explicit regularization. Empirical results on community detection and graph classification show competitive or state-of-the-art performance, with the model automatically inferring the number of clusters and depth per graph. This approach offers a principled, parameter-free mechanism for concise graph representations and interpretable hierarchical structure.
Abstract
Graph pooling compresses graphs and summarises their topological properties and features in a vectorial representation. It is an essential part of deep graph representation learning and is indispensable in graph-level tasks like classification or regression. Current approaches pool hierarchical structures in graphs by iteratively applying shallow pooling operators up to a fixed depth. However, they disregard the interdependencies between structures at different hierarchical levels and do not adapt to datasets that contain graphs with different sizes that may require pooling with various depths. To address these issues, we propose MDL-Pool, a pooling operator based on the minimum description length (MDL) principle, whose loss formulation explicitly models the interdependencies between different hierarchical levels and facilitates a direct comparison between multiple pooling alternatives with different depths. MDP-Pool builds on the map equation, an information-theoretic objective function for community detection, which naturally implements Occam's razor and balances between model complexity and goodness-of-fit via the MDL. We demonstrate MDL-Pool's competitive performance in an empirical evaluation against various baselines across standard graph classification datasets.
