Learning Network Dismantling Without Handcrafted Inputs
Haozhe Tian, Pietro Ferraro, Robert Shorten, Mahdi Jalili, Homayoun Hamedmoghadam
TL;DR
This work tackles network dismantling, an NP-hard problem, by learning a purely data-driven dismantling policy that operates on adjacency information alone. The proposed model, MIND, employs an all-ones, multi-head attention encoder (MIND-AM) and a history-aware mechanism (MIND-MP) to capture structural roles without handcrafted features, coupled with an omni-node in the decoder for global state awareness. A diversified synthetic training pipeline with degree-preserving rewiring and entropy-regularized RL (SAC) enables strong generalization to large real networks, achieving state-of-the-art dismantling performance with linear-time complexity. The approach reduces reliance on hand-engineered inputs, demonstrates scalability to networks with millions of nodes, and offers potential applicability to a broad class of graph-inference problems.
Abstract
The application of message-passing Graph Neural Networks has been a breakthrough for important network science problems. However, the competitive performance often relies on using handcrafted structural features as inputs, which increases computational cost and introduces bias into the otherwise purely data-driven network representations. Here, we eliminate the need for handcrafted features by introducing an attention mechanism and utilizing message-iteration profiles, in addition to an effective algorithmic approach to generate a structurally diverse training set of small synthetic networks. Thereby, we build an expressive message-passing framework and use it to efficiently solve the NP-hard problem of Network Dismantling, virtually equivalent to vital node identification, with significant real-world applications. Trained solely on diversified synthetic networks, our proposed model -- MIND: Message Iteration Network Dismantler -- generalizes to large, unseen real networks with millions of nodes, outperforming state-of-the-art network dismantling methods. Increased efficiency and generalizability of the proposed model can be leveraged beyond dismantling in a range of complex network problems.
