Neural Subgraph Matching
Rex, Ying, Zhaoyu Lou, Jiaxuan You, Chengtao Wen, Arquimedes Canedo, Jure Leskovec
TL;DR
NeuroMatch tackles subgraph isomorphism with a neural embedding approach that decomposes target graphs into k-hop neighborhoods and learns an order-embedding space reflecting subgraph containment. The method comprises an offline embedding stage and a fast online query stage using a subgraph prediction function and a voting scheme, trained with curriculum learning and a max-margin objective. It achieves substantial performance gains over both exact and neural baselines, including up to 100x speedups over exact methods and about 18% AUROC improvements on neural baselines, while generalizing to unseen query distributions and real-world datasets. The work demonstrates the practicality of neural subgraph matching for large-scale graphs and lays groundwork for further improvements in expressiveness and feature-aware transferability.
Abstract
Subgraph matching is the problem of determining the presence and location(s) of a given query graph in a large target graph. Despite being an NP-complete problem, the subgraph matching problem is crucial in domains ranging from network science and database systems to biochemistry and cognitive science. However, existing techniques based on combinatorial matching and integer programming cannot handle matching problems with both large target and query graphs. Here we propose NeuroMatch, an accurate, efficient, and robust neural approach to subgraph matching. NeuroMatch decomposes query and target graphs into small subgraphs and embeds them using graph neural networks. Trained to capture geometric constraints corresponding to subgraph relations, NeuroMatch then efficiently performs subgraph matching directly in the embedding space. Experiments demonstrate NeuroMatch is 100x faster than existing combinatorial approaches and 18% more accurate than existing approximate subgraph matching methods.
