GED-Consistent Disentanglement of Aligned and Unaligned Substructures for Graph Similarity Learning

Zhentao Zhan; Xiaoliang Xu; Jingjing Wang; Junmei Wang

GED-Consistent Disentanglement of Aligned and Unaligned Substructures for Graph Similarity Learning

Zhentao Zhan, Xiaoliang Xu, Jingjing Wang, Junmei Wang

TL;DR

This work addresses the mismatch between node-centric GED approximations and GED's global alignment by introducing GCGSim, a GED-consistent graph similarity framework. It relies on three novel components: GNCM for pair-aware graph representations, PSGD for principled disentanglement of aligned and unaligned substructures, and IIR for canonical alignment semantics, combined with NTN-based substructure interactions and dual prediction of edit costs and overall similarity. Theoretical justification via a variational ELBO with an informed, data-dependent prior links similarity to the posterior over substructure sources, while empirical results on four benchmarks show state-of-the-art accuracy and strong disentanglement signals. The approach delivers GED-aligned similarity with efficient inference, enabling reliable graph retrieval and analysis in practice.

Abstract

Graph Similarity Computation (GSC) is a fundamental graph related task where Graph Edit Distance (GED) serves as a prevalent metric. GED is determined by an optimal alignment between a pair of graphs that partitions each into aligned (zero-cost) and unaligned (cost-incurring) substructures. Due to NP-hard nature of exact GED computation, GED approximations based on Graph Neural Network(GNN) have emerged. Existing GNN-based GED approaches typically learn node embeddings for each graph and then aggregate pairwise node similarities to estimate the final similarity. Despite their effectiveness, we identify a mismatch between this prevalent node-centric matching paradigm and the core principles of GED. This discrepancy leads to two critical limitations: (1) a failure to capture the global structural correspondence for optimal alignment, and (2) a misattribution of edit costs driven by spurious node level signals. To address these limitations, we propose GCGSim, a GED-consistent graph similarity learning framework centering on graph-level matching and substructure-level edit costs. Specifically, we make three core technical contributions. Extensive experiments on four benchmark datasets show that GCGSim achieves state-of-the-art performance. Our comprehensive analyses further validate that the framework effectively learns disentangled and semantically meaningful substructure representations.

GED-Consistent Disentanglement of Aligned and Unaligned Substructures for Graph Similarity Learning

TL;DR

Abstract

GED-Consistent Disentanglement of Aligned and Unaligned Substructures for Graph Similarity Learning

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (6)

Theorems & Definitions (9)