Flexible Graph Similarity Computation With A Proactive Optimization Strategy
Zhouyang Liu, Ning Liu, Yixin Chen, Jiezhong He, Dongsheng Li
TL;DR
This work tackles flexible GED approximation by introducing GEN, which (1) integrates operation costs before mapping to adapt mappings to cost settings, and (2) employs a proactive guidance strategy that propagates graph-level alignment difficulties for informed one-shot matching. The approach combines a shared GNN encoder, cost-sensitive scoring via an Estimator, and difficulty propagation to produce graph-level guidance, significantly reducing GED approximation error and inference time while remaining robust to varying graph sizes and cost settings. Empirical results on real-world and synthetic data show substantial gains over state-of-the-art methods, with strong adaptability to non-uniform costs and improved scalability. The proposed method has practical implications for fast, accurate graph retrieval and reranking in systems where edit operations carry different semantic costs.
Abstract
Graph Edit Distance (GED) offers a principled and flexible measure of graph similarity, as it quantifies the minimum cost needed to transform one graph into another with customizable edit operation costs. Despite recent learning-based efforts to approximate GED via vector space representations, existing methods struggle with adapting to varying operation costs. Furthermore, they suffer from inefficient, reactive mapping refinements due to reliance on isolated node-level distance as guidance. To address these issues, we propose GEN, a novel learning-based approach for flexible GED approximation. GEN addresses the varying costs adaptation by integrating operation costs prior to match establishment, enabling mappings to dynamically adapt to cost variations. Furthermore, GEN introduces a proactive guidance optimization strategy that captures graph-level dependencies between matches, allowing informed matching decisions in a single step without costly iterative refinements. Extensive evaluations on real-world and synthetic datasets demonstrate that GEN achieves up to 37.8% reduction in GED approximation error and 72.7% reduction in inference time compared with state-of-the-art methods, while consistently maintaining robustness under diverse cost settings and graph sizes.
