Synergistic Graph Fusion via Encoder Embedding
Cencheng Shen, Carey E. Priebe, Jonathan Larson, Ha Trinh
TL;DR
The paper tackles multi-graph data on a common vertex set under supervised vertex classification. It introduces graph fusion embedding, which encodes each graph via a label-informed encoder, normalizes, and concatenates per-graph embeddings to form an $n \times MK$ representation that supports downstream classifiers. Theoretical results under both DC-SBM and a general graph model show convergence to class-conditioned means with an $O(1/\sqrt{n})$ rate and establish an asymptotic condition for perfect separation, along with a synergistic effect where adding graphs cannot deteriorate—and can improve—classification performance. Empirical evidence from simulations and real data across two, three, and four graphs demonstrates strong, robust improvements and practical scalability, highlighting the method's potential for data fusion across diverse domains.
Abstract
In this paper, we introduce a method called graph fusion embedding, designed for multi-graph embedding with shared vertex sets. Under the framework of supervised learning, our method exhibits a remarkable and highly desirable synergistic effect: for sufficiently large vertex size, the accuracy of vertex classification consistently benefits from the incorporation of additional graphs. We establish the mathematical foundation for the method, including the asymptotic convergence of the embedding, a sufficient condition for asymptotic optimal classification, and the proof of the synergistic effect for vertex classification. Our comprehensive simulations and real data experiments provide compelling evidence supporting the effectiveness of our proposed method, showcasing the pronounced synergistic effect for multiple graphs from disparate sources.
