Bures-Wasserstein Means of Graphs
Isabel Haasler, Pascal Frossard
TL;DR
This work introduces the Bures-Wasserstein mean of graphs by embedding each graph as a zero-mean Gaussian with covariance $L^†$ (where $L$ is the graph Laplacian) and formulating the graph mean as a BW barycenter in this embedding. The core result is that the BW mean corresponds to a Laplacian $L = S^†$ with $S$ solving $S = \sum_j \lambda_j (S^{1/2} L_j^† S^{1/2})^{1/2}$ subject to range$(S)= (\mathrm{span}\{\mathbf{1}_N\})^{\perp}$, and it can be computed via a convergent fixed-point iteration using a PD transform. The theory extends to graph filters and geodesic interpolation for two graphs, and empirical evaluations on graph fusion, k-means clustering, brain-network classification, and multi-layer learning show robust, competitive improvements over baselines. These results demonstrate the practical value of a principled OT-based graph mean that preserves both local and global structural information through smooth graph signal embeddings, and point to future work on unaligned graphs and generative modeling.
Abstract
Finding the mean of sampled data is a fundamental task in machine learning and statistics. However, in cases where the data samples are graph objects, defining a mean is an inherently difficult task. We propose a novel framework for defining a graph mean via embeddings in the space of smooth graph signal distributions, where graph similarity can be measured using the Wasserstein metric. By finding a mean in this embedding space, we can recover a mean graph that preserves structural information. We establish the existence and uniqueness of the novel graph mean, and provide an iterative algorithm for computing it. To highlight the potential of our framework as a valuable tool for practical applications in machine learning, it is evaluated on various tasks, including k-means clustering of structured aligned graphs, classification of functional brain networks, and semi-supervised node classification in multi-layer graphs. Our experimental results demonstrate that our approach achieves consistent performance, outperforms existing baseline approaches, and improves the performance of state-of-the-art methods.
