Gotta match 'em all: Solution diversification in graph matching matched filters
Zhirui Li, Ben Johnson, Daniel L. Sussman, Carey E. Priebe, Vince Lyzinski
TL;DR
This work tackles the problem of finding multiple noisily embedded templates in a large background graph by extending the graph-matching matched-filter framework with solution diversification. It introduces a Multiple Correlated Erdős-Rényi model to capture multiple embedded templates and a node-feature similarity term through a matrix $S$, integrated into a padded GMP objective to encourage diverse recoveries. The authors prove that, under mild conditions, down-weighting strong templates enables recovery of weaker templates with high probability, and they implement scalable speedups for the optimization, including reduced complexity for the linear assignment subproblem and random restarts with masking. Empirically, the approach yields multiple recovered templates in synthetic MCER settings and demonstrates practical utility on MRI brain connectomes and a large Transactional Knowledge Base, highlighting both the benefits and trade-offs of padding choices and penalty tuning for diversification.
Abstract
We present a novel approach for finding multiple noisily embedded template graphs in a very large background graph. Our method builds upon the graph-matching-matched-filter technique proposed in Sussman et al., with the discovery of multiple diverse matchings being achieved by iteratively penalizing a suitable node-pair similarity matrix in the matched filter algorithm. In addition, we propose algorithmic speed-ups that greatly enhance the scalability of our matched-filter approach. We present theoretical justification of our methodology in the setting of correlated Erdos-Renyi graphs, showing its ability to sequentially discover multiple templates under mild model conditions. We additionally demonstrate our method's utility via extensive experiments both using simulated models and real-world dataset, include human brain connectomes and a large transactional knowledge base.
