Topology-Aware Active Learning on Graphs
Harris Hardiman-Mostow, Jack Mauro, Adrien Weihs, Andrea L. Bertozzi
TL;DR
This work targets label-efficient learning on graphs by leveraging topology through Balanced Forman Curvature (BFC) and multiscale graph Laplacian regularization. It introduces Curvature Coreset (CC) to select a diverse initial labeled set and a data-driven stopping signal, plus a curvature-based mechanism to switch from exploration to exploitation within PWLL-$\tau$. It further proposes a localized graph rewiring strategy to incorporate multiscale information around labeled nodes, significantly improving label propagation while preserving sparsity. Across benchmarks, CC and curvature-driven PWLL-$\tau$ show strong improvements at low label rates, and localized rewiring achieves substantial accuracy gains with orders-of-magnitude speedups over full multiscale methods, highlighting practical gains for graph-based active learning.
Abstract
We propose a graph-topological approach to active learning that directly targets the core challenge of exploration versus exploitation under scarce label budgets. To guide exploration, we introduce a coreset construction algorithm based on Balanced Forman Curvature (BFC), which selects representative initial labels that reflect the graph's cluster structure. This method includes a data-driven stopping criterion that signals when the graph has been sufficiently explored. We further use BFC to dynamically trigger the shift from exploration to exploitation within active learning routines, replacing hand-tuned heuristics. To improve exploitation, we introduce a localized graph rewiring strategy that efficiently incorporates multiscale information around labeled nodes, enhancing label propagation while preserving sparsity. Experiments on benchmark classification tasks show that our methods consistently outperform existing graph-based semi-supervised baselines at low label rates.
