Revisiting FastMap: New Applications
Ang Li
TL;DR
The dissertation presents a generalization of FastMap to embed graph vertices and complex objects into Euclidean space in near-linear time, enabling efficient geometric solutions to large-scale graph problems. It introduces a FastMap+LSH framework that solves facility location, top-K centrality, community detection, and graph-convex-hull tasks by solving Euclidean-space relaxations and mapping results back to graphs. A novel FastMapSVM framework combines FastMap embeddings with SVMs, achieving interpretable, data-efficient classification for seismograms and CSP satisfiability, with strong empirical performance against neural baselines. The work also introduces PASPD-based distances for block modeling, iterative FM-based graph convex hulls, and demonstrates substantial speedups and competitive quality across AI, ML, and computational-geometry domains. Together, these contributions establish FastMap as a versatile tool for scalable graph reasoning and learning on complex objects, with broad practical impact and avenues for future enhancements.
Abstract
FastMap was first introduced in the Data Mining community for generating Euclidean embeddings of complex objects. In this dissertation, we first present FastMap to generate Euclidean embeddings of graphs in near-linear time: The pairwise Euclidean distances approximate a desired graph-based distance function on the vertices. We then apply the graph version of FastMap to efficiently solve various graph-theoretic problems of significant interest in AI: including facility location, top-K centrality computations, community detection and block modeling, and graph convex hull computations. We also present a novel learning framework, called FastMapSVM, by combining FastMap and Support Vector Machines. We then apply FastMapSVM to predict the satisfiability of Constraint Satisfaction Problems and to classify seismograms in Earthquake Science.
