Distance-Preserving Graph Compression Techniques
Amirali Madani, Anil Maheshwari
TL;DR
This paper tackles distance-preserving graph compression by focusing on optimally redistributing edge weights after contracting a given set of edges, with a rigorous emphasis on weighted paths and trees. It develops tight bounds and exact optimal strategies for a variety of contraction scenarios on paths (single edge, independent edges, contiguous subpaths) and trees (single edge via a marking problem, with equal-sized and varying-size subtree analyses; fractional versus integral markings). A key contribution is the demonstration that, in many cases, optimality is achieved by marking only neighboring edges on one side of the contracted edge, and in trees the optimal choice reduces to a side-selective, linear-time procedure. The results advance distance-preserving compression by providing provable guarantees on the error and efficient algorithms, while outlining open problems for general graphs and alternative error models with potential practical impact on scalable graph analytics.
Abstract
We study the problem of distance-preserving graph compression for weighted paths and trees. The problem entails a weighted graph $G = (V, E)$ with non-negative weights, and a subset of edges $E^{\prime} \subset E$ which needs to be removed from G (with their endpoints merged as a supernode). The goal is to redistribute the weights of the deleted edges in a way that minimizes the error. The error is defined as the sum of the absolute differences of the shortest path lengths between different pairs of nodes before and after contracting $E^{\prime}$. Based on this error function, we propose optimal approaches for merging any subset of edges in a path and a single edge in a tree. Previous works on graph compression techniques aimed at preserving different graph properties (such as the chromatic number) or solely focused on identifying the optimal set of edges to contract. However, our focus in this paper is on achieving optimal edge contraction (when the contracted edges are provided as input) specifically for weighted trees and paths.
