Efficient algorithms for optimal homology problems and their applications
Kostiantyn Lyman
TL;DR
This work develops a unified, flow-based framework for solving Optimal Homology Problems, notably the Optimal Homologous Cycle Problem (OHCP) and Multiscale SImplicial Flat Norm (MSFN), by embedding finite complexes in Euclidean space and utilizing dual-flow networks. A key innovation is the hat-augmentation that preserves homology while enabling strong duality results, along with extended analyses for directed (nonnegative) coefficients. The authors derive weak and strong duality theorems, optimality and complementary slackness conditions, and show how these lead to polynomial-time algorithms on embedded complexes via min-cost flow implementations. They further study the stability of the flat norm as a geometry-distance measure, comparing it with Hausdorff distance and establishing perturbation bounds with practical implications for validating synthetic power-distribution networks against real data. The practical impact is demonstrated through 2D planar network comparisons and a detailed structural validation pipeline for synthetic power grids, combining rigorous theory with scalable computational tools.
Abstract
The multiscale simplicial flat norm (MSFN) of a d-cycle is a family of optimal homology problems indexed by a scale parameter λ >= 0. Each instance (mSFN) optimizes the total weight of a homologous d-cycle and a bounded (d + 1)-chain, with one of the components being scaled by λ.We propose a min-cost flow formulation for solving instances of mSFN at a given scale λ in polynomial time in the case of (d + 1)-dimensional simplicial complexes embedded in {R^(d + 1)} and homology over Z. Furthermore, we establish the weak and strong dualities for mSFN, as well as the complementary slackness conditions. Additionally, we prove optimality conditions for directed flow formulations with cohomology over Z+. Next, we propose an approach based on the multiscale flat norm, a notion of distance between objects defined in the field of geometric measure theory, to compute the distance between a pair of planar geometric networks. Using a triangulation of the domain containing the input networks, the flat norm distance between two networks at a given scale can be computed by solving a linear program. In addition, this computation automatically identifies the 2D regions (patches) that capture where the two networks are different. We demonstrate through 2D examples that the flat norm distance can capture the variations of inputs more accurately than the commonly used Hausdorff distance. As a notion of stability, we also derive upper bounds on the flat norm distance between a simple 1D curve and its perturbed version as a function of the radius of perturbation for a restricted class of perturbations. We demonstrate our approach on a set of actual power networks from a county in the USA. Our approach can be extended to validate synthetic networks created for multiple infrastructures such as transportation, communication, water, and gas networks.
