Dual-Bounded Nonlinear Optimal Transport for Size Constrained Min Cut Clustering
Fangyuan Xie, Jinghui Yuan, Feiping Nie, Xuelong Li
TL;DR
This work targets size-constrained Min Cut clustering by recasting it as a dual-bounded nonlinear optimal transport problem and solving it with a new Dual-bounded Nonlinear Frank-Wolfe (DNF) method. The authors introduce the DB-NOT formulation, proving convergence guarantees for convex and Lipschitz-smooth non-convex objectives, and apply it to the Min Cut objective with gradient $-2SF$. The main contributions include the DB-NOT framework, the DNF algorithm with norm-based and inner product-based gradient approximation, and a corollary giving a convergence bound for the size-constrained Min Cut problem. Empirical results on eight real-world datasets show state-of-the-art performance in clustering quality and speed, with a parameter-free, stable optimization process, suggesting broad applicability to nonlinear bounded OT problems in graph clustering and beyond.
Abstract
Min cut is an important graph partitioning method. However, current solutions to the min cut problem suffer from slow speeds, difficulty in solving, and often converge to simple solutions. To address these issues, we relax the min cut problem into a dual-bounded constraint and, for the first time, treat the min cut problem as a dual-bounded nonlinear optimal transport problem. Additionally, we develop a method for solving dual-bounded nonlinear optimal transport based on the Frank-Wolfe method (abbreviated as DNF). Notably, DNF not only solves the size constrained min cut problem but is also applicable to all dual-bounded nonlinear optimal transport problems. We prove that for convex problems satisfying Lipschitz smoothness, the DNF method can achieve a convergence rate of \(\mathcal{O}(\frac{1}{t})\). We apply the DNF method to the min cut problem and find that it achieves state-of-the-art performance in terms of both the loss function and clustering accuracy at the fastest speed, with a convergence rate of \(\mathcal{O}(\frac{1}{\sqrt{t}})\). Moreover, the DNF method for the size constrained min cut problem requires no parameters and exhibits better stability.
