Min-Max-Jump distance and its applications
Gangli Liu
TL;DR
The paper introduces Min-Max-Jump distance (MMJ distance), a context-sensitive metric defined as the minimum, over all non-looping paths within a given context, of the maximum jump along the path. It presents three computation strategies—recursion, estimation with copy, and calculation with copy—and demonstrates MMJ’s utility across clustering and labeling tasks, including MMJ-K-means, MMJ-SC, and CNNI enhanced by MMJ-SC, as well as a MMJ-based label-prediction method. Key contributions include a formal MMJ framework, three practical algorithms with varying accuracy and complexity, and a suite of applications showing robust performance on non-spherical and non-convex data, with extensions to directed graphs and widest-path problems. Overall, MMJ distance provides a versatile, context-aware tool for clustering, evaluation, and predictive labeling, with potential for broad applicability in complex data analysis scenarios.
Abstract
We explore three applications of Min-Max-Jump distance (MMJ distance). MMJ-based K-means revises K-means with MMJ distance. MMJ-based Silhouette coefficient revises Silhouette coefficient with MMJ distance. We also tested the Clustering with Neural Network and Index (CNNI) model with MMJ-based Silhouette coefficient. In the last application, we tested using Min-Max-Jump distance for predicting labels of new points, after a clustering analysis of data. Result shows Min-Max-Jump distance achieves good performances in all the three proposed applications. In addition, we devise several algorithms for calculating or estimating the distance.
