DelTriC: A Novel Clustering Method with Accurate Outlier
Tomas Javurek, Michal Gregor, Sebastian Kula, Marian Simko
TL;DR
DelTriC tackles clustering in high-dimensional data with irregular shapes and simultaneous outlier detection. It projects data to a 2D proxy, builds a Delaunay triangulation, and back-projects to the original space to prune edges and merge clusters, aiming to preserve high-dimensional geometry. Key contributions include a sigma-based pruning framework with MAD normalization, a conservative centroid-based merging step, and a back-projection mechanism that improves anomaly detection. Empirical results on synthetic and real-world datasets show competitive clustering performance with stronger anomaly detection and favorable scalability compared to DBSCAN/HDBSCAN.
Abstract
The paper introduces DelTriC (Delaunay Triangulation Clustering), a clustering algorithm which integrates PCA/UMAP-based projection, Delaunay triangulation, and a novel back-projection mechanism to form clusters in the original high-dimensional space. DelTriC decouples neighborhood construction from decision-making by first triangulating in a low-dimensional proxy to index local adjacency, and then back-projecting to the original space to perform robust edge pruning, merging, and anomaly detection. DelTriC can outperform traditional methods such as k-means, DBSCAN, and HDBSCAN in many scenarios; it is both scalable and accurate, and it also significantly improves outlier detection.
