SpEx: A Spectral Approach to Explainable Clustering
Tal Argov, Tal Wagner
TL;DR
SpEx introduces a spectral framework for explainable clustering that can wrap around any reference clustering or operate directly on data, by constructing explainable trees through coordinate cuts tied to graph conductance. The core idea rests on a Cheeger-type bound that guarantees the existence of low-conductance coordinate splits, enabling iterative tree construction that optimizes a multi-way normalized cut. By mapping prior methods (IMM, EMN, CART) into the non-uniform sparsest-cut framework, SpEx provides a unified analytic lens and offers two practical instantiations: SpEx-Clique (reference-based) and SpEx-kNN (reference-free). Empirically, SpEx-Clique is consistently strong across datasets, while SpEx-kNN performs particularly well on low-dimensional data, though the approach remains generic enough to cover a broad range of clustering objectives; the paper also notes the absence of universal theoretical bounds and points to directions for future theory and graph choices.
Abstract
Explainable clustering by axis-aligned decision trees was introduced by Moshkovitz et al. (2020) and has gained considerable interest. Prior work has focused on minimizing the price of explainability for specific clustering objectives, lacking a general method to fit an explanation tree to any given clustering, without restrictions. In this work, we propose a new and generic approach to explainable clustering, based on spectral graph partitioning. With it, we design an explainable clustering algorithm that can fit an explanation tree to any given non-explainable clustering, or directly to the dataset itself. Moreover, we show that prior algorithms can also be interpreted as graph partitioning, through a generalized framework due to Trevisan (2013) wherein cuts are optimized in two graphs simultaneously. Our experiments show the favorable performance of our method compared to baselines on a range of datasets.
