Generative Kernel Spectral Clustering
David Winant, Sonny Achten, Johan A. K. Suykens
TL;DR
GenKSC addresses interpretability in clustering by integrating kernel spectral clustering with a generative model. It learns a parametric feature map and a decoder while optimizing a spectral clustering objective, augmented by a reconstruction loss and a cosine-based cluster loss that uses simplex cluster codes. The model yields a discriminative, explorable latent space where cluster directions can be traversed to visualize and exaggerate distinguishing features, demonstrated on MNIST012 and FashionMNIST. This approach bridges explainable AI with deep clustering and opens avenues for semi-supervised or supervised extensions.
Abstract
Modern clustering approaches often trade interpretability for performance, particularly in deep learning-based methods. We present Generative Kernel Spectral Clustering (GenKSC), a novel model combining kernel spectral clustering with generative modeling to produce both well-defined clusters and interpretable representations. By augmenting weighted variance maximization with reconstruction and clustering losses, our model creates an explorable latent space where cluster characteristics can be visualized through traversals along cluster directions. Results on MNIST and FashionMNIST datasets demonstrate the model's ability to learn meaningful cluster representations.
