Scalable Multi-view Clustering via Explicit Kernel Features Maps
Chakib Fettal, Lazhar Labiod, Mohamed Nadif
TL;DR
The paper tackles the scalability of multi-view clustering on large, attributed networks by introducing a framework that uses explicit kernel feature maps to form a consensus subspace affinity without costly iterations. By leveraging kernel summation, it factorizes the consensus into a low-dimensional embedding, enabling spectral-type clustering through efficient SVD and $k$-means. The main contributions are the MvSCK framework, a kernel-sum based consensus construction, and a view-weighting mechanism that improves clustering quality, all supported by extensive experiments on real-world, large-scale networks. The approach demonstrates strong clustering performance and superior running times, making scalable multi-view clustering feasible for datasets with millions of points and numerous views.
Abstract
The proliferation of high-dimensional data from sources such as social media, sensor networks, and online platforms has created new challenges for clustering algorithms. Multi-view clustering, which integrates complementary information from multiple data perspectives, has emerged as a powerful solution. However, existing methods often struggle with scalability and efficiency, particularly on large attributed networks. In this work, we address these limitations by leveraging explicit kernel feature maps and a non-iterative optimization strategy, enabling efficient and accurate clustering on datasets with millions of points.
