S^2MVTC: a Simple yet Efficient Scalable Multi-View Tensor Clustering
Zhen Long, Qiyuan Wang, Yazhou Ren, Yipeng Liu, Ce Zhu
TL;DR
The paper tackles scalability in multi-view clustering by shifting focus from global correlations of anchor graphs or projection matrices to direct learning of embedding-feature correlations within and across views. It introduces the tensor low-frequency approximation (TLFA) applied to a rotated embedding tensor $oldsymbol{rak B}\uparrow$ to enforce intra-view similarity, while a consensus embedding $ ilde{oldsymbol B}$ enforces inter-view semantic consistency. The model optimizes over projection matrices $oldsymbol{U}_v$ and embedding features $oldsymbol{B}_v$ with an auxiliary variable, enabling a separable alternating optimization that jointly yields a robust fused representation and final clustering via $oldsymbol{D}$ and $oldsymbol{G}$. Experiments on six large-scale datasets show that S^2MVTC substantially improves clustering accuracy and reduces CPU time compared to state-of-the-art methods, demonstrating strong scalability for massive data; the authors also provide publicly available code. The approach offers practical impact for large-scale multi-view tasks in vision, neuroscience, and multimedia, where rapid and accurate clustering across many views is essential.
Abstract
Anchor-based large-scale multi-view clustering has attracted considerable attention for its effectiveness in handling massive datasets. However, current methods mainly seek the consensus embedding feature for clustering by exploring global correlations between anchor graphs or projection matrices.In this paper, we propose a simple yet efficient scalable multi-view tensor clustering (S^2MVTC) approach, where our focus is on learning correlations of embedding features within and across views. Specifically, we first construct the embedding feature tensor by stacking the embedding features of different views into a tensor and rotating it. Additionally, we build a novel tensor low-frequency approximation (TLFA) operator, which incorporates graph similarity into embedding feature learning, efficiently achieving smooth representation of embedding features within different views. Furthermore, consensus constraints are applied to embedding features to ensure inter-view semantic consistency. Experimental results on six large-scale multi-view datasets demonstrate that S^2MVTC significantly outperforms state-of-the-art algorithms in terms of clustering performance and CPU execution time, especially when handling massive data. The code of S^2MVTC is publicly available at https://github.com/longzhen520/S2MVTC.
