Deep Multiview Clustering by Contrasting Cluster Assignments
Jie Chen, Hua Mao, Wai Lok Woo, Xi Peng
TL;DR
This paper tackles multiview clustering by learning view-invariant representations through cross-view contrastive learning (CVCL). It introduces a two-module network with per-view autoencoders and a cluster-level CVCL objective that aligns soft cluster assignments across views, supplemented by a pretraining/fine-tuning scheme and theoretical analysis of alignment and complexity. Empirically, CVCL achieves state-of-the-art clustering performance across seven datasets, with ablations showing the importance of pretraining and the cross-view consistency term $L_a$ for robust cross-view alignment. The approach holds practical impact for unsupervised clustering in multi-source data, offering improved discriminability and scalable performance without label information.
Abstract
Multiview clustering (MVC) aims to reveal the underlying structure of multiview data by categorizing data samples into clusters. Deep learning-based methods exhibit strong feature learning capabilities on large-scale datasets. For most existing deep MVC methods, exploring the invariant representations of multiple views is still an intractable problem. In this paper, we propose a cross-view contrastive learning (CVCL) method that learns view-invariant representations and produces clustering results by contrasting the cluster assignments among multiple views. Specifically, we first employ deep autoencoders to extract view-dependent features in the pretraining stage. Then, a cluster-level CVCL strategy is presented to explore consistent semantic label information among the multiple views in the fine-tuning stage. Thus, the proposed CVCL method is able to produce more discriminative cluster assignments by virtue of this learning strategy. Moreover, we provide a theoretical analysis of soft cluster assignment alignment. Extensive experimental results obtained on several datasets demonstrate that the proposed CVCL method outperforms several state-of-the-art approaches.
