URRL-IMVC: Unified and Robust Representation Learning for Incomplete Multi-View Clustering
Ge Teng, Ting Mao, Chen Shen, Xiang Tian, Xuesong Liu, Yaowu Chen, Jieping Ye
TL;DR
URRL-IMVC tackles incomplete multi-view clustering by learning a unified embedding through an attention-based auto-encoder that fuses multi-view and neighborhood information. It avoids explicit missing-view recovery by employing KNN imputation and data augmentation to bolster robustness, complemented by a DEC-based clustering module and a Transformer-based encoder. The framework introduces CDPE, TAM, and a shallow decoder to regularize representations, and uses a two-stage training regime with three losses to ensure clustering-friendly embeddings. Across six benchmark datasets, URRL-IMVC achieves state-of-the-art performance and demonstrates stability across varying numbers of views and missing rates, offering a scalable solution for IMVC with practical impact in real-world, incomplete-data scenarios.
Abstract
Incomplete multi-view clustering (IMVC) aims to cluster multi-view data that are only partially available. This poses two main challenges: effectively leveraging multi-view information and mitigating the impact of missing views. Prevailing solutions employ cross-view contrastive learning and missing view recovery techniques. However, they either neglect valuable complementary information by focusing only on consensus between views or provide unreliable recovered views due to the absence of supervision. To address these limitations, we propose a novel Unified and Robust Representation Learning for Incomplete Multi-View Clustering (URRL-IMVC). URRL-IMVC directly learns a unified embedding that is robust to view missing conditions by integrating information from multiple views and neighboring samples. Firstly, to overcome the limitations of cross-view contrastive learning, URRL-IMVC incorporates an attention-based auto-encoder framework to fuse multi-view information and generate unified embeddings. Secondly, URRL-IMVC directly enhances the robustness of the unified embedding against view-missing conditions through KNN imputation and data augmentation techniques, eliminating the need for explicit missing view recovery. Finally, incremental improvements are introduced to further enhance the overall performance, such as the Clustering Module and the customization of the Encoder. We extensively evaluate the proposed URRL-IMVC framework on various benchmark datasets, demonstrating its state-of-the-art performance. Furthermore, comprehensive ablation studies are performed to validate the effectiveness of our design.
