RAC-DMVC: Reliability-Aware Contrastive Deep Multi-View Clustering under Multi-Source Noise
Shihao Dong, Yue Liu, Xiaotong Zhou, Yuhui Zheng, Huiying Xu, Xinzhong Zhu
TL;DR
This work tackles reliable multi-view clustering under multi-source noise by introducing RAC-DMVC, which combines a reliability graph-guided noise contrastive objective, cross-view reconstruction for data-level denoising, dual-attention imputation for missing views, and a self-supervised cluster distillation mechanism. The approach jointly addresses missingness and observation noise, yielding robust representations and improved clustering performance across five benchmarks with varying noise ratios. Empirical results demonstrate state-of-the-art performance and resilience to noise, supported by ablations and visualizations that confirm each module's contribution. The framework offers practical benefits for real-world, noisy multi-view data applications by maintaining stable clustering quality despite significant data degradation.
Abstract
Multi-view clustering (MVC), which aims to separate the multi-view data into distinct clusters in an unsupervised manner, is a fundamental yet challenging task. To enhance its applicability in real-world scenarios, this paper addresses a more challenging task: MVC under multi-source noises, including missing noise and observation noise. To this end, we propose a novel framework, Reliability-Aware Contrastive Deep Multi-View Clustering (RAC-DMVC), which constructs a reliability graph to guide robust representation learning under noisy environments. Specifically, to address observation noise, we introduce a cross-view reconstruction to enhances robustness at the data level, and a reliability-aware noise contrastive learning to mitigates bias in positive and negative pairs selection caused by noisy representations. To handle missing noise, we design a dual-attention imputation to capture shared information across views while preserving view-specific features. In addition, a self-supervised cluster distillation module further refines the learned representations and improves the clustering performance. Extensive experiments on five benchmark datasets demonstrate that RAC-DMVC outperforms SOTA methods on multiple evaluation metrics and maintains excellent performance under varying ratios of noise.
