Generative Diffusion Contrastive Network for Multi-View Clustering
Jian Zhu, Xin Zou, Xi Wang, Lei Liu, Chang Tang, Li-Rong Dai
TL;DR
This work tackles the problem of low-quality data in multi-view clustering by introducing Stochastic Generative Diffusion Fusion (SGDF) and the Generative Diffusion Contrastive Network (GDCN). SGDF robustly fuses multi-view embeddings through a diffusion-based, multi-sample mechanism that averages multiple generative reconstructions, while GDCN combines per-view autoencoders, SGDF fusion, and contrastive learning to produce a robust common representation for clustering. The model achieves state-of-the-art results on four public MVC datasets, significantly outperforming existing deep MVC methods and demonstrating strong robustness to noisy or missing views. The approach offers a scalable, effective framework for deep MVC and provides public code to facilitate further research and application in multi-view fusion tasks.
Abstract
In recent years, Multi-View Clustering (MVC) has been significantly advanced under the influence of deep learning. By integrating heterogeneous data from multiple views, MVC enhances clustering analysis, making multi-view fusion critical to clustering performance. However, there is a problem of low-quality data in multi-view fusion. This problem primarily arises from two reasons: 1) Certain views are contaminated by noisy data. 2) Some views suffer from missing data. This paper proposes a novel Stochastic Generative Diffusion Fusion (SGDF) method to address this problem. SGDF leverages a multiple generative mechanism for the multi-view feature of each sample. It is robust to low-quality data. Building on SGDF, we further present the Generative Diffusion Contrastive Network (GDCN). Extensive experiments show that GDCN achieves the state-of-the-art results in deep MVC tasks. The source code is publicly available at https://github.com/HackerHyper/GDCN.
