Deep Incomplete Multi-view Clustering with Distribution Dual-Consistency Recovery Guidance
Jiaqi Jin, Siwei Wang, Zhibin Dong, Xihong Yang, Xinwang Liu, En Zhu, Kunlun He
TL;DR
This work tackles incomplete multi-view clustering by addressing inter-view heterogeneity during recovery. It introduces BURG, a framework that performs cross-view distribution transfer using flow-based models and enforces dual-consistency—neighbor-aware and prototypical—to guide both recovery and representation learning. The method combines three components: multi-view feature extraction, distribution transfer learning, and dual-consistency guided recovery, and demonstrates state-of-the-art clustering performance across six benchmarks with varying missing rates. The results highlight BURG's ability to restore realistic missing-view representations and enhance inter-view clustering structure, offering scalable and robust performance for practical multi-view data scenarios.
Abstract
Multi-view clustering leverages complementary representations from diverse sources to enhance performance. However, real-world data often suffer incomplete cases due to factors like privacy concerns and device malfunctions. A key challenge is effectively utilizing available instances to recover missing views. Existing methods frequently overlook the heterogeneity among views during recovery, leading to significant distribution discrepancies between recovered and true data. Additionally, many approaches focus on cross-view correlations, neglecting insights from intra-view reliable structure and cross-view clustering structure. To address these issues, we propose BURG, a novel method for incomplete multi-view clustering with distriBution dUal-consistency Recovery Guidance. We treat each sample as a distinct category and perform cross-view distribution transfer to predict the distribution space of missing views. To compensate for the lack of reliable category information, we design a dual-consistency guided recovery strategy that includes intra-view alignment guided by neighbor-aware consistency and cross-view alignment guided by prototypical consistency. Extensive experiments on benchmarks demonstrate the superiority of BURG in the incomplete multi-view scenario.
