Self-Learning Symmetric Multi-view Probabilistic Clustering
Junjie Liu, Junlong Liu, Rongxin Jiang, Yaowu Chen, Chen Shen, Jieping Ye
TL;DR
This work tackles incomplete multi-view clustering by introducing SLS-MPC, a unified probabilistic framework that combines a symmetric multi-view posterior with a hyper-parameter-free self-learning probability function. It advances the field by (i) decomposing joint view-posterior probabilities into per-view components, (ii) enforcing cross-view and multi-view consistency to learn each view’s distribution without prior knowledge, (iii) refining probabilities with graph-context information via path and co-neighbor propagation, and (iv) clustering samples in an unsupervised, parameter-free manner using refined posterior probabilities. Extensive experiments across multiple benchmarks demonstrate state-of-the-art performance in both complete and incomplete MVC scenarios, with strong robustness to missing views and noise. The approach offers practical benefits for real-world, multi-source data where view availability is imperfect and prior tuning is undesirable.
Abstract
Multi-view Clustering (MVC) has achieved significant progress, with many efforts dedicated to learn knowledge from multiple views. However, most existing methods are either not applicable or require additional steps for incomplete MVC. Such a limitation results in poor-quality clustering performance and poor missing view adaptation. Besides, noise or outliers might significantly degrade the overall clustering performance, which are not handled well by most existing methods. In this paper, we propose a novel unified framework for incomplete and complete MVC named self-learning symmetric multi-view probabilistic clustering (SLS-MPC). SLS-MPC proposes a novel symmetric multi-view probability estimation and equivalently transforms multi-view pairwise posterior matching probability into composition of each view's individual distribution, which tolerates data missing and might extend to any number of views. Then, SLS-MPC proposes a novel self-learning probability function without any prior knowledge and hyper-parameters to learn each view's individual distribution. Next, graph-context-aware refinement with path propagation and co-neighbor propagation is used to refine pairwise probability, which alleviates the impact of noise and outliers. Finally, SLS-MPC proposes a probabilistic clustering algorithm to adjust clustering assignments by maximizing the joint probability iteratively without category information. Extensive experiments on multiple benchmarks show that SLS-MPC outperforms previous state-of-the-art methods.
