How to characterize imprecision in multi-view clustering?
Jinyi Xu, Zuowei Zhang, Ze Lin, Yixiang Chen, Zhe Liu, Weiping Ding
TL;DR
This work tackles imprecision in multi-view clustering by extending credal partitions to the multi-view setting, enabling meta-clusters to capture overlapping region uncertainty. It develops MvLRECM, which jointly optimizes masses across views under an entropy-weighted fusion and a low-rank regularization, using the objective $J_{MvLRECM}$ that includes a nuclear-norm term $||\boldsymbol{Z}_i||_*$ and a low-rank surrogate $\boldsymbol{M}_i \approx \boldsymbol{Z}_i$. Optimization proceeds via alternating updates: centers $\boldsymbol{V}$ from a linear system $\boldsymbol{H}^q \cdot \boldsymbol{V}^q = \boldsymbol{B}^q$, weights $\boldsymbol{w}$ from a Lagrangian, masses $\boldsymbol{M}$ with closed-form expressions, and the low-rank proxy $\boldsymbol{Z}$ through nuclear-norm minimization. Empirical results on toy data and six real-world UCI datasets show that MvLRECM improves accuracy (ACC, Purity, F-score, RI) and reduces imprecision (IR) relative to state-of-the-art baselines, while clearly illustrating the benefits and limitations of credal, uncertainty-aware multi-view clustering.
Abstract
It is still challenging to cluster multi-view data since existing methods can only assign an object to a specific (singleton) cluster when combining different view information. As a result, it fails to characterize imprecision of objects in overlapping regions of different clusters, thus leading to a high risk of errors. In this paper, we thereby want to answer the question: how to characterize imprecision in multi-view clustering? Correspondingly, we propose a multi-view low-rank evidential c-means based on entropy constraint (MvLRECM). The proposed MvLRECM can be considered as a multi-view version of evidential c-means based on the theory of belief functions. In MvLRECM, each object is allowed to belong to different clusters with various degrees of support (masses of belief) to characterize uncertainty when decision-making. Moreover, if an object is in the overlapping region of several singleton clusters, it can be assigned to a meta-cluster, defined as the union of these singleton clusters, to characterize the local imprecision in the result. In addition, entropy-weighting and low-rank constraints are employed to reduce imprecision and improve accuracy. Compared to state-of-the-art methods, the effectiveness of MvLRECM is demonstrated based on several toy and UCI real datasets.
