Incomplete Multi-view Multi-label Classification via a Dual-level Contrastive Learning Framework
Bingyan Nie, Wulin Xie, Jiang Long, Xiaohuan Lu
TL;DR
The paper tackles incomplete multi-view multi-label classification by proposing a dual-level contrastive learning (DCL) framework that decouples each view into a shared representation $S^{(m)}$ and a view-private representation $P^{(m)}$. Two contrastive losses are applied: instance-level contrastive learning on high-level features to maximize cross-view consensus, and label-level contrastive learning to exploit label correlations across views, with missing data handled via indicator matrices and masked inputs. The final fused feature $Z = \theta(\bar{P}) \cdot \bar{S}$ feeds a classifier, and the overall objective combines $L_c$, $L_s$, $L_l$, and $L_r$ to balance prediction, consistency, and reconstruction. Empirically, DCL achieves stable, superior performance across five benchmark datasets under challenging missing-data settings, with ablations showing the critical contributions of both decoupling and the two-level contrastive losses.
Abstract
Recently, multi-view and multi-label classification have become significant domains for comprehensive data analysis and exploration. However, incompleteness both in views and labels is still a real-world scenario for multi-view multi-label classification. In this paper, we seek to focus on double missing multi-view multi-label classification tasks and propose our dual-level contrastive learning framework to solve this issue. Different from the existing works, which couple consistent information and view-specific information in the same feature space, we decouple the two heterogeneous properties into different spaces and employ contrastive learning theory to fully disentangle the two properties. Specifically, our method first introduces a two-channel decoupling module that contains a shared representation and a view-proprietary representation to effectively extract consistency and complementarity information across all views. Second, to efficiently filter out high-quality consistent information from multi-view representations, two consistency objectives based on contrastive learning are conducted on the high-level features and the semantic labels, respectively. Extensive experiments on several widely used benchmark datasets demonstrate that the proposed method has more stable and superior classification performance.
