Contrastive Continual Multi-view Clustering with Filtered Structural Fusion
Xinhang Wan, Jiyuan Liu, Hao Yu, Ao Li, Xinwang Liu, Ke Liang, Zhibin Dong, En Zhu
TL;DR
This paper tackles the problem of clustering data when views arrive sequentially and prior raw views cannot be stored, causing catastrophic forgetting in multi-view clustering. It introduces CCMVC-FSF, a method that uses a fixed-size buffer to store filtered structural information and a clustering-then-sample strategy to guide contrastive learning for updating a robust partition matrix, with a two-step alternating optimization and convergence guarantees. Key contributions include the fixed data buffer design, an efficient positive/negative sample generation scheme, a contrastive loss that leverages prior knowledge, and theoretical links to semi-supervised learning and knowledge distillation, plus extensive experiments showing robustness to CFP and scalability. The approach is privacy- and memory-friendly and well-suited for streaming, real-time multi-view data scenarios.
Abstract
Multi-view clustering thrives in applications where views are collected in advance by extracting consistent and complementary information among views. However, it overlooks scenarios where data views are collected sequentially, i.e., real-time data. Due to privacy issues or memory burden, previous views are not available with time in these situations. Some methods are proposed to handle it but are trapped in a stability-plasticity dilemma. In specific, these methods undergo a catastrophic forgetting of prior knowledge when a new view is attained. Such a catastrophic forgetting problem (CFP) would cause the consistent and complementary information hard to get and affect the clustering performance. To tackle this, we propose a novel method termed Contrastive Continual Multi-view Clustering with Filtered Structural Fusion (CCMVC-FSF). Precisely, considering that data correlations play a vital role in clustering and prior knowledge ought to guide the clustering process of a new view, we develop a data buffer with fixed size to store filtered structural information and utilize it to guide the generation of a robust partition matrix via contrastive learning. Furthermore, we theoretically connect CCMVC-FSF with semi-supervised learning and knowledge distillation. Extensive experiments exhibit the excellence of the proposed method.
