Table of Contents
Fetching ...

One for all: A novel Dual-space Co-training baseline for Large-scale Multi-View Clustering

Zisen Kong, Zhiqiang Fu, Dongxia Chang, Yiming Wang, Yao Zhao

TL;DR

This work tackles large-scale multi-view clustering by addressing view heterogeneity with a dual-space co-training framework. DSCMC jointly learns a discriminative anchor graph in the original space via projections $P^v$ and a latent-space transformation via $W^v$, aligned through a latent graph $Z$ and an anchor matrix $A$, with an element-wise strategy to boost robustness. The proposed objective combines complementary and consistent information across spaces, is optimized by alternating updates ensuring nonincreasing objectives, and runs in near-linear time with respect to the number of samples. Empirically, DSCMC outperforms state-of-the-art large-scale MVC methods on nine diverse datasets and demonstrates clear improvements from ablations and novel regularization choices, indicating strong practical impact for scalable multi-view clustering.

Abstract

In this paper, we propose a novel multi-view clustering model, named Dual-space Co-training Large-scale Multi-view Clustering (DSCMC). The main objective of our approach is to enhance the clustering performance by leveraging co-training in two distinct spaces. In the original space, we learn a projection matrix to obtain latent consistent anchor graphs from different views. This process involves capturing the inherent relationships and structures between data points within each view. Concurrently, we employ a feature transformation matrix to map samples from various views to a shared latent space. This transformation facilitates the alignment of information from multiple views, enabling a comprehensive understanding of the underlying data distribution. We jointly optimize the construction of the latent consistent anchor graph and the feature transformation to generate a discriminative anchor graph. This anchor graph effectively captures the essential characteristics of the multi-view data and serves as a reliable basis for subsequent clustering analysis. Moreover, the element-wise method is proposed to avoid the impact of diverse information between different views. Our algorithm has an approximate linear computational complexity, which guarantees its successful application on large-scale datasets. Through experimental validation, we demonstrate that our method significantly reduces computational complexity while yielding superior clustering performance compared to existing approaches.

One for all: A novel Dual-space Co-training baseline for Large-scale Multi-View Clustering

TL;DR

This work tackles large-scale multi-view clustering by addressing view heterogeneity with a dual-space co-training framework. DSCMC jointly learns a discriminative anchor graph in the original space via projections and a latent-space transformation via , aligned through a latent graph and an anchor matrix , with an element-wise strategy to boost robustness. The proposed objective combines complementary and consistent information across spaces, is optimized by alternating updates ensuring nonincreasing objectives, and runs in near-linear time with respect to the number of samples. Empirically, DSCMC outperforms state-of-the-art large-scale MVC methods on nine diverse datasets and demonstrates clear improvements from ablations and novel regularization choices, indicating strong practical impact for scalable multi-view clustering.

Abstract

In this paper, we propose a novel multi-view clustering model, named Dual-space Co-training Large-scale Multi-view Clustering (DSCMC). The main objective of our approach is to enhance the clustering performance by leveraging co-training in two distinct spaces. In the original space, we learn a projection matrix to obtain latent consistent anchor graphs from different views. This process involves capturing the inherent relationships and structures between data points within each view. Concurrently, we employ a feature transformation matrix to map samples from various views to a shared latent space. This transformation facilitates the alignment of information from multiple views, enabling a comprehensive understanding of the underlying data distribution. We jointly optimize the construction of the latent consistent anchor graph and the feature transformation to generate a discriminative anchor graph. This anchor graph effectively captures the essential characteristics of the multi-view data and serves as a reliable basis for subsequent clustering analysis. Moreover, the element-wise method is proposed to avoid the impact of diverse information between different views. Our algorithm has an approximate linear computational complexity, which guarantees its successful application on large-scale datasets. Through experimental validation, we demonstrate that our method significantly reduces computational complexity while yielding superior clustering performance compared to existing approaches.
Paper Structure (21 sections, 3 theorems, 34 equations, 5 figures, 3 tables, 1 algorithm)

This paper contains 21 sections, 3 theorems, 34 equations, 5 figures, 3 tables, 1 algorithm.

Key Result

Theorem 1

The objective function of our algorithm will be non-increasing for the iteration.

Figures (5)

  • Figure 1: The framework of our DSCMC method. Our algorithm learns the consistent anchor graph collaboratively through the original space and the latent space.
  • Figure 2: Convergence of three benchmark datasets.
  • Figure 3: Complete graph of the Caltech101-20 dataset compared at different norms.
  • Figure 4: Comparison of complete graphs constructed by different algorithms.
  • Figure 5: Parameter sensitivity analysis of our method on NUSWIDE dataset, where (a) fix $\lambda_3$ to tune $\lambda_1$ and $\lambda_2$; (b)$\lambda_1$ and $\lambda_2$ are fixed and tune $\lambda_3$.

Theorems & Definitions (5)

  • Theorem 1
  • proof
  • Lemma 1
  • proof
  • Theorem 2