Adaptive Fuzzy C-Means with Graph Embedding
Qiang Chen, Weizhong Yu, Feiping Nie, Xuelong Li
TL;DR
This work tackles automatic learning of the membership degree hyper-parameter in fuzzy C-Means (FCM) and the challenge of non-Gaussian cluster shapes. It introduces Adaptive Fuzzy C-Means with Graph Embedding (AFCM), which couples FCM with a generalized Gaussian mixture model and adds a graph-embedding regularizer to operate on learned manifolds, while a degenerate version reduces to a parameter-free FCM. The model employs an efficient alternating optimization with closed-form updates for the membership matrix, cluster centers, the adaptive hyper-parameter, and a projected data representation $\\tilde{X}$ obtained from a generalized eigenproblem. Empirical results on synthetic and real-world datasets demonstrate improved clustering performance, robustness to non-Gaussian structures, and consistent convergence, with ablation studies highlighting the benefits of one-stage clustering and manifold learning.
Abstract
Fuzzy clustering algorithms can be roughly categorized into two main groups: Fuzzy C-Means (FCM) based methods and mixture model based methods. However, for almost all existing FCM based methods, how to automatically selecting proper membership degree hyper-parameter values remains a challenging and unsolved problem. Mixture model based methods, while circumventing the difficulty of manually adjusting membership degree hyper-parameters inherent in FCM based methods, often have a preference for specific distributions, such as the Gaussian distribution. In this paper, we propose a novel FCM based clustering model that is capable of automatically learning an appropriate membership degree hyper-parameter value and handling data with non-Gaussian clusters. Moreover, by removing the graph embedding regularization, the proposed FCM model can degenerate into the simplified generalized Gaussian mixture model. Therefore, the proposed FCM model can be also seen as the generalized Gaussian mixture model with graph embedding. Extensive experiments are conducted on both synthetic and real-world datasets to demonstrate the effectiveness of the proposed model.
