Matrix Factorization with Dynamic Multi-view Clustering for Recommender System
Shangde Gao, Ke Liu, Yichao Fu, Hongxia Xu, Jian Wu
TL;DR
The paper addresses the scalability and interpretability gap in traditional matrix factorization-based recommender systems by introducing Matrix Factorization with Dynamic Multi-view Clustering (MFDMC). MFDMC unifies MF with dynamic multi-view clustering to learn view-specific, weight-adjusted cluster-centered representations and employs pruning to manage cluster complexity, enabling end-to-end training on web-scale data. The method yields improved RMSE on six real-world datasets and demonstrates interpretability through visualizations and semantic cluster analysis, with demonstrated applicability to computer vision tasks. This approach offers scalable, explainable representations that adapt to evolving user-item dynamics and can extend to downstream tasks beyond recommendation.
Abstract
Matrix factorization (MF), a cornerstone of recommender systems, decomposes user-item interaction matrices into latent representations. Traditional MF approaches, however, employ a two-stage, non-end-to-end paradigm, sequentially performing recommendation and clustering, resulting in prohibitive computational costs for large-scale applications like e-commerce and IoT, where billions of users interact with trillions of items. To address this, we propose Matrix Factorization with Dynamic Multi-view Clustering (MFDMC), a unified framework that balances efficient end-to-end training with comprehensive utilization of web-scale data and enhances interpretability. MFDMC leverages dynamic multi-view clustering to learn user and item representations, adaptively pruning poorly formed clusters. Each entity's representation is modeled as a weighted projection of robust clusters, capturing its diverse roles across views. This design maximizes representation space utilization, improves interpretability, and ensures resilience for downstream tasks. Extensive experiments demonstrate MFDMC's superior performance in recommender systems and other representation learning domains, such as computer vision, highlighting its scalability and versatility.
