Personalized Coupled Tensor Decomposition for Multimodal Data Fusion: Uniqueness and Algorithms
Ricardo Augusto Borsoi, Konstantin Usevich, David Brie, Tülay Adali
TL;DR
This work introduces a personalized coupled tensor decomposition (CTD) framework for multimodal data fusion, where each observed tensor is the sum of a common component linked to a shared tensor via a multilinear measurement model and a dataset-specific distinct component, with both components admitting CPDs. The authors establish both deterministic and generic uniqueness results leveraging uni-mode uniqueness and a flexible multilinear degradation model, and propose two computational strategies: a semi-algebraic method for principled initialization and an ALS-based optimization method that handles noise and flexible couplings. Experimental results on synthetic and real hyperspectral imaging data show the approach achieves accurate recovery of shared and dataset-specific factors and outperforms state-of-the-art CTD methods, particularly under inter-image variability and higher cloud contamination, with favorable computation times. The framework generalizes several existing CTD models and provides a principled basis for subsequent work on rank estimation and extensions to higher-order tensors.
Abstract
Coupled tensor decompositions (CTDs) perform data fusion by linking factors from different datasets. Although many CTDs have been already proposed, current works do not address important challenges of data fusion, where: 1) the datasets are often heterogeneous, constituting different "views" of a given phenomena (multimodality); and 2) each dataset can contain personalized or dataset-specific information, constituting distinct factors that are not coupled with other datasets. In this work, we introduce a personalized CTD framework tackling these challenges. A flexible model is proposed where each dataset is represented as the sum of two components, one related to a common tensor through a multilinear measurement model, and another specific to each dataset. Both the common and distinct components are assumed to admit a polyadic decomposition. This generalizes several existing CTD models. We provide conditions for specific and generic uniqueness of the decomposition that are easy to interpret. These conditions employ uni-mode uniqueness of different individual datasets and properties of the measurement model. Two algorithms are proposed to compute the common and distinct components: a semi-algebraic one and a coordinate-descent optimization method. Experimental results illustrate the advantage of the proposed framework compared with the state of the art approaches.
