Harmonized Tabular-Image Fusion via Gradient-Aligned Alternating Learning

Longfei Huang; Yang Yang

Harmonized Tabular-Image Fusion via Gradient-Aligned Alternating Learning

Longfei Huang, Yang Yang

Abstract

Multimodal tabular-image fusion is an emerging task that has received increasing attention in various domains. However, existing methods may be hindered by gradient conflicts between modalities, misleading the optimization of the unimodal learner. In this paper, we propose a novel Gradient-Aligned Alternating Learning (GAAL) paradigm to address this issue by aligning modality gradients. Specifically, GAAL adopts an alternating unimodal learning and shared classifier to decouple the multimodal gradient and facilitate interaction. Furthermore, we design uncertainty-based cross-modal gradient surgery to selectively align cross-modal gradients, thereby steering the shared parameters to benefit all modalities. As a result, GAAL can provide effective unimodal assistance and help boost the overall fusion performance. Empirical experiments on widely used datasets reveal the superiority of our method through comparison with various state-of-the-art (SoTA) tabular-image fusion baselines and test-time tabular missing baselines. The source code is available at https://github.com/njustkmg/ICME26-GAAL.

Harmonized Tabular-Image Fusion via Gradient-Aligned Alternating Learning

Abstract

Harmonized Tabular-Image Fusion via Gradient-Aligned Alternating Learning

Abstract

Paper Structure

Table of Contents

Figures (5)