Adaptive Prototype Learning for Multimodal Cancer Survival Analysis
Hong Liu, Haosen Yang, Federica Eduati, Josien P. W. Pluim, Mitko Veta
TL;DR
Adaptive Prototype Learning (APL) tackles redundancy in multimodal cancer survival analysis by learning two sets of task-relevant prototypes through learnable queries and cross-attention to bridge high-dimensional histology and genomics representations. A multimodal mixed self-attention module facilitates cross-modal interaction and information fusion, enabling robust survival prediction across five TCGA datasets. On five cancer cohorts, APL achieves a leading average C-index (around 0.72) and outperforms unimodal, multimodal, and prototype-based baselines, with ablation demonstrating the contributions of histology and genomics prototypes and the fusion mechanism. The work provides a practical, data-driven pathway to more accurate prognostic models, with code available at the provided repository for reproducibility.
Abstract
Leveraging multimodal data, particularly the integration of whole-slide histology images (WSIs) and transcriptomic profiles, holds great promise for improving cancer survival prediction. However, excessive redundancy in multimodal data can degrade model performance. In this paper, we propose Adaptive Prototype Learning (APL), a novel and effective approach for multimodal cancer survival analysis. APL adaptively learns representative prototypes in a data-driven manner, reducing redundancy while preserving critical information. Our method employs two sets of learnable query vectors that serve as a bridge between high-dimensional representations and survival prediction, capturing task-relevant features. Additionally, we introduce a multimodal mixed self-attention mechanism to enable cross-modal interactions, further enhancing information fusion. Extensive experiments on five benchmark cancer datasets demonstrate the superiority of our approach over existing methods. The code is available at https://github.com/HongLiuuuuu/APL.
