Heterogeneous graph attention network improves cancer multiomics integration
Sina Tabakhi, Charlotte Vandermeulen, Ian Sudbery, Haiping Lu
TL;DR
HeteroGATomics addresses the challenge of integrating high-dimensional multiomics data from small cohorts by combining joint feature selection with heterogeneous graph learning. The framework jointly selects informative features across omics via a multi-agent system and then learns modality-specific heterogeneous graphs with relation-aware GATs, followed by a late fusion to predict cancer types. It achieves superior diagnostic performance across BLCA, LGG, and RCC datasets and provides interpretable biomarker identification, including known cancer genes and novel targets. This dual-view, attention-based approach enhances expressivity, interpretability, and potential therapeutic insights in cancer multiomics analysis.
Abstract
The increase in high-dimensional multiomics data demands advanced integration models to capture the complexity of human diseases. Graph-based deep learning integration models, despite their promise, struggle with small patient cohorts and high-dimensional features, often applying independent feature selection without modeling relationships among omics. Furthermore, conventional graph-based omics models focus on homogeneous graphs, lacking multiple types of nodes and edges to capture diverse structures. We introduce a Heterogeneous Graph ATtention network for omics integration (HeteroGATomics) to improve cancer diagnosis. HeteroGATomics performs joint feature selection through a multi-agent system, creating dedicated networks of feature and patient similarity for each omic modality. These networks are then combined into one heterogeneous graph for learning holistic omic-specific representations and integrating predictions across modalities. Experiments on three cancer multiomics datasets demonstrate HeteroGATomics' superior performance in cancer diagnosis. Moreover, HeteroGATomics enhances interpretability by identifying important biomarkers contributing to the diagnosis outcomes.
