Pathology-genomic fusion via biologically informed cross-modality graph learning for survival analysis

Zeyu Zhang; Yuanshen Zhao; Jingxian Duan; Yaou Liu; Hairong Zheng; Dong Liang; Zhenyu Zhang; Zhi-Cheng Li

Pathology-genomic fusion via biologically informed cross-modality graph learning for survival analysis

Zeyu Zhang, Yuanshen Zhao, Jingxian Duan, Yaou Liu, Hairong Zheng, Dong Liang, Zhenyu Zhang, Zhi-Cheng Li

TL;DR

This work tackles survival prediction by fusing histology and transcriptomics through a biology-informed heterogeneous graph (PGHG). It builds pathology and genomic subgraphs guided by prior biological knowledge, supervises pathology features with GSVA pathway scores, and learns cross-modal representations via a graph attention network to produce robust prognostic signals. The approach demonstrates superior performance over unimodal and other multimodal methods across multiple TCGA and FAHZU datasets, with interpretable results highlighting tissue structures and pathways linked to prognosis. The framework offers a scalable, interpretable means to integrate multi-modal clinical data and identify potential biomarkers for cancer survival.

Abstract

The diagnosis and prognosis of cancer are typically based on multi-modal clinical data, including histology images and genomic data, due to the complex pathogenesis and high heterogeneity. Despite the advancements in digital pathology and high-throughput genome sequencing, establishing effective multi-modal fusion models for survival prediction and revealing the potential association between histopathology and transcriptomics remains challenging. In this paper, we propose Pathology-Genome Heterogeneous Graph (PGHG) that integrates whole slide images (WSI) and bulk RNA-Seq expression data with heterogeneous graph neural network for cancer survival analysis. The PGHG consists of biological knowledge-guided representation learning network and pathology-genome heterogeneous graph. The representation learning network utilizes the biological prior knowledge of intra-modal and inter-modal data associations to guide the feature extraction. The node features of each modality are updated through attention-based graph learning strategy. Unimodal features and bi-modal fused features are extracted via attention pooling module and then used for survival prediction. We evaluate the model on low-grade gliomas, glioblastoma, and kidney renal papillary cell carcinoma datasets from the Cancer Genome Atlas (TCGA) and the First Affiliated Hospital of Zhengzhou University (FAHZU). Extensive experimental results demonstrate that the proposed method outperforms both unimodal and other multi-modal fusion models. For demonstrating the model interpretability, we also visualize the attention heatmap of pathological images and utilize integrated gradient algorithm to identify important tissue structure, biological pathways and key genes.

Pathology-genomic fusion via biologically informed cross-modality graph learning for survival analysis

TL;DR

Abstract

Paper Structure (18 sections, 25 equations, 5 figures, 2 tables)

This paper contains 18 sections, 25 equations, 5 figures, 2 tables.

Introduction
Related Work
Survival Analysis with Genomic features
Survival Analysis with histology image
Survival Analysis with Multi-modal Learning
Methods
Pathological feature extraction and subgraph construction
Biological pathway feature extraction and subgraph construction
Biological prior knowledge guided representation learning
Pathology-genome heterogeneous graph learning
Multimodal Interpretability
Experiment and Results
Datasets and Preprocessing
Implementation Details
Ablation Study
...and 3 more sections

Figures (5)

Figure 1: Schematic illustration for the proposed method: pathology-genome heterogeneous graph. The blue box contains the biological knowledge guided representation learning network. The yellow box contains the pathology-genome heterogeneous graph.
Figure 2: Kaplan Meier curves of PathoGenoSurvGraph, includes pathological subgraph, genomic subgraph and PathoGenoSurvGraph w/wo biological knowledge guided representation learning module.
Figure 3: Genomic and histological interpretability in low grade gliomas. A: Global attention visualization and three high mean absolute IG biological pathways (REACTOME NON INTEGRIN MEMBRANE ECM INTERACTIONS, KEGG GLIOMA and KEGG FOCAL ADHESION) co-attention visualization for high-risk patient and low-risk patient in TCGA-LGG dataset. B: Top 10 absolute IG value RNA in three biological pathways with the color indicates the relative expression color.
Figure 4: Genomic and histological interpretability in glioblastoma. A: Global attention visualization and three high mean absolute IG biological pathways (REACTOME SIGNALING BY ERBB2, REACTOME ANTIGEN ACTIVATES B CELL RECEPTOR BCR LEADING TO GENERATION OF SECOND MESSENGERS and REACTOME GABA RECEPTOR ACTIVATION) co-attention visualization for high-risk patient and low-risk patient in FAHZU-GBM dataset. B: Top 10 RNA in each biological pathways with the color indicates the relative expression value.
Figure 5: Genomic and histological interpretability in kidney renal papillary cell carcinoma. A: Global attention visualization and three high mean absolute IG biological pathways (REACTOME FCERI MEDIATED MAPK ACTIVATION, KEGG ECM RECEPTOR INTERACTION and REACTOME SIGNALING BY MODERATE KINASE ACTIVITY BRAF MUTANTS) co-attention visualization for high-risk patient and low-risk patient in TCGA-KIRP dataset. B: Top 10 RNA in each biological pathways with the color indicates the relative expression value.

Pathology-genomic fusion via biologically informed cross-modality graph learning for survival analysis

TL;DR

Abstract

Pathology-genomic fusion via biologically informed cross-modality graph learning for survival analysis

Authors

TL;DR

Abstract

Table of Contents

Figures (5)