MEATRD: Multimodal Anomalous Tissue Region Detection Enhanced with Spatial Transcriptomics

Kaichen Xu; Qilong Wu; Yan Lu; Yinan Zheng; Wenlin Li; Xingjie Tang; Jun Wang; Xiaobo Sun

MEATRD: Multimodal Anomalous Tissue Region Detection Enhanced with Spatial Transcriptomics

Kaichen Xu, Qilong Wu, Yan Lu, Yinan Zheng, Wenlin Li, Xingjie Tang, Jun Wang, Xiaobo Sun

TL;DR

This work addresses the difficulty of detecting anomalous tissue regions when histology alone is insufficient by leveraging spatial transcriptomics as a complementary molecular modality. It introduces MEATRD, a multimodal ATR detector that blends histology and ST through a three-stage pipeline anchored by the Masked Graph Dual-Attention Transformer (MGDAT), culminating in a latent reconstruction loss-based one-class classifier. The approach achieves state-of-the-art performance across eight breast cancer and four PSC datasets, including challenging cases with minimal visual deviations, and provides theoretical insights into the informational properties of multimodal bottleneck encoding. The results highlight the practical impact of integrating molecular context with imaging for precise tissue anomaly detection and suggest broad applicability to other multimodal anomaly-detection tasks.

Abstract

The detection of anomalous tissue regions (ATRs) within affected tissues is crucial in clinical diagnosis and pathological studies. Conventional automated ATR detection methods, primarily based on histology images alone, falter in cases where ATRs and normal tissues have subtle visual differences. The recent spatial transcriptomics (ST) technology profiles gene expressions across tissue regions, offering a molecular perspective for detecting ATRs. However, there is a dearth of ATR detection methods that effectively harness complementary information from both histology images and ST. To address this gap, we propose MEATRD, a novel ATR detection method that integrates histology image and ST data. MEATRD is trained to reconstruct image patches and gene expression profiles of normal tissue spots (inliers) from their multimodal embeddings, followed by learning a one-class classification AD model based on latent multimodal reconstruction errors. This strategy harmonizes the strengths of reconstruction-based and one-class classification approaches. At the heart of MEATRD is an innovative masked graph dual-attention transformer (MGDAT) network, which not only facilitates cross-modality and cross-node information sharing but also addresses the model over-generalization issue commonly seen in reconstruction-based AD methods. Additionally, we demonstrate that modality-specific, task-relevant information is collated and condensed in multimodal bottleneck encoding generated in MGDAT, marking the first theoretical analysis of the informational properties of multimodal bottleneck encoding. Extensive evaluations across eight real ST datasets reveal MEATRD's superior performance in ATR detection, surpassing various state-of-the-art AD methods. Remarkably, MEATRD also proves adept at discerning ATRs that only show slight visual deviations from normal tissues.

MEATRD: Multimodal Anomalous Tissue Region Detection Enhanced with Spatial Transcriptomics

TL;DR

Abstract

MEATRD: Multimodal Anomalous Tissue Region Detection Enhanced with Spatial Transcriptomics

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (5)

Theorems & Definitions (10)